spotify/featran
A Scala library for feature transformation and engineering in machine learning workflows.

Velocity · 7d
+0.1
★ / day
Trend
→steady
star history
Featran is a Scala library that simplifies feature engineering for machine learning by providing common transformation operations like scaling, encoding, and normalization. It implements a two-step pattern combining global aggregation followed by element-wise mapping, supporting algorithms such as Min-Max scaling and one-hot encoding. The library integrates with multiple data processing backends including Spark, Flink, Scalding, and Scio.