← all repositories
ScottfreeLLC/AlphaPy

AutoML for traders who'd rather not hand-roll pipelines

A scikit-learn wrapper that tries to automate the boring parts of building trading and sports-betting models.

1.7k stars Python Domain AppsML Frameworks
AlphaPy
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

What it does

AlphaPy is a Python framework that wraps scikit-learn, Keras, XGBoost, LightGBM, and CatBoost into two domain-specific pipelines: MarketFlow for financial time-series and SportFlow for sports prediction. It handles feature engineering, model training, ensemble stacking, and portfolio tear-sheet generation via Quantopian’s pyfolio. The goal is letting speculators run ML without writing boilerplate.

The interesting bit

The project explicitly splits its audience in two: “speculators and data scientists.” That honesty is refreshing. Most of the value is in the glue — standardizing a config-driven workflow so you can swap between Random Forest and a Keras net without rewriting data loaders. The sports-betting angle (SportFlow) is also unusual; most AutoML tools ignore that niche entirely.

Key highlights

  • Supports blended and stacked ensembles out of the box
  • Includes portfolio analysis via pyfolio integration
  • Domain-specific pipelines: MarketFlow (trading) and SportFlow (sports)
  • Legacy repo; active development moved to AlphaPy Pro (Python 3.12+, UV package management, NLP features, MetaLabeling)
  • Pro version adds CI/CD, docs, and “enterprise-grade capabilities” (vague, but noted)

Caveats

  • This repository is legacy; the README itself redirects users to AlphaPy Pro
  • XGBoost requires manual platform-specific installation on Mac and Windows
  • pyfolio dependency has a known numpy compatibility bug requiring a git install workaround
  • No candidate images provided, and the README’s referenced image URLs appear to point to a different fork (Alpha314/AlphaPy)

Verdict

Worth a look if you want opinionated scaffolding for trading or sports ML and don’t mind migrating to the Pro fork for active support. Skip it if you need bleeding-edge research flexibility — this is automation for practitioners, not framework builders.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.