← all repositories
AI4Finance-Foundation/FinGPT

Wall Street won't open-source its LLM. These researchers did it anyway.

FinGPT fine-tunes open-source models on financial data for under $300, aiming to democratize what Bloomberg spent millions building.

20.4k stars Jupyter Notebook Language ModelsDomain Apps
FinGPT
Velocity · 7d
+17
★ / day
Trend
steady
star history

What it does

FinGPT is a collection of open-source financial LLMs built by fine-tuning general-purpose models like Llama2 and ChatGLM2 on domain-specific datasets. The project covers sentiment analysis, headline classification, relation extraction, named-entity recognition, and a robo-advisor forecaster that predicts stock price movement from news and financials. Everything is published on HuggingFace, with training notebooks you can run on a single RTX 3090.

The interesting bit

The project’s core argument is economic, not architectural. BloombergGPT reportedly cost ~$3M and 53 days to train; FinGPT’s LoRA fine-tuning runs under $300 and updates weekly. The README is essentially a cost-accounting takedown dressed up as research — complete with GPU-hour spreadsheets comparing their $17.25 sentiment run against Bloomberg’s estimated $2.67M. Whether the comparison is fair (different scopes, different base models) is left as an exercise for the reader.

Key highlights

  • FinGPT v3.3 (Llama2-13B + LoRA) claims weighted F1 of 0.882 on financial sentiment, trained in 17 hours on one RTX 3090
  • FinGPT-Forecaster demo takes a ticker, date, and news window, then outputs analysis plus next-week price prediction
  • Six instruction-tuning datasets published on HuggingFace, from 511-row NER to 82K-row headline tasks
  • RLHF is flagged as the “secret ingredient” missing from BloombergGPT, though implementation details are sparse
  • Multiple papers accepted at NeurIPS 2023 (Instruction Workshop) and ICAIF-23

Caveats

  • The “better than GPT-4” claim applies only to fine-tuned sentiment on specific benchmarks, not general capability
  • README mixes aspirational roadmap items with shipped features; some sections read like grant proposals
  • RTX 3090 cost assumptions ($1/hour) rely on third-party GPU rental platforms, not stable cloud pricing

Verdict

Worth exploring if you’re building financial NLP on a budget or need a sentiment baseline that beats generic models. Skip if you need production-grade reliability guarantees or audited trading logic — this is research tooling with ambitious marketing.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.