mlfoundations/open_flamingo
Open-source PyTorch framework for training and evaluating large multimodal visual-language models.

Velocity · 7d
+3.1
★ / day
Trend
→steady
star history
This repository provides an implementation of DeepMind’s Flamingo visual-language model, enabling training and evaluation of models that process both images and text. It supports in-context learning and includes tools for dataset handling, model initialization, text generation, and benchmarking. The project is designed for researchers working on multimodal deep learning and large language model development.