← all repositories

georgian-io/Multimodal-Toolkit

A toolkit that extends HuggingFace transformers to combine text embeddings with tabular categorical and numerical features for classification and regression.

620 stars Python ML FrameworksLanguage Models
Multimodal-Toolkit
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

This library builds multimodal models on top of pretrained transformer architectures (BERT, ALBERT, etc.) by adding fusion layers that combine transformer outputs with categorical and numerical tabular features. It supports end-to-end training where both the combining module and transformer parameters are fine-tuned for supervised downstream tasks. The toolkit is built on PyTorch and integrates directly with HuggingFace Transformers.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.