← all repositories

bakrianoo/aravec

AraVec provides 16 pre-trained Word2Vec word embedding models for Arabic text from Twitter and Wikipedia.

423 stars Jupyter Notebook Language ModelsData Tooling
aravec
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

AraVec is an open-source Arabic word embedding project that offers distributed word representations (vectors) trained on over 1.1 billion Arabic tokens from tweets and Wikipedia articles. It provides both unigram and n-gram models built using the gensim library’s Word2Vec implementation. These pre-trained embeddings can be loaded and used for various Arabic NLP tasks such as text classification, sentiment analysis, and information retrieval.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.