← all repositories

AdityaNG/kan-gpt

A PyTorch implementation of GPT models using Kolmogorov-Arnold Networks (KANs) as an alternative to standard MLPs for language modeling.

724 stars Python Language ModelsML Frameworks
kan-gpt
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

This repository implements Generative Pre-trained Transformers using Kolmogorov-Arnold Networks (KANs) instead of traditional multi-layer perceptrons. KANs replace linear weight parameters with spline-based functions, offering an alternative approach to transformer architectures. The project provides a working GPT implementation with text generation capabilities using the GPT2 tokenizer.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.