AdityaNG/kan-gpt
A PyTorch implementation of GPT models using Kolmogorov-Arnold Networks (KANs) as an alternative to standard MLPs for language modeling.

Velocity · 7d
+0.9
★ / day
Trend
→steady
star history
This repository implements Generative Pre-trained Transformers using Kolmogorov-Arnold Networks (KANs) instead of traditional multi-layer perceptrons. KANs replace linear weight parameters with spline-based functions, offering an alternative approach to transformer architectures. The project provides a working GPT implementation with text generation capabilities using the GPT2 tokenizer.