← all repositories

pochih/RL-Chatbot

A conversational chatbot trained using deep reinforcement learning with seq2seq architecture and policy gradient methods.

422 stars Python Chat Assistants
RL-Chatbot
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

This repository implements a chatbot trained using deep reinforcement learning techniques. The system uses a seq2seq model with shared-weight LSTM encoder-decoder architecture for sequence-to-sequence learning. After initial seq2seq training, policy gradient reinforcement learning is applied to improve response generation quality based on a reward function, allowing the chatbot to generate more engaging responses.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.