4 projects
nanoPPO
A flexible and efficient implementation of the Proximal Policy Optimization (PPO) algorithm for reinforcement learning.
nanoDPO
A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model for time series data, inspired by the paper of DPO in fine-tuning unsupervised Language Models
nChain
nchain is a flexible and efficent framework to create LLM bots using embeddings over extensible dataset
nanoLLM
nanoLLM is a flexible and efficent framework to create LLMs