Packages

Packages developed in BayJarvis - autonomous agent, deep learning, reinforcement learning, preference learning, retrieval-augmented generation and more.

Sort by most recent release · downloads this week · stars

nanoPPO by jamesliu

An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

⭐ 6

Latest: 0.15.post2 on 28th November 2023

nanoDPO by jamesliu

A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Language Models

⭐ 5

Latest: 0.1.post1 on 25th November 2023

nChain by jamesliu

a flexible and efficient implementation to create LLM bots over extensible dataset.

⭐ 2

Latest: 0.13.post4 on 9th November 2023