Change the repository type filter
All
Repositories list
72 repositories
oat
Public🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.Rigging-ChatbotArena
Public- Automatic Functional Differentiation in JAX
- 🚢 Data Toolkit for Sailor Language Models
Meta-Unlearning
Publicsailcompass
Public- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)
optim4rl
PublicOptim4RL is a Jax framework of learning to optimize for reinforcement learning.VocabularyParallelism
Publicsdft
Public[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".P-DoS
Public- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View
regmix
Public- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
dice
PublicOfficial implementation of Bootstrapping Language Models via DPO Implicit Rewardslorahub
PublicAdan
Publiczero-bubble-megatron-deepspeed
Public archive- MetaFormer Baselines for Vision (TPAMI 2024)