Lists (6)
Sort Name ascending (A-Z)
Stars
TheMCPCompany: Creating General-purpose Agents with Task-specific Tools
Stanford NLP Python library for understanding and improving PyTorch models via interventions
Our library for RL environments + evals
Crosslingual Reasoning through Test-Time Scaling
Training Large Language Model to Reason in a Continuous Latent Space
Scalable RL solution for advanced reasoning of language models
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
Repo of paper "Free Process Rewards without Process Labels"
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024
llama3 implementation one matrix multiplication at a time
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Multilingual Large Language Models Evaluation Benchmark
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
A curated list of fellowships for graduate students in Computer Science and related fields.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
BLOOM+1: Adapting BLOOM model to support a new unseen language
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
Implementation of popular ML algorithms from scratch