Skip to content
View yongzx's full-sized avatar

Highlights

  • Pro

Organizations

@Pilot-NER @FirstNetHack

Block or report yongzx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TheMCPCompany: Creating General-purpose Agents with Task-specific Tools

Python 16 2 Updated Dec 19, 2025

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 868 102 Updated Mar 6, 2026

Our library for RL environments + evals

Python 3,932 522 Updated Mar 27, 2026

structured outputs for llms

Python 12,610 978 Updated Mar 23, 2026
54 4 Updated Jan 30, 2024

Crosslingual Reasoning through Test-Time Scaling

Python 19 3 Updated May 13, 2025

A Flexible Toolkit for Dense Retrieval

Python 44 3 Updated Nov 12, 2025

s1: Simple test-time scaling

Python 6,650 765 Updated Jun 25, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,545 170 Updated Aug 12, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,831 107 Updated Mar 18, 2025

OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.

Python 175 22 Updated Jan 16, 2025

Repo of paper "Free Process Rewards without Process Labels"

Python 171 11 Updated Mar 14, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,578 478 Updated May 21, 2025

Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024

Jupyter Notebook 18 Updated Mar 25, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,252 1,287 Updated May 23, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,903 367 Updated Dec 17, 2025

Sphynx Hallucination Induction

Python 54 2 Updated Jan 31, 2025

The Art of Debugging Open Book

Python 1,334 67 Updated Mar 16, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,678 9,492 Updated Nov 12, 2025

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 454 38 Updated Nov 2, 2025

Multilingual Large Language Models Evaluation Benchmark

Python 132 18 Updated Aug 21, 2024

A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.

Jupyter Notebook 85 18 Updated Mar 7, 2025

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

Python 97 55 Updated Mar 16, 2026
JavaScript 4,061 1,818 Updated Jun 21, 2024

A curated list of fellowships for graduate students in Computer Science and related fields.

84 4 Updated Aug 11, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

77,425 8,950 Updated Feb 5, 2026

BLOOM+1: Adapting BLOOM model to support a new unseen language

Python 74 17 Updated Mar 2, 2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

2,802 247 Updated Mar 5, 2026

Implementation of popular ML algorithms from scratch

Python 954 272 Updated Jan 9, 2024
Next