Skip to content
View yangjianxin1's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yangjianxin1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".

Python 74 5 Updated Jun 23, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,735 157 Updated Feb 27, 2026

Pytorch Implementation of "Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models", AAAI 2025

Python 38 3 Updated Feb 4, 2026

Pytorch Implementation of "Sinkhorn Distance Minimization for Knowledge Distillation", COLING 2024 and TNNLS 2024

Python 120 2 Updated Apr 27, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 744 45 Updated Jun 6, 2025

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 334 22 Updated Apr 24, 2025

Official Repo for Open-Reasoner-Zero

Python 2,088 119 Updated Jun 2, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,830 107 Updated Mar 18, 2025

Speech recognition

C 1,317 185 Updated Mar 19, 2026

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 25,125 5,403 Updated Mar 26, 2026

Efficient Triton Kernels for LLM Training

Python 6,240 505 Updated Mar 26, 2026

O1 Replication Journey

2,001 61 Updated Jan 14, 2025
Python 1,347 53 Updated Nov 21, 2024

[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

Python 128 13 Updated Mar 7, 2024

POT : Python Optimal Transport

Python 2,774 542 Updated Mar 11, 2026

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,839 130 Updated Jan 17, 2025

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Python 174 14 Updated Dec 14, 2023

Implementation of Sinkhorn algorithms in Torch.

Python 4 1 Updated Aug 16, 2024

code for paper "BiLD: Bi-directional Logits Difference Loss for Large Language Model Distillation"

Python 11 1 Updated Dec 17, 2024

An Open Source Toolkit For LLM Distillation

Python 896 120 Updated Mar 14, 2026

llm deploy project based mnn. This project has merged into MNN.

C++ 1,618 178 Updated Jan 20, 2025

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,277 376 Updated Sep 7, 2025

A curated list of neural network pruning resources.

2,492 332 Updated Apr 4, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,113 131 Updated Oct 7, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,269 72 Updated Mar 9, 2025

A simple and effective LLM pruning approach.

Python 859 122 Updated Aug 9, 2024

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Python 643 57 Updated Mar 4, 2024

A family of compressed models obtained via pruning and knowledge distillation

374 18 Updated Nov 6, 2025

A flexible and efficient training framework for large-scale alignment tasks

Python 452 39 Updated Oct 23, 2025
Next