Skip to content
View lewtun's full-sized avatar
🤠
🤠

Highlights

  • Pro

Organizations

@huggingface @nlp-with-transformers

Block or report lewtun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,891 422 Updated Mar 24, 2026

labbench2

Python 31 10 Updated Mar 23, 2026

Pure-PyTorch Parakeet TDT inference

Python 32 6 Updated Mar 10, 2026

Give your agents the power of the Hugging Face ecosystem

Python 9,867 598 Updated Mar 24, 2026

Training tiny models to prove hard theorems

Python 65 10 Updated Mar 5, 2026

Build local voice agents with open-source models

Python 4,611 529 Updated Mar 23, 2026

OpenSource Claude Cowork. A desktop AI assistant that helps you with programming, file management, and any task you can describe.

TypeScript 3,063 413 Updated Mar 21, 2026

Route Claude Code requests to local or alternative AI models while maintaining the same interface.

JavaScript 4 Updated Dec 11, 2025

Ludic – an LLM-RL library for the era of experience

Python 62 8 Updated Jan 9, 2026

Package to calculate scattering phase shifts for arbitrary radial potentials using the phase shift method as well as resulting crosssections as mainly used in the context of elastic electron nucleu…

Python 3 3 Updated Dec 11, 2025

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,011 136 Updated Mar 24, 2026

A Gym for Agentic LLMs

Python 469 30 Updated Jan 21, 2026

An interface library for RL post training with environments.

Python 1,317 212 Updated Mar 20, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,710 283 Updated Mar 24, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 383 39 Updated Mar 23, 2026

A simple yet powerful agent framework that delivers with open-source models

Python 4,483 460 Updated Mar 21, 2026

Post-training with Tinker

Python 2,972 359 Updated Mar 24, 2026

Provider-agnostic, open-source evaluation infrastructure for language models

Python 751 99 Updated Mar 16, 2026

A 3D game engine designed for vibe coding

TypeScript 67 10 Updated Dec 21, 2025
Python 128 17 Updated Sep 23, 2025

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 458 62 Updated Mar 21, 2026
Python 172 24 Updated Oct 29, 2025

MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.

Python 573 75 Updated Mar 23, 2026

Train LLM on Hugging Face infra

Python 71 9 Updated Nov 13, 2025

A PyTorch native platform for training generative AI models

Python 5,179 757 Updated Mar 24, 2026

LiveMCPBench is a benchmark for evaluating the ability of agents to navigate and utilize a large-scale MCP toolset. It provides a comprehensive set of tasks that challenge agents to effectively use…

Python 95 14 Updated Dec 18, 2025

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Python 464 56 Updated Oct 7, 2025

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 500 52 Updated Aug 25, 2025

Build compute kernels and load them from the Hub.

Python 533 57 Updated Mar 24, 2026

Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).

Jupyter Notebook 101 5 Updated Aug 8, 2025
Next