lewtun

🤠

lewtun

🤠

Cowboy post-training @ Hugging Face

1.5k followers · 0 following

Achievements

x4 x4 x4

Achievements

x4 x4 x4

Highlights

Organizations

Lists (5)

Sort

Stars

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,891 422 Updated Mar 24, 2026

EdisonScientific / labbench2

labbench2

Python 31 10 Updated Mar 23, 2026

andimarafioti / nano-parakeet

Pure-PyTorch Parakeet TDT inference

Python 32 6 Updated Mar 10, 2026

huggingface / skills

Give your agents the power of the Hugging Face ecosystem

Python 9,867 598 Updated Mar 24, 2026

CMU-AIRe / QED-Nano

Training tiny models to prove hard theorems

Python 65 10 Updated Mar 5, 2026

huggingface / speech-to-speech

Build local voice agents with open-source models

Python 4,611 529 Updated Mar 23, 2026

DevAgentForge / Open-Claude-Cowork

OpenSource Claude Cowork. A desktop AI assistant that helps you with programming, file management, and any task you can describe.

TypeScript 3,063 413 Updated Mar 21, 2026

nico-martin / hermit

Route Claude Code requests to local or alternative AI models while maintaining the same interface.

JavaScript 4 Updated Dec 11, 2025

hallerite / ludic

Ludic – an LLM-RL library for the era of experience

Python 62 8 Updated Jan 9, 2026

Partynumbers42 / phasr

Package to calculate scattering phase shifts for arbitrary radial potentials using the phase shift method as well as resulting crosssections as mainly used in the context of elastic electron nucleu…

Python 3 3 Updated Dec 11, 2025

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,011 136 Updated Mar 24, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 469 30 Updated Jan 21, 2026

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 1,317 212 Updated Mar 20, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,710 283 Updated Mar 24, 2026

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 383 39 Updated Mar 23, 2026

TencentCloudADP / youtu-agent

A simple yet powerful agent framework that delivers with open-source models

Python 4,483 460 Updated Mar 21, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 2,972 359 Updated Mar 24, 2026

groq / openbench

Provider-agnostic, open-source evaluation infrastructure for language models

Python 751 99 Updated Mar 16, 2026

dylanebert / VibeGame

A 3D game engine designed for vibe coding

TypeScript 67 10 Updated Dec 21, 2025

huggingface / smol2operator

Python 128 17 Updated Sep 23, 2025

facebookresearch / meta-agents-research-environments

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 458 62 Updated Mar 21, 2026

chenchen0103 / ACEBench

Python 172 24 Updated Oct 29, 2025

SalesforceAIResearch / MCP-Universe

MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.

Python 573 75 Updated Mar 23, 2026

huggingface / trl-jobs

Train LLM on Hugging Face infra

Python 71 9 Updated Nov 13, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,179 757 Updated Mar 24, 2026

icip-cas / LiveMCPBench

LiveMCPBench is a benchmark for evaluating the ability of agents to navigate and utilize a large-scale MCP toolset. It provides a comprehensive set of tasks that challenge agents to effectively use…

Python 95 14 Updated Dec 18, 2025