Starred repositories
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
OpenClaw-RL: Train any agent simply by talking
An open-source AI agent that brings the power of Gemini directly into your terminal.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
OpenMMLab Pre-training Toolbox and Benchmark
OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards
Fully open reproduction of DeepSeek-R1
Source code for NoteLLM and NoteLLM-2
verl: Volcano Engine Reinforcement Learning for LLMs
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
State-of-the-Art Text Embeddings
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Official PyTorch implementation of the paper "Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval"
Transformer: PyTorch Implementation of "Attention Is All You Need"
AndroidWorld is an environment and benchmark for autonomous agents
Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents