Stars
Typer, build great CLIs. Easy to code. Based on Python type hints.
Companion Jupyter Notebooks for the RFSoC-Book.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Open-source, low-cost 10.5 GHz PLFM phased array RADAR system
GitNexus: The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a GitHub repo or ZIP file, and get an interactive …
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
A library of reinforcement learning components and agents
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
A fast and simple implementation of learning algorithms for robotics.
🤖 / 🏪 Agent Index - This is the agent index for LobeChat. It accesses index.json from this repository to display a list of available agents for LobeChat to the agent market.
Python productivity for RFSoC platforms
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The China Weather Radar Toolkit, support most of China's radar formats(WSR98D, CINRAD/SA/SB/CB, CINRAD/CC/CCJ, CINRAD/SC/CD)
Author's PyTorch implementation of TD3 for OpenAI gym tasks
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Laser for control mosquito, weed, and pest
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)