Lists (2)
Sort Name ascending (A-Z)
Stars
Generative Models by Stability AI
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Algorithm powering the For You feed on X
Assignments adapted from Stanford CS336: Language Modeling from Scratch for UH Manoa ECE491B: Introduction to large-scale AI systems
Fully open reproduction of DeepSeek-R1
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
AirLLM 70B inference with single 4GB GPU
A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
Understanding Deep Learning - Simon J.D. Prince
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.
A simple but efficient method to approximately calculate the users' vocabulary level
Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"
[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Learn Reinforcement Learning - A short repo of resources for studying reinforcement learning
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Simulator for training and evaluation of Recommender Systems
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
code and data for the time series analysis vids on my YouTube channel