Skip to content
View coolcoder001's full-sized avatar

Block or report coolcoder001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generative Models by Stability AI

Python 27,042 3,064 Updated Dec 16, 2025

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

Python 1,340 285 Updated Mar 24, 2026

PersonaPlex code.

Python 5,873 911 Updated Mar 2, 2026

Algorithm powering the For You feed on X

Rust 16,133 2,789 Updated Jan 20, 2026

Assignments adapted from Stanford CS336: Language Modeling from Scratch for UH Manoa ECE491B: Introduction to large-scale AI systems

Python 4 2 Updated May 21, 2025
Python 18,006 1,750 Updated Jan 23, 2026

Fully open reproduction of DeepSeek-R1

Python 25,964 2,417 Updated Nov 24, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,068 774 Updated Mar 24, 2026

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 14,383 1,445 Updated Mar 10, 2026

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 12,590 521 Updated Mar 23, 2026

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 9,243 2,178 Updated Feb 24, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 16,124 1,147 Updated Mar 24, 2026

A simple but efficient method to approximately calculate the users' vocabulary level

Python 41 16 Updated Sep 6, 2018

Machine Learning Paper Implementations

Python 4 Updated Dec 27, 2025

Official implementation for the paper "KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs"

Python 22 2 Updated Jan 18, 2026

[ICLR 2026 Blogpost Track Poster] JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Python 257 12 Updated Mar 11, 2026

Armchr is a set of tools for AI coding agents.

JavaScript 24 3 Updated Feb 19, 2026

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,097 4,721 Updated Mar 24, 2026

Learn Reinforcement Learning - A short repo of resources for studying reinforcement learning

3 1 Updated Aug 28, 2019
Python 67 7 Updated Feb 16, 2023

RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems

Jupyter Notebook 125 15 Updated Apr 26, 2022

Simulator for training and evaluation of Recommender Systems

Jupyter Notebook 57 5 Updated Mar 24, 2025

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,000 4,892 Updated Mar 24, 2026

AllenAI's post-training codebase

Python 3,651 515 Updated Mar 24, 2026
Python 4 6 Updated Oct 13, 2025

code and data for the time series analysis vids on my YouTube channel

Jupyter Notebook 769 698 Updated Apr 18, 2024

18.S096 three-week course at MIT

Jupyter Notebook 269 51 Updated Mar 25, 2023
Next