Lists (16)
Sort Name ascending (A-Z)
Stars
PyTorch implementation of the Mamba-3 architecture
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)
Companion webpage to the book "Mathematics For Machine Learning"
Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.
The fastest and highest-quality deep learning powered Sora2 watermark cleaner.
Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning
[ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective
High Resolution Depth Maps for Stable Diffusion WebUI
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…
100% Automated Anime Character Lora Training Pipeline
WentseChen / Verlog
Forked from verl-project/verlVerlog: A Multi-turn RL framework for LLM agents
main-horse / hnet-old
Forked from goombalab/hnetH-Net Dynamic Hierarchical Architecture
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality