ysngki

Follow

🎮

Bailan

ysngki

🎮

Bailan

Follow

4 followers · 2 following

Achievements

Achievements

Stars

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 50,403 6,617 Updated Mar 26, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 614 67 Updated Mar 24, 2026

lucidrains / deep-cross-attention

Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch

Python 96 5 Updated Feb 24, 2025

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,322 4,919 Updated Mar 26, 2026

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,436 135 Updated Apr 24, 2024

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,430 486 Updated Mar 26, 2026

BICLab / SpikingBrain-7B

Spiking Brain-inspired Large Models, integrating hybrid efficient attention, MoE modules and spike encoding into its architecture

Python 1,276 174 Updated Dec 1, 2025

seal-rg / recurrent-pretraining

Pretraining and inference code for a large-scale depth-recurrent language model

Python 868 78 Updated Dec 29, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,886 588 Updated Feb 1, 2026

jeshraghian / snntorch

Deep and online learning with spiking neural networks in Python

Python 1,912 284 Updated Nov 4, 2025

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,548 730 Updated Mar 26, 2026

junegunn / vim-plug

🌺 Minimalist Vim Plugin Manager

Vim Script 35,598 1,949 Updated Feb 18, 2026

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 6,240 505 Updated Mar 26, 2026

sapientinc / HRM

Hierarchical Reasoning Model Official Release

Python 12,384 1,807 Updated Sep 9, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,190 759 Updated Mar 26, 2026

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,626 290 Updated Feb 19, 2026

fla-org / flame

🔥 A minimal training framework for scaling FLA models

Python 359 62 Updated Nov 15, 2025

huggingface / kernels

Build compute kernels and load them from the Hub.

Python 536 58 Updated Mar 26, 2026

locuslab / wanda

A simple and effective LLM pruning approach.

Python 859 122 Updated Aug 9, 2024

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,682 282 Updated Jan 13, 2026

SakanaAI / continuous-thought-machines

Continuous Thought Machines, because thought takes time and reasoning is a process.

Python 1,807 278 Updated Dec 29, 2025

NVIDIA / cuda-python

CUDA Python: Performance meets Productivity

Cython 3,196 263 Updated Mar 26, 2026

OpenSparseLLMs / MoM

Python 126 4 Updated Feb 4, 2026

woct0rdho / transformers-qwen3-moe-fused

Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over GGUF

Python 242 14 Updated Feb 19, 2026

NVlabs / MaskLLM

[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models

Python 187 12 Updated Jan 1, 2025

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,441 998 Updated Mar 26, 2026

HazyResearch / zoology

Understand and test language model architectures on synthetic tasks.

Python 263 47 Updated Mar 22, 2026

decoderesearch / SAELens

Training Sparse Autoencoders on Language Models

Python 1,278 220 Updated Mar 19, 2026

QwenLM / ParScale

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 477 25 Updated May 17, 2025

EleutherAI / sparsify

Sparsify transformers with SAEs and transcoders

Python 703 97 Updated Mar 23, 2026