Skip to content
View ysngki's full-sized avatar
🎮
Bailan
🎮
Bailan

Block or report ysngki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 50,403 6,617 Updated Mar 26, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 614 67 Updated Mar 24, 2026

Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch

Python 96 5 Updated Feb 24, 2025

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,322 4,919 Updated Mar 26, 2026

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,436 135 Updated Apr 24, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,430 486 Updated Mar 26, 2026

Spiking Brain-inspired Large Models, integrating hybrid efficient attention, MoE modules and spike encoding into its architecture

Python 1,276 174 Updated Dec 1, 2025

Pretraining and inference code for a large-scale depth-recurrent language model

Python 868 78 Updated Dec 29, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,886 588 Updated Feb 1, 2026

Deep and online learning with spiking neural networks in Python

Python 1,912 284 Updated Nov 4, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,548 730 Updated Mar 26, 2026

🌺 Minimalist Vim Plugin Manager

Vim Script 35,598 1,949 Updated Feb 18, 2026

Efficient Triton Kernels for LLM Training

Python 6,240 505 Updated Mar 26, 2026

Hierarchical Reasoning Model Official Release

Python 12,384 1,807 Updated Sep 9, 2025

A PyTorch native platform for training generative AI models

Python 5,190 759 Updated Mar 26, 2026

Minimalistic large language model 3D-parallelism training

Python 2,626 290 Updated Feb 19, 2026

🔥 A minimal training framework for scaling FLA models

Python 359 62 Updated Nov 15, 2025

Build compute kernels and load them from the Hub.

Python 536 58 Updated Mar 26, 2026

A simple and effective LLM pruning approach.

Python 859 122 Updated Aug 9, 2024

Everything about the SmolLM and SmolVLM family of models

Python 3,682 282 Updated Jan 13, 2026

Continuous Thought Machines, because thought takes time and reasoning is a process.

Python 1,807 278 Updated Dec 29, 2025

CUDA Python: Performance meets Productivity

Cython 3,196 263 Updated Mar 26, 2026
Python 126 4 Updated Feb 4, 2026

Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over GGUF

Python 242 14 Updated Feb 19, 2026

[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models

Python 187 12 Updated Jan 1, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,441 998 Updated Mar 26, 2026

Understand and test language model architectures on synthetic tasks.

Python 263 47 Updated Mar 22, 2026

Training Sparse Autoencoders on Language Models

Python 1,278 220 Updated Mar 19, 2026

Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling

Python 477 25 Updated May 17, 2025

Sparsify transformers with SAEs and transcoders

Python 703 97 Updated Mar 23, 2026
Next