Skip to content
View whlook's full-sized avatar
🐢
s..l...o....w.....l......y.......
🐢
s..l...o....w.....l......y.......

Block or report whlook

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,205 13,602 Updated Mar 21, 2026

On demand communication

Python 33 2 Updated Mar 3, 2026

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 50 1 Updated Jun 16, 2023

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

Python 426 45 Updated Jul 5, 2025

The open source coding agent.

TypeScript 130,018 13,782 Updated Mar 25, 2026

[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation

Python 738 27 Updated Nov 27, 2025

NCCL Tests

Cuda 1,468 360 Updated Mar 11, 2026

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 2,089 264 Updated Mar 25, 2026

Tile primitives for speedy kernels

Cuda 3,261 263 Updated Mar 25, 2026

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,557 76 Updated Oct 16, 2025

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

1,523 86 Updated Jan 17, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,240 673 Updated Mar 25, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,745 466 Updated Mar 25, 2026

Efficient Triton Kernels for LLM Training

Python 6,238 505 Updated Mar 25, 2026

A Quirky Assortment of CuTe Kernels

Python 865 100 Updated Mar 24, 2026

NanoGPT (124M) in 2 minutes

Python 5,000 683 Updated Mar 17, 2026

Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".

Python 74 5 Updated Jun 23, 2025

<Foundations of Computer Vision> Book

PostScript 470 117 Updated Mar 22, 2026

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 18,032 3,060 Updated Mar 21, 2026

Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache).

Python 200 14 Updated Nov 17, 2025

Spark-TTS Inference Code

Python 10,959 1,171 Updated Apr 9, 2025

Finetune VITS and MMS using HuggingFace's tools

Python 194 72 Updated Mar 31, 2024

Collection of leaked system prompts

14,318 1,999 Updated Mar 23, 2026

[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Python 110 8 Updated Jun 2, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 977 50 Updated Feb 5, 2026

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 18,913 1,436 Updated Sep 21, 2025

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 18,473 1,415 Updated Mar 25, 2026

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,972 288 Updated May 15, 2025

《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译

Python 22,816 4,520 Updated Feb 24, 2026

[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,904 312 Updated Feb 19, 2025
Next