Skip to content
View peakji's full-sized avatar
🔜
Making progress
🔜
Making progress

Highlights

  • Pro

Organizations

@Level @hyperonym

Block or report peakji

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The best ChatGPT that $100 can buy.

Python 50,471 6,619 Updated Mar 26, 2026

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 19,284 2,250 Updated Mar 20, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 55,517 9,697 Updated Feb 11, 2026

how to optimize some algorithm in cuda.

Cuda 2,891 266 Updated Mar 24, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,941 453 Updated Mar 27, 2026

Header-only C++/python library for fast approximate nearest neighbors

C++ 5,136 801 Updated Mar 25, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,754 269 Updated Jul 18, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 991 103 Updated Sep 23, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 9,678 812 Updated Mar 26, 2026

Everything we actually know about the Apple Neural Engine (ANE)

2,434 93 Updated Mar 12, 2026

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,320 298 Updated May 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,150 8,432 Updated Mar 27, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,262 2,717 Updated Mar 26, 2026

Minimalist ML framework for Rust

Rust 19,829 1,491 Updated Mar 26, 2026

Fast, flexible LLM inference

Rust 6,738 548 Updated Mar 21, 2026

A natural language interface for computers

Python 62,877 5,420 Updated Feb 9, 2026

A framework for building realtime voice AI agents 🤖🎙️📹

Python 9,857 2,951 Updated Mar 27, 2026

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 4,725 364 Updated Aug 10, 2024

A generative speech model for daily dialogue.

Python 38,998 4,232 Updated Jan 18, 2026

Minimal container for Chrome's headless shell, useful for automating / driving the web

Shell 622 76 Updated Mar 21, 2026

A collective list of free APIs

Python 416,651 45,186 Updated Mar 18, 2026

📖 100 Go Mistakes and How to Avoid Them

Go 7,849 504 Updated Sep 24, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,394 1,025 Updated Jul 1, 2024

Fast and accurate AI powered file content types detection

Python 10,175 500 Updated Mar 26, 2026

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,003 1,944 Updated Jan 9, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,343 13,636 Updated Mar 26, 2026

Make images smaller using best-in-class codecs, right in the browser.

TypeScript 24,985 1,875 Updated Mar 17, 2026

leaked prompts of GPTs

31,969 4,409 Updated Sep 27, 2024

A blazing fast inference solution for text embeddings models

Rust 4,628 375 Updated Mar 23, 2026
Next