Skip to content
View khotyn's full-sized avatar
😌
Focusing
😌
Focusing

Organizations

@acug @sofastack

Block or report khotyn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

TypeScript 51,423 2,435 Updated Apr 2, 2026

🔥 The Web Data API for AI - Power AI agents with clean web data

TypeScript 102,867 6,775 Updated Apr 2, 2026

Markdown Architectural Decision Records

Markdown 2,101 451 Updated Mar 30, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 345,615 68,706 Updated Apr 2, 2026

A simple, performant and scalable Jax LLM!

Python 2,194 494 Updated Apr 2, 2026

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 33,936 3,181 Updated Apr 2, 2026

Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one

Zig 88,673 4,255 Updated Apr 2, 2026

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 1,161 165 Updated Apr 2, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,258 608 Updated Apr 2, 2026

Public repository for Agent Skills

Python 109,018 12,195 Updated Mar 25, 2026

The best ChatGPT that $100 can buy.

Python 50,875 6,686 Updated Mar 27, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,581 593 Updated Apr 2, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 105,592 16,793 Updated Apr 1, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 752 193 Updated Apr 2, 2026

Open Source Landscapes and Insights Produced by AntOSS

TypeScript 401 20 Updated Mar 20, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,295 909 Updated Mar 30, 2026

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,485 305 Updated Jul 17, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,286 3,499 Updated Apr 2, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,392 3,556 Updated Apr 2, 2026

iTerm2 is a terminal emulator for Mac OS X that does amazing things.

Objective-C 17,319 1,345 Updated Apr 1, 2026

[HPCA 2026] AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 323 105 Updated Mar 20, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,963 446 Updated Apr 2, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,308 850 Updated Mar 22, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,524 1,768 Updated Apr 2, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,549 1,005 Updated Mar 31, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,873 1,251 Updated Apr 1, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,335 5,127 Updated Apr 2, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,388 8,445 Updated Apr 1, 2026

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Python 334 22 Updated Apr 24, 2025

Module, Model, and Tensor Serialization/Deserialization

Python 297 49 Updated Feb 6, 2026
Next