Skip to content
View wangyikewxgm's full-sized avatar
  • alibabacloud
  • Beijing

Block or report wangyikewxgm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 334,607 65,290 Updated Mar 25, 2026

JobSet: a k8s native API for distributed ML training and HPC workloads

Python 321 110 Updated Mar 24, 2026

Algorithm powering the For You feed on X

Rust 16,138 2,789 Updated Jan 20, 2026

Kubernetes-native AI serving platform for scalable model serving.

Go 272 73 Updated Mar 19, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,712 364 Updated Mar 24, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,688 544 Updated Mar 24, 2026

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,273 98 Updated Aug 28, 2025

Nano vLLM

Python 12,408 1,775 Updated Nov 3, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,969 623 Updated Mar 25, 2026

compiler learning resources collect.

Python 2,698 367 Updated Mar 19, 2025

My learning notes for ML SYS.

Python 5,766 374 Updated Mar 19, 2026

Deep Reinforcement Learning

4,570 677 Updated Dec 10, 2022

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,177 3,488 Updated Mar 25, 2026

Train transformer language models with reinforcement learning.

Python 17,778 2,587 Updated Mar 25, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,000 61 Updated Mar 3, 2026

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 951 47 Updated Oct 29, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,823 2,327 Updated Mar 9, 2026

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 166,072 15,144 Updated Mar 25, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,973 4,979 Updated Mar 25, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,360 891 Updated Dec 17, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 31,762 3,347 Updated Mar 24, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,354 32,597 Updated Mar 24, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,847 2,226 Updated Mar 24, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,209 14,733 Updated Mar 25, 2026

Fast and memory-efficient exact attention

Python 22,959 2,548 Updated Mar 25, 2026

Kubernetes community content

Jupyter Notebook 12,800 5,353 Updated Mar 23, 2026

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Go 5,257 1,417 Updated Mar 24, 2026

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 182,791 46,215 Updated Mar 25, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,523 9,459 Updated Nov 12, 2025
Next