NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,826 2,327 Updated Mar 9, 2026

gpustack / gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 4,717 483 Updated Mar 25, 2026

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,512 2,342 Updated Sep 3, 2025

koordinator-sh / koordinator

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,666 410 Updated Mar 24, 2026

modelscope / easydistill

a toolkit on knowledge distillation for large language models

Python 302 32 Updated Mar 10, 2026

arcee-ai / DistillKit

An Open Source Toolkit For LLM Distillation

Python 897 120 Updated Mar 14, 2026

boto / boto3

Boto3, an AWS SDK for Python

Python 9,744 1,962 Updated Mar 24, 2026

openstack / ironic

A service for managing and provisioning Bare Metal servers. Mirror of code maintained at opendev.org.

Python 540 368 Updated Mar 22, 2026

alauda / kubeflow-chart

Kubeflow helm chart

Dockerfile 145 29 Updated Jun 30, 2023

minio / minio

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 60,550 7,304 Updated Feb 12, 2026

kubeflow / katib

Automated Machine Learning on Kubernetes

Python 1,669 518 Updated Mar 20, 2026

pytorch / serve

Serve, optimize and scale PyTorch models in production

Java 4,360 888 Updated Aug 6, 2025

SeldonIO / alibi

Algorithms for explaining machine learning models

Python 2,621 263 Updated Oct 17, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 18,763 2,698 Updated Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HaojueWang haojue

Achievements

Achievements

Block or report haojue

Stars

halide / Halide

PX4 / eigen

OpenMathLib / OpenBLAS

NVIDIA / nvidia-container-toolkit

jax-ml / jax

pytorch / torchdynamo

NVIDIA / Megatron-LM

sgl-project / sglang

vllm-project / vllm

onnx / onnx

microsoft / onnxruntime

openai / CLIP

langchain-ai / langgraph

microsoft / autogen

agentscope-ai / agentscope

apache / tvm

NVIDIA / TensorRT