Skip to content
View haojue's full-sized avatar
  • IBM
  • Beijing
  • 16:32 (UTC +08:00)

Block or report haojue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a language for fast, portable data-parallel computation

C++ 6,611 1,095 Updated Mar 24, 2026

Eigen is a C++ template library for linear algebra: matrices, vectors, numerical solvers, and related algorithms.

C++ 988 179 Updated Oct 18, 2023

OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.

C 7,350 1,656 Updated Mar 24, 2026

Build and run containers leveraging NVIDIA GPUs

Go 4,194 497 Updated Mar 22, 2026

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,214 3,489 Updated Mar 25, 2026

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,078 129 Updated Apr 17, 2024

Ongoing research training transformer models at scale

Python 15,795 3,747 Updated Mar 25, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,994 4,986 Updated Mar 25, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,243 14,745 Updated Mar 25, 2026

Open standard for machine learning interoperability

Python 20,533 3,903 Updated Mar 24, 2026

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 19,653 3,788 Updated Mar 25, 2026

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,946 3,969 Updated Feb 18, 2026

Build resilient language agents as graphs.

Python 27,437 4,715 Updated Mar 24, 2026

A programming framework for agentic AI

Python 56,173 8,445 Updated Mar 21, 2026

Build and run agents you can see, understand and trust.

Python 19,455 1,831 Updated Mar 24, 2026

Open Machine Learning Compiler Framework

Python 13,220 3,831 Updated Mar 25, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,826 2,327 Updated Mar 9, 2026

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 4,717 483 Updated Mar 25, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,512 2,342 Updated Sep 3, 2025

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,666 410 Updated Mar 24, 2026

a toolkit on knowledge distillation for large language models

Python 302 32 Updated Mar 10, 2026

An Open Source Toolkit For LLM Distillation

Python 897 120 Updated Mar 14, 2026

Boto3, an AWS SDK for Python

Python 9,744 1,962 Updated Mar 24, 2026

A service for managing and provisioning Bare Metal servers. Mirror of code maintained at opendev.org.

Python 540 368 Updated Mar 22, 2026

Kubeflow helm chart

Dockerfile 145 29 Updated Jun 30, 2023

MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.

Go 60,550 7,304 Updated Feb 12, 2026

Automated Machine Learning on Kubernetes

Python 1,669 518 Updated Mar 20, 2026

Serve, optimize and scale PyTorch models in production

Java 4,360 888 Updated Aug 6, 2025

Algorithms for explaining machine learning models

Python 2,621 263 Updated Oct 17, 2025

Development repository for the Triton language and compiler

MLIR 18,763 2,698 Updated Mar 25, 2026
Next