Skip to content
View minitu's full-sized avatar

Organizations

@UIUC-PPL

Block or report minitu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 879 76 Updated Mar 19, 2026

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models

Jupyter Notebook 810 170 Updated Mar 30, 2026

Training library for Megatron-based models with bidirectional Hugging Face conversion capability

Python 539 240 Updated Mar 30, 2026

LOA Logs - Modern DPS Meter for Lost Ark

Rust 201 69 Updated Mar 29, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,270 909 Updated Mar 30, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,249 679 Updated Mar 25, 2026

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 275 48 Updated Mar 30, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,008 3,394 Updated Mar 30, 2026

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,940 1,518 Updated Mar 25, 2026

PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.

Python 1,450 111 Updated Mar 29, 2026

Development repository for the Triton language and compiler

MLIR 18,798 2,715 Updated Mar 30, 2026

Ongoing research training transformer models at scale

Python 15,854 3,771 Updated Mar 30, 2026

Color effects manager for Razer devices for macOS. Supports High Sierra (10.13) to Monterey (12.0). Made by the community, based on openrazer.

JavaScript 2,558 295 Updated Apr 5, 2024

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

5,026 498 Updated Jul 30, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,655 660 Updated Mar 30, 2026
Jupyter Notebook 2 2 Updated Apr 25, 2024

[DEPRECATED] Moved to ROCm/rocm-systems repo

C++ 146 43 Updated Mar 28, 2026

An HPC-oriented, parallel programming language targeting Charm++. Aims to be to C++ as Scala is to Java.

Scala 3 Updated Mar 10, 2022

DaCe - Data Centric Parallel Programming

Python 581 155 Updated Mar 30, 2026

Near-optimal Prefetching System

33 6 Updated Nov 17, 2021

Graph Neural Network Library for PyTorch

Python 23,623 3,973 Updated Mar 27, 2026

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 14,256 3,056 Updated Jul 31, 2025

Menubar Tool to set Charge Limits and Prolong Battery Lifespan

Swift 9,007 331 Updated Mar 27, 2026

HPC Container Maker

Python 512 100 Updated Mar 13, 2026

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 1,054 204 Updated Mar 12, 2026

An implementation of a deep learning recommendation model (DLRM)

Python 4,027 868 Updated Jan 12, 2026

Open MPI main development repository

C 2,556 957 Updated Mar 30, 2026

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

C++ 2,307 191 Updated Feb 7, 2024

CLI11 is a command line parser for C++11 and beyond that provides a rich feature set with a simple and intuitive interface.

C++ 4,217 436 Updated Mar 30, 2026

GPUDirect Async support for IB Verbs

C++ 136 19 Updated Nov 10, 2022
Next