- Santa Clara, California, United States
-
11:29
(UTC -07:00) - https://www.linkedin.com/in/jaemincs/
Stars
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end reference examples to build with Nemotron models
Training library for Megatron-based models with bidirectional Hugging Face conversion capability
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.
Development repository for the Triton language and compiler
Ongoing research training transformer models at scale
Color effects manager for Razer devices for macOS. Supports High Sierra (10.13) to Monterey (12.0). Made by the community, based on openrazer.
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
An HPC-oriented, parallel programming language targeting Charm++. Aims to be to C++ as Scala is to Java.
Graph Neural Network Library for PyTorch
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Menubar Tool to set Charge Limits and Prolong Battery Lifespan
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
An implementation of a deep learning recommendation model (DLRM)
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
CLI11 is a command line parser for C++11 and beyond that provides a rich feature set with a simple and intuitive interface.