Skip to content
View hvy's full-sized avatar
🏃‍♂️
Focusing
🏃‍♂️
Focusing

Highlights

  • Pro

Organizations

@pfnet @pfnet-research @chainer @cupy @optuna

Block or report hvy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,146 13,597 Updated Mar 21, 2026

AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.

Python 187 47 Updated Mar 23, 2026

Inference server benchmarking tool

Rust 145 27 Updated Oct 2, 2025

Preferred Generation Benchmark

Python 92 16 Updated Mar 6, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,239 262 Updated Dec 15, 2025

Nano vLLM

Python 12,404 1,775 Updated Nov 3, 2025
TypeScript 48 Updated May 12, 2025

The code of several works on oimo.io/works

Haxe 1,458 60 Updated Jan 15, 2025

An Intel 8086 Emulator created in Rust.

Rust 430 67 Updated Feb 16, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,407 352 Updated Mar 20, 2026

Pipeline Parallelism for PyTorch

Python 786 88 Updated Aug 21, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,925 605 Updated May 3, 2024

The registry of the OptunaHub packages

Jupyter Notebook 51 56 Updated Mar 23, 2026

Python library to use and implement packages in OptunaHub

Python 55 14 Updated Mar 2, 2026

DiscoGrad - automatically differentiate across conditional branches in C++ programs

C++ 211 5 Updated Sep 12, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,872 70 Updated Jun 22, 2025

Development repository for the Triton language and compiler

MLIR 18,753 2,695 Updated Mar 24, 2026

A curated list for Efficient Large Language Models

Python 1,970 155 Updated Jun 17, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,361 891 Updated Dec 17, 2024

LLM inference in C/C++

C++ 99,172 15,758 Updated Mar 24, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,179 2,211 Updated Mar 24, 2026

Code release for NeuS

Python 1,767 221 Updated Feb 28, 2024

Google Research

Jupyter Notebook 37,530 8,365 Updated Mar 24, 2026

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 550 70 Updated Mar 17, 2026

Inference code for Llama models

Python 59,255 9,823 Updated Jan 26, 2025

Extended functionalities for Optuna in combination with third-party libraries.

Python 65 42 Updated Mar 16, 2026

A curated list of awesome neural radiance fields papers

TeX 6,771 601 Updated Jan 6, 2025

CPU assembly examples

Assembly 88 6 Updated May 19, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,387 775 Updated Mar 18, 2026
Next