Skip to content
View masahi's full-sized avatar

Organizations

@apache @dmlc @octoml

Block or report masahi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A TUI Git client inspired by Magit

Rust 2,663 141 Updated Mar 28, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,446 491 Updated Mar 30, 2026

CUDA/Metal accelerated language model inference

C 631 32 Updated May 29, 2025

RPyC (Remote Python Call) - A transparent and symmetric RPC library for python

Python 1,695 250 Updated Aug 14, 2025

📚 Jupyter notebook tutorials for OpenVINO™

Jupyter Notebook 3,079 1,010 Updated Mar 30, 2026

Embree ray tracing kernels repository.

C++ 2,670 422 Updated Mar 24, 2026

Universal LLM Deployment Engine with ML Compilation

Python 22,293 1,979 Updated Mar 29, 2026

Build system, successor to Buck

Rust 4,295 335 Updated Mar 31, 2026

MoonRay is DreamWorks’ open-source, award-winning, state-of-the-art production MCRT renderer.

CMake 4,612 289 Updated Feb 4, 2026

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

C++ 478 37 Updated Mar 15, 2024

Language Modeling with the H3 State Space Model

Assembly 522 51 Updated Sep 29, 2023

An open-source efficient deep learning framework/compiler, written in python.

Python 740 68 Updated Sep 4, 2025

An efficient vector-graphics renderer

Rust 2,641 55 Updated May 16, 2023

A GPU compute-centric 2D renderer.

Rust 3,886 230 Updated Mar 30, 2026

A modern cross-platform low-level graphics library and rendering framework

Batchfile 4,248 374 Updated Mar 29, 2026

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,713 382 Updated Mar 16, 2026

Real-time GPU path tracing with an OpenUSD Hydra render delegate

C++ 605 49 Updated Aug 8, 2025

This is the development repository for the OpenFHE library. The current development version is 1.5.0 (released on February 26, 2026). The current stable version is 1.4.2 (released on October 20, 20…

C++ 1,097 285 Updated Mar 30, 2026

3D fluid simulation experiments in Rust, using WebGPU-rs (WIP)

Rust 477 16 Updated Dec 17, 2022
HLSL 492 78 Updated Mar 10, 2026
Python 50 8 Updated Mar 29, 2023

A STARK prover and verifier for arbitrary computations

Rust 888 226 Updated Jul 19, 2025

The Flutter engine

C++ 7,581 5,979 Updated Feb 25, 2025

A General-purpose Task-parallel Programming System in C++

C++ 11,849 1,384 Updated Mar 29, 2026

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,610 302 Updated Mar 27, 2026

Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer

C 3,290 284 Updated Aug 28, 2024

Vulkan and rust experiments, including a spectral path tracer using Vulkan ray tracing extensions

Rust 131 5 Updated Sep 13, 2025

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,346 2,061 Updated Feb 2, 2026

magic-trace collects and displays high-resolution traces of what a process is doing

OCaml 5,275 128 Updated Mar 17, 2026

3D engine with modern graphics

C 6,954 733 Updated Mar 26, 2026
Next