Skip to content
View sott0n's full-sized avatar
  • Tenstorrent
  • Tokyo
  • 11:26 (UTC +09:00)

Block or report sott0n

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Claude Opus 4.6 wrote a dependency-free C compiler in Rust, with backends targeting x86 (64- and 32-bit), ARM, and RISC-V, capable of compiling a booting Linux kernel.

Rust 2,568 197 Updated Feb 5, 2026

Anthropic's original performance take-home, now open for you to try!

Python 3,722 847 Updated Jan 22, 2026

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

C++ 882 65 Updated Mar 24, 2026

MLIR-based partitioning system

MLIR 174 32 Updated Mar 25, 2026
6 1 Updated Nov 12, 2025

Low Level Software Documentation

4 Updated Mar 23, 2026

Simple MPI implementation for prototyping or learning

C 305 11 Updated Aug 6, 2025

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 106 80 Updated Mar 24, 2026

Frontend integration for PyTorch with tt-mlir

Python 23 10 Updated Mar 2, 2026

The TT-Forge ONNX is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their performance and efficiency.

Python 54 27 Updated Mar 24, 2026

COCONUT-SVSM

Rust 208 81 Updated Mar 24, 2026

Retargetable ML compilers for the twenty-first century!

Python 13 4 Updated Apr 22, 2025

Tenstorrent MLIR compiler

MLIR 253 118 Updated Mar 25, 2026

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 12,017 1,097 Updated Aug 18, 2024

Inference Llama 2 in one file of pure C

C 19,317 2,473 Updated Aug 6, 2024

LLM inference in C/C++

C++ 99,204 15,773 Updated Mar 24, 2026

LLM training in simple, raw C/CUDA

Cuda 29,257 3,445 Updated Jun 26, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,733 324 Updated Oct 19, 2024

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,761 568 Updated Dec 18, 2025

LLMPerf is a library for validating and benchmarking LLMs

Python 1,097 203 Updated Dec 9, 2024

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 1,385 391 Updated Mar 25, 2026

Shared Middle-Layer for Triton Compilation

MLIR 330 94 Updated Dec 5, 2025

A compiler for homomorphic encryption

C++ 694 127 Updated Mar 25, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 25,803 2,784 Updated Mar 24, 2026

Backward compatible ML compute opset inspired by HLO/MHLO

MLIR 630 185 Updated Mar 24, 2026
MLIR 172 54 Updated Mar 25, 2026

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 947 202 Updated Mar 18, 2026

TPP experimentation on MLIR for linear algebra

MLIR 146 38 Updated Mar 22, 2026
Next