forkni

Follow

💭

Research & development • TouchDesigner components 🧪 • real-time GenAI 🤖

Alex Korneev forkni

💭

Research & development • TouchDesigner components 🧪 • real-time GenAI 🤖

Follow

21 followers · 349 following

Achievements

Achievements

Lists (21)

Sort

Claude

96 repositories

COMFY UI

90 repositories

CUDA GPU

27 repositories

DiT Acceleration

Collection of acceleration methods specifically for DiT

37 repositories

GLSL

15 repositories

INTER

Kinect

91 repositories

Linux

ML Library

242 repositories

PYTHON TOOLS

58 repositories

RAG

14 repositories

Resources

87 repositories

Splat

STABLE DIFFUSION

List of SD workflows and useful components

51 repositories

STREAM_DIFFUSION

171 repositories

Sync

TensorRT

Touchdesigner

87 repositories

TRITON

UPSCALING

WINDOWS

22 repositories

Stars

JINO-ROHIT / advanced_ml

Jupyter Notebook 126 10 Updated Dec 9, 2025

JINO-ROHIT / kernels

writing really fast kernels

Cuda 6 Updated Mar 19, 2026

Erkaman / Awesome-CUDA

This is a list of useful libraries and resources for CUDA development.

605 51 Updated Oct 8, 2017

yangtiming / Fast-SAM-3D-Body

Fast SAM 3D Body: Accelerating SAM 3D Body for Real-Time Full-Body Human Mesh Recovery

Python 138 7 Updated Mar 17, 2026

MoonshotAI / Attention-Residuals

2,636 116 Updated Mar 17, 2026

99oblivius / CorridorKey-Engine

Forked from nikopueringer/CorridorKey

Faster Green Screen Keys — async multi-GPU inference engine for professional VFX pipelines

Python 18 2 Updated Mar 17, 2026

cubiq / Mellon

Speak Friend and Enter

Python 263 17 Updated Mar 2, 2026

csurfer / pyheat

pprofile + matplotlib = Python program profiled as an awesome heatmap!

Python 844 48 Updated Jul 4, 2023

alexmojaki / snoop

A powerful set of Python debugging tools, based on PySnooper

Python 1,447 41 Updated Jan 11, 2026

RightNow-AI / RightNow-GPU-Database

Comprehensive GPU specifications database with 2,824 GPUs across NVIDIA, AMD, and Intel

68 12 Updated Jan 7, 2026

adobe / openpbr-bsdf

Adobe's reference implementation of the OpenPBR BSDF

C 356 21 Updated Mar 20, 2026

TangleML / tangle

Tangle is a web app that allows the users to build and run Machine Learning pipelines without having to set up development environment.

Python 206 14 Updated Mar 24, 2026

Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 1,256 177 Updated Jul 29, 2023

VainF / Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,274 375 Updated Sep 7, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,444 770 Updated May 31, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,361 891 Updated Dec 17, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,393 374 Updated Oct 19, 2025

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,809 1,116 Updated Mar 4, 2026

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,103 192 Updated Jun 30, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,886 4,763 Updated Mar 24, 2026

alexmojaki / heartrate

Simple real time visualisation of the execution of a Python program.

Python 1,838 122 Updated Nov 13, 2021

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,335 232 Updated Jan 29, 2026

mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,626 201 Updated Jul 12, 2024

Shenyi-Z / ToCa

[ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching

Python 214 9 Updated Mar 14, 2025

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 53,005 7,380 Updated Mar 21, 2026

Disty0 / sdnq

SD.Next Quantization Engine

Python 103 9 Updated Mar 13, 2026

PrunaAI / ai-efficiency-courses

Courses on building, compressing, evaluating, and deploying efficient AI models.

Jupyter Notebook 71 5 Updated Mar 23, 2026

beartype / beartype

Unbearably fast near-real-time pure-Python runtime-static type-checker.

Python 3,387 72 Updated Mar 21, 2026

PrunaAI / pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 1,141 83 Updated Mar 24, 2026

jabir-zheng / TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Python 363 13 Updated Apr 28, 2024