Skip to content
View hargup's full-sized avatar

Organizations

@AGV-IIT-KGP @metakgp @Azad-Hall

Block or report hargup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

amdgpu example code in hip/asm

C++ 58 28 Updated Mar 18, 2026

A skill for thinking

468 34 Updated Mar 23, 2026

🦄 ai that works - every tuesday 10 AM PST

TypeScript 1,629 126 Updated Mar 20, 2026

Machine Learning Engineering Open Book

Python 17,515 1,111 Updated Mar 16, 2026

🤗 smolagents: a barebones library for agents that think in code.

Python 26,256 2,395 Updated Mar 13, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 2,164 185 Updated Mar 24, 2026

It is an LLM-based AI agent, which can write correct and efficient gpu kernels automatically.

Python 80 13 Updated Mar 24, 2026

Verified tensor graph optimization in Lean 4: constructive soundness proofs + equality saturation + verified extraction via e-graph↔circuit bijection + multi-target code generation.

Lean 3 Updated Mar 7, 2026
Lean 8 1 Updated Mar 2, 2026

Verified GPU programming framework for Lean 4. Write type-safe WebGPU shaders with formal verification, hardware-accelerated matrix ops, and cross-platform support (Metal/Vulkan/D3D12). Build prova…

Lean 15 1 Updated Feb 19, 2026

Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.

Cuda 18 1 Updated Feb 9, 2026

A lightweight multi-GPU inference engine for LLMs on mid/low-end GPUs.

C++ 6 1 Updated Mar 12, 2026

cuGraph - RAPIDS Graph Analytics Library

Cuda 2,148 346 Updated Mar 24, 2026

KV Cache & LoRA for minGPT

Python 60 8 Updated Mar 4, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,826 1,240 Updated Mar 24, 2026

Heterogeneous GPU Sharing on Kubernetes

Go 3,134 491 Updated Mar 23, 2026
Coq 73 3 Updated May 29, 2019
SystemVerilog 64 6 Updated Feb 25, 2026

Open-source CUDA compiler targeting multiple GPU architectures. Compiles .cu to AMD and Tenstorrent GPU's

C 1,514 64 Updated Mar 17, 2026

Assembler for NVIDIA Maxwell architecture

Sass 1,060 171 Updated Jan 3, 2023

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

Python 3,870 810 Updated Dec 23, 2020

An ARC-AGI solution using Agentica from Symbolica

Python 167 16 Updated Feb 12, 2026

ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engineered designs for agentic system.

Python 184 23 Updated Mar 19, 2026

A huge collection of VHDL/Verilog open-source IP cores scraped from the web

576 169 Updated Jan 18, 2023

SQLite bindings for Lean

C 39 1 Updated Mar 12, 2026

[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Python 150 7 Updated Mar 24, 2026

Multi-agent communication extension for pi coding agent

TypeScript 404 32 Updated Mar 19, 2026

A collection of GPU kernels and other experiments comparing Torch, Triton etc to Modular/Mojo

Jupyter Notebook 3 Updated Jan 8, 2026

The World's First Agentic IDE. Visual dashboard: live sessions, task management, code editor, terminal. Epic Swarm parallel workflows. Auto-proceed rules. Automation patterns. Beads + Agent Mail +…

Svelte 181 21 Updated Mar 24, 2026
Python 75 9 Updated Mar 15, 2026
Next