Skip to content
View rootfs's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@openshift @ceph @coreos-inc @rook @fast-ml @redhat-et @os-climate @llm-d

Block or report rootfs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 6,721 542 Updated Mar 23, 2026

TabICLv2: A state-of-the-art tabular foundation model

Python 658 84 Updated Mar 24, 2026

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python 5,922 599 Updated Mar 24, 2026

Implementation of the sap-rpt-1-oss deep learning model with inference pipeline as described in the paper "ConTextTab: A Semantics-Aware Tabular In-Context Learner".

Python 156 22 Updated Nov 27, 2025

TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models

Python 72 7 Updated Mar 16, 2026

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 3,129 231 Updated Mar 24, 2026

A high-performance and light-weight router for vLLM large scale deployment

Rust 161 57 Updated Mar 20, 2026

Modeling, training, eval, and inference code for OLMo

Python 6,415 723 Updated Nov 24, 2025
Python 108 11 Updated Jun 2, 2025

Bringing BERT into modernity via both architecture changes and scaling

Python 1,647 144 Updated Mar 1, 2026

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

JavaScript 13,720 1,373 Updated Mar 11, 2026

A framework for efficient model inference with omni-modality models

Python 3,734 617 Updated Mar 24, 2026

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

Python 72 15 Updated Feb 18, 2026

The Future of Data Engineering — A CLI SQL client for the modern data stack, enabling AI-native context engineering for data.

Python 1,124 166 Updated Mar 21, 2026

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,523 584 Updated Mar 24, 2026

LLM Semantic Router: Intelligent Mixture-of-Models (MoM) System with Privacy Preservation and Prompt Guard. The semantic router intelligently directs OpenAI compliant API requests to the most suita…

Python 21 12 Updated Aug 30, 2025

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Swift 2,930 451 Updated Jan 4, 2025

CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on Apple's MobileCLIP-S0 architecture, it ensures optimal perfor…

Swift 90 11 Updated Jul 25, 2024

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,701 364 Updated Mar 24, 2026

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 483 57 Updated Apr 19, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,776 1,016 Updated Mar 9, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,935 318 Updated Jan 14, 2026
Python 4,626 372 Updated Mar 16, 2026

Cloud Native Observability and Policy Engine for LLM Applications

Python 7 1 Updated Mar 19, 2026

GitHub Action to Create an AWS EC2 Self-hosted Runner

Shell 2 1 Updated Mar 19, 2026

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

2,210 200 Updated Apr 30, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,162 13,595 Updated Mar 21, 2026

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…

Python 11,168 1,146 Updated Dec 12, 2024

Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget

Python 167 9 Updated Aug 11, 2025
Next