Stars
Beginner, advanced, expert level Rust training material
AutoGaze automatically removes redundant patches in a video, reducing #tokens in ViT/MLLM by 4x-100x.
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
[CVPR 2026] Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
VLA-0: Building State-of-the-Art VLAs with Zero Modification
[RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
[CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
[CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
GigaWorld-Policy: An Efficient Action-Centered World–Action Model
An interface library for RL post training with environments.
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
This is the official **live** mirror of Ascend-native hardware plugin for ray
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
[NeurIPS2025 Spotlight] Implementation of "GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous Driving"
[ASPLOS 2026] CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting with CPU Offloading
Offical implementation of CVPR 2026 paper SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving.
Accelerating MoE with IO and Tile-aware Optimizations
DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images
A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node