Stars
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!
OCR model that handles complex tables, forms, handwriting with full layout.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
An awesome & curated list of best LLMOps tools for developers
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
The AI Browser Automation Framework
A Kubernetes deployable instance of GroundX for document parsing, storage, and search.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Command-line program to download videos from YouTube.com and other video sites
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
SwarmZero's SDK for building AI agents, swarms of agents and much more.
The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
OCR, layout analysis, reading order, table recognition in 90+ languages
A proxy server for multiple ollama instances with Key security
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Python hands on tutorial with 50+ Python Application (10 lines of code) By @xiaowuc2
500 AI Machine learning Deep learning Computer vision NLP Projects with code