Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL/pgvector
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…
A collection of learning resources for curious software engineers
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Some awesome AI related books and pdfs for learning and downloading, also apply some playground models for learning
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
A high-throughput and memory-efficient inference and serving engine for LLMs
Awesome Reasoning LLM Tutorial/Survey/Guide
Vane is an AI-powered answering engine.
A library for efficient similarity search and clustering of dense vectors.
Fast and memory-efficient exact attention
An annotated implementation of the Transformer paper.
The official Python library for the OpenAI API
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
A course on aligning smol models.
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Fast and accurate automatic speech recognition (ASR) for edge devices
An app that brings language models directly to your phone.
On-device AI across mobile, embedded and edge for PyTorch
Best practices & guides on how to write distributed pytorch training code
An Arduino library for the Nano 33 BLE Sense that leverages Mbed OS to automatically place sensor measurements in a ring buffer that can be integrated into programs in a simple manner.
A curated list of awesome information retrieval resources
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends