Stars
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A Model Context Protocol (MCP) server that provides Claude with advanced mathematical calculation capabilities
A fork to add multimodal model training to open-r1
Awesome MCP Servers - A curated list of Model Context Protocol servers
Instant voice cloning by MIT and MyShell. Audio foundation model.
PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Discover Unknown Unsafe Events via Generative Simulation
No fortress, purely open ground. OpenManus is Coming.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Robust Speech Recognition via Large-Scale Weak Supervision
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
A template repo for Python packages
Toolkit for linearizing PDFs for LLM datasets/training
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
Get your documents ready for gen AI
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)
All-in-one LLM CLI tool featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, with access to OpenAI, Claude, Gemini, Ollama, Groq, and more.
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows