Starred repositories
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
an OpenClaw skill that can generate paper search-review-critque expert-agent relevant to specific topics (we use Scientific ML and 3D geometry surrogate modeling as a demo).
Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library
AI Paper Review Prompts
Reverse Engineering Resources About All Platforms(Windows/Linux/macOS/Android/iOS/IoT) And Every Aspect! (More than 3500 open source tools and 2300 posts&videos)
Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher is prompted with specific topic to teach the student, and t…
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors, EMNLP 2024
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
Easy token price estimates for 400+ LLMs. TokenOps.
OpenRefine is a free, open source power tool for working with messy data and improving it
An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others.
TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.
Push docker images directly to remote servers without an external registry
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
An open-source AI agent that lives in your terminal.
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
Draw pretty maps from OpenStreetMap data! Built with osmnx +matplotlib + shapely