Stars
AI agents running research on single-GPU nanochat training automatically
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent
A game theoretic approach to explain the output of any machine learning model.
Application and blog explaining my interpretations of In-run Data Shapley
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)
Unbiased Learning To Rank Algorithms (ULTRA)
This repository implements a fast method for auditing the robustness of LLM ranking systems, such as Chatbot Arena, to dropping very small number of preferences.
A package for Displaying and Computing Benchmarking Results of Algorithmic Recourse and Counterfactual Explanation Algorithms
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Connect AI models like Claude & GPT with robots using MCP and ROS.
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
A high-throughput and memory-efficient inference and serving engine for LLMs
A Python Toolkit for Explainable IR methods
Code for visualizing the loss landscape of neural nets
[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
A large-scale face dataset for hair segmentation, hair recognition, and GANs for hair generation and editing.
[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features
Code for Debiasing Vision-Language Models via Biased Prompts
ir_explain: a Python Library of Explainable IR Methods
This repository is a curated collection of information (keywords, papers, libraries, books, etc.) about counterfactual explanations🙃 Contributions are welcome! Our maintenance capacity is limited, …
SpuCo is a Python package developed to further research to address spurious correlations.