This repository implements a fast method for auditing the robustness of LLM ranking systems, such as Chatbot Arena, to dropping very small number of preferences.

Jupyter Notebook 2 Updated Jan 20, 2026

charmlab / recourse_benchmarks

A package for Displaying and Computing Benchmarking Results of Algorithmic Recourse and Counterfactual Explanation Algorithms

Python 8 6 Updated Feb 10, 2026

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,540 146 Updated Aug 11, 2025

robotmcp / ros-mcp-server

Connect AI models like Claude & GPT with robots using MCP and ROS.

Python 1,119 161 Updated Mar 27, 2026

facebookresearch / DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,863 316 Updated Apr 6, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,516 14,868 Updated Mar 28, 2026

souravsaha / ir_explain

A Python Toolkit for Explainable IR methods

JavaScript 6 Updated Apr 29, 2025

tomgoldstein / loss-landscape

Code for visualizing the loss landscape of neural nets

Python 3,161 436 Updated Apr 5, 2022

hila-chefer / Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 903 116 Updated Aug 24, 2023

hila-chefer / Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,988 259 Updated Jan 24, 2024

cpuimage / CelebAHairMask-HQ

A large-scale face dataset for hair segmentation, hair recognition, and GANs for hair generation and editing.

88 7 Updated Mar 1, 2025

tding1 / Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Python 61 8 Updated Jul 19, 2022

izmailovpavel / spurious_feature_learning

Python 48 6 Updated Jan 17, 2023

chingyaoc / debias_vl

Code for Debiasing Vision-Language Models via Biased Prompts

Python 60 5 Updated May 16, 2023

souravsaha / ir_explain_old

ir_explain: a Python Library of Explainable IR Methods

JavaScript 6 1 Updated Jul 5, 2024

daikikatsuragawa / awesome-counterfactual-explanations

This repository is a curated collection of information (keywords, papers, libraries, books, etc.) about counterfactual explanations🙃 Contributions are welcome! Our maintenance capacity is limited, …

23 Updated Oct 27, 2022

BigML-CS-UCLA / SpuCo

SpuCo is a Python package developed to further research to address spurious correlations.

Jupyter Notebook 26 10 Updated Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hosna Oyarhoseini hosnahoseini

Achievements

Achievements

Block or report hosnahoseini

Stars

karpathy / autoresearch

MrHuff / PREF-SHAP

castorini / umbrela

texttron / BrowseComp-Plus

shap / shap

parthshr370 / Data-Shapley-in-One-Training-Run-Code

kohpangwei / group-influence-release

lmarena / arena-rank

karpathy / nanochat

ovg-project / kvcached

sail-sg / Rigging-ChatbotArena

ULTR-Community / ULTRA

dustalov / evalica

JennyHuang19 / IsRankingRobust