Skip to content
View romanlutz's full-sized avatar

Organizations

@fairlearn

Block or report romanlutz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Repository for "Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models"

Python 1 Updated Mar 30, 2026

Squad: AI agent teams for any project

TypeScript 1,510 197 Updated Mar 30, 2026

A tool that validates academic paper references

Python 276 34 Updated Mar 29, 2026

Gas Town - multi-agent workspace manager

Go 13,264 1,170 Updated Mar 30, 2026

An extremely fast Python type checker and language server, written in Rust.

Python 18,106 275 Updated Mar 30, 2026

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…

Python 5,518 3,274 Updated Mar 30, 2026

Playwright MCP server

TypeScript 30,004 2,417 Updated Mar 28, 2026

[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Python 47 7 Updated Feb 9, 2026

Benchmarking LLM agents on Cyber Threat Investigation.

Jupyter Notebook 118 20 Updated Feb 5, 2026

Simple Prompt Injection Kit for Evaluation and Exploitation

HTML 163 35 Updated Mar 27, 2026

Recursively scan a Python module and export numpydoc docstrings to JSON

TypeScript 3 Updated May 14, 2025

Open-Source Apprenticeship Program

3 Updated May 22, 2025

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 113 11 Updated Dec 2, 2024
Python 10 3 Updated Jun 3, 2025

File support for asyncio

Python 3,237 164 Updated Oct 9, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,593 2,154 Updated Sep 12, 2025

AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.

TypeScript 1,886 281 Updated Feb 13, 2026

Gather metrics on issues/prs/discussions such as time to first response, count of issues opened, closed, etc.

Python 528 91 Updated Mar 29, 2026

A Text-Based Environment for Interactive Debugging

Python 296 40 Updated Mar 23, 2026
Jupyter Notebook 3 Updated Feb 25, 2025
TypeScript 72 6 Updated Feb 11, 2026

Library for building WebSocket servers and clients in Python

Python 5,651 587 Updated Mar 8, 2026

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,917 204 Updated May 21, 2025

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 316 61 Updated Sep 16, 2024

This repository curates a collection of monthly white papers focused on the latest LLM attack and defenses.

25 2 Updated Oct 10, 2024

Creating a non-player character in a game backed by generative AI that will stay focused on its goals

Python 4 Updated Sep 27, 2024

Results and Analysis of Single-Turn Crescendo Attacks (STCA) on Large Language Models: Evaluating vulnerabilities in content moderation through adversarial techniques.

2 1 Updated Sep 12, 2024

A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.

Python 181 37 Updated Feb 26, 2026

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line …

TypeScript 18,793 1,611 Updated Mar 30, 2026

Test Software for the Characterization of AI Technologies

Python 282 58 Updated Mar 29, 2026
Next