romanlutz

Roman Lutz romanlutz

Responsible AI Engineering Lead | AI Red Teaming | Open Source

232 followers · 230 following

Microsoft
Greater Seattle Area, WA, USA
https://romanlutz.github.io
in/romanlutz
@romanlutz.bsky.social

Achievements

x4 x3

Achievements

x4 x3

Organizations

Starred repositories

Social-AI-Studio / ComicJailbreak

Repository for "Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models"

Python 1 Updated Mar 30, 2026

bradygaster / squad

Squad: AI agent teams for any project

TypeScript 1,510 197 Updated Mar 30, 2026

markrussinovich / refchecker

A tool that validates academic paper references

Python 276 34 Updated Mar 29, 2026

steveyegge / gastown

Gas Town - multi-agent workspace manager

Go 13,264 1,170 Updated Mar 30, 2026

astral-sh / ty

An extremely fast Python type checker and language server, written in Rust.

Python 18,106 275 Updated Mar 30, 2026

Azure / azure-sdk-for-python

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…

Python 5,518 3,274 Updated Mar 30, 2026

microsoft / playwright-mcp

Playwright MCP server

TypeScript 30,004 2,417 Updated Mar 28, 2026

OSU-NLP-Group / RedTeamCUA

[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Python 47 7 Updated Feb 9, 2026

microsoft / SecRL

Benchmarking LLM agents on Cyber Threat Investigation.

Jupyter Notebook 118 20 Updated Feb 5, 2026

ReversecLabs / spikee

Simple Prompt Injection Kit for Evaluation and Exploitation

HTML 163 35 Updated Mar 27, 2026

stefanv / npdoc2json

Recursively scan a Python module and export numpydoc docstrings to JSON

TypeScript 3 Updated May 14, 2025

cicscareers / OSAP

Open-Source Apprenticeship Program

3 Updated May 22, 2025

allenai / wildguard

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Python 113 11 Updated Dec 2, 2024

KutalVolkan / ai_recruiter

Python 10 3 Updated Jun 3, 2025

Tinche / aiofiles

File support for asyncio

Python 3,237 164 Updated Oct 9, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,593 2,154 Updated Sep 12, 2025

microsoft / AI-Red-Teaming-Playground-Labs

AI Red Teaming playground labs to run AI Red Teaming trainings including infrastructure.

TypeScript 1,886 281 Updated Feb 13, 2026

github-community-projects / issue-metrics

Gather metrics on issues/prs/discussions such as time to first response, count of issues opened, closed, etc.

Python 528 91 Updated Mar 29, 2026

microsoft / debug-gym

A Text-Based Environment for Interactive Debugging

Python 296 40 Updated Mar 23, 2026

CarsonDon / Multilingual-Vuln-LLMs

Jupyter Notebook 3 Updated Feb 25, 2025

microsoft / agdebugger

TypeScript 72 6 Updated Feb 11, 2026

python-websockets / websockets

Library for building WebSocket servers and clients in Python

Python 5,651 587 Updated Mar 8, 2026

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,917 204 Updated May 21, 2025

AI-secure / DecodingTrust

A Comprehensive Assessment of Trustworthiness in GPT Models

Python 316 61 Updated Sep 16, 2024

0din-ai / 0Din-Curated-Monthly-White-Papers

This repository curates a collection of monthly white papers focused on the latest LLM attack and defenses.

25 2 Updated Oct 10, 2024

jennifermarsman / FocusedNPC

Creating a non-player character in a game backed by generative AI that will stay focused on its goals

Python 4 Updated Sep 27, 2024

alanaqrawi / STCA

Results and Analysis of Single-Turn Crescendo Attacks (STCA) on Large Language Models: Evaluating vulnerabilities in content moderation through adversarial techniques.

2 1 Updated Sep 12, 2024

microsoft / eureka-ml-insights

A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.

Python 181 37 Updated Feb 26, 2026

promptfoo / promptfoo

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line …

TypeScript 18,793 1,611 Updated Mar 30, 2026