Skip to content
View muupan's full-sized avatar

Organizations

@pfnet @chainer

Block or report muupan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Android feed reader app

Kotlin 2,720 179 Updated Mar 26, 2026

Flexible evaluation tool for language models

Python 58 5 Updated Mar 18, 2026

Enable SSH access to Kuberntes pods

Rust 12 1 Updated Feb 2, 2026

Python bindings to the Zstandard (zstd) compression library

C 623 111 Updated Sep 14, 2025

A feature-rich command-line audio/video downloader

Python 153,364 12,435 Updated Mar 21, 2026
Python 1,117 53 Updated Jan 10, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 119,595 13,009 Updated Mar 25, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 384 40 Updated Mar 25, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,373 114 Updated Mar 25, 2026

text window manager, shell multiplexer, integrated DevOps environment

Shell 1,530 134 Updated Feb 16, 2026

CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP

Go 2,204 608 Updated Nov 20, 2025

Chrome extension to disable youtube video titles autotranslation

JavaScript 431 21 Updated Jan 18, 2026

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

Jupyter Notebook 2,079 122 Updated Dec 3, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,360 443 Updated Mar 9, 2026

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Python 274 22 Updated Apr 26, 2024

Web extension to set a default speed for video and audio

TypeScript 2,466 294 Updated Mar 14, 2026

The official implementation of "Horizon Reduction Makes RL Scalable"

Python 184 11 Updated Aug 2, 2025

OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation

Python 34 6 Updated May 23, 2025
Jupyter Notebook 12 6 Updated Jul 17, 2025

OCR Benchmark

TypeScript 627 52 Updated Oct 21, 2025

日本の祝日を取得するライブラリ

Python 245 16 Updated Feb 2, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 15,062 1,414 Updated Mar 26, 2026

MR.Q is a general-purpose model-free reinforcement learning algorithm.

Python 144 10 Updated Jun 23, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,984 1,584 Updated Feb 27, 2026

Really Fast End-to-End Jax RL Implementations

Python 1,035 85 Updated Sep 9, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,411 1,029 Updated Jul 8, 2025

Python tool for converting files and office documents to Markdown.

Python 92,525 5,540 Updated Mar 16, 2026

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,426 52 Updated Mar 21, 2026

A small library of LLM judges

Python 331 33 Updated Jul 31, 2025

Evaluating Reward Models in Multilingual Settings (ACL Main '25)

Python 41 4 Updated May 16, 2025
Next