Skip to content
View Aleafy's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Aleafy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An in-the-wild benchmark for AI agents in the OpenClaw Environment.

Python 69 1 Updated Mar 25, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,769 364 Updated Mar 10, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 3,947 316 Updated Mar 24, 2026

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 22,654 4,069 Updated Mar 25, 2026

VIGA: Vision-as-Inverse-Graphics Agent

Python 907 84 Updated Feb 25, 2026

A unified framework for easy reinforcement learning in Flow-Matching models

Python 284 18 Updated Mar 25, 2026

This repository provides FlashPortrait custom nodes for ComfyUI.

Python 26 2 Updated Dec 29, 2025

A Unified Visual Generator with Interleaved OmniModal Context

Python 206 3 Updated Mar 5, 2026

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,238 189 Updated Jan 19, 2026

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 704 70 Updated Jan 22, 2026

The official implementation of InfiniteVGGT

Python 334 17 Updated Jan 19, 2026

Mixture-of-Groups Attention for End-to-End Long Video Generation

94 Updated Oct 22, 2025

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,410 300 Updated Jan 5, 2026

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,707 131 Updated Dec 31, 2025

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 192 7 Updated Dec 29, 2025

[NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

Python 60 2 Updated Mar 31, 2025

[Siggraph Asia 25] SS4D: Native 4D Generative Model via Structured Spacetime Latents

Python 33 3 Updated Dec 17, 2025

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,325 117 Updated Mar 24, 2026

[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference…

Python 461 34 Updated Feb 21, 2026

[CVPR 2026] V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Python 135 6 Updated Jan 17, 2026

[CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"

Python 87 2 Updated Feb 13, 2026

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

114 4 Updated Dec 11, 2025

[ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"

Python 40 3 Updated Jan 17, 2026

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Python 35 3 Updated Aug 28, 2025

We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…

Python 1,215 107 Updated Jan 20, 2026

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

Python 235 25 Updated Aug 7, 2025

[CVPR 2025 Oral & Best Paper Finalist] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Python 1,120 95 Updated Jun 28, 2025

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 1,342 126 Updated Jan 11, 2026

The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".

Python 50 1 Updated May 23, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,861 1,777 Updated May 26, 2025
Next