wyhsirius

Follow

Yaohui wyhsirius

Follow

Research Scientist

153 followers · 8 following

Shanghai AI Laboratory
China
https://wyhsirius.github.io
@yaohuiwang_yh

Achievements

Achievements

Stars

Vchitect / RAPO

[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Python 112 1 Updated Oct 27, 2025

hmwang2002 / InternSVG

[ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".

Python 96 1 Updated Feb 6, 2026

wyhsirius / LIA-X

LIA-X: Interpretable Latent Portrait Animator

Python 100 12 Updated Sep 17, 2025

zhuangshaobin / Video-GPT

[ICLR2026] Video-GPT via Next Clip Diffusion.

Python 44 1 Updated Jun 2, 2025

maxin-cn / Cinemo

[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models

Python 296 22 Updated May 17, 2025

Costwen / Ouroboros3D

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion (CVPR2025)

Python 147 8 Updated Oct 22, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,107 413 Updated Mar 25, 2026

Vchitect / Latte

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,930 190 Updated Oct 30, 2025

Vchitect / VideoBooth

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 311 12 Updated Jun 9, 2024

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,548 107 Updated Mar 23, 2026

Vchitect / LaVie

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 951 64 Updated Nov 13, 2024

Vchitect / SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 970 65 Updated Nov 13, 2024

walker1126 / Latent_Action_Composition

[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation

Python 21 1 Updated Oct 25, 2023

pengbo807 / ConditionVideo

Training-Free Condition-Guided Text-to-Video Generation

Python 63 1 Updated Oct 23, 2025

OpenGVLab / LORIS

[ICML2023] Long-Term Rhythmic Video Soundtracker

Python 62 1 Updated Jul 28, 2025

OpenMOSS / MOSS

An open-source tool-augmented conversational language model from Fudan University

Python 12,096 1,133 Updated Jul 13, 2024

NVlabs / long-video-gan

Official PyTorch implementation of LongVideoGAN

Python 321 30 Updated Nov 5, 2022

GuyTevet / motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,934 449 Updated Oct 1, 2025

NVlabs / eg3d

Python 3,339 357 Updated Jun 10, 2023

sherwinbahmani / 3dvideogeneration

3D-Aware Video Generation

Python 75 3 Updated Nov 15, 2022

wyhsirius / LIA

[ICLR 22, TPAMI 24] LIA: Latent Image Animator

Python 650 68 Updated Oct 22, 2025

rosinality / vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,797 284 Updated Feb 15, 2023

lucidrains / imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,406 797 Updated Oct 7, 2024

microsoft / StyleSwin

[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation

Python 540 54 Updated Jul 30, 2024

justimyhxu / awesome-3D-generation

A curated list of awesome 3d generation papers

1,191 60 Updated Mar 9, 2023

willi-menapace / PlayableEnvironments

Official PyTorch implementation of "Playable Environments: Video Manipulation in Space and Time", CVPR 2022

Python 72 10 Updated Oct 16, 2022

vsitzmann / awesome-implicit-representations

A curated list of resources on implicit neural representations.

2,625 145 Updated Feb 11, 2024

zhanghengdev / GAFF

[WACV 2021]"Guided Attentive Feature Fusion for Multispectral Pedestrian Detection"

29 2 Updated Jan 13, 2021

zhanghengdev / MutualGuide

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Python 113 12 Updated Jan 28, 2023

YangDi666 / UNIK

[BMVC 2021 Oral] Official implementation of our paper "A Unified Framework for Real-world Skeleton-based Action Recognition" on Toyota Smarthome/Penn Action/NTU-RGB+D/Posetics datasets

Python 52 11 Updated Sep 2, 2022