Skip to content
View wyhsirius's full-sized avatar

Block or report wyhsirius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Python 112 1 Updated Oct 27, 2025

[ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".

Python 96 1 Updated Feb 6, 2026

LIA-X: Interpretable Latent Portrait Animator

Python 100 12 Updated Sep 17, 2025

[ICLR2026] Video-GPT via Next Clip Diffusion.

Python 44 1 Updated Jun 2, 2025

[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models

Python 296 22 Updated May 17, 2025

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion (CVPR2025)

Python 147 8 Updated Oct 22, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,107 413 Updated Mar 25, 2026

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,930 190 Updated Oct 30, 2025

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 311 12 Updated Jun 9, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,548 107 Updated Mar 23, 2026

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 951 64 Updated Nov 13, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 970 65 Updated Nov 13, 2024

[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation

Python 21 1 Updated Oct 25, 2023

Training-Free Condition-Guided Text-to-Video Generation

Python 63 1 Updated Oct 23, 2025

[ICML2023] Long-Term Rhythmic Video Soundtracker

Python 62 1 Updated Jul 28, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,096 1,133 Updated Jul 13, 2024

Official PyTorch implementation of LongVideoGAN

Python 321 30 Updated Nov 5, 2022

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,934 449 Updated Oct 1, 2025
Python 3,339 357 Updated Jun 10, 2023

3D-Aware Video Generation

Python 75 3 Updated Nov 15, 2022

[ICLR 22, TPAMI 24] LIA: Latent Image Animator

Python 650 68 Updated Oct 22, 2025

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,797 284 Updated Feb 15, 2023

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,406 797 Updated Oct 7, 2024

[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation

Python 540 54 Updated Jul 30, 2024

A curated list of awesome 3d generation papers

1,191 60 Updated Mar 9, 2023

Official PyTorch implementation of "Playable Environments: Video Manipulation in Space and Time", CVPR 2022

Python 72 10 Updated Oct 16, 2022

A curated list of resources on implicit neural representations.

2,625 145 Updated Feb 11, 2024

[WACV 2021]"Guided Attentive Feature Fusion for Multispectral Pedestrian Detection"

29 2 Updated Jan 13, 2021

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Python 113 12 Updated Jan 28, 2023

[BMVC 2021 Oral] Official implementation of our paper "A Unified Framework for Real-world Skeleton-based Action Recognition" on Toyota Smarthome/Penn Action/NTU-RGB+D/Posetics datasets

Python 52 11 Updated Sep 2, 2022
Next