Skip to content
View txytju's full-sized avatar
  • Ytech Kwai
  • Beijing,China

Block or report txytju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.

Python 176 7 Updated Mar 18, 2026

PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.

Python 163 11 Updated Apr 5, 2024
Python 40 3 Updated Jan 2, 2025

[CVPR2025]

JavaScript 1 Updated Mar 4, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,135 68 Updated Mar 20, 2025

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 322 8 Updated Jul 9, 2024

The best OSS video generation models, created by Genmo

Python 3,630 476 Updated Nov 14, 2025

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,131 70 Updated Feb 7, 2025

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

438 23 Updated Mar 8, 2025

Official inference repo for FLUX.1 models

Python 25,353 1,871 Updated Jul 31, 2025

[CVPR2024] Official implementation of SplattingAvatar.

Python 548 52 Updated Oct 28, 2024

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 374 45 Updated Mar 15, 2025

[CVPR '24] DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

Jupyter Notebook 171 17 Updated Jun 26, 2025

[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads

Python 246 13 Updated Apr 29, 2025

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,719 143 Updated Dec 17, 2024

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Python 759 39 Updated Nov 16, 2023

[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"

Python 577 54 Updated Mar 26, 2024

Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024

Python 112 7 Updated Dec 23, 2024

Kolors Team

Python 4,611 357 Updated Nov 13, 2024

[ICLR 2025 Oral] On Scaling Up 3D Gaussian Splatting Training

Python 658 40 Updated Sep 24, 2025

Decoupled Video Instance Segmentation Framework, improved version of dvis

Python 11 2 Updated May 22, 2024

Decoupled Video Instance Segmentation Framework

Python 8 1 Updated May 22, 2024

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,862 382 Updated Apr 7, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 5,041 409 Updated Jan 9, 2026

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Python 356 23 Updated Jul 4, 2023

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Python 232 13 Updated Jun 12, 2023
Python 137 13 Updated Jul 4, 2024

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,970 143 Updated Dec 1, 2025

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 305 15 Updated Apr 22, 2024
Next