txytju

txytju txytju

Computer vision engineer. Interested in machine learning and mixed reality.

77 followers · 456 following

Ytech Kwai
Beijing,China

Stars

zelaki / eqvae

[ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.

Python 176 7 Updated Mar 18, 2026

sayakpaul / cmmd-pytorch

PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.

Python 163 11 Updated Apr 5, 2024

bytedance / Hybrid-SD

Python 40 3 Updated Jan 2, 2025

wpy1999 / IV-VAE

[CVPR2025]

JavaScript 1 Updated Mar 4, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,135 68 Updated Mar 20, 2025

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 322 8 Updated Jul 9, 2024

genmoai / mochi

The best OSS video generation models, created by Genmo

Python 3,630 476 Updated Nov 14, 2025

rhymes-ai / Allegro

Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.

Python 1,131 70 Updated Feb 7, 2025

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

438 23 Updated Mar 8, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 25,353 1,871 Updated Jul 31, 2025

initialneil / SplattingAvatar

[CVPR2024] Official implementation of SplattingAvatar.

Python 548 52 Updated Oct 28, 2024

Fictionarry / TalkingGaussian

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python 374 45 Updated Mar 15, 2025

tobias-kirschstein / diffusion-avatars

[CVPR '24] DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars

Jupyter Notebook 171 17 Updated Jun 26, 2025

tobias-kirschstein / nersemble

[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads

Python 246 13 Updated Apr 29, 2025

adobe-research / VideoDoodles

Python 436 35 Updated Aug 16, 2024

TencentARC / BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,719 143 Updated Dec 17, 2024

SHI-Labs / Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Python 759 39 Updated Nov 16, 2023

aipixel / GaussianAvatar

[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"

Python 577 54 Updated Mar 26, 2024

ai-med / StablePose

Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024

Python 112 7 Updated Dec 23, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 4,611 357 Updated Nov 13, 2024

nyu-systems / Grendel-GS

[ICLR 2025 Oral] On Scaling Up 3D Gaussian Splatting Training

Python 658 40 Updated Sep 24, 2025

KlingAIResearch / DVIS_Plus

Decoupled Video Instance Segmentation Framework, improved version of dvis

Python 11 2 Updated May 22, 2024

KlingAIResearch / DVIS

Decoupled Video Instance Segmentation Framework

Python 8 1 Updated May 22, 2024

ant-research / CoDeF

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,862 382 Updated Apr 7, 2024

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 5,041 409 Updated Jan 9, 2026

baaivision / vid2vid-zero

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Python 356 23 Updated Jul 4, 2023

thu-ml / controlvideo

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Python 232 13 Updated Jun 12, 2023

zhang-tao-whu / DVIS_Plus

Python 137 13 Updated Jul 4, 2024

adobe-research / custom-diffusion

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,970 143 Updated Dec 1, 2025

bytedance / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 305 15 Updated Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly