Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,566 65 Updated Jun 14, 2025

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,904 89 Updated Jan 8, 2026

lzyhha / HSSL

Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)

Python 15 Updated May 2, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,173 95 Updated Dec 29, 2025

Alpha-VLLM / Lumina-Accessory

Python 115 3 Updated Apr 25, 2025

lzyhha / diffusers

Forked from huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 1 Updated May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhong-Yu Li lzyhha

Achievements

Achievements

Block or report lzyhha

Stars

HVision-NKU / ASID-Caption

AIDASLab / Awesome-Diffusion-LLM

yczhou001 / Awesome-Diffusion-LLM

ZHZisZZ / dllm

Tongyi-MAI / Z-Image

huggingface / diffusers

Alpha-VLLM / Lumina-DiMOO

HVision-NKU / OneVAE

openai / gpt-oss

NK-CS-ZZL / DiscretizedSDF

Kwai-Keye / Keye

HVision-NKU / DepthAnythingAC

HumanMLLM / LLaVA-Scissor

ByteDance-Seed / Bagel

ByteDance-Seed / Seed1.5-VL

showlab / Show-o

lzyhha / HSSL

stepfun-ai / Step1X-Edit

Alpha-VLLM / Lumina-Accessory

lzyhha / diffusers

FishAndWasabi / Real-LOD

synbol / MaskGIL

lzyhha / VisualCloze

showlab / FAR

Alpha-VLLM / Lumina-mGPT-2.0

Visual-AI / Mr.DETR

mims-harvard / ToolUniverse

mims-harvard / TxAgent

HumanMLLM / HumanOmni

Alpha-VLLM / Lumina-Video