Skip to content
View lyh-18's full-sized avatar

Block or report lyh-18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image editing into a single framework.

Python 249 12 Updated Mar 21, 2026

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

Python 8 1 Updated Mar 2, 2026

TeleMem is a high-performance drop-in replacement for Mem0, featuring semantic deduplication, long-term dialogue memory, and multimodal video reasoning.

Python 458 28 Updated Mar 22, 2026

The official code of Yume

Python 636 38 Updated Jan 14, 2026

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Python 160 4 Updated Jan 19, 2026

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Python 103 Updated Feb 5, 2026

PICABench: How Far Are We from Physically Realistic Image Editing?

Python 36 Updated Nov 5, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,919 788 Updated Mar 11, 2026
Jupyter Notebook 131 5 Updated Nov 8, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 861 26 Updated Dec 23, 2025

The official repo of TeleEgo - A Benchmark for Egocentric AI Assistants.

Python 59 1 Updated Dec 19, 2025

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 4,987 4,407 Updated Mar 25, 2026

[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co…

Python 1,465 119 Updated Dec 23, 2025
Python 54 Updated Oct 15, 2025

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 342 17 Updated Mar 16, 2026

UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

Python 127 3 Updated Mar 2, 2026

ALLWEONE® Open source AI presentation generator Gamma Alternative. Create professional slides with customizable themes and AI-generated content in minutes.

TypeScript 2,680 475 Updated Mar 25, 2026
Python 10 Updated Sep 9, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 954 59 Updated Mar 20, 2026

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

Python 665 68 Updated Mar 24, 2026

[CVPR 2026] ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding(书生 · 妙析多模态美学理解大模型)

Python 165 3 Updated Feb 25, 2026

A list of awesome all-in-one image restoration methods. Updating...!

96 5 Updated Jun 9, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,136 347 Updated Mar 25, 2026

Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)

Python 1,391 86 Updated Feb 7, 2026

[ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers

129 4 Updated Jun 26, 2025

Temporally Consistent Video Colorization with Deep Feature Propagation and Self-regularization Learning

Jupyter Notebook 54 13 Updated Oct 28, 2024

[CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 157 9 Updated Apr 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,030 8,416 Updated Mar 25, 2026

[ICCV 2025] FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

Python 224 6 Updated Nov 26, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,873 1,215 Updated Nov 21, 2025
Next