Skip to content
View dongyh20's full-sized avatar

Block or report dongyh20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EgoVerse: Egocentric Data for Robot Learning from Around the World

Python 174 7 Updated Mar 25, 2026

[Awesome] 🔥🔥🔥 Latest Papers, Codes and Datasets on Streaming / Online Video Understanding

146 10 Updated Jan 13, 2026

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 11,887 621 Updated Mar 25, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 4,029 323 Updated Mar 25, 2026
Python 20 Updated Mar 18, 2026

VisualToolChain-Bench

Python 36 1 Updated Mar 21, 2026

An hardware-aware Efficient Implementation for "Mixture-of-Depths Attention".

Python 143 3 Updated Mar 23, 2026
38 Updated Mar 17, 2026

Streaming Thinking for VideoLLM Streaming Video Understanding

91 Updated Mar 13, 2026
Python 65 2 Updated Mar 16, 2026

[CVPR 2026] PersonaVLM: Long-Term Personalized Multimodal LLMs

Jupyter Notebook 9 1 Updated Mar 16, 2026

🦞 Just talk to your agent — it learns and EVOLVES 🧬.

Python 2,651 271 Updated Mar 25, 2026

Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Python 158 5 Updated Mar 13, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,203 416 Updated Mar 25, 2026

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

250 3 Updated Mar 22, 2026

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]

Jupyter Notebook 162 9 Updated Mar 20, 2026

HY-WU (Part I): An Extensible Functional Neural Memory Framework and An Instantiation in Text-Guided Image Editing

Python 257 10 Updated Mar 18, 2026
Python 1,996 234 Updated Feb 26, 2026

Scalable data generation for video reasoning models.

Python 18 3 Updated Feb 26, 2026

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetil…

Rust 5 1 Updated Mar 18, 2026

GLM-5: From Vibe Coding to Agentic Engineering

1,889 165 Updated Mar 25, 2026

Agentic LaTeX Writer - Local-first editor for AI-assisted academic writing

TypeScript 104 11 Updated Feb 23, 2026

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Python 34 Updated Mar 3, 2026

构建受监督的、自我进化的 Agent 组织的基础设施 | Infrastructure for supervised, self-improving agent organization. 从飞书/Telegram 运行 Claude Code,共享记忆、Agent 工厂、定时任务、通信总线。

TypeScript 421 44 Updated Mar 23, 2026
Python 110 2 Updated Feb 4, 2026

[ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement l…

Python 202 3 Updated Dec 10, 2025

Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.

Python 906 88 Updated Feb 5, 2026

Moonshot's most powerful model

1,551 169 Updated Jan 31, 2026
Next