Skip to content
View ericlewis's full-sized avatar

Sponsors

@hotelvictorcharlie

Sponsoring

@agg23
@swiftwasm
@kean

Highlights

  • Pro

Block or report ericlewis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Playdate to Pocket Operator Sync

Lua 2 Updated Feb 19, 2023

A powerful toolkit for creating concise and expressive Swift macros

Swift 304 18 Updated Feb 17, 2026

Advanced AI-Powered Reverse Engineering Tool with Agent Skills Integration

Python 7 1 Updated Oct 23, 2025

Developer documentation for writing new firmware for the SP-1 stem player by Teenage Engineering.

C 21 2 Updated Mar 23, 2026

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 226 16 Updated Mar 31, 2025

DeepSeek-OCR as Vision Tower

Python 1 Updated Nov 21, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 32,001 4,005 Updated Mar 26, 2026

A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating the potential of cross-task information transfer in persona…

Python 128 3 Updated Dec 25, 2025

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

Python 1 Updated Mar 25, 2026

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,874 204 Updated Jan 16, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,961 661 Updated Mar 25, 2026

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 958 977 Updated Jul 4, 2024

A paper list of some recent works about Token Compress for Vit and VLM

863 39 Updated Mar 25, 2026

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,391 213 Updated May 19, 2025

Reading list for research topics in multimodal machine learning

6,845 898 Updated Aug 20, 2024

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 390 102 Updated Mar 26, 2026

BLIP-2 implementation for training vision-language models. Q-Former + frozen encoders + any LLM. Colab-ready notebooks with MoE variant.

Jupyter Notebook 3 Updated Dec 19, 2025

An API-compatible, drop-in replacement for Apple's Foundation Models framework with support for custom language model providers.

Swift 809 63 Updated Mar 24, 2026

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,575 201 Updated May 7, 2025

A Framework of Small-scale Large Multimodal Models

Python 967 99 Updated Mar 26, 2026

Trying to study the effect of different connectors , (linear, MLP and Cross Attention) to analyze what paradigms do LLM'S use or make a best guess

3 Updated Nov 26, 2025

A curated list of vision-and-language pre-training (VLP). :-)

62 7 Updated Jul 6, 2022

Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Python 307 14 Updated Mar 2, 2026

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 412 22 Updated Aug 26, 2025

Turn Apple's CVPR-25 FastVLM encoder into a reproducible baseline for mobile apps. First complete implementation achieving <250ms multimodal inference on iPhone.

Python 11 2 Updated Mar 19, 2026

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 278 9 Updated May 26, 2025
Python 4,614 452 Updated Sep 14, 2025

Fully Open Framework for Democratized Multimodal Reinforcement Learning.

Python 44 3 Updated Dec 19, 2025

Famous Vision Language Models and Their Architectures

Markdown 1,214 56 Updated Jan 11, 2026

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Kotlin 7,883 972 Updated Feb 26, 2026
Next