Skip to content
View solaoi's full-sized avatar

Highlights

  • Pro

Block or report solaoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for openai.fm, a demo for the OpenAI Speech API

TypeScript 2,832 7,940 Updated Mar 3, 2026

A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, supporting 100+ languages, outperforming Silero-VAD, TEN-VAD, FunASR-VAD and WebRTC-VAD

Python 318 26 Updated Mar 24, 2026

Swift API for MLX

C++ 1,684 169 Updated Mar 26, 2026

Generates an image from a DOM node using HTML5 canvas

JavaScript 10,781 1,692 Updated Apr 8, 2024

Offline voice input app for macOS on Apple Silicon — powered by MLX-Audio (Whisper/Qwen3-ASR)

Rust 4 Updated Mar 11, 2026

Capture system loopback audio on macOS 12.3+, Windows and Linux

TypeScript 110 18 Updated Aug 3, 2025

Build ultra fast, tiny, and cross-platform desktop apps with Typescript.

TypeScript 10,820 256 Updated Mar 22, 2026

The swiss army knife of lossless video/audio editing

TypeScript 39,362 1,915 Updated Mar 26, 2026

VOICE → WORDS

Swift 1,456 112 Updated Mar 26, 2026

C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models

C 497 48 Updated Feb 17, 2026

A fast and soft pattern search for trillion-scale corpora.

Python 203 7 Updated Feb 28, 2026

Offline streaming speech-to-text in the browser

JavaScript 25 1 Updated Aug 28, 2025

A Streaming-Native Serving Engine for TTS/STS Models

Python 61 8 Updated Feb 22, 2026

マネーフォワードMeを自動化、保有資産の可視化を行います

TypeScript 235 25 Updated Mar 27, 2026

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,822 481 Updated Mar 16, 2026

A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversational AIs

Python 84 13 Updated Mar 27, 2026

Zero-copy deserialization framework for Rust

Rust 4,109 220 Updated Feb 28, 2026

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 248 22 Updated Mar 7, 2025

VoiceBench: Benchmarking LLM-Based Voice Assistants

Python 345 23 Updated Mar 21, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 338,140 66,383 Updated Mar 27, 2026

🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.

TypeScript 1,047 88 Updated Mar 26, 2026

Chrome extension that analyzes tweets on X timeline based on the X algorithm weights

JavaScript 111 8 Updated Feb 1, 2026

Massive open Japanese speech corpus

Python 375 34 Updated Jan 19, 2026
Python 1 Updated Mar 15, 2026

A lightning fast audio upsampler.

Python 745 71 Updated Feb 26, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 18,728 1,472 Updated Mar 26, 2026

Browser automation CLI for AI agents

Rust 25,166 1,517 Updated Mar 27, 2026

Curated list of design and UI resources from stock photos, web templates, CSS frameworks, UI libraries, tools and much more

65,110 11,992 Updated Mar 24, 2026

Training code for FAcodec presented in NaturalSpeech3

Python 240 21 Updated Aug 26, 2024

Unsupervised Speech Decomposition Via Triple Information Bottleneck

Python 699 96 Updated Oct 23, 2024
Next