Skip to content
View faroit's full-sized avatar
🚀
Crafting narrow ai
🚀
Crafting narrow ai

Organizations

@RocketScienceAbteilung @sigsep

Block or report faroit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Claude skill that writes the accurate prompts for any AI tool. Zero tokens or credits wasted. Full context and memory retention

2,322 175 Updated Mar 23, 2026

Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models

Python 26 4 Updated Feb 25, 2026

Run OpenClaw more securely inside NVIDIA OpenShell with managed inference

JavaScript 17,067 1,917 Updated Mar 26, 2026

AI-GENERATED MUSIC DETECTION IN BROADCAST MONITORING

2 Updated Jan 26, 2026

Open-source orchestration for zero-human companies

TypeScript 33,864 4,833 Updated Mar 26, 2026

AI agents running research on single-GPU nanochat training automatically

Python 57,413 7,996 Updated Mar 26, 2026

One-command setup to turn a Raspberry Pi into a 24/7 AI agent with browser control, chat integrations, and local LLMs. Powered by OpenClaw.

Shell 3 Updated Feb 15, 2026

A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…

TypeScript 25,631 9,008 Updated Mar 26, 2026

A minimal, secure Python interpreter written in Rust for use by AI

Rust 6,556 265 Updated Mar 27, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 337,474 66,175 Updated Mar 27, 2026
Python 6 2 Updated Sep 16, 2025
Python 6 Updated Jan 7, 2026

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 6,394 511 Updated Mar 26, 2026

Target Speaker Extraction Toolkit

Python 251 35 Updated Oct 4, 2025

Flux Domain Manager is a service for handling Flux network request for domain assignment to Flux itself or specific application

JavaScript 19 22 Updated Mar 13, 2026

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 204 16 Updated Jul 29, 2025

Speaker diarization benchmark framework

Python 39 8 Updated Jan 8, 2026

Extract audio files from a parquet or arrow file generated by Hugging Face `datasets` library.

Rust 15 1 Updated Mar 16, 2026

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 14,670 2,261 Updated Mar 26, 2026

This is the audio sample repository for speech separation model "MossFormer2".

Python 176 11 Updated Nov 28, 2024

PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind

Python 93 4 Updated Nov 24, 2025

OpenFLAM: Framewise Language Audio Model

Python 101 6 Updated Jan 14, 2026

DACVAE

Python 207 17 Updated Dec 22, 2025
Python 3 Updated Dec 11, 2025

Repository to reproduce the separation experiments included in the paper "The Spheres Dataset: A Multitrack Orchestral Resource for Music Source Separation and Information Retrieval".

Python 8 Updated Nov 28, 2025

UTokyo-SaruLab MOS Prediction System

Python 305 30 Updated Feb 23, 2026

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Python 763 86 Updated Dec 4, 2025

A novel data compression framework

C 2,968 139 Updated Mar 26, 2026

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai Tech Report Link: https://arxiv.org/abs/2512.10971

Python 11,947 2,009 Updated Mar 23, 2026

Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.

Python 109 9 Updated Mar 3, 2026
Next