Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Swarah: Indian-English speech dataset collected across the country
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Vector (and Scalar) Quantization, in Pytorch
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
A repository for generating stylized talking 3D and 3D face
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Convert images of LaTex math equations into LaTex code.
Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch
Mobilenet v1 trained on Imagenet for STM32 using extended CMSIS-NN with INT-Q quantization support
A higher-level Neural Network library for microcontrollers.
Kubernetes Native Edge Computing Framework (project under CNCF)
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Open standard for machine learning interoperability
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Voice Conversion Challenge 2020 CycleVAE baseline system
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)