AI Recorder

This project is an AI-powered voice recorder that transcribes speech in real-time using client-side and cloud-side AI.

Features

Real-time voice activity detection (VAD) using ONNX Web Runtime
Speech transcription using Whisper via ONNX Web Runtime or Lepton serverless API
Responsive UI with recording and processing indicators

Prerequisites

Node.js
npm

Installation

Clone the repository:

git clone https://github.com/vthinkxie/ai-recorder.git
cd ai-recorder

Install dependencies:
```
npm install
```
Setup workspace token

Go to Lepton dashboard to get your workspace token. Create a .env file in the root directory of the project and add the following:
```
LEPTON_TOKEN=your_workspace_token
```
Note: the price whisper provided by lepton ai can be found here
Start the development server:
```
npm start
```
This will start the development server and you can access the application at http://localhost:3000, the local whisper can be accessed via http://localhost:3000/local

References

Voice Activity Detection

The application uses voice activity detection (VAD) via ONNX Web Runtime to determine when the user is speaking. This is indicated by the red "Recording" text and icon.

Get more detail at https://github.com/snakers4/silero-vad and https://github.com/DictationDaddy/VAD_WEB_DEMO

Whisper tiny

The application integrates with openai/whisper-tiny.en for speech transcription via ONNX Web Runtime. When the user speaks, the transcribed text will appear in the designated area.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Recorder

Features

Prerequisites

Installation

References

Voice Activity Detection

Whisper tiny

License

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Recorder

Features

Prerequisites

Installation

References

Voice Activity Detection

Whisper tiny

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages