Skip to content

A web application that converts speech to speech 100% private using VAD (voice activity detection)

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE
MIT
MOONSHINE_LICENSE
Notifications You must be signed in to change notification settings

rhulha/EchoMate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

EchoMate - Voice Assistant Web Application

EchoMate is a web application that enables speech-to-speech conversations using Moonshine for speech recognition, Kokoro for text-to-speech synthesis, and Silero-VAD for voice activity detection. The app features a tabbed interface for conversation, settings, and credits.

Two Versions of this app

There are two versions of this app. One is pure client side but you need a PC with a very good graphics card and a browser that supports WebGPU. The other does most of the work on the server side using Python so that you can use a very simple browser like on a mobile phone.

pure client side version
server side version

This project is the pure client side version.

Warning

  • This project it pure client side and depends heavily on WebGPU support in the browser and is only tested with Chrome !!!!!
  • I also have a version that does everything on the server side (Python) except for VAD here:
  • EchoMate_ServerSide
  • Optionally you can use a local LLM server like llama.cpp in the settings.

Features

  • Real-time voice activity detection (VAD)
  • Speech-to-text (STT) using Moonshine
  • Text-to-speech (TTS) synthesis with Kokoro
  • Configurable server endpoint, system prompt, and voice selection
  • Modern, responsive web UI

Usage

  • You should be able to just talk.
  • Use the Settings tab to configure the chat server URL, system prompt, and select a TTS voice.
  • View open source credits in the Credits tab.

Credits

License

LICENSE for Me and Kokoro. MOONSHINE_LICENSE for Moonshine STT. SILERO_LICENSE for Silero VAD.

About

A web application that converts speech to speech 100% private using VAD (voice activity detection)

Topics

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE
MIT
MOONSHINE_LICENSE

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published