Earshot

Ridiculously fast & accurate voice activity detection in pure Rust.

Achieves an RTF of 0.0007 (1,270x real time): 20x faster than Silero VAD v6 & TEN VAD - and more accurate, too!

If you find Earshot useful, please consider sponsoring pyke.io.

Usage

use earshot::Detector;

// Create a new VAD detector using the default NN.
let mut detector = Detector::default();

let mut frame_receiver = ...
while let Some(frame) = frame_receiver.recv() {
	// `frame` is Vec<i16> with length 256. Each frame passed to the detector must be exactly 256 samples.
	// f32 [-1, 1] frames are also supported with `predict_f32`.
	let score = detector.predict_i16(&frame);
	// Score is between 0-1; 0 = no voice, 1 = voice.
	if score >= 0.5 { // 0.5 is a good default threshold, but can be customized.
		println!("Voice detected!");
	}
}

Binary & memory size

Earshot is very embedded-friendly: each instance of Detector uses ~8 KiB of memory to store the audio buffer & neural network state. Binary footprint is ~100 KiB; the neural network is 75 KiB of that.

In contrast, Silero's model is 2 MiB, TEN's is 310 KiB, but both require ONNX Runtime, which adds an additional 8 MB to your binary (+ a whole lot more memory).

`#![no_std]`

Earshot supports #![no_std], but it does require an allocator. The std feature is enabled by default, so add default-features = false to enable #![no_std]:

[dependencies]
earshot = { version = "1", default-features = false }

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
benches		benches
examples		examples
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Licenses found

Repository files navigation

Earshot

Usage

Binary & memory size

`#![no_std]`

About

Licenses found

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Packages

Uh oh!

Languages

Uh oh!

License

Licenses found

pykeio/earshot

Folders and files

Latest commit

History

Repository files navigation

Earshot

Usage

Binary & memory size

#![no_std]

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Languages

`#![no_std]`

Packages