🔊 Generic Audio Classifier

A powerful audio classification application using state-of-the-art deep learning models

🎯 What Can You Do?

This application allows you to classify audio files into various categories and subcategories using advanced machine learning models.

Upload your audio files for instant classification
Record audio directly through your microphone
Visualize classification results with detailed analytics
Contribute to the dataset by adding new labeled audio files
Explore the existing dataset structure and examples

🧠 Powered by Advanced Models

Model	Description	Accuracy
NASNet Mobile	Neural Architecture Search Network optimized for mobile	95%
EfficientNet V2 B0	Optimized CNN with balanced performance	87%
DualNet CX	Dual-pathway network for contextual features	99%
DualNet Xpert	Expert system with dual feature extraction	98%

📊 Dataset Overview

Metric	Count
Audio Files	23,303
Categories	4
Subcategories	23

📈 Classification Visualization

The application provides detailed visualizations of classification results, including confidence scores for each category.

Dataset Structure

GENERIC_AUDIO_CLASSIFIER
├── Animals
│   ├── CATS
│   ├── DOGS
│   ├── ELEPHANT
│   ├── HORSE
│   └── LIONS
├── Birds
│   ├── CROWS
│   ├── PARROT
│   ├── PEACOCK
│   └── SPARROW
├── Environment
│   ├── CROWD
│   ├── MILITARY
│   ├── OFFICE
│   ├── RAINFALL
│   ├── TRAFFIC
│   └── WIND
└── Vehicles
    ├── airplane
    ├── bicycle
    ├── bike
    ├── bus
    ├── car
    ├── helicopter
    ├── train
    └── truck

🔑 Key Features

Feature	Description
🎙️ Audio Processing	Process various audio formats with intelligent feature extraction
🔄 Real-time Classification	Get instant predictions with high accuracy and precision
📊 Advanced Visualization	See detailed analytics and confidence scores for each prediction
🔍 Dynamic Dataset	Flexible system that grows and improves with new data

Dataset Sources

The dataset includes audio samples from various sources:

Kaggle Generic Audio Samples Dataset - Collected and Organized by me🤘
Various YouTube videos (see Acknowledgements section)
Vehicle sounds from Kaggle Vehicle Sounds Dataset

Model Training Notebooks

🛠️ Setup Instructions

1️⃣ Install Required Dependencies

Ensure you have Python 3.8 or later installed.

Run the following command to install all required Python libraries:

pip install -r requirements.txt

2️⃣ Install FFmpeg (Required for `pydub` and `librosa`)

Windows:

Download FFmpeg from: https://ffmpeg.org/download.html
Extract it to a directory (e.g., C:\ffmpeg).
Add the bin folder to your system PATH:
- Search for "Edit the system environment variables" in Windows.
- Under System Properties > Advanced > Environment Variables, find Path and edit it.
- Click New and add:
```
C:\ffmpeg\bin
```
- Click OK and restart your system.

Mac/Linux (Using Homebrew):

brew install ffmpeg

Ubuntu/Debian (Using APT):

sudo apt update && sudo apt install ffmpeg -y

⚠️ Streamlit Limitations

Streamlit does not support FFmpeg and sounddevice in the cloud environment.
To enable audio recording, run the app locally.
Use app_local_record.py instead of app.py for full recording features.

🔧 Running the App Locally

To start the app, run:

streamlit run app.py

If you want local audio recording support, run:

streamlit run app_local_record.py

📜 License

This project is open-source and available under the Apache License.

Acknowledgements

Special thanks to the content creators who made their recordings available. The Vehicle sounds dataset was sourced from Kaggle user Jan Boubia Abderrahim.

Source Videos

Animals

Birds

Crows

"Crow Cawing Sound Effect" https://youtu.be/T8xQ-y2pfVo?si=aUvr_v8SgEhXtCEQ
"Crow Sounds" https://youtu.be/s1gxWM_E_D8?si=eKCUhO894EQzYqZx
"Crows Cawing" https://youtu.be/ujqJiFjbsOU?si=7VcTpDz3MTwnPLpM
"Crow Calls" https://youtu.be/WoRbb5zaThM?si=n-mMzIr3tNf5_tfn

Parrots

"Parrot Talking and Squawking" https://youtu.be/dBPu0MKa_vg?si=aairO4USAf-I2jQB
"Parrot Sounds" https://youtu.be/o74WN6HCocY?si=KkGM2xxU5eNgXwWR
"Parrot Vocalizations" https://youtu.be/6yoEvmlmQM0?si=zsZ1cIY7w1xSDosL
"Talking Parrot" https://youtu.be/dBPu0MKa_vg?si=2le9yhelwK3rjHM-
"Parrot Squawking" https://youtu.be/aj3ny_GTuhM?si=NyEofcmo-_vHTNYz
"Parrot Sound Effects" https://youtu.be/B9dUpGFc5Uc?si=lablnWLyizKbxvdS
"Parrot Calls" https://youtu.be/BHOUyvC-guc?si=SYsLhUR1IL3kylZm
"Parrot Noises" https://youtu.be/oDPwVz55zGg?si=wL1MVAC45tVzvE2n

Peacocks

"Peacock Calling Sound" https://youtu.be/MiF7v-gYXLE?si=h2CZiZzPqkjd2_Gs
"Peacock Sound Effect" https://youtu.be/walgy_1QQmY?si=Uz2GiEbNNronMgWo
"Peacock Mating Call" https://youtu.be/UgDw2iIcmQ0?si=Z2tT2cA9t_z604-p
"Peacock Sounds" https://youtu.be/AnImnX0DRNQ?si=fBet-NSx5a_RtCQP
"Peacock Screaming" https://youtu.be/LDoN7_Z5O-M?si=60J7k9DxIO0GIiYE
"Peacock Calls" https://youtu.be/xP8xK0ke7SE?si=7ucdGXaMOIrelb_5

Sparrows

"Sparrow Chirping Sound" https://youtu.be/h9AoB2JSoCg?si=8f2zJSu-z7lIynsC
"Sparrow Song" https://youtu.be/8MM6uX71ovU?si=7ftGmxPObHnRVfA0
"Sparrow Sounds" https://youtu.be/X3C_hpTxRd0?si=uhUv0exPSfN2fTIt
"Sparrow Calling" https://youtu.be/hLbVDJI80b0?si=XGepbsltxrafGJGk
"Sparrow Chirps" https://youtu.be/fKAhbrkiAPo?si=c2KBHUPtnsEIdqf4

Environment

Crowd

"Crowd Noise Sound Effect" https://youtu.be/3jYUp9LhiQ8?si=V2H-nFJzK7YANtna
"Crowd Ambient Sound" https://youtu.be/FnhJ2wARY4Q?si=9wbo9A55D4Wz0A1q
"Crowd Sounds" https://youtu.be/1Jh6SuKALt4?si=HtgDCfnjmtvKDuAb
"Stadium Crowd" https://youtu.be/88UwejHolJ8?si=Q2uYVAG1MLnnCmZW
"Crowd Chattering" https://youtu.be/a0Ud85Xdxn4?si=WBtOqweocqcECCqx
"Crowd Ambience" https://youtu.be/4h7tXm5b5KM?si=i8aQWenyivQon6Ji
"People Crowd" https://youtu.be/IKB3Qiglyro?si=ohSXY5BjyHXrgx0T

Military

"Military Sound Effects" https://youtu.be/qFxR1yvsvqQ?si=oUJV9LIx7xjBGBgJ
"Military Vehicles Sounds" https://youtu.be/RGtN2GIM-ig?si=GtS5SjSbeYFpun7H
"Military Operations Audio" https://youtu.be/0QmA_-uxaDE?si=-BrrVzVT49EVRhi-

Office

"Office Ambience Sounds" https://youtu.be/D7ZZp8XuUTE?si=QSlXeMzoZG3jziZQ

Rainfall

"Rain Sound Effect" https://youtu.be/y615vOsiG5w?si=nR1IS-o9KWddVN6x
"Heavy Rain Sounds" https://youtu.be/GA1D88HF0xE?si=ztSu-_-KUfUWSaLS
"Rainfall Audio" https://youtu.be/gP9sGBywjks?si=LpyUGSV6cvEsBq2S

Traffic

"Traffic Sound Effect" https://youtu.be/yrIRAd7E6qE?si=ykIq3lnR00Kdd7oQ
"City Traffic Sounds" https://youtu.be/GlCazmVBUMg?si=CKsUPjqG_m3rjudp
"Urban Traffic Noise" https://youtu.be/ET8SLcviq7s?si=CdGTfbfyJZ4WMa6M
"Highway Traffic" https://youtu.be/FdOrPosxbFU?si=zdcgBiY8vJOsuMHK
"Busy Street Sounds" https://youtu.be/L9Jl_AxzohQ?si=ChRu86EewVuwVWj6
"Traffic Ambience" https://youtu.be/zxxfvP8-lrU?si=xwPBKlaEJRKyZzDA
"Road Traffic" https://youtu.be/LqL_C29sGCY?si=Jo-wWII4J52bnh2R
"Traffic Jam Sounds" https://youtu.be/iJZcjZD0fw0?si=4-sXfx4b-Cd4s1IH
"Intersection Traffic" https://youtu.be/9wbA9FVtWF4?si=31mA4JNLenbotBjJ

Wind

"Wind Sound Effect" https://youtu.be/v2Zh0oGmmvo?si=yrsD8rInl9mxbanc
"Strong Wind Sounds" https://youtu.be/v2Zh0oGmmvo?si=ssj46IpptaqN6zP5

Vehicles

All vehicle sounds were sourced from the "Vehicle Sounds Dataset" https://www.kaggle.com/datasets/janboubiabderrahim/vehicle-sounds-dataset by Jan Boubia Abderrahim on Kaggle.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.vscode		.vscode
DATASET_FLAC		DATASET_FLAC
DualNet_CX		DualNet_CX
DualNet_Xpert		DualNet_Xpert
EfficientNet_V2_B0		EfficientNet_V2_B0
NasNet_Mobile		NasNet_Mobile
results		results
.env		.env
.gitattributes		.gitattributes
.python-version		.python-version
DATASET_FLAC.rar		DATASET_FLAC.rar
README.md		README.md
app.py		app.py
app_local_record.py		app_local_record.py
elephant_detect.py		elephant_detect.py
flac_conversion.py		flac_conversion.py
package-lock.json		package-lock.json
packages.txt		packages.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
tflite_conversion.py		tflite_conversion.py
yolov5s.pt		yolov5s.pt

Folders and files

Latest commit

History

Repository files navigation

🔊 Generic Audio Classifier

🎯 What Can You Do?

🧠 Powered by Advanced Models

📊 Dataset Overview

📈 Classification Visualization

Dataset Structure

🔑 Key Features

Dataset Sources

Model Training Notebooks

🛠️ Setup Instructions

1️⃣ Install Required Dependencies

2️⃣ Install FFmpeg (Required for pydub and librosa)

Windows:

Mac/Linux (Using Homebrew):

Ubuntu/Debian (Using APT):

⚠️ Streamlit Limitations

🔧 Running the App Locally

📜 License

Acknowledgements

Source Videos

Cats

Dogs

Elephants

Horses

Lions

Crows

Parrots

Peacocks

Sparrows

Crowd

Military

Office

Rainfall

Traffic

Wind

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2️⃣ Install FFmpeg (Required for `pydub` and `librosa`)

Packages