This Project deals with classfication of audio in a video to 10 prominent Urban Sound classes using the Mel-Spectrogram of these audios which are extracted at every second in the time frame of the video. This is then showed as a graph overlayed on the same video ,displaying 3 most audible sounds in hat time frame.
RheaSudesh/Audio-Dataset-Classification
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|