AudioSet

Tags
objectsreading
Contributor
James Parker
Date
March 4, 2024
Folgezettel
8a

Audioset is a large dataset created by Google that comprises around 2 million 10-second sound clips (extracted from user-uploaded YouTube videos) all labelled by human annotators for use in training neural networks, machine learning models, to recognize and categorize different types of sounds and audio events. This dataset was designed with the goal of creating automatic audio event recognition systems that can label hundreds or thousands of different sound events in real-world recordings with a time resolution better than one second, similar to how human listeners can recognize and relate the sounds they hear.

Readings:

Experiment

Dataset Audit: a score