General information |
Label |
Value |
Description |
|
Name |
ARCA23K |
Full dataset name |
|
ID |
sounds/arca23k
|
Datalist id for external indexing |
|
Abbreviation |
ARCA23K |
Official dataset abbreviation, e.g. one used in the original paper |
|
Provider |
CVSSP |
|
|
Year |
2021 |
Dataset release year |
|
Modalities |
Audio
|
Data modalities included in the dataset |
|
Collection name |
ARCA23K |
Common name for all related datasets, used to group datasets coming from same source |
|
Research domain |
Tagging
Weak annotation
Noisy labels
|
Related domains, e.g., Scenes, Mobile devices, Audio-visual, Open set, Ambient noise, Unlabelled, Multiple sensors, SED, SELD, Tagging, FL, Strong annotation, Weak annotation, Unlabelled, Multi-annotator |
|
Related datasets name |
|
|
|
License |
Creative Commons, CC BY 4.0 |
|
|
Download |
Download
(8.72GB)
|
|
|
Citation |
[Iqbal2021] ARCA23K: An audio dataset for investigating open-set label noise
|
|
Audio |
Label |
Value |
Description |
|
Data |
|
|
Data type |
Audio
|
Possible values: Audio | Features |
|
|
File format |
|
|
|
File format type |
Constant
|
Possible values: Constant | Variable |
|
|
|
File format |
wav
|
Possible value: wav | aiff | flac | mp3 | aac | ogg |
|
|
|
Lossy compression |
No
|
is audio compressed in a lossy manner |
|
|
|
Bit rate |
16 |
Bit depth of audio, possible values: 8 | 16 | 24 | 32 |
|
|
|
Sampling rate (kHz) |
44.1 kHz |
Sampling rate in kHz, possible values: 8 | 16 | 22.05 | 32 | 44.1 | 48 |
|
|
Channels |
|
|
|
Setup |
Mono
|
Possible values: Mono | Stereo | Binaural | Ambisonic | Array | Multi-Channel | Variable |
|
|
|
Number of channels |
1 |
|
|
|
Material |
|
|
|
Source |
Freesound
|
Possible values: Original | Youtube | Freesound | Online | Crowdsourced | [Dataset name] |
|
Content |
|
|
Content type |
Freefield
|
Possible values: Freefield | Synthetic | Isolated |
|
Recording |
|
|
Setup |
Unknown
|
Possible values: Near-field | Far-field | Mixed | Uncontrolled | Unknown |
|
|
Spot type |
Unknown
|
Possible values: Fixed | Moving | Unknown |
|
Files |
|
|
Count |
23727 files |
Total number of files |
|
|
Total duration (minutes) |
3108 min |
Total duration of the dataset in minutes |
|
|
File length |
Variable
|
Characterization of the file lengths, possible values: Constant | Quasi-constant | Variable |
|
|
File length (seconds) |
0.3 - 30 sec |
Approximate length of files |
Meta |
Label |
Value |
Description |
|
Types |
Tag
|
List of meta data types provided for the data, possible values: Event, Tag, Scene, Caption, Geolocation, Spatial location, Annotator, Timestamp, Presence, Proximity, etc. |
|
Event |
|
|
Classes |
70 |
Number of event classes |
|
|
Classes |
False
|
Possible values: True | False | Almost |
|
|
Annotation |
|
|
|
Type |
Weak
|
Possible values: Strong | Weak | Location | None |
|
|
|
Source |
Automatic
|
Possible values: Experts | Crowdsourced | Synthetic | Metadata | Automatic |
|
|
|
Annotations per item |
1 |
How many annotations there are available per item (possible multi-annotator setup) |
|
|
|
Labelled amount (%) |
100 % |
Percentage of all data, amount of data which is labelled |
|
|
|
Validated amount (%) |
0 % |
Percentage of all data, amount of data which is validated by human |
|
|
|
Strong annotations amount (%) |
0 % |
Percentage of all data, amount of data which has strong annotations |
|
|
|
Overlapping event instances |
No
|
|
|
|
Labeling |
|
|
|
Hierarchical |
No
|
|
|
|
|
Ontology name |
Yes
|
|
|
|
Instance |
|
|
|
Count |
23727 |
Count of all event instances in the dataset |
|
|
|
Average instances per class |
338.96 |
Average per class instance count |
Cross-validation setup |
Label |
Value |
Description |
|
|
Provided |
Yes
|
|
|
|
Folds |
1 |
|
|
|
Sets |
Train
Val
Test
|
Set types provided in the split, possible values: Train | Test | Val | Dev | Eval |
Info |
Label |
Value |
Description |
|
|
Comments |
Audio files for the validation set and test set are not distributed in the Zenodo release (only the ground truth data is provided). Since the validation set and test set are a subset of FSD50K, the omitted audio files can be obtained by downloading the FSD50K dataset. |
|