NAR

sounds

Download Site Janvier2014

Label				Value	Description
General information
	Name			NAR	Full dataset name
	ID			sounds/inria_nar	Datalist id for external indexing
	Abbreviation			NAR	Official dataset abbreviation, e.g. one used in the original paper
	Provider			INRIA
	Year			2014	Dataset release year
	Modalities			Audio	Data modalities included in the dataset
	Collection name			NAR	Common name for all related datasets, used to group datasets coming from same source
	Research domain			Tagging Weak annotation Isolated sounds	Related domains, e.g., Scenes, Mobile devices, Audio-visual, Open set, Ambient noise, Unlabelled, Multiple sensors, SED, SELD, Tagging, FL, Strong annotation, Weak annotation, Unlabelled, Multi-annotator
	License			Free
	Download			Download (35MB)
	Companion site			Site	Link to the companion site for the dataset
	Citation			[Janvier2014] Sound Representation and Classification Benchmark for Domestic Robots
Audio
Label				Value	Description
	Data
		Data type		Audio	Possible values: Audio \| Features
		File format
			File format type	Constant	Possible values: Constant \| Variable
			File format	wav	Possible value: wav \| aiff \| flac \| mp3 \| aac \| ogg
			Lossy compression	No	is audio compressed in a lossy manner
			Bit rate	16	Bit depth of audio, possible values: 8 \| 16 \| 24 \| 32
			Sampling rate (kHz)	48 kHz	Sampling rate in kHz, possible values: 8 \| 16 \| 22.05 \| 32 \| 44.1 \| 48
		Channels
			Setup	Mono	Possible values: Mono \| Stereo \| Binaural \| Ambisonic \| Array \| Multi-Channel \| Variable
			Number of channels	1
		Material
			Source	Original	Possible values: Original \| Youtube \| Freesound \| Online \| Crowdsourced \| [Dataset name]
	Content
		Content type		Isolated	Possible values: Freefield \| Synthetic \| Isolated
		Scene		Constant	Is the scene class constant for single recording, possible values: Constant \| Variable
	Recording
		Setup		Uncontrolled	Possible values: Near-field \| Far-field \| Mixed \| Uncontrolled \| Unknown
		Setup count		1	Amount of different recording setups (microphone and recording device) used
		Spot type		Fixed	Possible values: Fixed \| Moving \| Unknown
	Files
		Count		852 files	Total number of files
		Total duration (minutes)		8 min	Total duration of the dataset in minutes
		File length		Variable	Characterization of the file lengths, possible values: Constant \| Quasi-constant \| Variable
Meta
Label				Value	Description
	Types			Tag	List of meta data types provided for the data, possible values: Event, Tag, Scene, Caption, Geolocation, Spatial location, Annotator, Timestamp, Presence, Proximity, etc.
	Scene
		Classes		2	Number of scene classes
		Classes		False	Possible values: True \| False \| Almost
		Classes		Kitchen Office
	Event
		Classes		42	Number of event classes
		Classes		Almost	Possible values: True \| False \| Almost
		Classes		choking close microwave cuttlery door close door open eating fill a glass fingerclap fridge handclap key knock microwave move a chair open microwave open/close a drawer ripped paper running the tap speech toaster tongue clic zip
		Annotation
			Type	Weak	Possible values: Strong \| Weak \| Location \| None
			Annotations per item	1	How many annotations there are available per item (possible multi-annotator setup)
			Labelled amount (%)	100 %	Percentage of all data, amount of data which is labelled
			Strong annotations amount (%)	0 %	Percentage of all data, amount of data which has strong annotations
			Overlapping event instances	No
		Labeling
			Hierarchical	Yes
		Instance
			Count	852	Count of all event instances in the dataset
			Average instances per class	1.0243902439	Average per class instance count

NAR

General information

Audio

Data

File format

Channels

Material

Content

Recording

Files

Meta

Scene

Event

Annotation

Labeling

Instance