Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-S0395
Nautilus Speaker Characterization (NSC) Corpus
The Nautilus Speaker Characterization (NSC) Corpus comprises clean microphone recordings of conversational speech from 300 German speakers (126 males and 174 females) aged 18 to 35 years, with no marked dialect/accent. The recordings were performed in the acoustically-isolated room "Nautilus" (which gives name to this database) of the Quality and Usability Lab of the Technische Universität Berlin, Germany, in 2016/2017.
Four scripted and four semi-spontaneous dialogs were elicited from the speakers, simulating telephone call inquiries. Additionally, spontaneous neutral and emotional (predominantly excitement or frustration) speech utterances and questions were produced. Interactions between speakers and their interlocutor (a recording assistant) are provided in separate mono files, accompanied by timestamps and tags that define the speaker's turns. All speech is sampled at 48 kHz (audio/wav 16-bit 1-channel files). The microphone AKG C 414B-XLS was employed to record the speakers (95.6 hours of speech) and the headset Sennheiser HMD 46 to record the interlocutor (59.5 hours of speech).
The speech corresponding to one of the semi-spontaneous dialogs was later evaluated with respect to 34 continuous numeric labels of perceived interpersonal speaker characteristics (such as likable, attractive, competent, childish, etc.), for the 300 speakers by 15 different listeners, on average. For a set of 20 selected "extreme" speakers, also 34 naive voice descriptions (such as bright, creaky, articulate, melodious, etc.) were evaluated by 26 external raters.
All labels are provided, together with the speech recordings and the speakers' metadata (age, gender, place of birth, chronological places of residence and duration of stay, place of birth of the mother and of the father, self-assessed personality, etc.).
The material provided in the NSC Corpus is expected to be of broad interest to phoneticians and speech scientists working on the perceptual and acoustic correlates of personal attributes. Speech and prosody production and conversational behavior in human-human interactions can be studied by analyzing speaker's and interlocutor's turns of spontaneous speech. The NSC data may also be adequate for other speech-related research requiring high-quality clean recordings in German.
Period of coverage :
Version history :
Creation date :
Distribution medium :
Click on the arrow to display content.
TEXT_DURATION95.6 hours of speech for speakers and 59.5 hours of speech for interlocutors
TEXT_RECORDING_CHANNELSmicrophone AKG C 414B-XLS for speakers and headset Sennheiser HMD 46 for interlocutor
Source Channel :
TEXT_RECORDING_ENVIRONMENTThe recordings were performed in the acoustically-isolated room \"Nautilus\" (which gives name to this database) of the Quality and Usability Lab of the Technische Universität Berlin, Germany
TEXT_TASKtelephone call inquiries
Academic - Research Free
Non Member Prices
Academic - Research Free
Tuesday 19 February, 2019
24600446 requests since Monday 27 September, 2004
Copyright © 2008