ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalog Reference : ELRA-S0412
    Japanese Kids Speech database (Upper Grade)
    The Japanese Kids Speech database (Upper Grade) contains the total recordings of 232 Japanese Kids speakers (104 males and 128 females), from 9 to 13 years’ old (fourth, fifth and sixth graders in elementary school), recorded in quiet rooms using smartphones. This database may be combined with the Japanese Kids Speech database (Lower Grade) also available in the ELRA Catalogue under reference ELRA-S0411.

    Number of speakers, utterances and duration, age are as follows :
    Number of speakers 232 (104 male/128 female)
    Number of utterances (average): 385 utterances per speaker
    Total number of utterances: 89,454
    Age: from 9 to 13 years' old
    Total hours of data: 145.4

    1018 sentences were used. Recordings were made through smartphones and audio data stored in .wav files as sequences of 16KHz Mono, 16 bits, Linear PCM.

    Database:
    ・Audio data: WAV format, 16KHz, 16bit, mono (recorded with smartphone)
    ・Recording scripts: TSV format(tab-delimited), UTF-8 (without BOM)
    ・Transcription data: TSV format(tab-delimited), UTF-8 (without BOM)
    ・Size: 16.2GB

    Number of speakers per age:
    9 years' old: 56 (21 male, 35 female)
    10 years' old: 71 (30 male, 41 female)
    11 years' old: 65 (28 male, 37 female)
    12 years' old: 38 (24 male, 14 female)
    13 years' old: 2 (1 male, 1 female)

    Structure of database:
    ├─ readme.txt
    ├─ Japanese Kids Speech Database.pdf Description document of the database
    ├─ Transcription.tsv Transcription
    ├─ scripts.tsv Script

    └─ voices/ directory of audio data
    ├─ high/ directory of upper grade
    └─(speaker_ID/) directory of speaker ID (six digits)
    └─(audio_file) audio file (WAV format, 16KHz, 16bit, mono)

    File naming conventions of audio files are as follows:
    Field number | Contents | Description | Remarks
    0 | Language ID | “JA” (fixed) | Japanese
    1 | Speaker ID | Six digit | 5XXXXX
    2 | Script ID | HXXXX | XXXX: four digits
    3 | Age | Two digits
    4 | Gender | M: male, F: female

    Filed separation character is “_”.
    For example, if the audio file name is “JA_500002_H0001_10_F.wav, this file has the following meaning:
    JA: Language ID (Japanese)
    500002: speaker ID
    H0001: script ID
    10: age (ten years old)
    F: gender (female)

    Contents Click on the arrow to display content.
     speech corpus 
     
    Members Prices
    Academic - Commercial 75000.00 EUR
    Academic - Research 12500.00 EUR
    Commercial - Commercial 75000.00 EUR
    Commercial - Research 50000.00 EUR
    Non Member Prices
    Academic - Commercial 75000.00 EUR
    Academic - Research 12500.00 EUR
    Commercial - Commercial 75000.00 EUR
    Commercial - Research 50000.00 EUR
    145 - Table './catalog_elra/counter' is marked as crashed and should be repaired

    select startdate, counter from counter

    [TEP STOP]