ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    ELRA releases free Language Resources.


    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • ELRA-S0411 : Japanese Kids Speech database (Lower Grade)
    The Japanese Kids Speech database (Lower
    Grade) contains the total recordings of
    179 Japanese Kids speakers (71 males and
    108 females), from 6 to 9 years' old
    (first, second and third graders in
    elementary school), recorded in quiet
    rooms using smartphones. 1019 sentence
    were used. Recordings were made through
    smartphones and audio data stored in
    .wav files as sequences of 16KHz Mono,
    16 bits, Linear PCM.

  • ELRA-S0412 : Japanese Kids Speech database (Upper Grade)
    The Japanese Kids Speech database (Upper
    Grade) contains the total recordings of
    232 Japanese Kids speakers (104 males
    and 128 females), from 9 to 13 years’
    old (fourth, fifth and sixth graders in
    elementary school), recorded in quiet
    rooms using smartphones. 1018 sentences
    were used. Recordings were made through
    smartphones and audio data stored in
    .wav files as sequences of 16KHz Mono,
    16 bits, Linear PCM.

  • ELRA-S0410 : CAREGIVER Corpus
    A multi-lingual speech corpus used for
    modeling language acquisition called
    CAREGIVER has been designed and recorded
    within the framework of the EU funded
    Acquisition of Communication and
    Recognition Skills (ACORNS) project. The
    corpus contains nearly 66,000
    utterance-based audio files spoken over
    a two-year period by 16 male and 14
    female native speakers of Dutch, UK
    English, and Finnish. An orthographic
    transcription is available for every
    utterance. Also, time-aligned word and
    phone annotations for some of the
    sub-corpora exist.

  • ELRA-S0409-01 : MDT Mandarin Chinese Conversational Recognition Corpus – Complete set
    This dataset consists of 4.98 hours of
    transcribed conversational speech in
    Mandarin Chinese, where 30 conversations
    are uttered by 32 speakers (16 males and
    16 females). The audios are sampled at
    16 kHz and quantized at 16 bits.

  • ELRA-S0409-02 : MDT Mandarin Chinese Conversational Recognition Corpus – 1 channel
    This dataset consists of 4.98 hours of
    transcribed conversational speech in
    Mandarin Chinese, where 30 conversations
    are uttered by 32 speakers (16 males and
    16 females). The audios are sampled at
    16 kHz and quantized at 16 bits.

  • (last update: December 2020)
    145 - Table './catalog_elra/counter' is marked as crashed and should be repaired

    select startdate, counter from counter

    [TEP STOP]