ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    ELRA releases free Language Resources.


    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • ELRA-M0052 : EnToFrNE - a Parallel English-French Lexicon of Named Entities
    This lexicon consists of 1,167,263
    parallel named entities in English and
    French. The tags used are: PERSON,
    ORGANIZATION, LOCATION, PRODUCT and
    MISC. The lexicon comes in two formats:
    csv and xml.

  • ELRA-T0378 : English-Persian database of idioms and expressions
    This database consists of about 30,000
    bilingual parallel sentences and phrases
    in English and Persian (15,000 in each
    language). It comes with a software
    through which the users can search a
    word, phrase or chunk and receive all
    idioms and expressions related to the
    query. The database is presented in
    Access format and the software is
    executable on Windows systems.

  • ELRA-T0379 : English-Persian terminology database of computer and IT
    This bilingual terminology consists of
    around 25,000 terms in the field of
    computer engineering, computer sciences
    and information technology. It comes
    with a software through which the users
    can search a word, phrase or chunk and
    receive all entries related to the
    query. The database is presented in
    Access format and the software is
    executable on Windows systems.

  • ELRA-T0380 : English-Persian terminology database of management and economics
    This bilingual terminology consists of
    around 15,000 terms in the field of
    management and economics sciences. It
    comes with a software through which the
    users can search a word, phrase or chunk
    and receive all entries related to the
    query. The main database of the software
    is presented in Access format and the
    software itself is executable on Windows
    systems.

  • ELRA-S0406 : Glissando-sp
    Glissando-sp includes more than 12 hours
    of speech in Spanish, recorded under
    optimal acoustic conditions,
    orthographically transcribed,
    phonetically aligned and annotated with
    prosodic information (location of the
    stressed syllables and prosodic
    phrasing). The corpus was recorded by 8
    professional speakers and 20
    non-professional speakers: 4 “news
    broadcaster” professional speakers (2
    male and 2 female), 4 “advertising”
    professional speakers (2 male and 2
    female), and 20 non-professional
    speakers (10 male and 10 female).
    Glissando-sp is made of three
    subcorpora: readings of real news texts
    (provided by “Cadena Ser” radio
    station), interactions between two
    speakers oriented to a specific goal in
    the domain of information requests, and
    conversations between people who have
    some degree of familiarity with each
    other.

  • (last update: October 2019)

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0