ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    ELRA releases free Language Resources.


    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • ELRA-W0129 : Arbobanko (Esperanto Treebank)
    The Esperanto Arbobanko Treebank is a
    52,000 token dependency treebank of
    Esperanto with texts from the MONATO
    news magazine, consisting of random
    excerpts from the period 2000-2010. All
    words were annotated for lemma,
    part-of-speech, inflection, compounding
    and affixing, syntactic function,
    dependency links, NER types, semantic
    types of nouns and adjectives, and verb
    frame categories.

  • ELRA-M0052 : EnToFrNE - a Parallel English-French Lexicon of Named Entities
    This lexicon consists of 1,167,263
    parallel named entities in English and
    French. The tags used are: PERSON,
    ORGANIZATION, LOCATION, PRODUCT and
    MISC. The lexicon comes in two formats:
    csv and xml.

  • ELRA-T0378 : English-Persian database of idioms and expressions
    This database consists of about 30,000
    bilingual parallel sentences and phrases
    in English and Persian (15,000 in each
    language). It comes with a software
    through which the users can search a
    word, phrase or chunk and receive all
    idioms and expressions related to the
    query. The database is presented in
    Access format and the software is
    executable on Windows systems.

  • ELRA-T0379 : English-Persian terminology database of computer and IT
    This bilingual terminology consists of
    around 25,000 terms in the field of
    computer engineering, computer sciences
    and information technology. It comes
    with a software through which the users
    can search a word, phrase or chunk and
    receive all entries related to the
    query. The database is presented in
    Access format and the software is
    executable on Windows systems.

  • ELRA-T0380 : English-Persian terminology database of management and economics
    This bilingual terminology consists of
    around 15,000 terms in the field of
    management and economics sciences. It
    comes with a software through which the
    users can search a word, phrase or chunk
    and receive all entries related to the
    query. The main database of the software
    is presented in Access format and the
    software itself is executable on Windows
    systems.

  • (last update: January 2020)

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0