Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Anglais Français
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalog Reference : ELRA-W0125
    TRAD Chinese-French Parallel Text - Blog
    TRAD Chinese-French Parallel Text - Blog was developed by ELDA as part of the PEA-TRAD project. It contains English translations of a subset of approximately 10,000 Chinese words from GALE Phase 1 Chinese Blog Parallel Text (available here: https://catalog.ldc.upenn.edu/LDC2008T06).

    The PEA-TRAD project (Translation as a Support for Document Analysis) was supported by the French Ministry of Defense (DGA). Its purpose was to develop speech-to-speech translation technology for multiple languages (e.g., Arabic, Chinese, Pashto) from a variety of domains.

    This release consists of 444 segments (translations units) from 17 documents. The source data is Chinese blog text collected and translated into English by LDC for the DARPA GALE (Global Autonomous Language Exploitation) program. Information about the ELDA translation team, translation guidelines and validation results is contained in the documentation accompanying this release.

    The Chinese source file contains 15,809 characters and the French reference translation contains 11,769 words. The data is presented in two unicode-encoded XML files along with an associated DTD.

    Click here for licensing instructions: https://catalog.ldc.upenn.edu/LDC2018T02.

    ISLRN : 713-266-631-883-0
    Project : PEA TRAD
    Technical Information
    Distribution medium : Downloadable
    Contents Click on the arrow to display content.
    written corpus 

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0