Cooperation for Arabic Language Resources and Tools – The MEDAR Project

Bente Maegaard, Mohamed Attia, Khalid Choukri, Olivier Hamon, Steven Krauwer, Mustafa Yaseen

2 Citationer (Scopus)

Abstract

The paper describes some of the work carried out within the European funded project MEDAR. The project has three streams of activity: the technical stream, the cooperation stream and the dissemination stream. MEDAR has first updated the existing surveys and BLARK for Arabic, and then the technical stream focused on machine translation. The consortium identified a number of freely available MT systems and then customized two versions of the famous MOSES package. The Consortium addressed the needs to package MOSES for English to Arabic (while the main MT stream is on Arabic to English). For performance assessment purposes, the partners produced test data that allowed carrying out an evaluation campaign with 5 different systems (including from outside the consortium) and two online ones. Both the MT baselines and the collected data will be made available via ELRA catalogue. The cooperation stream focuses mostly on the cooperation roadmap for Human Language Technologies for Arabic. Cooperation Roadmap for the region directed towards the Arabic HLT in general. It is the purpose of the roadmap to outline areas and priorities for collaboration, in terms of collaboration between EU countries and Arabic speaking countries, as well as cooperation in general: between countries, between universities, and last but not least between universities and industry.

OriginalsprogEngelsk
TitelProceedings of the Seventh International Conference on Language Resources and Evaluation
RedaktørerNicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias
Antal sider5
UdgivelsesstedValletta, Malta
ForlagEuropean Language Resources Association
Publikationsdato2010
Sider2863-2867
ISBN (Elektronisk)2-9517408-6-7
StatusUdgivet - 2010
BegivenhedThe 7th International Conference on Language Resources and Evaluation 2010 - Valletta, Malta
Varighed: 19 maj 201021 maj 2010
Konferencens nummer: 7

Konference

KonferenceThe 7th International Conference on Language Resources and Evaluation 2010
Nummer7
Land/OmrådeMalta
ByValletta
Periode19/05/201021/05/2010

Emneord

  • Det Humanistiske Fakultet
  • sprogressourcer, sprogværktøjer, arabisk

Fingeraftryk

Dyk ned i forskningsemnerne om 'Cooperation for Arabic Language Resources and Tools – The MEDAR Project'. Sammen danner de et unikt fingeraftryk.

Citationsformater