Ideas
Partners
Consultants
Calls
EU Projects
Blog NEW
- Europe towards City Innovation Hubs
  
  Written by Matteo Satta EU funding has always helped cities support their innovation processes, particularly on digital transformation and environment, but the approach is now moving toward a new ambitious […]
- Mistakes that Cities Should Be Avoiding in EU Projects
  
  By Matteo Satta Nowadays, we are seeing an increasing trend of people requesting to lower taxes and, as a result, public budgets. This leads many cities to try to find […]
- How a city can boost your Smart City project
  
  By Matteo Satta The commission has been heavily investing in pilot Smart City projects for a long time, but you may often have R&D or innovation projects pretending to have […]
- How to involve a city in your project? Here are a few tips.
  
  Article written by Matteo Satta (Smart) City projects are more and more funded by the European Commission, but consortia often struggle to get cities on board or really engage them. […]
- >> See all posts
Log In Sign Up

Want to see this project on homepage?
Propose a Picture

Multilingual Lexicon Extraction from Comparable Corpora (MULTILEX)
Start date: Sep 1, 2014, End date: Aug 31, 2018 PROJECT FINISHED

"Given large collections of parallel (i.e. translated) texts, it is well-known how to, by successively applying a sentence- and aword-alignment step, establish correspondences between words across languages. However, parallel texts are a scarceresource for most language pairs involving lesser-used languages. On the other hand, human second language acquisitionseems not to require the reception of large amounts of translated texts, which indicates that there must be another way ofcrossing the language barrier. Apparently, the human capabilities are based on looking at comparable resources, i.e. textsor speech on related topics in different languages, which, however, are not translations of each other. Comparable (writtenor spoken) corpora are far more common than parallel corpora, thus offering the chance to overcome the data acquisitionbottleneck. Despite its cognitive motivation, in the proposed project we will not attempt to simulate the complexities ofhuman second language acquisition, but will show that it is possible by purely technical means to automatically extractinformation on word- and multiword-translations from comparable corpora. The aim is to push the boundaries of currentapproaches, which typically utilize correlations between co-occurrence patterns across languages, in several ways: 1)Eliminating the need for initial lexicons by using a bootstrapping approach which only requires a few seed translations. 2)Implementing a new methodology which first establishes alignments between comparable documents across languages,and then computes cross-lingual alignments between words and multiword-units. 3) Improving the quality of computed wordtranslations by applying an interlingua approach, which, by relying on several pivot languages, allows a highly effectivemulti-dimensional cross-check. 4) We will show that, by looking at foreign citations, language translations can even bederived from a single monolingual text corpus."

Promote here

Your project activity

European Projects, Clusters and Open Calls

Open Calls and Tenders
A privileged space to disseminate your open Call to the Up2Europe community
European Projects
Consortium looking for a way to reach a wider audience for their activities
Consulting Firms
Promote your consulting services to the right target

Coordinator

JOHANNES GUTENBERG-UNIVERSITAT MAINZ

€ 100 000,00

Sascha Hofmann
SAARSTRASSE 21 55099 MAINZ (Germany)

Details

100% € 100 000,00
FP7-PEOPLE
Project on CORDIS Platform

Search for European Projects

Drop Images Here
Or click to add/replace

Multilingual Lexicon Extraction from Comparable Corpora (MULTILEX)
Start date: Sep 1, 2014, End date: Aug 31, 2018 PROJECT FINISHED

Promote here

Your project activity

European Projects, Clusters and Open Calls

Open Calls and Tenders
A privileged space to disseminate your open Call to the Up2Europe community

European Projects
Consortium looking for a way to reach a wider audience for their activities

Consulting Firms
Promote your consulting services to the right target

Coordinator

JOHANNES GUTENBERG-UNIVERSITAT MAINZ

Details

Search for European Projects

Multilingual Lexicon Extraction from Comparable Corpora (MULTILEX) Start date: Sep 1, 2014, End date: Aug 31, 2018 PROJECT FINISHED

Get Access to the 1st Network for European Cooperation Log In or Create an account to see this content

Promote here Your project activity European Projects, Clusters and Open Calls

Open Calls and Tenders A privileged space to disseminate your open Call to the Up2Europe community

European Projects Consortium looking for a way to reach a wider audience for their activities

Consulting Firms Promote your consulting services to the right target

Coordinator

JOHANNES GUTENBERG-UNIVERSITAT MAINZ

Details

Multilingual Lexicon Extraction from Comparable Corpora (MULTILEX)
Start date: Sep 1, 2014, End date: Aug 31, 2018 PROJECT FINISHED

Get Access to the 1st Network for European Cooperation
Log In

or

Create an account

to see this content

Promote here

Your project activity

European Projects, Clusters and Open Calls

Open Calls and Tenders
A privileged space to disseminate your open Call to the Up2Europe community

European Projects
Consortium looking for a way to reach a wider audience for their activities

Consulting Firms
Promote your consulting services to the right target