Your English writing platform
Discover LudwigSuggestions(3)
Exact(1)
Because the proceedings are translated into all of the official EU languages, they have been used to make a multilingual corpus known as Europarl.
Similar(59)
Sentences in this sense can be used to align multilingual corpora and probabilistic machine learning algorithms are able to latch onto the proto-tagging of punctuation.
In terms of methodology, rigorous attention will have to be paid to the quantitative profiles of systems across languages and, as a prerequisite to this, the development of multilingual corpora or a database of comparable registers.
The empirical evaluation results show that the proposed LSI-based MLDC technique achieves satisfactory clustering effectiveness, measured by both cluster recall and cluster precision, and is capable of maintaining a good balance between monolingual and cross-lingual clustering effectiveness when clustering a multilingual document corpus.
For each language pair, an instance of Moses is trained using Europarl [24] data and JRC-Acquis Multilingual Parallel Corpus [25].
These multilingual text corpora can be parallel: i.e. texts in different languages that are translations of each other such as those used to match English UMLS terms with their German translations [ 12] or to find French translations of MeSH terms [ 13].
The multilingual text corpora can be comparable: i.e. texts addressing the same general topic in different languages, to search for French translations of medical terms [ 14], to extend the German version of the MeSH [ 15] or to build a Japanese-French terminology [ 16].
HIT-BTEC is a Thai translated version of the multilingual Olympic-domain corpus developed by Harbin institute of technology (HIT) [22] and the Basic Travel Expression Corpus (BTEC) [23].
This article describes the MultiMedica corpus, a multilingual collection of Spanish, Japanese, and Arabic texts from the biomedical domain.
The proposed methodology of annotation can be used when creating multilingual integrated resources and parallel corpora of closely related languages.
Experimental results show the effectiveness of the proposed methods in a mono-lingual ASR setup on the Librispeech corpus and in a transfer learning setup from a multilingual ASR (MLASR) base model to a low-resourced language.
Write better and faster with AI suggestions while staying true to your unique style.
Since I tried Ludwig back in 2017, I have been constantly using it in both editing and translation. Ever since, I suggest it to my translators at ProSciEditing.

Justyna Jupowicz-Kozak
CEO of Professional Science Editing for Scientists @ prosciediting.com