CLARIN Tool Portal

Active filters:

Tool task: Machine translation
Keywords: NeMo

2 record(s) found

Search results

Neural Machine Translation model for Slovene-English language pair RSDO-DS4-NMT 1.2.6

3 resources

This Neural Machine Translation model for Slovene-English language pair was trained following the NVIDIA NeMo NMT AAYN recipe (for details see the official NVIDIA NeMo NMT documentation, https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/machine_translation/machine_translation.html, and NVIDIA NeMo GitHub repository https://github.com/NVIDIA/NeMo). It provides functionality for translating text written in Slovene language to English and vice versa. The training corpus was built from publicly available datasets, including Parallel corpus EN-SL RSDO4 1.0 (https://www.clarin.si/repository/xmlui/handle/11356/1457), as well as a small portion of proprietary data. In total the training corpus consisted of 32.638.758 translation pairs and the validation corpus consisted of 8.163 translation pairs. The model was trained on 64GPUs and on the validation corpus reached a SacreBleu score of 48.3191 (at epoch 37) for translation from Slovene to English and a SacreBleu score of 53.8191 (at epoch 47) for translation from English to Slovene.

Use "Neural Machine Translation model for Slovene-English language pair RSDO-DS4-NMT 1.2.6"
NeMo Neural Machine Translation service RSDO-DS4-NMT-API 1.0

2 resources

Neural Machine Translation service for NeMo AAYN Base models. For more details about building such models, see the official NVIDIA NeMo documentation (https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/nlp/machine_translation/machine_translation.html) and NVIDIA NeMo GitHub (https://github.com/NVIDIA/NeMo). A model for language pair SL-EN can be downloaded from http://hdl.handle.net/11356/1736. The service accepts the source language and target language, and either a single string or list of strings to be translated. The result will be in the same format as the request, either as a single string or list of strings. The maximal accepted text length is 5000c. Note that transcription of one 5000c text block on cpu will take advantage of all available cores, consume up to 3GB RAM and may take ~200s (on a system with 24 vCPU). See the service README.md for further details.

Use "NeMo Neural Machine Translation service RSDO-DS4-NMT-API 1.0"

Result filters

Metadata provider

Language

Resource type

Tool task

Project

Keywords

Active filters:

Search results

Neural Machine Translation model for Slovene-English language pair RSDO-DS4-NMT 1.2.6

NeMo Neural Machine Translation service RSDO-DS4-NMT-API 1.0