CLARIN Tool Portal

Active filters:

Keywords: language model
Language: Macedonian

5 record(s) found

Search results

The CLASSLA-StanfordNLP model for lemmatisation of standard Macedonian 1.0

2 resources

The model for lemmatisation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus (to be published). The estimated F1 of the lemma annotations is ~99.1.

Use "The CLASSLA-StanfordNLP model for lemmatisation of standard Macedonian 1.0"
The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1

2 resources

The model for lemmatisation of standard Macedonian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus expanded with the Macedonian SETimes corpus (to be published). The estimated F1 of the lemma annotations is ~98.81. The difference from the previous version is that this version was trained using a larger training dataset.

Use "The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1"
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1

3 resources

This model for morphosyntactic annotation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus (to be published) and using the Macedonian CLARIN.SI word embeddings (http://hdl.handle.net/11356/1359). The model produces simultaneously UPOS, FEATS and XPOS (MULTEXT-East) labels. The estimated F1 of the XPOS annotations is ~97.6. The difference to the previous version of the model is that the pre-trained embeddings are limited to 250 thousand entries and adapted to the new code base.

Use "The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1"
The CLASSLA-Stanza model for morphosyntactic annotation of standard Macedonian 2.1

3 resources

This model for morphosyntactic annotation of standard Macedonian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the 1984 training corpus expanded with the Macedonian SETimes corpus (to be published) and using the Macedonian CLARIN.SI word embeddings (http://hdl.handle.net/11356/1788). The model produces simultaneously UPOS, FEATS and XPOS (MULTEXT-East) labels. The estimated F1 of the XPOS annotations is ~97.14. The difference from the previous version is that this version was trained using a larger training dataset and the new version of the Macedonian word embeddings.

Use "The CLASSLA-Stanza model for morphosyntactic annotation of standard Macedonian 2.1"
The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.0

3 resources

This model for morphosyntactic annotation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus (to be published) and using the Macedonian CLARIN.SI word embeddings (http://hdl.handle.net/11356/1359). The model produces simultaneously UPOS, FEATS and XPOS (MULTEXT-East) labels. The estimated F1 of the XPOS annotations is ~97.6.

Use "The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.0"

Result filters

Metadata provider

Language

Resource type

Tool task

Availability

Project

Keywords

Active filters:

Search results

The CLASSLA-StanfordNLP model for lemmatisation of standard Macedonian 1.0

The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1

The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.1

The CLASSLA-Stanza model for morphosyntactic annotation of standard Macedonian 2.1

The CLASSLA-StanfordNLP model for morphosyntactic annotation of standard Macedonian 1.0