The CLASSLA-Stanza model for lemmatisation of standard Macedonian 2.1
The model for lemmatisation of standard Macedonian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla-stanfordnlp) by training on the 1984 training corpus expanded with the Macedonian SETimes corpus (to be published). The estimated F1 of the lemma annotations is ~98.81.
The difference from the previous version is that this version was trained using a larger training dataset.