Result filters

Metadata provider

Language

Resource type

Tool task

  • Word embeddings

Availability

Organisation

Keywords

Active filters:

  • Project: MEZZANINE
  • Tool task: Word embeddings
Loading...
1 record(s) found

Search results

  • Word embeddings CLARIN.SI-embed.mk 2.0

    CLARIN.SI-embed.mk contains word embeddings induced from a large collection of Macedonian texts crawled from the .mk top-level domain. The embeddings are based on the skip-gram model of fastText trained on 933,231,582 tokens of running text for 986,670 lowercased surface forms. The difference to the previous version of the embeddings is that this version was trained on the original dataset expanded with the MaCoCu-mk web crawl corpus (http://hdl.handle.net/11356/1512).