Result filters

Metadata provider

  • DSpace

Language

Resource type

Active filters:

  • Metadata provider: DSpace
  • Keywords: MWE
Loading...
2 record(s) found

Search results

  • The Database of Lithuanian multiword expressions

    The Database of Lithuanian multiword expressions (MWEs) is freely accessible for online search at: https://resursai.pastovu.vdu.lt/paieska/paprastoji from 2019. It contains two-word and three-word MWEs extracted from the DELFI.lt corpus representing news texts on the various topics (https://klc.vdu.lt/pastovuSearch.html). First, 12,000 MWEs (mostly collocations, a few idioms) were included in the database. In 2022, the database was updated adding new collocations from the same corpus and filtering arbitrary collocations: out of appr. 19,000 collocations appr. 9000 are marked as arbitrary collocations, i.e., having lexical collocability restrictions. The database provides rich information about the usage of collocations: lemma, word forms, frequencies (in the DELFI.lt corpus), morphological information, syntactic relations, grammatical variants, text genres, and usage examples. Usage variation cases are also illustrated, for example, word order changes or insertions between collocation constituents.
  • Colloc -- A Tool for Automatic Identification of Multiword Expressions

    Colloc -- a tool for automatic identification of multiword expressions (MWE) is freely available for online use at http://resursai.mwe.lt/atpazintuvas. As material for training DELFI.lt corpus (http://tekstynas.mwe.lt/) was used. For identification combination of 2 trained models (RNN bi-LSTM and CRF) is used. Automatically identified MWE can be retrieved in 2 formats -- list of MWE or / and text with annotated MWE.