Result filters

Metadata provider

Language

Tool task

Organisation

  • CLARIN ERIC - Language Resource Switchboard

Keywords

  • Tokenisation

Active filters:

  • Organisation: CLARIN ERIC - Language Resource Switchboard
  • Keywords: Tokenisation
Loading...
2 record(s) found

Search results

  • WebLicht Tokenization TUR

    WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.