Result filters

Metadata provider

Language

Tool task

  • Tokenisation

Organisation

  • CLARIN ERIC - Language Resource Switchboard

Keywords

Active filters:

  • Tool task: Tokenisation
  • Organisation: CLARIN ERIC - Language Resource Switchboard
Loading...
2 record(s) found

Search results

  • WebLicht Tokenization TUR

    WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.