Result filters

Metadata provider

  • CLARIAH Tools

Tool task

  • Tokenisation

Field of study

Active filters:

  • Tool task: Tokenisation
  • Metadata provider: CLARIAH Tools
Loading...
1 record(s) found

Search results

  • ucto

    Ucto tokenizes text files: it separates words from punctuation, and splits sentences. This is one of the first tasks for almost any Natural Language Processing application. Ucto offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.