Result filters

Metadata provider

Tool task

  • Tokenisation

Field of study

Organisation

  • Humanities Cluster

Active filters:

  • Tool task: Tokenisation
  • Organisation: Humanities Cluster
Loading...
1 record(s) found

Search results

  • ucto

    Ucto tokenizes text files: it separates words from punctuation, and splits sentences. This is one of the first tasks for almost any Natural Language Processing application. Ucto offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.