Result filters

Metadata provider

Tool task

Field of study

Keywords

  • natural language processing
  • tokenizer

Active filters:

  • Keywords: natural language processing
  • Keywords: tokenizer
Loading...
1 record(s) found

Search results

  • ucto

    Ucto tokenizes text files: it separates words from punctuation, and splits sentences. This is one of the first tasks for almost any Natural Language Processing application. Ucto offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.