CLARIN Tool Portal

DARIAH DKPro-Wrapper: POS-Tagging und Lemmatization DE

1 resources

The DARIAH DKPro Wrapper is a wrapper for DKPro Core, a tool for linguistic annotation.

EXMARaLDA

1 resources

**EXMARaLDA** is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool ([Partitur-Editor](https://exmaralda.org/en/partitur-editor-2/ "Partitur Editor")), a tool for managing corpora ([Corpus-Manager](https://exmaralda.org/en/corpus-manager-coma-2/ "Corpus-Manager (Coma)")) and a query and analysis tool ([EXAKT](https://exmaralda.org/en/exakt-3/ "EXAKT")). **EXMARaLDA's** features include, for instance: - time-aligned transcription of digital audio or video - flexible annotation for freely choosable categories, - systematic documentation of a corpus through metadata - flexible output of transcription data in various layouts and formats (notation, document) - computer-assisted querying of transcription, annotation and metadata - interoperable as it works XML based data formats that allow for data exchange with other tools (like Praat, ELAN, Transcriber etc.) and enable a flexible processing and sustainable usage of the data. **EXMARaLDA** is used by [researchers world wide](https://exmaralda.org/en/projects/ "Projekte") in different contexts in which spoken language is analysed, these include: - conversation and discourse analysis, - study of language acquisition and multilingualism, - phonetics and phonology, - dialectology and sociolinguistics. **EXMARaLDA** was developed in the project "Computer assisted methods for the creation and analysis of multilingual data" at the Collaborative Research Center "Multilingualism" (Sonderforschungsbereich "Mehrsprachigkeit" – SFB 538) at the University of Hamburg. Since July 2011, the development of EXMARaLDA is continued at the [Hamburg Centre for Language Corpora](https://corpora.uni-hamburg.de/drupal/en), since November 2011 in cooperation with the [Archive for Spoken German](http://agd.ids-mannheim.de/index.shtml) at the Institute for the German Language in Mannheim.

WebLicht Tokenization TUR

1 resources

WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.

WebLicht POSTags Lemmas DE

1 resources

WebLicht Easy Chain for POS Tagging and Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.

WebLicht POSTags Lemmas EN

1 resources

WebLicht Easy Chain for POS Tagging and Lemmatization (English). The pipeline makes use of WebLicht's TCF converter, the Stanford tokenizer, the Jitar POS Tagger, and the lemmatizer service from MorphAdorner. WebLicht's Tundra can be used to visualize the result.

iDAI.vocab PT

1 resources

The German Archaeological Institute is a scientific institution of the confederation in the business area of the Department of Foreign Affairs.

WebLicht Dep Parsing DE

1 resources

WebLicht Easy Chain for Dependency Parsing (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, the POS Tagger from the OpenNLP projet, and the MaltParser, a system for data-driven dependency parsing. WebLicht's Tundra can be used to visualize the result.

WebLicht Lemmas DE

1 resources

WebLicht Easy Chain for Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.

Opener Tokenizer

1 resources

Tokenizer for Dutch, English, German, French, Spanish and Italian. Consumes Plain text and produces TCF.

WebLicht POSTags Lemmas IT

1 resources

WebLicht Easy Chain for POS Tagging and Lemmatization (Italian). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the POS Tagger from the OpenNLP project. The model for Italian is trained on a relatively small training corpus (MIDT) and should therefore be considered experimental. WebLicht's Tundra can be used to visualize the result.

Result filters

Metadata provider

Language

Resource type

Tool task

Organisation

Keywords

Active filters:

Search results

DARIAH DKPro-Wrapper: POS-Tagging und Lemmatization DE

EXMARaLDA

WebLicht Tokenization TUR

WebLicht POSTags Lemmas DE

WebLicht POSTags Lemmas EN

iDAI.vocab PT

WebLicht Dep Parsing DE

WebLicht Lemmas DE

Opener Tokenizer

WebLicht POSTags Lemmas IT