**EXMARaLDA** is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool
([Partitur-Editor](https://exmaralda.org/en/partitur-editor-2/ "Partitur Editor")),
a tool for managing corpora
([Corpus-Manager](https://exmaralda.org/en/corpus-manager-coma-2/ "Corpus-Manager (Coma)"))
and a query and analysis tool
([EXAKT](https://exmaralda.org/en/exakt-3/ "EXAKT")).
**EXMARaLDA's** features include, for instance:
- time-aligned transcription of digital audio or video
- flexible annotation for freely choosable categories,
- systematic documentation of a corpus through metadata
- flexible output of transcription data in various
layouts and formats (notation, document)
- computer-assisted querying of transcription,
annotation and metadata
- interoperable as it works XML based data formats that allow
for data exchange with other tools (like Praat, ELAN,
Transcriber etc.) and enable a flexible processing and
sustainable usage of the data.
**EXMARaLDA** is used by [researchers world
wide](https://exmaralda.org/en/projects/ "Projekte") in
different contexts in which spoken language is analysed,
these include:
- conversation and discourse analysis,
- study of language acquisition and multilingualism,
- phonetics and phonology,
- dialectology and sociolinguistics.
**EXMARaLDA** was developed in the project "Computer
assisted methods for the creation and analysis of
multilingual data" at the Collaborative Research Center
"Multilingualism" (Sonderforschungsbereich
"Mehrsprachigkeit" – SFB 538) at the University of
Hamburg. Since July 2011, the development of EXMARaLDA
is continued at the [Hamburg Centre for Language
Corpora](https://corpora.uni-hamburg.de/drupal/en),
since November 2011 in cooperation with the [Archive for
Spoken German](http://agd.ids-mannheim.de/index.shtml)
at the Institute for the German Language in Mannheim.
WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.
WebLicht Easy Chain for POS Tagging and Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.
WebLicht Easy Chain for POS Tagging and Lemmatization (English). The pipeline makes use of WebLicht's TCF converter, the Stanford tokenizer, the Jitar POS Tagger, and the lemmatizer service from MorphAdorner. WebLicht's Tundra can be used to visualize the result.
WebLicht Easy Chain for Dependency Parsing (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, the POS Tagger from the OpenNLP projet, and the MaltParser, a system for data-driven dependency parsing. WebLicht's Tundra can be used to visualize the result.
WebLicht Easy Chain for Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.
WebLicht Easy Chain for POS Tagging and Lemmatization (Italian). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the POS Tagger from the OpenNLP project. The model for Italian is trained on a relatively small training corpus (MIDT) and should therefore be considered experimental. WebLicht's Tundra can be used to visualize the result.