CLARIN Tool Portal

698 record(s) found

Search results

GaLAHaD

2 resources

GaLAHaD (Generating Linguistic Annotations for Historical Dutch) allows linguists to compare taggers, tag their own corpora, evaluate the results and export their tagged documents.

Use "GaLAHaD"
ineo-collaboration

1 resources

how to get metadata into INEO
FoLiA-Linguistic-Annotation-Tool

2 resources

FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.

Use "FoLiA-Linguistic-Annotation-Tool"
ucto

1 resources

Ucto tokenizes text files: it separates words from punctuation, and splits sentences. This is one of the first tasks for almost any Natural Language Processing application. Ucto offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

Use "ucto"
stamd

2 resources

Webservice for working with stand-off annotations on text (STAM)

Use "stamd"
auchann

2 resources

The AuChAnn (Automatic CHAT Annotation) package can generate CHAT annotations based on a transcript-correction pairs of utterances.

Use "auchann"
I-Analyzer

2 resources

I-analyzer is a tool for exploring corpora (large collections of texts). You can use I-analyzer to find relevant documents, or to make visualisations to understand broader trends in the corpus. The interface is designed to be accessible for users of all skill levels. I-analyzer is primarily intended for academic research and higher education. We focus on data that is relevant for the humanities, but we are open to datasets that are relevant for other fields.

Use "I-Analyzer"
Automatic Speech Recognition for Dutch

1 resources

This is a web-based automatic speech recogniser for Dutch, capable of transcribing dutch speech recordings using multiple models.

Use "Automatic Speech Recognition for Dutch"
A Blacklab Server CLARIN FCS 2.0 endpoint

1 resources

CLARIAH Federated content search corpora, developed by the Dutch Language Institute (INT), is a service to enable searching in multiple Dutch corpora at the same time. This application implements the CLARIN FCS 2.0 specification on top of Dutch language corpora. This repository hosts the source code.
DANE

1 resources

Utils for working with the Distributed Annotation and Enrichment system

Use "DANE"

Result filters

Metadata provider

Language

Resource type

Type of tool

Tool task

Field of study

Availability

Organisation

Project

Keywords

Search results

GaLAHaD

ineo-collaboration

FoLiA-Linguistic-Annotation-Tool

ucto

stamd

auchann

I-Analyzer

Automatic Speech Recognition for Dutch

A Blacklab Server CLARIN FCS 2.0 endpoint

DANE