OpenConvert
The OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application.The OpenConvert Tools were created by IVDNT in the OpenConvert project. The OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application. Furthermore, as a proof of concept, the website currently provides two annotation tools: a simple Tokenizer for TEI files and a modern Dutch part of speech tagger.
The tool service can be called as a REST webservice which returns responses in XML, allowing it to be part of a webservice tool chain.
Input TEI, plain text, HTML
ALTO XML input
ePub input
directory containing files of a valid input type
zip file (with extension .zip) containing files of a valid input type
Free for academic use. Non-applicable for commercial parties
CLARIN based login required. The Clarin federation accepts login from many europian institutions. please seehttp://www.clarin.eu/content/service-provider-federation for more details
input file name (File upload)
Format of input file
Format of output file
to specify the tagger or tokeniser
input file mimetype is application/tei+xml
input file mimetype is text/html
input file mimetype is text/alto+xml
input file mimetype is application/msword
input file mimetype is application/epub+zip
input file mimetype is text/plain
output file mimetype is application/tei+xml
output file mimetype is text/folia+xml
Basic tagger-lemmatizer for modern Dutch
a TEI tokenizer