Description
MorphoDiTa: Morphological Dictionary and Tagger is an open-source tool for morphological analysis of natural language texts. It performs morphological analysis, morphological generation, tagging and tokenization and is distributed as a standalone tool or a library, along with trained linguistic models. In the Czech language, MorphoDiTa achieves state-of-the-art results with a throughput around 10-200K words per second. MorphoDiTa is a free software under LGPL license and the linguistic models are free for non-commercial use and distributed under CC BY-NC-SA license, although for some models the original data used to create the model may impose additional licensing conditions.
Tool type
Tool for creating own tools and resources
Tool task
tagging, analysis, tokenization
Key words
text processing, web-service
Research domain
Computational Linguistics, Linguistics, Morphology
Language
English, Czech
Country
Czech
CLARIN centre
Charles University in Prague
Contact person
Milan Straka; Jana Straková
URL
http://lindat.mff.cuni.cz/services/morphodita
Similar to