Show simple item record

 
dc.contributor.author Marcińczuk, Michał
dc.date.accessioned 2019-02-07T15:28:19Z
dc.date.available 2019-02-07T15:28:19Z
dc.date.issued 2019-02-07
dc.identifier.uri http://hdl.handle.net/11321/626
dc.description The task consists in developing a tool for lemmatization of proper names and multi-word phrases. The generated lemmas should follow the KPWr guidelines [https://clarin-pl.eu/dspace/handle/11321/625]. The training dataset contains XX documents from the KPWr corpus and an index of phrases with lemmas.
dc.language.iso pol
dc.publisher Wroclaw University of Science and Technology
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label CC
dc.source.uri http://poleval.pl/tasks/task2
dc.subject named entities
dc.subject multi-word units
dc.subject lemmatization
dc.title PolEval 2019 Task 2: Lemmatization of proper names and multi-word phrases — training data
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN-PL
contact.person Michał Marcińczuk marcinczuk@gmail.com Wroclaw University of Science and Technology
sponsor Ministry of Science and Higher Education (Poland) 6358/IA/119/2013 CLARIN-PL nationalFunds
size.info 24323 expressions
files.size 1698282
files.count 1


 Files in this item

This item is
Distributed under Creative Commons
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Attribution Required
Icon
Name
poleval2019_lemmatize_training.tar.gz
Size
1.62 MB
Format
application/gzip
Description
Unknown
 Download file

Show simple item record