Welcome to the 10th Global WordNet Conference – GWC 2019!

This is an opportunity for researchers and developers to present and discuss their latest results on the development, enrichment and exploitation of wordnets for various languages around the world. This conference is hosted by the Language Technology Research Group G4.19 at Wroclaw University of Science and Technology, Poland and the Global WordNet Association.
23 – 27 July, 2019
Faculty of Informatics and Management at the Wroclaw University of Science and Technology

More informationhttps://gwc2019.clarin-pl.eu

Conference Topics
1. Lexical semantics and meaning representation
* Critical analysis and applications of lexical and semantic relations
* Proposed new relations
* Definitions, semantic components, co-occurrence and frequency statistics
* Word and Sense Embeddings
* Necessity and completeness issues
* Ontology and wordnet
* Other lexicographical and lexicological questions pertaining to wordnet-style meaning representation
* Wordnets and Linked Open Data (LOD)

2. Architecture of lexical databases
* Language independent and language dependent components
* Integration of multi-wordnets in research infrastructures (like CLARIN) and LT networks (like META-NET)

3. Tools and Methods for wordnet development
* User and Data entry interfaces
* Methods for constructing, extending and enriching wordnets

4. Applications of wordnet
* Word sense disambiguation
* Machine translation
* Information extraction and retrieval
* Document structuring and categorisation
* Automatic hyperlinking
* Language pedagogy
* Psycholinguistic applications

5. Standardization, distribution and availability of wordnets and wordnet tools

Created by CLARIN-PL LTC, PolLinguaTech Knowledge Centre obtained the certificate of CLARIN K Centre

PolLinguaTech was founded to provide researchers with the crucial knowledge about the applications of natural language processing (NLP) with a special emphasis put on Polish language processing. The Centre provides indispensible documentation  (technical documents, user manuals), organises trainings and offers help and advice on each stage of research.

PolLinguaTech functions at CLARIN-PL Language Technology Centre – one of the leading language technology research and development centers focusing on Polish language. The offer of PolLinguaTech is identical with the current offer of LTC. Gaining the Clarin K Centre Certificate will allow to better coordinate activities dedicated to the promotion of knowledge about language technologies and the institutionalised form of user assistance.

We encourage you to take advantage of the offer of CTJ and PolLinguaTech and to contact our employees.


PolLinguaTech – Knowledge webpage: LINK

Clarin K Centre Certificate: LINK

The 6th CLARIN Annual Conference – 2nd CALL FOR PAPERS (Budapest, 18th-20th September 2017)


CLARIN ERIC is happy to announce the 6th CLARIN Annual Conference and invites submission of papers.


The 6th CLARIN Annual Conference will be held in Budapest, Hungary.


  • 1st February, 2017 First call published and submission system open
  • 8th May, 2017 Submission deadline
  • 24th June, 2017 Notification of acceptance
  • 1st September, 2017 Final version of extended abstracts due
  • 18th–20th September 2017 CLARIN Annual Conference


The CLARIN Annual Conference is organised for the Humanities and Social Sciences community in order to exchange ideas and experiences on the CLARIN infrastructure. This includes its design, construction and operation, the data and services that it contains or should contain, its actual use by researchers, its relation to other infrastructures and projects, and the CLARIN Knowledge Sharing Infrastructure.


Operation and use of the CLARIN infrastructure, e.g.

  • Use of the CLARIN infrastructure in humanities and social sciences research, including needs for updated and new functionality
  • Usability studies and evaluations of CLARIN services
  • Analysis of the CLARIN infrastructure usage, identification of user audience and impact studies
  • Showcases, demonstrators and research in humanities and social sciences that is relevant to CLARIN
  • Models for the sustainability of the infrastructure, including issues in curation, migration, financing and cooperation
  • Legal and ethical issues in operating the infrastructure

Design and construction of the CLARIN infrastructure, e.g.

  • Metadata and concept registries, cataloguing and browsing
  • Persistent identifiers
  • Access, including Single Sign On Authentication and Authorisation
  • Search, including Federated Content Search
  • Web applications, web services, workflows and use of the infrastructure
  • Standards and solutions for interoperability of language resources, tools and services

CLARIN Knowledge Infrastructure and Dissemination, e.g.

  • User assistance (helpdesks, user manuals, FAQs)
  • CLARIN portals and outreach to users
  • Videos, screen casts, recorded lectures
  • Researcher training activities
  • Knowledge infrastructure centres

CLARIN in relation with other infrastructures and projects, e.g.

  • Relations with other SSH research infrastructures such as DARIAH, CESSDA, etc.
  • Relations with meta-infrastructure projects such as EUDAT and RDA
  • Relations with national and regional initiatives

THEMATIC SESSION: Multilingual Processing for Humanities and Social Sciences

The Humanities and Social Sciences (H&SS) have formulated research questions pertaining to different languages. However, the number of research tasks in H&SS in which Language Technology has been applied to cross-language barriers and analyse the same phenomena on material expressed in different languages is relatively small. The situation is better in the case genuine linguistic research, but in multilingual research applications in H&SS are mostly based on a kind of ‘bag of words’ model, and very rarely utilise more advanced multilingual Language Technology methods.

The general aim of this thematic session is to present examples of multilingual approaches in H&SS research relevant to CLARIN, and to discuss infrastructural solutions to the problem of multilingual interoperability of the Language Technology that are necessary for more advanced research in H&SS. We expect to organise presentations and discussions during the session on the following aspects:

  1. Examples of applications of Language Technology to multilingual processing for the needs of research in H&SS.
  2. Research tasks and ongoing projects in H&SS on the basis of multilingual material and application of Language Technology.
  3. Interoperability of language resources and tools for the needs of multilingual applications in H&SS: models for linking, standards and formats, mapping and linking algorithms, complex processing methods, architectures and platforms.

We invite submissions describing CLARIN relevant work addressing these aspects. Submissions (for oral presentations, posters, or demos) intended for the thematic session should be marked as such, and will be evaluated with respect to their appropriateness for the theme, in addition to the general acceptance criteria listed below.


The scientific program both of the general sessions and the thematic session will include oral presentations, posters, and demos. There is no difference in quality between oral and poster presentations. Only the appropriateness of the type of communication (more or less interactive) to the content of the paper will be considered.


Submission of proposals for oral presentations, poster presentations and/or demos must be extended abstracts (length: up to four A4 pages including references) in PDF format, in accordance with the template provided on the website.

It is not required that the authors are or have been directly involved in national or international CLARIN projects, but their work must be clearly relevant to the CLARIN activities, resources, tools or services.

Extended abstracts must be submitted through the EasyChair submission system (link) and will be reviewed by the program committee.

All proposals will be reviewed on the basis of both individual criteria and global criteria. The latter include thematic, linguistic and geographical spread. Individual acceptance criteria are the following:

  • Appropriateness: the contribution must pertain to the CLARIN infrastructure or be relevant for it (e.g. use CLARIN, contribute to the CLARIN design, construction, operation, exploitation, illustrate possible applications etc.). In addition, submissions to the thematic session will be selected on the basis of their appropriateness to the theme.
  • Soundness and correctness: the content must be technically and factually correct and methods must be scientifically sound, according to best practice, and preferably evaluated.
  • Meaningful comparison: the abstract must indicate that the author is aware of alternative approaches, if any, and highlight relevant differences.
  • Substance: concrete work and experiences will be preferred over ideas and plans.
  • Impact: contributions with a higher impact on the research community and society at large will be preferred over papers with lower impact.
  • Clarity: the extended abstract must be informative, clear and understandable for the CLARIN audience.
  • Timeliness and novelty: the work must convey relevant new knowledge to the audience at this event.


One author per each accepted paper will be granted free participation in the conference and meals. Selected authors will be granted reimbursement of travel (up to 220 Euros) and accommodation costs.


If the submission is accepted, it will be published (possibly in revised form) in the conference Book of Abstracts. After the conference, the author(s) will be invited to submit a full paper (max. 12 pages) to be reviewed according to the same criteria as the abstracts. Accepted full papers will be digitally published in a conference proceedings volume at Linköping University Electronic Press within about 6 months after the conference.


The program committee for the conference consists of the following members:

  • Jan Theo Bakker, Dutch Language Union, The Netherlands/Flanders
  • Lars Borin, University of Gothenburg, Sweden
  • António Branco, University of Lisbon, Portugal
  • Koenraad De Smedt, University of Bergen, Norway
  • Tomaž Erjavec, Jožef Stefan Institute, Slovenia
  • Eva Hajičová, Charles University Prague, Czech Republic
  • Erhard Hinrichs, University of Tübingen, Germany
  • Krister Lindén, University of Helsinki, Finland
  • Bente Maegaard, University of Copenhagen, Denmark
  • Monica Monachini, Institute for Computational Linguistics «A. Zampolli», Italy
  • Karlheinz Mörth, Austrian Academy of Sciences, Austria
  • Jan Odijk, Utrecht University, the Netherlands
  • Maciej Piasecki, Wrocław University of Science and Technology, Poland (chair)
  • Stelios Piperidis, ILSP, Athena Research Center, Greece
  • Kiril Simov, IICT, Bulgarian Academy of Sciences, Bulgaria
  • Inguna Skadiņa, University of Latvia, Latvia
  • Jurgita Vaičenonienė, Vytautas Magnus University, Lithuania
  • Tamás Váradi, Research Institute for Linguistics, Hungarian Academy of Sciences
  • Kadri Vider, University of Tartu, Estonia
  • Martin Wynne, University of Oxford, UK


Challenges for Wordnets. A workshop co-located with LDK 2017

Challenges for Wordnets

A workshop co-located with LDK 2017:
The first conference on Language, Data and Knowledge

18 June, Galway, Ireland



Wordnets are increasingly widely used to model word meaning in natural language processing tasks. However, there are still many challenges in accurately describing word meanings and making these descriptions useful for both human and machine users. This workshop aims to identify, discuss and start to solve existing challenges.


Much research on wordnets focuses either on their construction or their use in some application. Few papers bridge the gap by discussing how different wordnet models and construction methods affect their effectiveness in use, or how different applications require different parts of language to be modeled.

For this reason, we are experimenting with a new workshop series challenges for wordnet, to give wordnet users and developers a chance to share experiences both good and bad. It will be co-located with the First Conference on Language, Data and Knowledge (LDK 2017) in Galway, Ireland. The workshop will start with some short presentations and then finish with an extended discussion based on the challenges presented. We welcome position statements on the following (or related) topics:

  • Issues for Modeling Languages
    – missing parts-of-speech in wordnet (e.g. prepositions, conjunctions)
    – incomplete representation (e.g. semantics of adverbs)
    – links to examples/corpora
    – what is a wordnet
    – basic building blocks of a wordnet
    – language/dialect differences
  • Issues of Compatibility
    – integration with other resources
    – wordnets vs ontologies
    – licenses
  • Application Issues
    – consistent coverage
    – named entities
    – scaling up
    – wordnet services: WSD, similarity…
    – maintenance
    – versionings and updating
  • Evaluation
    – quality measures
    – experts vs crowds
    – translation vs monolingual construction


Please submit papers of between 6-10 pages, excluding references, formatted using the Springer Lecture Notes in Artificial Intelligence formatting guidelines. Submissions should be anonymous. Submissions will be reviewed by at least 3 reviewers and will be made available on online prior to the workshop.

Authors of good submisions will be invited to submit extended versions for a special issue of the Cognitive Studies | Études cognitives journal. The extended versions will be carefully peer reviewed, but the scope of this special issue will be set in advance.

Papers should be submitted via EasyChair

The workshop is supported by the CLARIN-PL research infrastructure.

LDK 2017 conference received support from the Global WordNet Association Board


prof. Eduard Hovy, Carnegie Mellon University, Language Technologies Institute


Paper submission: 10 April
Notification of Acceptance: 30 April
Workshop Date: 18 June 14:00-18:00

Maciej Piasecki – Wroclaw University of Technology
Francis Bond – Nanyang Technological University
John P. McCrae – Insight Centre for Data Analytics, National University of Ireland, Galway
Jan Wieczorek – Wroclaw University of Technology


Eneko Agirre – University of the Basque Country
Sonja Bosch – University of South Africa, Pretoria
Christiane D. Fellbaum – Princeton University
Darja Fišer – University of Ljubljana
Antoni Oliver Gonzalez – Open University of Catalonia
Shu-Kai Hsieh – National Taiwan University
John P. McCrae – Insight Centre for Data Analytics, National University of Ireland, Galway
Verginica Mititelu – Romanian Academy
Monica Monachini – National Research Council of Italy
Adam Pease – Articulate Software
Bolette Sandford Pedersen – University of Copenhagen
Maciej Piasecki – Wroclaw University of Technology (chair)
German Rigau – Polytechnic University of Catalonia
Ewa Rudnicka – Wroclaw University of Technology
Shikhar Kr. Sarma – Gauhati University
Stanisław Szpakowicz – Emeritus Professor, University of Ottawa
Veronika Vincze – University of Szeged
Piek Th. J. M. Vossen – VU University Amsterdam



Address any questions to Maciej Piasecki and Francis Bond at <clarin-pl@pwr.edu.pl>.

We have made available a vector word model learned with neural networks

The published model describes words of natural language by means of multidimensional vectors. The vectors refer to words’ hidden properties which are motivated by the layers of the neural network. The model may be used for determining the degree of word similarity to which specific vectors relate and thus for the generation of word frequency lists.

The model is available at our repository on the D-Space platform: https://clarin-pl.eu/dspace/handle/11321/327

The Programme Board of Clarin-PL

The Programme Board has been appointed in accordance with the recommendation of the Ministry of Science and Higher Education in Poland and has an advisory character. Its major task is to indicate new research directions within CLARIN-PL and the evaluation of the current stage of the infrastructure from the perspective of its applications in Humanities and Social Sciences.

Members of the Board:

prof. dr hab. Marek Jeziński
Uniwersytet Mikołaja Kopernika

prof. dr hab. Kazimierz Wolny-Zmorzyński
Uniwersytet Warszawski

prof. dr hab. Jerzy Bartmiński
projekt EUROJOS

dr hab. Marek Troszyński
Instytut Badań Literackich PAN

prof. dr hab. Włodzimierz Gruszczyński
Uniwersytet SWPS

dr hab. Joanna Rączaszek-Leonardi
Instytut Psychologii PAN

dr hab. Kazimierz Krzysztofek, prof. Uniwersytetu SWPS
Uniwersytet SWPS

dr hab. Katarzyna Materska, prof. UKSW
Uniwersytet Kardynała Stefana Wyszyńskiego

prof. dr hab. Wojciech Burszta
Uniwersytet SWPS; Instytut slawistyki PAN

dr hab. Tomasz Wicherkiewicz, prof. UAM
Uniwersytet im. Adama Mickiewicza

dr hab. Katarzyna Klessa
Uniwersytet im. Adama Mickiewicza

CLARIN Workshop – Towards Interoperability of Lexico-Semantic Resources

CLARIN Workshop (type I) – Towards Interoperability of Lexico-Semantic Resources

Call for Participation in CLARIN Workshop (type I) — Towards Interoperability of Lexico-Semantic Resources
The goal of the workshop is to improve the interoperability, usability and ease of access of CLARIN lexico-semantic resources. To achieve this goal, we will work on the design of CLARIN open virtual lexical platform facilitating better access to lexical resources and research web applications by Humanities & Social Sciences (H&SS) users. The workshop will have a discussion and group-work based character. Most of the time is reserved for chaired discussions in small topic-related subgroups, concluded with reports presented to the general forum by their chairs during the summing-up session.
Researchers interested in the participation in the workshop should send a letter to the following address: clarin-pl@pwr.edu.pl — including a description of interest:
1. Short description of the background of the applicant, especially related to the workshop goal and topics. If an applicant comes from an institution which is a member of CLARIN, this should be clearly expressed.
2. Description of the motivation for the participation in the workshop.
The description should be approximately 500 words.
The deadline for submission is December 5th 2016. Decisions concerning acceptance will be sent by December 8th 2016.

The workshop is sponsored by CLARIN. Accommodation during the workshop (including one night in a university hostel) is covered. Partial reimbursement of travel costs up to 200 Euros will be offered to all participants.
The maximum capacity is limited to 25 persons who will be selected by PC on the basis of the delivered descriptions of interest. Priority will be given to participants from the CLARIN members (www.clarin.eu).

Workshop Short Description

Name workshopCLARIN Workshop (type I) -- Towards Interoperability of Lexico-Semantic Resources
TypeI: Thematic workshop (no follow-up implementation project envisaged)
Programming CommitteeMaciej Piasecki, Wrocław University of Science and Technology,
Erhard Hinrichs, Eberhard-Karls-Universität Tübingen
Kadri Vider, University of Tartu, CELR
LocationCELR, University of Tartu, Tartu, Estonia
Dates31st January (9 a.m. opening) - 1st February 2017 (1 p.m. closing)
SummaryThe goal of the workshop is to improve the interoperability, usability and ease of access of CLARIN lexico-semantic resources in order to build a CLARIN open virtual lexical platform facilitating better access to lexical resources by Humanities & Social Sciences (H&SS) users and research web applications.
The topics will include:
the choice of a few common formats for CLARIN lexico-semantic resources (LSR);
the design of a system of web services for accessing CLARIN LSRs,
the common virtual platform for linking, displaying and browsing LSRs;
gathering requirements from H&SS related to LRSs and the platform.
The outcome will be an integrated report with an action plan that will be offered to the wider CLARIN community for further discussion, testing and refinement. The workshop is planned for 3 invited experts and up to 25 participants in total.

Full Description
Workshop goal
The main goal of the workshop is to initiate the works on the improvement of interoperability, usability and ease of access of CLARIN LRSs for the needs of a their better visibility for H&SS users and their enhanced utilization in research applications. The key idea is to coordinate development of a system of web services for accessing LRSs and a common virtual lexical platform built on the top of them. The platform is intended to be an open generic solution that will allow for effective linking, displaying and browsing of the rich variety of data included in CLARIN LRSs. One of the functions of the platform will be a kind of federated search for LRSs. The platform will be an open solution with potentially many installations and many web application based on them.

The main topics will include:

1. the choice of a few common formats for CLARIN lexico-semantic resources (LSR);
2. the design of a system of web services for accessing CLARIN LSRs,
3. the common virtual platform for linking, displaying and browsing LSRs;
4. gathering requirements from H&SS related to LRSs and the platform.

The outcome of the workshop will be an integrated report with an action plan for each of the key topics discussed in the working groups. It will be next offered to the wider CLARIN community for further discussion, testing and refinement. The first, limited set of standards and requirements will be selected for further work. As a follow up we will also prepare a joint proposal of CLARIN type II workshop devoted to the construction of a first set of web services for accessing selected LRSs and a first prototype of the platform. The platform development is planned to be a collaborative work, but CLARIN-PL is prepared for serve as the main developer due to a similar goal in its action plan.

Link to CLARIN strategic priorities
The topics of the workshop are closely related to several CLARIN strategic pillars (cf. CE-2015-0657), namely:
integration of data – the common format and strategy of integration will form the basis for the interoperable multifaceted resources and will thus enable “the development of tools that allow for mapping between different standards”
integration of services – the common platform and system of web services will simplify providing multilingual web services for semantic annotation, exploration, exploitation, enhancement, analysis, manipulation and visualization of language data; support for mono- and multilingual processing workflows,
ease of access – the common platform and system of web services will simplify access to different types of lexico-semantic resources. They will will enhance multilingual browsing and visualization services.
crossing borders – the common format, platform and strategy of integration will facilitate crossing borders between languages, countries, and infrastructures; “international collaboration both at the RI and at the thematic level”.

Profile of proposing team
Maciej Piasecki is an Associated Professor at Wrocław University of Science and Technology and a coordinator of the G4.19 Computational Linguistics and Language Technology Research Group. He holds PhD in Computer Science for work on Natural Language Processing. Maciej has been the leader of the Polish wordnet project since its beginning in 2005 till now. Maciej’s main research topics are: extraction of the lexico-semantic knowledge from corpora, semi-automatic wordnet expansion, Distributional Semantics and relational lexical semantics. Maciej has been also working on morpho-syntactic processing of Polish (co-author of the first Polish tagger), Information Extraction, formal semantics and Machine Translation.
Erhard Hinrichs is a Full Professor (Ordinarius) for General and Computational Linguistics, Eberhard-Karls-Universität Tübingen, Seminar für Sprachwissenschaft, Tübingen, Germany, 1991 – present. He obtained Ph.D., Linguistics from The Ohio State University, Columbus, Ohio, USA. August 1985. He has been the leader of the GermaNet project since it beginning till now. Erhard Hinrichs is CLARIN National Coordinator of Germany.
Kadri Vider is a researcher of Language Technology at Institute of Computer Sciences in University of Tartu. She managed studies of Estonian Wordnet till 2007 when she started to work in Estonian Ministry of Education and Research arranging Estonian membership in CLARIN infrastructure also. Since coming back into University of Tartu in 2011 she manages Center of Estonian Language Resources – a research infrastructure consortium executing obligations of membership of Estonia in CLARIN ERIC. Kadri Vider is CLARIN National Coordinator of Estonia.

Indication of profile(s) and number of envisaged participants
We would like to bring together key wordnet developers from CLARIN, researches with experience in wordnet applications, as well as selected experts from the area of linking wordnets and lexical semantics technology from outside CLARIN. The external experts are meant to provide reference points facilitating the discussions during the workshop.

Participants from CLARIN will be selected on the basis of descriptions of interest collected in an open call. However, we will work on obtaining good coverage with respect to CLARIN wordnets and the workshop topics. We aim at 2 invited plenary talks. The rest of places will be open to moderated discussions and collaboration in working groups. We expect up to 25 participants for 2 days.

Workshop date(s), location and timetable
The workshop will be hosted by the Institute of Computer Science, University of Tartu, Estonia and will last for 2 days starting at 9 a.m. on 31st January and finishing at 1 p.m. on 1st February.

Provisional agenda
The core of the workshop will be chaired parallel working group meetings, each devoted to one of the key topics. They will start with short presentations of (selected) speakers, addressing questions included in the CfP. Presentations will be followed by extensive discussions. Their results will be consolidated into reports and action plans. The working group meetings will be preceded by 1-2 invited plenary talks focused on the experience related to the workshop topics. The workshop will close with the plenary session during which chairs of all the working groups will present their reports and action plans that will be subject to a general discussion. It will be next offered to the wider CLARIN community for further discussion, testing and refinement.

Program Committee:

Maciej Piasecki
Erhard Hinrichs
Kadri Vider

Invited Experts (confirmed so far):
Francis Bond – author of the Open Multilingual WordNet
Christiane Fellbaum – Princeton WordNet

Language Technology Centre is now a CLARIN B Centre

We are happy to announce that CLARIN-PL Language Technology Centre is now an officially certified CLARIN B centre. This means we were given a positive opinion in CLARIN and were granted a Data Seal of Approval.

Type B centres offer free access to language tools and repositories to registered users both in their home country and abroad. The services are listed in the repository, where users may also save and share (according to a license chosen) their own, defined language data. All services are integrated in the Centre, which enables multifaceted processing of texts. The Centre is concerned with accepted standards and laws regarding personal data, intellectual property, license, and ethical aspects.

The certificate recognizes our sustained effort to develop CLARIN in Poland. We would like to thank all our Co-operators who contributed to the creation of CLARIN-PL, as well as our registered users who prove our services are useful.

Polish WordNet 2.3 now available!

We are happy to inform that on July 9, 2015 our team has launched plWordNet 2.3 – the latest version of a large semantic dictionary of Polish. plWN contains 170 000 words, 240 000 unique senses interlinked by 600 000 relations and is now the largest wordnet in the world.

plWN 2.3 has been extended extended to adjectives, contains more multi-word units, definitions, and examples. What’s more, this is the first time we’ve published Emo WordNet – 30 000 lexical units described in terms of their emotional connotations (at the moment available only in the download version).

Together with plWordNet 2.3 we announce publication of enWordNet 0.1 – an extended version of Princeton WordNet 3.1 containing 7 000 new lemmas, 8 000 lexical units, 5 500 synsets, and WNLoom-Viewer – a browsing application for plWordNet.

plWN is a free and open-source app suitable for all uses, including commercial ones.

plWN 2.3 – Download

plWN – View online

plWN – Free mobile app

WNLoom-Viewer – Download

WNLoom-Viewer – User manual