Challenges for Wordnets. A workshop co-located with LDK 2017

Challenges for Wordnets

A workshop co-located with LDK 2017:
The first conference on Language, Data and Knowledge

18 June, Galway, Ireland

Submit now

Wordnets

Wordnets are increasingly widely used to model word meaning in natural language processing tasks. However, there are still many challenges in accurately describing word meanings and making these descriptions useful for both human and machine users. This workshop aims to identify, discuss and start to solve existing challenges.

CALL FOR PAPERS

Much research on wordnets focuses either on their construction or their use in some application. Few papers bridge the gap by discussing how different wordnet models and construction methods affect their effectiveness in use, or how different applications require different parts of language to be modeled.

For this reason, we are experimenting with a new workshop series challenges for wordnet, to give wordnet users and developers a chance to share experiences both good and bad. It will be co-located with the First Conference on Language, Data and Knowledge (LDK 2017) in Galway, Ireland. The workshop will start with some short presentations and then finish with an extended discussion based on the challenges presented. We welcome position statements on the following (or related) topics:

  • Issues for Modeling Languages
    – missing parts-of-speech in wordnet (e.g. prepositions, conjunctions)
    – incomplete representation (e.g. semantics of adverbs)
    – links to examples/corpora
    – what is a wordnet
    – basic building blocks of a wordnet
    – language/dialect differences
  • Issues of Compatibility
    – integration with other resources
    – wordnets vs ontologies
    – licenses
  • Application Issues
    – consistent coverage
    – named entities
    – scaling up
    – wordnet services: WSD, similarity…
    – maintenance
    – versionings and updating
  • Evaluation
    – quality measures
    – experts vs crowds
    – translation vs monolingual construction

 

Please submit papers of between 4-10 pages, excluding references, formatted using the Springer Lecture Notes in Artificial Intelligence formatting guidelines. Submissions should be anonymous. Submissions will be reviewed by at least 3 reviewers and will be made available on online prior to the workshop.

Authors of good submisions will be invited to submit extended versions for a special issue of the Cognitive Studies | Études cognitives journal (https://ispan.waw.pl/journals/index.php/cs-ec/index). The extended versions will be carefully peer reviewed, but the scope of this special issue will be set in advance.

Papers should be submitted via EasyChair

The workshop is supported by the CLARIN-PL research infrastructure (http://clarin-pl.eu)

LDK 2017 conference received support from the Global WordNet Association Board

IMPORTANT DATES

Paper submission: 30 March
Notification of Acceptance: 21 April
Workshop Date: 18 June 14:00-18:00
ORGANIZING COMMITTEE

Maciej Piasecki – Wroclaw University of Technology
Francis Bond – Nanyang Technological University
John P. McCrae – Insight Centre for Data Analytics, National University of Ireland, Galway
Jan Wieczorek – Wroclaw University of Technology
PROGRAM COMMITTEE

Maciej Piasecki – Wroclaw University of Technology (chair)
Francis Bond – Nanyang Technological University
John P. McCrae – Insight Centre for Data Analytics, National University of Ireland, Galway
Ewa Rudnicka – Wroclaw University of Technology
German Rigau – Polytechnic University of Catalonia
Darja Fišer – University of Ljubljana
Sonja Bosch – University of South Africa, Pretoria
Monica Monachini – National Research Council of Italy
Verginica Mititelu – Romanian Academy
Stanisław Szpakowicz – Emeritus Professor, University of Ottawa
Antoni Oliver Gonzalez – Open University of Catalonia
Veronika Vincze – University of Szeged
Eneko Agirre – University of the Basque Country
Piek Th. J. M. Vossen – VU University Amsterdam
Adam Pease – Articulate Software
Shikhar Kr. Sarma – Gauhati University
Shu-Kai Hsieh – National Taiwan University
Christiane D. Fellbaum – Princeton University
Bolette Sandford Pedersen – University of Copenhagen

 

Contact:

Address any questions to Maciej Piasecki and Francis Bond at <clarin-pl@pwr.edu.pl>.

We have made available a vector word model learned with neural networks

The published model describes words of natural language by means of multidimensional vectors. The vectors refer to words’ hidden properties which are motivated by the layers of the neural network. The model may be used for determining the degree of word similarity to which specific vectors relate and thus for the generation of word frequency lists.

The model is available at our repository on the D-Space platform: https://clarin-pl.eu/dspace/handle/11321/327

The Programme Board of Clarin-PL has been appointed

The Programme Board has been appointed in accordance with the recommendation of the Ministry of Science and Higher Education in Poland and has an advisory character. Its major task is to indicate new research directions within CLARIN-PL and the evaluation of the current stage of the infrastructure from the perspective of its applications in Humanities and Social Sciences.

Members of the Board:

prof. dr hab. Marek Jeziński
Uniwersytet Mikołaja Kopernika

prof. dr hab. Kazimierz Wolny-Zmorzyński
Uniwersytet Warszawski

prof. dr hab. Jerzy Bartmiński
projekt EUROJOS

dr hab. Marek Troszyński
Instytut Badań Literackich PAN

prof. dr hab. Włodzimierz Gruszczyński
Uniwersytet SWPS

dr hab. Joanna Rączaszek-Leonardi
Instytut Psychologii PAN

dr hab. Kazimierz Krzysztofek, prof. Uniwersytetu SWPS
Uniwersytet SWPS

CLARIN Workshop – Towards Interoperability of Lexico-Semantic Resources

CLARIN Workshop (type I) – Towards Interoperability of Lexico-Semantic Resources

Call for Participation in CLARIN Workshop (type I) — Towards Interoperability of Lexico-Semantic Resources
The goal of the workshop is to improve the interoperability, usability and ease of access of CLARIN lexico-semantic resources. To achieve this goal, we will work on the design of CLARIN open virtual lexical platform facilitating better access to lexical resources and research web applications by Humanities & Social Sciences (H&SS) users. The workshop will have a discussion and group-work based character. Most of the time is reserved for chaired discussions in small topic-related subgroups, concluded with reports presented to the general forum by their chairs during the summing-up session.
Researchers interested in the participation in the workshop should send a letter to the following address: clarin-pl@pwr.edu.pl — including a description of interest:
1. Short description of the background of the applicant, especially related to the workshop goal and topics. If an applicant comes from an institution which is a member of CLARIN, this should be clearly expressed.
2. Description of the motivation for the participation in the workshop.
The description should be approximately 500 words.
The deadline for submission is December 5th 2016. Decisions concerning acceptance will be sent by December 8th 2016.

The workshop is sponsored by CLARIN. Accommodation during the workshop (including one night in a university hostel) is covered. Partial reimbursement of travel costs up to 200 Euros will be offered to all participants.
The maximum capacity is limited to 25 persons who will be selected by PC on the basis of the delivered descriptions of interest. Priority will be given to participants from the CLARIN members (www.clarin.eu).

Workshop Short Description

Name workshopCLARIN Workshop (type I) -- Towards Interoperability of Lexico-Semantic Resources
TypeI: Thematic workshop (no follow-up implementation project envisaged)
Programming CommitteeMaciej Piasecki, Wrocław University of Science and Technology,
maciej.piasecki@pwr.edu.pl
Erhard Hinrichs, Eberhard-Karls-Universität Tübingen
Kadri Vider, University of Tartu, CELR
LocationCELR, University of Tartu, Tartu, Estonia
Dates31st January (9 a.m. opening) - 1st February 2017 (1 p.m. closing)
SummaryThe goal of the workshop is to improve the interoperability, usability and ease of access of CLARIN lexico-semantic resources in order to build a CLARIN open virtual lexical platform facilitating better access to lexical resources by Humanities & Social Sciences (H&SS) users and research web applications.
The topics will include:
the choice of a few common formats for CLARIN lexico-semantic resources (LSR);
the design of a system of web services for accessing CLARIN LSRs,
the common virtual platform for linking, displaying and browsing LSRs;
gathering requirements from H&SS related to LRSs and the platform.
The outcome will be an integrated report with an action plan that will be offered to the wider CLARIN community for further discussion, testing and refinement. The workshop is planned for 3 invited experts and up to 25 participants in total.

Full Description
Workshop goal
The main goal of the workshop is to initiate the works on the improvement of interoperability, usability and ease of access of CLARIN LRSs for the needs of a their better visibility for H&SS users and their enhanced utilization in research applications. The key idea is to coordinate development of a system of web services for accessing LRSs and a common virtual lexical platform built on the top of them. The platform is intended to be an open generic solution that will allow for effective linking, displaying and browsing of the rich variety of data included in CLARIN LRSs. One of the functions of the platform will be a kind of federated search for LRSs. The platform will be an open solution with potentially many installations and many web application based on them.

The main topics will include:

1. the choice of a few common formats for CLARIN lexico-semantic resources (LSR);
2. the design of a system of web services for accessing CLARIN LSRs,
3. the common virtual platform for linking, displaying and browsing LSRs;
4. gathering requirements from H&SS related to LRSs and the platform.

The outcome of the workshop will be an integrated report with an action plan for each of the key topics discussed in the working groups. It will be next offered to the wider CLARIN community for further discussion, testing and refinement. The first, limited set of standards and requirements will be selected for further work. As a follow up we will also prepare a joint proposal of CLARIN type II workshop devoted to the construction of a first set of web services for accessing selected LRSs and a first prototype of the platform. The platform development is planned to be a collaborative work, but CLARIN-PL is prepared for serve as the main developer due to a similar goal in its action plan.

Link to CLARIN strategic priorities
The topics of the workshop are closely related to several CLARIN strategic pillars (cf. CE-2015-0657), namely:
integration of data – the common format and strategy of integration will form the basis for the interoperable multifaceted resources and will thus enable “the development of tools that allow for mapping between different standards”
integration of services – the common platform and system of web services will simplify providing multilingual web services for semantic annotation, exploration, exploitation, enhancement, analysis, manipulation and visualization of language data; support for mono- and multilingual processing workflows,
ease of access – the common platform and system of web services will simplify access to different types of lexico-semantic resources. They will will enhance multilingual browsing and visualization services.
crossing borders – the common format, platform and strategy of integration will facilitate crossing borders between languages, countries, and infrastructures; “international collaboration both at the RI and at the thematic level”.

Profile of proposing team
Maciej Piasecki is an Associated Professor at Wrocław University of Science and Technology and a coordinator of the G4.19 Computational Linguistics and Language Technology Research Group. He holds PhD in Computer Science for work on Natural Language Processing. Maciej has been the leader of the Polish wordnet project since its beginning in 2005 till now. Maciej’s main research topics are: extraction of the lexico-semantic knowledge from corpora, semi-automatic wordnet expansion, Distributional Semantics and relational lexical semantics. Maciej has been also working on morpho-syntactic processing of Polish (co-author of the first Polish tagger), Information Extraction, formal semantics and Machine Translation.
Erhard Hinrichs is a Full Professor (Ordinarius) for General and Computational Linguistics, Eberhard-Karls-Universität Tübingen, Seminar für Sprachwissenschaft, Tübingen, Germany, 1991 – present. He obtained Ph.D., Linguistics from The Ohio State University, Columbus, Ohio, USA. August 1985. He has been the leader of the GermaNet project since it beginning till now. Erhard Hinrichs is CLARIN National Coordinator of Germany.
Kadri Vider is a researcher of Language Technology at Institute of Computer Sciences in University of Tartu. She managed studies of Estonian Wordnet till 2007 when she started to work in Estonian Ministry of Education and Research arranging Estonian membership in CLARIN infrastructure also. Since coming back into University of Tartu in 2011 she manages Center of Estonian Language Resources – a research infrastructure consortium executing obligations of membership of Estonia in CLARIN ERIC. Kadri Vider is CLARIN National Coordinator of Estonia.

Indication of profile(s) and number of envisaged participants
We would like to bring together key wordnet developers from CLARIN, researches with experience in wordnet applications, as well as selected experts from the area of linking wordnets and lexical semantics technology from outside CLARIN. The external experts are meant to provide reference points facilitating the discussions during the workshop.

Participants from CLARIN will be selected on the basis of descriptions of interest collected in an open call. However, we will work on obtaining good coverage with respect to CLARIN wordnets and the workshop topics. We aim at 2 invited plenary talks. The rest of places will be open to moderated discussions and collaboration in working groups. We expect up to 25 participants for 2 days.

Workshop date(s), location and timetable
The workshop will be hosted by the Institute of Computer Science, University of Tartu, Estonia and will last for 2 days starting at 9 a.m. on 31st January and finishing at 1 p.m. on 1st February.

Provisional agenda
The core of the workshop will be chaired parallel working group meetings, each devoted to one of the key topics. They will start with short presentations of (selected) speakers, addressing questions included in the CfP. Presentations will be followed by extensive discussions. Their results will be consolidated into reports and action plans. The working group meetings will be preceded by 1-2 invited plenary talks focused on the experience related to the workshop topics. The workshop will close with the plenary session during which chairs of all the working groups will present their reports and action plans that will be subject to a general discussion. It will be next offered to the wider CLARIN community for further discussion, testing and refinement.

Program Committee:

Maciej Piasecki
Erhard Hinrichs
Kadri Vider

Invited Experts (confirmed so far):
Francis Bond – author of the Open Multilingual WordNet
Christiane Fellbaum – Princeton WordNet

Language Technology Centre is now a CLARIN B Centre

We are happy to announce that CLARIN-PL Language Technology Centre is now an officially certified CLARIN B centre. This means we were given a positive opinion in CLARIN and were granted a Data Seal of Approval.

Type B centres offer free access to language tools and repositories to registered users both in their home country and abroad. The services are listed in the repository, where users may also save and share (according to a license chosen) their own, defined language data. All services are integrated in the Centre, which enables multifaceted processing of texts. The Centre is concerned with accepted standards and laws regarding personal data, intellectual property, license, and ethical aspects.

The certificate recognizes our sustained effort to develop CLARIN in Poland. We would like to thank all our Co-operators who contributed to the creation of CLARIN-PL, as well as our registered users who prove our services are useful.

Polish WordNet 2.3 now available!

We are happy to inform that on July 9, 2015 our team has launched plWordNet 2.3 – the latest version of a large semantic dictionary of Polish. plWN contains 170 000 words, 240 000 unique senses interlinked by 600 000 relations and is now the largest wordnet in the world.

plWN 2.3 has been extended extended to adjectives, contains more multi-word units, definitions, and examples. What’s more, this is the first time we’ve published Emo WordNet – 30 000 lexical units described in terms of their emotional connotations (at the moment available only in the download version).

Together with plWordNet 2.3 we announce publication of enWordNet 0.1 – an extended version of Princeton WordNet 3.1 containing 7 000 new lemmas, 8 000 lexical units, 5 500 synsets, and WNLoom-Viewer – a browsing application for plWordNet.

plWN is a free and open-source app suitable for all uses, including commercial ones.

plWN 2.3 – Download

plWN – View online

plWN – Free mobile app

WNLoom-Viewer – Download

WNLoom-Viewer – User manual

Invitation to the CLARIN Annual Conference

We are glad to inform that the CLARIN Annual Conference addressed to Partners and Users will be held between 15th and 17th October 2015 in Wroclaw, Poland.

As usual, the Conference will be dedicated to the possibilities for practical application of the infrastructure in humanities and in social sciences, as well as to challenges and experiences related to CLARIN development.

Extended abstracts may be submitted until and including 15th July 2015.

For more details please visit Conference website.

 

/Photo source/

Invitation to a conference

FACULTY OF HUMANITIES,  CENTRE FOR DISTANCE LEARNING, CENTRE FOR GAMES RESEARCH AT THE UNIVERSITY OF MARIE CURIE-SKŁODOWSKA
and „GRODZKA GATE – NN THEATRE CENTRE“ IN LUBLIN

under the auspices of DARIAH.PL

invite for a conference:

DIGITAL HUMANITIES – THEORY AND PRACTICE

November 26-27, 2015, University of Marie Curie-Skłodowska, Lublin

 Digital humanities constitutes the most rapidly developing research area. A vast number of innovative tools and methods for exploration of the contemporary digital culture and the digitized history have appeared within its realm. Achievements of the field remodel the work of a modern humanist – their methods, interpretation strategies, means of communication, ways of conducting research and presenting results.

By means of this conference we wish to enable researchers operating within this new optics to exchange their experiences and to promote the trend among researchers attached to traditional methods. In particular, we would like to focus on:

  1. Digital tools for humanists
  2. Research methods
  3. Theoretical consideration of digital humanities
  4. Sample projects held within the domain
  5. Humanistic education in the digital era

The second day of the conference will include workshops (RemiksLab) on tools and apps, devoted to digital humanists and led by qualified practicioners from all over the Poland.

Both plenary and profiled discussions will be broadcast live and translated simultaneously. The program will also include a panel discussion on the development of digital humanities, and building a definition of digital humanities to be published on Wikipedia.

The prospectus of the conference will take the forms of a reviewed e-book and a film.

To submit your proposal of presentation please contact us on polska@gmail.com using the application form by June 30, 2015.

Conference website: humanistykacyfrowa.teatrnn.pl

The Research Council of the conference will decide on admissions. All admitted participants will be entitled to take part in workshops, whose program will be posted on the conference website at the beginning of September.

 

Head of the Organizational Committee

Andrzej Radomski, DSc

andrzejradomski@gmail.com

 

Secretaries:

Ewa Sroka, PhD

emurska@gmail.com

Radosław Bomba, PhD

radoslawbomba@gmail.com

Successful partnership of humanities and information technologies

Maciej Piasecki, PhD, Eng., the coordinator of CLARIN in Poland, will give a lecture on „CLARIN-PL: successful parthership of humanities and information technologies” at  the international interdisciplinary conference „Culture and Professions of the Future: Humanities as the Arts and Technology“on May 4-5, 2015 in Łódź.