The DiscAn corpus is a collection of subcorpora of Dutch language that have been annotated at the level of discourse. These subcorpora form a set of Dutch corpus analyses of coherence relations and discourse connectives that have been compiled and annotated by researchers at several universities in The Netherlands and Belgium. In the DiscAn project, funded by CLARIN-NL, this set of corpus analyses has been standardized (both in terms of raw data – the texts – and analyses) and opened up for further scientific research.
data, corpus, text corpus, mono-lingual
Max Planck Institute for Psycholinguistics
Prof. Dr. T.J.M. Sanders (Utrecht University)