![]() |
|
||||||
|
![]() ![]() Description
The file below is the result of the manual annotation carried out in the framework of the comparison of distributional models, as described in (Tanguy et al., 2015). The dataset is based on the analysis of the version of the TALN corpus corresponding to the years 2007-2013. 4 different annotators have judged the relevancy of the neighbors computed by the distributional models for the 30 following words:
The notion of neighborhood is not restricted to a specific semantic relation: the neighbors may be synonyms, hypernyms, hyponyms, antonyms or words that are otherwise semantically related. The dataset is a tabulated file (UTF-8 encoded) whose lines contain:
Design
Person in charge
Franck SajousContact: Licence
Some rights are reserved.
The gold standard file is available under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Download
gold-semdis-corpusTALN.csv
Reference
L. Tanguy, F. Sajous et N. Hathout (2015).
Évaluation sur mesure de modèles distributionnels sur un corpus spécialisé : comparaison des approches par contextes syntaxiques et par fenêtres graphiques.
TAL, 56(2), pp 103-127. [ Article ] [ Bibtex ]
|