PREF-IT

REsources Developed At CLLE
Homepage Resources Applications Corpora Lexicons Other resources About CLLE This website Legal notice Contact

PREF-IT
PREFixed ITalian verbs corpus

Description

PREF-IT is composed by 1680 Italian verbs morphologically constructed from nominal or adjectival bases by prefixation process. Data have been retrieved during the Giuseppina Todaro's PhD thesis (Todaro, 2017) at Université Toulouse Jean Jaurès under a joint supervision agreement with Università Roma Tre.

The prefixed verbs have been automatically extracted from the ItWaC corpus (Baroni et al., 2009) by regular expressions (see Todaro 2017, chap.3) then manually cleaned-up and integrated with the Treccani neologism list and some non-systematic web-based extractions.

Each verb has been manually tagged with respect to formal features (such as the prefix involved and the flection class of lexeme) and semantic aspects (see Todaro 2017, in the README.txt and the thesis document for more details on the adopted approach). For each verb and base, PREF-IT also reports the frequency count which is based on the ItWaC corpus.

Person in charge

Giuseppina Todaro

Licence

Some rights are reserved. PREF-IT is released under a Creative Commons BY-NC-SA 2.0 licence.

Download

paracorpus.zip contains the ressource as a spreadsheet in XLSX format and a README file in which every field is described.

Reference

Todaro, Giuseppina. (2017). Nomi (e aggettivi) che diventano verbi tramite prefissazione: quel che resta della parasintesi. Thèse de doctorat. Università Roma Tre / Université de Toulouse-Jean Jaurès.