REDAC
REsources Developed At CLLE-ERSS CLLE-ERSS research unit







version française
PREF-IT
PREFixed ITalian verbs corpus
Description
PREF-IT is composed by 1680 Italian verbs morphologically constructed from nominal or adjectival bases by prefixation process. Data have been retrieved during the Giuseppina Todaro's PhD thesis (Todaro, 2017) at Université Toulouse Jean Jaurès under a joint supervision agreement with Università Roma Tre.

The prefixed verbs have been automatically extracted from the ItWaC corpus (Baroni et al., 2009) by regular expressions (see Todaro 2017, chap.3) then manually cleaned-up and integrated with the Treccani neologism list and some non-systematic web-based extractions.

Each verb has been manually tagged with respect to formal features (such as the prefix involved and the flection class of lexeme) and semantic aspects (see Todaro 2017, in the README.txt and the thesis document for more details on the adopted approach). For each verb and base, PREF-IT also reports the frequency count which is based on the ItWaC corpus.

Person in charge
Giuseppina Todaro

Licence

Some rights are reserved. PREF-IT is released under a Creative Commons BY-NC-SA 2.0 licence.

Download
  • paracorpus.zip contains the ressource as a spreadsheet in XLSX format and a README file in which every field is described.
Reference
Todaro, Giuseppina. (2017). Nomi (e aggettivi) che diventano verbi tramite prefissazione: quel che resta della parasintesi. Thèse de doctorat. Università Roma Tre / Université de Toulouse-Jean Jaurès.