REDAC
REsources Developed At CLLE CLLE: Cognition, Langues, Langage, Ergonomie







Version française
TALN Corpus
articles extracted from the proceedings of the TALN and RECITAL conferences between 2007 and 2013
Versions
This pages contains an old version of the corpus, that was developed within the SemdDis2014 workshop. A most recent version of the corpus is available here.

Description

The TALN proceedings corpus is a subset of the scientific articles presented at the TALN (Traitement Automatique des Langues Naturelles) and RECITAL (Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues) conferences from 2007 to 2013.
This corpus consists of 586 articles counting about 2 million words.

Sources and metadata have been compiled by Florian Boudin (LINA, Université de Nantes) Website... ]
Articles' selection and conversion has been performed by Ludovic Tanguy (CLLE-ERSS, Université de Toulouse).

Person in charge
Ludovic Tanguy :

Licence

Articles from TALN and RECITAL conferences are the property of the Association pour Traitement Automatique des LAngues (ATALA). Please read the corpus' licence and terms of use.


Download