ENGLAWI an ENglish Great Lexicon for Accessing Wiktionary Information
ENGLAWI is an English Machine-Readable Dictionary encoded in XML format.
It is a structured and normalized version of Wiktionary.
The dictionary includes:
simple words, compounds and multiword expressions
inflected forms and lemmas
pronunciations in API
definitions (glosses and examples)
ENGLAWI is supplied with G-PeTo, a series of Perl Scripts intended to help extract information from the large XML file. Ready-to-use lexicons that have been extracted from ENGLAWI are also provided (see the download section below).
ENGLAFF: an Inflectional Lexicon extracted from ENGLAWI
Franck Sajous, Basilio Calderone and Nabil Hathout (2020).
ENGLAWI: From Human- to Machine-Readable Wiktionary.
Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020),
Marseille, France, pp. 3016-3026. [ PDF ] [ Bibtex ]