This documentation is based on (Sajous and Hathout, 2015) and (Hathout and Sajous, 2016).
94% of GLAWI's entries contain one or several phonemic transcriptions in API. They correspond to pronunciations that occur at different places in Wiktionnaire's articles:1
The two screenshots below illustrate the pronunciation elements that can be found in the entry sorciÃ¨re `witch':Example of the entry sorciÃ¨re: transcriptions at the POS (noun) level:
Example of the entry sorciÃ¨re: transcriptions at the article level:
As shown by the second screenshot, transcriptions occurring in the Prononciation section at the end of the articles may mention diatopic variations. In the example above, national variants distinguish hexagonal French and French from Canada (Quebec). Regional variants may also be mentioned, as illustrated by the screenshot above: moins `minus, less' is pronounced differently in Paris, perceived as "standard" French and in Marseille area (Haut Languedoc):
1 471 missing pronunciations have been taken from the resource Lexique.org developed by Boris New. These pronunciations are identified by the attribute from=
The pronunciations elements may be found as children of
text and pos elements.
The optional area attribute (indicating diatopic variations) of the pron elements
may occur when the pronunciations element is a direct child of text.