REDAC
REsources Developed At CLLE CLLE: Cognition, Langues, Langage, Ergonomie






GLAWI

ENGLAWI's documentation
Definitions

Definitions

Description

Inside pos tags, the definitions (plural) element may include several definition (singular) children, each describing a word sense.

A definition contains a gloss and possibly one or more usage examples (example tag) or references (exampleRef tag), each of them possibly including labels that give attitudinal, diatopic, diachronic, diafrequential information or indicate that the word is a term belonging to a specialized domain.

Additionaly, a definition may mention synonyms, antonyms, hypernyms or hyponyms encoded in semRel tags

Each gloss, example and exampleRef element contains information available under 4 versions:

  • the original wikicode;
  • a plain text version;
  • an XML version that formally encodes various information: markups encode wiki typesetting (boldface, italic, etc.), dates, foreign words, and external/inner links, etc. (see further description). A documentation page is also dedicated to the quotations found in exampleRef elements;
  • a syntactic parsing of the text in CoNLL format produced by the Talismane parser.

The picture below illustrates a sense (= a given definition tag) of the adjective free, related to the domain of computer software.


Embedded definitions
In polysemous entries, the definition element corresponding to the headword's meanings are generally direct children of the definitions element. In headwords like free, head, form, product, etc., definitions may be embedded in other definitions.
For example, the noun product is described in Wiktionary as follows:
Noun
product (countable and uncountable, plural products)
    1. (countable, uncountable) A commodity offered for sale.

    2. (cosmetics, uncountable) Any preparation to be applied to the hair, skin, nails, etc.

    3. Anything that is produced; a result.
          1. The amount of an artifact that has been created by someone or some process.
          2. A consequence of someone's efforts or of a particular set of circumstances.
          3. (chemistry) A chemical substance formed as a result of a chemical reaction.
          4. (arithmetic) A quantity obtained by multiplication of two or more numbers.
          5. (mathematics) Any operation or a result thereof which generalises multiplication of numbers, like the multiplicative operation in a ring, product of types or a categorical product.
          6. Any tangible or intangible good or service that is a result of a process and that is intended for delivery to a customer or end user.

    4. (US, slang) Illegal drugs, especially cocaine, when viewed as a commodity.

Meaning #3 is divided into 6 subsenses. The corresponding (shortened) XML encoding in ENGLAWI is:

XML structure

<!ELEMENT definitions (definition)*> <!ELEMENT definition (gloss|example|exampleRef|semRel|definition)*> <!ATTLIST definition level CDATA #IMPLIED> <!ELEMENT gloss (labels?, wiki?, xml?, txt?, parsed?)> <!ELEMENT example (labels?, wiki?, xml?, txt?, parsed?)> <!ELEMENT exampleRef (labels?, wiki?, xml?, txt?, parsed?)> <!ELEMENT semRel (#PCDATA)> <!ATTLIST semRel type (ant|syn|hyper|hypo) #REQUIRED> <!ELEMENT labels (label)*> <!ELEMENT label (#PCDATA)> <!ATTLIST label type (attitudinal|diachronic|diafreq|diatopic|domain|gram|of|other|sem) #REQUIRED modifier (almost|chiefly|generally|largely|less|mildly|more|moreOften|now|nowChiefly |nowLargely|nowLess|nowMore|nowMostly|nowOften|nowOnly|nowUsually|often |only|originally|particularly|perhaps|possibly|primarily|slightly|sometimes |somewhat|usually|very) #IMPLIED value CDATA #REQUIRED> The description of wiki, xml, txt and parsed is available here.
Labels are also described in a separate page.

Semantic relations in definitions

After a gloss and possibly some examples, a definition may include a list of synonyms, antonyms, hypernyms or hyponyms. Semantic relations usually occur in the section child of pos elements (see description). Semantic relations found in a given definition are linked to a particular meaning.

Below is a shortened XML of the article book: while a section with a semRel type's value relates to the whole pos element (book as a noun), specific synonyms are linked to a particular meaning (tome and volume are synonyms of book when book means "a major division of a long work"; booklet is a synonym of book when book means "a convenient collection [...] of small paper items"; etc.)




At last, synonyms may occur in glosses of synonymic definitions, encoded by a formOf element. For example, the adjective rethoric is simply defined as "Synonym of rhetorical":


Some synonymic definitions are followed by an additional explanation, such as the fourth meaning of powder room, defined as a synonym of half bath, in turn defined as a "small room with...":



Back to ENGLAWI's [ main documentation page ] [ project page ]