Antidote 9 User Guide

User Guide / Personal Dictionaries / Importing Into a Personal Dictionary / Import formats

Import formats

Introduction

This document presents the structure of the export and import files of Antidote’s personal dictionaries.

Structure of the Export and Import Files

  • Text format
  • Each line must contain 1, 2 or 8 tokens separated by tabulations
  • Lines beginning with “//” are ignored with some exceptions (see the sections Word Language and Encoding)
  • Empty lines are ignored.

Word Language

  • To specify the language of the words to import, the following three identifiers are used :
    • FR: French
    • EN: English
    • ML: Multilingual (English and French)
  • These identifiers can be used in two places:

    • Globally, indicating the identifier in parentheses at the beginning of the file in the following comment:

      Name : DICTIONARY_NAME (LANGUAGE_IDENTIFIER)
    • For each word, indicating the language identifier as the first attribute (see the section Importing 8 tokens below)
    • If the language is unspecified according to the previous methods:
      • In multilingual Antidote, an import error will occur.
      • In monolingual Antidote, the default language will be that of Antidote.

Encoding

Antidote tries to guess the file’s encoding based on the content and your system platform. You can also tell Antidote which encoding you are using without it having to guess: Use the special comment coding: at the beginning of the file followed by the encoding identifier.

Here are the encoding identifiers :

  •  mac_roman
  •  iso8859_15
  •  cp1252
  •  utf_8
  •  utf_16_le
  •  utf_16_be

Example :

// coding: utf_8

Importing One Token

If a line only contains one token, that token is considered the base form and Antidote infers the word category, and (if required by the category) word morphology or verb model. In multilingual Antidote, the word language must be globally specified at the top of the file with a special comment (see the section Word Language)

BASE_FORM __END_OF_LINE__

Importing Two Tokens

If a line contains two tokens, the first token is considered the base form and the second is considered the category. Antidote infers the word morphology or verb model, if required by the category.

In multilingual Antidote, the word language must be globally specified at the top of the file with a special comment (see the section Word Language)

BASE_FORM __TAB__ CATEGORY __END_OF_LINE__

Importing 8 tokens

If a line contains 8 tokens, these tokens are considered in the following order:

  • Base form
  • Category
  • Inflection 1 field
  • Inflection 2 field
  • Inflection 2 field
  • Inflection 2 field
  • Attributes separated by +
  • Definition

    • To introduce a new definition, insert the “◊” character
    • To introduce a new paragraph, insert the “” and “r” characters
    • For a multilingual word, it is possible to specify a definition for each language. All definitions are specified in the same field. The French definition must be preceded by the keyword |FR| and the English definition by the keyword |EN|.

      Example:

      |FR|Définition de « ABCDEFG »|EN|Definition of “ABCDEFG”

Accepted Categories

French Words

  • Acronyme (acronym or initialism)
  • Adj (adjective)
  • Adv (adverb)
  • Interj (interjection)
  • LocutionLatine (Latin expression)
  • Nom (common noun)
  • NP (proper noun)
  • Verbe (verb)
  • Pref (préfix)

English Words

  • Acronym (acronym or initialism)
  • Adj (adjective)
  • Adv (adverb)
  • Interj (interjection)
  • Noun (common noun)
  • PN (proper noun)
  • Verb (verb)

Multilingual Words

  • Acronyme (acronym or initialism)
  • Acronym (acronym or initialism)
  • NP (proper noun)
  • PN (proper noun)

Inflection Fields

French Words

  • Acronym, adjective, adjectival Latin expression, noun, proper noun
    • Inflection 1 field: Masculine singular
    • Inflection 2 field: Masculine plural
    • Inflection 3 field: Feminine singular
    • Inflection 4 field: Feminine plural
  • Verb
    • Inflection 1 field: Verb model
    • Inflection 2 field: Leave empty
    • Inflection 3 field: Leave empty
    • Inflection 4 field: Leave empty
  • Adverb, adverbial Latin expression, interjection, prefix
    • Inflection 1 field: Leave empty
    • Inflection 2 field: Leave empty
    • Inflection 3 field: Leave empty
    • Inflection 4 field: Leave empty

English Words

  • Noun, acronym
    • Inflection 1 field : Singular
    • Inflection 2 field : Plural
    • Inflection 3 field : Leave empty
    • Inflection 4 field : Leave empty
  • Adjective
    • Champ de flexion 1 : Singulier
    • Champ de flexion 2 : Comparatif
    • Champ de flexion 3 : Superlatif
    • Champ de flexion 4 : À laisser vide
  • Verb
    • Inflection 1 field: 3rd person
    • Inflection 2 field: Past
    • Inflection 3 field: Past participle
    • Inflection 4 field: Present participle
  • Adverb, interjection, proper noun
    • Inflection 1 field: Leave empty
    • Inflection 2 field: Leave empty
    • Inflection 3 field: Leave empty
    • Inflection 4 field: Leave empty

Multilingual Words

  • Acronym, proper noun
    • Inflection 1 field: Masculine singular
    • Inflection 2 field: Masculine plural
    • Inflection 3 field: Feminine singular
    • Inflection 4 field: Feminine plural

Attributes Accepted for Each Category

French Words

  • Noun
    • Chose
      •  Entite
      •  Diplome
    • Personne
      •  Fonction
    •  Animal
  • Adjective
    • Habitant
      •  Langue
  • Interjection
    •  Sensation
    •  Message
    •  Bruit
  • Adverb
    •  Maniere
    •  Temps
    •  Lieu
  • Locution latine
    •  Adjectif
    •  Adverbe
  • Proper Noun
    •  Famille
    •  Prenom
    •  Compagnie
    •  Marque
    • Lieu
      •  VoieDeCirculation
      •  Ville
      •  Ile
      •  Region
      •  CoursDeau
      •  CorpsCeleste
    • Habitant
      •  Langue
    •  TitreOeuvre
    •  Autre
  • Acronym
    •  Compagnie
    • Chose
      •  NonComptable
      •  Diplome
    • Personne
      •  FonctionSociale
  • Verb
    • No attribute is accepted
    • Accepted models
      •  Aimer
      •  Finir
      •  Courir
      •  Rendre
  • Prefix
    •  FacultTDU
    •  BesoinTDU
    •  ErreurTDU
    •  FacultTDUReforme
    •  BesoinTDUReforme
    •  ErreurTDUReforme

English Words

  • Noun
    • Thing
      •  Entity
      •  Diploma
    • Person
      •  Profession
    •  Animal
    •  Group
  • Adjective
    • Demonym
      •  Language
  • Interjection
    •  Emotional
    •  Message
    •  Sound
  • Adverb
    •  Manner
    •  Time
    •  Place
  • Proper Noun
    •  LastName
    •  FirstName
    •  Company
    •  Brand
    • Place
      •  Street
      •  Town
      •  Island
      •  Region
      •  CourseOfWater
      •  CelestialBody
    • Demonym
      •  Language
    •  TitleOfAWork
    •  Other
  • Acronym
    •  Company
    • Thing
      •  Uncountable
      •  Diploma
    • Person
      •  Profession
  • Verb
    • No attribute is accepted

Multilingual Words

  • Proper Noun
    •  Famille, LastName
    •  Prenom, FirstName
    •  Compagnie, Company
    •  Marque, Brand
    • Lieu, Place
      •  VoieDeCirculation, Street
      •  Ville, Town
      •  Ile, Island
      •  Region, Region
      •  CorpsCeleste, CelestialBody
    • Habitant, Demonym
      •  Langue, Language
    •  TitreOeuvre, TitleOfAWork
    •  Autre, Other
  • Acronym
    •  Compagnie, Company
    • Chose, Thing
      •  NonComptable, Uncountable
      •  Diplome, Diploma
    • Personne, Person
      •  FonctionSociale, Profession