Lexicon

The lexicon (dictionary) is where the entry for each word is stored. Along with each word is some info telling whether it is a noun, verb, adjective, &c., and how to deal with it with regards to inflections. The simple one or two word English translations come from Foclóir Póca, Foclóir Gaeilge-Béarla (Ó Dónaill), Learner's Irish-English / English-Irish Dictionary (Ó Siochfhradha), and Téarmaí Ríomhaireachta (An Gum). Many of the words have been included in the lexicon to show the rules of a subset of the word-type in question. The rest are ones I encounter and look up often enough to make me want to put them in.

There are utilities that convert the lexicon entries from a format that is easy for the computer to read to one that is easy for a human to read, and back again. The human-readable format is called a dump-format, the other a lex-format.

Currently there are two programs that deal directly with the lexicon.

ltod

ltod (lex to dump) reads in a lexicon and prints out selected items in the long (dump) format. If the word has multiple entries in the lexicon (for example 'do', which is among other things a possessive adjective and a preposition) then all are printed. Also, if the word has irregular forms, all the forms will be printed. Click here to see a sample printout.

Enter a word to dump:

dtol

dtol (dump to lex) reads the long dump format and turns it info the compact format used by the programs. By filling in the form below, you can add a word to the lexicon - then use ltod to check that the info is correct, and clár to look at the inflectional pattern.

DTOL is not yet available.

If you have words you'd like added to the lexicon, please send mail to lexicon@ceantar.org.


jtm