

Several options are available to display or not some details: here the analyses are given, but one can choose to display only the lemmas with their translation. Collatinus’ window with the beginning of Aeneid in the upper frame (text under study) and the result of lemmatization (bottom part).

By the way, this encoding allows also to translate the analysis into different languages. The number is converted in its human readable form when needed, i.e., for the display as in Figure 2. As a matter of fact, there are only 416 of these analyses.
COLLATINUS ART CODE
Instead of an explicit analysis as e.g., nominative singular, we just make a list of the morphosyntactical analyses that are possible in Latin and code the analysis with a simple number. The word-endings carry part of the information for the analysis, which is then stored in the file. And, obviously, we have also to check that the two parts fit together (i.e., that they are associated to the same paradigm with a proper matching rule). For a given form, we have to split it in all the possible ways and to check that the first part coincides with a root and the last one with a known word-ending. However, the lemmatization of a form requires the reverse process. During the search, the diacritics are not taken into account, but here we have chosen to display only the exact match, in red. One of them contains the text under study (bottom-left) and the other one (right) presents the result of the lemmatization. Its main window, top-left, is devoted to the consultation of the dictionaries. An option has also been introduced to show only this exact match, as seen in Figure 1.įigure 1. The exact match, if it exists, appears first and is red-colored. On the other hand, the search of a word that comes from a Greek text, with its diacritic signs, gives too many results. In a case where the Greek word is typed with Latin characters, it produces a simple match. Forgetting the “decoration” of the forms reduces their number to 853,000. The case of the iota subscript may be discussed, but, as it is encoded with a special character, it is treated as any other diacritic. To alleviate the problem of accent and breathing marks, these diacritic signs are not taken into account in the search of a form. The Greek words in the list are transliterated using betacode. But, we will consider here the features of the off-line versions. Both programs have an online version, developed with the help of Régis Robineau, webmaster of the Biblissima project. As open-source programs, Collatinus and Eulexis can be programmed to meet any particular problem. Eulexis is newer and is intended as Collatinus’ counterpart for Greek texts, though it is based on different mechanisms. The program also allows word search in the dictionaries, either in a digital format or as fixed-page images. It generates a complete lexical aid, with a short translation and the morphological analyses of the forms, for any text that might be given to students. Collatinus was originally developed by Yves Ouvrard for teaching.
COLLATINUS ART FULL
They also contain full digital dictionaries that give all the details about a word’s usage.

COLLATINUS ART FREE
Collatinus and Eulexis are free open-source programs, designed to lemmatize Latin or Greek texts and to give a short translation of the parsed words.
