What does "normalised" mean?

For the Leipzig co-occurrence analysis the machine-readable CAMENA texts were fed into a database. For the time being the character strings stored in the database have been standardised according to the following rules for replacing certain letters:


Please note: When choosing search terms within the Leipzig CO-OCCURRENCE analysis users should apply these rules independently.

Caveat Lector: This standardisation is experimental in nature. It does not take account of the orthographic peculiarities of text elements in other languages (German, French, Italian, English, Spanish, Hebrew, Arabic, etc.) – with the exception of Greek (in Beta Code, see above).

On the subject of standardisation of the machine-readable CAMENA texts, see also Regeln für die Abschrift und Auszeichnung der CAMENA-Texte (rules for transcription and markup of the CAMENA texts).