Normalizing Chinese Text

Author: Christian Wittern


Normaliz

Normaliz is a little tool, that allows you to create a new, normalized version of a text. It was developed for our texts inputted in Shanghai and knows only of characters that we have encountered so far.

Normaliz is invoked on a DOS prompt by typing
normaliz <filename>
where. On a Macintosh, you can just drag your file on the Normaliz icon.

Normaliz can work in three different modes. It will ask you which one you want to invoke.

In the silent mode, the characters will be silently replaced and you have no way to check them later.

The report mode creates a list of the characters replaced in a new file ending with .rep.

If you want to review these changes later, you can select the verbose mode, where some markers are inserted into your text, to show where revisions have been made. These markers take the following form:
<APP><LEM wit="normalize">replaced Character</LEM><RDG wit="original"></RDG></APP> which is, by the way, one way to encode variants with the TEI Guidelines.

Author:Christian Wittern
Last updated: 95/05/01