Section "unicode"

The Dogelog player accepts Prolog texts in Unicode code points based on Unicode general categories and numeric values. We rolled our own Unicode database in that we compressed the desired Unicode character properties. The following utilities written in Java are provided to generate the Unicode database:

The output of these utilities is to be placed in the corresponding "unicode" file. It will provide the Unicode database in compressed form to reduce the file footprint and allow delivery over the wire. The compression is possible since the Unicode database needs only to provide general category and numeric value.

The Unicode database provides the following host language calls:

code_type(C): (host language)
Retrieve a Unicode code point general category. The general category is an integer in the range 0 to 16 and 18 to 30.
code_numeric(C): (host language)
Retrieve a Unicode code point numeric value. The numeric value is an integer in the range -1 or 0 to 36.

Kommentare