12
M_T08_EN_B
Introduction
Practical consequences
The old systems using the ASCII 8 bit fonts may communicate with the T08 program, which uses 16 bit fonts.
To allow this interoperability, 2 methods are possible:
• The ASCII and UCS tables are common for the characters numbered from 0 to 127. For these characters,
compatibility is guaranteed.
• For other characters, UTF-8 encoding is used to specify the 16 bit character code by using the sequences
of several 8 bit "characters", which can be managed by 8 bit systems.
Using the T08 program
As long as the program doesn't exchange data with the exterior, it uses the 16 native bit fonts. The user can
benet from extended linguistic support.
Manualeditingofthemarkingles
The marking les (.tml les) in TML format (Technifor Marking Language) are saved in UTF-8 format. This
format is automatically recognized when a le is opened with Note Pad (notepad.exe) in Windows (2000 or
more recent).
For an exact display, select a compatible display font, such as "Arial Unicode MS".
When saving, make sure the UTF-8 format is well selected (especially when creating a new le).
Data reception/emission (RS232, telnet)
The characters received must be encoded in UTF-8. Characters emitted are encoded in UTF-8.
If a device sends to the machine characters in ASCII encoding, they are only recognized if their codes are
included between 0 and 127. In this manner, this device may correctly receive the characters whose codes
are included between 0 and 127.
To use other characters, these must be encoded in UTF-8.
Example: to send the "é" character, the device cannot use the ASCII encoding of the Latin 1 character page
which assigns this character the code 130.
It must use UCS encoding which assigns it the code 133. In UTF-8, the code is written as follows:
11000011 10101001
It must thus send 2 bytes: 195 - 169.