1 |
TECkit (Text Encoding Conversion toolkit) is a toolkit for converting data |
2 |
between 8-bit legacy encodings and Unicode. It can also be used for |
3 |
transliteration of Unicode between different scripts. |
4 |
|
5 |
TECkit uses a mapping description language (mapping byte encodings to Unicode). |
6 |
Mapping rules can be extended by (1) the use of character sequences rather than |
7 |
single characters on either side; (2) by the addition of contextual constraints |
8 |
(environments) determining when a rule should apply; (3) and by the use of |
9 |
character classes, optional and repeatable elements, grouping and alternation |
10 |
to express more complex patterns to be matched and processed. |
11 |
|
12 |
TECkit is particularly useful with XeTeX (Unicode-aware derivate of TeX). |
13 |
|
14 |
The following binaries are provided: |
15 |
|
16 |
teckit_compile mapping compiler that allows binary mapping tables (.tec) |
17 |
to be built from TECkit description files (.map) |
18 |
sfconv a tool for converting Standard Format (SF) files |
19 |
txtconv a utility to apply TECkit mappings to plain-text files |
20 |
|
21 |
WWW: http://scripts.sil.org/TECkit |
22 |
http://scripts.sil.org/TECkitDownloads#5b6cf869 |