Choosing a Conversion Type

By default, SAP ASE uses direct conversions to convert data between different character sets.

Set the enable unicode conversions option to either 1 or 2 to use the Unicode conversions:

If all character sets fall into column 2 in the table below, set enable unicode conversions to 2 to always use Unicode conversions.

For SAP ASE versions 15.0 and later, the default value for enable unicode conversions is 1.

If the server default is UTF-8, the server automatically uses Unicode conversions only.

Non-Unicode Client/Server Systems

In a non-Unicode system, the character sets of the server and clients are native character sets; therefore, you can use the SAP ASE direct conversions.

However, there are some character sets for which there is no SAP ASE direct conversion; in this situation, you must use Unicode conversions.
  • If all character sets in your client/server system are column 1 in the table below, use the SAP ASE direct conversions. The character sets must all belong to the same language group.

  • If the character sets in your client/server system are in column 2 in the table below, or some combination of columns 1 and 2, configure your server to use Unicode conversions. Again, the character sets must all belong to the same language group.

For example, assume the server default character set is CP 850 and the clients’ character sets are either ISO 8859-1 or ROMAN 8. The table below shows that direct conversions exist between CP 850 and the client character sets. Now, suppose you add a client using CP 1252. Since there is no direct conversion between CP 1252 and CP 850, (the default server character set), you must use Unicode conversions to convert between CP 1252 and CP 850. When you have a mixture of character sets—some where you can use SAP ASE direct conversions and others where you must use Unicode conversions—you can specify that a combination of SAP ASE direct conversion and Unicode conversion be used.

Unicode Client/Server Systems

If your server default is Unicode UTF-8, then all conversions are between UTF-8 and the native character set being used on the client systems. In a Unicode system, Unicode conversions are used exclusively.

Conversion methods for character sets

Language group

Column 1 – SAP ASE direct conversions and Unicode conversions

Column 2 – Unicode conversions only

Group 1

CP 437, CP 850, ISO 8859-1, Macintosh Roman

CP 860, CP 1252, ISO 8859-15, CP 863

Group 2

CP 852, CP 1250, CP 8859-1, Macintosh Central European

ISO 8859-2

Group 4

No conversions needed (only one character set supported)

Group 5

CP 855, CP 866, CP 1251, ISO 8859-5, Koi8, Macintosh Cyrillic

Group 6

CP 864, CP 1256, ISO 8859-6

Group 7

CP 869, CP 1253, GREEK8, ISO 8859-7, Macintosh Greek

Group 8

CP 1255, ISO 8859-8

Group 9

CP 857, CP 1254, ISO 8859-9, Macintosh Turkish, TURKISH8

Group 101

DEC Kanjii, EUC-JIS, Shift-JIS

CP 932

Group 102

CP 936, EUG-GB, GB18303

Group 103

Big 5, CP 950, EUC-CNS

Group 104

EUCKSC, CP 949

Group 105

CP 874, TIS 620

Group 106

No conversions needed (only one character set supported)

Unicode

No conversions needed (only one character set supported)