|
Format of Elementary Data in XML Files | XML Support | Suppressing Identical Values before Repeating Elements |
The XML parser can tell whether the file contains one, two or four byte characters when importing data from XML to a database. The characters that do not belong to ISO-646 (ASCII) are decoded according to the encoding specification in the declaration. If you set the correct coding in the declaration, all charsets that are supported by the SAX parser will be supported during import. You may reliably transfer multi-language data in UTF-8 and UTF-16 coding.
The prolog specified in the DAD global parameters will be written to the XML file when exported from a database to XML. The encoding entry is searched in the prolog. If found, the file will be coded according to this setting. If the encoding entry is not found, UTF-8 coding will be used (according to the XML standard).
UTF-8 and UTF-16 may always be used for output. Other code pages may be used if they are installed in the operating system. Currently the ISO-8859-1, ISO-8859-2, windows-1252, windows-1250 and ibm852 code pages are supported.
The UCS-2 coding used in the 602SQL Server is not supported in XML. Use UTF-16 instead. UTF-16 is a superset of UCS-2 and differs only in some Asian languages.
The DAD is always coded in UTF-8.
Format of Elementary Data in XML Files | XML Support | Suppressing Identical Values before Repeating Elements |