diff options
author | waker <wakeroid@gmail.com> | 2011-01-22 19:30:21 +0100 |
---|---|---|
committer | waker <wakeroid@gmail.com> | 2011-01-22 19:30:21 +0100 |
commit | 37fac17dab627cfbca21660e3232c9295bb490dd (patch) | |
tree | 6cd836ad0bbd1bbbc5c7dd2421314d503d96d002 /ConvertUTF/readme.txt | |
parent | 6f520312e87f7a485cd841997d4cfe4f1c3034c9 (diff) |
added utf16 converters, fixed few bugs in existing charset converters
Diffstat (limited to 'ConvertUTF/readme.txt')
-rw-r--r-- | ConvertUTF/readme.txt | 43 |
1 files changed, 43 insertions, 0 deletions
diff --git a/ConvertUTF/readme.txt b/ConvertUTF/readme.txt new file mode 100644 index 00000000..b9f17fb8 --- /dev/null +++ b/ConvertUTF/readme.txt @@ -0,0 +1,43 @@ + +The accompanying C source code file "ConvertUTF.c" and the associated header +file "ConvertUTF.h" provide for conversion between various transformation +formats of Unicode characters. The following conversions are supported: + + UTF-32 to UTF-16 + UTF-32 to UTF-8 + UTF-16 to UTF-32 + UTF-16 to UTF-8 + UTF-8 to UTF-16 + UTF-8 to UTF-32 + +In addition, there is a test harness which runs various tests. + +The files "CVTUTF7.C" and "CVTUTF7.H" are for archival and historical purposes +only. They have not been updated to Unicode 3.0 or later and should be +considered obsolescent. "CVTUTF7.C" contains two functions that can convert +between UCS2 (i.e., the BMP characters only) and UTF-7. Surrogates are +not supported, the code has not been tested, and should be considered +unsuitable for general purpose use. + +Please submit any bug reports about these programs here: + + http://www.unicode.org/unicode/reporting.html + +Version 1.0: initial version. + +Version 1.1: corrected some minor problems; added stricter checks. + +Version 1.2: corrected switch statements associated with "extraBytesToRead" + in 4 & 5 byte cases, in functions for conversion from UTF8. + Note: formally, the 4 & 5 byte cases are illegal in the latest + UTF8, but the table and this code has always catered for those, + cases since at one time they were legal. + +Version 1.3: Updated UTF-8 legality check; + updated to use UNI_MAX_LEGAL_UTF32 in UTF-32 conversions + Updated UTF-8 legality tests in harness.c + + +Last update: October 19, 2004 + + |