#
5d0e1b53 |
|
18-Jan-2016 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
substring function for UTF8 strings
|
#
7e7280ec |
|
01-Jul-2015 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
Provide a UTF8-aware padRight function This can then be used in the testing routines that want pretty output that might include non-ASCII characters.
|
#
893569d9 |
|
22-Mar-2015 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
UTF8.getChar was too liberal. In particular, it allowed UTF8 encodings that used more bytes than they should. Fixed this and added a bunch of test-cases. Also, stopped getChar from decoding past the maximum allowed code-point. Thanks to Rob Arthan for the bug report. Closes #237
|
#
88236957 |
|
19-Apr-2010 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
Get UTF8 conversion to handle codes above the 16 bit 'BMP'.
|
#
95103b87 |
|
20-Mar-2010 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
Add translate function (analogous to String.translate) to UTF8 module.
|
#
7154aa49 |
|
31-Jul-2009 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
Start to implement the Basis 97 extensions that Moscow ML hasn't got in order to force our codebase to get up-to-date. It should also mean less bodging around for the Poly/ML code. I haven't checked that my changes to tools-poly/poly/poly-init2.ML have done all that is required yet. Feel free to fix problems arising there (I hope it will just be a matter of deleting things).
|
#
94821ca4 |
|
03-Sep-2008 |
Michael Norrish <Michael.Norrish@nicta.com.au> |
Add some simplistic code for decoding and encoding UTF8 strings. Also add even more naive "character" categorisation functions to UnicodeChars.
|