History log of /seL4-l4v-master/HOL4/src/portableML/UnicodeChars.sig
Revision Date Author Comments
# c80fc0a8 22-Nov-2018 Michael Norrish <Michael.Norrish@nicta.com.au>

More utility entry-points in modules to do with Unicode and UTF8


# ac7236c2 22-Aug-2017 Michael Norrish <Michael.Norrish@nicta.com.au>

Provide a Unicode ++ symbol for APPEND

Happy to revert this if people hate the look of it - this is mostly an
experiment.


# a26902b5 23-Aug-2016 Michael Norrish <Michael.Norrish@nicta.com.au>

Provide support for Unicode subscript 'r'

Thinking that this might be a reasonable annotation for relationScript's
set operators. For example, R1 RUNION R2 might be written R1 ∪ᵣ R2


# 32bf1e49 02-Feb-2015 Piotr Trojanek <piotr.trojanek@gmail.com>

extra semicolons removed from (some) .sig files


# d36e6395 25-Jul-2012 Michael Norrish <michael.norrish@nicta.com.au>

Add UnicodeChars entries for quotation marks.

These correspond to the HTML entities &{l,r}{s,d}quo; hence the names
chosen.


# fdf9e918 16-Feb-2012 Michael Norrish <Michael.Norrish@nicta.com.au>

Add some nice Unicode brackets to our list of useful characters.


# 5fc11730 02-Nov-2011 Michael Norrish <Michael.Norrish@nicta.com.au>

Clean up UnicodeChars implementation.

In particular:
* Use U function everywhere so that Unicode code-points are more
obvious. Instead of "\u00e2\u0088\u0080", U 0x2200 is much more
obviously a reference to U+2200 (forall). This also allows us to
switch away from the UTF-8 encoding more easily if necessary.
* Include more Greek letters.


# 6d87ca0c 19-Apr-2010 Michael Norrish <Michael.Norrish@nicta.com.au>

Fix handling of special 'U' constant, retaining better backwards compatibility.

In particular, there is now a special form 'univ' that behaves as 'U' did.
Further, with Unicode on, the character U+1D54C is used in the same way.
This character may not be in many fonts, so add another trace to turn its
use off. If you can see it, it's a nice 'double-struck' U. I would have
gone for a script-style U, but it leant over too much, and so overlapped
the following left-paren character.


# c33eab22 02-Mar-2010 Michael Norrish <Michael.Norrish@nicta.com.au>

Fix value of UnicodeChars.sup_1 and add more superscript bindings.


# 594607ed 05-Feb-2010 Thomas Tuerk <Thomas.Tuerk@cl.cam.ac.uk>

some more unicode characters added, mainly arrows and stars


# f8646a44 23-Dec-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Modify the lexer to handle Unicode tokens properly. For example, you
can type the equivalent of \forall\tau_1 and it will split this blob
of three Unicode characters (and some larger number of bytes) into two
tokens (\forall followed by \tau_1). The pretty-printer also knows
about token categories.


# d9003833 26-Nov-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Fairly dramatic change, supporting the use of "syntactic patterns".
It is now possible to overload a name to term (typically a
lambda-abstraction) in order to create syntax-only definitions. For
example, there is a definition of not-equals now in the system:
val _ = overload_on ("<>", ``\x y. ~(x = y)``)
val _ = set_fixity "<>" (Infix(NONASSOC, 450))
This definition is parsed and pretty-printed.

Two other annoying changes. The interface for adding unicode variants
has changed, and I have made precedence level 450 of the parser
non-associative. This latter change has caused most of the (minor)
adjustments to the files below. The regression test doesn't go
through yet. (Currently there is a failure in
quotient/examples/sigma that I don't understand.)


# a1d77c67 06-Nov-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Add some more Unicode characters to the "standard list" in
UnicodeChars. In particular, add superscript minus, and superscript
1, thinking of setting up a concatenation of the two as an inversion
suffix operator at some later date.


# 8aca6036 09-Sep-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

More time-wasting, err, addition of arrow characters to UnicodeChars.


# 0123e623 08-Sep-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Add some symbols for the numeric types.


# 20a25cb8 08-Sep-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Use superscript plus to make transitive closure look nice in "Unicode mode".


# 94821ca4 03-Sep-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Add some simplistic code for decoding and encoding UTF8 strings. Also
add even more naive "character" categorisation functions to
UnicodeChars.


# 205f7240 01-Sep-2008 Michael Norrish <Michael.Norrish@nicta.com.au>

Move the structure defining the "useful" Unicode characters into
portableML, but keep Unicode.UChar as a reference to the same
information. (Gotta love the SML module system.)