Characters are objects that represent printed characters such as letters and digits. In LispKit, characters are UTF-16 code units.
Character literals are written using the notation
#\xhex-scalar-value. Characters written using this
#\notation are self-evaluating, i.e. they do not have to be quoted.
The following standard character names are supported by LispKit:
#\newline: the linefeed character U+000A
#\null: the null character U+0000
#\return: the return character U+000D
#\space: the space character U+0020
#\tab: the tab character U+0009
Here are some examples using the
#\m: lowercase letter 'm'
#\M: uppercase letter 'M'
#\(: left parenthesis ')'
#\\: backslash '\'
#\: space character ' '
#\x03BB: the lambda character 'λ' (equivalent to
Case is significant in
#\character, and in
#\character-name, but not in
#\xhex-scalar-value. If character in
#\character is alphabetic, then any character immediately following character cannot be one that can appear in an identifier. This rule resolves the ambiguous case where, for example, the sequence of characters
#\spacecould be taken to be either a representation of the space character or a representation of the character
#\sfollowed by a representation of the symbol
Some of the procedures that operate on characters ignore the difference between upper case and lower case. The procedures that ignore case have "-ci" (for "case insensitive") embedded in their names.
#tif obj is a character, otherwise returns
(char=? char ...)
(char<? char ...) (char>? char ...) (char<=? char ...) (char>=? char ...)
These procedures return
#tif the results of passing their arguments to
char->integerare respectively equal, monotonically increasing, monotonically decreasing, monotonically non-decreasing, or monotonically non-increasing. These predicates are transitive.
(char-ci=? char ...)
(char-ci<? char ...) (char-ci>? char ...) (char-ci<=? char ...) (char-ci>=? char ...)
These procedures are similar to
char=?etc., but they treat upper case and lower case letters as the same. For example,
(char-ci=? #\A #\a)returns
#t. Specifically, these procedures behave as if
char-foldcasewere applied to their arguments before they were compared.
#tif its argument is a alphabetic character, otherwise it returns
#f. Note that many Unicode characters are alphabetic but neither upper nor lower case.
#tif its argument is a numeric character, otherwise it returns
#tif its argument is a whitespace character, otherwise it returns
#tif its argument is an upper-case character, otherwise it returns
#tif its argument is a lower-case character, otherwise it returns
char-upcaseprocedure, given an argument that is the lowercase part of a Unicode casing pair, returns the uppercase member of the pair, provided that both characters are supported by LispKit. Note that language-sensitive casing pairs are not used. If the argument is not the lowercase member of such a pair, it is returned.
char-downcaseprocedure, given an argument that is the uppercase part of a Unicode casing pair, returns the lowercase member of the pair, provided that both characters are supported by LispKit. If the argument is not the uppercase member of such a pair, it is returned.
char-foldcaseprocedure applies the Unicode simple case-folding algorithm to its argument and returns the result. Note that language-sensitive folding is not used. If the argument is an uppercase letter, the result will be either a lowercase letter or the same as the argument if the lowercase letter does not exist or is not supported by LispKit.
digit-valuereturns the numeric value (0 to 9) of its argument if it is a numeric digit (that is, if
#fon any other character.
(digit-value #\3) ⇒ 3
(digit-value #\x0EA6) ⇒ #f
Given a Unicode character,
char->integerreturns an exact integer between 0 and #xD7FF or between #xE000 and #x10FFFF which is equal to the Unicode scalar value of that character. Given a non-Unicode character, it returns an exact integer greater than #x10FFFF.
Given an exact integer that is the value returned by a character when
char->integeris applied to it,
integer->charreturns that character.
(char-name char encoded?)
If character char has a corresponding named XML entity, then procedure
char-namereturns the name of this entity. Otherwise,
#f. If parameter encoded is set to
#t, then the name is returned including the full entity encoding.
(char-name #\<) ⇒ "LT"
(char-name #\&) ⇒ "AMP"
(char-name #\") ⇒ "quot"
(char-name #\a) ⇒ #f
(char-name #\> #t) ⇒ ">"