[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Scheme-reports] digit-value

To: Pierpaolo Bernardi <olopierpa@x>
Subject: Re: [Scheme-reports] digit-value
From: John Cowan <cowan@x>
Date: Tue, 3 Jul 2012 10:44:50 -0400
Cc: scheme-reports@x
In-reply-to: <CANY8u7GA83aV0m2ftCmNt-iK8jA-f-P4ZLHe-pKfoiMF49rk2g@mail.gmail.com>
References: <A1023585-A89B-4278-AD5B-C231C12116E3@iro.umontreal.ca> <20120701071521.GD13650@mercury.ccil.org> <00636C66-977C-4CD7-BBFE-D70EFC9E9D6D@iro.umontreal.ca> <CAMMPzYPeGA3eAFQu-VW2Be6NzyzRfnDMmE9_jj5FNzZ8MRNDqA@mail.gmail.com> <415D3714-B45E-4834-92AE-C2D34CAA926F@iro.umontreal.ca> <4FF1921E.5040103@snell-pym.org.uk> <20120703071622.GL28322@mercury.ccil.org> <4FF2B232.7070606@snell-pym.org.uk> <CANY8u7GA83aV0m2ftCmNt-iK8jA-f-P4ZLHe-pKfoiMF49rk2g@mail.gmail.com>

Pierpaolo Bernardi scripsit:

> string->number and read accepting as numbers all characters satisfying
> the current char-numeric? would be crazy.
> 
> Think roman numerals, just to take a simple example we all know about.
> Assuming the I are the right unicode character for the roman numeral 1:

The Roman-numeral characters (which are only present for backward
compatibility with certain other character sets) aren't decimal digits,
and therefore R7RS `char-numeric?` returns #f on them.  So things aren't
as bad as that: all the decimal digits are used in exactly the same way,
only the shapes differ from European digits.

Here is the complete list of scripts that have their own digits (sometimes
only in older texts; modern texts use the European style):

Arabic (two flavors, one for Arabic langauge, one for Persian and Urdu
languages), N'ko, Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil,
Telugu, Kannada, Malayalam, Thai, Lao, Tibetan, Myanmar (two flavors,
one for Myanmar language, one for Shan language), Khmer, Mongolian, Limbu,
New Tai Lue, Tai Tham (two flavors, secular and ecclesiastical), Balinese,
Sundanese, Lepcha, Ol Chiki, Vai, Saurashtra, Kayah Li, Javanese, Cham,
Meetei Mayek, Osmanya, Brahmi, Sora Sompeng, Chakma, Sharada, Takri.

In addition, there is a set of fullwidth European digits (same shape,
but the size of Chinese characters), and the following sets of specially
fonted digits for mathematical use only:  math bold, math double-struck,
sans-serif, sans-serif bold, monospace.


-- 
I am expressing my opinion.  When my            John Cowan
honorable and gallant friend is called,         cowan@x
he will express his opinion.  This is           http://www.ccil.org/~cowan
the process which we call Debate.                   --Winston Churchill

_______________________________________________
Scheme-reports mailing list
Scheme-reports@x
http://lists.scheme-reports.org/cgi-bin/mailman/listinfo/scheme-reports

Follow-Ups:
- Re: [Scheme-reports] digit-value
  - From: Pierpaolo Bernardi <olopierpa@x>

References:
- [Scheme-reports] digit-value
  - From: Marc Feeley <feeley@x>
- Re: [Scheme-reports] digit-value
  - From: John Cowan <cowan@x>
- Re: [Scheme-reports] digit-value
  - From: Marc Feeley <feeley@x>
- Re: [Scheme-reports] digit-value
  - From: Alex Shinn <alexshinn@x>
- Re: [Scheme-reports] digit-value
  - From: Marc Feeley <feeley@x>
- Re: [Scheme-reports] digit-value
  - From: Alaric Snell-Pym <alaric@x>
- Re: [Scheme-reports] digit-value
  - From: John Cowan <cowan@x>
- Re: [Scheme-reports] digit-value
  - From: Alaric Snell-Pym <alaric@x>
- Re: [Scheme-reports] digit-value
  - From: Pierpaolo Bernardi <olopierpa@x>

Prev by Date: Re: [Scheme-reports] Bytevectors should be called u8vectors
Next by Date: Re: [Scheme-reports] Bytevectors should be called u8vectors
Previous by thread: Re: [Scheme-reports] digit-value
Next by thread: Re: [Scheme-reports] digit-value
Index(es):
- Date
- Thread