[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Scheme-reports] DISCUSSION/VOTE: The character tower
- To: Per Bothner <per@x>
- Subject: Re: [Scheme-reports] DISCUSSION/VOTE: The character tower
- From: Jussi Piitulainen <jpiitula@x>
- Date: 08 May 2014 08:31:38 +0300
- Cc: scheme-reports@x
- In-reply-to: <536ADE44.6090509@bothner.com>
- References: <20140506005154.GF5011@mercury.ccil.org> <1399355074.9426.107.camel@excessive.dsl.static.sonic.net> <20140506062205.GI5011@mercury.ccil.org> <1399397365.9426.168.camel@excessive.dsl.static.sonic.net> <20140506184520.GR5011@mercury.ccil.org> <1399409125.19292.31.camel@excessive.dsl.static.sonic.net> <20140506220452.GY5011@mercury.ccil.org> <1399415476.21129.6.camel@excessive.dsl.static.sonic.net> <CAGUt3y7c5xWL2oTu2_ipfBQYJ0j1RincatF3a0OEh3YgVMV1tQ@mail.gmail.com> <1399494419.29733.35.camel@excessive.dsl.static.sonic.net> <536ADE44.6090509@bothner.com>
Per Bothner writes:
> On 05/07/2014 01:26 PM, Bear wrote:
> > (string #\A #\x301) and (string #\xc1) are both expressions returning
> > a string. Unicode canonical equivalence says they should return the
> > same string.
>
> In which programming languages, if any, do these expression (or similar
> expressions in appropriate syntax) return the same string?
> I don't know of any.
They don't in Python. I tried in 3.2.3 (in Ubuntu) and 3.4.0 (in Red
Hat).
>>> 'A' + '\u0301', len('A' + '\u0301')
('Á', 2)
>>> 'Á', len('Á')
('Á', 1)
Both print as a single character but the glyphs look slightly
different to me. The string lengths differ as shown.
_______________________________________________
Scheme-reports mailing list
Scheme-reports@x
http://lists.scheme-reports.org/cgi-bin/mailman/listinfo/scheme-reports