[Python] Usare Unicode e charset

Marco Mariani marco.mariani a prometeia.it
Gio 3 Dic 2009 12:45:35 CET


Manlio Perillo wrote:

> Uff, questo 6 bits ora da dove è uscito? ...
>   

Ho controllato. Con UTF-8 il massimo e' 4 bytes

Da wikipedia:

The original specification allowed for sequences of up to six bytes 
covering numbers up to 31 bits (the original limit of the Universal 
Character Set <http://en.wikipedia.org/wiki/Universal_Character_Set>). 
However, UTF-8 was restricted by RFC 3629 
<http://tools.ietf.org/html/rfc3629> to use only the area covered by the 
formal Unicode definition, U+|0000| to U+|10FFFF|, in November 2003.

-- 
This e-mail (and any attachment(s)) is strictly confidential and for use only by intended recipient(s). Any use, distribution, reproduction or disclosure by any other person is strictly prohibited. The content of this e-mail does not constitute a commitment by the Company except where provided for in a written agreement between this e-mail addressee and the Company. If you are not an intended recipient(s), please notify the sender promptly and destroy this message and its attachments without reading or saving it in any manner. Any non authorized use of the content of this message constitutes a violation of the obligation to abstain from learning of the correspondence among other subjects, except for more serious offence, and exposes the person responsible to the relevant consequences.



Maggiori informazioni sulla lista Python