encoding

Dotan Cohen dotancohen at gmail.com
Tue Nov 25 12:34:24 EST 2008


2008/11/25 Benjamin Kaplan <benjamin.kaplan at case.edu>:
> On Tue, Nov 25, 2008 at 11:53 AM, luca72 <lucaberto at libero.it> wrote:
>>
>> hello i'm writing an irc client, i use for encoding the utf-8 but i
>> never see all the typed sign in the correct way, anyone know the
>> standard encodind for the irc channels?
>>
>> Regards
>>
>> Luca
>> --
>> http://mail.python.org/mailman/listinfo/python-list
>
> This is something where Google is very helpful. I haven't dealt with this
> before, but a look at the definition of the protocol says there isn't any
> standard encoding. As a matter of fact, the RFC specifically says there is
> no standard character set[1]. The only thing it specifies is that IRC uses
> an 8-bit protocol. Since UTF-8 that doesn't work, I'd try using ISO-8859 and
> CP1252 and see if those work better.
>
> [1] http://www.faqs.org/rfcs/rfc1459.html (section 2.2)
>

In any case I would suggest that you use UTF-8 anyway.  You will save
your users a lot of headaches. While I am a big standards nut, this is
the use case where you should divert.

-- 
Dotan Cohen

http://what-is-what.com
http://gibberish.co.il

א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ-ק-ר-ש-ת
ا-ب-ت-ث-ج-ح-خ-د-ذ-ر-ز-س-ش-ص-ض-ط-ظ-ع-غ-ف-ق-ك-ل-م-ن-ه‍-و-ي
А-Б-В-Г-Д-Е-Ё-Ж-З-И-Й-К-Л-М-Н-О-П-Р-С-Т-У-Ф-Х-Ц-Ч-Ш-Щ-Ъ-Ы-Ь-Э-Ю-Я
а-б-в-г-д-е-ё-ж-з-и-й-к-л-м-н-о-п-р-с-т-у-ф-х-ц-ч-ш-щ-ъ-ы-ь-э-ю-я
ä-ö-ü-ß-Ä-Ö-Ü


More information about the Python-list mailing list