char 128? no... 256

Roman Suzi rnd at onego.ru
Wed Feb 12 15:52:44 EST 2003


On Wed, 12 Feb 2003, Afanasiy wrote:

>>That is why your Windows doesn't use latin-1.
>>(Hmmm... I thought cp1250 is latin1.)
>
>Also, should this be reported as a bug?

No. latin-1 (aka ISO-8859-1) is different from cp1250.

I've made diff between them (according to GNU recode):

$ diff <(recode -l latin1) <(recode -l cp1250)
< ISO-8859-1                          
---
> CP1250
20,35c20,35
< 128 PA   144 DC   160 NS   176 DG   192 A!   208 D-   224 a!   240 d-
< 129 HO   145 P1   161 !I   177 +-   193 A'   209 N?   225 a'   241 n?
< 130 BH   146 P2   162 Ct   178 2S   194 A>   210 O!   226 a>   242 o!
< 131 NH   147 TS   163 Pd   179 3S   195 A?   211 O'   227 a?   243 o'
< 132 IN   148 CC   164 Cu   180 ''   196 A:   212 O>   228 a:   244 o>
< 133 NL   149 MW   165 Ye   181 My   197 AA   213 O?   229 aa   245 o?
< 134 SA   150 SG   166 BB   182 PI   198 AE   214 O:   230 ae   246 o:
< 135 ES   151 EG   167 SE   183 .M   199 C,   215 *X   231 c,   247 -:
< 136 HS   152 SS   168 ':   184 ',   200 E!   216 O/   232 e!   248 o/
< 137 HJ   153 GC   169 Co   185 1S   201 E'   217 U!   233 e'   249 u!
< 138 VS   154 SC   170 -a   186 -o   202 E>   218 U'   234 e>   250 u'
< 139 PD   155 CI   171 <<   187 >>   203 E:   219 U>   235 e:   251 u>
< 140 PU   156 ST   172 NO   188 14   204 I!   220 U:   236 i!   252 u:
< 141 RI   157 OC   173 --   189 12   205 I'   221 Y'   237 i'   253 y'
< 142 S2   158 PM   174 Rg   190 34   206 I>   222 TH   238 i>   254 th
< 143 S3   159 AC   175 'm   191 ?I   207 I:   223 ss   239 i:   255 y:
---
> 128 Eu            160 NS   176 DG   192 R'   208 D/   224 r'   240 d/
>          145 '6   161 '<   177 +-   193 A'   209 N'   225 a'   241 n'
> 130 .9   146 '9   162 '(   178 ';   194 A>   210 N<   226 a>   242 n<
>          147 "6   163 L/   179 l/   195 A(   211 O'   227 a(   243 o'
> 132 :9   148 "9   164 Cu   180 ''   196 A:   212 O>   228 a:   244 o>
> 133 .3   149 sb   165 A;   181 My   197 L'   213 O"   229 l'   245 o"
> 134 /-   150 -N   166 BB   182 PI   198 C'   214 O:   230 c'   246 o:
> 135 /=   151 -M   167 SE   183 .M   199 C,   215 *X   231 c,   247 -:
>                   168 ':   184 ',   200 C<   216 R<   232 c<   248 r<
> 137 %0   153 TM   169 Co   185 a;   201 E'   217 U0   233 e'   249 u0
> 138 S<   154 s<   170 S,   186 s,   202 E;   218 U'   234 e;   250 u'
> 139 <1   155 >1   171 <<   187 >>   203 E:   219 U"   235 e:   251 u"
> 140 S'   156 s'   172 NO   188 L<   204 E<   220 U:   236 e<   252 u:
> 141 T<   157 t<   173 --   189 '"   205 I'   221 Y'   237 i'   253 y'
> 142 Z<   158 z<   174 Rg   190 l<   206 I>   222 T,   238 i>   254 t,
> 143 Z'   159 z'   175 Z.   191 z.   207 D<   223 ss   239 d<   255 '.

- there are some common chars but they are nonetheless different.

*

Thank you for bringing up the subject. Now I know that cp1250 != latin1.


Sincerely yours, Roman Suzi
-- 
rnd at onego.ru =\= My AI powered by Linux RedHat 7.3






More information about the Python-list mailing list