[XML-SIG] how to get xml parse with non ascii charset

pita jyllyj at gcom-cn.com
Mon Jun 30 13:49:24 EDT 2003


in winxp/python23/lastest cjkcodecs
i found some problem in xml parse
from xml.dom import minidom
minidom.parse(file) can't undenstand gb2312
i post sample'code that you can test it


begin 666 xmlparse_gb2312.py
M9G)O;2!X;6PN9&]M(&EM<&]R="!M:6YI9&]M#0H-"G-T<E]A<V-I:3TB(B(\
M/WAM;"!V97)S:6]N/2(Q+C B(&5N8V]D:6YG/2)A<V-I:2(_/@T*/&%S8VEI
M7W)O;W0^#0H)/&-H:6QD,3X-"@D)=&AI<R!I<R!A(&-H:6QD<F5N+ at T*"3PO
M8VAI;&0Q/@T*"3QC:&EL9#(^#0H)"71H:7,@:7,@;W1H97(@8VAI;&1R96XN
M#0H)/"]C:&EL9#(^#0H\+V%S8VEI7W)O;W0^#0HB(B(-"G!R:6YT("=P87)S
M92!S='(@=VET:"!E;F-O9&EN9R!A<V-I:2<-"F1O;3UM:6YI9&]M+G!A<G-E
M4W1R:6YG*'-T<E]A<V-I:2D-"G!R:6YT(&1O;2YC:&EL9$YO9&5S#0IP<FEN
M=" G+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM
M+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2<-"@T*<W1R7VQA=&EN7S$](B(B/#]X
M;6P@=F5R<VEO;CTB,2XP(B!E;F-O9&EN9STB;&%T:6Y?,2(_/@T*/&QA=&EN
M7S%?<F]O=#X-"@D\8VAI;&0Q/@T*"0ET:&ES(&ES(&$@8VAI;&1R96XN#0H)
M/"]C:&EL9#$^#0H)/&-H:6QD,CX-"@D)=&AI<R!I<R!O=&AE<B!C:&EL9')E
M;BX-"@D\+V-H:6QD,CX-"CPO;&%T:6Y?,5]R;V]T/@T*(B(B#0H-"@T*#0IP
M<FEN=" G<&%R<V4@<W1R('=I=&@@96YC;V1I;F<@;&%T:6Y?,2<-"F1O;3UM
M:6YI9&]M+G!A<G-E4W1R:6YG*'-T<E]L871I;E\Q*0T*<')I;G0 at 9&]M+F-H
M:6QD3F]D97,-"G!R:6YT("<M+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM
M+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM)PT*#0IS=')?
M9V(R,S$R/2(B(CP_>&UL('9E<G-I;VX](C$N,"(@96YC;V1I;F<](F=B,C,Q
M,B(_/@T*/&=B,C,Q,E]R;V]T/@T*"3QC:&EL9#$^#0H)"71H:7,@:7, at 82!C
M:&EL9')E;BX-"@D\+V-H:6QD,3X-"@D\8VAI;&0R/@T*"0ET:&ES(&ES(&]T
M:&5R(&-H:6QD<F5N+ at T*"3PO8VAI;&0R/@T*/"]G8C(S,3)?<F]O=#X-"B(B
M(@T*#0H-"@T*<')I;G0@)W!A<G-E('-T<B!W:71H(&5N8V]D:6YG(&=B,C,Q
M,B<-"F1O;3UM:6YI9&]M+G!A<G-E4W1R:6YG*'-T<E]G8C(S,3(I#0IP<FEN
M="!D;VTN8VAI;&1.;V1E<PT*<')I;G0@)RTM+2TM+2TM+2TM+2TM+2TM+2TM
M+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TM+2TG
3#0ID;VTN=6YL:6YK*"D-"@D)"0``
`
end






More information about the XML-SIG mailing list