[Tutor] man pages parsing (still)

Tiago Saboga tiagosaboga at terra.com.br
Tue Sep 12 22:59:52 CEST 2006


Em Segunda 11 Setembro 2006 19:45, Kent Johnson escreveu:
> Tiago Saboga wrote:
> > Ok, the guilty line (279) has a "©" that was probably defined in the
> > dtd, but as it doesn't know what is the right dtd... But wait... How does
> > python read the dtd? It fetches it from the net? I tried it
> > (disconnected) and the answer is yes, it fetches it from the net. So
> > that's the problem!
> >
> > But how do I avoid it? I'll search. But if you can spare me some time,
> > you'll make me a little happier.
> >
> > [1] - The line is as follows:
> > <!DOCTYPE refentry PUBLIC "-//OASIS//DTD DocBook XML V4.1.2//EN"
> >                   
> > "http://www.oasis-open.org/docbook/xml/4.1.2/docbookx.dtd">
>
> I'm just guessing, but I think if you find the right combination of
> handlers and feature settings you can at least make it just pass through
> the external entities without looking up the DTDs.

I got it! I just set the feature_external_ges to false and it doesn't fetch 
the dtd any more. Thanks!!! ;-)

>
> Take a look at these pages for some hints:
> http://www.cafeconleche.org/books/xmljava/chapters/ch07s02.html#d0e10350
> http://www.cafeconleche.org/books/xmljava/chapters/ch06s11.html

It looks very interesting, and it was exactly what I needed. But I couldn't 
grab it at first, I need some more time to understand it all.

Thanks again!!!

Tiago.


More information about the Tutor mailing list