unicode .replace not working - why?
Kurt Peters
nospampeterskurt at msn.com
Sun Oct 12 21:22:46 EDT 2008
Thanks...
On a side note, do you really think the function call wouldn't interpret
the unichr before the function call?
Kurt
"Peter Otten" <__peter__ at web.de> wrote in message
news:gcsaqq$5bf$01$1 at news.t-online.com...
> Kurt Peters wrote:
>
>> I had done that about 21 revisions ago.
>
> If you litter your module with code that is commented out it is hard to
> keep
> track of what works and what doesn't.
>
>> Nevertheless, why would you think
>> that would work, when the code as shown doesn't?
>
> Because he knows Python? Why don't /you/ try it before asking that
> question?
>
> A good place to do "exploratory" programming is Python's interactive
> interpreter. Here's a sample session:
>
> Python 2.5.1 (r251:54863, Jul 31 2008, 23:17:43)
> [GCC 4.1.3 20070929 (prerelease) (Ubuntu 4.1.2-16ubuntu2)] on linux2
> Type "help", "copyright", "credits" or "license" for more information.
>>>> from pyPdf import PdfFileReader as PFR
>>>> doc = PFR(open("SUA.pdf"))
>>>> text = doc.getPage(3).extractText()
>>>> type(text)
> <type 'unicode'>
>>>> text[:200]
> u'2/16/08 7400.8P Table of Contents - Continued Section
> Page
> \ xa773.49 New Hampshire (NH) 50
> \xa773.50 New Jersey (NJ) 50 \xa773.51 New Mex
> ico (NM) 51 \xa773.52 New York (NY) 56 \xa773.53 North '
>>>> print text[:200].replace(u"\xa7", u"\n")
> 2/16/08 7400.8P Table of Contents - Continued Section Page
> 73.49 New Hampshire (NH) 50
> 73.50 New Jersey (NJ) 50
> 73.51 New Mexico (NM) 51
> 73.52 New York (NY) 56
> 73.53 North
>
> Peter
More information about the Python-list
mailing list