Finding non ascii characters in a set of files

John Machin sjmachin at lexicon.net
Fri Feb 23 10:35:40 EST 2007


On Feb 24, 2:12 am, "Peter Bengtsson" <pete... at gmail.com> wrote:
> On Feb 23, 2:38 pm, b... at yahoo.com wrote:
>
> > Hi,
>
> > I'm updating my program to Python 2.5, but I keep running into
> > encoding problems. I have no ecodings defined at the start of any of
> > my scripts. What I'd like to do is scan a directory and list all the
> > files in it that contain a non ascii character. How would I go about
> > doing this?
>
> How about something like this:
> content = open('file.py').read()
> try:
>     content.encode('ascii')
> except UnicodeDecodeError:
>     print "file.py contains non-ascii characters"





More information about the Python-list mailing list