Python script that does batch find and replace in txt files
Syed Khalid
khalidness at gmail.com
Sun Nov 9 15:52:35 EST 2014
Hi Albert,
Thank you for script.
I am getting the below error :
File "EamClean.log", line 12
with codecs.open(txt + "_out.txt", "wb", encoding="utf-8") as w:
^
SyntaxError: invalid syntax
Kindly do the needful.
On Mon, Nov 10, 2014 at 1:53 AM, Albert-Jan Roskam <fomcl at yahoo.com> wrote:
>
>
>
>
> ----- Original Message -----
> > From: Syed Khalid <khalidness at gmail.com>
> > To: python-list at python.org
> > Cc:
> > Sent: Sunday, November 9, 2014 8:58 PM
> > Subject: Python script that does batch find and replace in txt files
> >
> > Python script that does batch find and replace in txt files Need a
> python script
> > that opens all .txt files in a folder find replace/delete text and save
> files.
> >
> > I have text files and I need to perform below steps for each file.
> >
> > Step 1: Put cursor at start of file and Search for "Contact's
> > Name:". Delete all the rows before it.
> > Step 2: Put cursor at end of file, Search for "Contact's Name:"
> > select option UP.
> > Step 3: Search for "Photo of the" Replace with blanks
> > Step 4: Search for "Contact is" Replace with blanks
> > Step 5: Search for "Contact's Name:" Replace with blanks
> > Step 6: Search for "Age:" Replace with blanks
> > Step 7: Search for "Sex:" Replace with blanks
> > Step 8: Search for "House No:" Replace with blanks
> > Step 9: Search for "available" Replace with blanks
> > Step 10: Remove Empty Lines Containing Blank Characters from file
> > Step 11: Trim Leading Space for each line
> > Step 12: Trim Trailing Space after each line
> > Step 13: Search for - (hyphen) Replace with _ (underscore)
>
> > Step 14: Save file.
>
> something like (untested)
>
>
> import glob, codecs, re, os
>
> regex = re.compile(r"Age: |Sex: |House No: ") # etc etc
>
> for txt in glob.glob("/some/path/*.txt"):
> with codecs.open(txt, encoding="utf-8") as f:
> oldlines = f.readlines()
> for i, line in enumerate(oldlines):
> if "Contact's Name: " in line:
> break
> newlines = [regex.sub("", line).strip().replace("-", "_") for line in
> oldlines[i:]
> with codecs.open(txt + "_out.txt", "wb", encoding="utf-8") as w:
> w.write(os.linesep.join(newlines))
>
>
>
>
> >
> > Currently I have recorded a macro in Notepad++.
> > I open each file, run macro and save file.
> > As there are many files I was looking for a program to automate the
> process.
> >
> > I posted the same query in Notepad++ forum. I got a reply that it can be
> done by
> > using Python script.
> >
> > Kindly do the needful.
> >
> > Thank you.
> > khalidness
> >
> > --
> > https://mail.python.org/mailman/listinfo/python-list
> >
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-list/attachments/20141110/073e6b8c/attachment.html>
More information about the Python-list
mailing list