[Tutor] Considering translating bash script to Python to learn

Dotan Cohen dotancohen at gmail.com
Tue May 12 21:57:21 CEST 2009


> Thats rarely a good approach. While you can replace bash with Python you
> will just wind up calling a bunch of external programs and thats what shell
> scripts are best at.

That is why is has been a bash script until now.

> The only time its worthwhile is where the bash code is
> structurally complex with lots of loops and conditionals and local state.
>

That's where it is going, I am starting to have date-dependant
functions and such. The bash is getting messy.

>> rearranges, then sends the whole thing to the printer. However, I seem
>> to depend on many bash-specific functions (os programs) and I cannot
>> find Python equivalents. Here are some lines of the script, and the
>> relevant questions:
>
> The key is what you say here. The core code depends on os programs (not bash
> functions!). Python can probably replace some of those os programs but why
> bother unless you need to retire them or are paying money for them?

Because the direction of the script is getting complex. But I still
have my reservations.

>> gnome-web-print --mode=print --files $FILE /tmp/weeklyCalendar.pdf
>> Here, I am converting an HTML file to PDF. I rely on the external
>> program gnome-web-print to do the conversion. Does Python have a
>> native way to handle this, or would I just wind up calling
>> gnome-web-print from within python?
>
> You could write a program to do this in python but it would be bigger than
> your entire bash script replacement.
>

I see!

>> pdftops -f 2 -l 2 $HOME/.bin/todo/todo.odf.pdf /tmp/weeklyTodo.ps
>> Here I am exporting the second page of a PDF file as a PS file.
>> Actually, the file in question is a hybrid ODF-PDF file and if Python
>> has a way of exporting a specific page of a ODF file that is fine too.
>> I could export to PDF as well, the important bit is too isolate a
>> single page of either an ODF or PDF file and have it as either a PS or
>> PDF.
>
> Same here although ReportLab (a non standard lib Python module) may have
> some helper functions for converting PDF to ps.
>

Thanks, I did see that mentioned today while googling.

>> gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite
>> -sOutputFile=/tmp/weeklyCalendarPrintMe.pdf
>> $HOME/.bin/todo/weeklyCalendar-blank.pdf /tmp/weeklyCalendar.pdf
>> /tmp/weeklyTodo.ps $HOME/.bin/todo/weeklyCalendar-blank.pdf
>> Here, I am combining several PDF and PS files as a single PDf file.
>> Can Python do this?
>
> Again you probably could since PDF and PS are both text like formats
> internally but it would be quite a lot of work.
>

The intention was to have a library function to use, not to code a
parser myself.

>> lpr -P printer -o number-up=2 /tmp/weeklyCalendarPrintMe.pdf
>> Here I am sending the final file to the printer, and having it print 2
>> pages on each page. Again, is this something that Python can handle
>> internally?
>
> This is so OS and local environment specific that Python usually just
> delegates this to lpr in my experience!
>

That is logical and what I expected to hear.

>> Thanks for the help. I know that Python is not specifically designed
>> to manipulate PDF files, but as the script is getting more complex and
>> performs logic in other sections (not shown here) I would like to take
>> advantage of other Python properties.
>
> For PDF you should look at ReportLab. It is really designed for creating
> PDFs from non PDF data (eg out of a database or with graphics etc). Combined
> with a html parser such as Beautiful Soup
> that aspect may be worth converting to Python.
>

Thanks.

-- 
Dotan Cohen

http://what-is-what.com
http://gibberish.co.il


More information about the Tutor mailing list