any lib to extract pages form pdf and then merge?

CTO debatem1 at gmail.com
Tue May 26 01:39:20 EDT 2009


On May 26, 12:47 am, oyster <lepto.pyt... at gmail.com> wrote:
> I want to extract some pages from vary pdf files, then write them
> with/witout rotation into one new pdf file. something likes this
> [py]
> import gfx
> doc = gfx.open("pdf", r"Theory.pdf")
> pdf = gfx.PDF()
> for pagenr in [1,5,7]:
>     page = doc.getPage(pagenr)
>
>     if pagenr==1:
>         page.rotate(90)                                         #for some pages
>
>     pdf.startpage(page.width, page.height)
>     page.render(pdf)
>     pdf.endpage()
> pdf.save("new pdf.pdf")
> [/py]
>
> I have tried pypdf, but it errs and exits on some of my pdfs(no, the
> files have no password)
>
> can someone suggest on such a lib for python on windows/or a pure C-dll?
> (I mean pdf page->pdf, not pdf page->pic->pdf)
>
> thanx

I'd recommend reportlab <URL: http://www.reportlab.org/>. It is mostly
geared towards creating new PDFs, but it is pretty much best-of-breed
AFAICT.

Geremy Condra



More information about the Python-list mailing list