I need help - I can help - Extracting Pages

Print Page | Close Window

Extracting Pages

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: I need help - I can help
Forum Description: Problems and solutions while programming with the Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=692
Printed Date: 09 Mar 26 at 6:54PM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com

Topic: Extracting Pages

Posted By: jose.cavalcanti
Subject: Extracting Pages
Date Posted: 27 Mar 07 at 11:04PM

Hi, when I extract pages using QuickPDF, pages are much bigger than extracting them with Adobe Acrobat itself.

Page 102 - 2106 Kb - 61 Kb

Page 411 - 2100 Kb - 55 Kb

Anyone knows why?

Regards,

Replies:

Posted By: Ingo
Date Posted: 28 Mar 07 at 1:06AM

Hi Jose!

Are there any images on these pages?
If there's one image reference for each page inside the pdf... QuickPDF put it on each extracted page? Does the extracted pages from Acrobat and QuickPDF looking identical?

Best regards,
Ingo

Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:00AM

I little one at the the end of the page.
The pages are absolutely identicals.
If you don�t mind I can send them to you.
I have a procedure that is lasting forever, because the size of the files...

Regards,

Posted By: Ingo
Date Posted: 28 Mar 07 at 10:03AM

Okay... send it.

Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:04AM

One thing that can help. When I was Zipping the 4 files to send to you (just in case), the Ratio for Acrobat pages was 16% and the Ratio for QuickPDF pages was 92%!!!
ZIP File Size is 427 Kb.

Regards,

Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:05AM

What is your e-mail address?

Posted By: Ingo
Date Posted: 28 Mar 07 at 12:07PM

ingo[dot]schmoekel[at]ewetel[dot]net

Posted By: Ingo
Date Posted: 28 Mar 07 at 12:13PM

jose.cavalcanti wrote:

the Ratio for Acrobat pages was 16% and the Ratio for QuickPDF pages was 92%!!!

Hi Jose!

In this case perhaps you should try functions like
CompressContent
CompressPage
CompressFonts
CompressImages
first.

Best regards,
Ingo

Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 1:39PM

Sent.

Regards,

Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 1:41PM

The problem is that it�ll take more time to extract pages in my procedure...
I think that I�ll need another way to do this.

Regards,

Posted By: Ingo
Date Posted: 29 Mar 07 at 12:58AM

Hi Jose!

Did you try the compress-functions?

Best regards,
Ingo

Posted By: jose.cavalcanti
Date Posted: 29 Mar 07 at 9:27AM

No. The problem itself is not the size of the extracted page. The problem is that it is taking much more time to extract these pages than other PDF files and it�s taking a long time to extract all the pages of the file.
I think that the problem is in the source file.
Today I�ll be studying this problem.
I�ll let you know of any solution.
Thanks anyway.

Best Regards,

Posted By: jose.cavalcanti
Date Posted: 29 Mar 07 at 8:17PM

Do you know of any other PDF Library that I can use to extract pages from a PDF document?
I want to see where is the problem.

Regards,

Posted By: jose.cavalcanti
Date Posted: 26 Apr 07 at 12:38PM

I�m here again Ingo. Embarrassed

As you can remember, I need to extract pages from PDFs Documents. Everything was going fine, but with ONE Publication, the extracted page was 1.215 Mb size from a 240 pages Document with 13.226 Mb size.
When I extract the same page from Adobe Acrobat professional 7 the size is 39 Kb.
For this Publication I wrote a piece of code using Adobe Acrobat SDK, but it�s giving me a lot of problems (Out of Memory Error, Server Error, etc). The code is:

LPARAMETERS    toPDFDoc, tcFileNameFrom, tcFileNameTo, tnNumerodaPagina

LOCAL    llSourceDoc, loJSObject, loExtractedDoc

llSourceDoc    = toPDFDoc.Open(tcFileNameFrom)
loJSObject     = toPDFDoc.GetJSObject()
loExtractedDoc = loJSObject.ExtractPages(tnNumerodaPagina - 1, tnNumerodaPagina - 1)
               = loExtractedDoc.SaveAs(tcFileNameTo)
               = loExtractedDoc.CloseDoc()
               = toPDFDoc.Close()

RETURN

Do you know of any other method to extract pages from a PDF Document?

Regards,

Posted By: chicks
Date Posted: 26 Apr 07 at 12:48PM

Have you tried http://www.pdfhacks.com/pdftk/ - pdftk ? It's based on iText, and comes with complete source code. There's a .NET port of iText which you may want to use if pdftk works for you.

Posted By: Ingo
Date Posted: 27 Apr 07 at 2:42AM

Hi Jose!

The real PDF-expert here is Chicks ;-)
You should try pdftk/iText.
There's a C#-version iTextSharp, too.

Best regards,
Ingo