Print Page | Close Window

Extracting Pages

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: I need help - I can help
Forum Description: Problems and solutions while programming with the Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=692
Printed Date: 15 May 24 at 2:24PM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: Extracting Pages
Posted By: jose.cavalcanti
Subject: Extracting Pages
Date Posted: 27 Mar 07 at 11:04PM
Hi, when I extract pages using QuickPDF, pages are much bigger than extracting them with Adobe Acrobat itself.

Page 102 - 2106 Kb - 61 Kb

Page 411 - 2100 Kb - 55 Kb

Anyone knows why?

Regards,




Replies:
Posted By: Ingo
Date Posted: 28 Mar 07 at 1:06AM
Hi Jose!

Are there any images on these pages?
If there's one image reference for each page inside the pdf... QuickPDF put it on each extracted page? Does the extracted pages  from Acrobat and QuickPDF looking identical?

Best regards,
Ingo



Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:00AM
I little one at the the end of the page.
The pages are absolutely identicals.
If you donīt mind I can send them to you.
I have a procedure that is lasting forever, because the size of the files...

Regards,


Posted By: Ingo
Date Posted: 28 Mar 07 at 10:03AM
Okay... send it.



Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:04AM
One thing that can help. When I was Zipping the 4 files to send to you (just in case), the Ratio for Acrobat pages was 16% and the Ratio for QuickPDF pages was 92%!!!
ZIP File Size is 427 Kb.


Regards,


Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 10:05AM
What is your e-mail address?


Posted By: Ingo
Date Posted: 28 Mar 07 at 12:07PM
ingo[dot]schmoekel[at]ewetel[dot]net



Posted By: Ingo
Date Posted: 28 Mar 07 at 12:13PM
Originally posted by jose.cavalcanti jose.cavalcanti wrote:

the Ratio for Acrobat pages was 16% and the Ratio for QuickPDF pages was 92%!!!


Hi Jose!

In this case perhaps you should try functions like
CompressContent
CompressPage
CompressFonts
CompressImages
first.

Best regards,
Ingo



Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 1:39PM
Sent.

Regards,


Posted By: jose.cavalcanti
Date Posted: 28 Mar 07 at 1:41PM
The problem is that itīll take more time to extract pages in my procedure...
I think that Iīll need another way to do this.

Regards,


Posted By: Ingo
Date Posted: 29 Mar 07 at 12:58AM
Hi Jose!

Did you try the compress-functions?

Best regards,
Ingo



Posted By: jose.cavalcanti
Date Posted: 29 Mar 07 at 9:27AM
No. The problem itself is not the size of the extracted page. The problem is that it is taking much more time to extract these pages than other PDF files and itīs taking a long time to extract all the pages of the file.
I think that the problem is in the source file.
Today Iīll be studying this problem.
Iīll let you know of any solution.
Thanks anyway.

Best Regards,


Posted By: jose.cavalcanti
Date Posted: 29 Mar 07 at 8:17PM
Do you know of any other PDF Library that I can use to extract pages from a PDF document?
I want to see where is the problem.

Regards,


Posted By: jose.cavalcanti
Date Posted: 26 Apr 07 at 12:38PM
Iīm here again Ingo.Embarrassed

As you can remember, I need to extract pages from PDFs Documents. Everything was going fine, but with ONE Publication, the extracted page was 1.215 Mb size from a 240 pages Document with 13.226 Mb size.
When I extract the same page from Adobe Acrobat professional 7 the size is 39 Kb.
For this Publication I wrote a piece of code using Adobe Acrobat SDK, but itīs giving me a lot of problems (Out of Memory Error, Server Error, etc). The code is:


LPARAMETERS    toPDFDoc, tcFileNameFrom, tcFileNameTo, tnNumerodaPagina

LOCAL    llSourceDoc, loJSObject, loExtractedDoc

llSourceDoc    = toPDFDoc.Open(tcFileNameFrom)
loJSObject     = toPDFDoc.GetJSObject()
loExtractedDoc = loJSObject.ExtractPages(tnNumerodaPagina - 1, tnNumerodaPagina - 1)
               = loExtractedDoc.SaveAs(tcFileNameTo)
               = loExtractedDoc.CloseDoc()
               = toPDFDoc.Close()
              
RETURN

Do you know of any other method to extract pages from a PDF Document?

Regards,


Posted By: chicks
Date Posted: 26 Apr 07 at 12:48PM
Have you tried http://www.pdfhacks.com/pdftk/ - pdftk ? It's based on iText, and comes with complete source code. There's a .NET port of iText which you may want to use if pdftk works for you.


Posted By: Ingo
Date Posted: 27 Apr 07 at 2:42AM
Hi Jose!

The real PDF-expert here is Chicks ;-)
You should try pdftk/iText.
There's a C#-version iTextSharp, too.

Best regards,
Ingo




Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk