Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - ExtractText stopped working
  FAQ FAQ  Forum Search   Register Register  Login Login

ExtractText stopped working

 Post Reply Post Reply
Author
Message
bernhardz View Drop Down
Beginner
Beginner


Joined: 30 Nov 05
Location: United States
Status: Offline
Points: 4
Post Options Post Options   Thanks (0) Thanks(0)   Quote bernhardz Quote  Post ReplyReply Direct Link To This Post Topic: ExtractText stopped working
    Posted: 30 Nov 05 at 5:38PM

Hi,

I posted this message originally with the QuickPDF forum, but decided also to come over here.

I switched from a Crystal 8.5 generated PDF document to Version 10.0 of Crystal and the GetPageText no longer retrieves the proper text information.

Obviously, there must have been some format changes in the Crystal V10.0 PDF format. I have sample documents available. Has anybody else experienced these and possibly know a solution?

TIA

Bernhard

Back to Top
chicks View Drop Down
Debenu Quick PDF Library Expert
Debenu Quick PDF Library Expert


Joined: 29 Oct 05
Location: United States
Status: Offline
Points: 251
Post Options Post Options   Thanks (0) Thanks(0)   Quote chicks Quote  Post ReplyReply Direct Link To This Post Posted: 30 Nov 05 at 6:01PM
Don't know for sure, but here are some possibilities:

1. PDF now gets created as image instead of text streams (are they considerably larger than before?)

2. New PDF creation engine uses newer compression algorithm (zbig?) that QuickPDF may not be able to uncompress.

To test, see if the standard freebie commandline tools pdftk and pdftotext (part of xpdf) can uncompress and extract text from them.

Also, after uncompressing with pdftk, view PDFs with a text editor, you should see the text inside unless it's in an image.

Back to Top
Ingo View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 29 Oct 05
Status: Offline
Points: 3524
Post Options Post Options   Thanks (0) Thanks(0)   Quote Ingo Quote  Post ReplyReply Direct Link To This Post Posted: 01 Dec 05 at 1:43AM
Hi Bernhard,

try this workaround: LoadFromFile(originalfile)... SaveToFile(workfile)... LoadFromFile(workfile)... GetPageText...
SaveToFile will write the pdf-content back to disk with quickpdf-technics. Loading the new saved file a second time will solved your problem i think.

Cheers,
Ingo

Back to Top
bernhardz View Drop Down
Beginner
Beginner


Joined: 30 Nov 05
Location: United States
Status: Offline
Points: 4
Post Options Post Options   Thanks (0) Thanks(0)   Quote bernhardz Quote  Post ReplyReply Direct Link To This Post Posted: 01 Dec 05 at 11:10AM

Thanks for the help. THe LoadFromFile / SaveToFile / LoadFromFile did the trick in getting me back into a format that QuickPDF could read.

Bernhard

Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store