Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - GetPageText returns not all text
  FAQ FAQ  Forum Search   Register Register  Login Login

GetPageText returns not all text

 Post Reply Post Reply
Author
Message
Mykhaylo Boreyko View Drop Down
Beginner
Beginner
Avatar

Joined: 20 Nov 13
Location: Ukraine
Status: Offline
Points: 3
Post Options Post Options   Thanks (0) Thanks(0)   Quote Mykhaylo Boreyko Quote  Post ReplyReply Direct Link To This Post Topic: GetPageText returns not all text
    Posted: 20 Nov 13 at 3:39PM
Hello guys,

I'm implementing search functionality for my PDF viewer.
I'm using GetPageText function to get text on page and then find target string.
When target string is found, text is highlighted on page using returned in GetPageText text bounds.

Everything works fine. But for some documents, pages GetPageText returns not all words on page, so search fails.

In reference guide there is statement "The SetTextExtractionWordGap, SetTextExtractionOptions and SetTextExtractionArea functions can be used to adjust the text extraction process".
But what function I must call, with what parameters, to return all words on page?

Or maybe GetPageText is obsolete and ExtractPageTextBlocks must be used instead?

P.S.
I'm using Quick PDF Library 9.16.
Back to Top
Mykhaylo Boreyko View Drop Down
Beginner
Beginner
Avatar

Joined: 20 Nov 13
Location: Ukraine
Status: Offline
Points: 3
Post Options Post Options   Thanks (0) Thanks(0)   Quote Mykhaylo Boreyko Quote  Post ReplyReply Direct Link To This Post Posted: 20 Nov 13 at 7:39PM
I've realized that problem was in my csv parsing code.
So for now everything is normal.
Back to Top
AndrewC View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 08 Dec 10
Location: Geelong, Aust
Status: Offline
Points: 841
Post Options Post Options   Thanks (0) Thanks(0)   Quote AndrewC Quote  Post ReplyReply Direct Link To This Post Posted: 26 Nov 13 at 1:50AM
Mykhaylo,

Using ExtractPageTextBlocks removes the needs to parse a CSV file which can be a little trick at times.

The SetTextExtractionWordGap function is very very rarely needed and the SetTextExtractionOptions is useful when you need the results to be reformatted a little to make extraction easier.

Andrew.
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store