Print Page | Close Window

GetPageText returns not all text

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: I need help - I can help
Forum Description: Problems and solutions while programming with the Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=2786
Printed Date: 27 Jan 26 at 2:03PM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: GetPageText returns not all text
Posted By: Mykhaylo Boreyko
Subject: GetPageText returns not all text
Date Posted: 20 Nov 13 at 3:39PM
Hello guys,

I'm implementing search functionality for my PDF viewer.
I'm using GetPageText function to get text on page and then find target string.
When target string is found, text is highlighted on page using returned in GetPageText text bounds.

Everything works fine. But for some documents, pages GetPageText returns not all words on page, so search fails.

In reference guide there is statement "The SetTextExtractionWordGap, SetTextExtractionOptions and SetTextExtractionArea functions can be used to adjust the text extraction process".
But what function I must call, with what parameters, to return all words on page?

Or maybe GetPageText is obsolete and ExtractPageTextBlocks must be used instead?

P.S.
I'm using Quick PDF Library 9.16.



Replies:
Posted By: Mykhaylo Boreyko
Date Posted: 20 Nov 13 at 7:39PM
I've realized that problem was in my csv parsing code.
So for now everything is normal.


Posted By: AndrewC
Date Posted: 26 Nov 13 at 1:50AM
Mykhaylo,

Using ExtractPageTextBlocks removes the needs to parse a CSV file which can be a little trick at times.

The SetTextExtractionWordGap function is very very rarely needed and the SetTextExtractionOptions is useful when you need the results to be reformatted a little to make extraction easier.

Andrew.



Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk