Print Page | Close Window

Retriving pdf content

Printed From: Debenu Quick PDF Library - PDF SDK Community Forum
Category: For Users of the Library
Forum Name: I need help - I can help
Forum Description: Problems and solutions while programming with the Debenu Quick PDF Library and Debenu PDF Viewer SDK
URL: http://www.quickpdf.org/forum/forum_posts.asp?TID=372
Printed Date: 19 May 24 at 2:01AM
Software Version: Web Wiz Forums 11.01 - http://www.webwizforums.com


Topic: Retriving pdf content
Posted By: Gerald manickam
Subject: Retriving pdf content
Date Posted: 19 Mar 06 at 6:57AM

hai,

    I want to get the content of a pdf by passing the coordinates.

Is it possible with quick pdf.Please help me .

With Regards,

P.Gerald Manickam




Replies:
Posted By: swb1
Date Posted: 24 Mar 06 at 10:12AM

Yes it’s possible. I’ve done it.

 

 It is not necessarily straight-forward and predicable. I use GetPageText( 3 ) which returns a list of text elements and their coordinates. You can you this list to determine if the given text lies within certain boundaries.

 

The gotcha part is that not all PDFs are created equal. How the text is laid out in the document is highly dependant on how the PDF was created. For example I have seen at least one PDF print driver that creates a PDFs ONE LETTER AT A TIME!!!! While it displays just fine, it was almost impossible to determine where one word stopped and the next word started.  You will need to have tight control over that factor.

 

Regards,

sb



Print Page | Close Window

Forum Software by Web Wiz Forums® version 11.01 - http://www.webwizforums.com
Copyright ©2001-2014 Web Wiz Ltd. - http://www.webwiz.co.uk