<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : Text Extraction</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : Text Extraction]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Thu, 11 Jun 2026 21:07:20 +0000</pubDate>
  <lastBuildDate>Thu, 19 Jan 2012 21:42:55 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=2104</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[Text Extraction : If you are using LoadFromFile...]]></title>
   <link>http://www.quickpdf.org/forum/text-extraction_topic2104_post8968.html#8968</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1483">AndrewC</a><br /><strong>Subject:</strong> 2104<br /><strong>Posted:</strong> 19 Jan 12 at 9:42PM<br /><br /><div>If you are using LoadFromFile then it would be better to use the GetPageText(8); call.</div><div><br></div><div><div style="text-align: left; : rgb251, 251, 253; ">&nbsp; nPage = 1</div><div style="text-align: left; : rgb251, 251, 253; ">&nbsp; strInputFilePath="c:\rnw.pdf"</div><div style="text-align: left; : rgb251, 251, 253; ">&nbsp; tt = QP.LoadFromFile(strInputFilePath, â€œâ€)</div></div><div style="text-align: left; : rgb251, 251, 253; "><br></div><div style="text-align: left; : rgb251, 251, 253; ">&nbsp; QP.SelectPage(1);</div><div style="text-align: left; : rgb251, 251, 253; "><br></div><div>&nbsp; QP.SetOrigin(1);</div><div>&nbsp; QP.SetTextExtractionArea(1, 1, QP.PageWidth(), 3 * 72); &nbsp; // 72pts = 1 inch</div><div>&nbsp; strText = QP.GetPageText(8);</div><div><br></div><div>Note: If you use the ExtractFilePageText call then it gets a little more complicated as you need to use the QP.DASetTextExtractionArea function and the native PDF coordinate system as SetOrigin doesn't work with the DA functions and you don't have access to the Page Height and Width until you open the file. &nbsp;It is easier to work with the standard functions and GetPageText().</div>]]>
   </description>
   <pubDate>Thu, 19 Jan 2012 21:42:55 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/text-extraction_topic2104_post8968.html#8968</guid>
  </item> 
  <item>
   <title><![CDATA[Text Extraction : I am using the following code...]]></title>
   <link>http://www.quickpdf.org/forum/text-extraction_topic2104_post8965.html#8965</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1820">rnw</a><br /><strong>Subject:</strong> 2104<br /><strong>Posted:</strong> 19 Jan 12 at 4:52PM<br /><br />I am using the following code to extract text from a pdf file in Visual Basic, and it works great.&nbsp; But I can not figure out how to set the "text&nbsp;Extraction area" to only a portion of the pdf file, say like the firts 3 inches of the pdf file.<DIV>&nbsp;</DIV><DIV>nPage = 1</DIV><DIV>strInputFilePath="c:\rnw.pdf"</DIV><DIV>tt = QP.LoadFromFile(strInputFilePath, “”)<BR>strtext = QP.ExtractFilePageText(strInputFilePath, "", nPage, 8)</DIV><DIV>&nbsp;</DIV><DIV>Any help on where to find this would be great.</DIV><DIV>&nbsp;</DIV><DIV>Roger</DIV><DIV>&nbsp;</DIV><DIV><BR>&nbsp;</DIV>]]>
   </description>
   <pubDate>Thu, 19 Jan 2012 16:52:05 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/text-extraction_topic2104_post8965.html#8965</guid>
  </item> 
 </channel>
</rss>