<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : ocr - where is the recognized text?</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : ocr - where is the recognized text?]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Thu, 11 Jun 2026 21:14:25 +0000</pubDate>
  <lastBuildDate>Wed, 25 Nov 2015 07:56:36 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=2102</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[ocr - where is the recognized text? :   AndrewC wrote:OCR text is often...]]></title>
   <link>http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post12827.html#12827</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=2803">wubuer</a><br /><strong>Subject:</strong> 2102<br /><strong>Posted:</strong> 25 Nov 15 at 7:56AM<br /><br /><table width="99%"><tr><td class="BBquote"><img src="forum_images/quote_box.png" title="Originally posted by AndrewC" alt="Originally posted by AndrewC" style="vertical-align: text-bottom;" /> <strong>AndrewC wrote:</strong><br /><br /><a href="http://www.&#111;nline-code.net/ocr.html" target="_blank" rel="nofollow"><font color="#000000">OCR</font></a> text is often inserted into an invisible text object that cannot be seen but can be extracted with GetPageText text extraction functions within QPL.<div><br></div><div>&nbsp; int ret = QP.LoadFromFile("ocred.pdf", "");</div><div>&nbsp; string s = QP.GetPageText(3); &nbsp; &nbsp;// you can also try option 7 or 8.</div><div><br></div><div><br></div></td></tr></table><div><br></div><div>thanks, it's help a lot.</div>]]>
   </description>
   <pubDate>Wed, 25 Nov 2015 07:56:36 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post12827.html#12827</guid>
  </item> 
  <item>
   <title><![CDATA[ocr - where is the recognized text? : Many thanks for your precious...]]></title>
   <link>http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8960.html#8960</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1817">vladob</a><br /><strong>Subject:</strong> 2102<br /><strong>Posted:</strong> 17 Jan 12 at 7:43AM<br /><br />Many thanks for your precious help<DIV>It works</DIV><DIV>Have a nice day</DIV><DIV>V.</DIV>]]>
   </description>
   <pubDate>Tue, 17 Jan 2012 07:43:06 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8960.html#8960</guid>
  </item> 
  <item>
   <title><![CDATA[ocr - where is the recognized text? : OCR text is often inserted into...]]></title>
   <link>http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8959.html#8959</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1483">AndrewC</a><br /><strong>Subject:</strong> 2102<br /><strong>Posted:</strong> 16 Jan 12 at 2:22PM<br /><br />OCR text is often inserted into an invisible text object that cannot be seen but can be extracted with GetPageText text extraction functions within QPL.<div><br></div><div>&nbsp; int ret = QP.LoadFromFile("ocred.pdf", "");</div><div>&nbsp; string s = QP.GetPageText(3); &nbsp; &nbsp;// you can also try option 7 or 8.</div><div><br></div><div><br></div>]]>
   </description>
   <pubDate>Mon, 16 Jan 2012 14:22:44 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8959.html#8959</guid>
  </item> 
  <item>
   <title><![CDATA[ocr - where is the recognized text? : Hi Vladimir!Don&amp;#039;t know if...]]></title>
   <link>http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8958.html#8958</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=111">Ingo</a><br /><strong>Subject:</strong> 2102<br /><strong>Posted:</strong> 13 Jan 12 at 6:47PM<br /><br />Hi Vladimir!<br><br>Don't know if i understand your question right but ...<br>First there's a scanned invoice for example.<br>It's scanned as an image to pdf first.<br>You can view this pdf via QuickPDF, changing properties and so on but textextraction isn't possible.<br>Then there are ocr-tools available going through this pdf making readable textcontent from the "image-pdf".<br>For this the "image-pdf" remains the same but additionally the ocr-tool inserts real textcontent.<br>Now you can extract this text with QuickPDF and things like fulltext search and others are possible.<br><br>With QuickPDF you can determine if there's an "ocr-ed" 'cause while textextraction there's an option to extract with fontnames... ocr-fonts are very special fonts and mostly inside the fontname there's an "ocr" too.<br>The other thing how to determine an ocr-pdf is:<br>If the inserted imagecount is the same than the pagecount and if the images have the same dimensions as the pages.<br><br>I hope i could help a little bit and perhaps now you have further ideas ;-)<br><br>Cheers and welcome here,<br>Ingo<br>&nbsp;<br>]]>
   </description>
   <pubDate>Fri, 13 Jan 2012 18:47:43 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8958.html#8958</guid>
  </item> 
  <item>
   <title><![CDATA[ocr - where is the recognized text? : Hi all  I have following question,...]]></title>
   <link>http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8957.html#8957</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1817">vladob</a><br /><strong>Subject:</strong> 2102<br /><strong>Posted:</strong> 13 Jan 12 at 5:36PM<br /><br />Hi all<DIV>&nbsp;</DIV><DIV>I have following question, when you ask OCR software to read picture PDF (scanned pictures into PDF), OCR engines inject recognized text into PDF file, can you let me know where? I mean how I can access those recognized text with QuickPDF?</DIV><DIV>&nbsp;</DIV><DIV>Many thanks</DIV><DIV>&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>Vladimir</DIV>]]>
   </description>
   <pubDate>Fri, 13 Jan 2012 17:36:52 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/ocr-where-is-the-recognized-text_topic2102_post8957.html#8957</guid>
  </item> 
 </channel>
</rss>