<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : Extracting text problem</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : Extracting text problem]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Wed, 20 May 2026 20:48:26 +0000</pubDate>
  <lastBuildDate>Thu, 14 May 2009 20:34:21 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=1085</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[Extracting text problem : Excellent -- note, we have also...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5079.html#5079</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1003">deabrew</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 8:34PM<br /><br />Excellent -- note, we have also added support for this functionality within the next build (7.14) of QPL.<br /><br />Cheers, -Karl]]>
   </description>
   <pubDate>Thu, 14 May 2009 20:34:21 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5079.html#5079</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : I just recreated the PDF sample...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5078.html#5078</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1068">RobertN</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 8:17PM<br /><br /><P>I just recreated the PDF sample using DoPDF print driver instead of PrimoPDF and everything works now in detecting the text using QuickPDF.</P><DIV>Thank you again for the quick responses.</DIV>]]>
   </description>
   <pubDate>Thu, 14 May 2009 20:17:52 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5078.html#5078</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : Hello Robert, Ingo,  I&amp;#039;d...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5076.html#5076</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1003">deabrew</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 5:53PM<br /><br />Hello Robert, Ingo,<br /><br />I'd like to confirm that Ingo has notified me, and that we will support this issue in a future version (fairly shortly).<br /><br />Regards, Karl.]]>
   </description>
   <pubDate>Thu, 14 May 2009 17:53:58 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5076.html#5076</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : I&amp;#039;ve sent an email in this...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5074.html#5074</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=111">Ingo</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 10:01AM<br /><br />I've sent an email in this case to Debenu ... ;-)<br><br>Cheers, Ingo<br>]]>
   </description>
   <pubDate>Thu, 14 May 2009 10:01:54 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5074.html#5074</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : here is essentially what i&amp;#039;m...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5073.html#5073</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1068">RobertN</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 9:47AM<br /><br />here is essentially what i'm doing in Delphi 7.<DIV>&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>procedure TForm1.Button2Click(Sender: TObject);<BR>var oDoc : TQuickPDF0713;<BR>&nbsp;&nbsp;&nbsp; sTemp,sFilename : string;<BR>begin<BR>&nbsp; sFilename := 'c:\Temperature_Transmitter_Template.pdf';<BR>&nbsp; oDoc := TQuickPDF0713.Create;<BR>&nbsp; try<BR>&nbsp; if oDoc.UnlockKey('...') = 1<BR>&nbsp; then begin<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; if oDoc.LoadFromFile(sFilename) = 1<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; then begin<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; sTemp := oDoc.GetPageText(0);<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ShowMessage(sTemp);<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; // this returns an empty string<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; end<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; else begin<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ShowMessage('invalid PDF');<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; end;<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; end<BR>&nbsp; else begin<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; ShowMessage('Invalid KEY');<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; end;<BR>&nbsp; finally<BR>&nbsp;&nbsp;&nbsp; FreeAndNil(oDoc);<BR>&nbsp; end;<BR>end;<BR></DIV><DIV>The output is blank for GetPagetext() 0,1</DIV><DIV>for 2 - I get the text coordinates,etc in CSV format</DIV><DIV>for 3 and 4 - I get the same as 2, but all text is garbled.</DIV><DIV>Do i need to convert it.</DIV><DIV>&nbsp;</DIV><DIV>sample output :</DIV><DIV>"UBTAOI+Arial",#000000,6.71,60.1272,118.3487,295.7056,118.3487,295.7056,124.6588,60.1272,124.6588,"())*++,-../*, )0 +-)0)+*.("</DIV><DIV>&nbsp;</DIV><DIV>Thanks,</DIV><DIV>Robert</DIV>]]>
   </description>
   <pubDate>Thu, 14 May 2009 09:47:04 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5073.html#5073</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem :  Hi!I would be careful about...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5072.html#5072</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=111">Ingo</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 8:51AM<br /><br />Hi!<br><br>I would be careful about the versions of PrimoPDF. They are using the ghostscript-library and with older versions (before 8.15) QuickPDF still has problems while extracting! Your pdf was made with PrimoPDF and ghostscript-version 8.50 ... so this is okay. Looking in the extracted text i can find many variables beginning with "@" ... so i think basically it's working. <br>Adobe Reader (8.1) and Foxit (3.0) can't find "@sometext", too.<br>Is it a special moment while adding "@sometext" to the content? <br>How do you do this? <br>Any code parts for us here to check?<br><br>Cheers, Ingo<br><br><span style="font-size:10px"><br /><br />Edited by Ingo - 14 May 09 at 8:53AM</span>]]>
   </description>
   <pubDate>Thu, 14 May 2009 08:51:06 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5072.html#5072</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : Hi Ingo,  here is a sample pdf...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5071.html#5071</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1068">RobertN</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 8:39AM<br /><br />Hi Ingo,<DIV>&nbsp;</DIV><DIV>here is a sample pdf file with the&nbsp;"@sometext" in&nbsp;it.</DIV><DIV><a href="http://www.mediafire.com/file/wzmmoznzuwf/Temperature_Transmitter_Template.pdf" target="_blank">http://www.mediafire.com/file/wzmmoznzuwf/Temperature_Transmitter_Template.pdf</A></DIV><DIV>it was generated using excel and printed to PDF via PrimoPDF.</DIV><DIV>I have tried a few other printer drivers, but the result was the same.</DIV><DIV>&nbsp;</DIV><DIV>I tried &nbsp;GetPageText() with 0,1,2,3,4 but all with the same result.</DIV><DIV>I can open it in Acrobat Reader and extract the text without a problem.</DIV><DIV>&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>Thank you,</DIV><DIV>Robert</DIV>]]>
   </description>
   <pubDate>Thu, 14 May 2009 08:39:30 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5071.html#5071</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : Hi Robert!In your case i think...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5068.html#5068</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=111">Ingo</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 14 May 09 at 1:48AM<br /><br />Hi Robert!<br><br>In your case i think the content of "@some..." will be single strings/words ...<br>So it should be better to use GetPageText(4).<br><br>Perhaps it's possible for you to send me a sample of your files and then i'll try to extract the strings with "@some..."?<br><br>ingo&nbsp; &#091; dot &#093;&nbsp; schmoekel&nbsp; ( at )&nbsp; ewetel&nbsp; &#091; dot &#093;&nbsp; net<br><br>Cheers, Ingo<br>&nbsp;<br>]]>
   </description>
   <pubDate>Thu, 14 May 2009 01:48:35 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5068.html#5068</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text problem : I have created a simple form in...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5067.html#5067</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1068">RobertN</a><br /><strong>Subject:</strong> 1085<br /><strong>Posted:</strong> 13 May 09 at 3:04PM<br /><br /><P>I have created a simple form in Excel with cells that have '@VariableName'</P><DIV></DIV>in them. I print to PDF and then open the pdf using QuickPDF and delphi.<DIV>I want to scan the pdf for all text that has <a href="mailto:@somevariablename" target="_blank">'@somevariablename'</A> and get the fontsize,coordinates,etc and then convert them into formfields.</DIV><DIV>The purpose is to create a pdf form filler that i can save the results from.</DIV><DIV>&nbsp;</DIV><DIV>I tried to do a GetPageText(3) but the results don't have any readable text. If I try a pdf with formfields i get the extracted text properly. </DIV><DIV>&nbsp;</DIV><DIV>How do I extract this text ?</DIV><DIV>&nbsp;</DIV><DIV>Thank you,</DIV><DIV>Robert</DIV>]]>
   </description>
   <pubDate>Wed, 13 May 2009 15:04:27 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5067.html#5067</guid>
  </item> 
 </channel>
</rss>