Debenu Quick PDF Library - PDF SDK Community Forum : Extracting text problem

Debenu Quick PDF Library - PDF SDK Community Forum : Extracting text problem http://www.quickpdf.org/forum/ Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved. Wed, 20 May 2026 20:48:26 +0000 Thu, 14 May 2009 20:34:21 +0000 http://blogs.law.harvard.edu/tech/rss Web Wiz Forums 11.01 360 www.quickpdf.org/forum/RSS_post_feed.asp?TID=1085 <![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]> http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png http://www.quickpdf.org/forum/ <![CDATA[Extracting text problem : Excellent -- note, we have also...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5079.html#5079 Author: deabrew
Subject: 1085
Posted: 14 May 09 at 8:34PM

Excellent -- note, we have also added support for this functionality within the next build (7.14) of QPL.

Cheers, -Karl]]> Thu, 14 May 2009 20:34:21 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5079.html#5079 <![CDATA[Extracting text problem : I just recreated the PDF sample...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5078.html#5078 Author: RobertN
Subject: 1085
Posted: 14 May 09 at 8:17PM

I just recreated the PDF sample using DoPDF print driver instead of PrimoPDF and everything works now in detecting the text using QuickPDF.

Thank you again for the quick responses.

]]> Thu, 14 May 2009 20:17:52 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5078.html#5078 <![CDATA[Extracting text problem : Hello Robert, Ingo, I'd...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5076.html#5076 Author: deabrew
Subject: 1085
Posted: 14 May 09 at 5:53PM

Hello Robert, Ingo,

I'd like to confirm that Ingo has notified me, and that we will support this issue in a future version (fairly shortly).

Regards, Karl.]]> Thu, 14 May 2009 17:53:58 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5076.html#5076 <![CDATA[Extracting text problem : I've sent an email in this...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5074.html#5074 Author: Ingo
Subject: 1085
Posted: 14 May 09 at 10:01AM

I've sent an email in this case to Debenu ... ;-)

Cheers, Ingo
]]> Thu, 14 May 2009 10:01:54 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5074.html#5074 <![CDATA[Extracting text problem : here is essentially what i'm...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5073.html#5073 Author: RobertN
Subject: 1085
Posted: 14 May 09 at 9:47AM

here is essentially what i'm doing in Delphi 7.

procedure TForm1.Button2Click(Sender: TObject);
var oDoc : TQuickPDF0713;
    sTemp,sFilename : string;
begin
sFilename := 'c:\Temperature_Transmitter_Template.pdf';
oDoc := TQuickPDF0713.Create;
try
if oDoc.UnlockKey('...') = 1
then begin
         if oDoc.LoadFromFile(sFilename) = 1
         then begin
                sTemp := oDoc.GetPageText(0);
                ShowMessage(sTemp);
                // this returns an empty string
              end
         else begin
                ShowMessage('invalid PDF');
              end;
       end
else begin
         ShowMessage('Invalid KEY');
       end;
finally
    FreeAndNil(oDoc);
end;
end;

The output is blank for GetPagetext() 0,1

for 2 - I get the text coordinates,etc in CSV format

for 3 and 4 - I get the same as 2, but all text is garbled.

Do i need to convert it.

sample output :

"UBTAOI+Arial",#000000,6.71,60.1272,118.3487,295.7056,118.3487,295.7056,124.6588,60.1272,124.6588,"())*++,-../*, )0 +-)0)+*.("

Thanks,

Robert

]]> Thu, 14 May 2009 09:47:04 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5073.html#5073 <![CDATA[Extracting text problem : Hi!I would be careful about...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5072.html#5072 Author: Ingo
Subject: 1085
Posted: 14 May 09 at 8:51AM

Hi!

I would be careful about the versions of PrimoPDF. They are using the ghostscript-library and with older versions (before 8.15) QuickPDF still has problems while extracting! Your pdf was made with PrimoPDF and ghostscript-version 8.50 ... so this is okay. Looking in the extracted text i can find many variables beginning with "@" ... so i think basically it's working.
Adobe Reader (8.1) and Foxit (3.0) can't find "@sometext", too.
Is it a special moment while adding "@sometext" to the content?
How do you do this?
Any code parts for us here to check?

Cheers, Ingo

Edited by Ingo - 14 May 09 at 8:53AM]]> Thu, 14 May 2009 08:51:06 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5072.html#5072 <![CDATA[Extracting text problem : Hi Ingo, here is a sample pdf...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5071.html#5071 Author: RobertN
Subject: 1085
Posted: 14 May 09 at 8:39AM

Hi Ingo,

here is a sample pdf file with the "@sometext" in it.

http://www.mediafire.com/file/wzmmoznzuwf/Temperature_Transmitter_Template.pdf

it was generated using excel and printed to PDF via PrimoPDF.

I have tried a few other printer drivers, but the result was the same.

I tried GetPageText() with 0,1,2,3,4 but all with the same result.

I can open it in Acrobat Reader and extract the text without a problem.

Thank you,

Robert

]]> Thu, 14 May 2009 08:39:30 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5071.html#5071 <![CDATA[Extracting text problem : Hi Robert!In your case i think...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5068.html#5068 Author: Ingo
Subject: 1085
Posted: 14 May 09 at 1:48AM

Hi Robert!

In your case i think the content of "@some..." will be single strings/words ...
So it should be better to use GetPageText(4).

Perhaps it's possible for you to send me a sample of your files and then i'll try to extract the strings with "@some..."?

ingo [ dot ] schmoekel ( at ) ewetel [ dot ] net

Cheers, Ingo

]]> Thu, 14 May 2009 01:48:35 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5068.html#5068 <![CDATA[Extracting text problem : I have created a simple form in...]]> http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5067.html#5067 Author: RobertN
Subject: 1085
Posted: 13 May 09 at 3:04PM

I have created a simple form in Excel with cells that have '@VariableName'

in them. I print to PDF and then open the pdf using QuickPDF and delphi.

I want to scan the pdf for all text that has '@somevariablename' and get the fontsize,coordinates,etc and then convert them into formfields.

The purpose is to create a pdf form filler that i can save the results from.

I tried to do a GetPageText(3) but the results don't have any readable text. If I try a pdf with formfields i get the extracted text properly.

How do I extract this text ?

Thank you,

Robert

]]> Wed, 13 May 2009 15:04:27 +0000 http://www.quickpdf.org/forum/extracting-text-problem_topic1085_post5067.html#5067