Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - Encryption in GetPageText?
  FAQ FAQ  Forum Search   Register Register  Login Login

Encryption in GetPageText?

 Post Reply Post Reply
Author
Message
jwinkl View Drop Down
Beginner
Beginner
Avatar

Joined: 01 May 14
Location: Austria
Status: Offline
Points: 2
Post Options Post Options   Thanks (0) Thanks(0)   Quote jwinkl Quote  Post ReplyReply Direct Link To This Post Topic: Encryption in GetPageText?
    Posted: 01 May 14 at 4:53PM
I'm completely at a loss with GetPageText. When using it with the Parameter 2, I get - for example on page 1 of the "GettingStarted.pdf" document from Debenu itself - the firste line

250.53,715.64,#000000,12,"GBONLS+Verdana [Bold]","Delphi Edition"

which is fairly clear to me. When using the parameter 3, it is

"GBONLS+Verdana [Bold]",#000000,12,250.5258,128.7699,344.7498,128.7699,344.7498,114.1899,250.5258,114.1899,"'HOSKL (GLWLRQ"

so it seems that "Delphi Edition" is somehow (not very intelligently) encrypted to "'HOSKL (GLWLRQ".

Now I would be obliged if anyone could tell me why this is so and how the clear text could be retrieved. For my purposes parameter 3 is mandatory, because I need the bounds rectangle of the text as well as the text itself.

I'm using version 10.13
Back to Top
AndrewC View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 08 Dec 10
Location: Geelong, Aust
Status: Offline
Points: 841
Post Options Post Options   Thanks (0) Thanks(0)   Quote AndrewC Quote  Post ReplyReply Direct Link To This Post Posted: 02 May 14 at 9:59AM
jwinkl,

Can you please send me the PDF file to support@debenu.com.  

Text extraction is complex and it looks like this PDF is using encoding tables.  It is strange that Option 2 is working better than Option 3 as it is usually the other way around.  Option 3 does a lot more work and can extract quite complex encoded fonts.

Andrew.
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store