Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
DAExtractPageText losing characters |
Post Reply
|
| Author | |
Mike4ql
Beginner
Joined: 26 Jul 10 Status: Offline Points: 5 |
Post Options
Thanks(0)
Quote Reply
Topic: DAExtractPageText losing charactersPosted: 08 Oct 10 at 11:39AM |
|
I am trying to extract the text from a PDF and most of it works fine but occasionally letters are missed in the extract. This appears to be because the PDF is using octal codes for the characters. This is the text which should be produced and is rendered correctly by DARenderPageToString: Here is the command extract for this same section BT The DAExtractPageText (option 3) returns 2 lines with an empty string and a space (or perhaps 2) for the Top Line and misses out the "fl" from the begining of the Next Line. Is there any way I can correct this? |
|
![]() |
|
Mike4ql
Beginner
Joined: 26 Jul 10 Status: Offline Points: 5 |
Post Options
Thanks(0)
Quote Reply
Posted: 12 Oct 10 at 7:15PM |
|
Has nobody else seen this?
It seems to be a fundamental flaw preventing anyone from using PDF Quick to extract text from a PDF.
I would be grateful for any suggestions.
Mike
|
|
![]() |
|
Post Reply
|
|
|
Tweet
|
| Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store