Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
GetPageText get repeat char |
Post Reply ![]() |
Author | |
purple ![]() Beginner ![]() Joined: 24 Aug 12 Status: Offline Points: 7 |
![]() ![]() ![]() ![]() ![]() Posted: 25 Mar 13 at 10:41AM |
Hi, I'v got a file from customer, using qp.GetPageText to get text, each word is repeat all char 3 times, and copy text from adobe reader is ok. Have any ideas? |
|
![]() |
|
AndrewC ![]() Moderator Group ![]() ![]() Joined: 08 Dec 10 Location: Geelong, Aust Status: Offline Points: 841 |
![]() ![]() ![]() ![]() ![]() |
Some PDF libraries use a Normal font and draw it 3 or 4 times at a small offset to simulate Bold font. This is the most likely reason for multiple repeated character.
You should try calling either/or QP.SetTextExtractionOptions(7, 1); and / or QP.SetTextExtractionOptions(8, 1); or Andrew. |
|
![]() |
|
purple ![]() Beginner ![]() Joined: 24 Aug 12 Status: Offline Points: 7 |
![]() ![]() ![]() ![]() ![]() |
The version I use is 8.16, seems this option 7 and 8 is new feature in 9.xx?
I 'v got another file, extract mess text like 'ÁÃÃÖäÕã âäÔÔÁÙè', the origin text is 'ACCOUNT SUMMARY', can be copy from adobe reader and that is OK. can this be fix? Thanks
purple.
|
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store