Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
How to optimize text extraction? |
Post Reply ![]() |
Author | |
Dmitry ![]() Team Player ![]() Joined: 21 Sep 06 Status: Offline Points: 47 |
![]() ![]() ![]() ![]() ![]() Posted: 11 Mar 07 at 3:40AM |
Hi to all!
I have a question. How to optimize time of executing the function GetPageText? Average time of execution is about one second per page. It's too long for me :-) How to reduce this time? if I understand correctly during the text extraction qPDF library extract also all images from page. May be it will be more faster not to extract and save to harddrive images ??? |
|
![]() |
|
marian_pascalau ![]() Debenu Quick PDF Library Expert ![]() Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
![]() ![]() ![]() ![]() ![]() |
Dmitry,
there is only one way to influence the Text extraction: the Option parameter.
As you may know there are 5 parameters:
0: contents scan
1: internally same as 0
2: contents scan, CVS output
3: CVS text collection with rendering (may read image dictionary)
4: CVS text collection with rendering and word separation.
As information for you using the 0-2 Option may bring some improvements.
|
|
![]() |
|
Dmitry ![]() Team Player ![]() Joined: 21 Sep 06 Status: Offline Points: 47 |
![]() ![]() ![]() ![]() ![]() |
marian_pascalau, yes I know. But I need exactly parameter 5.
|
|
![]() |
|
Ingo ![]() Moderator Group ![]() ![]() Joined: 29 Oct 05 Status: Offline Points: 3529 |
![]() ![]() ![]() ![]() ![]() |
". . .
qPDF library extract also all images from page . . ." Hi! The actual library version doesn't extract the images anymore. Best regards, Ingo |
|
![]() |
|
marian_pascalau ![]() Debenu Quick PDF Library Expert ![]() Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
![]() ![]() ![]() ![]() ![]() |
Hi Dmitry, Hi Ingo,
I cannot follow both of you:
Dmitry, what do you mean with parameter 5?
Ingo, is it now working as expected or this is an error?
Marian
|
|
![]() |
|
Ingo ![]() Moderator Group ![]() ![]() Joined: 29 Oct 05 Status: Offline Points: 3529 |
![]() ![]() ![]() ![]() ![]() |
Hi Marian!
It's working like accepted... I think months ago this was fixed... Here's a thread pointing in the same direction: http://www.quickpdf.org/forum/search_results_posts.asp?SearchID=20070312070924&KW=asachoi Best regards, Ingo |
|
![]() |
|
Dmitry ![]() Team Player ![]() Joined: 21 Sep 06 Status: Offline Points: 47 |
![]() ![]() ![]() ![]() ![]() |
marian_pascalau, sorry, I meant parameter 4
![]() Ingo, please give me just direct link to the thread. |
|
![]() |
|
marian_pascalau ![]() Debenu Quick PDF Library Expert ![]() Joined: 28 Mar 06 Location: Germany Status: Offline Points: 278 |
![]() ![]() ![]() ![]() ![]() |
Dmitry, if you consider a sponsorship and I will try to optimize the text extraction (Option=4) for you. Otherwise you should to use the option 2 and split text with your own program.
|
|
![]() |
|
Dmitry ![]() Team Player ![]() Joined: 21 Sep 06 Status: Offline Points: 47 |
![]() ![]() ![]() ![]() ![]() |
marian_pascalau
No, thanks ![]() |
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store