Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - full line text extraction
  FAQ FAQ  Forum Search   Register Register  Login Login

full line text extraction

 Post Reply Post Reply
Author
Message
alinux View Drop Down
Team Player
Team Player


Joined: 09 Dec 08
Location: France
Status: Offline
Points: 20
Post Options Post Options   Thanks (0) Thanks(0)   Quote alinux Quote  Post ReplyReply Direct Link To This Post Topic: full line text extraction
    Posted: 25 Nov 10 at 10:39AM
Hi,

I'm using text extraction functions (activex v7.22 b2) to extract words with coordinates.
I'll need to extract full-line text with line coordinates. Do I have any solution for doing this?

Alin
Back to Top
Ingo View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 29 Oct 05
Status: Offline
Points: 3524
Post Options Post Options   Thanks (0) Thanks(0)   Quote Ingo Quote  Post ReplyReply Direct Link To This Post Posted: 25 Nov 10 at 3:34PM
Hi Alin!

QuickPDF doesn't have this option but it will support you...
You can extract complete pages, strings like they were inserted and single words.
Take the word-option and concatenate the complete lines regarding the position data of each word.
If you have this algorithm completed please insert it here in the samples-section 'cause i need it, too ;-)

Cheers and thanks in advance,
Ingo



Edited by Ingo - 25 Nov 10 at 3:36PM
Back to Top
alinux View Drop Down
Team Player
Team Player


Joined: 09 Dec 08
Location: France
Status: Offline
Points: 20
Post Options Post Options   Thanks (0) Thanks(0)   Quote alinux Quote  Post ReplyReply Direct Link To This Post Posted: 26 Nov 10 at 8:11AM
Hi Ingo,

Thanks for you answer.  I'll post the algo soon; I have some problems with the tables (the OCR engine "read" tables by line/column about a random?! criteria).

Alin
Back to Top
alinux View Drop Down
Team Player
Team Player


Joined: 09 Dec 08
Location: France
Status: Offline
Points: 20
Post Options Post Options   Thanks (0) Thanks(0)   Quote alinux Quote  Post ReplyReply Direct Link To This Post Posted: 26 Nov 10 at 8:04PM
Hi Ingo,

I've posted a basic sample (see code sample) of text assembling lines based on Y1 or Y2 coordinate of words (GetPageText(4) function); for a more accurate result, I think that it'll need a control variable for Y coordinate different values for the words of the same line.

Cheers,
Alin

Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store