Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
text extraction |
Post Reply
|
| Author | |
rajeev
Beginner
Joined: 09 Nov 10 Location: INDIA Status: Offline Points: 1 |
Post Options
Thanks(0)
Quote Reply
Topic: text extractionPosted: 10 Nov 10 at 2:49PM |
|
Hi,
I used php to successfully read the lines from pdf file of a newspaper. The problem is that it reads char by char or word by word only. I wish to read the file paragraph by paragraph. any help for this?
Also i could extract images from pdf, but i also need the coordinates where the image was placed.
any help will be appreciated..
|
|
![]() |
|
Ingo
Moderator Group
Joined: 29 Oct 05 Status: Offline Points: 3530 |
Post Options
Thanks(0)
Quote Reply
Posted: 10 Nov 10 at 7:32PM |
|
Hi!
With QuickPDF you can do textextraction from pdf word by word, string by string and/or page by page. Have a look in the online reference accessable via www.quickpdf.org. The image coordinates you can get via the relevant mediaboxes. Read the pdf with QuickPDF, then decryption, then reading the real content (like looking into pdf via notepad). Cheers and welcome here, Ingo |
|
![]() |
|
Post Reply
|
|
|
Tweet
|
| Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store