Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!

Debenu Quick PDF Library - PDF SDK Community Forum Homepage
Forum Home Forum Home > For Users of the Library > I need help - I can help
  New Posts New Posts RSS Feed - Extract web / email links
  FAQ FAQ  Forum Search   Register Register  Login Login

Extract web / email links

 Post Reply Post Reply
Author
Message
ZarkoGajic View Drop Down
Beginner
Beginner
Avatar

Joined: 18 Mar 09
Location: Croatia
Status: Offline
Points: 19
Post Options Post Options   Thanks (0) Thanks(0)   Quote ZarkoGajic Quote  Post ReplyReply Direct Link To This Post Topic: Extract web / email links
    Posted: 18 May 10 at 2:01PM
Hi,

What would be the easiest way to extract web links like "http://" or "www.site.com" and email links like "mailto:mail@domain.com" or "mail@domain.com" from an existing PDF document?

The GetAnnotStrProperty(111) would retrieve the annotation link value.

I am looking for a way to extract those "web-like" links that a PDF reader would represent as web links and ask to open a web browser or start the default email client.

-zarko
-zarko gajic
Back to Top
Ingo View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 29 Oct 05
Status: Offline
Points: 3524
Post Options Post Options   Thanks (0) Thanks(0)   Quote Ingo Quote  Post ReplyReply Direct Link To This Post Posted: 18 May 10 at 3:31PM
Hi Zarko!

You need this page:
http://www.quickpdflibrary.com/help/quickpdf/AnnotationsAndHotspotLinks.php
Additional there was an older newsletter from Rowan or Karl explaining how to separate these links.
I think you should go on the official supportpages. There you'll find these things.

Cheers, Ingo

Back to Top
ZarkoGajic View Drop Down
Beginner
Beginner
Avatar

Joined: 18 Mar 09
Location: Croatia
Status: Offline
Points: 19
Post Options Post Options   Thanks (0) Thanks(0)   Quote ZarkoGajic Quote  Post ReplyReply Direct Link To This Post Posted: 18 May 10 at 3:35PM
Ingo,

Thanks. I'm aware of the Annotations related function.

I'm looking for the best way to extract text and look for "web-alike"  patterns :)


-zarko gajic
Back to Top
dsola View Drop Down
Team Player
Team Player


Joined: 28 Oct 05
Location: Croatia
Status: Offline
Points: 34
Post Options Post Options   Thanks (0) Thanks(0)   Quote dsola Quote  Post ReplyReply Direct Link To This Post Posted: 27 May 10 at 3:57PM
Hi,

brute force ?
GetPageText or direct access equivalent method and then search.
If all links have same font or colour search would be simple.

Pozdrav iz Nove

Davor
registered QuickPDF user
Back to Top
ZarkoGajic View Drop Down
Beginner
Beginner
Avatar

Joined: 18 Mar 09
Location: Croatia
Status: Offline
Points: 19
Post Options Post Options   Thanks (0) Thanks(0)   Quote ZarkoGajic Quote  Post ReplyReply Direct Link To This Post Posted: 27 May 10 at 4:09PM
Davore, thanks ... that's how it was done :)
-zarko gajic
Back to Top
Ingo View Drop Down
Moderator Group
Moderator Group
Avatar

Joined: 29 Oct 05
Status: Offline
Points: 3524
Post Options Post Options   Thanks (0) Thanks(0)   Quote Ingo Quote  Post ReplyReply Direct Link To This Post Posted: 27 May 10 at 6:10PM
Hi!

I don't think that GetPageText will work in every case.
With GetPageText you'll get things like "please klick on this link to enter the website"
but you won't get the real link behind.

You can do this things by yourself, too.
I've made an Unencryption and search for things like http and so on in the real file-content.

Cheers, Ingo
 
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down

Forum Software by Web Wiz Forums® version 11.01
Copyright ©2001-2014 Web Wiz Ltd.

Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. AboutContactBlogSupportOnline Store