Do you own a Debenu Quick PDF Library version 7, 8, 9, 10, 11, 12, 13 or iSEDQuickPDF license? Upgrade to Debenu Quick PDF Library 14 today!
![]() |
Unicode text extraction? |
Post Reply ![]() |
Author | |
phildick ![]() Beginner ![]() Joined: 14 Oct 09 Location: Poland Status: Offline Points: 2 |
![]() ![]() ![]() ![]() ![]() Posted: 14 Oct 09 at 2:36PM |
Welcome, I need a pdf component for Delphi 2009 to extract text from pdf files. I installed and tested QuickPDF. I tried both Delphi 2009 and ActiveX versions and both extracted only ASCII text without any international characters (Polish in my case). I am a little disappointed, especially because there is a note "Full Unicode support" in the feature list (http://www.quickpdflibrary.com/products/quickpdf/features.php). Is there any way I can extract full text with all characters? Best regards, Bartek |
|
![]() |
|
shimax ![]() Beginner ![]() Joined: 03 Oct 09 Location: Japan Status: Offline Points: 6 |
![]() ![]() ![]() ![]() ![]() |
Hello, Bartek
As discussed in
it seems that unicode text extraction does not work well as expected.
In my case as well Japanese characters are not extracted at all.
I contacted with the support, but I have not yet got an answer for a week except they recieved my email. So I think to implement unicode support is a very diffcult task for some reasons or they are so busy for other problems or for developing new features.
Not only in text extraction but also in other features there seems to be many unicode-related problems in QuickPDF. Regretabbly, full unicode support is not true at least as far as the version is 7.16.
|
|
![]() |
|
Wheeley ![]() Senior Member ![]() ![]() Joined: 30 Oct 05 Location: United States Status: Offline Points: 146 |
![]() ![]() ![]() ![]() ![]() |
The next release should have more support for unicode. I was told they are removing the function ToPDFUnicode. If they do that, then unicode support must be enhanced somehow.
Wheeley |
|
![]() |
|
Michel_K17 ![]() Newbie ![]() ![]() www.exp-systems.com Joined: 25 Jan 03 Status: Offline Points: 297 |
![]() ![]() ![]() ![]() ![]() |
I have received the same assurances as well. They (Debenu) have been very good at addressing specific issues as we bring them up. On the unicode front, at least we can now save/merge PDF files with unicode characters in the path.
Support for unicode characters as part of the metadata is coming with the next beta (which is what I was waiting for). ![]() For text extraction, I don't know. Michel |
|
Michel
|
|
![]() |
|
Ingo ![]() Moderator Group ![]() ![]() Joined: 29 Oct 05 Status: Offline Points: 3529 |
![]() ![]() ![]() ![]() ![]() |
Hi All!
QuickPDF is a very complete and extensive library and the unicode-support should touch nearly all modules. So please be a bit patient. I'm pretty sure that it's only a matter of time ;-) Cheers, Ingo |
|
![]() |
|
phildick ![]() Beginner ![]() Joined: 14 Oct 09 Location: Poland Status: Offline Points: 2 |
![]() ![]() ![]() ![]() ![]() |
Hi Ingo, Maybe it is, but I installed the demo modules in my Delphi 2009 (which is fully Unicode now), and all the string parameters are declared as AnsiString, not String. Even if it's backward compatibility, which I completely understand, there could be a "wide string" version of every string routine, as it was done in Windows API years ago. BTW the last non-Unicode Windows OS was released in 2000 (Windows ME), so it's been almost ten years since. Furthermore, I imported the ActiveX version in which all parameters are passed as WideString (so it should be fully Unicode), and it produced the same result as earlier - only ANSI characters in the extracted text. Best regards, Bartek |
|
![]() |
Post Reply ![]() |
|
Tweet
|
Forum Jump | Forum Permissions ![]() You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
Copyright © 2017 Debenu. Debenu Quick PDF Library is a PDF SDK. All rights reserved. About — Contact — Blog — Support — Online Store