<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : Extracting text from different charsets</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : Extracting text from different charsets]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Sat, 04 Apr 2026 23:17:52 +0000</pubDate>
  <lastBuildDate>Tue, 22 May 2012 12:21:31 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=2271</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[Extracting text from different charsets : Can you try using ExtractPageText(3)...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-from-different-charsets_topic2271_post9628.html#9628</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1483">AndrewC</a><br /><strong>Subject:</strong> 2271<br /><strong>Posted:</strong> 22 May 12 at 12:21PM<br /><br />Can you try using ExtractPageText(3) or ExtractPageText(7) to see if the text is extracted correctly. &nbsp;Option 0 is a very fast extraction method but it is not aware of font encodings.<div><br></div><div>In QPL 8.xx we have added option 8 which outputs the text using the same format as option 0 but can handle various font encodings and mappings.</div><div><br></div><div>Andrew.</div><div><br></div><div>Andrew.</div>]]>
   </description>
   <pubDate>Tue, 22 May 2012 12:21:31 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-from-different-charsets_topic2271_post9628.html#9628</guid>
  </item> 
  <item>
   <title><![CDATA[Extracting text from different charsets : Hi. I&amp;#039;m using v.7.26 under...]]></title>
   <link>http://www.quickpdf.org/forum/extracting-text-from-different-charsets_topic2271_post9621.html#9621</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=1944">miguele</a><br /><strong>Subject:</strong> 2271<br /><strong>Posted:</strong> 21 May 12 at 11:20AM<br /><br />Hi. I'm using v.7.26 under Delphi XE to extract from different charset PDF files (usually with accentuation characters). Usually the "ExtractFilePageText" with option 0 reads accurately, but for some older PDF files the accentuated characters are extracted wrongly. Is there a way to prevent this? Can you provide some sample code?<div><br></div><div>Thanks!</div>]]>
   </description>
   <pubDate>Mon, 21 May 2012 11:20:55 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/extracting-text-from-different-charsets_topic2271_post9621.html#9621</guid>
  </item> 
 </channel>
</rss>