<?xml version="1.0" encoding="utf-8" ?>
<?xml-stylesheet type="text/xsl" href="RSS_xslt_style.asp" version="1.0" ?>
<rss version="2.0" xmlns:WebWizForums="http://syndication.webwiz.co.uk/rss_namespace/">
 <channel>
  <title>Debenu Quick PDF Library - PDF SDK Community Forum : Image Legibility</title>
  <link>http://www.quickpdf.org/forum/</link>
  <description><![CDATA[This is an XML content feed of; Debenu Quick PDF Library - PDF SDK Community Forum : I need help - I can help : Image Legibility]]></description>
  <copyright>Copyright (c) 2006-2013 Web Wiz Forums - All Rights Reserved.</copyright>
  <pubDate>Wed, 20 May 2026 05:08:42 +0000</pubDate>
  <lastBuildDate>Tue, 18 Mar 2008 11:37:03 +0000</lastBuildDate>
  <docs>http://blogs.law.harvard.edu/tech/rss</docs>
  <generator>Web Wiz Forums 11.01</generator>
  <ttl>360</ttl>
  <WebWizForums:feedURL>www.quickpdf.org/forum/RSS_post_feed.asp?TID=878</WebWizForums:feedURL>
  <image>
   <title><![CDATA[Debenu Quick PDF Library - PDF SDK Community Forum]]></title>
   <url>http://www.quickpdf.org/forum/forum_images/QPDF_Forum_Title.png</url>
   <link>http://www.quickpdf.org/forum/</link>
  </image>
  <item>
   <title><![CDATA[Image Legibility : Thanks for the reply. I came-up...]]></title>
   <link>http://www.quickpdf.org/forum/image-legibility_topic878_post4199.html#4199</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=312">bb46970</a><br /><strong>Subject:</strong> 878<br /><strong>Posted:</strong> 18 Mar 08 at 11:37AM<br /><br />Thanks for the reply.&nbsp; I came-up with an option that may work.&nbsp; I believe that they do scan the documents as 1-bit.&nbsp; However, I went with a grey scale option.&nbsp; I use scanline, to examine each pixel.&nbsp; I add the red, green, and blue values, for each pixel.&nbsp; If it falls below 50%, I assume that it is "dark."&nbsp; If necessary, I can adjust the 50%.&nbsp; I keep a tally of all of the dark pixels.&nbsp; Then I set a threshold for the page.&nbsp; For example, if 45% of the&nbsp;page is dark pixels, I flag it as "suspect."&nbsp; I do that for each page in the document.&nbsp; My only concern is finding suspect pages, for a human to examine, and determine if the document needs to be rescanned.<DIV>&nbsp;</DIV><DIV>If anyone has a better option - particularly faster - I am open to it.&nbsp; Some of these documents are hundreds of pages long, and I may have to process hundreds of documents at a time.</DIV>]]>
   </description>
   <pubDate>Tue, 18 Mar 2008 11:37:03 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/image-legibility_topic878_post4199.html#4199</guid>
  </item> 
  <item>
   <title><![CDATA[Image Legibility : First of all, your problem is...]]></title>
   <link>http://www.quickpdf.org/forum/image-legibility_topic878_post4198.html#4198</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=589">peteratoce</a><br /><strong>Subject:</strong> 878<br /><strong>Posted:</strong> 18 Mar 08 at 8:49AM<br /><br />First of all, your problem is more in the field of image processing than in PDF handling. Further, it is almost impossible to generate a good image from a really bad scan. <DIV>&nbsp;</DIV><DIV>That said, you can have a look at e.g. PixEdit, which allows you to load PDFs and offers a COM interface to do all kinds of operations on your (hopefully monochrome?) images. </DIV><DIV>If you want to identify images that are too dark you probably would have to look at a region in the margin that should be white = without pixels.&nbsp;</DIV><DIV>&nbsp;</DIV><DIV>There is unfortunately no API function in PixEdit that returns the &nbsp;number of black pixels in a given area, but you can excerpt an area to file and then perhaps turn to ImageMagick for the counting of black pixels.</DIV><DIV>On second thought, simply save to format "Uncompressed, No header", read in the bytes and count the number of "1"-bits in each byte yourself.</DIV><DIV>&nbsp;</DIV><DIV>Peter</DIV><span style="font-size:10px"><br /><br />Edited by peteratoce - 18 Mar 08 at 9:10AM</span>]]>
   </description>
   <pubDate>Tue, 18 Mar 2008 08:49:31 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/image-legibility_topic878_post4198.html#4198</guid>
  </item> 
  <item>
   <title><![CDATA[Image Legibility : I have a client who scans text...]]></title>
   <link>http://www.quickpdf.org/forum/image-legibility_topic878_post4192.html#4192</link>
   <description>
    <![CDATA[<strong>Author:</strong> <a href="http://www.quickpdf.org/forum/member_profile.asp?PF=312">bb46970</a><br /><strong>Subject:</strong> 878<br /><strong>Posted:</strong> 14 Mar 08 at 5:32PM<br /><br />I have a client who scans text documents.&nbsp; They do not perform OCR.&nbsp; They just save the text as images, in the PDFs.&nbsp; Sometimes the people scanning do poor jobs, resulting in some pages that are really dark or black.&nbsp; I am looking for a way to programmatically check the pages and see if any of the pages are "suspect."]]>
   </description>
   <pubDate>Fri, 14 Mar 2008 17:32:31 +0000</pubDate>
   <guid isPermaLink="true">http://www.quickpdf.org/forum/image-legibility_topic878_post4192.html#4192</guid>
  </item> 
 </channel>
</rss>