Commit graph

8 commits

Author SHA1 Message Date
joerg
3769fa0bfc Add a number of includes hidden by libstdc++'s name space pollution. 2013-04-29 21:31:09 +00:00
marino
f50901c296 graphics/tesseract: #include <unistd.h>
Fixes out-of-scope errors seen on gcc 4.7.x
2012-11-23 23:52:33 +00:00
dholland
2313ef244d Add missing <stdio.h>, should fix or improve linux build 2011-11-14 02:44:40 +00:00
wiz
d417e89789 Update to 2.04. Set LICENSE.
June 30 2009 - V2.04
	  Integrated bug fixes and patches and misc changes for portability.
	  Integrated a patch to remove some of the "access" macros.
	  Removed dependence on lua from the viewer, speeding it up
	  dramatically.
	  Fixed the viewer so it compiles and runs properly!
	  Specifically fixing issues: 1, 63, 67, 71, 76, 81, 82, 106, 111,
	  112, 128, 129, 130, 133, 135, 142, 143, 145, 147, 153, 154, 160,
	  165, 170, 175, 177, 187, 192, 195, 199, 201, 205, 209, 108, 169
2009-07-22 20:57:47 +00:00
wiz
4358c8cac0 Replace patch-ab with a post-extract rule. No change to the binary package,
just one file less in pkgsrc ;)
2008-10-30 22:12:59 +00:00
wiz
b4a554e958 Update to 2.03:
January 23 2008 - V2.02
          Improvements to clustering, training and classifier.
          Major internationalization improvements for large-character-set
          languages, eg Kannada.
          Removed some compiler warnings.
          Added multipage tiff support for training and running.
          Updated graphics output to talk to new java-based viewer.
          Added ability to save n-best lists.
          Added leptonica support for more file types.
          Improved Init/End to make them safe.
          Reduced memory use of dictionaries.
          Added some new APIs to TessBaseAPI.
April 21 2008 - V2.02 (again)
          Fixed namespace collisions with jpeg library (INT32).
          Portability fixes for Windows for new code.
          Updates to autoconf system for new code.
April 22 2008 - V2.03
          Fixed crash introduced in 2.02.
	  Fixed lack of tessembedded.cpp in distribution.
	  Added test for leptonica header files and conditional test for lib.
2008-05-30 13:06:26 +00:00
wiz
1da043e250 Update to 2.00, provided by Rumko on pkgsrc-users.
July 02 2007 - V2.00
	  Converted internal character handling to UTF8.
	  Trained with 6 languages.
	  Added unicharset_extractor, wordlist2dawg.
	  Added boxfile creation mode.
	  Added UNLV regression test capability.
	  Fixed problems with copyright and registered symbols.
	  Fixed extern "C" declarations problem.
2007-07-28 01:02:14 +00:00
wiz
e899e6021c Initial import of tesseract-1.04b from pkgsrc-wip (packaged by heinz@
and myself):

This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO
OUTPUT FORMATTING, and NO UI. It can only process an image of a
single column and create text from it. It can detect fixed pitch
vs proportional text.  Having said that, in 1995, this engine was
in the top 3 in terms of character accuracy, and it compiles and
runs on both Linux and Windows. Another current limitation is that
it only recognizes English and its character set is only US-ASCII.
Training code IS included in the open source release however, and
will be included in a future release.
2007-05-18 06:39:27 +00:00