Problems found with existing digests:
Package fotoxx distfile fotoxx-14.03.1.tar.gz
ac2033f87de2c23941261f7c50160cddf872c110 [recorded]
118e98a8cc0414676b3c4d37b8df407c28a1407c [calculated]
Package ploticus-examples distfile ploticus-2.00/plnode200.tar.gz
34274a03d0c41fae5690633663e3d4114b9d7a6d [recorded]
da39a3ee5e6b4b0d3255bfef95601890afd80709 [calculated]
Problems found locating distfiles:
Package AfterShotPro: missing distfile AfterShotPro-1.1.0.30/AfterShotPro_i386.deb
Package pgraf: missing distfile pgraf-20010131.tar.gz
Package qvplay: missing distfile qvplay-0.95.tar.gz
Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden). All existing
SHA1 digests retained for now as an audit trail.
- Grayscale native support (PGM format), plus four binarization methods.
- Internal preprocessor including deskewing, balancing, thresholding and
interpolation (by Giulio Lunati).
- Various new features: border path computing, barcode search,
detection of extremities, PAGE only mode, the flea, the spyhole,
instant threshold, per-depth optimized X code, etc.
- Documentation updated (but not finished). Glossary added.
- Many bugfixes and interface enhancements.
by me. From the DESCR:
Clara OCR is a free (GPL) Optical Character Recognition (OCR) program
for systems that support the C library and the X windows system (e.g.
most flavours of Unix). The development platform of Clara OCR is
32-bit Intel running GNU/Linux.
Clara OCR is intended for large scale digitalization projects. It
features a powerful GUI and a web interface for cooperative
digitalization of books. Clara OCR development started in 1999 and
is approaching production quality.
Features:
Converts pbm/pgm image files to text (ISO-8859)
Can process scans in batch for large documents
Can run from the command-line
Is relatively easy to train
Non-features:
Is not "omnifont"; you must train it for each document
Does not scan the images
Does not support unicode
Cannot read handwriting