922e8744f0
This code is a raw OCR engine. It has NO PAGE LAYOUT ANALYSIS, NO OUTPUT FORMATTING, and NO UI. It can only process an image of a single column and create text from it. It can detect fixed pitch vs proportional text. Having said that, in 1995, this engine was in the top 3 in terms of character accuracy, and it compiles and runs on both Linux and Windows. Another current limitation is that it only recognizes English and its character set is only US-ASCII. Training code IS included in the open source release however, and will be included in a future release. TODO: Compiles fine, but dumps core on NetBSD-4.99.3/amd64. Backtrace: Program terminated with signal 11, Segmentation fault. #0 0x00000000004c1c70 in reverse32 () (gdb) bt #0 0x00000000004c1c70 in reverse32 () #1 0x00000000004aed12 in read_squished_dawg () #2 0x00000000004aaded in init_permute () #3 0x0000000000485779 in program_editup () #4 0x0000000000485869 in start_recog () #5 0x0000000000403d04 in init_tesseract () #6 0x000000000040309b in main () (gdb)
37 lines
1.2 KiB
Text
37 lines
1.2 KiB
Text
@comment $NetBSD: PLIST,v 1.1.1.1 2006/10/27 22:30:56 thomasklausner Exp $
|
|
bin/tesseract
|
|
libexec/tessdata/DangAmbigs
|
|
libexec/tessdata/blackText.params
|
|
libexec/tessdata/configs/api_config
|
|
libexec/tessdata/configs/api_resaljet
|
|
libexec/tessdata/configs/box.train
|
|
libexec/tessdata/configs/inter
|
|
libexec/tessdata/configs/oldapi_config
|
|
libexec/tessdata/configs/oldbox.train
|
|
libexec/tessdata/configs/var_api_config
|
|
libexec/tessdata/configs/var_box.train
|
|
libexec/tessdata/configs/variable_config
|
|
libexec/tessdata/confsets
|
|
libexec/tessdata/fmtable.cls
|
|
libexec/tessdata/fnetwts
|
|
libexec/tessdata/freq-dawg
|
|
libexec/tessdata/inttemp
|
|
libexec/tessdata/netwts
|
|
libexec/tessdata/newdiff.asccodes
|
|
libexec/tessdata/normproto
|
|
libexec/tessdata/pffmtable
|
|
libexec/tessdata/soptable.cls
|
|
libexec/tessdata/tessconfigs/batch
|
|
libexec/tessdata/tessconfigs/matdemo
|
|
libexec/tessdata/tessconfigs/old_batch
|
|
libexec/tessdata/tessconfigs/segdemo
|
|
libexec/tessdata/tessconfigs/var_batch
|
|
libexec/tessdata/test_matrix
|
|
libexec/tessdata/user-words
|
|
libexec/tessdata/word-dawg
|
|
libexec/tesseract
|
|
share/doc/tesseract/README
|
|
@dirrm share/doc/tesseract
|
|
@dirrm libexec/tessdata/tessconfigs
|
|
@dirrm libexec/tessdata/configs
|
|
@dirrm libexec/tessdata
|