Tesseract OCR 3.0 安装记录

On 01/11/2012, in linux, by kilobug

下载:
1、安装包:Tesseract-3.01.tar.gz
2、语言包:Eng.traineddata.gz(这个是英文语言包)
3、图像分析库:Leptonica-1.68.tar.gz(版本必须>=1.67)

Ubuntu:

sudo apt-get install autoconf automake libtool
sudo apt-get install libpng12-dev
sudo apt-get install libjpeg62-dev
sudo apt-get install libtiff4-dev
sudo apt-get install zlib1g-dev

CentOS:

yum install gcc gcc-c++ autoconf automake libtool libpng libjpeg libtiff zlib-devel

解压Leptonica-1.68.tar.gz并进入目录:

./configure && make && make install
ln -s /usr/local/lib/liblept.* /usr/lib/
ln -s /usr/local/lib/liblept.* /usr/lib32/
ln -s /usr/local/lib/liblept.* /usr/lib64/
ln -s /usr/local/lib/liblept.* /lib/
ln -s /usr/local/lib/liblept.* /lib32/
ln -s /usr/local/lib/liblept.* /lib64/

解压Tesseract-3.01.tar.gz并进入目录
删除ccutil/strngs.h第一行第一个字符
执行以下操作:

sh autogen.sh
CPPFLAGS="-I/usr/local/include" LDFLAGS="-L/usr/local/lib" ./configure
make && make install
ln -s /usr/local/lib/libtesseract.* /usr/lib/
ln -s /usr/local/lib/libtesseract.* /usr/lib32/
ln -s /usr/local/lib/libtesseract.* /usr/lib64/
ln -s /usr/local/lib/libtesseract.* /lib/
ln -s /usr/local/lib/libtesseract.* /lib32/
ln -s /usr/local/lib/libtesseract.* /lib64/

解压Eng.traineddata.gz并拷贝到/usr/local/share/tessdata/目录下

gzip -d eng.traineddata.gz
cp eng.traineddata /usr/local/share/tessdata/

End.

PS:容易把数字解析成字母,如数字2=字母Z,数字0=字母o等

 

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="" highlight="">

无觅相关文章插件,快速提升流量