下载:
1、安装包:Tesseract-3.01.tar.gz
2、语言包:Eng.traineddata.gz(这个是英文语言包)
3、图像分析库:Leptonica-1.68.tar.gz(版本必须>=1.67)
Ubuntu:
sudo apt-get install autoconf automake libtool sudo apt-get install libpng12-dev sudo apt-get install libjpeg62-dev sudo apt-get install libtiff4-dev sudo apt-get install zlib1g-dev
CentOS:
yum install gcc gcc-c++ autoconf automake libtool libpng libjpeg libtiff zlib-devel
解压Leptonica-1.68.tar.gz并进入目录:
./configure && make && make install ln -s /usr/local/lib/liblept.* /usr/lib/ ln -s /usr/local/lib/liblept.* /usr/lib32/ ln -s /usr/local/lib/liblept.* /usr/lib64/ ln -s /usr/local/lib/liblept.* /lib/ ln -s /usr/local/lib/liblept.* /lib32/ ln -s /usr/local/lib/liblept.* /lib64/
解压Tesseract-3.01.tar.gz并进入目录
删除ccutil/strngs.h第一行第一个字符
执行以下操作:
sh autogen.sh CPPFLAGS="-I/usr/local/include" LDFLAGS="-L/usr/local/lib" ./configure make && make install ln -s /usr/local/lib/libtesseract.* /usr/lib/ ln -s /usr/local/lib/libtesseract.* /usr/lib32/ ln -s /usr/local/lib/libtesseract.* /usr/lib64/ ln -s /usr/local/lib/libtesseract.* /lib/ ln -s /usr/local/lib/libtesseract.* /lib32/ ln -s /usr/local/lib/libtesseract.* /lib64/
解压Eng.traineddata.gz并拷贝到/usr/local/share/tessdata/目录下
gzip -d eng.traineddata.gz cp eng.traineddata /usr/local/share/tessdata/
End.
PS:容易把数字解析成字母,如数字2=字母Z,数字0=字母o等
