linux下安装tesseract-ocr

Yoara

17871人浏览 · 2015-01-04 15:18:10

Yoara · 2015-01-04 15:18:10 发布

1. 在ubuntu下可以自动安装

 sudo apt-get install tesseract-ocr

2.编译安装

a.编译环境: gcc gcc-c++ make(这个环境一般机器都具备,可以忽略)

yum install gcc gcc-c++ make

b.安装tesseract-ocr编译必须的包

yum/apt-get install autoconf automake libtool

c.增加图像解析需要的包，可以按照指定的格式选择包

yum install libjpeg-devel libpng-devel libtiff-devel zlib-devel

ubuntu

sudo apt-get install libpng12-dev
sudo apt-get install libjpeg62-dev
sudo apt-get install libtiff4-dev

d.下载 leptonica 包: http://www.leptonica.org/source/leptonica-1.71.tar.gz

wget http://www.leptonica.org/source/leptonica-1.71.tar.gz
tar -zxvf ...
./configure
make
make install

需要注意，leptonica的版本问题

3.01 requires at least v1.67 of Leptonica.
3.02 requires at least v1.69 of Leptonica. (Both available in Ubuntu 12.04 Precise Pangolin.)
3.03 requires at least v1.70 of Leptonica. (Both available in Ubuntu 14.04 Trusty Tahr.)

如果版本不一致，会出现问题如下：

Tesseract Open Source OCR Engine v3.02.02 with Leptonica
Error in findTiffCompression: function not present
Error in pixReadStreamTiff: function not present
Error in pixReadStream: tiff: no pix returned
Error in pixRead: pix not read
Unsupported image type.

e.下载 tesseract-3.02 安装包: http://tesseract-ocr.googlecode.com/files/tesseract-3.02.02.tar.gz

wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz
./autogen.sh
./configure
make
make install
ldconfig

f.下载 tesseract-3.02 英文语言包: http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz，解压后将 tesseract-ocr/tessdata 下的所有文件全部拷贝到 /usr/local/share/tessdata 下。

测试

tesseract phototest.tif phototest -l eng

这时应该在当前目录生成一个 phototest.txt 文本文件,内容就是 phototest.tif 显示的文字.

GitCode 开源社区

新一代开源开发者平台 GitCode，通过集成代码托管服务、代码仓库以及可信赖的开源组件库，让开发者可以在云端进行代码托管和开发。旨在为数千万中国开发者提供一个无缝且高效的云端环境，以支持学习、使用和贡献开源项目。

更多推荐

GitCode 9 月：小程序新增三大模型专属频道；百大开源项目结果公布；GitCodeAI 社区战略升级全景发布会圆满召开

GitCode 开源社区

混元世界模型 1.1 在 GitCode 开源！秒级生成 3D 世界，开发者速来体验！

GitCode 开源社区

双星开源：Astron-Agent 与 Astron-RPA 在 GitCode 上线，加速 AI 智能体时代！

GitCode 开源社区

所有评论(0)

查看更多评论

Yoara

@Yoara

已为社区贡献1条内容