我正在使用tess4j,Tesseract的java包装器.我也安装了正常的Tesseract.我不确定tess4j是如何工作的,但由于它附带了一个tessdata文件夹,我可以假设您将语言数据文件放在那里.但是,tess4j仅在语言数据文件位于“真实”tessdata文件夹(tesseract附带的文件夹,而不是tess4j)中时才起作用.如果我删除该文件夹,我收到此错误消息:

Error opening data file C:\Program Files\Tesseract-OCR\tessdata/jpn.trained

data

Please make sure the TESSDATA_PREFIX environment variable is set to the par

ent directory of your "tessdata" directory.

Failed loading language 'jpn'

Tesseract couldn't load any languages!

#

# A fatal error has been detected by the Java Runtime Environment:

#

# EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x631259dc, pid=5108, tid=

10148

#

# JRE version: 7.0_06-b24

# Java VM: Java HotSpot(TM) Client VM (23.2-b09 mixed mode, sharing windows

-x86 )

# Problematic frame:

# C [libtesseract302.dll+0x59dc] STRING::strdup+0x467c

#

# Failed to write core dump. Minidumps are not enabled by default on client

versions of Windows

#

# An error report file with more information is saved as:

# D:\School\Programs\OCRTest\v1.0.0\hs_err_pid5108.log

#

# If you would like to submit a bug report, please visit:

# http://bugreport.sun.com/bugreport/crash.jsp

# The crash happened outside the Java Virtual Machine in native code.

# See problematic frame for where to report the bug.

#

这是否意味着我需要安装Tesseract才能使用tess4j?为什么?或者maby我的tess4j tessdata文件夹位于错误的位置(目前我的.java文件中,tess4j jars位于我设置了类路径的lib文件夹中).

Logo

旨在为数千万中国开发者提供一个无缝且高效的云端环境,以支持学习、使用和贡献开源项目。

更多推荐