tesseract使用时遇到的坑；TesseractNotFoundError_客气__的博客

相关文章推荐
温文尔雅的菠萝 · iOS开发系列-SQLite - ...· 7 月前 ·
怕老婆的帽子 · AI编程应用实战 - 知乎· 8 月前 ·
阳光的金鱼 · 由于CORS政策，WordPress网站无法 ...· 1 年前 ·
逃跑的棒棒糖 · java list ...· 1 年前 ·
潇洒的大熊猫 · 可解释性论文汇总_2snoopy的博客-CS ...· 1 年前 ·
from PIL import Image
import pytesseract.pytesseract
text = pytesseract.image_to_string(Image.open(r'C:\Users\Desktop\portrait_image\123.png'))
print(text)
使用pytesseract时报错，提示tesseract没有安装或者不在PATH中
 
Traceback (most recent call last):
  File "D:\test_tesseract\venv\lib\site-packages\pytesseract\pytesseract.py", line 252, in run_tesseract
    proc = subprocess.Popen(cmd_args, **subprocess_args())
  File "C:\Users\AppData\Local\Programs\Python\Python39\lib\subprocess.py", line 951, in __init__
    self._execute_child(args, executable, preexec_fn, close_fds,
  File "C:\Users\AppData\Local\Programs\Python\Python39\lib\subprocess.py", line 1420, in _execute_child
    hp, ht, pid, tid = _winapi.CreateProcess(executable, args,
FileNotFoundError: [WinError 2] 系统找不到指定的文件。
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
  File "D:\test_tesseract\test.py", line 9, in <module>
    text = pytesseract.image_to_string(Image.open(r'C:\Users\aike\Desktop\portrait_image\123.png'))
  File "D:\test_tesseract\venv\lib\site-packages\pytesseract\pytesseract.py", line 413, in image_to_string
    return {
  File "D:\test_tesseract\venv\lib\site-packages\pytesseract\pytesseract.py", line 416, in <lambda>
    Output.STRING: lambda: run_and_get_output(*args),
  File "D:\test_tesseract\venv\lib\site-packages\pytesseract\pytesseract.py", line 284, in run_and_get_output
    run_tesseract(**kwargs)
  File "D:\test_tesseract\venv\lib\site-packages\pytesseract\pytesseract.py", line 256, in run_tesseract
    raise TesseractNotFoundError()
pytesseract.pytesseract.TesseractNotFoundError: 11 is not installed or it's not in your PATH. See README file for more information.
解决办法：
 
 打开方式可以在pycharm 输入import pytesseract.pytesseract
 然后按住ctrl键鼠标对着pytesseract右键点击进去 

 将tesseract_cmd = 'tesseract’修改为：tesseract_cmd = ‘D:\Tesseract-OCR\tesseract.exe’
 其中tesseract_cmd是我已经修改了的地址，tesseract.exe在刚刚安装位置里面。修改后保存，再去运行python代码，就可以成功了。
                    测试代码from PIL import Imageimport pytesseract.pytesseracttext = pytesseract.image_to_string(Image.open(r'C:\Users\Desktop\portrait_image\123.png'))print(text)使用pytesseract时报错，提示tesseract没有安装或者不在PATH中Traceback (most recent call last):  File "D:\test_t
				Tesseract的安装及配置
在Python爬虫过程中，难免遇到各种各样的验证码问题，最简单的就是这种验证码了，那么在遇到验证码的时候该怎么办呢？我们就需要OCR技术了，OCR-即Optical Character Recognition光学字符识别，是指通过扫描字符，然后将其形状翻译成电子文本的过程。而tesserocr是Python的一个OCR识别库，所以在安装tesserocr之前，我们需要安装tesseract这个东西
下载地址：https://digi.bib.uni-mannheim.de/tesseract/可以选择下载不带dev的稳定版本，我下载的是3.05.01版本的，不
				## 使用pytesseract库训练时，报错如下：
  File "C:\Users\Acer\AppData\Roaming\Python\Python36\site-packages\pytesseract\pytesseract.py", line 232, in run_tesseract
    raise TesseractNotFoundError()
pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it's not in your path
我们按照提示打开pytes
Leptonica，tesseract
 您将需要使用培训工具和匹配的leptonica绑定构建的tesseract的最新版本（> = 4.0.0beta1）。 可以在找到和更多内容。
 另外，您可以在此项目中构建leptonica和tesseract并将其安装到./usr中的./usr子目录中：
  make leptonica tesseract
 Tesseract将从git存储库构建，该存储库需要CMake，自动工具（包括autotools-archive）和一些其他培训工具库。 请参阅的。
Python
 您需要最新版本的Python3.x。 为了进行图像处理，使用了Python库Pillow 。 如果没有全局安装，请使用提供的需求文件pip install -r r
				前言：春节期间，无法全身心投入地去写爬虫，那就玩玩验证码吧，应该比较有趣！首次接触验证码识别，用pytesser接触一下最简单的验证码先，代码参照：使用python以及工具包进行简单的验证码识别。具体细节可以参见原文，里面安装和报错处理没有详细记录，我在此处主要记录一下自己的安装及处理过程。 
效果：可识别以下类型的验证码： 
正文：代码：# encoding=utf-8
				安装pytesseract和tessercat后运行如下代码出现错误
import sys
sys.path.append(r'E:\anaconda\Lib\site-packages')
import pytesseract 
from PIL import Image
for i in range(100):
    image = Image.open(r'C:\Users\Admini...
					pytesseract报错pytesseract.TesseractNotFoundError: tesseract is not installed or it‘s not in your path
				python中使用pytesseract，遇到以下错误：pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed or it's not in your path，本文主要介绍解决此错误的方法。
原文地址：Python中使用pytesseract(tesseract OCR)报错(TesseractNotFoundError)解决方法
C:\Users\User-name\AppData\Local\Programs\Python\Python36\python.exe C:/Users/User-name/PycharmProjects/orc/testforpackets.py
Traceback (most recent call last):
  File &amp;quot;C:...
				Tesseract 是一个开源的 OCR（光学字符识别）引擎，它可以将图像中的文本转换为可编辑的文本。它支持多种语言，包括中文。
要使用 Tesseract 识别文本，你需要安装 Tesseract 并准备一张带有文本的图像。然后，你可以使用 Tesseract 的命令行工具或者在 Python 中使用 Tesseract 库来进行文本识别。
例如，如果你使用的是 Python，你可以使用 pytesseract 库来识别文本。首先，你需要安装 pytesseract 库：
pip install pytesseract
然后，你可以使用以下代码来识别图像中的文本：
import pytesseract
from PIL import Image
# 读取图像
image = Image.open('image.png')
# 使用 Tesseract 识别文本
text = pytesseract.image_to_string(image, lang='chi_sim')
print(text)
其中，`lang` 参数用于指定识别的语言，在这里我们使用了 `chi_sim`，表示简体中文。
Tesseract 还提供了许多其他的配置选项，你可以使用这些选项来调整 Tesseract 的行为，比如调整文本识别的精度或者调整识别的速度。你可以在 Tesseract 的文档中了解更多信息。