Failed to get text from image using pytesseract

Question

Failed to get text from image using pytesseract

While I am using the code below, getting Error as "WindowsError: [Error 2] The system cannot find the file specified." Please help me get the text from the image.

from pytesseract import image_to_string
from PIL import Image

print image_to_string(Image.open(r'D:\\name.jpg'),lang='eng')

ERROR:

Tracing WindowsError (last call last) in () 2 from PIL import Image 3 ----> 4 print image_to_string (Image.open (r'D: \ name.jpg '), lang =' eng ')

C: \ ProgramData \ Anaconda2 \ Lib \ site-packages \ pytesseract \ pytesseract.pyc in image_to_string (image, lang, boxes, config) 120 lang = lang, 121 boxes = boxes, → 122 config = config) 123 if status: 124 errors = get_errors (error_string)

C: \ ProgramData \ Anaconda2 \ Lib \ site-packages \ pytesseract \ pytesseract.pyc in run_tesseract (input_filename, output_filename_base, lang, boxes, config) 44 command + = shlex.split (config) 45 ---> 46 proc = subprocess .Popen (command, stderr = subprocess.PIPE) 47 status = proc.wait () 48 error_string = proc.stderr.read ()

C: \ ProgramData \ Anaconda2 \ lib \ subprocess.pyc in init (self, args, bufsize, executable, stdin, stdout, stderr, preexec_fn, close_fds, shell, cwd, env, universal_newlines, startupinfo, creationflags) 388 p2cread, p2cwrite , 389 c2pread, c2pwrite, → 390 errread, errwrite) 391 exception: 392 # Keep original exception if os.close is raised.

C: \ ProgramData \ Anaconda2 \ lib \ subprocess.pyc to _execute_child (self, args, executable, preexec_fn, close_fds, cwd, env, universal_newlines, startupinfo, createflags, shell, to_close, p2cread, p2cwrite2, c2pread, c2pread errwrite) 638 env, 639 cwd, -> 640 startupinfo) 641 except pywintypes.error, e: 642 # Translate pywintypes.error to WindowsError which

WindowsError: [Error 2] The system cannot find the file specified

+3

python

siksha Jul 12 17 at 4:52

source to share

3 answers

Install google tesseract-ocr from tesseract-ocr . Code may miss dependencies.

+2

Bodhi94 Jul 12 17 at 5:12 am

source to share

You don't need to specify the path as a raw string. Without the raw line:

print image_to_string(Image.open('D:\\name.jpg'),lang='eng')

With the original line:

print image_to_string(Image.open(r'D:\name.jpg'),lang='eng')

0

Bodhi94 Jul 12 17 at 5:01

source to share

cSharma · Accepted Answer · 2017-07-13T04:15:40+0000

After installing the entire Tesseract-OCR package and application, you need to restart your computer. I tried your code and got the same problem, but after restarting my computer, it worked for me. Try it.

Failed to get text from image using pytesseract

More articles: