PyTesseract - Simple Python Optical Character Recognition

This repository contains the code for this blogpost.

Getting Started

Docs

Prerequisites

Kindly ensure you have the following installed on your machine:

Python 3
Tesseract
Git
An IDE or Editor of your choice

Running the Application

Clone the repository

$ git clone https://github.com/ro6ley/python-ocr-example.git

Check into the cloned repository

$ cd python-ocr-example

If you are using Pipenv, setup the virtual environment and start it as follows:

$  pip install pipenv
$ pipenv install && pipenv shell

1. Install tesseract using windows installer available at: https://github.com/UB-Mannheim/tesseract/wiki

2. Note the tesseract path from the installation. Default installation path at the time of this edit was: C:\Users\USER\AppData\Local\Tesseract-OCR. It may change so please check the installation path.

3. pip install pytesseract

4. Set the tesseract path in the script before calling image_to_string:

pytesseract.pytesseract.tesseract_cmd = r'C:\Users\USER\AppData\Local\Tesseract-OCR\tesseract.exe'

$ pip install -r requirements.txt

Run OCR server

$ python app.py

Test

http://localhost:5000/upload

Contribution

Please feel free to raise issues using this template and I'll get back to you.

You can also fork the repository, make changes and submit a Pull Request using this template.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github		.github
__pycache__		__pycache__
docs		docs
images		images
static/uploads		static/uploads
templates		templates
test		test
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
app.py		app.py
ocr_core.py		ocr_core.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTesseract - Simple Python Optical Character Recognition

Getting Started

Docs

Prerequisites

Running the Application

Test

Contribution

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PyTesseract - Simple Python Optical Character Recognition

Getting Started

Docs

Prerequisites

Running the Application

Test

Contribution

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages