OCR to text printed on a metal plate

I am working on an OCR project that aims to read a stamped serial number from a metal plate:

Example of stamped text.

I am using OpenCV to prepare an image for OCR and use Tesseract for the OCR itself. This is the ideal process:

  • In the image of the whole plate, crop to the common place of the serial number.
  • Prepare cropped image for OCR.
  • Apply OCR.

My current process:

  • Configure the serial number manually.
  • Convert to grayscale.
  • Sharpen.
  • Use Canny edge detection.
  • Launch Tesseract OCR.

However, I have very limited success. My main questions:

  • What processing optimizes OCR? Makes edge detection a good start?
  • Can I use a bullet font to my advantage?
  • Can I use the "color" of the text (as opposed to gray metal or black / white labels) to my advantage?
+3
opencv computer-vision ocr tesseract opencv3.0


source to share


No one has answered this question yet

Check out similar questions:

1518
Image processing: improvement of the algorithm for the recognition of "Coca-Cola Can"
13
OCR: Image to Text?
2
Poor Tesseract OCR Results for Digital Classification
1
OpenCV: Isolating license plate characters for OCR
1
Tesseract OCR does not recognize text with contrast in one image
1
How to fill in gaps in emails after finding Canny's edge
1
Removing extra pixels / lines from a license plate
0
OpenCV image conversion for Tesseract OCR
0
License Plate Recognition



All Articles
Loading...
X
Show
Funny
Dev
Pics