Which font works best with the OCR service?

Chris Parker 21 Reputation points
2020-09-28T22:14:15.907+00:00

Hello,

Just started using the OCR from Cognitive Services and I've found that it's, to be frank, not that good. But it's very easy to use in Logic Apps and it's very inexpensive so I'd like to see how I might be able to improve the readings.

We are taking pictures of a product with a serial number label on it created by the product's manufacturer. The string is long and has a character that looks like a zero throughout the serial number. Sometimes the letter that looks like a zero is adjacent to numbers and other times letters. When this character is next to numbers, OCR interprets it as a zero. When it's next to letters, it's interpreted as an uppercase letter 'O'.

I tried making my own labels with various methods: I have my own small label maker and I took pictures of my screen using different fonts. I even tried using a font that renders the zero with a diagonal line through it and uppercase 'O' as a mostly round circle.

None of the fonts I used (even Word's OCRB font) worked any better and the worst performer was the zero with the line through it. Those characters were interpreted as some kind of UTF-8 character.

My goal here is two fold:

  1. Assist the Cognitive Services team in improving their service
  2. Discover a font that definitely works well so I can urge the manufacturer to use that font instead
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,833 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. YutongTie-MSFT 51,696 Reputation points
    2020-09-28T22:37:18.763+00:00

    Hello,

    Thanks for your feedback and insight for the product. I will forward your idea to product team and let you know the response.

    Regards,
    Yutong

    0 comments No comments

  2. Chris Parker 21 Reputation points
    2020-10-05T15:09:24.44+00:00

    Thanks for your response. What about the main part of my post to find the best font? Which font is the system trained on currently?

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.