Azure Vision (Text Extraction OCR) Issue

Question

I’m reaching out to you with a request around text extraction from scanned documents using Azure Vision API. We had built a solution to extract data from scanned documents (Architectural drawings), for which we are using Azure Vision API. The requirement is to extract Drawing Title, Drawing Number and Revision from Architectural drawings. For certain images the Azure api is unable to extract data (images with size less than 5kb).

Can you help us in getting this issue resolved? I can schedule the call to discuss this issue further.

Answer

@Acharya, Rakesh Thanks for the question, Azure Cognitive Services provides Industry's best optical character recognition (OCR) capability with Read API. The Computer Vision Read API is Azure's latest OCR technology (learn what's new) that extracts printed text (in several languages), handwritten text (English only), digits, and currency symbols from images and multi-page PDF documents. It's optimized to extract text from text-heavy images and multi-page PDF documents with mixed languages. If possible can you please share the sample input images and the output that is unable to extract data.

Also we have built a form recognition service seems promising for your application. Can you please try with the Form Recognizer Layout API that Detects and extracts text and layout of documents.
https://azure.microsoft.com/en-us/services/cognitive-services/form-recognizer/

In the following outlines the traditional challenges of doing OCR in the wild, and what are the ways in which deep learning algorithms are being applied to transform these solutions.
https://twimlai.com/how-deep-learning-has-revolutionized-ocr-with-cha-zhang/
Resources
• Computer Vision
• Microsoft Form Recognizer
• Paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
• Paper: LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Share via

Azure Vision (Text Extraction OCR) Issue

1 answer