Azure Read API
Provides the ability for optical character recognition upon documents
The Computer Vision Read API is Azure's latest OCR technology (learn what's new) that extracts printed text (in several languages), handwritten text (in several languages), digits, and currency symbols from images and multi-page PDF documents.
It's optimized to extract text from text-heavy images and multi-page PDF documents with mixed languages. It supports extracting both printed and handwritten text in the same image or document.
You can list a component in the marketplace, and define if you want it to be a template.
✅ Available in Marketplace
❌ Can not be used as a template
What is a Model?
This component is a model, which is a type of store that is specialized for handling AI/ML model storage, which includes both the implementation, and the results of training.
Models are a foundational part of Kodexa, and are used in many different ways. For example, a model can be used to classify documents, or to extract data from documents. Models can also be used to train other models.
✅ Atomic Deployment (Recommended)
❌ Not trainable
A model needs to reference a model runtime to use.
The model also has the following model runtime parameters configured. This influences how the model is run, see the model runtime references to determine what parameters are available.
When you use the model for inference, you can use the following options:
|store_azure_output||False||None||boolean||Store the JSON response from Azure in the document as a feature on the root node|
|keep_azure_lines||True||None||boolean||Retain the line layout from Azure in the document|