A group of actions: Vision
The action recognizes the text from the document, divides it into blocks and returns it in a tabular form. Works with Robin OCR 1.
Settings
Property | Description | Type | Filling example | Mandatory field |
Parameters | ||||
Path to file | The path to the file to extract the data from. Supported formats: jpg, jpeg, bmp, png, tif, pdf. | Robin.FilePath | Yes | |
Page number | The page number of the file to retrieve data from. If the field is empty, data will be retrieved from all pages. | Robin.Numeric | No | |
Language | The suggested language of the text to extract. | Robin.String | No | |
Algorithm | The format of the data that the action should recognize. If «Table» or «Text and Table» is selected, text from tables will be returned with tabular layout. | Robin.String | Yes | |
Distance between words | The maximum distance between words in a text. Used when dividing the source text into columns for the resulting table. In the «Table» and «Text and Table» algorithms, the parameter is not taken into account for text from tables. The default value is 20 pixels. | Robin.Numeric | No | |
Distance between lines | The maximum distance between lines in text. Used when dividing the source text into lines for the resulting table. In the «Table» and «Text and Table» algorithms, the parameter is not taken into account for text from tables. The default value is 1 px. | Robin.Numeric | No | |
Folder path | Path to the folder where the image of the original text will be saved with superimposed blocks into which it is divided by the recognition action. To save, you also need to fill in the «File name» field. | Robin.FolderPath | C:\doc\img | No |
File name | Name of the image of the original text with superimposed blocks (without extension). The image will be created with * .png extension. If the action extracts text from several pages, then each of them will have its own image, to the name of which an index will be added. To save, you also need to fill in the «Path to folder» field. | Robin.String | No | |
Overwrite | If the value is «true» and an image with the same name and extension exists in the specified folder, then the new block image will overwrite it. If «false», the image will not be overwritten and the action will return an error. | Robin.Boolean | true | No |
Parameters | Additional parameters that affect the result and quality of text recognition. | Robin.String | No | |
Sign | If the value is «true», then the word «Part» will be added before each recognized tabular or text part with the ordinal number of this part. If the value is «false», a blank line will be inserted before these parts. | Robin.Boolean | true | No |
Results | ||||
Table | A table generated from data retrieved from a source file. | Robin.DataTable | ||
Path to image with blocks | A collection of image paths with overlaid blocks. | Robin.Collection |
Обзор
Инструменты контента