DocumentAnalyzerSchema
Type: objectNo Additional Properties
Paragraphs
Type: arrayList of detected paragraphs
No Additional ItemsEach item of this array must be:
ParagraphSchema
Type: objectNo Additional Properties
Box
Type: array of integerBounding box of the paragraph in the format [x1, y1, x2, y2]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
Contents
Text content of the paragraph
Direction
Text direction, e.g., ['horizontal' or 'vertical']
Order
Order of the paragraph in the document
Role
Role of the paragraph, e.g., ['sectionheadings', 'pageheader', 'page_footer'])
Tables
Type: arrayList of detected tables
No Additional ItemsEach item of this array must be:
TableStructureRecognizerSchema
Type: objectNo Additional Properties
Box
Type: array of integerBounding box of the table in the format [x1, y1, x2, y2]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
N Row
Type: integerNumber of rows in the table
N Col
Type: integerNumber of columns in the table
Rows
Type: arrayList of table lines representing rows
No Additional ItemsEach item of this array must be:
TableLineSchema
Type: objectNo Additional Properties
Box
Type: array of integerBounding box of the table line in the format [x1, y1, x2, y2]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
Score
Type: numberConfidence score of the table line detection
Cols
Type: arrayList of table lines representing columns
No Additional ItemsEach item of this array must be:
Spans
Type: arrayList of table lines representing spans
No Additional ItemsEach item of this array must be:
Cells
Type: arrayList of table cells
No Additional ItemsEach item of this array must be:
TableCellSchema
Type: objectNo Additional Properties
Col
Type: integerColumn index of the cell
Row
Type: integerRow index of the cell
Col Span
Type: integerNumber of columns spanned by the cell
Row Span
Type: integerNumber of rows spanned by the cell
Box
Type: array of integerBounding box of the cell in the format [x1, y1, x2, y2]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
Contents
Text content of the cell
Order
Type: integerOrder of the table in the document
Words
Type: arrayList of recognized words
No Additional ItemsEach item of this array must be:
WordPrediction
Type: objectNo Additional Properties
Points
Type: array of arrayBounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
Must contain a minimum of 2
items
Must contain a maximum of 2
items
Each item of this array must be:
Content
Type: stringText content of the word
Direction
Type: stringText direction, e.g., 'horizontal' or 'vertical'
Rec Score
Type: numberConfidence score of the word recognition
Det Score
Type: numberConfidence score of the word detection
Figures
Type: arrayList of detected figures
No Additional ItemsEach item of this array must be:
FigureSchema
Type: objectNo Additional Properties
Box
Type: array of integerBounding box of the figure in the format [x1, y1, x2, y2]
Must contain a minimum of 4
items
Must contain a maximum of 4
items
Each item of this array must be:
Order
Order of the figure in the document
Paragraphs
Type: arrayList of paragraphs associated with the figure
No Additional ItemsEach item of this array must be:
Direction
Text direction, e.g., ['horizontal' or 'vertical']