Module Output¶
各モジュールの出力について説明します。
Document Analyzer¶
Document Analyzer モジュールは以下の変数を tuple で出力します。
| 変数名 | 型 | 説明 |
|---|---|---|
| results | DocumentAnalyzerSchema |
モジュールの出力結果 |
| ocr_vis | np.ndarray | None |
AI-OCR の出力可視化画像(visualizer=True の時のみ) |
| layout_vis | np.ndarray | None |
Layout Analyzer の出力可視化画像(visualizer=True の時のみ) |
results 変数の準拠するスキーマ DocumentAnalyzerSchema の仕様は以下の通りです。
DocumentAnalyzerSchema
No Additional Propsobject
#
DocumentAnalyzerSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
paragraphs
List of detected paragraphs
Requiredarray
#
paragraphs
List of detected paragraphs
Requiredarray
List of detected paragraphs
Nested fields
ParagraphSchema
No Additional Propsobject
#
ParagraphSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
Text direction, e.g., ['horizontal' or 'vertical']
role
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
RequiredanyOf
#
role
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
RequiredanyOf
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
tables
List of detected tables
Requiredarray
#
tables
List of detected tables
Requiredarray
List of detected tables
Nested fields
TableStructureRecognizerSchema
No Additional Propsobject
#
TableStructureRecognizerSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
rows
List of table lines representing rows
Requiredarray
#
rows
List of table lines representing rows
Requiredarray
List of table lines representing rows
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
cols
List of table lines representing columns
Requiredarray
#
cols
List of table lines representing columns
Requiredarray
List of table lines representing columns
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
spans
List of table lines representing spans
Requiredarray
#
spans
List of table lines representing spans
Requiredarray
List of table lines representing spans
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
cells
List of table cells
Requiredarray
#
cells
List of table cells
Requiredarray
List of table cells
Nested fields
TableCellSchema
No Additional Propsobject
#
TableCellSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
col_span
Number of columns spanned by the cell
Requiredinteger
#
col_span
Number of columns spanned by the cell
Requiredinteger
DocumentAnalyzerSchema › tables › TableStructureRecognizerSchema › cells › TableCellSchema › col_span
Number of columns spanned by the cell
row_span
Number of rows spanned by the cell
Requiredinteger
#
row_span
Number of rows spanned by the cell
Requiredinteger
DocumentAnalyzerSchema › tables › TableStructureRecognizerSchema › cells › TableCellSchema › row_span
Number of rows spanned by the cell
box
Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the cell in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
contents
Text content of the cell
RequiredanyOf
#
contents
Text content of the cell
RequiredanyOf
DocumentAnalyzerSchema › tables › TableStructureRecognizerSchema › cells › TableCellSchema › contents
Text content of the cell
words
List of recognized words
Requiredarray
#
words
List of recognized words
Requiredarray
List of recognized words
Nested fields
WordPrediction
No Additional Propsobject
#
WordPrediction
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
points
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
#
points
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
direction
Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
#
direction
Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
Text direction, e.g., 'horizontal' or 'vertical'
rec_score
Confidence score of the word recognition
Requirednumber
#
rec_score
Confidence score of the word recognition
Requirednumber
Confidence score of the word recognition
det_score
Confidence score of the word detection
Requirednumber
#
det_score
Confidence score of the word detection
Requirednumber
Confidence score of the word detection
figures
List of detected figures
Requiredarray
#
figures
List of detected figures
Requiredarray
List of detected figures
Nested fields
FigureSchema
No Additional Propsobject
#
FigureSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the figure in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the figure in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the figure in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
paragraphs
List of paragraphs associated with the figure
Requiredarray
#
paragraphs
List of paragraphs associated with the figure
Requiredarray
List of paragraphs associated with the figure
Nested fields
ParagraphSchema
No Additional Propsobject
#
ParagraphSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
Text direction, e.g., ['horizontal' or 'vertical']
role
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
RequiredanyOf
#
role
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
RequiredanyOf
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer'])
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
direction
Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
Text direction, e.g., ['horizontal' or 'vertical']
AI-OCR¶
AI-OCR モジュールは以下の変数を tuple で出力します。
| 変数名 | 型 | 説明 |
|---|---|---|
| results | OCRSchema |
モジュールの出力結果 |
| ocr_vis | np.ndarray | None |
AI-OCR の出力可視化画像(visualizer=Trueの時のみ) |
results 変数の準拠するスキーマ OCRSchema の仕様は以下の通りです。
OCRSchema
No Additional Propsobject
#
OCRSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
words
List of recognized words with their bounding boxes, content, direction, and scores
Requiredarray
#
words
List of recognized words with their bounding boxes, content, direction, and scores
Requiredarray
List of recognized words with their bounding boxes, content, direction, and scores
Nested fields
WordPrediction
No Additional Propsobject
#
WordPrediction
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
points
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
#
points
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Constraints
- Minimum items:
4 - Maximum items:
4
direction
Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
#
direction
Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
Text direction, e.g., 'horizontal' or 'vertical'
rec_score
Confidence score of the word recognition
Requirednumber
#
rec_score
Confidence score of the word recognition
Requirednumber
Confidence score of the word recognition
det_score
Confidence score of the word detection
Requirednumber
#
det_score
Confidence score of the word detection
Requirednumber
Confidence score of the word detection
Layout Analyzer¶
Layout Analyzer モジュールは以下の変数を tuple で出力します。
| 変数名 | 型 | 説明 |
|---|---|---|
| results | LayoutAnalyzerSchema |
モジュールの出力結果 |
| layout_vis | np.ndarray | None |
Layout Analyzer の出力可視化画像(visualizer=Trueの時のみ) |
results 変数の準拠するスキーマ LayoutAnalyzerSchema の仕様は以下の通りです。
LayoutAnalyzerSchema
No Additional Propsobject
#
LayoutAnalyzerSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
paragraphs
List of detected paragraphs
Requiredarray
#
paragraphs
List of detected paragraphs
Requiredarray
List of detected paragraphs
Nested fields
Element
No Additional Propsobject
#
Element
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the layout element in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the layout element detection
Requirednumber
#
score
Confidence score of the layout element detection
Requirednumber
Confidence score of the layout element detection
role
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
#
role
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
tables
List of detected tables
Requiredarray
#
tables
List of detected tables
Requiredarray
List of detected tables
Nested fields
TableStructureRecognizerSchema
No Additional Propsobject
#
TableStructureRecognizerSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
rows
List of table lines representing rows
Requiredarray
#
rows
List of table lines representing rows
Requiredarray
List of table lines representing rows
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
cols
List of table lines representing columns
Requiredarray
#
cols
List of table lines representing columns
Requiredarray
List of table lines representing columns
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
spans
List of table lines representing spans
Requiredarray
#
spans
List of table lines representing spans
Requiredarray
List of table lines representing spans
Nested fields
TableLineSchema
No Additional Propsobject
#
TableLineSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the table line in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the table line detection
Requirednumber
#
score
Confidence score of the table line detection
Requirednumber
Confidence score of the table line detection
cells
List of table cells
Requiredarray
#
cells
List of table cells
Requiredarray
List of table cells
Nested fields
TableCellSchema
No Additional Propsobject
#
TableCellSchema
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
col_span
Number of columns spanned by the cell
Requiredinteger
#
col_span
Number of columns spanned by the cell
Requiredinteger
Number of columns spanned by the cell
box
Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the cell in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
figures
List of detected figures
Requiredarray
#
figures
List of detected figures
Requiredarray
List of detected figures
Nested fields
Element
No Additional Propsobject
#
Element
No Additional Propsobject
Constraints
- Additional properties are not allowed
Nested fields
box
Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
#
box
Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
Bounding box of the layout element in the format [x1, y1, x2, y2]
Constraints
- Minimum items:
4 - Maximum items:
4
Nested fields
Item
integer
score
Confidence score of the layout element detection
Requirednumber
#
score
Confidence score of the layout element detection
Requirednumber
Confidence score of the layout element detection
role
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
#
role
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
Auto-generated from JSON Schema files.