OCRPageObject
June 23, 2026 ยท View on GitHub
Fields
| Field | Type | Required | Description |
|---|---|---|---|
index | int | :heavy_check_mark: | The page index in a pdf document starting from 0 |
markdown | str | :heavy_check_mark: | The markdown string response of the page |
images | List[models.OCRImageObject] | :heavy_check_mark: | List of all extracted images in the page |
tables | List[models.OCRTableObject] | :heavy_minus_sign: | List of all extracted tables in the page |
hyperlinks | List[str] | :heavy_minus_sign: | List of all hyperlinks in the page |
header | OptionalNullable[str] | :heavy_minus_sign: | Header of the page |
footer | OptionalNullable[str] | :heavy_minus_sign: | Footer of the page |
dimensions | Nullable[models.OCRPageDimensions] | :heavy_check_mark: | The dimensions of the PDF Page's screenshot image |
confidence_scores | OptionalNullable[models.OCRPageConfidenceScores] | :heavy_minus_sign: | Confidence scores for the OCR page (populated when confidence_scores_granularity is set) |
blocks | List[models.Block] | :heavy_minus_sign: | Paragraph-level bounding boxes for all content blocks in reading order (populated when include_blocks is True) |