Document structure analysis

Contents
[ ]

A typical scanned image or photo of a text document may contain a large number of different content blocks - text paragraphs, tables, illustrations, formulas, and the like. Detecting, ordering, and classifying areas of interest on a page is the cornerstone of a successful and accurate OCR. This process is called document structure and layout analysis (DSR).

Document structure analysis and recognition

Aspose.OCR Cloud offers several document structure analysis algorithms, allowing you to choose the one that works best for your specific content.

You can manually override the default document areas detection method if you are unhappy with the results or get unwanted artifacts. Document structure analysis algorithm is specified in an optional recognition setting dsrMode:

Value Description Use cases
NoDsrNoFilter Do not analyze document structure. Simple images containing a few lines of text without illustrations or formatting.
Applications requiring maximum recognition speed
Web applications
DsrNoFilter Detect large blocks of text, such as paragraphs and columns. Optimal for multi-column documents with illustrations. Can be further tuned up with DSR confidence recognition setting to filter out dim and blurry areas that can lead to unreliable recognition results.
See Complex structure analysis for additional details.
Contracts
Books
Articles
Newspapers
High-quality scans
DsrAndFilter Do not analyze document structure for small images to maximize recognition speed; use complex structure analysis for large images only. Use this algorithm to accelerate the batch recognition of a large number of diverse images.
Can be further tuned up with DSR confidence setting to filter out dim and blurry areas that can lead to unreliable recognition results.
Batch recognition
TextDetector Find small text blocks (individual words, phrases, or lines) inside complex images and then position these blocks relative to each other in recognition results. This algorithm works best with sparse irregular text and low-quality photos.
See Text area analysis for additional details.
Invoices
Screenshots
Driver’s licenses
Identity cards
Visas
Math formulas
DsrPlusDetector The combination of complex structure analysis and text area analysis. Can be further tuned up with DSR confidence recognition setting to filter out dim and blurry areas that can lead to unreliable recognition results. Posters
Billboards
Datasheets
Random photos
Batch recognition
Regions Detect blocks of text, such as paragraphs, columns, annotations, and so on. Regions detection
CraftPoly Automatically straighten curved or distorted lines and find small text blocks (individual words, phrases, or lines) inside the resulting image.
See Curved text detection for additional details.
Photos of books, magazine articles, and other curved pages.