Document structure analysis

A typical scanned image or photo of a text document may contain a large number of different content blocks - text paragraphs, tables, illustrations, formulas, and the like. Detecting, ordering, and classifying areas of interest on a page is the cornerstone of a successful and accurate OCR. This process is called document structure and layout analysis (DSR).

Document structure analysis and recognition

Aspose.OCR Cloud offers several document structure analysis algorithms, allowing you to choose the one that works best for your specific content.

Aspose.OCR Cloud automatically selects the document structure analysis algorithms for you that is suitable for most common use cases. However, you can manually override or tune up the defaults through the API if you are unhappy with the results or get unwanted artifacts.

You can manually override the default document areas detection method if you are unhappy with the results or get unwanted artifacts. Document structure analysis algorithm is specified in an optional recognition setting dsrMode:

Value	Description	Use cases
`NoDsrNoFilter`	Do not analyze document structure.	Simple images containing a few lines of text without illustrations or formatting. Applications requiring maximum recognition speed Web applications
`DsrNoFilter`	Detect large blocks of text, such as paragraphs and columns. Optimal for multi-column documents with illustrations. Can be further tuned up with DSR confidence recognition setting to filter out dim and blurry areas that can lead to unreliable recognition results. See Complex structure analysis for additional details.	Contracts Books Articles Newspapers High-quality scans
`DsrAndFilter`	Do not analyze document structure for small images to maximize recognition speed; use complex structure analysis for large images only. Use this algorithm to accelerate the batch recognition of a large number of diverse images. Can be further tuned up with DSR confidence setting to filter out dim and blurry areas that can lead to unreliable recognition results.	Batch recognition
`TextDetector`	Find small text blocks (individual words, phrases, or lines) inside complex images and then position these blocks relative to each other in recognition results. This algorithm works best with sparse irregular text and low-quality photos. See Text area analysis for additional details.	Invoices Screenshots Driver’s licenses Identity cards Visas Math formulas
`DsrPlusDetector`	The combination of complex structure analysis and text area analysis. Can be further tuned up with DSR confidence recognition setting to filter out dim and blurry areas that can lead to unreliable recognition results.	Posters Billboards Datasheets Random photos Batch recognition
`Regions`	Detect blocks of text, such as paragraphs, columns, annotations, and so on.	Regions detection
`CraftPoly`	Automatically straighten curved or distorted lines and find small text blocks (individual words, phrases, or lines) inside the resulting image. See Curved text detection for additional details.	Photos of books, magazine articles, and other curved pages.

Language Processing result formats