Ground Truth for LRDE DBD OCR

From TC11
Jump to: navigation, search

Datasets -> Datasets List -> Current Page

Created: 2013-05-30
Last updated: 2013-007-03

Keywords

scanned, magazine, documents, binarization


Description

125 binarized images for "clean documents".

Image groundtruths have been produced using a semi-automatic process: a global thresholding followed by some manual adjustments.


Purpose of the three document qualities :

  • Original : evaluate the binarization quality on perfect documents mixing text and images.
  • Clean : evaluate the binarization quality on perfect document with text only.
  • Scanned : evaluate the binarization quality on slightly degraded documents with text only.

Related Dataset

Related Tasks

Submitted Files

Version 1.0


This page is editable only by TC11 Officers .