Difference between revisions of "Ground Truth for LRDE DBD OCR"

From TC11
Jump to: navigation, search
(Created page with "Datasets -> Datasets List -> Current Page {| style="width: 100%" |- | align="right" | {| |- | '''Created: '''2013-05-30 |- | {{Last updated}} |} |} =Keywords= scann…")
(No difference)

Latest revision as of 01:27, 4 July 2013

Datasets -> Datasets List -> Current Page

Created: 2013-05-30
Last updated: 2013-007-04


scanned, magazine, documents, binarization


125 binarized images for "clean documents".

Image groundtruths have been produced using a semi-automatic process: a global thresholding followed by some manual adjustments.

Purpose of the three document qualities :

  • Original : evaluate the binarization quality on perfect documents mixing text and images.
  • Clean : evaluate the binarization quality on perfect document with text only.
  • Scanned : evaluate the binarization quality on slightly degraded documents with text only.

Related Dataset

Related Tasks

Submitted Files

Version 1.0

This page is editable only by TC11 Officers .