Ground Truth for LRDE DBD OCR

From TC11
Revision as of 01:27, 4 July 2013 by Liwicki (talk | contribs) (Created page with "Datasets -> Datasets List -> Current Page {| style="width: 100%" |- | align="right" | {| |- | '''Created: '''2013-05-30 |- | {{Last updated}} |} |} =Keywords= scann…")

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Datasets -> Datasets List -> Current Page

Created: 2013-05-30
Last updated: 2013-007-04

Keywords

scanned, magazine, documents, binarization


Description

125 binarized images for "clean documents".

Image groundtruths have been produced using a semi-automatic process: a global thresholding followed by some manual adjustments.


Purpose of the three document qualities :

  • Original : evaluate the binarization quality on perfect documents mixing text and images.
  • Clean : evaluate the binarization quality on perfect document with text only.
  • Scanned : evaluate the binarization quality on slightly degraded documents with text only.

Related Dataset

Related Tasks

Submitted Files

Version 1.0


This page is editable only by TC11 Officers .