<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://www.iapr-tc11.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=NEOCR%3A_Natural_Environment_OCR_Dataset</id>
	<title>NEOCR: Natural Environment OCR Dataset - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://www.iapr-tc11.org/mediawiki/index.php?action=history&amp;feed=atom&amp;title=NEOCR%3A_Natural_Environment_OCR_Dataset"/>
	<link rel="alternate" type="text/html" href="http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;action=history"/>
	<updated>2026-04-10T08:08:20Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.31.16</generator>
	<entry>
		<id>http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1590&amp;oldid=prev</id>
		<title>Dimos: /* Metadata and Ground Truth Data */</title>
		<link rel="alternate" type="text/html" href="http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1590&amp;oldid=prev"/>
		<updated>2011-10-04T11:08:21Z</updated>

		<summary type="html">&lt;p&gt;‎&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;Metadata and Ground Truth Data&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 11:08, 4 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l44&quot; &gt;Line 44:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 44:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Global image metadata includes the filename, folder, source information, image width, height, depth, brightness and contrast. Textfield (local, bounding box) metadata contains the visible text and optical, geometrical and typographical characteristics. Bounding boxes are rectangular and parallel to the axes. Additionally distortion quadrangles are provided which enclose the visible text more precisely.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Global image metadata includes the filename, folder, source information, image width, height, depth, brightness and contrast. Textfield (local, bounding box) metadata contains the visible text and optical, geometrical and typographical characteristics. Bounding boxes are rectangular and parallel to the axes. Additionally distortion quadrangles are provided which enclose the visible text more precisely.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Optical characteristics include texture, brightness, contrast, inversion, resolution, noise and blur information. Texture, noise and inversion were annotated manually, the rest was computed automatically using ImageMagick. Geometrical characteristics cover distortion, rotation, character arrangement and occlusion information. Typographical characteristics contain typeface and language metadata. Please see the CBDAR paper [[#References|[1]]] or the [http://www.iapr-tc11.org/dataset/NEOCR/neocr_metadata_doc.pdf &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;technical report&lt;/del&gt;] for further details on the metadata.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Optical characteristics include texture, brightness, contrast, inversion, resolution, noise and blur information. Texture, noise and inversion were annotated manually, the rest was computed automatically using ImageMagick. Geometrical characteristics cover distortion, rotation, character arrangement and occlusion information. Typographical characteristics contain typeface and language metadata. Please see the CBDAR paper [[#References|[1&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;]]], the technical report [[#References|[2&lt;/ins&gt;]]] or the [http://www.iapr-tc11.org/dataset/NEOCR/neocr_metadata_doc.pdf &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;metadata documentation&lt;/ins&gt;] for further details on the metadata.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=Related Tasks=&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=Related Tasks=&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key mediawiki:diff::1.12:old-1550:rev-1590 --&gt;
&lt;/table&gt;</summary>
		<author><name>Dimos</name></author>
		
	</entry>
	<entry>
		<id>http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1550&amp;oldid=prev</id>
		<title>Dimos: /* Version 1.0 */</title>
		<link rel="alternate" type="text/html" href="http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1550&amp;oldid=prev"/>
		<updated>2011-10-02T18:12:26Z</updated>

		<summary type="html">&lt;p&gt;‎&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;Version 1.0&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 18:12, 2 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l58&quot; &gt;Line 58:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 58:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Version 1.0==&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Version 1.0==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://www.iapr-tc11.org/dataset/NEOCR/neocr_dataset.tar.gz The complete NEOCR dataset with annotations] (&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;XXXX KB&lt;/del&gt;)&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://www.iapr-tc11.org/dataset/NEOCR/neocr_dataset.tar.gz The complete NEOCR dataset with annotations] (&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;1.3 GB&lt;/ins&gt;)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Disjoint split of the NEOCR images for training and testing [http://www.iapr-tc11.org/dataset/NEOCR/test.txt Test Set Image Listing] (5 KB), [http://www.iapr-tc11.org/dataset/NEOCR/train.txt Training Set Image Listing] (5 KB)&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Disjoint split of the NEOCR images for training and testing [http://www.iapr-tc11.org/dataset/NEOCR/test.txt Test Set Image Listing] (5 KB), [http://www.iapr-tc11.org/dataset/NEOCR/train.txt Training Set Image Listing] (5 KB)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://www.iapr-tc11.org/dataset/NEOCR/annotation.xsd The NEOCR XML-Schema definitions for the annotations] (10 KB)&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [http://www.iapr-tc11.org/dataset/NEOCR/annotation.xsd The NEOCR XML-Schema definitions for the annotations] (10 KB)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key mediawiki:diff::1.12:old-1549:rev-1550 --&gt;
&lt;/table&gt;</summary>
		<author><name>Dimos</name></author>
		
	</entry>
	<entry>
		<id>http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1549&amp;oldid=prev</id>
		<title>Dimos: /* References */</title>
		<link rel="alternate" type="text/html" href="http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1549&amp;oldid=prev"/>
		<updated>2011-10-02T18:11:28Z</updated>

		<summary type="html">&lt;p&gt;‎&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;References&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 18:11, 2 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l50&quot; &gt;Line 50:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 50:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=References=&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=References=&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# R. Nagy, A. Dicker and K. Meyer‐Wegener, &amp;quot;NEOCR: A Configurable Dataset for Natural Image Text Recognition&amp;quot;. In CBDAR Workshop 2011 at ICDAR 2011. pp. 53‐58, September 2011.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# R. Nagy, A. Dicker and K. Meyer‐Wegener, &amp;quot;NEOCR: A Configurable Dataset for Natural Image Text Recognition&amp;quot;. In CBDAR Workshop 2011 at ICDAR 2011. pp. 53‐58, September 2011. &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;[http://www.iapr-tc11.org/dataset/NEOCR/cbdar_paper.pdf (PDF)], [http://www.iapr-tc11.org/dataset/NEOCR/cbdar_presentation.pdf (Presentation)]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# R. Nagy, A. Dicker, and K. Meyer‐Wegener, &amp;quot;Definition and Evaluation of the NEOCR Dataset for Natural‐Image Text Recognition&amp;quot;. University of Erlangen, Dept. of Computer Science, Technical Reports, CS‐2011‐07, September 2011.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# R. Nagy, A. Dicker, and K. Meyer‐Wegener, &amp;quot;Definition and Evaluation of the NEOCR Dataset for Natural‐Image Text Recognition&amp;quot;. University of Erlangen, Dept. of Computer Science, Technical Reports, CS‐2011‐07, September 2011. &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;[http://www.iapr-tc11.org/dataset/NEOCR/neocr_techrep.pdf (PDF)]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=Submitted Files=&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;=Submitted Files=&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Dimos</name></author>
		
	</entry>
	<entry>
		<id>http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1548&amp;oldid=prev</id>
		<title>Dimos: Created page with 'Datasets -&gt; Datasets List -&gt; Current Page  {| style=&quot;width: 100%&quot; |- | align=&quot;right&quot; |   {|  |- | '''Created: '''2011-10-02 |- | {{Last updated}} |}  |}  =Contact Author=…'</title>
		<link rel="alternate" type="text/html" href="http://www.iapr-tc11.org/mediawiki/index.php?title=NEOCR:_Natural_Environment_OCR_Dataset&amp;diff=1548&amp;oldid=prev"/>
		<updated>2011-10-02T17:56:28Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;#039;&lt;a href=&quot;/mediawiki/index.php/Datasets&quot; title=&quot;Datasets&quot;&gt;Datasets&lt;/a&gt; -&amp;gt; &lt;a href=&quot;/mediawiki/index.php/Datasets_List&quot; title=&quot;Datasets List&quot;&gt;Datasets List&lt;/a&gt; -&amp;gt; Current Page  {| style=&amp;quot;width: 100%&amp;quot; |- | align=&amp;quot;right&amp;quot; |   {|  |- | &amp;#039;&amp;#039;&amp;#039;Created: &amp;#039;&amp;#039;&amp;#039;2011-10-02 |- | {{Last updated}} |}  |}  =Contact Author=…&amp;#039;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;[[Datasets]] -&amp;gt; [[Datasets List]] -&amp;gt; Current Page&lt;br /&gt;
&lt;br /&gt;
{| style=&amp;quot;width: 100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| align=&amp;quot;right&amp;quot; | &lt;br /&gt;
&lt;br /&gt;
{| &lt;br /&gt;
|-&lt;br /&gt;
| '''Created: '''2011-10-02&lt;br /&gt;
|-&lt;br /&gt;
| {{Last updated}}&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
=Contact Author=&lt;br /&gt;
 Robert Nagy&lt;br /&gt;
 University of Erlangen-Nuremberg&lt;br /&gt;
 Chair for Computer Science 6 (Data Management)&lt;br /&gt;
 Matrensstr. 3&lt;br /&gt;
 D-91058 Erlangen&lt;br /&gt;
 Germany&lt;br /&gt;
 Email: robert[dot]nagy [at] cs[dot]fau[dot]de&lt;br /&gt;
&lt;br /&gt;
=Current Version=&lt;br /&gt;
[[Image:neocr_examples.jpg|400px|thumb|right| Example images from the NEOCR dataset. Note that the dataset also includes images with text in different languages, text with vertical character arrangement, light text on dark and dark text on light background, occlusion,&lt;br /&gt;
good and bad contrast..]]&lt;br /&gt;
[[Image:neocr_examples_bb.jpg|400px|thumb|right| Example of different text characteristics present in images of the NEOCR dataset, along with ground truth bounding boxes and distortion quadrangles.]]&lt;br /&gt;
[[Image:neocr_labeling_screenshot.jpg|400px|thumb|right| The [http://labelme.csail.mit.edu/ LabelMe] interface used for ground truthing.]]&lt;br /&gt;
&lt;br /&gt;
1.0 (also available from the [http://www6.cs.fau.de/research/projects/pixtract/neocr/ NEOCR Web site])&lt;br /&gt;
&lt;br /&gt;
=Keywords=&lt;br /&gt;
OCR, Natural Scene, Scene Text, Word Spotting, Scene Text Recognition, Scene Text Detection, Scene Text Localization&lt;br /&gt;
&lt;br /&gt;
=Description=&lt;br /&gt;
The NEOCR dataset contains 659 real world images with 5238 annotated bounding boxes (textfields). The images were taken by several people independently from the dataset, so the dataset covers a broad range of characteristics which distinguish real world images from scanned documents. All text recognizable by humans has been annotated for all images. The dataset creation process was stopped when for each metadata dimension at least 100 textfields were included in the dataset.&lt;br /&gt;
&lt;br /&gt;
The ground truth contains not only the visible text, but also distortion quadrangles, which enclose the visible text much more precisely than bounding boxes. The dataset is enriched with metadata consisting of brightness, contrast, inversion, texture, resolution, noise, blur, distortion, rotation, character arrangement, occlusion, typeface and language information. The annotation is provided in XML based on the schema of LabelMe.&lt;br /&gt;
&lt;br /&gt;
=Metadata and Ground Truth Data=&lt;br /&gt;
The annotation was created manually by an adaptation of the LabelMe annotation tool. All text visible and recognizable by humans has been annotated for all images. The annotation is provided in XML, the schema of LabelMe was extended to our needs. The extended XMLschema is also provided as part of the dataset. Metadata is provided globally and locally.&lt;br /&gt;
&lt;br /&gt;
Global image metadata includes the filename, folder, source information, image width, height, depth, brightness and contrast. Textfield (local, bounding box) metadata contains the visible text and optical, geometrical and typographical characteristics. Bounding boxes are rectangular and parallel to the axes. Additionally distortion quadrangles are provided which enclose the visible text more precisely.&lt;br /&gt;
&lt;br /&gt;
Optical characteristics include texture, brightness, contrast, inversion, resolution, noise and blur information. Texture, noise and inversion were annotated manually, the rest was computed automatically using ImageMagick. Geometrical characteristics cover distortion, rotation, character arrangement and occlusion information. Typographical characteristics contain typeface and language metadata. Please see the CBDAR paper [[#References|[1]]] or the [http://www.iapr-tc11.org/dataset/NEOCR/neocr_metadata_doc.pdf technical report] for further details on the metadata.&lt;br /&gt;
&lt;br /&gt;
=Related Tasks=&lt;br /&gt;
* [[Text Recognition in Natural Scenes]]&lt;br /&gt;
&lt;br /&gt;
=References=&lt;br /&gt;
# R. Nagy, A. Dicker and K. Meyer‐Wegener, &amp;quot;NEOCR: A Configurable Dataset for Natural Image Text Recognition&amp;quot;. In CBDAR Workshop 2011 at ICDAR 2011. pp. 53‐58, September 2011.&lt;br /&gt;
# R. Nagy, A. Dicker, and K. Meyer‐Wegener, &amp;quot;Definition and Evaluation of the NEOCR Dataset for Natural‐Image Text Recognition&amp;quot;. University of Erlangen, Dept. of Computer Science, Technical Reports, CS‐2011‐07, September 2011.&lt;br /&gt;
&lt;br /&gt;
=Submitted Files=&lt;br /&gt;
==Disclaimer==&lt;br /&gt;
By downloading and using the dataset you agree to acknowledge it's source and cite the above papers in related publications. Please link to the authors' Web page of the set as http://www6.cs.fau.de/neocr.&lt;br /&gt;
&lt;br /&gt;
==Version 1.0==&lt;br /&gt;
* [http://www.iapr-tc11.org/dataset/NEOCR/neocr_dataset.tar.gz The complete NEOCR dataset with annotations] (XXXX KB)&lt;br /&gt;
* Disjoint split of the NEOCR images for training and testing [http://www.iapr-tc11.org/dataset/NEOCR/test.txt Test Set Image Listing] (5 KB), [http://www.iapr-tc11.org/dataset/NEOCR/train.txt Training Set Image Listing] (5 KB)&lt;br /&gt;
* [http://www.iapr-tc11.org/dataset/NEOCR/annotation.xsd The NEOCR XML-Schema definitions for the annotations] (10 KB)&lt;br /&gt;
* [http://www.iapr-tc11.org/dataset/NEOCR/neocr_metadata_doc.pdf NEOCR Metadata Documentation PDF] (7.5 MB)&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
This page is editable only by [[IAPR-TC11:Reading_Systems#TC11_Officers|TC11 Officers ]].&lt;/div&gt;</summary>
		<author><name>Dimos</name></author>
		
	</entry>
</feed>