============================================================================ IAPR TC-11 Newsletter June 2011 http://www.iapr-tc11.org ========== Contents ======================================================== * Message from the Editor * Dates 'n' Deadlines - HIP 2011 (at ICDAR, Bejing), June 20 (extended!) - MOCR 2011 (at ICDAR 2011), June 20 (extended!) - AND 2011 (at ICDAR 2011), June 26 (extended!) * Calls for Papers - Document Recognition and Retrieval XIX, San Francisco, 22-26 January 2012 - AND 2011 (at ICDAR 2011), September 17 (updated) - HIP 2011 (at ICDAR 2011), September 16-17 (updated) - MOCR 2011 (at ICDAR 2011), September 17 (updated) * IJDAR Contents Telegram (Volume 14 Number 2) * News: - Advertisement possibilities in IJDAR (by S. Marinai) * Call for Contributions ============================================================================ ========== Message from the Editor ========================================= Welcome to the June edition of our TC-11 newsletter. This edition brings good news to all researchers who would like to submit their latest results to some satelite workshop of ICDAR 2011: The submission deadlines for HIP, MOCR, and AND have been extended - to June 20 for HIP and MOCR and to June 26 for AND. CBDAR does not allow new paper submissions any more but existing abstracts and manuscript drafts may be updated until June 29. Still open is the call for papers for DRR 2012, which will be held in San Francisco in January 2012. All prospective authors of ICDAR papers I would like to remind that the camera ready versions of those papers are due July 1. Furthermore, as most of us will need to apply for a visa in order to be able to travel to Bejing, I would like to advise you to take care of the necessary formalities early. Finally, you will again find an IJDAR Contents Telegram in this issue together with an announcement by IJDAR Editor-in-Chief Simone Marinai. Best regards, Gernot A. Fink, IAPR-TC11 Newsletter Editor Gernot.Fink@udo.edu ============================================================================ ========== Dates 'n' Deadlines ============================================= Event/Location/Web: Event Date: Deadline (paper submission): ---------------------------------------------------------------------------- * ICDAR 2011, Beijing September 18-21 --- (http://www.icdar2011.org/) - Doctoral Consortium: July 1 (http://www.icdar2011.org/EN/column/column39.shtml) * HIP 2011 (at ICDAR 2011) September 16-17 June 20 (ext.) (http://www.comp.nus.edu.sg/~hdocp) * MOCR 2011 (at ICDAR 2011) September 17 June 20 (ext.) (http://www.cubs.buffalo.edu/MOCR/) * AND 2011, (at ICDAR 2011) September 17 June 26 (ext.) (http://sites.google.com/site/and2011workshop/) * CBDAR 2011, (at ICDAR 2011) September 22 June 15 (http://imlab.jp/cbdar2011/) * AFHA 2011, (at ICDAR 2011) September 17-18 June 15 (http://forensic.to/webhome/afha/) * DRR 2012, San Francisco January 22-26, 2012 June 30 (http://drr2012.irccyn.ec-nantes.fr/) * DAS 2012, Gold Coast, Australia March 27-29, 2012 September 30 (http://www.ict.griffith.edu.au/das2012) ============================================================================ ========== Call for Papers: DRR 2012 ======================================= Document Recognition and Retrieval XIX Part of the IS&T/SPIE Annual Symposium on Electronic Imaging 22-26 January 2012 . San Francisco, CA USA http://drr2012.irccyn.ec-nantes.fr/ Important dates: *30 June 2011*:*Full papers due* 1 September 2011 : Acceptance notice 14 November 2011 : Final manuscripts due Papers are solicited in, but not limited to, the following areas Document Recognition 1. Text recognition: machine-printed, handwritten documents; paper, tablet, camera, video sources 2. Writer/style identification, verification, adaptation 3. Graphics recognition: vectorization (e.g. for line-art, maps and technical drawings), signature, logo and graphical symbol recognition, figure, chart and graph recognition, diagrammatic notations (e.g. music, mathematical notation) 4. Document layout analysis and understanding: document and page region segmentation, form and table recognition, document understanding through combined modalities (e.g. speech and images) 5. Evaluation: performance metrics, document degradation models 6. Additional topics: document image filtering, enhancement and compression, document clustering and classification, machine learning (e.g. integration and optimization of recognition modules), historical and degraded document images (e.g. fax), multilingual document recognition, web page analysis (including wikis and blogs) Document Retrieval 1. Indexing and Summarization : (noisy) text documents (messages, blogs, etc.), imaged documents, entity tagging from OCR'ed text, text categorization 2. Query Languages and Modalities: Content-Based Image Retrieval (CBIR) for documents, keyword spotting, non-textual query-by-example (e.g. tables, figures, math), querying by document geometry and/or logical structure, approximate string matching algorithms for OCR'ed text, retrieval of noisy text documents (messages, blogs, etc.), cross and multi- lingual retrieval 3. Evaluation: relevance and performance metrics, evaluation protocols, benchmarking 4. Additional topics: relevance feedback, impact of recognition accuracy on retrieval performance, and digital libraries including systems engineering and quality assurance Program Committee: /Conference Chairs:/ /*Christian Viard-Gaudin*/, Univ. of Nantes (France); /*Richard Zanibbi*/, Rochester Institute of Technology (USA)/ //Program Committee:/ /*Gady Agam*/, Illinois Institute of Technology; /*Elisa Barney Smith*/, Boise State Univ.; /*Bill Barrett*/, Brigham Young Univ.; /*Kathrin Berkner*/, Ricoh Innovations, Inc.; /*Bertrand Couasnon*/, IRISA/INSA Rennes (France); /*Hervé Dejean*/, Xerox Research (France); /*Xiaoqing Ding*/, Tsinghua Univ. (China); /*David Doermann*/, Univ.of Maryland/College Park; /*Oleg Golubitsky*/, Google, Inc.; /*Jianying Hu*/, IBM Thomas J. Watson Research Ctr.; /*Laurence Likforman-Sulem*/, Telecom ParisTech (France); /*Marcus Liwicki*/, DFKI (Germany); /*Xiaofan Lin*/, Vobile Inc; /*Daniel Lopresti*/, Lehigh Univ.; /*Hiroshi Sako*/, Hitachi,Ltd. (Japan); /*Sargur Srihari*/, Univ. at Buffalo; /*Venkata Subramaniam*/, IBM India Research Lab. (India); /*Kazem Taghva*/, Univ. of Nevada/Las Vegas; /*George Thoma*/, National Library of Medicine; /*Alessandro Vinciarelli*/, University of Glasgow (United Kingdom); /*Berrin Yanikoglu*/, Sabanci Univ. (Turkey); /*Jie Zou*/, National Library of Medicine ============================================================================ ========== Call for Papers: AND 2011 (updated) ============================= Fifth Workshop on Analytics for Noisy Unstructured Text Data (AND 2011) 11th Int. Conf. on Document Analysis and Recognition (ICDAR) September 17th, 2011 Beijing, China Noisy unstructured text data is ubiquitous and abundant in real-world situations. Handling noisy text poses new challenges for Information Extraction (IE), Natural Language Processing (NLP), Information Retrieval (IR) and Knowledge Management (KM). Special handling of noise as well as noise-robust IR and KM techniques are essential to overcome these challenges. As in the case of AND 07, 08, 09, and 10, we intend that AND 2011 will provide researchers an opportunity to present their latest results toward addressing these challenges. We seek papers dealing with all aspects of noisy unstructured text data, its processing and applications. We particularly encourage contributions that look toward solving real life problems. Topics of interest (but are not limited to): * Noise induced by document analysis techniques and its impact on downstream applications * Formal theory on characterization of noise * Genre recognition based on the type of noise * Robust parsing and Part of Speech (POS) tagging * Characterizing, modelling and accounting for historical language change * Methods for detecting and correcting errors in noisy text * Information extraction and retrieval from noisy text data * Automatic classification and clustering of noisy unstructured data * Noise-invariant document summarization techniques * Issues in keyword search in presence of noise in unstructured data * Machine Translation for noisy text * Analyzing very short communications like those on Twitter * Techniques for analysis and mining of call-logs, transcribed calls, web logs, chat logs, emails, tweets * Business Intelligence (BI) applications dealing with noisy text data * Surveys relating to noisy text analytics Submission Guidelines We invite papers up to 8 pages in length in the style specified at (http://and2011.cse.lehigh.edu/). Accepted papers will be included in the ACM Digital Library. The best student paper of AND 2011 will receive the IAPR Best Student Paper Award. Important Dates Abstract Submission: EXTENDED to June 19th, 2011 Paper Submission: EXTENDED to June 26th, 2011 Notification of Acceptance: July 25th, 2011 Camera-Ready papers due: August 8th, 2011 ============================================================================ ========== Call for Papers: HIP 2011 (updated) ============================= First International Workshop on Historical Document Imaging and Processing (HIP) Beijing, China, Sept. 16-17, 2011, (at ICDAR 2011) (http://www.comp.nus.edu.sg/~hdocp) HIP seeks to bring together archivists, curators and researchers, from private industry, government, and academia, who are involved in all aspects of imaging, collection and processing of historical documents. The workshop will feature a keynote by Mr. ZHANG Zhiqing, Deputy Director of the National Library of China (NLC) (http://www.nlc.gov.cn/en/service/acbooks.htm) Workshop attendees will also participate in a post-workshop visit to the NLC to view a variety of special collections. HIP topics of interest include: * Imaging and Image Acquisition * Digital Archiving Considerations * Historical Collections * Document Restoration/Improving readability * Content Extraction * Family History Documents and Genealogies * Automated Classification, Grouping and Hyperlinking of Historical Documents See http://www.comp.nus.edu.sg/~hdocp for full details. Authors should submit original work using guidelines found at http://www.comp.nus.edu.sg/~hdocp/. Accepted papers will be published in the printed conference proceedings, available to participants at the workshop, and electronically as part of the ACM International Conference Proceedings Series. Important NEW Dates: June 20, 2011 Manuscripts due July 15, 2011 Acceptance notification August 15, 2011 Camera-ready manuscript due August 15, 2011 Advance registration due Registration: Full Registration USD $100 Student Registration USD $50 General Chairs: William A. Barrett (Brigham Young University) Michael S. Brown (National University of Singapore) Program Chair: R. Manmatha (University of Massachusetts, Amherst, USA) Organizing Chair: Jake Gehring (Manager, FamilySearch Data Operations, USA) Ms. Long Wei (Research Librarian, National Library of China) Program Committee: Apostolos Antonacopoulos* Josep Lladós Mohamed Cheriet Shijian Lu Xiaoqing Ding Simone Marinai David Doermann Heath Nielson Scott Eldredge Liangrui Peng Basilis Gatos Alexandra Psarrou Venu Govindaraju Jack Reese C.J. Jawahar Eric Ringger LianWen Jin Zhixin Shi George Landon Chew Lim Tan Cheng-Lin Liu Eugene Walach Competition: Historical Document Layout Analysis* For details visit: http://www.primaresearch.org/ICDAR2011_competition ============================================================================ ========== Call for Papers: MOCR 2011 (updated) ============================ 3rd International Workshop on Multilingual OCR ----------------------------------------------- (In conjunction with ICDAR 2011) Beijing Friendship Hotel - Beijing, China September 17, 2011 http://cubs.buffalo.edu/MOCR/ Please Note: Submission deadline has been extended to 06/20/2011 MOCR 2011 invites the submission of original, previously unpublished work in any of the areas of interest to the multilingual OCR community. Submissions could also include preliminary experimental results based on new approaches, descriptions of newly created open data sets, proposals for competitions and extended work highlighting recent results that were too late for the ICDAR 2011 deadline. The workshop will explore methodologies for multilingual document analysis systems with particular focus on OCR. The scope of Multilingual OCR is defined to include systems that are capable of reading more than one language in the same document, as well as one-language-per-document systems that can be easily retargeted to new languages. The workshop will provide a forum for technical discussions on three important themes: (i) recent progress in the field of multilingual OCR (ii) adaptation and repurposing of proven methods for multilingual OCR (iii) hard open research problems and promising new approaches Contributions addressing (but not limited to) the following areas are invited: * Recognition Domains : Script and Language identification, Machine-print and handwriting recognition * Evaluation Methodologies: Metrics, Standards, Ground truthing, Benchmark datasets * Multiple languages: Techniques applicable/retargetable to multiple languages/scripts * Document Analysis: Layout analysis, Reading order, Structured objects such as tables, forms * Document types: Contemporary as well as historical documents, video text * Recognition methodologies: Techniques such as HMMs, features for recognition, use of language models etc. For submission and other details, please visit the Workshop web-site: http://cubs.buffalo.edu/MOCR/ Registration for the workshop will be handled by the ICDAR 2011 registration system. Important Dates: * Paper submission due: June 20, 2011 * Reviews completed: July 15, 2011 * Notification of acceptance: July 20, 2011 * Camera ready copies due: August 1, 2011 * Workshop: September 17, 2011 Organizing Committee: General Co-Chairs: * Santanu Chaudhury, IIT Delhi, India * Venu Govindaraju, University at Buffalo - SUNY, USA * Daniel Lopresti, Lehigh University, USA * Prem Natarajan, Raytheon BBN Technologies, USA Technical Program and Publications Chair: * Srirangaraj Setlur, University at Buffalo - SUNY, USA _____________________ ICDAR 2011 Secretariat Email: icdar2011@gmail.com ============================================================================ ========== Call for Contributions ========================================== This newsletter needs your support in order to provide useful information to the TC11 community. Therefore, please contribute relevant news by sending a short notice to the newsletter editor Gernot A. Fink . Such news could be the obvious announcements of conferences and workshops, job opportunities, reports on past conferences, book reviews, or anything that might be of interest to a wider audience involved in the construction of reading systems. ============================================================================ ========== IJDAR Contents Telegram (Volume 14 Number 2) ==================== IJDAR Table of Contents. Volume 14 Number 2 Editorial Special issue on noisy text analytics Daniel Lopresti, Shourya Roy, Klaus Schulz & L. Venkata Subramaniam Report from the AND 2009 working group on noisy text datasets Simone Marinai & Dimosthenis Karatzas Text retrieval from early printed books Simone Marinai A word spotting framework for historical machine-printed documents A. L. Kesidis, E. Galiotou, B. Gatos & I. Pratikakis Unconstrained handwritten document retrieval Huaigu Cao, Venu Govindaraju & Anurag Bhardwaj Towards information retrieval on historical document collections: the role of matching procedures and special lexica Annette Gotscharek, Ulrich Reffle, Christoph Ringlstetter, Klaus U. Schulz & Andreas Neumann Character confusion versus focus word-based correction of spelling and OCR variants in corpora Martin W. C. Reynaert Robust named entity detection from optical character recognition output Krishna Subramanian, Rohit Prasad & Prem Natarajan Domain-specific entity extraction from noisy, unstructured data using ontology-guided search Sergey Bratus, Anna Rumshisky, Alexy Khrabrov, Rajenda Magar & Paul Thompson Supervised semantic relation mining from linguistically noisy text documents Cristina Giannone, Roberto Basili, Paolo Naggar & Alessandro Moschitti Digital weight watching: reconstruction of scanned documents Maarten Marx & Tim Gielissen ============================================================================ ========== News: Advertisement possibilities in IJDAR ====================== News from IJDAR Starting with Volume 14 we have some pages in each printed issue for advertisement of events relevant for IJDAR, such as call for papers. Interested people can contact Simone Marinai (simone.marinai@unifi.it) ============================================================================ ========== Subscription Information ======================================== This newsletter is sent to subscribers of the IAPR TC11 mailing list. To manage your subscription, please visit the mailing list homepage at: https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=IAPR-TC11 The homepage for IAPR TC11 is http://www.iapr-tc11.org ============================================================================