Automatic Digital Document Processing and Management: Problems, Algorithms and Techniques

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

Computer-readable documents have become ubiquitous in everyday life - from legacy documents that have been digitized, to new documents that have been created electronically. As the number of electronic documents continues to grow, so does the importance of digital methods for processing and managing these documents.

This comprehensive text/reference provides a broad review of the issues involved in handling and processing digital documents. Examining the full range of a document's lifetime, the book covers acquisition, representation, security, pre-processing, layout analysis, understanding, analysis of single components, information extraction, filing, indexing and retrieval. A background knowledge of the area is not required, beyond familiarity with basic concepts of computer science and mathematics; deeper technical content is provided in discrete subsections that are not essential for an understanding of other parts of the book.

Topics and features:

  • With a Foreword by Professor George Nagy of Rensselaer Polytechnic Institute, New York, USA
  • Provides a list of acronyms and a glossary of technical terms
  • Contains appendices covering key concepts in machine learning, and providing a case study on building an intelligent system for digital document and library management
  • Discusses issues of security, and legal aspects of digital documents
  • Examines core issues of document image analysis, and image processing techniques of particular relevance to digitized documents
  • Reviews the resources available for natural language processing, in addition to techniques of linguistic analysis for content handling
  • Investigates methods for extracting and retrieving data/information from a document, including representation at a semantic level

Undergraduate and graduate students will find the text a valuable general reference on the subject, and researchers will discover how their specific area of interest is interrelated with other disciplines involved in digital document processing. The book also supplies a repertoire of potential technological solutions for professionals working on digital documents.

Dr. Stefano Ferilli is an associate professor at the University of Bari, Italy, where he is Director of the Interdepartmental Center for Logic and Applications.

Author(s): Stefano Ferilli (auth.)
Series: Advances in Pattern Recognition
Edition: 1
Publisher: Springer-Verlag London
Year: 2011

Language: English
Pages: 297
Tags: Image Processing and Computer Vision

Front Matter....Pages I-XXVI
Front Matter....Pages 1-2
Documents....Pages 3-13
Digital Formats....Pages 15-71
Legal and Security Aspects....Pages 73-109
Front Matter....Pages 111-112
Image Processing....Pages 113-143
Document Image Analysis....Pages 145-196
Front Matter....Pages 197-198
Natural Language Processing....Pages 199-222
Information Management....Pages 223-255
Back Matter....Pages 257-297