Document Analysis Systems VII: 7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006. Proceedings

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This book constitutes the refereed proceedings of the 7th International Conference on Document Analysis Systems, DAS 2006, held in Nelson, New Zealand, in February 2006.

The 33 revised full papers and 22 poster papers presented were carefully reviewed and selected from 78 submissions. The papers are organized in topical sections on digital libraries, image processing, handwriting, document structure and format, tables, language and script identification, systems and performance evaluation, and retrieval and segmentation.

Author(s): A. Balasubramanian, Million Meshesha, C. V. Jawahar (auth.), Horst Bunke, A. Lawrence Spitz (eds.)
Series: Lecture Notes in Computer Science 3872
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2006

Language: English
Pages: 632
Tags: Pattern Recognition; Information Storage and Retrieval; Image Processing and Computer Vision; Simulation and Modeling; Computer Appl. in Administrative Data Processing

Front Matter....Pages -
Retrieval from Document Image Collections....Pages 1-12
A Semi-automatic Adaptive OCR for Digital Libraries....Pages 13-24
Contribution to the Discrimination of the Medieval Manuscript Texts: Application in the Palaeography....Pages 25-37
Restoring Ink Bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique....Pages 38-49
Networked Document Imaging with Normalization and Optimization....Pages 50-61
Gray-Scale Thinning Algorithm Using Local Min/Max Operations....Pages 62-70
Automated Scoring of Handwritten Essays Based on Latent Semantic Analysis....Pages 71-83
Aligning Transcripts to Automatically Segmented Handwritten Manuscripts....Pages 84-95
Virtual Example Synthesis Based on PCA for Off-Line Handwritten Character Recognition....Pages 96-105
Extraction of Handwritten Text from Carbon Copy Medical Form Images....Pages 106-116
Document Logical Structure Analysis Based on Perceptive Cycles....Pages 117-128
A System for Converting PDF Documents into Structured XML Format....Pages 129-140
XCDF: A Canonical and Structured Document Format....Pages 141-152
Structural Analysis of Mathematical Formulae with Verification Based on Formula Description Grammar....Pages 153-163
Notes on Contemporary Table Recognition....Pages 164-175
Handwritten Artefact Identification Method for Table Interpretation with Little Use of Previous Knowledge....Pages 176-185
Writer Identification for Smart Meeting Room Systems....Pages 186-195
Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme ‘th’....Pages 196-207
Segmentation of On-Line Handwritten Japanese Text Using SVM for Improving Text Recognition....Pages 208-219
Application of Bi-gram Driven Chinese Handwritten Character Segmentation for an Address Reading System....Pages 220-231
Language Identification in Degraded and Distorted Document Images....Pages 232-242
Bangla/English Script Identification Based on Analysis of Connected Component Profiles....Pages 243-254
Script Identification from Indian Documents....Pages 255-267
Finding the Best-Fit Bounding-Boxes....Pages 268-279
Towards Versatile Document Analysis Systems....Pages 280-290
Exploratory Analysis System for Semi-structured Engineering Logs....Pages 291-301
Ground Truth for Layout Analysis Performance Evaluation....Pages 302-311
On Benchmarking of Invoice Analysis Systems....Pages 312-323
Semi-automatic Ground Truth Generation for Chart Image Recognition....Pages 324-335
Efficient Word Retrieval by Means of SOM Clustering and PCA....Pages 336-347
The Effects of OCR Error on the Extraction of Private Information....Pages 348-357
Combining Multiple Classifiers for Faster Optical Character Recognition....Pages 358-367
Performance Comparison of Six Algorithms for Page Segmentation....Pages 368-379
HVS Inspired System for Script Identification in Indian Multi-script Documents....Pages 380-389
A Shared Fragments Analysis System for Large Collections of Web Pages....Pages 390-401
Offline Handwritten Arabic Character Segmentation with Probabilistic Model....Pages 402-412
Automatic Keyword Extraction from Historical Document Images....Pages 413-424
Digitizing a Million Books: Challenges for Document Analysis....Pages 425-436
Toward File Consolidation by Document Categorization....Pages 437-448
Finding Hidden Semantics of Text Tables....Pages 449-461
Reconstruction of Orthogonal Polygonal Lines....Pages 462-473
A Multiclass Classification Framework for Document Categorization....Pages 474-483
The Restoration of Camera Documents Through Image Segmentation....Pages 484-495
Cut Digits Classification with k-NN Multi-specialist....Pages 496-505
The Impact of OCR Accuracy and Feature Transformation on Automatic Text Classification....Pages 506-517
A Method for Symbol Spotting in Graphical Documents....Pages 518-528
Groove Extraction of Phonographic Records....Pages 529-540
Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval....Pages 541-552
Robust Chinese Character Recognition by Selection of Binary-Based and Grayscale-Based Classifier....Pages 553-563
Segmentation-Driven Recognition Applied to Numerical Field Extraction from Handwritten Incoming Mail Documents....Pages 564-575
Performance Evaluation of Text Detection and Tracking in Video....Pages 576-587
Document Analysis System for Automating Workflows....Pages 588-592
Automatic Assembling of Cadastral Maps Based on Generalized Hough Transformation....Pages 593-603
A Few Steps Towards On-the-Fly Symbol Recognition with Relevance Feedback....Pages 604-615
The Fuzzy-Spatial Descriptor for the Online Graphic Recognition: Overlapping Matrix Algorithm....Pages 616-627
Back Matter....Pages -