String Processing and Information Retrieval: 13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006. Proceedings

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This volume contains the papers presented at the 13th International Symposium on String Processing and Information Retrieval (SPIRE), held October 11-13, 2006, in Glasgow, Scotland. The SPIRE annual symposium provides an opportunity for both new and established researchers to present original contributions to areas such as string processing (dictionary algorithms, text searching, pattern matching, text c- pression, text mining, natural language processing, and automata-based string processing); information retrieval languages, applications, and evaluation (IR modelling, indexing, ranking and ?ltering, interface design, visualization, cro- lingual IR systems, multimedia IR, digital libraries, collaborative retrieval, W- related applications, XML, information retrieval from semi-structured data, text mining, and generation of structured data from text); and interaction of biology and computation (sequencing and applications in molecular biology, evolution and phylogenetics, recognition of genes and regulatory elements, and sequen- driven protein structure prediction). The papers in this volume were selected from 102 papers submitted from over 20 di?erent countries in response to the Call for Papers. A total of 26 submissions were accepted as full papers, yielding an acceptance rate of about 25%. In view of the large number of good-quality submissions the Program Committee decided to accept 5 short papers, that have also been included in the proceedings. SPIRE 2006 also featured two talks by invited speakers: Jamie Callan (Carnegie Mellon University, USA) and Martin Farach-Colton (Rutgers University, USA).

Author(s): Andrea Esuli, Tiziano Fagni, Fabrizio Sebastiani (auth.), Fabio Crestani, Paolo Ferragina, Mark Sanderson (eds.)
Series: Lecture Notes in Computer Science 4209 : Theoretical Computer Science and General Issues
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2006

Language: English
Pages: 370
Tags: Information Storage and Retrieval; Artificial Intelligence (incl. Robotics); Database Management; Data Structures; Coding and Information Theory; Algorithm Analysis and Problem Complexity

Front Matter....Pages -
MP-Boost: A Multiple-Pivot Boosting Algorithm and Its Application to Text Categorization....Pages 1-12
TreeBoost.MH: A Boosting Algorithm for Multi-label Hierarchical Text Categorization....Pages 13-24
Cluster Generation and Cluster Labelling for Web Snippets: A Fast and Accurate Hierarchical Solution....Pages 25-36
Principal Components for Automatic Term Hierarchy Building....Pages 37-48
Computing the Minimum Approximate λ -Cover of a String....Pages 49-60
Sparse Directed Acyclic Word Graphs....Pages 61-73
On-Line Repetition Detection....Pages 74-85
Analyzing User Behavior to Rank Desktop Items....Pages 86-97
The Intention Behind Web Queries....Pages 98-109
Compact Features for Detection of Near-Duplicates in Distributed Retrieval....Pages 110-121
Inverted Files Versus Suffix Arrays for Locating Patterns in Primary Memory....Pages 122-133
Efficient Lazy Algorithms for Minimal-Interval Semantics....Pages 134-149
Output-Sensitive Autocompletion Search....Pages 150-162
A Compressed Self-index Using a Ziv-Lempel Dictionary....Pages 163-180
Mapping Words into Codewords on PPM....Pages 181-192
Improving Usability Through Password-Corrective Hashing....Pages 193-204
Word-Based Correction for Retrieval of Arabic OCR Degraded Documents....Pages 205-216
A Statistical Model of Query Log Generation....Pages 217-228
Using String Comparison in Context for Improved Relevance Feedback in Different Text Media....Pages 229-241
A Multiple Criteria Approach for Information Retrieval....Pages 242-254
English to Persian Transliteration....Pages 255-266
Efficient Algorithms for Pattern Matching with General Gaps and Character Classes....Pages 267-278
Matrix Tightness: A Linear-Algebraic Framework for Sorting by Transpositions....Pages 279-290
How to Compare Arc-Annotated Sequences: The Alignment Hierarchy....Pages 291-303
Structured Index Organizations for High-Throughput Text Querying....Pages 304-315
Adaptive Query-Based Sampling of Distributed Collections....Pages 316-328
Dotted Suffix Trees A Structure for Approximate Text Indexing....Pages 329-336
Phrase-Based Pattern Matching in Compressed Text....Pages 337-345
Discovering Context-Topic Rules in Search Engine Logs....Pages 346-353
Incremental Aggregation of Latent Semantics Using a Graph-Based Energy Model....Pages 354-359
A New Algorithm for Fast All-Against-All Substring Matching....Pages 360-366
Back Matter....Pages -