This book constitutes the proceedings of the First Information Retrieval Facility Conference, IRFC 2010, held in Vienna, Austria, im May 2010. The 11 papers presented were carefully reviewed and selected from 20 high-quality submissions. IRF conferences wish to resonate in particular with young researchers. This first conference aimed to tackle four complementary research areas: information retrieval, semantic web technologies for IT, natural language processing for IR, and large-scale or distributed computing for the above areas.
Author(s): Hamish Cunningham, Allan Hanbury, Stefan RĂ¼ger
Series: Lecture ... Applications, incl. Internet/Web, and HCI
Edition: 1st Edition.
Publisher: Springer
Year: 2010
Language: English
Pages: 175
Cover
......Page 1
Advances
in Multidisciplinary
Retrieval......Page 3
Lecture Notes in Computer Science 6107......Page 2
ISBN-10 3642130836......Page 4
Preface......Page 5
Table of Contents......Page 9
How Much Search Can You Afford?......Page 10
Patent Retrieval......Page 12
References......Page 13
Introduction......Page 15
Attachment Prediction......Page 16
The Corpus......Page 17
Amazon Mechanical Turk......Page 19
Task Design......Page 20
Quality of Annotations......Page 21
Learning to Predict Attachments......Page 22
Learning with Skewed Data......Page 23
Feature Selection......Page 24
Experiments on Attachment Prediction on the Document Level......Page 25
Discussion and Future Work......Page 26
References......Page 27
Introduction......Page 29
Matching Text to Readers......Page 30
Language......Page 31
Structure......Page 33
Readability Based Web Ranking......Page 34
Results......Page 36
Conclusions and Future Work......Page 38
References......Page 39
Introduction......Page 40
Related Work......Page 42
Constructing Knowledge Representations......Page 43
Extracting Representative Term Sets......Page 45
Integration Strategy......Page 47
Experimental Setup......Page 48
Retrieval Setup......Page 49
Query Dependency Analysis......Page 50
Parameter Optimization for BM25F......Page 51
References......Page 53
Introduction......Page 56
Latent Dirichlet Allocation......Page 57
Explicit Semantic Analysis......Page 58
Making Use of Concept Models for CLIR......Page 59
Experiments......Page 60
Mate Retrieval on Multext JOC......Page 61
Query-Based Retrieval with CLEF2000......Page 63
Conclusion......Page 66
References......Page 67
Introduction......Page 69
Distributed Dimensional Data Model......Page 71
Indexing Process......Page 73
Entity Queries......Page 74
Distributed Processing......Page 75
Results......Page 76
Conclusion......Page 77
References......Page 78
Introduction......Page 79
Background......Page 81
Statistical Significance Tests......Page 82
Experiments......Page 83
The Variance of a Bounded Metric......Page 84
The Variability of Transformed Scores......Page 85
Variability as a Tie Breaker......Page 87
The Effect of Topic Set Size on Measuring Variability in Effectiveness......Page 88
Summary and Discussion......Page 89
References......Page 91
Appendix A......Page 92
Introduction......Page 93
Related Work......Page 95
Discrete Fourier Transform......Page 97
Spectral Leakage......Page 98
Digital Filtering......Page 99
Query to Spectrum Transformation......Page 100
Document to Filter Transformation......Page 101
Document Ranking......Page 102
Some Preliminary Results......Page 103
Example......Page 104
Conclusions and Future Work......Page 105
References......Page 107
Introduction......Page 109
Logic-Based Retrieval......Page 111
Probabilistic Datalog......Page 112
Distributed Information Retrieval......Page 113
Data Source Selection......Page 114
DF-Based Selection......Page 115
LM-Based Selection......Page 117
Retrieval Result Fusion......Page 118
Retrieval Strategy Modelling......Page 120
Overview......Page 122
Parallel Processing......Page 123
Evaluation......Page 124
Summary and Conclusions......Page 126
References......Page 127
Introduction......Page 129
System Overview......Page 132
Training and Evaluation Corpora......Page 134
Conditional Random Fields for Bibliographical Information Extraction......Page 135
Patents References......Page 137
Non Patent Literature......Page 139
Patent References......Page 140
Non-patent Literature......Page 141
Conclusion......Page 143
References......Page 144
Introduction......Page 145
Related Work......Page 146
A Brief Survey of Probabilistic IR......Page 148
Trying to Depart from Classical Probabilistic IR......Page 150
A Mathematical Analysis......Page 151
Objectives and Research Questions......Page 153
Design of the Experiments......Page 154
Results......Page 156
Discussion......Page 158
Concluding Remarks and Future Directions......Page 159
References......Page 160
Introduction......Page 161
Named Entity Recognition......Page 163
Results......Page 165
PubMed Central......Page 167
TREC-CHEM......Page 169
Discussion......Page 172
References......Page 173
Author Index......Page 175