This book constitutes the thoroughly refereed post-proceedings of the First International Joint Conference on Natural Language Processing, IJCNLP 2004, held in Hainan Island, China in March 2004.
The 84 revised full papers presented in this volume were carefully selected during two rounds of reviewing and improvement from 211 papers submitted. The papers are organized in topical sections on dialogue and discourse; FSA and parsing algorithms; information extractions and question answering; information retrieval; lexical semantics, ontologies, and linguistic resources; machine translation and multilinguality; NLP software and applications, semantic disambiguities; statistical models and machine learning; taggers, chunkers, and shallow parsers; text and sentence generation; text mining; theories and formalisms for morphology, syntax, and semantics; word segmentation; NLP in mobile information retrieval and user interfaces; and text mining in bioinformatics.
Author(s): Matthias Denecke, Kohji Dohsaka, Mikio Nakano (auth.), Keh-Yih Su, Jun’ichi Tsujii, Jong-Hyeok Lee, Oi Yee Kwong (eds.)
Series: Lecture Notes in Computer Science 3248 : Lecture Notes in Artificial Intelligence
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2005
Language: English
Pages: 822
Tags: Artificial Intelligence (incl. Robotics); Mathematical Logic and Formal Languages; Language Translation and Linguistics; Information Storage and Retrieval; Algorithm Analysis and Problem Complexity; Document Preparation and Text Processi
Front Matter....Pages -
Fast Reinforcement Learning of Dialogue Policies Using Stable Function Approximation....Pages 1-11
Zero Pronoun Resolution Based on Automatically Constructed Case Frames and Structural Preference of Antecedents....Pages 12-21
Improving Noun Phrase Coreference Resolution by Matching Strings....Pages 22-31
Combining Labeled and Unlabeled Data for Learning Cross-Document Structural Relationships....Pages 32-41
Parsing Mixed Constructions in a Type Feature Structure Grammar....Pages 42-51
Iterative CKY Parsing for Probabilistic Context-Free Grammars....Pages 52-60
Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities....Pages 61-70
A Re-examination of IR Techniques in QA System....Pages 71-80
A Novel Pattern Learning Method for Open Domain Question Answering....Pages 81-89
Chinese Named Entity Recognition Based on Multilevel Linguistic Features....Pages 90-99
Information Flow Analysis with Chinese Text....Pages 100-109
Phoneme-Based Transliteration of Foreign Names for OOV Problem....Pages 110-119
Window-Based Method for Information Retrieval....Pages 120-129
Improving Relevance Feedback in Language Modeling Approach: Maximum a Posteriori Probability Criterion and Three-Component Mixture Model....Pages 130-138
BBS Based Hot Topic Retrieval Using Back-Propagation Neural Network....Pages 139-148
How Effective Is Query Expansion for Finding Novel Information?....Pages 149-157
The Hinoki Treebank A Treebank for Text Understanding....Pages 158-167
Building a Parallel Bilingual Syntactically Annotated Corpus....Pages 168-176
Acquiring Bilingual Named Entity Translations from Content-Aligned Corpora....Pages 177-186
Visual Semantics and Ontology of Eventive Verbs....Pages 187-196
A Persistent Feature-Object Database for Intelligent Text Archive Systems....Pages 197-205
Example-Based Machine Translation Without Saying Inferable Predicate....Pages 206-215
Improving Back-Transliteration by Combining Information Sources....Pages 216-223
Bilingual Sentence Alignment Based on Punctuation Statistics and Lexicon....Pages 224-232
Automatic Learning of Parallel Dependency Treelet Pairs....Pages 233-243
Practical Translation Pattern Acquisition from Combined Language Resources....Pages 244-253
An English-Hindi Statistical Machine Translation System....Pages 254-262
Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model....Pages 263-271
Selecting Prosody Parameters for Unit Selection Based Chinese TTS....Pages 272-279
Natural Language Database Access Using Semi-automatically Constructed Translation Knowledge....Pages 280-289
Korean Stochastic Word-Spacing with Dynamic Expansion of Candidate Words List....Pages 290-298
You Don’t Have to Think Twice if You Carefully Tokenize....Pages 299-309
Automatic Genre Detection of Web Documents....Pages 310-319
Statistical Substring Reduction in Linear Time....Pages 320-327
Detecting Sentence Boundaries in Japanese Speech Transcriptions Using a Morphological Analyzer....Pages 328-337
Specification Retrieval – How to Find Attribute-Value Information on the Web....Pages 338-347
Conceptual Information-Based Sense Disambiguation....Pages 348-357
Influence of WSD on Cross-Language Information Retrieval....Pages 358-366
Resolution of Modifier-Head Relation Gaps Using Automatically Extracted Metonymic Expressions....Pages 367-376
Word Sense Disambiguation Using Heterogeneous Language Resources....Pages 377-385
Improving Word Sense Disambiguation by Pseudo-samples....Pages 386-395
Long Distance Dependency in Language Modeling: An Empirical Study....Pages 396-405
Word Folding: Taking the Snapshot of Words Instead of the Whole....Pages 406-415
Bilingual Chunk Alignment Based on Interactional Matching and Probabilistic Latent Semantic Indexing....Pages 416-425
Learning to Filter Junk E-Mail from Positive and Unlabeled Examples....Pages 426-435
A Collaborative Ability Measurement for Co-training....Pages 436-445
Flexible Margin Selection for Reranking with Full Pairwise Samples....Pages 446-455
A Comparative Study on the Use of Labeled and Unlabeled Data for Large Margin Classifiers....Pages 456-465
Comparing Entropies within the Chinese Language....Pages 466-475
NTPC: N-fold Templated Piped Correction....Pages 476-486
A Three Level Cache-Based Adaptive Chinese Language Model....Pages 487-492
Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging....Pages 493-499
Deterministic Dependency Structure Analyzer for Chinese....Pages 500-508
High Speed Unknown Word Prediction Using Support Vector Machine for Chinese Text-to-Speech Systems....Pages 509-517
Syntactic Analysis of Long Sentences Based on S-Clauses....Pages 518-526
Chinese Chunk Identification Using SVMs Plus Sigmoid....Pages 527-536
Tagging Complex NEs with MaxEnt Models: Layered Structures Versus Extended Tagset....Pages 537-544
A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity....Pages 545-554
Detection of Incorrect Case Assignments in Paraphrase Generation....Pages 555-565
Building a Pronominalization Model by Feature Selection and Machine Learning....Pages 566-575
Categorizing Unknown Text Segments for Information Extraction Using a Search Result Mining Approach....Pages 576-586
Mining Table Information on the Internet....Pages 587-595
Collecting Evaluative Expressions for Opinion Extraction....Pages 596-605
A Study of Semi-discrete Matrix Decomposition for LSI in Automated Text Categorization....Pages 606-615
Systematic Construction of Hierarchical Classifier in SVM-Based Text Categorization....Pages 616-625
Implementing the Syntax of Japanese Numeral Classifiers....Pages 626-635
A Graph Grammar Approach to Map Between Dependency Trees and Topological Models....Pages 636-645
The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser....Pages 646-654
Chinese Treebanks and Grammar Extraction....Pages 655-663
FML-Based SCF Predefinition Learning for Chinese Verbs....Pages 664-673
Deep Analysis of Modern Greek....Pages 674-683
Corpus-Oriented Grammar Development for Acquiring a Head-Driven Phrase Structure Grammar from the Penn Treebank....Pages 684-693
Unsupervised Segmentation of Chinese Corpus Using Accessor Variety....Pages 694-703
Chinese Unknown Word Identification Using Class-Based LM....Pages 704-713
An Example-Based Study on Chinese Word Segmentation Using Critical Fragments....Pages 714-722
The Use of SVM for Chinese New Word Identification....Pages 723-732
Chinese New Word Finding Using Character-Based Parsing Model....Pages 733-742
Thematic Session: Natural Language Technology in Mobile Information Retrieval and Text Processing User Interfaces....Pages 743-744
Spoken Versus Written Queries for Mobile Information Access: An Experiment on Mandarin Chinese....Pages 745-754
An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry....Pages 755-764
Dit4dah: Predictive Pruning for Morse Code Text Entry....Pages 765-775
Thematic Session: Text Mining in Biomedicine....Pages 776-776
Unsupervised Event Extraction from Biomedical Literature Using Co-occurrence Information and Basic Patterns....Pages 777-786
Annotation of Gene Products in the Literature with Gene Ontology Terms Using Syntactic Dependencies....Pages 787-796
Mining Biomedical Abstracts: What’s in a Term?....Pages 797-806
SVM-Based Biological Named Entity Recognition Using Minimum Edit-Distance Feature Boosted by Virtual Examples....Pages 807-814
Back Matter....Pages -