Advances in Knowledge Discovery and Data Mining: 11th Pacific-Asia Conference, PAKDD 2007, Nanjing, China, May 22-25, 2007. Proceedings

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This book constitutes the refereed proceedings of the 11th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2007, held in Nanjing, China in May 2007.

The 34 revised full papers and 92 revised short papers presented together with four keynote talks or extended abstracts thereof were carefully reviewed and selected from 730 submissions. The papers are devoted to new ideas, original research results and practical development experiences from all KDD-related areas including data mining, machine learning, databases, statistics, data warehousing, data visualization, automatic scientific discovery, knowledge acquisition and knowledge-based systems.

Author(s): Jiawei Han (auth.), Zhi-Hua Zhou, Hang Li, Qiang Yang (eds.)
Series: Lecture Notes in Computer Science 4426 : Lecture Notes in Artificial Intelligence
Edition: 1
Publisher: Springer-Verlag Berlin Heidelberg
Year: 2007

Language: English
Pages: 1161
Tags: Artificial Intelligence (incl. Robotics); Data Mining and Knowledge Discovery; Information Storage and Retrieval; Probability and Statistics in Computer Science; Multimedia Information Systems; Computer Appl. in Administrative Data Proce

Front Matter....Pages -
Research Frontiers in Advanced Data Mining Technologies and Applications....Pages 1-5
Finding the Real Patterns....Pages 6-6
Class Noise vs Attribute Noise: Their Impacts, Detection and Cleansing....Pages 7-8
Multi-modal and Multi-granular Learning....Pages 9-10
Hierarchical Density-Based Clustering of Categorical Data and a Simplification....Pages 11-22
Multi-represented Classification Based on Confidence Estimation....Pages 23-34
Selecting a Reduced Set for Building Sparse Support Vector Regression in the Primal....Pages 35-46
Mining Frequent Itemsets from Uncertain Data....Pages 47-58
QC4 - A Clustering Evaluation Method....Pages 59-70
Semantic Feature Selection for Object Discovery in High-Resolution Remote Sensing Imagery....Pages 71-83
Deriving Private Information from Arbitrarily Projected Data....Pages 84-95
Consistency Based Attribute Reduction....Pages 96-107
A Hybrid Command Sequence Model for Anomaly Detection....Pages 108-118
σ - Algorithm : Structured Workflow Process Mining Through Amalgamating Temporal Workcases....Pages 119-130
Multiscale BiLinear Recurrent Neural Network for Prediction of MPEG Video Traffic....Pages 131-137
An Effective Multi-level Algorithm Based on Ant Colony Optimization for Bisecting Graph....Pages 138-149
A Unifying Method for Outlier and Change Detection from Data Streams Based on Local Polynomial Fitting....Pages 150-161
Simultaneous Tuning of Hyperparameter and Parameter for Support Vector Machines....Pages 162-172
Entropy Regularization, Automatic Model Selection, and Unsupervised Image Segmentation....Pages 173-182
A Timing Analysis Model for Ontology Evolutions Based on Distributed Environments....Pages 183-192
An Optimum Random Forest Model for Prediction of Genetic Susceptibility to Complex Diseases....Pages 193-204
Feature Based Techniques for Auto-Detection of Novel Email Worms....Pages 205-216
Multiresolution-Based BiLinear Recurrent Neural Network....Pages 217-223
Query Expansion Using a Collection Dependent Probabilistic Latent Semantic Thesaurus....Pages 224-235
Scaling Up Semi-supervised Learning: An Efficient and Effective LLGC Variant....Pages 236-247
A Machine Learning Approach to Detecting Instantaneous Cognitive States from fMRI Data....Pages 248-259
Discovering Correlated Items in Data Streams....Pages 260-271
Incremental Clustering in Geography and Optimization Spaces....Pages 272-283
Estimation of Class Membership Probabilities in the Document Classification....Pages 284-295
A Hybrid Multi-group Privacy-Preserving Approach for Building Decision Trees....Pages 296-307
A Constrained Clustering Approach to Duplicate Detection Among Relational Data....Pages 308-319
Understanding Research Field Evolving and Trend with Dynamic Bayesian Networks....Pages 320-331
Embedding New Data Points for Manifold Learning Via Coordinate Propagation....Pages 332-343
Spectral Clustering Based Null Space Linear Discriminant Analysis (SNLDA)....Pages 344-354
On a New Class of Framelet Kernels for Support Vector Regression and Regularization Networks....Pages 355-366
A Clustering Algorithm Based on Mechanics....Pages 367-378
DLDA/QR: A Robust Direct LDA Algorithm for Face Recognition and Its Theoretical Foundation....Pages 379-387
gPrune: A Constraint Pushing Framework for Graph Pattern Mining....Pages 388-400
Modeling Anticipatory Event Transitions....Pages 401-408
A Modified Relationship Based Clustering Framework for Density Based Clustering and Outlier Filtering on High Dimensional Datasets....Pages 409-416
A Region-Based Skin Color Detection Algorithm....Pages 417-424
Supportive Utility of Irrelevant Features in Data Preprocessing....Pages 425-432
Incremental Mining of Sequential Patterns Using Prefix Tree....Pages 433-440
A Multiple Kernel Support Vector Machine Scheme for Simultaneous Feature Selection and Rule-Based Classification....Pages 441-448
Combining Supervised and Semi-supervised Classifier for Personalized Spam Filtering....Pages 449-456
Qualitative Simulation and Reasoning with Feature Reduction Based on Boundary Conditional Entropy of Knowledge....Pages 457-464
A Hybrid Incremental Clustering Method-Combining Support Vector Machine and Enhanced Clustering by Committee Clustering Algorithm....Pages 465-472
CCRM: An Effective Algorithm for Mining Commodity Information from Threaded Chinese Customer Reviews....Pages 473-480
A Rough Set Approach to Classifying Web Page Without Negative Examples....Pages 481-488
Evolution and Maintenance of Frequent Pattern Space When Transactions Are Removed....Pages 489-497
Establishing Semantic Relationship in Inter-query Learning for Content-Based Image Retrieval Systems....Pages 498-506
Density-Sensitive Evolutionary Clustering....Pages 507-514
Reducing Overfitting in Predicting Intrinsically Unstructured Proteins....Pages 515-522
Temporal Relations Extraction in Mining Hepatitis Data....Pages 523-530
Supervised Learning Approach to Optimize Ranking Function for Chinese FAQ-Finder....Pages 531-538
Combining Convolution Kernels Defined on Heterogeneous Sub-structures....Pages 539-546
Privacy-Preserving Sequential Pattern Release....Pages 547-554
Mining Concept Associations for Knowledge Discovery Through Concept Chain Queries....Pages 555-562
Capability Enhancement of Probabilistic Neural Network for the Design of Breakwater Armor Blocks....Pages 563-570
Named Entity Recognition Using Acyclic Weighted Digraphs: A Semi-supervised Statistical Method....Pages 571-578
Contrast Set Mining Through Subgroup Discovery Applied to Brain Ischaemina Data....Pages 579-586
Intelligent Sequential Mining Via Alignment: Optimization Techniques for Very Large DB....Pages 587-597
A Hybrid Prediction Method Combining RBF Neural Network and FAR Model....Pages 598-605
An Advanced Fuzzy C-Mean Algorithm for Regional Clustering of Interconnected Systems....Pages 606-615
Centroid Neural Network with Bhattacharyya Kernel for GPDF Data Clustering....Pages 616-622
Concept Interconnection Based on Many-Valued Context Analysis....Pages 623-630
Text Classification for Thai Medicinal Web Pages....Pages 631-638
A Fast Algorithm for Finding Correlation Clusters in Noise Data....Pages 639-647
Application of Discrimination Degree for Attributes Reduction in Concept Lattice....Pages 648-655
A Language and a Visual Interface to Specify Complex Spatial Patterns....Pages 656-663
Clustering Ensembles Based on Normalized Edges....Pages 664-671
Quantum-Inspired Immune Clonal Multiobjective Optimization Algorithm....Pages 672-679
Phase Space Reconstruction Based Classification of Power Disturbances Using Support Vector Machines....Pages 680-687
Mining the Impact Factors of Threads and Participators on Usenet Using Link Analysis....Pages 688-695
Weighted Rough Set Learning: Towards a Subjective Approach....Pages 696-703
Multiple Self-Splitting and Merging Competitive Learning Algorithm....Pages 704-711
A Novel Relative Space Based Gene Feature Extraction and Cancer Recognition....Pages 712-719
Experiments on Kernel Tree Support Vector Machines for Text Categorization....Pages 720-727
A New Approach for Similarity Queries of Biological Sequences in Databases....Pages 728-736
Anomaly Intrusion Detection Based on Dynamic Cluster Updating....Pages 737-744
Efficiently Mining Closed Constrained Frequent Ordered Subtrees by Using Border Information....Pages 745-752
Approximate Trace of Grid-Based Clusters over High Dimensional Data Streams....Pages 753-760
BRIM: An Efficient Boundary Points Detecting Algorithm....Pages 761-768
Syntactic Impact on Sentence Similarity Measure in Archive-Based QA System....Pages 769-776
Semi-structure Mining Method for Text Mining with a Chunk-Based Dependency Structure....Pages 777-784
Principal Curves with Feature Continuity....Pages 785-792
Kernel-Based Linear Neighborhood Propagation for Semantic Video Annotation....Pages 793-800
Learning Bayesian Networks with Combination of MRMR Criterion and EMI Method....Pages 801-808
A Cooperative Coevolution Algorithm of RBFNN for Classification....Pages 809-816
ANGEL: A New Effective and Efficient Hybrid Clustering Technique for Large Databases....Pages 817-824
Exploring Group Moving Pattern for an Energy-Constrained Object Tracking Sensor Network....Pages 825-832
ProMail: Using Progressive Email Social Network for Spam Detection....Pages 833-840
Multidimensional Decision Support Indicator (mDSI) for Time Series Stock Trend Prediction....Pages 841-848
A Novel Support Vector Machine Ensemble Based on Subtractive Clustering Analysis....Pages 849-856
Keyword Extraction Based on PageRank....Pages 857-864
Finding the Optimal Feature Representations for Bayesian Network Learning....Pages 865-870
Feature Extraction and Classification of Tumor Based on Wavelet Package and Support Vector Machines....Pages 871-878
Resource Allocation and Scheduling Problem Based on Genetic Algorithm and Ant Colony Optimization....Pages 879-886
Image Classification and Segmentation for Densely Packed Aggregates....Pages 887-894
Mining Temporal Co-orientation Pattern from Spatio-temporal Databases....Pages 895-903
Incremental Learning of Support Vector Machines by Classifier Combining....Pages 904-911
Clustering Zebrafish Genes Based on Frequent-Itemsets and Frequency Levels....Pages 912-920
A Practical Method for Approximate Subsequence Search in DNA Databases....Pages 921-931
An Information Retrieval Model Based on Semantics....Pages 932-939
AttributeNets: An Incremental Learning Method for Interpretable Classification....Pages 940-947
Mining Personalization Interest and Navigation Patterns on Portal....Pages 948-955
Cross-Lingual Document Clustering....Pages 956-963
Grammar Guided Genetic Programming for Flexible Neural Trees Optimization....Pages 964-971
A New Initialization Method for Clustering Categorical Data....Pages 972-980
L0-Constrained Regression for Data Mining....Pages 981-988
Application of Hybrid Pattern Recognition for Discriminating Paddy Seeds of Different Storage Periods Based on Vis/NIRS....Pages 989-996
Density-Based Data Clustering Algorithms for Lower Dimensions Using Space-Filling Curves....Pages 997-1005
Transformation-Based GMM with Improved Cluster Algorithm for Speaker Identification....Pages 1006-1014
Using Social Annotations to Smooth the Language Model for IR....Pages 1015-1021
Affection Factor Optimization in Data Field Clustering....Pages 1022-1028
A New Algorithm for Minimum Attribute Reduction Based on Binary Particle Swarm Optimization with Vaccination....Pages 1029-1036
Graph Nodes Clustering Based on the Commute-Time Kernel....Pages 1037-1045
Identifying Synchronous and Asynchronous Co-regulations from Time Series Gene Expression Data....Pages 1046-1054
A Parallel Algorithm for Learning Bayesian Networks....Pages 1055-1063
Incorporating Prior Domain Knowledge into a Kernel Based Feature Selection Algorithm....Pages 1064-1071
Geo-spatial Clustering with Non-spatial Attributes and Geographic Non-overlapping Constraint: A Penalized Spatial Distance Measure....Pages 1072-1079
GBKII: An Imputation Method for Missing Values....Pages 1080-1087
An Effective Gene Selection Method Based on Relevance Analysis and Discernibility Matrix....Pages 1088-1095
Towards Comprehensive Privacy Protection in Data Clustering....Pages 1096-1104
A Novel Spatial Clustering with Obstacles Constraints Based on Particle Swarm Optimization and K-Medoids....Pages 1105-1113
Online Rare Events Detection....Pages 1114-1121
Structural Learning About Independence Graphs from Multiple Databases....Pages 1122-1130
An Effective Method For Calculating Natural Adjacency Relation in Spatial Database....Pages 1131-1139
K-Centers Algorithm for Clustering Mixed Type Data....Pages 1140-1147
Proposion and Analysis of a TCP Feature of P2P Traffic....Pages 1148-1155
Back Matter....Pages -