ISBN: 3-540-65866-1
TITLE: Methodologies for Knowledge Discovery and Data Mining
AUTHOR: Zhong, Ning; Zhou, Lizhu (Eds.)
TOC:

Invited Talks
KDD as an Enterprise IT Tool: Reality and Agenda 1
W. Kim
Computer Assisted Discovery of First Principle Equations from Numeric Data 2
H. Motoda
Emerging KDD Technology
Data Mining: A Rough Set Perspective 3
Z. Pawlak
Data Mining Techniques for Associations, Clustering and Classification 13
C. C. Aggarwal, P. S. Yu
Data Mining: Granular Computing Approach 24
T. Y. Lin
Rule Extraction from Prediction Models 34
H. Tsukimoto
Association Rules
Mining Association Rules on Related Numeric Attributes 44
X. Du, Z. Liu, N. Ishii
LGen - A Lattice-Based Candidate Set Generation Algorithm for I/O Efficient Association Rule Mining 54
C. L. Yip, K. K. Loo, B. Kao, D. W. Cheung, C. K. Cheng
Extending the Applicability of Association Rules 64
K. Rajamani, S. Sung, A. Cox
An Efficient Approach for Incremental Association Rule Mining 74
P. S. M. Tsai, C.-C. Lee, A. L. P. Chen
Association Rules in Incomplete Databases 84
M. Kryszkiewicz
Parallel SQL Based Association Rule Mining on Large Scale PC Cluster: Performance Comparison with Directly Coded C Implementation 94
I. Pramudiono, T. Shintani, T. Tamura, M. Kitsuregawa
H-Rule Mining in Heterogeneous Databases 99
Y. Yang, M. Singhal
An Improved Definition of Multidimensional, Inter-transaction Association Rule 104
A. Zhou, S. Zhou, W. Jin, Z. Tian
Incremental Discovering Association Rules: A Concept Lattice Approach 109
K. Hu, Y. Lu, C. Shi
Feature Selection and Generation
Induction as Pre-processing
X. Wu 114
Stochastic Attribute Selection Committees with Multiple Boosting: Learning More Accurate and More Stable Classifier Committees 123
Z. Zheng, G. I. Webb
On Information-Theoretic Measures of Attribute Importance 133
Y. Y. Yao, S. K. M. Wong, C. J. Butz
A Technique of Dynamic Feature Selection Using the Feature Group Mutual Information 138
K.-C. Lee
A Data Pre-processing Method Using Association Rules of Attributes for Improving Decision Tree 143
M. Terabe, O. Katai, T. Sawaragi, T. Washio, H. Motoda
Mining in Semi, Un-structured Data
An Algorithm for Constrained Association Rule Mining in Semi-structured Data 148
L. Singh, B. Chen, R. Haight, P. Scheuermann
Incremental Mining of Schema for Semi-structured Data 159
A. Zhou, W. Jin, S. Zhou, Z. Tian
Discovering Structure from Document Databases 169
M.-F. Jiang, S.-S. Tseng, C.-J. Tsai
Combining Forecasts from Multiple Textual Data Sources 174
V. Cho, B. Wthrich
Domain Knowledge Extracting in a Chinese Natural Language Interface to Databases: NChiql 179
X. Meng, Y. Zhou, S. Wang
Interestingness, Surprisingness, and Exceptions
Evolutionary Hot Spots Data Mining: An Architecture for Exploring for Interesting Discoveries 184
G. J. Williams
Efficient Search of Reliable Exceptions 194
H. Liu, H. Lu, L. Feng, F. Hussain
Heuristics for Ranking the Interestingness of Discovered Knowledge 204
R. J. Hilderman, H. J. Hamilton
Rough Sets, Fuzzy Logic, and Neural Networks
Automated Discovery of Plausible Rules Based on Rough Sets and Rough Inclusion 210
S. Tsumoto
Discernibility System in Rough Sets 220
Z. Liu, Z. Xie
Automatic Labeling of Self-Organizing Maps: Making a Treasure-Map Reveal Its Secrets 228
A. Rauber, D. Merkl
Neural Network Based Classifiers for a Vast Amount of Data 238
L. Zhang, B. Zhang
Accuracy Tuning on Combinatorial Neural Model 247
H. A. Prado, K. F. Machado, S. R. Frigeri, P. M. Engel
A Situated Information Articulation Neural Network: VSF Network 252
Y. Kakemoto, S. Nakasuka
Neural Method for Detection of Complex Patterns in Databases 258
C. Deng, F. Xiong
Preserve Discovered Linguistic Patterns Valid in Volatility Data Environment 263
X. Shi, M.-C. Chan, D. Li
An Induction Algorithm Based on Fuzzy Logic Programming 268
D. Shibata, N. Inuzuka, S. Kato, T. Matsui, H. Itoh
Rule Discovery in Databases with Missing Values Based on Rough Set Model 274
S. Tsumoto
Sustainability Knowledge Mining from Human Development Database 279
X. Wang, R. Wang, J. Wang
Induction, Classification, and Clustering
Characterization of Default Knowledge in Ripple Down Rules Method 284
T. Wada, T. Horiuchi, H. Motoda, T. Washio
Improving the Performance of Boosting for Naive Bayesian Classification 296
K. M. Ting, Z. Zheng
Convex Hulls in Concept Induction 306
D. A. Newlands, G. I. Webb
Mining Classification Knowledge Based on Cloud Models 317
J. Fan, D. Li
Robust Clustering of Large Geo-referenced Data Sets 327
V. Estivill-Castro, M. E. Houle
A Fast Algorithm for Density-Based Clustering in Large Database 338
B. Zhou, D. W. Cheung, B. Kao
A Lazy Model-Based Algorithm for On-Line Classification 350
G. Melli
An Efficient Space-Partitioning Based Algorithm for the K-Means Clustering 355
K. AlSabti, S. Ranka, V. Singh
A Fast Clustering Process for Outliers and Remainder Clusters 360
C.-M. Su, S.-S. Tseng, M.-F. Jiang, J. C. S. Chen
Optimising the Distance Metric in the Nearest Neighbour Algorithm on a Real-World Patient Classification Problem 365
H. He, S. Hawkins
Classifying Unseen Cases with Many Missing Values 370
Z. Zheng, B. T. Low
Study of a Mixed Similarity Measure for Classification and Clustering 375
T. B. Ho, N. B. Nguyen, T. Morita
Visualization
Visually Aided Exploration of Interesting Association Rules 380
B. Liu, W. Hsu, K. Wang, S. Chen
DVIZ: A System for Visualizing Data Mining 390
J. Han, N. Cercone
Causal Model and Graph-Based Methods
A Minimal Causal Model Learner 400
H. Dai
Efficient Graph-Based Algorithm for Discovering and Maintaining Knowledge in Large Databases 409
K. L. Lee, G. Lee, A. L. P. Chen
Basket Analysis for Graph Structured Data 420
A. Inokuchi, T. Washio, H. Motoda, K. Kumasawa, N. Arai
The Evolution of Causal Models: A Comparison of Bayesian Metrics and Structure Priors 432
J. R. Neil, K. B. Korb
KD-FGS: A Knowledge Discovery System from Graph Data Using Formal Graph System 438
T. Miyahara, T. Uchida, T. Kuboyama, T. Yamamoto, K. Takahashi, H. Ueda
Agent-Based, and Distributed Data Mining
Probing Knowledge in Distributed Data Mining 443
Y. Guo, J. Sutiwaraphun
Discovery of Equations and the Shared Operational Semantics in Distributed Autonomous Databases 453
Z. W. Ras, J. M. Zytkow
The Data-Mining and the Technology of Agents to Fight the Illicit Electronic Messages 464
A. Zighed, M. C ot, N. Troudi
Knowledge Discovery in SportsFinder: An Agent to Extract Sports Results from the Web 469
H. Lu, L. Sterling, A. Wyatt
Event Mining with Event Processing Networks 474
L. Perrochon, W. Mann, S. Kasriel, D. C. Luckham
Advanced Topics and New Methodologies
An Analysis of Quantitative Measures Associated with Rules 479
Y. Y. Yao, N. Zhong
A Strong Relevant Logic Model of Epistemic Processes in Scientific Discovery 489
J. Cheng
Discovering Conceptual Differences among Different People via Diverse Structures 494
T. Yoshida, T. Kondo, S. Nishida
Ordered Estimation of Missing Values 499
O. O. Lobo, M. Numao
Prediction Rule Discovery Based on Dynamic Bias Selection 504
E. Suzuki, T. Ohno
Discretization of Continuous Attributes for Learning Classification Rules 509
A. An, N. Cercone
BRRA: A Based Relevant Rectangle Algorithm for Mining Relationships in Databases 515
S. Ben Yahia, A. Jaoua
Mining Functional Dependency Rule of Relational Database 520
X. Tao, N. Wang, S. Zhou, A. Zhou, Y. Hu
Time-Series Prediction with Cloud Models in DMKD 525
R. Jiang, D. Li, H. Chen
Author Index 531
END
