HAN
21-bib-633-672-9780123814791
2011/6/1
3:27
Page 671
#39
Bibliography
671
[Zha08]
C. Zhai.
Statistical Language Models for Information Retrieval. Morgan and Claypool,
2008.
[ZHL
+
98]
O. R. Za¨ıane, J. Han, Z. N. Li, J. Y. Chiang, and S. Chee. MultiMedia-Miner: A sys-
tem prototype for multimedia data mining. In Proc. 1998 ACM-SIGMOD Int. Conf.
Management of Data (SIGMOD’98), pp. 581–583, Seattle, WA, June 1998.
[Zhu05]
X. Zhu. Semi-supervised learning literature survey. In
Computer Sciences Technical
Report 1530, University of Wisconsin–Madison, 2005.
[ZHZ00]
O. R. Za¨ıane, J. Han, and H. Zhu. Mining recurrent items in multimedia with progressive
resolution refinement. In Proc. 2000 Int. Conf. Data Engineering (ICDE’00), pp. 461–470,
San Diego, CA, Feb. 2000.
[Zia91]
W. Ziarko. The discovery, analysis, and representation of data dependencies in databases.
In G. Piatetsky-Shapiro and W. J. Frawley (eds.), Knowledge Discovery in Databases,
pp. 195–209. AAAI Press, 1991.
[ZL06]
Z.-H. Zhou and X.-Y. Liu. Training cost-sensitive neural networks with methods
addressing the class imbalance problem. IEEE Trans. Knowledge and Data Engineering,
18:63–77, 2006.
[ZPOL97]
M. J. Zaki, S. Parthasarathy, M. Ogihara, and W. Li. Parallel algorithm for discovery of
association rules. Data Mining and Knowledge Discovery, 1:343–374, 1997.
[ZRL96]
T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH: An efficient data clustering method
for very large databases. In
Proc. 1996 ACM-SIGMOD Int. Conf. Management of Data
(SIGMOD’96), pp. 103–114, Montreal, Quebec, Canada, June 1996.
[ZS02]
N. Zapkowicz and S. Stephen. The class imbalance program: A systematic study.
Intelligence Data Analysis, 6:429–450, 2002.
[ZYH
+
07]
F. Zhu, X. Yan, J. Han, P. S. Yu, and H. Cheng. Mining colossal frequent patterns by
core pattern fusion. In Proc. 2007 Int. Conf. Data Engineering (ICDE’07), pp. 706–715,
Istanbul, Turkey, Apr. 2007.
[ZYHY07]
F. Zhu, X. Yan, J. Han, and P. S. Yu. gPrune: A constraint pushing framework for graph
pattern mining. In Proc. 2007 Pacific-Asia Conf. Knowledge Discovery and Data Mining
(PAKDD’07), pp. 388–400, Nanjing, China, May 2007.
[ZZ09]
Z. Zhang and R. Zhang.
Multimedia Data Mining: A Systematic Introduction to Concepts
and Theory. Chapman & Hall, 2009.
[ZZH09]
D. Zhang, C. Zhai, and J. Han. Topic cube: Topic modeling for OLAP on multi-
dimensional text databases. In Proc. 2009 SIAM Int. Conf. Data Mining (SDM’09),
pp. 1123–1134, Sparks, NV, Apr. 2009.
EDELKAMP
19-ch15-671-700-9780123725127
2011/5/28
14:50
Page 672
#2
This page intentionally left blank
HAN
22-ind-673-708-9780123814791
2011/6/1
3:27
Page 673
#1
Index
Numbers and Symbols
.632 bootstrap, 371
δ-bicluster algorithm, 517–518
δ-pCluster, 518–519
A
absolute-error criterion, 455
absolute support, 246
abstraction levels, 281
accuracy
attribute construction and, 105
boosting, 382
with bootstrap, 371
classification, 377–385
classifier, 330, 366
with cross-validation, 370–371
data, 84
with holdout method, 370
measures, 369
random forests, 383
with random subsampling, 370
rule selection based on, 361
activation function, 402
active learning, 25, 430, 437
ad hoc data mining, 31
AdaBoost, 380–382
algorithm illustration, 382
TrAdaBoost, 436
adaptive probabilistic networks, 397
advanced data analysis, 3, 4
advanced database systems, 4
affinity matrix, 520, 521
agglomerative hierarchical method, 459
AGNES, 459, 460
divisive hierarchical clustering versus,
459–460
Agglomerative Nesting (AGNES), 459, 460
aggregate cells, 189
aggregation, 112
bootstrap, 379
complex data types and, 166
cube computation and, 193
data cube, 110–111
at multiple granularities, 230–231
multiway array, 195–199
simultaneous, 193, 195
AGNES. See Agglomerative Nesting
algebraic measures, 145
algorithms. See specific algorithms
all confidence
measure, 268, 272
all-versus-all (AVA), 430–431
analysis of variance (ANOVA), 600
analytical processing, 153
ancestor cells, 189
angle-based outlier detection (ABOD), 580
angle-based outlier factor (ABOF), 580
anomalies. See outliers
anomaly mining. See outlier analysis
anomaly-based detection, 614
antimonotonic constraints, 298, 301
antimonotonic measures, 194
antimonotonicity, 249
apex cuboids, 111, 138, 158
application domain-specific semantics, 282
applications, 33, 607–618
business intelligence, 27
computer science, 613
domain-specific, 625
engineering, 613, 624
exploration, 623
financial data analysis, 607–609
intrusion detection/prevention, 614–615
recommender systems, 615–618
retail industry, 609–611
science, 611–613
social science and social studies, 613
673