Data mining techniques

Markus Hegland

doi:10.1017/S0962492901000058

Abstract

Methods for knowledge discovery in data bases (KDD) have been studied for more than a decade. New methods are required owing to the size and complexity of data collections in administration, business and science. They include procedures for data query and extraction, for data cleaning, data analysis, and methods of knowledge representation. The part of KDD dealing with the analysis of the data has been termed data mining. Common data mining tasks include the induction of association rules, the discovery of functional relationships (classification and regression) and the exploration of groups of similar data objects in clustering. This review provides a discussion of and pointers to efficient algorithms for the common data mining tasks in a mathematical framework. Because of the size and complexity of the data sets, efficient algorithms and often crude approximations play an important role.

Information

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Semenova, Tatiana Hegland, Markus Graco, Warwick and Williams, Graham 2004. Advances in Knowledge Discovery and Data Mining. Vol. 3056, Issue. , p. 659.

Hájek, Petr 2005. Data Mining and Knowledge Discovery Handbook. p. 589.

Kourgiantakis, Markos Mandalianos, Iraklis Migdalas, Athanasios and Pardalos, Panos M. 2006. Handbook of Optimization in Telecommunications. p. 1017.

Hájek, Petr 2009. Data Mining and Knowledge Discovery Handbook. p. 541.

Volnyansky, Ilya and Pestov, Vladimir 2009. Curse of Dimensionality in Pivot Based Indexes. p. 39.

Taub, Michelle Azevedo, Roger Bouchet, François and Khosravifar, Babak 2014. Can the use of cognitive and metacognitive self-regulated learning strategies be predicted by learners’ levels of prior knowledge in hypermedia-learning environments?. Computers in Human Behavior, Vol. 39, Issue. , p. 356.

Suh, Junyeub Kim, Changhyeon Sung, Wonjin So, Jaewoo and Heo, Seo Weon 2017. Construction of a Generalized DFT Codebook Using Channel-Adaptive Parameters. IEEE Communications Letters, Vol. 21, Issue. 1, p. 196.

Cheng, Meng-Tzu Rosenheck, Louisa Lin, Chen-Yen and Klopfer, Eric 2017. Analyzing gameplay data to inform feedback loops in The Radix Endeavor. Computers & Education, Vol. 111, Issue. , p. 60.

Sadeghzadeh, Keivan and Fard, Nasser 2017. Analytical clustering procedures in massive failure data. p. 1.

Bohn, Bastian and Griebel, Michael 2017. Error Estimates for Multivariate Regression on Discretized Function Spaces. SIAM Journal on Numerical Analysis, Vol. 55, Issue. 4, p. 1843.

Saoud, Manel Saad Boubetra, Abdelhak and Attia, Safa 2017. A Multi-Agent Based Modeling and Simulation Data Management and Analysis System for the Hospital Emergency Department. International Journal of Healthcare Information Systems and Informatics, Vol. 12, Issue. 3, p. 21.

Saoud, Manel Saad Boubetra, Abdelhak and Attia, Safa 2018. Handbook of Research on Emerging Perspectives on Healthcare Information Systems and Informatics. p. 347.

Bohn, Bastian 2018. Sparse Grids and Applications - Miami 2016. Vol. 123, Issue. , p. 19.

Ooms, Richard Spruit, Marco R. and Overbeek, Sietse 2019. 3PM Revisited. International Journal of Business Intelligence Research, Vol. 10, Issue. 1, p. 80.

Saoud, Manel Saad Boubetra, Abdelhak and Attia, Safa 2020. Hospital Management and Emergency Medicine. p. 192.

Durand, Taetse and Hattingh, Marie 2020. Data Mining and Artificial Intelligence Techniques Used to Extract Big Data Patterns. p. 1.

Saoud, Manel Saad Boubetra, Abdelhak and Attia, Safa 2020. Hospital Management and Emergency Medicine. p. 27.

Saoud, Manel Saad Boubetra, Abdelhak and Attia, Safa 2021. Research Anthology on Decision Support Systems and Decision Management in Healthcare, Business, and Engineering. p. 367.

Hiller, T. Deipenwisch, L. and Nyhuis, P. 2022. Systemising Data-driven Methods for Predicting Throughput Time within Production Planning & Control. p. 0716.

Choi, Soyoung 2023. Deriving the Types and Characteristics of Lost Children in South Korea Using the Sequential Association Rule. Behavioral Sciences, Vol. 13, Issue. 5, p. 393.

Download full list

Article contents

Data mining techniques

Abstract

Information

Access options

Article purchase

Temporarily unavailable

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Data mining techniques

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests