INVESTIGATING DATA MINING METHODS FOR PATTERN AND RELATIONSHIP DETECTION IN LARGE DATASETS
Keywords:
Data Mining, Pattern Detection, Relationship Detection, Clustering, Neural Networks, Large Datasets, Association Rule Mining, Predictive Analytics, Comparative Analysis, Big DataAbstract
Data mining methods play a crucial role in uncovering hidden patterns and relationships in large datasets, providing valuable insights for decision-making and predictive analytics. This paper investigates various data mining techniques such as association rule mining, clustering, and neural networks, with a focus on their ability to detect patterns and relationships in large-scale data. A comparative analysis of these methods is presented, evaluating their performance on different types of data and metrics such as accuracy, precision, and computational efficiency. The results highlight the strengths and limitations of each method, suggesting areas for future improvement in the application of data mining for complex datasets.
References
Agrawal, R., Imieliński, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. ACM SIGMOD Record, 22(2), 207-216.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273-297.
Bayyapu, S. (2022). Optimizing IT sourcing in healthcare: Balancing control, cost, and innovation. International Journal of Computer Applications, 3(1), 14-20.
Friedman, J., Hastie, T., & Tibshirani, R. (2001). The elements of statistical learning. Springer Series in Statistics.
Heckerman, D. (1996). A tutorial on learning with Bayesian networks. Innovations in Bayesian Networks, 156, 301-354.
Kaul, D. (2022). AI-Driven Decentralized Authentication System Using Homomorphic Encryption. International Journal of Advanced Research in Engineering and Technology (IJARET), 13(3), 74–84.
Hinton, G. E., Osindero, S., & Teh, Y. W. (2012). A fast learning algorithm for deep belief nets. Neural Computation, 18(7), 1527-1554.
Bayyapu, S. (2021). Bridging the gap: Overcoming data, technological, and human roadblocks to AI-driven healthcare transformation. Journal of Management (JOM), 8(1), 7-14.
Hsu, C. W., Chang, C. C., & Lin, C. J. (2003). A practical guide to support vector classification. Department of Computer Science, National Taiwan University.
Jain, A. K. (2010). Data clustering: 50 years beyond k-means. Pattern Recognition Letters, 31(8), 651-666.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436-444.
Loh, W. Y. (2011). Classification and regression trees. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(1), 14-23.
Bayyapu, S. (2020). Blockchain healthcare: Redefining data ownership and trust in the medical ecosystem. International Journal of Advanced Research in Engineering and Technology (IJARET), 11(11), 2748-2755.
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, 281-297.
Kaul, D. (2021). AI-Driven Dynamic Upsell in Hotel Reservation Systems Based on Cybersecurity Risk Scores. International Journal of Computer Engineering and Technology (IJCET), 12(3), 114–125.
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81-106.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Salumanda, Christian (Author)
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.