CLOUD-BASED AI/ML MODEL DEPLOYMENT: A COMPARATIVE ANALYSIS OF MANAGED AND SELF-MANAGED PLATFORMS

Authors

  • Snehansh Devera Konda Visvesvaraya Technological University, India Author

Keywords:

Cloud-Based Machine Learning (ML) Infrastructure, AI Model Deployment Architecture, Managed ML Platforms, Enterprise AI/ML Operations, Cloud Computing Infrastructure

Abstract

The widespread adoption of artificial intelligence and machine learning (AI/ML) technologies has created an urgent need for efficient and scalable deployment solutions across industries. This article presents a comprehensive analysis of cloud-based AI/ML model deployment strategies, examining both managed platforms offered by major cloud providers (AWS SageMaker, Google Vertex AI, and Microsoft Azure Machine Learning) and self-managed infrastructure solutions. Through systematic evaluation of platform capabilities, infrastructure requirements, and organizational considerations, the article develops a decision framework to guide enterprises in selecting appropriate deployment architectures. The article analysis reveals that while managed platforms offer significant advantages in terms of reduced complexity, automated infrastructure management, and faster time-to-market, self-managed solutions provide superior customization capabilities and potential cost benefits at scale for organizations with sufficient technical expertise. The article synthesizes implementation data from multiple enterprise case studies to identify critical success factors in AI/ML deployment, including infrastructure scalability, monitoring capabilities, and resource optimization. Furthermore, the article proposes a novel evaluation matrix for assessing the total cost of ownership across different deployment scenarios, incorporating both direct infrastructure costs and indirect expenses related to expertise and maintenance. These findings contribute to the growing body of knowledge on enterprise AI/ML operations while providing practical guidance for organizations navigating the complex landscape of cloud-based model deployment strategies.

References

J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008. https://dl.acm.org/doi/10.1145/1327452.1327492

Chengcheng Wan, Shicheng Liu, Henry Hoffmann, Michael Maire, and Shan Lu, "Are Machine Learning Cloud APIs Used Correctly?" IEEE/ACM 43rd International Conference on Software Engineering (ICSE), 2021. https://ieeexplore.ieee.org/abstract/document/9402073

Waldemar Hummer, Vinod Muthusamy, Thomas Rausch, Parijat Dube, Kaoutar El Maghraoui, and Anupama Murthi, "ModelOps: Cloud-Based Lifecycle Management for Reliable and Trusted AI," IEEE International Conference on Cloud Engineering (IC2E), 2019. https://ieeexplore.ieee.org/abstract/document/8790192

Igor L. Markov, Pavlos A. Apostolopoulos, Mia R. Garrard, Tanya Qie, Yin Huang, Tanvi Gupta, Anika Li, Cesar Cardoso, George Han, Ryan Maghsoudian, and Norm Zhou, "Scalable End-to-End ML Platforms: from AutoML to Self-serve," arXiv, 2023. https://arxiv.org/abs/2302.14139

Omid Gheibi, Danny Weyns, and Federico Quin, "Applying Machine Learning in Self-adaptive Systems: A Systematic Literature Review," ACM Transactions on Autonomous and Adaptive Systems, 2021. https://dl.acm.org/doi/fullHtml/10.1145/3469440

B. Mayer and K. M. Lackner, "Selection of an IoT Platform: A Framework for a Two-Stage Multi-Criteria Decision Making Process," IEEE International Conference on Computer Technology Applications (ICCTA), 2022. https://dl.acm.org/doi/fullHtml/10.1145/3543712.3543750

A. Rashid and A. Chaturvedi, "Cloud Computing Characteristics and Services: A Brief Review," International Journal of Computer Sciences and Engineering, vol. 7, no. 2, pp. 421-426, 2019. https://www.ijcseonline.org/full_paper_view.php?paper_id=3680

A. Géron, "Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems," O'Reilly Media, 2024. https://www.bibguru.com/b/how-to-cite-hands-on-machine-learning-with-scikit-learn-keras-and-tensorflow-concepts-tools-and-techniques-to-build-intelligent-systems/

Published

2024-12-16

How to Cite

Snehansh Devera Konda. (2024). CLOUD-BASED AI/ML MODEL DEPLOYMENT: A COMPARATIVE ANALYSIS OF MANAGED AND SELF-MANAGED PLATFORMS. INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING AND TECHNOLOGY (IJCET), 15(6), 1380-1396. https://mylib.in/index.php/IJCET/article/view/1745