A CONTINUOUS INTEGRATION AND CONTINUOUS DEPLOYMENT FRAMEWORK FOR SAGEMAKER REAL-TIME INFERENCE
Keywords:
Machine Learning Operations, Amazon Web Service, Real-Time InferenceAbstract
Real-time inference in a modern machine learning workflow requires robust deployment and monitoring to ensure models are delivering accurate and timely predictions. This paper elaborates on the implementation details of a CI/CD framework for deploying SageMaker real-time inference models by automating model packaging, deployment, and monitoring processes, integrating key approval steps that assure model performance and stakeholder involvement before production deployment. The workflow is designed to take full advantage of AWS Step Functions, SageMaker Model Registry, and other AWS services to make this transition from development to production as seamless as possible.
References
"TarBall - Debian Wiki," Debian. [Online]. Available: https://wiki.debian.org/TarBall.
"Using XGBoost with SageMaker — Write an inference script," Amazon SageMaker Documentation.
[Online]. Available: https://sagemaker.readthedocs.io/en/stable/frameworks/xgboost/using_xgboost.html#write-an-inference-script.
"SageMaker Python SDK — Model," Amazon SageMaker Documentation. [Online]. Available: https://sagemaker.readthedocs.io/en/stable/api/inference/model.html.
"Model Registry," Amazon SageMaker Documentation. [Online]. Available: https://docs.aws.amazon.com/sagemaker/latest/dg/model-registry.html.
"Using a Model Package in SageMaker," Amazon SageMaker Documentation. [Online]. Available: https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-mkt-model-pkg-model.html.
"What is Amazon CloudWatch?" Amazon CloudWatch Documentation. [Online]. Available: https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/WhatIsCloudWatch.html "AWS CodeCommit — Source Control Service," Amazon Web Services. [Online]. Available: https://aws.amazon.com/codecommit/.
"AWS CodeDeploy — User Guide," Amazon Web Services. [Online]. Available: https://docs.aws.amazon.com/codedeploy/latest/userguide/welcome.html.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Gautham Ram Rajendiran (Author)
![Creative Commons License](http://i.creativecommons.org/l/by-nc/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.