Canvas CloudAI
Canvas Cloud AI

Model Serving

advanced
ai & ml
Enhanced Content

Definition

Making trained AI models available to applications through APIs or services for making predictions. Like opening a restaurant that serves dishes created from tested recipes.

Real-World Example

Model serving infrastructure hosts a language translation model that applications can call via API to translate text in real-time.

Related Terms

Cloud Provider Equivalencies

All four provide managed endpoints to deploy trained models and expose them via HTTPS APIs for real-time (and in some cases batch/async) predictions, with autoscaling, monitoring, and access control.

AWS
Amazon SageMaker (Real-Time Inference, Serverless Inference, Asynchronous Inference)
AZ
Azure Machine Learning (Online Endpoints, Managed Online Endpoints)
GCP
Vertex AI (Online Prediction Endpoints)
OCI
OCI Data Science (Model Deployment)

Explore More Cloud Computing Terms