Definition
Model serving is the process of making an ML model available for use in production via APIs or web services. It includes load management, scalability, latency monitoring, and version management. Platforms like AWS SageMaker, Google Vertex AI, and open source solutions make deployment accessible to SMEs as well.
Related terms
EXPLORE