Definition
Inference is the process by which an already-trained AI model produces predictions or responses from new data. While training happens once (and is expensive), inference happens every time you use the model. The operational costs of AI depend primarily on inference, which is why optimizing inference speed and cost is crucial for SMEs.
Related terms
EXPLORE