The verified answer is D. Model deployment and inference. The question describes a trained model being made available to production applications through an API endpoint. That is deployment, and the runtime use of the model to answer queries is inference. AWS SageMaker documentation states that after training, you can get predictions, or inferences, from trained machine learning models, and SageMaker provides model deployment options to support ML inference needs. This directly matches the phrase “available to production applications through an API endpoint for runtime queries.”
Data preparation is incorrect because that phase happens before training. It includes collecting, cleaning, transforming, labeling, and preparing datasets so the model can learn from them. The question already says the model is trained, so the workflow has moved beyond data preparation.
Model training and tuning is incorrect because training and tuning are where the model learns patterns from data and hyperparameters may be adjusted to improve performance. The question is not describing learning, optimization, or retraining. It describes exposing the trained model to applications.
Model evaluation and validation is incorrect because evaluation checks whether the model meets quality, accuracy, fairness, safety, or business criteria before or during production use. The scenario does not describe testing the model; it describes making the model callable by production applications.
In the ML lifecycle, deploying a model means placing it into an environment where applications can send requests and receive predictions. Inference is the process of using the deployed model to generate outputs from new inputs. Because the company is exposing the trained model through an API endpoint for runtime queries, the activity is model deployment and inference.