The verified answer is C. Incorporating continuous human feedback across model development, training, and deployment phases and using performance evaluation to improve model accuracy. The question asks for human-in-the-loop improvement throughout the ML lifecycle, not a one-time review and not fully automated correction. AWS AI Practitioner guidance describes the foundation model lifecycle as including data selection, model selection, pre-training, fine-tuning, evaluation, deployment, and feedback. Because feedback is part of the lifecycle, human review should not be limited to the first training stage only.
AWS generative AI lifecycle guidance also emphasizes continuous improvement and feedback loops. It explains that operationalizing a generative AI application is not a one-time event, but an ongoing process driven by real-world usage, feedback systems, automated feedback loops, direct user feedback mechanisms, and strategic human-in-the-loop interventions. This supports option C because it combines human feedback across phases with performance evaluation to improve model behavior and accuracy over time.
Option A is incorrect because it removes human intervention. Automated testing is valuable, but it is not human-in-the-loop. Option B is incorrect because collecting human feedback only during initial training is too narrow. It ignores post-training evaluation, deployment monitoring, user feedback, drift, and iterative improvement. Option D is incorrect because it relies on automated reinforcement learning and predefined metrics without human oversight. That again contradicts the “human-in-the-loop” requirement.
The strongest clue is the phrase “throughout the ML lifecycle.” A correct human-in-the-loop strategy must include human review, human feedback, and evaluation during development, training, deployment, and ongoing improvement. Therefore, option C is the only answer that satisfies both parts of the requirement: continuous human involvement and lifecycle-based model improvement.