You have trained a classification model in Snowflake using Snowpark ML to predict customer churn. After deploying the model, you observe that the model performs well on the training data but poorly on new, unseen data'. You suspect overfitting. Which of the following strategies can be applied within Snowflake to detect and mitigate overfitting during model validation , considering the model is already deployed and receiving inference requests through a Snowflake UDF?
Correct Answer: A,C
Options A and C are correct because they describe strategies for detecting and mitigating overfitting during the model validation process using Snowflake's capabilities. AUPRC is a good performance metric to compare the training vs validation set results to catch overfitting, and regularization can be used to avoid overfitting. Option C directly incorporates cross-validation into the model training workflow within Snowflake, allowing for early detection and mitigation of overfitting through hyperparameter tuning and model selection. Option B is incorrect because it focuses on performance optimization, not overfitting. Option D describes an AIB testing or champion-challenger setup which could be a strategy to use to detect data drift over time, but not overfitting. E is only partially correct as it describes one way to detect data drift, but not overfitting.