The stakeholders of a machine learning model have confirmed that they understand the objective and purpose of the model, and ensured that the proposed model aligns with their business priorities. They have also selected a framework and a machine learning model that they will be using.
What should be the next step to progress along the machine learning workflow?
Correct Answer: A
Themachine learning (ML) workflowfollows a structured sequence of steps. Once stakeholders have agreed on theobjectives, business priorities, and the framework/model selection, the next logical step is to prepare and pre-process the databefore training the model.
* Data Preparationis crucial becausemachine learning models rely heavily on the quality of input data. Poor data can result in biased, inaccurate, or unreliable models.
* The process involvesdata acquisition, cleaning, transformation, augmentation, and feature engineering.
* Preparing the dataensures it is in the right format, free from errors, and representative of the problem domain, leading to better generalization in training.
* A (Tune the ML Algorithm):Hyperparameter tuning occursafter the model has been trainedand evaluated.
* C (Agree on Acceptance Criteria):Acceptance criteria should already have been defined in theinitial objective-setting phasebefore framework and model selection.
* D (Evaluate the Framework and Model):The selection of the framework and ML model has already been completed. The next step isdata preparation, not reevaluation.
* ISTQB CT-AI Syllabus (Section 3.2: ML Workflow - Data Preparation Phase)
* "Data preparation comprises data acquisition, pre-processing, and feature engineering.
Exploratory data analysis (EDA) may be performed alongside these activities".
* "The data used to train, tune, and test the model must be representative of the operational data that will be used by the model".
Why Other Options Are Incorrect:Supporting References from ISTQB Certified Tester AI Testing Study Guide:Conclusion:Since the model selection is complete, thenext step in the ML workflow is to prepare and pre-process the datato ensure it is ready for training and testing. Thus, thecorrect answer is B.