Valid NCA-GENM Dumps shared by ExamDiscuss.com for Helping Passing NCA-GENM Exam! ExamDiscuss.com now offer the newest NCA-GENM exam dumps, the ExamDiscuss.com NCA-GENM exam questions have been updated and answers have been corrected get the newest ExamDiscuss.com NCA-GENM dumps with Test Engine here:
You are building a system that takes an image of a scene and a short audio clip as input and generates a descriptive text. You want to evaluate the system's performance. Which of the following evaluation metrics are MOST suitable for assessing both the accuracy and the coherence of the generated descriptions in relation to the input image and audio?
Correct Answer: E
BLEU, CIDEr, and SPICE are all suitable for evaluating image captioning and similar generative tasks. BLEU measures the n-gram overlap between the generated text and reference texts. CIDEr specifically focuses on consensus-based image description evaluation, weighting n-grams that are more common among human-generated captions. SPICE focuses on semantic propositional content and captures object, attribute, and relationship triples. ROUGE focuses on recall, but the other 3 provide the best overall picture. Perplexity and WER are more suitable for language models, and Inception Score and FID are used for evaluating the quality of generated images.