Valid NCA-GENM Dumps shared by ExamDiscuss.com for Helping Passing NCA-GENM Exam! ExamDiscuss.com now offer the newest NCA-GENM exam dumps, the ExamDiscuss.com NCA-GENM exam questions have been updated and answers have been corrected get the newest ExamDiscuss.com NCA-GENM dumps with Test Engine here:
You are working with a multimodal model that combines text and video data for action recognition. The text data consists of descriptions of the actions, and the video data consists of sequences of frames. You want to fuse these modalities at a late fusion stage. Which of the following approaches BEST describes late fusion?
Correct Answer: D
Late fusion involves processing each modality separately to obtain feature representations and then combining these representations at a later stage, typically by concatenation or averaging, before making a final prediction. Averaging predictions (option B) is a specific type of late fusion. Concatenating raw pixel values and word embeddings (option A) is an example of early fusion. Training a single model with a shared embedding space (option C) is also closer to early or intermediate fusion. Attention mechanisms can be used in various fusion strategies but do not define late fusion specifically.