NCA-GENL Exam Dumps | In the context of transformer-based large language models, how does the use of layer normalization mitigate

<< Prev Question Next Question >>

Question 12/23

In the context of transformer-based large language models, how does the use of layer normalization mitigate the challenges associated with training deep neural networks?

A. It reduces the computational complexity by normalizing the input embeddings.

B. It stabilizes training by normalizing the inputs to each layer, reducing internal covariate shift.

C. It increases the model's capacity by adding additional parameters to each layer.

D. It replaces the attention mechanism to improve sequence processing efficiency.

Question 12/23

LEAVE A REPLY

Download PDF File