NCA-GENM Exam Dumps | You are tasked with deploying a generative A1 model for image inpainting using Triton Inference Server.

<< Prev Question Next Question >>

Question 105/192

You are tasked with deploying a generative A1 model for image inpainting using Triton Inference Server. The model requires significant GPU memory and you want to maximize throughput. Which Triton configuration parameters would be MOST important to tune, and why?

A. 'instance_group' (setting count to the number of available GPUs) and (setting a high value to accumulate requests).

B. 'instance_group' (setting count to the number of available GPUs and kind to KIND_GPU) and (increasing it to the largest value that fits in GPU memory).

C. 'dynamic_batching' (enabling it and setting and 'model_warmup' (specifying dummy inputs to pre-load the model).

D. 'optimization' (setting strategy to TRT to enable TensorRT optimization) and 'input_shape' (specifying the exact input shape).

E. Both B and C.

Question 105/192

LEAVE A REPLY

Download PDF File