NCA-AIIO Exam Dumps | Your AI team is deploying a large-scale inference service that must process real-time data 24/7. Given

<< Prev Question Next Question >>

Question 88/89

Your AI team is deploying a large-scale inference service that must process real-time data 24/7. Given the high availability requirements and the need to minimize energy consumption, which approach would best balance these objectives?

A. Implement an auto-scaling group of GPUs that adjusts the number of active GPUs based on the workload

B. Use a GPU cluster with a fixed number of GPUs always running at 50% capacity to save energy

C. Schedule inference tasks to run in batches during off-peak hours

D. Use a single powerful GPU that operates continuously at full capacity to handle all inference tasks

Question 88/89

LEAVE A REPLY

Download PDF File