GES-C01 Exam Dumps | A development team is constructing a Gen AI application using Snowflake Cortex LLM functions, particularly

<< Prev Question Next Question >>

Question 81/171

A development team is constructing a Gen AI application using Snowflake Cortex LLM functions, particularly for conversational and text generation tasks. They are concerned about potential high costs due to token consumption. Which of the following strategies would most effectively help minimize token usage and optimize costs when working with these Cortex LLM functions?

A. For multi-turn conversational experiences using SNOWFLAKE .CORTEX.COMPLETE, only send the most recent user prompt in each API call, as the model automatically retains previous context.

B. When employing AI_COMPLETE for structured output tasks, providing concise and highly descriptive explanations for each field within the JSON schema will reduce the input tokens required for the LLM to understand and adhere to the schema accurately.

C. Utilize the COUNT_TOKENS (SNOWFLAKE .CORTEX) helper function to pre-validate the prompt length against the model's context window, thereby preventing truncation errors and subsequent re-runs.

D. To encourage more succinct LLM responses and reduce completion_tokens, configure the temperature option to a higher value (e.g., 0.7) in COMPLETE function calls.

E. In multi-turn conversations within Cortex Analyst, integrate a dedicated LLM summarization agent to rephrase follow-up questions, which reduces the total conversational history passed as context to the main LLM.

Question 81/171

LEAVE A REPLY

Download PDF File