DSA-C03 Exam Dumps | You are developing a real-time fraud detection system using Snowpark and deploying it as a Streamlit

<< Prev Question Next Question >>

Question 19/143

You are developing a real-time fraud detection system using Snowpark and deploying it as a Streamlit application connected to Snowflake. The system ingests transaction data continuously and applies a pre-trained machine learning model (stored as a binary file in Snowflake's internal stage) to score each transaction for fraud. You need to ensure the model loading process is efficient, and you're aiming to optimize performance by only loading the model once when the application starts, not for every single transaction. Which combination of approaches will BEST achieve this in a reliable and efficient manner, considering the Streamlit application's lifecycle and potential concurrency issues?

A. Use the 'st.cache_data' decorator in Streamlit to cache the loaded model and Snowpark session. Load the model directly from the stage within the cached function. This approach handles concurrency and ensures the model is only loaded once per session.

B. Load the model outside of the Streamlit application's execution context (e.g., in a separate script) and store it in a global variable. Access this global variable within the Streamlit application. This approach requires careful handling of concurrency.

C. Leverage the 'snowflake.snowpark.Session.read_file' to load the model binary directly into a Snowpark DataFrame and then convert to a Pandas DataFrame. Then, use the 'pickle' library for deserialization.

D. Use Python's built-in 'threading.Lock' to serialize access to the model loading code and the Snowpark session, preventing concurrent access from multiple Streamlit user sessions. Store the loaded model in a module-level variable.

E. Load the model within a try-except block and set the Snowpark session as a singleton that will guarantee model loads once for the entire application.

Question 19/143

LEAVE A REPLY

Download PDF File