You are designing a data ingestion system on Google Cloud that is expected to handle high volumes of streaming data. The system must provide reliable and accurate processing of the data. What should be your primary strategy?
The latest changes and updates from the administration for this exam.
Latest Update: Jun 15 2026
All questions are working fine.
You are designing a data ingestion system on Google Cloud that is expected to handle high volumes of streaming data. The system must provide reliable and accurate processing of the data. What should be your primary strategy?
You are a data engineer in a healthcare company that uses Google Cloud Platform. Your team has developed a machine learning model to predict patient outcomes. To ensure compliance with healthcare regulations, the model should only be accessible by certain team members. Which of the following would be the best way to control access to the model?
For a critical application your company uses Cloud SQL with MySQL for storing relational data. You have one instance in the zone that is closest to users. Your company's management is concerned about a single point of failure - when that instance becomes unavailable, your application will crash. What can you do to reduce this risk?
A web application generates thumbnails based on the uploaded photo by the user. The frontend application uploads photos to Cloud Storage. The backend is quite obsolete and runs a cron job that checks Cloud Storage buckets every 15 minutes for new photos. You want to improve this application and process the photos as soon as possible. Which Google Cloud service should you use?
You're designing a data pipeline for an IoT company that collects sensor data from devices worldwide. The data should be processed in real-time for anomaly detection and stored for historical analysis. Given the variability in data volume and the need for low-latency processing, what set of Google Cloud products would you recommend?
Your company has asked you to troubleshoot a machine learning model that has been underperforming. Upon reviewing the input data for the model, you notice that there are several assumptions about the data that have been violated. Which of the following is the most likely source of error?
As a data engineer, you must propose a solution for the following case. A development team wants to build an application that stores images in a Cloud Storage bucket and wants to compress those images for future use in machine learning models. They want to use a Google-managed service that can automatically scale up and down to zero with minimal effort. Which Google Cloud service should you recommend to do this?
An e-commerce company uses Apache Kafka for ingesting data and MongoDB for storage in an on-premises data center. They want to migrate to Google Cloud. Which services would you recommend as alternatives in GCP?
As a data engineer for a multinational organization, you are tasked with designing a hybrid cloud solution. The company has sensitive data stored on-premises that cannot be moved to the cloud due to regulatory constraints. However, they also want to take advantage of the scalability and powerful analytics capabilities of Google Cloud. The solution should enable real-time analytics on data both in the on-premises data center and in Google Cloud. Which of the following solutions would you recommend?
As a data engineer, you have built a real-time analytics pipeline using Pub/Sub for data ingestion, Dataflow for processing, and BigQuery for analysis. The system must have high reliability and fidelity and be capable of recovering from failures. What approach should you take for data recovery and fault tolerance in this scenario?