Google GOOGLE-PRODATAENG Practice Questions

The latest changes and updates from the administration for this exam.

verified

Latest Update: Jun 15 2026

All questions are working fine.

All Questions (420)bookmarkBookmarked (0)

All420Show all accessible questions in this exam.Missed420Questions you haven't attempted yet.Incorrect0Questions you answered incorrectly on your last attempt.

Question 61

You work for a mobile game developer and are responsible for designing a new in-game item trading system database. A user may purchase an item from the catalog or from other user. Users may have different items and can search for other users who have particular item. Future versions of the game may also contain new items. You want to use flexible schema data model. What storage solution would you recommend?

OLAP solution such as BigQuery.

Wide-column database such as Bigtable

Document database such as Datastore or MongoDB.

OLTP solution such as MySQL or PostgreSQL .

Question 62

As a data engineer, you have a private repository on GitHub where you push code related to your Google Cloud project. Recently, you accidentally included a service account private JSON key file in a commit and pushed it to the repository. What steps should you take to mitigate potential security risks?

Invalidate the exposed service account key and create a new key for the service account.

Delete the GitHub repository and create a new one.

Delete the key file from your local system and push another commit.

Change the visibility of your GitHub repository from private to public.

Question 63

You are developing a deep learning model that requires a large amount of computational resources during the training phase. The model will be deployed in a production environment and will be used to make predictions on a continuous basis. What is the most appropriate infrastructure to use for both training and serving the model?

A single, powerful GPU machine for both training and serving.

A cluster of GPU machines for training and a single CPU machine for serving.

A cluster of CPU machines for training and a single GPU machine for serving.

A cluster of GPU machines for both training and serving.

Question 64

During the yearly audit in your company, it is necessary to provide external auditors with access to the audit logs for the past five years. What actions can you take to reduce both cost and operational overhead? (select 3)

Select all that apply

Configure a lifecycle management policy on the logs bucket to delete objects older than 5 years.

Export audit logs to Cloud Storage.

Export audit logs to Cloud Filestore.

Export audit logs to BigQuery

Grant external auditors the role of Storage Object Viewer on the logs storage bucket.

Question 65

A financial services company is using machine learning models to detect fraudulent transactions. The models are currently running on standard CPU servers and the processing time is slowing down the overall system performance. What hardware accelerator should the company use to speed up the processing time of the machine learning models and improve system performance?

FPGA (Field-Programmable Gate Array)

TPU (Tensor Processing Unit)

GPU (Graphics Processing Unit)

ASIC (Application-Specific Integrated Circuit)

Question 66

You're designing a pipeline to ingest a vast volume of structured log data into BigQuery for analysis. The log data is generated by multiple services running in Compute Engine and is currently stored in Cloud Storage. You need a solution that can handle the large volume, ensures the data's availability immediately for querying, and minimizes cost. Which method should you use to ingest this data into BigQuery?

Use bq load command-line tool to load data from Cloud Storage to BigQuery.

Use Cloud Dataflow to stream data from Cloud Storage to BigQuery.

Use BigQuery Data Transfer Service to schedule daily transfers from Cloud Storage to BigQuery.

Use Cloud Storage Transfer Service to move data from Cloud Storage to BigQuery.

Question 67

You are a data engineer in a financial firm, tasked with developing a system that can generate credit score predictions. The system should be able to make real-time predictions for interactive online applications and also be able to process large batches of data for generating reports. The predictions must be made using a custom model developed in-house using Tensorflow. Which architecture should you use to satisfy these requirements in Google Cloud?

Use Cloud Run for online predictions and AI Platform Predictions for batch predictions.

Use AI Platform Predictions for online predictions and Cloud Dataproc for batch predictions.

Use AI Platform Predictions for both online and batch predictions.

Use Cloud Run for online predictions and Cloud Dataflow for batch predictions.

Question 68

You are designing a data processing solution for a company that uses Google Cloud. The company plans to run batch processing jobs on a 500 TB dataset stored in Google Cloud Storage. The jobs will perform complex queries and transformations on the data. You've been asked to design a solution that minimizes cost but ensures job completion within a 24-hour window. Which of the following would you recommend?

Use Cloud Bigtable with high-performance SSD storage and use Cloud Functions for processing.

Use BigQuery with on-demand pricing for query processing.

Use Cloud Dataflow with autoscaling and utilize FlexRS (Flexible Resource Scheduling) to run the jobs during off-peak hours.

Use Cloud Dataproc with preemptible VMs and high-memory machine types to speed up processing.

Question 69

You have a multi-terabyte data set stored in a Google Cloud Storage bucket that contains information about user transactions. Your manager has asked you to create a solution that will enable near real-time analysis of this data. The solution should also be able to handle spikes in traffic and scale automatically to handle the increased load. Which of the following solutions is the best for this use case?

Using a Cloud Pub/Sub topic to stream the data into BigQuery in near real-time.

Using a Cloud Functions trigger to run a Python script that inserts the data into BigQuery every time a new transaction is added to the Cloud Storage bucket.

Using a Cloud Dataproc cluster to run Apache Spark jobs that process the data and insert it into BigQuery in batch intervals.

Using a Cloud Dataflow pipeline to process the data and insert it into BigQuery in batch intervals.

Question 70

You are tasked with designing a robust job automation and orchestration process for a complex data pipeline in your organization. The data pipeline needs to ingest data from various sources, perform ETL operations, and push the processed data to Google BigQuery for analysis. It also has dependencies between tasks and requires error handling and alerting in case of failures. Which Google Cloud service would best suit this requirement?

Google Kubernetes Engine (GKE)

Cloud Functions

Cloud Composer

Cloud Pub/Sub

Update History

Update History