Snowflake · DEA-C02
Validates advanced expertise in designing, building, optimizing, and managing complex data pipelines within the Snowflake platform. Covers data movement, transformation, performance optimization, storage, and data governance at a production-level depth.
Questions
597
Duration
115 minutes
Passing Score
750/1000
Difficulty
ProfessionalLast Updated
Jun 2026
Use this DEA-C02 practice exam to prepare for SnowPro Advanced: Data Engineer (DEA-C02) with realistic questions, detailed explanations, and focused study modes. The practice bank includes 597 questions for Snowflake DEA-C02, so you can review the exam steadily instead of relying on one long cram session.
As you practice, pay extra attention to recurring topics such as Data Movement, Data Transformation, Performance Optimization, Storage & Data Protection, and Data Governance. Start with short sessions to identify weak areas, then move into timed quizzes once your accuracy is consistent.
The explanations are especially useful when you want to connect exam wording to the responsibilities and scenarios described in the official certification guidance. Use the free preview first, then unlock the full question bank when you are ready to build a complete study routine.
The SnowPro Advanced: Data Engineer (DEA-C02) is Snowflake's role-based advanced certification that validates production-level expertise in designing, building, optimizing, and managing complex data engineering workflows on the Snowflake platform. Released on February 18, 2025 as the successor to DEA-C01, this exam tests a candidate's ability to source data from data lakes, APIs, and on-premises systems; build near real-time streaming pipelines; execute ELT/ETL transformations at scale; and apply advanced performance tuning across Snowflake's compute and storage architecture. The 2025 revision places greater emphasis on streaming technologies, automation, data observability, and multi-cloud interoperability compared to its predecessor.
The certification spans five weighted content domains: Data Movement (26%), Data Transformation (25%), Performance Optimization (21%), Storage and Data Protection (14%), and Data Governance (14%). It covers advanced Snowflake-native capabilities including Snowpark for Python/Java/Scala-based transformations, User-Defined Functions (UDFs) and UDTFs, continuous data pipelines using Streams and Tasks, dynamic data masking, row-level security policies, and Snowflake's data sharing and replication features. Achieving this credential demonstrates that a practitioner can operate confidently in enterprise-grade production environments.
This certification is designed for experienced data engineers who work with Snowflake in production environments and wish to formally validate their advanced expertise. The ideal candidate has at least two years of hands-on Snowflake data engineering experience and is comfortable designing end-to-end pipelines, troubleshooting performance bottlenecks, and implementing governance frameworks at scale.
Relevant job titles include Data Engineer, Senior Data Engineer, Cloud Data Engineer, Data Platform Engineer, and Cloud Data Architect. Professionals working in data-intensive industries such as financial services, retail, healthcare, media, and technology who are responsible for building and maintaining Snowflake-based data infrastructure will find this credential directly applicable to their day-to-day work.
Snowflake does not list any hard prerequisites that must be completed before registering for DEA-C02, but strongly recommends holding an active SnowPro Core certification (COF-C02) as a foundational baseline. The SnowPro Core validates general Snowflake platform knowledge, and candidates who lack it will likely find the advanced-level content significantly more challenging.
Beyond the Core certification, Snowflake recommends a minimum of two years of hands-on, production-level experience in a data engineering role using Snowflake. Candidates should be proficient in SQL and at least one of Python, Java, or Scala for Snowpark-based transformations, and should have practical experience with cloud platforms (AWS, Azure, or GCP), data ingestion tools such as Snowpipe, and Snowflake's native pipeline constructs including Streams, Tasks, and Dynamic Tables.
The DEA-C02 exam consists of 65 scored questions delivered in 115 minutes, yielding a time allocation of approximately 1 minute 46 seconds per question. Question types include multiple choice (single correct answer), multiple select (two or more correct answers), and interactive question types such as drag-and-drop and matching scenarios that simulate real-world data engineering decisions. There is no announced beta or unscored survey question pool; Snowflake states that results are available immediately upon exam completion.
The exam is delivered online via remote proctoring through Pearson VUE, or at an authorized Pearson VUE testing center. The cost is $375 USD per attempt (approximately $300 USD in India with a regional discount). Scoring uses a scaled score from 100 to 1000, with a passing threshold of 750. Candidates who do not pass must wait a minimum of seven days before retaking the exam. The certification is valid for two years, after which holders can renew via the dedicated recertification exam (DEA-R02).
Earning the DEA-C02 credential signals to employers that a candidate can independently own and operate complex, production-scale Snowflake data platforms — a skill set in high demand as enterprises increasingly consolidate their data infrastructure on cloud-native platforms. Certified Snowflake data engineers in the United States report average annual salaries of approximately $125,000, with the range spanning roughly $101,000 at the 25th percentile to $156,000 at the 75th percentile, and senior roles reaching $190,000 or more. Industry analyses suggest that Snowflake-specific certification correlates with 20–30% higher compensation compared to equivalent roles without platform credentials.
The DEA-C02 positions holders for senior individual contributor and lead engineer roles across finance, retail, healthcare, and technology sectors where Snowflake is a primary data platform. Compared to broader cloud certifications such as AWS Certified Data Engineer or Google Professional Data Engineer, the SnowPro Advanced: Data Engineer is narrower in scope but commands a premium in organizations already standardized on Snowflake. It complements rather than competes with cloud-provider certifications and is frequently listed alongside them in job postings for Senior Data Engineer, Cloud Data Architect, and Data Platform Lead roles.
5 sample questions with answers and explanations. Start a practice session to test yourself across all 597 questions.
Preview — answers shown1. Northwind Traders is ingesting data from multiple file formats into a Snowflake table named PRODUCT_CATALOG. The engineering team uses MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE in their COPY INTO statement to automatically map source file columns to the target table columns. This approach works successfully for JSON and Parquet sources. A new pipeline must load CSV files using the same COPY command template. What will happen when the COPY command with MATCH_BY_COLUMN_NAME is executed against the CSV files? (Select one!)
Explanation
MATCH_BY_COLUMN_NAME is supported only for semi-structured file formats: JSON, Avro, ORC, and Parquet. It is explicitly NOT supported for CSV or TSV files. CSV files do not embed column names within the data payload itself — they rely on positional column ordering — so Snowflake cannot perform name-based column matching for this format. The COPY command will return an error when MATCH_BY_COLUMN_NAME is specified with a CSV file format. Teams loading CSV files must explicitly define column mapping using a column list or a SELECT expression in the COPY statement. JSON and Parquet formats embed field names directly in the file structure, which is what enables MATCH_BY_COLUMN_NAME to function correctly for those formats.
2. A data engineering team at Adatum is migrating their existing stream and task pipeline to Dynamic Tables for near-real-time reporting. Their current pipeline uses CURRENT_TIMESTAMP() to timestamp records and reads from a stream created on the source table. They attempt to configure a Dynamic Table with TARGET_LAG = '45 seconds'. Which two statements accurately describe limitations they will encounter with this migration? (Select two!)
Multiple correct answersExplanation
Dynamic Tables have two specific limitations relevant to this migration. First, Dynamic Tables cannot reference streams directly — the defining query must read from base tables, and Snowflake manages the incremental change detection automatically. This is a fundamental architectural difference from the stream-plus-task pattern. Second, the minimum TARGET_LAG for Dynamic Tables is 60 seconds; configuring a lag below this threshold is not supported. Regarding the other options: CURRENT_TIMESTAMP() and other non-deterministic functions prevent incremental refresh because Snowflake cannot determine which rows changed, forcing a full table scan instead — this is a limitation, not a valid option. Dynamic Tables are declarative and do not require manual CDC logic; Snowflake handles incremental refresh automatically. Stored procedures cannot be used within Dynamic Table definitions, making that option also incorrect.
3. A data governance team at Tailspin Toys needs to implement an enterprise-wide data sensitivity classification system in Snowflake. The team wants to label tables and columns containing personally identifiable information with sensitivity levels such as CONFIDENTIAL and HIGHLY_SENSITIVE. The labels must be queryable through Snowflake Account Usage views to generate automated compliance reports. The solution must not modify underlying data values or alter existing access policies. Which Snowflake feature should the team implement to satisfy these requirements? (Select one!)
Explanation
Object tags in Snowflake allow governance teams to attach key-value metadata labels directly to database objects including tables, columns, schemas, and databases. Tags can be defined with an enumerated set of allowed values such as CONFIDENTIAL and HIGHLY_SENSITIVE, enforcing consistent labeling across the organization. Tag assignments are queryable through SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES and information schema views, enabling automated compliance reporting without modifying underlying data or altering existing access policies. Row access policies control which rows a querying role can see based on conditional logic, but they are enforcement mechanisms and do not provide a metadata labeling or reporting framework. Dynamic data masking policies obscure column values at query time for data protection purposes but are not designed to serve as a metadata classification and compliance reporting system. Column-level privilege grants restrict which roles can query specific columns, functioning as an access control mechanism rather than a labeling or governance classification framework.
4. A data engineering team at Fabrikam is building a real-time IoT telemetry pipeline to ingest sensor readings from 10,000 devices. The ingestion application uses the Snowflake Snowpipe Streaming SDK to write individual row-level events directly to a Snowflake table. The team needs to ensure exactly-once delivery semantics and must understand which Snowpipe Streaming component maintains the position state that makes this possible. Which component is responsible for maintaining the offset token that enables exactly-once delivery in Snowpipe Streaming? (Select one!)
Explanation
In Snowpipe Streaming, a channel is the logical stream of rows directed to a specific table. Each channel maintains its own offset token, which tracks the position in the data stream and enables exactly-once delivery semantics for row-level ingestion. The migration process is a background optimization that reorganizes streaming data from temporary micro-partitions into fully optimized storage but plays no role in delivery guarantees. The pipe definition is a Snowpipe concept for file-based continuous loading and is not applicable to Snowpipe Streaming. Notification integrations are used with classic Snowpipe for cloud storage event triggers such as SQS or Azure Event Grid — they are not part of the Snowpipe Streaming architecture.
5. A data governance architect at Litware Inc. manages a Snowflake environment with hundreds of tables containing PII columns. Applying masking policies column-by-column is unsustainable at scale. The team wants a centralized model where simply tagging a column automatically enforces the appropriate masking policy with no additional per-column policy assignment required. Which Snowflake capability enables this approach? (Select one!)
Explanation
Snowflake supports tag-based masking policies, where a masking policy is associated with a tag using ALTER TAG ... SET MASKING POLICY. Once configured, any column that receives that tag automatically inherits and enforces the associated masking policy without requiring a separate ALTER TABLE ... ALTER COLUMN ... SET MASKING POLICY statement for each column. This centralizes governance so teams can scale data masking across hundreds of columns simply by applying tags. Masking policies cannot be applied at the schema level with automatic inheritance to all columns — schema-level cascading masking is not a supported Snowflake feature. Row access policies control row-level visibility and cannot substitute for column-level masking, which hides specific field values rather than entire rows. Automatic Data Classification identifies sensitive data categories and assigns Snowflake system tags to columns, but it does not automatically assign or enforce masking policies — a separate governance step connecting classification tags to masking policies is still required after classification.
$7.99
One-time access to this exam