AWS • MLA-C01
Validates ability to build, operationalize, deploy, and maintain machine learning solutions and pipelines using AWS Cloud services.
Questions
582
Duration
130 minutes
Passing Score
720/1000
Difficulty
AssociateLast Updated
Jan 2026
The AWS Certified Machine Learning Engineer – Associate (MLA-C01) validates a candidate's ability to build, operationalize, deploy, and maintain machine learning solutions and pipelines on AWS. Launched in October 2024, this role-based certification fills a critical gap in the AWS certification portfolio, targeting the engineer who bridges the gap between data science prototypes and production-ready ML systems. The exam tests practical knowledge of the full ML engineering lifecycle: data ingestion and preparation, model training and tuning, deployment and orchestration, and ongoing monitoring and security.
The certification covers a broad set of AWS services centered on Amazon SageMaker, alongside data storage and processing services, CI/CD and orchestration tools, monitoring and logging platforms, and security controls. Notably, the exam does not test deep domain expertise in NLP or computer vision, nor does it cover end-to-end ML solution architecture—those concerns fall to the AWS Certified Machine Learning – Specialty exam. MLA-C01 is specifically scoped to the operational and engineering tasks an ML engineer performs day-to-day in a cloud environment.
This certification is designed for ML engineers, MLOps engineers, DevOps engineers, data engineers, and backend software developers who work with machine learning systems on AWS. The ideal candidate has at least one year of hands-on experience with Amazon SageMaker and related AWS ML services, combined with at least one year of experience in a related engineering role. Data scientists looking to strengthen their deployment and operationalization skills will also benefit.
Candidates should be comfortable with software engineering best practices such as modular code design, debugging, and deployment, as well as CI/CD pipelines, Infrastructure as Code (IaC), and version control. This is not an entry-level credential—it assumes working knowledge of ML concepts, data engineering fundamentals, and cloud infrastructure provisioning.
There are no mandatory prerequisites to sit for the MLA-C01 exam; AWS does not require any prior certification. However, AWS recommends at least one year of hands-on experience using Amazon SageMaker and other AWS ML engineering services, as well as one year of experience in a related role such as backend development, DevOps, or data engineering.
Recommended foundational knowledge includes: common ML algorithms and their use cases, data engineering concepts (formats, ingestion pipelines, transformation), data querying and transformation skills, CI/CD pipeline design and orchestration, cloud resource provisioning and monitoring, and AWS security fundamentals including identity management and encryption. Candidates new to AWS may benefit from first earning the AWS Certified Cloud Practitioner or AWS Certified AI Practitioner, though neither is required.
The MLA-C01 exam consists of 65 total questions—50 scored and 15 unscored. The unscored questions are used by AWS to evaluate potential future exam content and are not identified during the exam. The time limit is 130 minutes. The exam is delivered through Pearson VUE at a testing center or via online proctoring. It is available in English, Japanese, Korean, and Simplified Chinese. The exam fee is $150 USD.
Four question types are used: multiple choice (one correct answer out of four), multiple response (two or more correct answers out of five or more options, requiring all correct selections for credit), ordering (arrange 3–5 steps in the correct sequence), and matching (match 3–7 response pairs). Scores are reported on a scaled range of 100–1,000, with a passing score of 720. The exam uses a compensatory scoring model, meaning no per-domain minimum is required. Unanswered questions are scored as incorrect; there is no penalty for guessing. The certification is valid for three years.
The MLA-C01 certification targets some of the fastest-growing roles in technology. The World Economic Forum's Future of Jobs Report projects demand for AI and ML Specialists to grow by more than 80% by 2030, and the U.S. Bureau of Labor Statistics forecasts 34% growth for data scientists between 2024 and 2034. The certification validates skills directly applicable to ML Engineer, MLOps Engineer, and AI/ML Platform Engineer roles. While the credential launched in late 2024 and direct salary correlation data is still emerging, related benchmarks are strong: ZipRecruiter places AWS Machine Learning Engineers at an average of approximately $145,000–$146,000 annually, and Payscale data shows that ML engineers with AWS skills earn roughly $5,600 more than peers without them. AWS Certified Machine Learning – Specialty holders average $213,000 according to Skillsoft's 2024 IT Skills and Salary Report, indicating the premium that AWS ML credentials command.
Within the AWS certification ecosystem, MLA-C01 sits between the AWS Certified AI Practitioner (foundational) and the AWS Certified Machine Learning – Specialty (advanced), making it a natural stepping stone for engineers building a structured AWS ML career path. It complements the AWS Certified Data Engineer – Associate and AWS Certified Solutions Architect – Associate, and is increasingly listed as a preferred qualification in ML engineering job postings on major platforms.
5 sample questions with correct answers and explanations. Start a practice session to test yourself across all 582 questions.
1. A retail company has a stable order-processing application. They are now launching an ML-powered recommendation engine. The engineering team observes that while the order-processing logic rarely changes, the recommendation engine's accuracy declines over time as new products are added and customer preferences evolve. What is the key operational difference the team must plan for with the new ML workload?
Explanation
The correct answer is that the ML workload requires a continuous, iterative cycle. Unlike traditional software with fixed logic, an ML model's performance is directly tied to the data it was trained on. When real-world data changes (a phenomenon known as 'drift'), the model's predictions become less accurate. To combat this, the team must implement a lifecycle (often called MLOps) to constantly monitor for this degradation and retrain the model with fresh data to maintain its performance. Why the others are wrong: - The other options describe characteristics of a static, traditional application, which is the opposite of how a real-world ML system behaves. The scenario explicitly states that performance declines over time, proving that it cannot be a single deployment and is highly dependent on input data.
2. A key reliability practice for ML systems is to use version control for all components to enable rollbacks and auditing. This includes versioning the data, the source code, and the model artifacts. This practice directly enables which important ML design principle?
Explanation
Strict version control across all components is the foundation for enabling reproducibility. Reproducibility means you can reliably recreate a past result, such as a specific trained model. By versioning the data, code, and other artifacts, you can always check out a specific state of your project to debug an issue, audit a result, or rollback to a previous working version. Why the others are wrong: - The other options are also important design principles, but version control is most directly linked to being able to reproduce a specific state or result.
3. A data scientist is working on a complex project with dozens of different model training runs. They need a way to systematically log, track, and compare the hyperparameters, metrics, and artifacts associated with each run. Which SageMaker capability provides this functionality within the SageMaker Studio IDE?
Explanation
SageMaker Experiments is designed to solve this exact problem. It allows you to organize, track, compare, and evaluate your machine learning training runs. It automatically captures input parameters, configurations, and results, and stores them as 'experiments'. You can then easily query and compare different runs in SageMaker Studio to identify the best-performing models. Why the others are wrong: - Model Registry is for versioning and cataloging final model candidates for deployment, not for tracking all experimental runs. - Pipelines orchestrate workflows. - Projects are templates for setting up MLOps resources.
4. A team needs to provide access to a SageMaker notebook instance for a new data scientist. According to the principle of least privilege, which is the best way to grant the necessary permissions?
Explanation
Creating a new, specific IAM role is the correct approach. The principle of least privilege dictates that an entity should have only the permissions required to perform its task. By creating a role with a policy that only allows access to the specific S3 buckets and other resources the data scientist needs, you minimize the potential security risk. Why the others are wrong: - The other options are all examples of overly permissive and insecure practices.
5. A research team has a model that takes up to 10 minutes to process a single large medical image. They need an on-demand endpoint that can accept the large image file and process it without the client application timing out. Which SageMaker deployment option is designed for this use case?
Explanation
An Asynchronous Inference Endpoint is specifically designed for workloads with large payloads and long processing times. The client sends a request and immediately gets an acknowledgment with a location to poll for the result. SageMaker processes the request in the background and places the output in S3, avoiding client timeouts. This is ideal for the 10-minute processing time described. Why the others are wrong: - A Real-time Endpoint would time out after about 60 seconds. - Batch Transform is for processing entire datasets, not single on-demand requests. - A Multi-Model Endpoint is for hosting many models, but it still follows the real-time request/response pattern and would time out.
One-time access to this exam