DeloitteEngineering Blog | Netherlands
2 October 2023

AWS Cloud Practitioner Study Guide

Not so long ago, I studied for the AWS Cloud Practitioner exam and I realized that even though there are tons of preparation material out there, there are not so many revision sheets. I really do find it useful after I am done with my initial deep dive into the study material to create revision sheets that serve as study material for the days immediately prior to the exam.

It’s something as simple as:

And voilà!

However, if you are about to do the exam and you have not prepared a revision sheet, worry not, because this is what this blog is all about.

In this blog I cover the compute, storage, and databases parts of the cloud practitioner exam. I decided to differentiate databases from storage (even though databases ARE storage :D), because the study material is a lot for these services and it’s an easy way to separate them in two groups.

In general the blog aims to group services that are closely related or deal with similar functionalities. Some of these services are notionally connected and are easy to confuse during the exam and these notes will hopefully help clear some of the confusion out.

compute icon

Compute

In the compute section, we will focus on Amazon Elastic Compute Cloud (EC2) and its different types and feature. This section also includes a small peak into Amazon Machine Image (AMI).

Amazon Elastic Compute Cloud (EC2)

Definition: EC2 is scalable computing capacity in the AWS Cloud. It provides virtual computing environments, known as instances.

EC2 auto-scaling: it is a feature of EC2 that allows the adjustment of the number of EC2 instances (in or out) automatically, based on demand. It needs to be activated manually and it doesn’t automatically deploy AWS shield.

EC2 user data script: it is a way for a system administrator to specify a bootstrap script to be run on an EC2 instance during launch.

EC2 instance types:

  1. General purpose: balance of compute, memory, and networking resources
  2. Compute optimized: ideal for compute-bound applications that benefit from high-performance processors
  3. Memory optimized: designed to deliver fast performance for workloads that process large data sets in memory
  4. Storage optimized: designed for workloads that require high, sequential read and write access to very large data sets on local storage. Optimized to deliver low-latency, random I/O operations per second (IOPS) to applications
  5. Accelerated computing: used for floating point number calculations, graphics processing, or data pattern matching

EC2 instance purchasing options:

  1. On-demand: pay, by the second, for the instances that you launch
  2. Savings plans: commitment to a consistent amount of usage for a term of 1 or 3 years
  3. Reserved instances: commitment to a consistent instance configuration for a term of 1 or 3 years
  4. Spot instances: lets you request unused EC2 instances
  5. Dedicated hosts: workload runs on a specific, physical host (server) that is fully dedicated to running your instances
  6. Dedicated instances: pay, by the hour, for instances that run on single-tenant hardware
  7. Capacity reservations: reserve capacity for your EC2 instances in a specific AZ for any duration

Reserved instances with reserved instances the benefit is the reserved capacity and the reduced cost. The most cost-effective option to purchase an EC2 reserved instance is with a partial upfront payment option with a standard 3-year term.

Standard reserved instances: they provide the most significant discount. They can be modified, but can’t be exchanged.

Convertible reserved instances: they provide a lower discount than standard reserved instances. However, they also let you exchange one or more convertible reserved instances for another convertible reserved instance with a different configuration.

Spot instances: suitable for workloads that can be interrupted. Available at up to 90% discount compared to on-demand.

Dedicated hosts: suitable for existing server-bound software licenses. Allows an organization to bring their own licensing on host hardware that is physically isolated from other AWS accounts.

EC2 information relevant to the exam:

Amazon Machine Image (AMI)

Definition: a supported and maintained imaged provided by AWS that can be used to launch a preconfigured EC2 instance.

AMI information relevant to the exam: user must use an AMI from the same region as that of the EC2 instance. AMI has no bearing on the performance of the EC2 instance.

storage icon

Storage

In the storage section, we will explore different storage options.

Initially, we will focus on Amazon Simple Storage Service (S3), its different storage classes and functionalities. Then, we will focus on file storage options, like EFS and FSx. Further, we will look into the AWS Storage Gateway and the different gateway types it supports. Finally, we will zoom in the EC2 instance store and Elastic Block Store (EBS) and its features.

Amazon Simple Storage Service (S3)

Definition: object storage service offering scalability, data availability, security, and performance. It stores objects in buckets.

S3 information relevant to the exam:

Amazon S3 storage classes:

  1. S3 Standard: high durability, availability, and performance object storage for frequently accessed data. There is a per GB/month storage fee + data egress fee.
  2. S3 Standard IA (Infrequent Access): suitable for data that are accessed less frequently, but requires rapid access when needed. There is a minimum capacity charge per object + a retrieval fee.
  3. S3 One-Zone IA (Infrequent Access): it stores data in a single Availability Zone (other classes store them in three). It is the class with the lowest availability. It costs 20% less than the Standard IA class. There is a minimum capacity charge per object + retrieval fee.
  4. S3 Intelligent Tiering: it automatically moves data to the most cost-effective access tier based on access frequency.
  5. S3 Glacier Instant Retrieval: it delivers the lowest-cost storage for long-lived data that is rarely accessed and required retrieval in milliseconds. There is a retrieval fee.
  6. S3 Glacier Flexible Retrieval: up to 10% lower cost than S3 Glacier Instant Retrieval. Suitable for archive data that is accessed 1-2 times per year and is retrieved asynchronously. There is a retrieval fee.
  7. S3 Glacier Deep Archive: lowest-cost storage class for data that may be accessed 1-2 times per year.

Glacier data access options: two options provided, the standard one (3–5 hours) or the expedited one (1–5 mins, unless the archive is bigger than 250MB).

Object lifecycle management: feature of S3 that enables you to set rules to automatically transfer objects between different storage classes at defined time intervals.

Amazon S3 replication:

Amazon Elastic File System (EFS)

Definition: a serverless, fully elastic file storage service.

EFS information relevant to the exam:

Amazon FSx for Windows File Server: it provides fully managed Microsoft Windows file servers, backed by a fully native Windows file system. It is using the Server Message Block (SMB) protocol.

AWS Storage Gateway

Definition: a hybrid cloud storage service that gives you on-premises access to virtually unlimited cloud storage.

Gateway types supported by AWS Storage Gateway:

EC2 instance store

Definition: it provides temporary block-level storage for the EC2 instance.

It is located on disks that are physically attached to the host computer. It is good for low-latency, fault-tolerant, architectures (data not persisting for long) and fast I/O performance.

Elastic Block Store (EBS)

Definition: it provides block level storage volumes for use with EC2 instances.

EBS information relevant to the exam:

databases icon

Databases

In the databases section, we will cover the databases that are available on AWS, as well as some data processing services that show up along the databases on the Cloud Practitioner exam questions.

We cover relational databases services (RDS and Aurora), no-SQL databases (DynamoDB and DocumentDB), data warehouses (Redshift), and in-memory caches (Elasticache). We also cover a couple of data processing services (Elastic MapReduce and Athena).

Relational Database Service (RDS)

Definition: a collection of managed services that makes it simple to set up, operate, and scale relational databases in the cloud.

RDS information relevant to the exam:

RDS Automated backups:

Aurora

Definition: a relational database management system (RDBMS) built for the cloud with full MySQL and PostgreSQL compatibility.

Aurora information relevant to the exam:

DynamoDB

Definition: is a fully managed, serverless, key-value NoSQL database designed to run high-performance applications at any scale.

DynamoDB information relevant to the exam:

DocumentDB

Definition: is a fully managed native JSON document database that makes it easy and cost effective to operate critical document workloads at virtually any scale without managing infrastructure.

DocumentDB information relevant to the exam:

Redshift

Definition: uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and machine learning.

Redshift information relevant to the exam:

Elasticache

Definition: is a fully managed, Redis- and Memcached-compatible caching service delivering real-time, cost-optimized performance for modern applications.

Elasticache information relevant to the exam:

Amazon Elastic MapReduce (EMR)

Definition: cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning using open-source frameworks such as Apache Spark, Apache Hive, and Presto.

EMR information relevant to the exam:

Athena

Definition: is an interactive query service that makes it easy to analyze data directly in Amazon Simple Storage Service (Amazon S3) using standard SQL.

Athena information relevant to the exam:

Final words

The preparation for the Cloud Practitioner exam can be overwhelming, due to the huge amount of information introduced, especially for someone just starting with the AWS cloud. However, the actual exam itself is pretty doable if you have a good overview and understanding of the AWS landscape — no need to get into specific service deep-dives at this stage of your learning curve.

I do hope that this revision sheet helped to put some structure in the final preparation prior to the exam and I wish you good luck!

Related vacancies