Complete the code to specify the EMR cluster's release label.
emr_cluster = client.run_job_flow(ReleaseLabel='[1]')
The release label emr-6.7.0 specifies a recent stable EMR version with Apache Spark support.
Complete the code to set the instance type for the master node in the EMR cluster.
Instances={'MasterInstanceType': '[1]', 'InstanceCount': 3}The m5.xlarge instance type is a balanced choice for the master node in EMR clusters.
Fix the error in the bootstrap action configuration to install a custom package.
BootstrapActions=[{'Name': 'InstallPackages', 'ScriptBootstrapAction': {'Path': '[1]'}}]Bootstrap scripts must be stored in an S3 bucket accessible by EMR, so the path must start with s3://.
Fill both blanks to configure the EMR cluster to use Spot instances for core nodes with a bid price.
Instances={'CoreInstanceType': '[1]', 'CoreInstanceMarket': '[2]', 'CoreBidPrice': '0.10', 'InstanceCount': 5}Core nodes use m5.xlarge instances and the market type SPOT enables Spot instances for cost savings.
Fill all three blanks to define an EMR step that runs a Spark application stored in S3.
Steps=[{'Name': 'SparkApp', 'ActionOnFailure': '[1]', 'HadoopJarStep': {'Jar': '[2]', 'Args': ['spark-submit', '[3]']}}]The step continues on failure (CONTINUE), uses command-runner.jar to run commands, and runs the Spark app located at s3://mybucket/apps/my_spark_app.py.