Challenge - 5 Problems

🎖️

SQL Data Reader Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

❓ Predict Output

intermediate

2:00remaining

Output of SQL query read into pandas DataFrame

What is the output of the following Python code that reads data from a SQL database into a pandas DataFrame?

Data Analysis Python

import sqlite3
import pandas as pd

conn = sqlite3.connect(':memory:')
cursor = conn.cursor()
cursor.execute('CREATE TABLE sales (id INTEGER, amount INTEGER)')
cursor.executemany('INSERT INTO sales VALUES (?, ?)', [(1, 100), (2, 200), (3, 300)])
conn.commit()

query = 'SELECT id, amount FROM sales WHERE amount > 150'
df = pd.read_sql_query(query, conn)
print(df)

Empty DataFrame
Columns: [id, amount]
Index: []

   id  amount
0   2     200
1   3     300

   id  amount
0   1     100
1   2     200
2   3     300

   id  amount
0   1     100

Attempts:

2 left

❓ data_output

intermediate

1:30remaining

Number of rows returned by SQL query

How many rows will the DataFrame contain after running this code?

Data Analysis Python

import sqlite3
import pandas as pd

conn = sqlite3.connect(':memory:')
cursor = conn.cursor()
cursor.execute('CREATE TABLE employees (name TEXT, age INTEGER)')
cursor.executemany('INSERT INTO employees VALUES (?, ?)', [('Alice', 30), ('Bob', 25), ('Charlie', 35)])
conn.commit()

query = 'SELECT * FROM employees WHERE age < 30'
df = pd.read_sql_query(query, conn)
print(len(df))

Attempts:

2 left

🔧 Debug

advanced

2:00remaining

Identify the error when reading SQL data

What error will this code produce when trying to read from the SQL database?

Data Analysis Python

import sqlite3
import pandas as pd

conn = sqlite3.connect(':memory:')
cursor = conn.cursor()
cursor.execute('CREATE TABLE products (id INTEGER, price REAL)')
cursor.execute('INSERT INTO products VALUES (1, 9.99)')
conn.commit()

query = 'SELECT id, price FROM product'
df = pd.read_sql_query(query, conn)
print(df)

Asqlite3.OperationalError: no such table: product

BKeyError: 'product'

CTypeError: read_sql_query() missing 1 required positional argument

Empty DataFrame
Columns: [id, price]
Index: []

Attempts:

2 left

❓ visualization

advanced

2:30remaining

Plotting data read from SQL

Which option shows the correct matplotlib bar plot code to visualize the total sales amount per product from the DataFrame read from SQL?

Data Analysis Python

import sqlite3
import pandas as pd
import matplotlib.pyplot as plt

conn = sqlite3.connect(':memory:')
cursor = conn.cursor()
cursor.execute('CREATE TABLE sales (product TEXT, amount INTEGER)')
cursor.executemany('INSERT INTO sales VALUES (?, ?)', [('A', 10), ('B', 20), ('A', 15), ('B', 25)])
conn.commit()

query = 'SELECT product, amount FROM sales'
df = pd.read_sql_query(query, conn)

df.groupby('product')['amount'].sum().plot(kind='bar')
plt.show()

df.plot.bar(x='product', y='amount')
plt.show()

plt.bar(df['product'], df['amount'])
plt.show()

df.plot(kind='line', x='product', y='amount')
plt.show()

Attempts:

2 left

🚀 Application

expert

3:00remaining

Efficiently reading large SQL tables in chunks

You want to process a large SQL table in parts to avoid memory issues. Which code snippet correctly reads the table in chunks of 1000 rows using pandas?

df = pd.read_sql_query('SELECT * FROM big_table LIMIT 1000', conn)
process(df)

chunks = pd.read_sql_query('SELECT * FROM big_table', conn)
for i in range(0, len(chunks), 1000):
    process(chunks[i:i+1000])

for chunk in pd.read_sql_query('SELECT * FROM big_table', conn, chunksize=1000):
    process(chunk)

while True:
    df = pd.read_sql_query('SELECT * FROM big_table', conn, chunksize=1000)
    process(df)
    break

Attempts:

2 left