Prompt Engineering / GenAIml~20 mins

Data extraction from text in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Challenge - 5 Problems

🎖️

Data Extraction Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

1:30remaining

What is the main purpose of Named Entity Recognition (NER) in data extraction?

Named Entity Recognition (NER) is a common technique in data extraction from text. What does NER primarily do?

AIt identifies and classifies key information like names, dates, and locations in text.

BIt translates text from one language to another.

CIt summarizes long documents into short paragraphs.

DIt generates new text based on input prompts.

Attempts:

2 left

❓ Predict Output

intermediate

1:30remaining

Output of simple regex extraction code

What is the output of the following Python code that extracts all email addresses from a text?

Prompt Engineering / GenAI

import re
text = 'Contact us at support@example.com or sales@example.org.'
emails = re.findall(r'\b[\w.-]+@[\w.-]+\.\w+\b', text)
print(emails)

A['support@example.com', 'sales@example.org']

B['support@example.com sales@example.org']

C['support@example', 'sales@example']

D[]

Attempts:

2 left

❓ Model Choice

advanced

2:00remaining

Best model type for extracting structured data from unstructured text

You want to extract structured information like product names, prices, and dates from customer reviews. Which model type is best suited for this task?

AK-Means clustering for grouping similar texts

BGenerative Adversarial Network (GAN) for data generation

CConvolutional Neural Network (CNN) for image classification

DRecurrent Neural Network (RNN) or Transformer-based model for sequence labeling

Attempts:

2 left

❓ Metrics

advanced

2:00remaining

Evaluating extraction accuracy with precision and recall

You built a model to extract dates from text. On a test set, it found 80 dates, of which 60 were correct. The test set actually contains 100 dates. What are the precision and recall?

APrecision = 0.80, Recall = 0.60

BPrecision = 0.60, Recall = 0.75

CPrecision = 0.75, Recall = 0.60

DPrecision = 0.60, Recall = 0.80

Attempts:

2 left

🔧 Debug

expert

2:30remaining

Why does this extraction code raise an error?

Consider this Python code snippet for extracting phone numbers. Why does it raise an error?

Prompt Engineering / GenAI

import re
text = 'Call me at 123-456-7890 or 987-654-3210.'
pattern = r'\d{3}-\d{3}-\d{4}'
matches = re.match(pattern, text)
print(matches.group())

AThe regex pattern is invalid and causes a SyntaxError.

Bre.match only checks the start of the string, so it returns None causing an AttributeError.

CThe text variable is empty, so no matches are found.

Dmatches.group() is called before re.match is imported.

Attempts:

2 left

Practice

(1/5)

1. What is the main goal of data extraction from text in AI?

easy

A. To find and pull out useful information like names and dates from text

B. To translate text from one language to another

C. To generate new text based on a prompt

D. To compress text files to save space

Data extraction from text in Prompt Engineering / GenAI - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand the purpose of data extraction

Step 2: Compare options to the definition

Final Answer:

Quick Check:

Solution

Step 1: Recall Python function call syntax

Step 2: Check each option

Final Answer:

Quick Check:

Solution

Step 1: Understand the function output format

Step 2: Match output to expected format

Final Answer:

Quick Check:

Solution

Step 1: Analyze the extraction logic

Step 2: Identify limitation

Final Answer:

Quick Check:

Solution

Step 1: Consider model choice for extraction

Step 2: Compare other options

Final Answer:

Quick Check: