entities = [] for subtree in named_entities.[1](): if hasattr(subtree, '[2]'): entity_name = ' '.join([token for token, pos in subtree.leaves()]) entity_type = subtree.label() entities.append((entity_name, entity_type))

import spacy from spacy.training import Example nlp = spacy.blank('en') ner = nlp.add_pipe('[1]') TRAIN_DATA = [ ("Google is a tech company", {"entities": [(0, 6, '[2]')]}), ("I live in New York", {"entities": [(10, 18, '[3]')]}) ] optimizer = nlp.begin_training() for itn in range(10): for text, annotations in TRAIN_DATA: doc = nlp.make_doc(text) example = Example.from_dict(doc, annotations) nlp.update([example], sgd=optimizer)

Practice

(1/5)

1. What is the main goal of Named Entity Recognition (NER) in natural language processing?

easy

A. To find and label names of people, places, and dates in text

B. To translate text from one language to another

C. To summarize long documents into short paragraphs

D. To generate new text based on input

Solution

Step 1: Understand NER purpose
NER is designed to identify and label specific types of information like names, places, and dates in text.
Step 2: Compare with other NLP tasks
Translation, summarization, and text generation are different tasks unrelated to labeling entities.
Final Answer:
To find and label names of people, places, and dates in text -> Option A
Quick Check:
NER = Labeling names in text [OK]

Hint: NER finds names and dates in text, not translations or summaries [OK]

Common Mistakes:

Confusing NER with translation or summarization
Thinking NER generates new text
Believing NER only finds keywords, not entities

2. Which of the following is the correct way to import a Named Entity Recognition pipeline using Hugging Face Transformers in Python?

easy

A. import pipeline from transformers; ner = pipeline('named_entity')

B. from transformers import pipeline; ner = pipeline('ner')

C. from transformers import ner_pipeline; ner = ner_pipeline()

D. import ner from transformers; ner = pipeline('ner')

Solution

Step 1: Recall correct import syntax
The Hugging Face library uses 'from transformers import pipeline' to import the pipeline function.
Step 2: Check pipeline usage for NER
Calling pipeline('ner') creates a named entity recognition pipeline correctly.
Final Answer:
from transformers import pipeline; ner = pipeline('ner') -> Option B
Quick Check:
Correct import and pipeline call = from transformers import pipeline; ner = pipeline('ner') [OK]

Hint: Use 'from transformers import pipeline' and call pipeline('ner') [OK]

Common Mistakes:

Using incorrect import syntax
Calling pipeline with wrong task name
Trying to import non-existent functions

3. Given the following Python code using Hugging Face Transformers NER pipeline:

from transformers import pipeline
ner = pipeline('ner')
text = "Barack Obama was born in Hawaii on August 4, 1961."
results = ner(text)
print(results)

What will be the output type of results?

medium

A. A single string with all entities concatenated

B. A dictionary with entity counts

C. A list of dictionaries with entity details

D. An integer representing number of entities

Solution

Step 1: Understand pipeline output format
The NER pipeline returns a list where each item is a dictionary describing an entity found in the text.
Step 2: Check example output structure
Each dictionary contains keys like 'entity', 'score', 'index', and 'word' describing the entity.
Final Answer:
A list of dictionaries with entity details -> Option C
Quick Check:
NER output = list of entity dictionaries [OK]

Hint: NER pipeline returns list of dicts, not strings or counts [OK]

Common Mistakes:

Expecting a single string output
Thinking output is a dictionary summary
Assuming output is just a count number

4. Consider this code snippet for NER using Hugging Face Transformers:

from transformers import pipeline
ner = pipeline('ner')
text = "Apple is looking at buying U.K. startup for $1 billion"
results = ner(text, grouped_entities=True)
print(results)

What is the likely error or issue here?

medium

A. The text input must be a list, not a string

B. The pipeline call is missing the model parameter

C. There is no error; code runs correctly

D. The argument 'grouped_entities' is invalid and causes a TypeError

Solution

Step 1: Check pipeline argument validity
The 'grouped_entities' argument is not supported in the current pipeline call and will raise a TypeError.
Step 2: Confirm correct usage
To group entities, the argument should be 'aggregation_strategy' with values like 'simple', not 'grouped_entities'.
Final Answer:
The argument 'grouped_entities' is invalid and causes a TypeError -> Option D
Quick Check:
Invalid argument causes error = The argument 'grouped_entities' is invalid and causes a TypeError [OK]

Hint: Check pipeline argument names carefully; 'grouped_entities' is wrong [OK]

Common Mistakes:

Using unsupported argument names
Assuming text input must be a list
Thinking missing model parameter causes error

5. You want to extract all person names and locations from a news article using NER. Which approach best ensures you only get these two entity types from the pipeline output?

hard

A. Filter the NER results by checking if the entity label is 'PER' or 'LOC'

B. Use the pipeline with task='ner' and no filtering

C. Manually search the text for capitalized words

D. Train a new model only on person and location data

Solution

Step 1: Understand NER output labels
NER results include entity labels like 'PER' for person and 'LOC' for location.
Step 2: Filter results for desired entities
Filtering the output by these labels extracts only person and location entities effectively.
Final Answer:
Filter the NER results by checking if the entity label is 'PER' or 'LOC' -> Option A
Quick Check:
Filter by labels 'PER' and 'LOC' to get persons and locations [OK]

Hint: Filter NER output by entity labels to get specific types [OK]

Common Mistakes:

Not filtering and getting all entity types
Trying manual text search instead of using labels
Assuming retraining is needed for filtering

Named entity recognition in NLP - Interactive Code Practice

Start learning this pattern below

Practice

Solution

Step 1: Understand NER purpose

Step 2: Compare with other NLP tasks

Final Answer:

Quick Check:

Solution

Step 1: Recall correct import syntax

Step 2: Check pipeline usage for NER

Final Answer:

Quick Check:

Solution

Step 1: Understand pipeline output format

Step 2: Check example output structure

Final Answer:

Quick Check:

Solution

Step 1: Check pipeline argument validity

Step 2: Confirm correct usage

Final Answer:

Quick Check:

Solution

Step 1: Understand NER output labels

Step 2: Filter results for desired entities

Final Answer:

Quick Check: