Challenge - 5 Problems

🎖️

BERT Pre-training Master

Get all challenges correct to earn this badge!

Test your skills under time pressure!

🧠 Conceptual

intermediate

2:00remaining

What is the main goal of BERT's Masked Language Model (MLM) during pre-training?

BERT uses a special pre-training task called Masked Language Model (MLM). What is the main goal of MLM?

APredict randomly masked words in a sentence using context from both sides

BClassify the sentiment of a sentence as positive or negative

CPredict the next word in a sentence given all previous words

DTranslate a sentence from one language to another

Attempts:

2 left

🧠 Conceptual

intermediate

2:00remaining

What is the purpose of the Next Sentence Prediction (NSP) task in BERT pre-training?

Besides MLM, BERT uses Next Sentence Prediction (NSP) during pre-training. What does NSP help BERT learn?

ATo predict the sentiment of a sentence

BTo determine if one sentence logically follows another

CTo translate sentences between languages

DTo generate new sentences from scratch

Attempts:

2 left

❓ Model Choice

advanced

2:30remaining

Which architecture component enables BERT to use context from both left and right sides during MLM pre-training?

BERT can look at words before and after a masked word simultaneously. Which part of BERT's architecture allows this?

AUnidirectional LSTM layers

BRecurrent neural networks with attention

CConvolutional neural networks

DBidirectional Transformer encoder layers

Attempts:

2 left

❓ Metrics

advanced

2:30remaining

During BERT pre-training, which metric best indicates how well the model predicts masked tokens?

Which metric is commonly used to measure BERT's performance on the Masked Language Model task during pre-training?

AMean squared error of token embeddings

BBLEU score for sentence generation

CAccuracy of predicting masked tokens

DF1 score for next sentence prediction

Attempts:

2 left

🔧 Debug

expert

3:00remaining

What error will occur if BERT's input tokens are not properly masked during MLM pre-training?

Suppose you accidentally feed BERT input sequences without masking any tokens during MLM pre-training. What is the most likely outcome?

AThe loss will be very low but the model won't learn to predict masked words

BThe model will train normally with no issues

CA runtime error will occur due to missing mask tokens

DThe model will overfit quickly and produce random predictions

Attempts:

2 left

Practice

(1/5)

1. What are the two main tasks used during BERT pre-training?

easy

A. Text Classification and Named Entity Recognition

B. Masked Language Model and Next Sentence Prediction

C. Part-of-Speech Tagging and Dependency Parsing

D. Sentiment Analysis and Machine Translation

BERT pre-training concept in NLP - Practice Problems & Coding Challenges

Start learning this pattern below

Practice

Solution

Step 1: Understand BERT pre-training tasks

Step 2: Match tasks to options

Final Answer:

Quick Check:

Solution

Step 1: Define Masked Language Model (MLM)

Step 2: Match definition to options

Final Answer:

Quick Check:

Solution

Step 1: Identify the masked word in the sentence

Step 2: Predict the masked word

Final Answer:

Quick Check:

Solution

Step 1: Understand NSP task

Step 2: Identify incorrect statement

Final Answer:

Quick Check:

Solution

Step 1: Understand NSP goal

Step 2: Choose best enhancement

Final Answer:

Quick Check: