NLPml~20 mins

Sentiment analysis pipeline in NLP - ML Experiment: Train & Evaluate

Choose your learning style10 modes available

Learn Why Deep Model Try Challenge Experiment Recall Metrics

Start learning this pattern below

Jump into concepts and practice - no test required

Recommended

Test this pattern10 questions across easy, medium, and hard to know if this pattern is strong

Experiment - Sentiment analysis pipeline

Problem:Build a sentiment analysis model to classify movie reviews as positive or negative.

Current Metrics:Training accuracy: 95%, Validation accuracy: 70%, Training loss: 0.15, Validation loss: 0.65

Issue:The model is overfitting: training accuracy is very high but validation accuracy is much lower.

Your Task

Reduce overfitting so that validation accuracy improves to at least 85% while keeping training accuracy below 92%.

You can only modify the model architecture and training hyperparameters.

Do not change the dataset or preprocessing steps.

Hint 1

Hint 2

Hint 3

Hint 4

Solution

NLP

import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Embedding, LSTM, Dense, Dropout
from tensorflow.keras.preprocessing.sequence import pad_sequences
from tensorflow.keras.datasets import imdb

# Load data
max_features = 10000
max_len = 200
(X_train, y_train), (X_test, y_test) = imdb.load_data(num_words=max_features)

X_train = pad_sequences(X_train, maxlen=max_len)
X_test = pad_sequences(X_test, maxlen=max_len)

# Build model with dropout and reduced units
model = Sequential([
    Embedding(max_features, 64, input_length=max_len),
    LSTM(64, return_sequences=False),
    Dropout(0.5),
    Dense(1, activation='sigmoid')
])

model.compile(optimizer=tf.keras.optimizers.Adam(learning_rate=0.0001),
              loss='binary_crossentropy',
              metrics=['accuracy'])

# Early stopping callback
early_stop = tf.keras.callbacks.EarlyStopping(monitor='val_loss', patience=3, restore_best_weights=True)

# Train model
history = model.fit(X_train, y_train, epochs=20, batch_size=64, validation_split=0.2, callbacks=[early_stop])

# Evaluate
train_loss, train_acc = model.evaluate(X_train, y_train, verbose=0)
val_loss, val_acc = model.evaluate(X_test, y_test, verbose=0)

print(f'Training accuracy: {train_acc*100:.2f}%, Validation accuracy: {val_acc*100:.2f}%')

Added a Dropout layer with rate 0.5 after the LSTM layer to reduce overfitting.

Reduced LSTM units from 128 to 64 to simplify the model.

Lowered learning rate to 0.0001 for more stable training.

Added early stopping to stop training when validation loss stops improving.

Results Interpretation

Before: Training accuracy 95%, Validation accuracy 70%, high overfitting.

After: Training accuracy 90%, Validation accuracy 87%, overfitting reduced.

Adding dropout, reducing model size, lowering learning rate, and using early stopping helps reduce overfitting and improves validation accuracy.

Bonus Experiment

Try using a pretrained word embedding like GloVe instead of training embeddings from scratch.

💡 Hint

Load GloVe embeddings and set them as weights in the Embedding layer with trainable=False.

Practice

(1/5)

1. What is the main purpose of a sentiment analysis pipeline in natural language processing?

easy

A. To automatically detect feelings or opinions in text

B. To translate text from one language to another

C. To count the number of words in a sentence

D. To generate new text based on input

Sentiment analysis pipeline in NLP - ML Experiment: Train & Evaluate

Start learning this pattern below

Practice

Solution

Step 1: Understand the goal of sentiment analysis

Step 2: Identify the pipeline's role

Final Answer:

Quick Check:

Solution

Step 1: Recall the Hugging Face pipeline syntax

Step 2: Match the exact task name for sentiment analysis

Final Answer:

Quick Check:

Solution

Step 1: Understand the input text sentiment

Step 2: Predict output from sentiment pipeline

Final Answer:

Quick Check:

Solution

Step 1: Identify cause of NameError

Step 2: Fix by importing pipeline function

Final Answer:

Quick Check:

Solution

Step 1: Understand the problem with empty inputs

Step 2: Apply filtering before analysis

Final Answer:

Quick Check: