Overview - First Selenium script

What is it?

A Selenium script is a small program that controls a web browser automatically. It can open websites, click buttons, fill forms, and check if things work as expected. This helps testers check websites without doing everything by hand. Selenium scripts use code to tell the browser what to do step-by-step.

Why it matters

Without Selenium scripts, testers would have to check websites manually, which is slow and prone to mistakes. Automating tests saves time, finds bugs faster, and makes sure websites work well for users. This is important because websites change often and need quick, reliable checks.

Where it fits

Before learning Selenium scripts, you should know basic programming in Python and understand what web browsers and websites are. After this, you can learn how to write more complex tests, use test frameworks, and run tests automatically in teams.

Mental Model

Core Idea

A Selenium script is like a remote control that tells a web browser exactly what to do to test a website automatically.

Think of it like...

Imagine you have a robot helper who follows your exact instructions to open a door, press buttons, and check if the lights turn on. The Selenium script is the instruction list, and the browser is the robot doing the tasks.

┌─────────────────────┐
│ Selenium Script Code │
└──────────┬──────────┘
           │ sends commands
           ▼
┌─────────────────────┐
│   Web Browser (e.g., │
│    Chrome, Firefox)  │
└──────────┬──────────┘
           │ performs actions
           ▼
┌─────────────────────┐
│  Website Under Test  │
└─────────────────────┘

Build-Up - 7 Steps

1

FoundationSetting up Selenium and WebDriver

Concept: Learn how to install Selenium and prepare the browser driver to control the browser.

First, install Selenium using pip: pip install selenium. Then, download the correct WebDriver for your browser (like ChromeDriver for Chrome). Place the driver in a known folder or add it to your system path. This setup lets your Python code talk to the browser.

Result

You have Selenium installed and the browser driver ready to launch and control the browser.

Knowing how to set up Selenium and the driver is essential because without this, your script cannot communicate with the browser.

2

FoundationWriting a simple script to open a webpage

3

IntermediateFinding elements on the page

4

IntermediatePerforming actions on elements

5

IntermediateAdding assertions to verify results

6

AdvancedHandling waits for dynamic content

7

ExpertStructuring scripts for maintainability

Under the Hood

Selenium works by sending commands from your script to a browser driver, which then controls the browser using its automation interface. The driver translates commands like 'open URL' or 'click element' into browser actions. The browser executes these actions and returns results or errors back to the script. This communication happens over a WebDriver protocol using HTTP requests.

Why designed this way?

Selenium was designed to separate test code from browser control, allowing tests to run on many browsers without changing code. Using a driver for each browser respects browser security and architecture. The WebDriver protocol standardizes commands so tools and browsers can evolve independently.

┌───────────────┐       HTTP       ┌───────────────┐
│ Selenium Test │ ───────────────> │ Browser Driver│
│    Script     │ <────────────── │               │
└───────────────┘                  └──────┬────────┘
                                         │
                                         ▼
                                ┌─────────────────┐
                                │   Web Browser   │
                                └─────────────────┘

Myth Busters - 3 Common Misconceptions

Quick: Does Selenium automatically wait for all page elements to load before acting? Commit yes or no.

Common Belief:Selenium waits automatically for all elements to load before performing actions.

Tap to reveal reality

Quick: Can Selenium test websites without a real browser? Commit yes or no.

Common Belief:Selenium can test websites without opening a real browser window.

Tap to reveal reality

Quick: Is it best to write all test steps in one big script? Commit yes or no.

Common Belief:Writing one big script with all test steps is simpler and better.

Tap to reveal reality

Expert Zone

1

Tests can fail silently if element locators are brittle; using stable attributes like data-test-id improves reliability.

2

Implicit waits can cause unpredictable delays; explicit waits give precise control over timing and conditions.

3

Running tests in parallel requires careful browser session management to avoid conflicts and speed up test suites.

When NOT to use

Selenium is not ideal for testing non-web applications or APIs; use tools like Postman for APIs or Appium for mobile apps instead.

Production Patterns

In real projects, Selenium tests are integrated into CI/CD pipelines to run automatically on code changes. Tests use page object models to separate page structure from test logic, improving maintainability.

Connections

API Testing

Complementary testing approach

Knowing Selenium helps understand UI testing, while API testing focuses on backend logic; combining both gives full coverage.

Robotic Process Automation (RPA)

Similar automation pattern

Both Selenium and RPA automate user actions, but RPA targets business workflows beyond testing, showing automation's broad power.

Human-Computer Interaction (HCI)

User behavior simulation

Selenium scripts mimic how users interact with interfaces, linking software testing to understanding user experience in HCI.

Common Pitfalls

#1Trying to find elements before the page fully loads causes errors.

Wrong approach:element = browser.find_element('id', 'dynamic') # immediately after get()

Correct approach:from selenium.webdriver.support.ui import WebDriverWait from selenium.webdriver.support import expected_conditions as EC from selenium.webdriver.common.by import By wait = WebDriverWait(browser, 10) element = wait.until(EC.presence_of_element_located((By.ID, 'dynamic')))

Root cause:Misunderstanding that web pages load asynchronously and elements may not be present immediately.

#2Hardcoding element locators that change frequently leads to broken tests.

Wrong approach:button = browser.find_element('xpath', '//div[3]/button[1]')

Correct approach:button = browser.find_element('css selector', '[data-test-id="submit-button"]')

Root cause:Using fragile locators tied to page layout instead of stable, semantic attributes.

#3Not closing the browser after tests wastes resources and causes errors.

Wrong approach:browser = webdriver.Chrome() browser.get('https://example.com') # no browser.quit() call

Correct approach:browser = webdriver.Chrome() browser.get('https://example.com') browser.quit()

Root cause:Forgetting cleanup steps due to lack of test structure or automation framework.

Key Takeaways

Selenium scripts automate browsers by sending commands to control web pages step-by-step.

Setting up the browser driver correctly is essential for Selenium to work.

Locating page elements precisely and waiting for them to load prevents common test failures.

Assertions in scripts turn actions into tests by verifying expected results.

Organizing tests with functions and frameworks improves maintainability and scalability.