Complete the code to specify the main component responsible for fetching web pages.
The web crawler starts with the [1] component to download web pages.
The Fetcher is the component that downloads web pages from the internet.
Complete the code to identify the component that manages the list of URLs to visit.
The [1] manages the queue of URLs that the crawler needs to visit next.The URL Scheduler keeps track of URLs to crawl and controls the order of crawling.
Fix the error in the component that extracts links from downloaded pages.
The [1] extracts URLs and content from the fetched web pages.
The Parser analyzes the page content and extracts links and data.
Fill both blanks to complete the description of the crawler's storage components.
The [1] stores the raw web pages, while the [2] stores the processed data for search.
The Content Repository saves raw pages; the Index stores searchable data.
Fill all three blanks to complete the request flow in the crawler system.
First, the [1] selects a URL, then the [2] downloads the page, and finally the [3] extracts links and data.
The Scheduler picks URLs, the Fetcher downloads pages, and the Parser extracts data.
