Browser automation with Playwright

01

Introduction to Playwright with the NICAR schedule (uggghhhh)

When requests and BeautifulSoup can't see content because it's rendered by JavaScript, it's time to bring in a real browser. We'll use Playwright to scrape a JavaScript-heavy site, click 'Show More' buttons, and pull data into pandas.

Code-along Download .ipynb

Ref: Playwright for Python • NICAR Schedule

Open in Colab Read

02

Introduction to Playwright with OpenSyllabus

When requests and BeautifulSoup can't see content because it's rendered by JavaScript, it's time to bring in a real browser. We'll use Playwright to scrape a JavaScript-heavy site, click 'Show More' buttons, and pull data into pandas.

Code-along Download .ipynb

Ref: Playwright for Python • OpenSyllabus

Open in Colab Read

03

Scraping Texas tow truck licenses

A gentle warm-up: select an option from a dropdown, click search, and grab the results table. Just enough interaction to get comfortable with Playwright's selectors.

Code-along Download .ipynb

Ref: Texas TDLR license search

Open in Colab Read

04

Paginating through Iowa appraisal companies

What happens when results span multiple pages? We'll click 'Next Page' in a loop, collecting every table along the way and combining them with pandas.

Code-along Download .ipynb

Ref: Iowa Professional Licensing

Open in Colab Read

05

Looping through North Dakota oil well townships

Instead of paginating, this time we loop through every option in a dropdown. Each township gets its own search, and we stack all the results together.

Code-along Download .ipynb

Ref: ND Oil and Gas well search

Open in Colab Read

06

Filling forms to find Maryland locksmiths

Now we're typing into text fields instead of picking from dropdowns. We'll loop through a list of zip codes, fill in the search form each time, and collect the results.

Code-along Download .ipynb

Ref: Maryland electronic licensing

Open in Colab Read

07

Downloading PDFs from the NC State Bar

The final boss: navigate to a page, loop through some letters, click through some pages, and download PDFs.

Code-along Download .ipynb

Ref: NC State Bar Discipline Orders

Open in Colab Read

08

AI-powered scraper writer

Point an AI agent at a website and let it write a Playwright scraper for you. You describe what you want in plain English, it explores the page and produces a working script. Requires a free Google AI API key.

Code-along Download .ipynb

Ref: Google AI Studio (free API key) • PydanticAI

Open in Colab Read