Python Web Scraper & Automation Developer

I build reliable data collection pipelines and automation tools.

About

I'm Mike Dugan, a Python web scraper and automation developer. I help businesses and individuals collect, organize, and analyze online data — turning messy websites into clean, usable spreadsheets or databases. From eCommerce listings to market research and real-time tracking, I build fast, reliable scrapers tailored to each project. My goal is simple: deliver accurate data, on time, in the exact format you need.

  • Dynamic sites (Playwright/Selenium) & API integrations
  • Data cleaning/post-processing
  • Ethical scraping practices

Projects

Steam Sale Scraper

Screenshot of Steam Sale Scraper results

Scrapes Steam Store sale pages to collect the most discounted games, including title, tags, reviews, pricing, and discount percentage.

  • Handles JS-rendered pages with Playwright
  • Config-driven selectors; modular parsing
  • Validation & post-processing pipeline

OpenGovUS Business Scraper

Screenshot of OpenGovUS Business Scraper CSV output

Extracts business listings from OpenGovUS with robust pagination, deduplication, and cleaned CSV output suitable for analysis.

  • Pagination handling & duplicate detection
  • Cleaned datasets with consistent schema
  • Production-style code organization

BooksToScrape eCommerce Scraper

Screenshot of BooksToScrape product data

Scrapes product data from books.toscrape.com including titles, prices, availability, ratings, and categories with pagination.

  • Pagination-aware crawler
  • Robust parsing of product attributes
  • CSV export with consistent fields

CoinGecko API Scraper

Screenshot of CoinGecko API data results

Retrieves top 100 cryptocurrencies by market cap via CoinGecko API, including price, market cap, volume, and 24h change.

  • API consumption with error handling
  • Rate-limit safety & retries
  • Schema-validated structured output