Back
Similar todos
working with Pandas (Python) to update a few existing scrapers. It's really, really nice but I ran into a bug trying to pull URLs that I don't want to think about right now.
finished out my day updating scraper-proxy to support css and xpath selectors along with a few features I haven't seen anyone do yet.
started switching some nasty scraping code project over to Pandas because it works so damn well
🐼 reading up on the Python Pandas library since it makes scraping HTML tables easy peasy
worked on two new #jobs scrapers using RSS
working on some a few scrapers for pulling in data vs. just using sheets #daysuntillatenight
finally finished scraper and data looks good. a few hard refactors but I have a pretty good handle on spatula now.
worked on how to test/keep a scraping project updated more easily
Prepare scrapers code for batches and pagination #mrscraper
built two more article scrapers using spatula. It's really nice.
pulled in a new csv scraper data for #daysuntillatenight
build more scrapers #pagesonpages
late night hacking on a scraper project and mostly getting the UI updated
🔨 Updates scraper-proxy tool and rewriting/refactoring from the ground up to handle structured data
more scraper work and I finally figured out how to work around a few issues I couldn't quite wrap my brain around.
more scraper-proxy work (up to 31 deploys this month) and worked on selector logic and automation.
Updated the way I process the scraping data to minimize errors #bagsoup
implemented scraping from posts #scrapebook
looking into some new scraping issues