Back
Similar todos
more Pandas updates and I built out a reverse scraper which makes it easier to say what values you want on a page and then it builds out the CSS selectors backwards.
more scraper-proxy work (up to 31 deploys this month) and worked on selector logic and automation.
worked on a scraper service tonight to update it for a few common area layouts and the general collection system
finally finished scraper and data looks good. a few hard refactors but I have a pretty good handle on spatula now.
late night hacking on a scraper project and mostly getting the UI updated
worked on two new #jobs scrapers using RSS
🔨 Updates scraper-proxy tool and rewriting/refactoring from the ground up to handle structured data
worked on how to test/keep a scraping project updated more easily
make new scraper and test it #plumberjobs
more scraper work and I finally figured out how to work around a few issues I couldn't quite wrap my brain around.
overhauled another scraper/processor with much better tests and fixed several non-obvious bugs. I should have written tested upfront.
continue work on scrapers #pagesonpages
built two more article scrapers using spatula. It's really nice.
fix a few scrapers for #devopsprojectshq
working with Pandas (Python) to update a few existing scrapers. It's really, really nice but I ran into a bug trying to pull URLs that I don't want to think about right now.
working on some scraping for data analysis #linkhandy
fixed a bug in one of my job scrapers
finished job scraper
worked on job website tests on one of the scrapers