Back
Similar todos
working with Pandas (Python) to update a few existing scrapers. It's really, really nice but I ran into a bug trying to pull URLs that I don't want to think about right now.
🐼 reading up on the Python Pandas library since it makes scraping HTML tables easy peasy
more Pandas updates and I built out a reverse scraper which makes it easier to say what values you want on a page and then it builds out the CSS selectors backwards.
working on some a few scrapers for pulling in data vs. just using sheets #daysuntillatenight
finally finished scraper and data looks good. a few hard refactors but I have a pretty good handle on spatula now.
worked on how to test/keep a scraping project updated more easily
worked on another data importer using the new python spatula library. It's not bad, but proving to be a bit more difficult to get my 10+ year old project synced than I'd like
re-worked one of my job scrapers and trying to decide if I should keep it or keep moving forward on it
late night hacking on a scraper project and mostly getting the UI updated
🍔 built a project scraper over lunch that I don't hate. I used sqlite-utils with mixed results. might just port it back to django and call it a day
looking into some new scraping issues
dusted off my scraper project and shared it with a few machines
overhauled another scraper/processor with much better tests and fixed several non-obvious bugs. I should have written tested upfront.
pulled in a new csv scraper data for #daysuntillatenight
🐍 worked on a new python + django in the Fortune 500 project today as a soone to be released spreadsheet as a website data project #fortune500
Rewrote scraper. Second attempt is always significantly better than the first one.
worked on some more Python project automation to sync up a bunch of random changes
more scraper work and I finally figured out how to work around a few issues I couldn't quite wrap my brain around.
built two more article scrapers using spatula. It's really nice.