Back
Similar todos
working with Pandas (Python) to update a few existing scrapers. It's really, really nice but I ran into a bug trying to pull URLs that I don't want to think about right now.
fixed Pandas issue and I have better, cleaner data now
🐼 reading up on the Python Pandas library since it makes scraping HTML tables easy peasy
more Pandas updates and I built out a reverse scraper which makes it easier to say what values you want on a page and then it builds out the CSS selectors backwards.
working on some scraping for data analysis #linkhandy
working on some a few scrapers for pulling in data vs. just using sheets #daysuntillatenight
finally finished scraper and data looks good. a few hard refactors but I have a pretty good handle on spatula now.
worked on how to test/keep a scraping project updated more easily
updated pandas backend for #daysuntillatenight
worked on another data importer using the new python spatula library. It's not bad, but proving to be a bit more difficult to get my 10+ year old project synced than I'd like
re-worked one of my job scrapers and trying to decide if I should keep it or keep moving forward on it
some data scraping and analysis #life
late night hacking on a scraper project and mostly getting the UI updated
continue work on scrapers #pagesonpages
🍔 built a project scraper over lunch that I don't hate. I used sqlite-utils with mixed results. might just port it back to django and call it a day
did a little bit more django-fsm hacking by adding it to a page scraper project #mylife
dusted off my scraper project and shared it with a few machines
overhauled another scraper/processor with much better tests and fixed several non-obvious bugs. I should have written tested upfront.
refactored a scraper app to use django-q more effectively