Back
Yaël 🛸
#thecompaniesapi add wikipedia extraction ; feeding our existing flow ; funny how it takes no more than an afternoon to add such steps to our infra!
#thecompaniesapi prepare next deploy ; add stockExchange/stockSymbol & foundedYear to our existing extraction steps
#thecompaniesapi fix address extraction & resolving ; add stockExchange/stockSymbol
#thecompaniesapi deploy the latest robot update ; wouldnt be fun without a gc heap JS error at build time to reward me for all that hard work :)
#thecompaniesapi added model selection to our source system ; we can now run user queries on a model and our continuous extraction/datasets sources on another; became afraid of receiving a huge batch of user queries and make it create a huge Anthropic bill 😅 ; also merged my last 2 PRs to next production batch
#thecompaniesapi add sockets sync to our datasets builder page ; funny how the last mile feels like a reward when you build your own thing
#thecompaniesapi finish subdomains support & cover all cases; continue polishing fine tuning panel :)
#thecompaniesapi closing review checklist ; preparing for the merge of last PRs before running fine tuning!
#thecompaniesapi finished updating subdomains PR ; will soon be merged ; proper strategies for all domains redirection cases
#thecompaniesapi finish fine tuning panel ; submit branch for review
#thecompaniesapi work on datasets filtering & classification UI ; we can now review outputs from Claude/GPT and directly grab the dataset to send for fine-tuning
#thecompaniesapi work on dataset editor ui ; enable cross-worker communication thanks to tailscale ; working on your own timeline is super satisfying
#thecompaniesapi improve robot ui again ; display current scanning state on companies table ; improve progress display ; also keep improving the ideation step
#thecompaniesapi keep iterating on latest columns flows and ideation flow
#thecompaniesapi subdomains support ready for merge ; swap efforts back on AI to start creating the datasets ; added new extractions steps last missing main table datapoints, we can now fill all our columns with combination of website + ai extraction! next step fine tuning
#thecompaniesapi continue work on subdomains ; handling edge cases for this one aint easy
#thecompaniesapi polish subdomains support ; properly handle root domain aliasing
#thecompaniesapi added support for subdomains scanning in our whole stack ; we can now scan the whole web