I do quite a bit of scraping and have had good luck using various AI platforms. I haven't done real estate directly, but I have pulled restaurant details to build a few database and was happy with it.
I use jina.ai a lot for various projects. I have ZERO association with them, but it does a good enough job of cleaning up a URL to pass to OpenAI or Claude with great results. I'm using the Reader API, but they have some others that might be good.
I just pass the results to an LLM and rarely look back.
I use Python 99% of the time and I have had a ton of success with OpenAI + Structured data. I'm a few weeks into using PydanticAI for nice JSON/structured data out and I love using it. (also no association) ai.pydantic.dev/results/
I do quite a bit of scraping and have had good luck using various AI platforms. I haven't done real estate directly, but I have pulled restaurant details to build a few database and was happy with it.
I use jina.ai a lot for various projects. I have ZERO association with them, but it does a good enough job of cleaning up a URL to pass to OpenAI or Claude with great results. I'm using the Reader API, but they have some others that might be good.
I just pass the results to an LLM and rarely look back.
I use Python 99% of the time and I have had a ton of success with OpenAI + Structured data. I'm a few weeks into using PydanticAI for nice JSON/structured data out and I love using it. (also no association) ai.pydantic.dev/results/