I built this tool because I wanted a way to just take a bunch of URLs or domains, and query their content in RAG applications.It takes away the pain of crawling, extracting content, chunking, vectorizing, and updating periodically.I'm curious to see if it can be useful to others. I meant to launch this six months ago but life got in the way...
The Show HN product received mixed feedback. Users highlighted the need for better output, prompt control, and middle pricing tiers. Some see AI integration as a transitional phase, while others question the need for new tools given existing solutions like Google. Concerns were raised about scraping ethics, bot prevention, and respecting website owners' rights. Technical inquiries focused on sqlite-vss stability, handling website changes, and support for various file formats. Positive feedback mentioned the embedding management and website crawling features. There were also discussions on micropayment systems, fraud risks, and comparisons with similar services. A few comments were flagged for review, and some users shared links to alternative tools or services.
Users criticized the product for instability, outdated components, and limited functionality, including output and prompt adjustment. There's a significant disparity between free and enterprise versions, and ethical concerns over scraping and bot prevention. The product's niche is questioned, and it's seen as redundant with existing tools. Criticisms also target the lack of detail on processes, unclear pricing, and potential legal issues with copyright and GDPR. Users are concerned about the product's technical capabilities, compliance, and respect for website policies, as well as cloud-only functionality and hallucination issues in AI.