Queryable Web Research Series: Publishing a report about doctor web Ylli P, February 19, 2025April 16, 2025 As we come to the end of the first phase of doctor web (data retrieval web engine: https://pypi.org/project/dr-web-engine/), with the project being fully open sources and published for end user consumption on the Docker Hub repository and Python Package repository, we want to make sure we can share the motivations…
Queryable Web Ready for release: Making the dr-web-engine public Ylli P, February 13, 2025February 15, 2025 While we continue to add features to our data retrieval web engine (aka dr-web-engine, or shortly doctor web), we are going to explore some of the steps needed to (a) make the new engine available to anyone that wants to use it (b) make the source code available to anyone…
Queryable Web Building a Data Retrieval Web Engine: A Step-by-Step Journey Ylli P, February 2, 2025February 10, 2025 Introduction In the Queryable Web Series, we’ve been exploring how to make web data as accessible and queryable as a database. The ultimate goal is to create tools that allow users to extract semi-structured data from websites using simple, declarative queries—without needing to write custom scripts for every use case….
Queryable Web Queryable Web Series – Introduction Ylli P, January 29, 2025January 29, 2025 The web is an ocean of information, yet efficiently extracting structured data from it remains a challenge. Traditional approaches like web scraping often rely on brittle, ad-hoc scripts or centralized APIs with limited flexibility. In this new blog series, Queryable Web, I will explore an alternative approach: a structured, queryable…