Skip to content

Update DuckDB queries and parquet file loading #24

Open
lfoppiano wants to merge 3 commits intomainfrom
feature/luca/update-duckdb
Open

Update DuckDB queries and parquet file loading #24
lfoppiano wants to merge 3 commits intomainfrom
feature/luca/update-duckdb

Conversation

@lfoppiano
Copy link

In this PR we are adding a parameter (and instruction) for querying local files with duckdb.
In addition, we changed the way the files are loaded and we use the query for selecting the crawl and the subset. This is the propagation from the Java Tour discussed here and in the follow-up PR.

Copy link
Member

@laurieburchell laurieburchell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, the hard-coded variables are more obvious now

@lfoppiano lfoppiano requested a review from wumpus February 15, 2026 08:04
@lfoppiano
Copy link
Author

@wumpus since this is public and add some commands to download our data, perhaps could you please have a quick look at it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants