-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Open
Description
When running TPC-DS q72, I've noticed that regardless of the underlying file format, latency increases dramatically even with relatively modest scale factors like 10. I've measured the query at around 2.4 seconds with SF=1, but over 60s when SF=10.
When running in my benchmarking setup, the plan is (as you can see - its extremely join heavy) here.
Profiling the query using samply (This is with branch-50 over parquet, SF=1):

By playing around with it, seems like most time is spent in the loop inside the chain_traverse macro. I've tried a few common performance techniques - making it an explicitly inlined generic function, changing how the indices and values memory is managed/written to, but nothing made a noticeable difference.
Metadata
Metadata
Assignees
Labels
No labels