Skip to content

TPC-DS query 72 slow on modest (10) scale factors #17494

@AdamGS

Description

@AdamGS

When running TPC-DS q72, I've noticed that regardless of the underlying file format, latency increases dramatically even with relatively modest scale factors like 10. I've measured the query at around 2.4 seconds with SF=1, but over 60s when SF=10.

When running in my benchmarking setup, the plan is (as you can see - its extremely join heavy) here.

Profiling the query using samply (This is with branch-50 over parquet, SF=1):
Image

By playing around with it, seems like most time is spent in the loop inside the chain_traverse macro. I've tried a few common performance techniques - making it an explicitly inlined generic function, changing how the indices and values memory is managed/written to, but nothing made a noticeable difference.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions