Skip to content

Add having in prediction problem generation #133

@PatrikDurdevic

Description

@PatrikDurdevic

In the store dataset, the target entity is orderlines.

Currently, we can generate a problem like

  • Predict the number of records with <products.price> greater than 22.99 in next 1m days
  • SELECT count(*) from orderlines inner join products where products.price > 22.99

Which basically predicts how many individual products with a price > 22.99 will be ordered in a month.

With a having operation, we would be able to generate problems like

  • Predict the number of records having average <products.price> greater than 22.99 in the next 1m days
  • SELECT count(orderlines.orderid) from orderlines inner join products group by orderlines.orderid having avg(products.price) > 22.99

Which predicts how many orders there will be such that average product price in each order is > 22.99 in a month.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions