Skip to content

Implement PyArrow Dataset TableProvider#9

Merged
andygrove merged 6 commits intoapache:masterfrom
kdbrooks:8-pyarrow-datasets
Jul 26, 2022
Merged

Implement PyArrow Dataset TableProvider#9
andygrove merged 6 commits intoapache:masterfrom
kdbrooks:8-pyarrow-datasets

Conversation

@kdbrooks
Copy link
Contributor

@kdbrooks kdbrooks commented Jul 22, 2022

This implements a PyArrow Dataset TableProvider that allows for using Datasets as tables in Datafusion.

Fixes #8 originally datafusion-contrib/datafusion-python#10

This is a rebase of datafusion-contrib/datafusion-python#59

I have updated the TableProvider trait for DataFusion 10.0.0 but have not been able to make sure this compiles yet because the master branch is not building.

Copy link
Member

@andygrove andygrove left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks @kylebrooks-8451.

@andygrove andygrove merged commit e0cbf48 into apache:master Jul 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support reading from PyArrow datasets

2 participants