Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QST] Evaluate nanoarrow as part of any future Arrow C Data Interface exploration #13678

Closed
beckernick opened this issue Jul 10, 2023 · 3 comments
Labels
0 - Backlog In queue waiting for assignment question Further information is requested

Comments

@beckernick
Copy link
Member

We've had some discussions in the past about exploring Arrow C Data Interface (cc @shwina @vyasr @GregoryKimball ) for a variety of reasons. If we used the C Data Interface, it might open the door to using the new, much lighter weight Arrow project nanoarrow:

The vision of nanoarrow is that it should be trivial for a library or application to implement an Arrow-based interface: if a library consumes or produces tabular data, Arrow should be the first place developers look. Developers shouldn’t have to be familiar with the details of the columnar specification—nor should they have to take on any build-time dependencies—to get started.

Nanoarrow is <350KB when compiled and can in theory safely interact with any recent PyArrow versions. If we return to exploring the Arrow C Data Interface, we should also explore Nanoarrow.

@beckernick beckernick added question Further information is requested Needs Triage Need team to review and classify labels Jul 10, 2023
@shwina
Copy link
Contributor

shwina commented Jul 10, 2023

Related: #5097

@paleolimbot
Copy link

Just a note if you aren't aware that there's now a Device extension of the C Data interface ( https://arrow.apache.org/docs/dev/format/CDeviceDataInterface.html ) and proof-of-concept nanoarrow implementation of it ( https://github.com/apache/arrow-nanoarrow/tree/main/extensions/nanoarrow_device#readme )!

@vyasr
Copy link
Contributor

vyasr commented Mar 21, 2024

We are now using nanoarrow in #15047 and I expect the usage to increase to cover the other relevant use cases as discussed in #15193.

@vyasr vyasr closed this as completed Mar 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0 - Backlog In queue waiting for assignment question Further information is requested
Projects
None yet
Development

No branches or pull requests

5 participants