Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parquet allow user to specify column encoding #696

Open
norberttech opened this issue Oct 31, 2023 · 0 comments
Open

Parquet allow user to specify column encoding #696

norberttech opened this issue Oct 31, 2023 · 0 comments

Comments

@norberttech
Copy link
Member

Currently, PagesBuilder is deciding what type of encoding applies to the columns, and we only support two types:

  • RLE_DICTIONARY
  • PLAIN

We should allow users to overwrite this, but keep in mind that Booleans should not allow for RLE_DICTIONARY encoding.

I think encoding could be added to NestedColumn/FlatColumn static constructors and used later in PagesBuilder that should fallback to default logic when it's not specified.

@norberttech norberttech converted this from a draft issue Oct 31, 2023
@norberttech norberttech added this to the 0.6.0 milestone Nov 6, 2023
@norberttech norberttech modified the milestones: 0.6.0, 0.7.0 Jan 27, 2024
@norberttech norberttech removed this from the 0.7.0 milestone Mar 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

No branches or pull requests

1 participant