Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HUDI-8892] Introduce projection push down for payload mode #12684

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

TheR1sing3un
Copy link
Member

In many wide table scenarios, there may be thousands of columns in a table, and there are multiple tasks processing different columns. When reading, the downstream only cares about some dimension columns. But now the payload mode does not support columns trimming, resulting in great performance regression when perform snapshot read on file slices with log files exist. This is because all columns of the base file are read, even though most of the columns are not needed by the user

Change Logs

  1. support projection push down for payload mode

Describe context and summary for this change. Highlight if any code was copied.

Impact

Improve mor read performance

Risk level (write none, low medium or high below)

medium

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

1. support projection push down for payload mode

Signed-off-by: TheR1sing3un <[email protected]>
@github-actions github-actions bot added the size:S PR with lines of changes in (10, 100] label Jan 21, 2025
@TheR1sing3un TheR1sing3un marked this pull request as draft January 21, 2025 10:29
1. refactor payload dummy instance

Signed-off-by: TheR1sing3un <[email protected]>
@hudi-bot
Copy link

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@TheR1sing3un TheR1sing3un marked this pull request as ready for review January 22, 2025 03:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size:S PR with lines of changes in (10, 100]
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants