Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Runtime Stitching Progress #1743

Open
2 of 8 tasks
jnie-TT opened this issue Jan 10, 2025 · 0 comments
Open
2 of 8 tasks

Runtime Stitching Progress #1743

jnie-TT opened this issue Jan 10, 2025 · 0 comments
Assignees

Comments

@jnie-TT
Copy link
Contributor

jnie-TT commented Jan 10, 2025

This issue tracks the overall progress of runtime stitching.

Generality Features

These features reduce overhead and are generally applicable to most tasks, requiring minimal user intervention.

Performance Features

These features provide fine-grained control and aggressive performance optimization, but require task-specific user configuration.

  • Add compile hints in the compiler such that the user can toggle input/output layout (dram/l1, interleaved/sharded, row_major/tiled) [TODO]
  • Add compile hints in the compiler such that the user can toggle input persistency (volatile vs persistent). Persistent inputs will not be deallocated within the graph, whereas volatile inputs will be deallocated to free up memory once it has no more users [TODO]
  • Add compile hints in the compiler such that the user can toggle input/output device mesh grid and offset, this eliminates the need to redistribute tensors across multi-device for every program [TODO]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant