You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add device attribute to ttnn layout attr, so that runtime APIs know which device a tensor belongs to [TODO]
Performance Features
These features provide fine-grained control and aggressive performance optimization, but require task-specific user configuration.
Add compile hints in the compiler such that the user can toggle input/output layout (dram/l1, interleaved/sharded, row_major/tiled) [TODO]
Add compile hints in the compiler such that the user can toggle input persistency (volatile vs persistent). Persistent inputs will not be deallocated within the graph, whereas volatile inputs will be deallocated to free up memory once it has no more users [TODO]
Add compile hints in the compiler such that the user can toggle input/output device mesh grid and offset, this eliminates the need to redistribute tensors across multi-device for every program [TODO]
The text was updated successfully, but these errors were encountered:
This issue tracks the overall progress of runtime stitching.
Generality Features
These features reduce overhead and are generally applicable to most tasks, requiring minimal user intervention.
Performance Features
These features provide fine-grained control and aggressive performance optimization, but require task-specific user configuration.
The text was updated successfully, but these errors were encountered: