[runtime] initial support for running model on device #8

pilkicTT · 2024-07-23T15:56:27Z

moves device specific code from tt_torch_device/ into runtime/
adds TTSystem class (singleton) for holding all info on present devices
runs mlir compiler as a separate compile stage, which at the end generates flatbuffer binary
implements CompiledModel class for running inference on compiled model
run_binary() is the function which invokes tt-mlir runtime

NOTE: with this commit, the following tests are passing:

pybuda/test/test_api.py
pybuda/test/mlir/test_ops.py::test_add
pybuda/test/mlir/test_ops.py::test_multiply

nsmithtt · 2024-07-23T18:49:22Z

moves device specific code from tt_torch_device/ into runtime/

It seems like tt_torch_device directory still exists, is this intended to stay or will it eventually be removed? Seems a bit confusing to have both, but if this is only intermediate step then seems OK!

nsmithtt · 2024-07-23T19:16:21Z

It seems like tt_torch_device directory still exists, is this intended to stay or will it eventually be removed? Seems a bit confusing to have both, but if this is only intermediate step then seems OK!

Nvm, it makes sense after taking a look. I think the only thing that's a bit confusing is that there are 2 sets of tt_device.hpp and tt_device.cpp. Maybe we can rename the tt_torch_device/tt_device*? It mostly holds the python bindings so maybe we could call it bindings? Or torch_runtime or something? Just throwing out ideas.

nsmithtt

Looks great!!

- moves device specific code from `tt_torch_device/` into `runtime/` - adds `TTSystem` class (singleton) for holding all info on present devices - runs mlir compiler as a separate compile stage, which at the end generates flatbuffer binary - implements `CompiledModel` class for running inference on compiled model - `run_binary()` is the function which invokes tt-mlir runtime NOTE: with this commit, the following tests are passing: - pybuda/test/test_api.py - pybuda/test/mlir/test_ops.py::test_add - pybuda/test/mlir/test_ops.py::test_multiply

pilkicTT · 2024-07-24T14:15:29Z

It seems like tt_torch_device directory still exists, is this intended to stay or will it eventually be removed? Seems a bit confusing to have both, but if this is only intermediate step then seems OK!

Nvm, it makes sense after taking a look. I think the only thing that's a bit confusing is that there are 2 sets of tt_device.hpp and tt_device.cpp. Maybe we can rename the tt_torch_device/tt_device*? It mostly holds the python bindings so maybe we could call it bindings? Or torch_runtime or something? Just throwing out ideas.

Yeah, I was thinking of moving it to under runtime, e.g. runtime/torch - will be done in changes to follow (there is still a lot of refactoring to do 😄).

pilkicTT requested review from nsmithtt, nobradovictt, svuckovicTT, sdjordjevicTT and mtopalovicTT July 23, 2024 15:56

pilkicTT mentioned this pull request Jul 23, 2024

Adopt new device runtime tenstorrent/tt-mlir#19

Closed

nsmithtt approved these changes Jul 23, 2024

View reviewed changes

mtopalovicTT approved these changes Jul 23, 2024

View reviewed changes

nobradovictt approved these changes Jul 24, 2024

View reviewed changes

sdjordjevicTT approved these changes Jul 24, 2024

View reviewed changes

pilkicTT force-pushed the pilkic/device-runtime branch from 8fb96a7 to 47bb5ff Compare July 24, 2024 13:25

pilkicTT force-pushed the pilkic/device-runtime branch from 5ac06d4 to 35b0f5c Compare July 24, 2024 14:12

pilkicTT merged commit e05b7e2 into main Jul 24, 2024
1 check failed

pilkicTT deleted the pilkic/device-runtime branch August 5, 2024 09:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[runtime] initial support for running model on device #8

[runtime] initial support for running model on device #8

pilkicTT commented Jul 23, 2024

nsmithtt commented Jul 23, 2024

nsmithtt commented Jul 23, 2024

nsmithtt left a comment

pilkicTT commented Jul 24, 2024

[runtime] initial support for running model on device #8

[runtime] initial support for running model on device #8

Conversation

pilkicTT commented Jul 23, 2024

nsmithtt commented Jul 23, 2024

nsmithtt commented Jul 23, 2024

nsmithtt left a comment

Choose a reason for hiding this comment

pilkicTT commented Jul 24, 2024