Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better macOS build support #11

Closed
wants to merge 482 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
482 commits
Select commit Hold shift + click to select a range
c3214c0
Add backend overlay blob size error handling in recompile
jserbedzijaTT Mar 22, 2024
2ff96b4
[Perf] Add new sets of perf tracking models (focus on data formats an…
nvukobratTT Mar 20, 2024
4d9dd8d
Move conv2d transpose into sanity test
dzeng-tt Mar 21, 2024
750b700
Add ability to skip CI push jobs in perf run
tt-mpantic Mar 22, 2024
2a6731d
Fix net2pipe failure in perceiver on wh_b0
chandrasekaranpradeep Mar 22, 2024
5b767fa
Fix unit tests broken by MR1779
dgolubovicTT Mar 25, 2024
ea71701
Handle many different ways of falling back
ttssokorac Mar 25, 2024
1f26c6d
[Model] Fixup whisper after recent balancer update.
nobradovictt Mar 25, 2024
990d9d7
patch for yolov5m_640 failed job
kamalrajkannan78 Mar 25, 2024
7f94d1a
BBE added support for maximum. This PR removes decomposition of maxim…
mtopalovicTT Mar 25, 2024
86732ff
Adding Versim e2e test support in pybuda
sdjordjevicTT Mar 26, 2024
ba02f93
Patch for yolov5l(input size-640) timeout issue in CI job
kamalrajkannan78 Mar 26, 2024
4e1ee74
Investigate failing pybuda test pybuda-silicon-benchmark-tti-save-load
tt-mjudge Mar 26, 2024
d393e53
PT2: update conftest to account for list/tensor outputs
arui-yyz Mar 26, 2024
12860ab
Better CTA for our main README PyBuda #2520
nvukobratTT Mar 26, 2024
d45ed96
[Balancer] Refactor policy result and introduce BalancerPolicySolutio…
nobradovictt Mar 26, 2024
4ab1f89
[Model] Nightly model fixup after balancer update.
nobradovictt Mar 27, 2024
4e55a55
Fix tt-model-demos issue #2531
arui-yyz Mar 27, 2024
b18f398
Update TT-Buda logo for public docs #2520
nvukobratTT Mar 28, 2024
641857e
Modify yolov5 test script for GS testing
kamalrajkannan78 Mar 28, 2024
ee0aa5e
Update public installation document
rjakovljevicTT Mar 28, 2024
eb4fef0
[nightly] Fix #2445
svuckovicTT Mar 28, 2024
0551c36
[Balancer] Log balancer score(per epoch+total) to file for perf analy…
nobradovictt Mar 28, 2024
f7d19ae
Enable PYBUDA_PAD_OUTPUT_BUFFER for yolo_v5 [no_ci_perf_wh]
tt-mpantic Mar 24, 2024
fd0ce9c
Enable autotranspose in benchmarks
tt-mpantic Mar 28, 2024
73cd418
[perf_wh_fp16][perf_wh_bfp8_b] dummy change, check perf
vcanicTT Mar 12, 2024
5886984
[perf_wh_bfp8_b] change math fidelity from hifi2 to lofi for benchmar…
vcanicTT Mar 22, 2024
2899b35
[perf_wh_bfp8_b] change math fidelity from hifi2 to lofi for benchmar…
vcanicTT Mar 22, 2024
2eea21f
[perf_wh_bfp8_b] change math fidelity from hifi2 to lofi for benchmar…
vcanicTT Mar 22, 2024
e299c88
[perf_wh_fp16][perf_wh_bfp8_b] change math fidelity from hifi2 to lof…
vcanicTT Mar 27, 2024
ddd90cf
[perf_wh][perf_wh_fp16][perf_wh_bfp8_b] change math fidelity from hif…
vcanicTT Mar 27, 2024
8af171f
[perf_wh][perf_wh_fp16][perf_wh_bfp8_b] change math fidelity from hif…
vcanicTT Mar 27, 2024
7753aa9
[scripts] fix compare_perf to show whole table
pilkicTT Mar 28, 2024
0a5df0f
remove when: always from docs jobs
jwilde-tenstorrent Mar 28, 2024
db55d09
Significantly speed-up compile time of sparse matmuls
svuckovicTT Mar 29, 2024
bf8841c
[Benchmark] Tune and enable by default LoFi for unet and mv2.
nobradovictt Mar 29, 2024
ed85700
Pybuda BBE consume 20240314 week10
vmilosevic Mar 29, 2024
8b0ffe2
[nightly] Fix #2441, #2449, #2476
svuckovicTT Mar 28, 2024
3ad4e60
Benchmark overrides cleanup
tt-mpantic Mar 29, 2024
989b02f
Remove not needed files in benchmarks folder
tt-mpantic Mar 25, 2024
63ee2d0
[Model] DDRNet regression fix
jserbedzijaTT Mar 29, 2024
cb92c40
Update GitLab script to reference new pipeline naming #2540
nvukobratTT Mar 29, 2024
30d9950
Remove t5 base and flan-t5 base from benchmarks
tt-mpantic Mar 29, 2024
14eb6c2
Fix regressed fp16 benchmarks
tt-mpantic Apr 1, 2024
ecfc7d2
Remove duplicate jobs from job definitions
tt-mpantic Apr 1, 2024
264dcd2
Reset dram allocator before dynamic dram allocation fallback.
dgolubovicTT Apr 1, 2024
1ff4308
PT2: move PT module back to CPU before generating CPU outputs
arui-yyz Mar 21, 2024
6b71b9e
PT2: support graph without inputs / outputs
arui-yyz Mar 22, 2024
ef1f0dd
Add CI jobs that run all benchmarks in one job
tt-mpantic Apr 2, 2024
a7030f0
Removing Wormhole_A0 from the pybuda repository
sdjordjevicTT Apr 3, 2024
d6acc5a
Enable fusing of ops with output df override
odjuricicTT Apr 3, 2024
8766d56
PT2: Fix to dtype when both src / dest are TT tensors
arui-yyz Apr 2, 2024
2f4a8ee
Regression: Fix Whisper Medium on grayskull e300
arui-yyz Apr 3, 2024
6076038
Adding pre commit and post commit workflow and build action
vmilosevic Apr 10, 2024
cea2281
Fix typo in build actions
vmilosevic Apr 10, 2024
889c49d
Set env variable PYTHON_VERSION: "python3.10"
vmilosevic Apr 10, 2024
12126b9
Install deps (libboost-all-dev)
vmilosevic Apr 10, 2024
f5f3d01
Reorder actions
vmilosevic Apr 10, 2024
e8de218
Add libzmq3-dev
vmilosevic Apr 10, 2024
df98c1f
Merge pull request #21 from tenstorrent/vmilosevic/fix-actions
vmilosevic Apr 11, 2024
8b40e6d
Removing CI files
vmilosevic Apr 11, 2024
035b88f
Build pipeline image before running pipeline
tt-mjudge Apr 4, 2024
a4ffa91
Revert "Merge branch 'mjudge/docker-pipeline' into 'main'"
tt-mjudge Apr 4, 2024
940ace7
PT2: graph capture needs to cleanup all dicts in reset
arui-yyz Apr 4, 2024
bd12a8d
Add missing binary operators to the test
vobojevicTT Apr 4, 2024
c7977f2
Add csv export option to compare_perf.py
odjuricicTT Apr 5, 2024
ece9e04
Add missing unary operators in existing tests
kmilanovicTT Apr 1, 2024
0b977d1
Fix GS inception benchmark
tt-mpantic Apr 5, 2024
cc6127a
Add E75 perf jobs
tt-mpantic Apr 5, 2024
c306108
Pt2 fallback and other support
ttssokorac Apr 5, 2024
ad13059
[Balancer] Ribbon2 - introduce conservative optimization. Migrate MV1…
nobradovictt Apr 7, 2024
7f1cb11
Enable openpose hand on GS.
tt-mpantic Apr 5, 2024
9250ab9
Skip activation of venv and print a message when build fails
gfengTT Feb 1, 2024
6178b22
Fail benchmark CI job in case of RuntimeFailure.
tt-mpantic Apr 8, 2024
8a82a52
Align BERT with the version in Benchmarking repo
vcanicTT Apr 9, 2024
11a4ac1
Build pipeline image before running pipeline
tt-mjudge Apr 9, 2024
8f85218
PT2: don't overwrite as_strided to pad to buda shape
arui-yyz Apr 9, 2024
e4f2df4
Add tri_basic_2 semantic segmentation model on gs and nightly
chandrasekaranpradeep Apr 4, 2024
2ddf24a
Add config for comparing every op at each compilation stage
chandrasekaranpradeep Apr 5, 2024
ef83fcf
Run operator tests only in inference mode
kmilanovicTT Apr 5, 2024
0e255b1
[Verify] Make sure all TVM verification passes are included during fo…
nvukobratTT Apr 7, 2024
ada543d
[Codegen] Concatenate op: Patch to handle case where same input is re…
nvukobratTT Apr 7, 2024
9e9d488
[Lowering] Better handling for 5D TMs lowering + support for squeeze …
nvukobratTT Apr 7, 2024
8f19ef4
[Netlist] Cosine op: Better handling of vector argument for proper Ne…
nvukobratTT Apr 7, 2024
fadec8c
[Models] Gemma 2B: Full model bringup for WH B0 N150 + Bringup on 1x1…
nvukobratTT Apr 7, 2024
42d2815
[CI] Adding Gemma 2B to push commit (until transformers package is up…
nvukobratTT Apr 7, 2024
7bd5b3e
Patch for yolov5m (640x640) onnx model's CI job failure
kamalrajkannan78 Apr 10, 2024
e0e38a6
Removing CI files
vmilosevic Apr 11, 2024
f08e24c
[Balancer] Misc improvements to conservative optimization in ribbon2.
nobradovictt Apr 11, 2024
7aea0c2
TVM: INT8 op translation cleanup
arui-yyz Apr 11, 2024
27fc123
[Benchmark] Tune yolov3 bfp8_b. Remove input queue entry override.
nobradovictt Apr 12, 2024
ddeedc5
Revert the delete inputs condition in PyBudaWriter
chandrasekaranpradeep Apr 11, 2024
dc048f3
Separating symlink lock file to be per user
sdjordjevicTT Apr 11, 2024
64ef862
[Model] XFail Gemma 2B 1x1 model
nvukobratTT Apr 12, 2024
bba68c3
Add tests for SSD300-RESNET50(pytorch) model
kamalrajkannan78 Apr 12, 2024
19a23b7
Add op->op NOC bw estimation
rpavlovicTT Apr 15, 2024
6062125
Use noc bw estimates during balancing
rpavlovicTT Apr 15, 2024
48b6842
[Models] Skipping non-supported CodeGen variants #2584
nvukobratTT Apr 12, 2024
9bfa455
[build] makefile fixes
pilkicTT Apr 16, 2024
1716e35
Compile models in integration tests
vmilosevic Apr 17, 2024
355b446
Adding queue instead of padding nop with no valid grids, and some cle…
dgolubovicTT Apr 17, 2024
d80f9a2
Adding targets in makefile to simplify build and run unit tests
sdjordjevicTT Apr 17, 2024
4b014f9
[Pipeline] Remove cache_positions attribute during generative pipelin…
nvukobratTT Apr 17, 2024
cbe083c
[Tensor] Convert to appropriate data format for proper queue shape ca…
nvukobratTT Apr 17, 2024
2cb0089
[Model] Gemma 2B generative sample (without past-cache)
nvukobratTT Apr 17, 2024
75e98ce
[Balancer] NOC BW estimate followup. EpochSolution refactor. Bug fixes.
nobradovictt Apr 18, 2024
c919d5c
Expand unary test single
kmilanovicTT Apr 16, 2024
0d95729
Enable fusing for all data format overrides
odjuricicTT Apr 18, 2024
8a083ae
[Model] Gemma 2B: Generative run on 1x1 WH B0
nvukobratTT Apr 18, 2024
193b492
[Models] Gemma 2B - Skip redundant test runs
nvukobratTT Apr 18, 2024
223ecbc
[CI] Run only generative Gemma 2B models (full and 1x1 grid
nvukobratTT Apr 18, 2024
3d39229
Turn on PYBUDA_EXP_APPROX for t5 and flan-t5
tt-mpantic Apr 19, 2024
ef7d2b9
[docs] updating torchvision instructions
pilkicTT Apr 19, 2024
2c060cc
[fork-join] removing tiles threshold
pilkicTT Apr 19, 2024
7d204be
Add missing SPDX license headers
vmilosevic Apr 18, 2024
ef1566c
Update release jobs mb and loop count for April release
tt-mpantic Apr 19, 2024
10de904
Removing CI files
vmilosevic Apr 19, 2024
eca1753
Add benchmarking repo perf CI jobs
tt-mpantic Apr 19, 2024
bc69a10
Adding logic for choosing best of padding and adding queue
dgolubovicTT Apr 19, 2024
c5ca658
[fork-join] override models to buffer with queues
pilkicTT Apr 19, 2024
365ea12
[Models] Limit Gemma 2B compile depth on CI #2558
nvukobratTT Apr 19, 2024
31e74e3
Moving nary ops from test_sanity to a new module
vobojevicTT Apr 19, 2024
b0961ca
Patches to weekly tests
dzeng-tt Apr 18, 2024
0c88429
[TVM] Update Simplifyreshape pattern callback and add sanity test
chandrasekaranpradeep Apr 18, 2024
e6db099
Add perceiver conv and learned model on gs
chandrasekaranpradeep Apr 22, 2024
1b84571
Pybuda BBE consume 20240401 week14
vmilosevic Apr 22, 2024
74779df
Add tests for Yolov6(pytorch) model
kamalrajkannan78 Apr 22, 2024
17793ac
Add BW estimates to perf analysis tool
derdeljanTT Apr 21, 2024
9ad7ceb
Consuming backend changes for blackhole estimates
sdjordjevicTT Apr 22, 2024
3755ad3
[Models] ViT demo #2605
nvukobratTT Apr 22, 2024
d7a9a41
Implement DRAM read bandwidth estimator
derdeljanTT Apr 22, 2024
c59abcf
Patches to ccm and buda-mode-demos for incoming release
dzeng-tt Apr 23, 2024
0109861
Fix dram read BW estimate scaling based on dram channel usage
derdeljanTT Apr 22, 2024
6f1d5e5
Refactor buffering queue creation for functional unification
dgolubovicTT Apr 23, 2024
129f382
Patch for t5_compile_e300 failed job
kamalrajkannan78 Apr 23, 2024
afa4a73
fix typo in benchmark script
pilkicTT Apr 23, 2024
3cff8c0
[Model] Fixup yolov5x
jserbedzijaTT Apr 23, 2024
81d9491
[Model] Fixup tri_basic_2
jserbedzijaTT Apr 24, 2024
3cd00e4
[Balancer] Refactor/optimize EpochSolution eval.
nobradovictt Apr 24, 2024
3e4caeb
Fix producer mblock size
rpavlovicTT Apr 25, 2024
2dc9970
Fix data movement BW estimates report
derdeljanTT Apr 25, 2024
8d89fc7
[Benchmark] Misc override tune.
nobradovictt Apr 25, 2024
ab9f0c3
Adding Emulation e2e test support in pybuda
sdjordjevicTT Apr 25, 2024
0aabcbe
Move remove padding method call
dgolubovicTT Apr 26, 2024
a725eda
Removing CI files
vmilosevic Apr 26, 2024
35ee326
Update BBE to new snapshot bbe_to_pybuda_release_20240401_week14
vmilosevic Apr 26, 2024
e23b5f1
Remove bad merge from test
vmilosevic May 8, 2024
23f6d81
Merge pull request #24 from tenstorrent/vmilosevic/fix_bad_merge
vmilosevic May 8, 2024
41e4f4c
Bringup for base mistral7b model
ppadjinTT Apr 26, 2024
37019dc
[Model] Lower Gemma 2B DRAM usage to 2GB
nvukobratTT Apr 25, 2024
ea56c84
Fix dram prolog BW estimates normalization
derdeljanTT Apr 26, 2024
313481e
Handle multiple user in fuse_redundant_tm_sequence and add sanity test
chandrasekaranpradeep Apr 18, 2024
7471f01
Revert "Merge branch 'ppadjin/mistral-decoder-layer-test' into 'main'"
nvukobratTT Apr 29, 2024
2a86007
Fix perceiverio onnx and pytorch model nightly failures
chandrasekaranpradeep Apr 29, 2024
cdb1ff9
Remove additional conditions for devtype in verify_cfg
meenakshiramanathan1 Apr 29, 2024
88c6199
Remove seeded-venv job from PyBuda
tt-mjudge Apr 29, 2024
620132e
[Model] Gemma 2B: Reduce DRAM usage for 1x1 model variant
nvukobratTT Apr 30, 2024
e618688
[Model] Gemma 2B: Grayskull support
nvukobratTT Apr 30, 2024
db0d9ae
Fix dram scatter gather num tiles calculation
derdeljanTT Apr 27, 2024
cafa446
Fix tile clear granularity calculation for matmuls
derdeljanTT Apr 27, 2024
a9358b7
Fix ublock scan order calculation for dram queues
derdeljanTT May 8, 2024
fe1230d
[sparse] Add ability to calculate per-core sparse matmul metadata (us…
svuckovicTT May 8, 2024
d00a010
Revert "[sparse] Add ability to calculate per-core sparse matmul meta…
svuckovicTT May 8, 2024
234f588
[PerfModel] Protect init under FS.
nobradovictt May 9, 2024
0f9163f
Add segformer pytorch and onnx model on gs and wh_b0
chandrasekaranpradeep May 6, 2024
7787d08
[legalizer] Optimize prologue DRAM reads to be Post-TM whenever possible
jserbedzijaTT Feb 7, 2024
754b857
Add support for different scale factor in upsample2d and upsample3d
meenakshiramanathan1 Apr 22, 2024
c7989a3
[with fixes][sparse] Add ability to calculate per-core sparse matmul …
svuckovicTT May 10, 2024
4560359
Add test for Retinanet-Resnet(pytorch and onnx) model
kamalrajkannan78 May 10, 2024
bbad3bd
Improving legalizer failures to be more informative, and using them t…
dgolubovicTT Apr 24, 2024
d0a866c
Remove sparse buffer op
svuckovicTT May 10, 2024
fed38f7
Adding blackhole pybuda sanity golden tests in CI
sdjordjevicTT May 10, 2024
688bbd7
[Model] bert regression fix after unrolled parameters optimization
jserbedzijaTT May 13, 2024
d0930bc
Add Automation Script For Bisecting
pdeviTT May 13, 2024
465d27f
Fix run skipped test by pytest decorator
vobojevicTT May 13, 2024
2934106
Add test for the Where operator
vobojevicTT May 13, 2024
7505386
[Balancer] Benchmark override cleanup part 1
nobradovictt May 13, 2024
1c0301e
Add segformer image classification pytorch and onnx model
chandrasekaranpradeep May 13, 2024
675cbe4
Add model demos for DLA (Pytorch)
ashokkumarkannan1 May 6, 2024
eb76b71
Ppadjin/mistral7b fix
ppadjinTT May 14, 2024
0c9b407
Add option to log epoch evaluation estimates
derdeljanTT May 9, 2024
22552dd
Fix dram multicast BW estimates
derdeljanTT May 10, 2024
42f30bc
Return harvested config as a map to account for skip chip ids.
jnie-TT May 9, 2024
85316ab
[Balancer] Allow R tstream into output.
nobradovictt May 14, 2024
1e6a191
Override cleanup - PYBUDA_DISABLE_STREAM_OUTPUT.
nobradovictt May 16, 2024
875fded
Add queue without padding as last resort.
dgolubovicTT May 16, 2024
e3c94b4
Add tests for ddrnet(semantic segmentation-pytorch and customer's onn…
kamalrajkannan78 May 16, 2024
1b78c6d
Update the diffusers package version from 0.14.0 to 0.27.2
chandrasekaranpradeep May 9, 2024
760f251
Fix: KeyError in DLA model demos
ashokkumarkannan1 May 17, 2024
4fb9e63
Add support for saving perf analysis epoch summary and details screen…
derdeljanTT May 6, 2024
e4fd66d
Improve logic for scaling dram read BW w.r.t. fork factor
derdeljanTT May 9, 2024
5eb8294
Restructuring Grayskull Benchmark Models
vcanicTT May 20, 2024
0038976
[Model] Nightly model fixup after recent compiler default setting cha…
nobradovictt May 20, 2024
c69596c
[Balancer] Update limiter_cycles with pre vs post TM prologue calc. ...
nobradovictt May 20, 2024
d1ddf0a
Add tests for yolox(onnx)
kamalrajkannan78 May 18, 2024
4e2346b
Fixed overlapping DRAM queues bug and queue allocation logic refactor
jserbedzijaTT May 16, 2024
116dfc2
Add logging for dram channel free blocks at the beginning of queue al…
dgolubovicTT May 21, 2024
93e16c2
[Balancer] Cleanup PYBUDA_OP_MODEL_COMPARE_VERSION override from models.
nobradovictt May 21, 2024
3fe7a31
Add perceiverio fourier and learned model on wh_b0 and gs
chandrasekaranpradeep May 21, 2024
55a9f65
Update doc for Ubuntu 22.04
vmilosevic May 22, 2024
8868693
Mistral7b decode with KV cache
ppadjinTT May 22, 2024
4251d63
[fork-join] insertion instructions related refactor
pilkicTT May 22, 2024
ccccbf7
split segformer tests based on tasks to avoid timeout error
kamalrajkannan78 May 22, 2024
3e89283
Fix CI failed jobs
kamalrajkannan78 May 22, 2024
c0f2928
Fix Benchmark Model Resnet on Grayskull.
vcanicTT May 23, 2024
cb2191e
Jwilde/set up new doc pipeline
jwilde-tenstorrent May 23, 2024
0285ebb
[fork-join] minor cleanup & formatting changes
pilkicTT May 23, 2024
ee2eaed
use one if rule in docsite job
May 23, 2024
8a44ebe
Add model demos for DLA (onnx)
ashokkumarkannan1 May 24, 2024
932c4af
Improve dram read BW estimation during per-op OpModel selection
derdeljanTT May 24, 2024
8aa86cd
Adding blackhole TVM tests
sdjordjevicTT May 23, 2024
28897bf
[padding] Fix padding bugs discovered by veto run 21.5 .
dgolubovicTT May 24, 2024
22994ef
[fused-ops] Add new constraints to fused ops
jserbedzijaTT May 15, 2024
8cb7b49
Align recompile with recent backend changes
jserbedzijaTT May 22, 2024
e3d03ba
[Balancer] Remove legacy optimization function. Enable conservative o…
nobradovictt May 24, 2024
4be49c6
Added compile jobs for gemma model
dsudhakarTT May 27, 2024
c1f1033
Changing mistral7b policy to ribbon
ppadjinTT May 27, 2024
c2d233d
Enable T5 Benchmark Model for Grayskull
vcanicTT May 27, 2024
447ab27
Removing CI files
vmilosevic Jun 3, 2024
c11363d
Update tt-tvm version
vmilosevic Jun 3, 2024
f5ffdad
wip refactor
nsmithtt Jun 14, 2024
a6561dc
refactor
AleksKnezevic Jun 18, 2024
5de84f7
fix macos linker errors
nsmithtt Jun 18, 2024
3fa82b1
Refactor WIP, Passes though compiled_state
AleksKnezevic Jun 20, 2024
2a55491
Makefile link against mlir/llvm
nsmithtt Jun 24, 2024
a9343f3
WIP emit_mlir
AleksKnezevic Jun 28, 2024
8a0a364
Some runtime bootstrapping; mostly in place
nsmithtt Jun 28, 2024
ffb2179
WIP: register torch backend, close device on exit
pilkicTT Jul 1, 2024
bcc7b1c
bump tt-mlir module to use dynamic libs
pilkicTT Jul 1, 2024
55d2a99
fix: close devices on signal
pilkicTT Jul 4, 2024
a9f2ee6
Merge branch 'pilkic/close-device-fix' into 'main'
pilkicTT Jul 4, 2024
0b2df69
Emitting PyBuda graph TTIR MLIR
sdjordjevicTT Jul 4, 2024
52d98db
Merge branch 'sdjordjevic/emitting_pybuda_ttir_mlir' into 'main'
sdjordjevicTT Jul 5, 2024
a2cc5c1
Initial set of op tests required to compile for MNIST inference
nvukobratTT Jul 5, 2024
42c4806
Merge branch 'nvukobrat/initial_set_of_sanity_tests' into 'main'
nvukobratTT Jul 5, 2024
69f33ba
Running MLIR passes and generating flatbuffer binary in PyBuda
sdjordjevicTT Jul 12, 2024
6a796ce
Merge branch 'sdjordjevic/run_mlir_passes_and_generate_flatbuffer_bin…
sdjordjevicTT Jul 12, 2024
8f297c1
[api] initial work on new compile flow
pilkicTT Jul 12, 2024
1d1db4d
Merge branch 'pilkic/initial-compile' into 'main'
pilkicTT Jul 12, 2024
e6fd064
Integrating cmake
mtopalovicTT Jul 17, 2024
bd15c9b
Merge branch 'milant/integrating_cmake' into 'main'
mtopalovicTT Jul 17, 2024
20fe913
[Training] MNIST Linear: Inference sample using torch.compile and tra…
nvukobratTT Jul 18, 2024
179bbda
Merge branch 'nvukobrat/mnist_placeholder' into 'main'
nvukobratTT Jul 18, 2024
368f83f
Update README with build instructions for cmake.
mtopalovicTT Jul 22, 2024
10d3fdf
Merge pull request #4 from tenstorrent/milant/readme_fixup
mtopalovicTT Jul 22, 2024
2319f03
Minor fix
mtopalovicTT Jul 23, 2024
c79f5ff
Merge pull request #7 from tenstorrent/milant/fixing_readme
mtopalovicTT Jul 23, 2024
35b0f5c
[runtime] initial support for running model on device
pilkicTT Jul 12, 2024
e05b7e2
Merge pull request #8 from tenstorrent/pilkic/device-runtime
pilkicTT Jul 24, 2024
be98eb6
Better macOS build support
nsmithtt Jul 24, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
31 changes: 31 additions & 0 deletions .github/actions/install-deps/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
name: "Install dependencies"
description: "Installs dependencies on GitHub Actions runners"

inputs:
os:
description: 'Runner OS'
required: true

runs:
using: "composite"
steps:
- name: Verify ubuntu only
shell: bash
run: |
if ! echo ${{ inputs.os }} | grep -q "ubuntu"; then
echo "${{ inputs.os }} does not seem to be ubuntu"
fi
- name: Assert requested os exists in dependencies
shell: bash
run: |
if ! jq -e ".\"${{ inputs.os }}\" != null" $GITHUB_ACTION_PATH/dependencies.json; then
echo "${{ inputs.os }} does not exist as a supported os in $GITHUB_ACTION_PATH/dependencies.json"
fi
- name: Retrieve and install pkg deps based on OS
id: retrieve-pkg-deps
shell: bash
run: |
DEPENDENCIES=$(jq -r --arg os "${{ inputs.os }}" '.[$os] | .[]' $GITHUB_ACTION_PATH/dependencies.json)
echo $DEPENDENCIES
sudo apt update
sudo apt install $DEPENDENCIES
12 changes: 12 additions & 0 deletions .github/actions/install-deps/dependencies.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
{
"ubuntu-22.04": [
"software-properties-common",
"build-essential",
"python3.10-venv",
"libyaml-cpp-dev",
"libboost-all-dev",
"libsndfile1",
"libhwloc-dev",
"libzmq3-dev"
]
}
24 changes: 24 additions & 0 deletions .github/workflows/build-artifacts.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
name: Build artifacts

on:
workflow_dispatch:
workflow_call:

env:
PYTHON_VERSION: "python3.10"

jobs:
build-artifacts:
strategy:
matrix:
arch: ["grayskull"]
runs-on: ubuntu-22.04
steps:
- uses: actions/checkout@v4
- uses: ./.github/actions/install-deps
with:
os: ubuntu-22.04
- name: Update submodule
run: git submodule update --init --recursive
- name: Build for ${{ matrix.arch }}
run: source env_for_silicon.sh
13 changes: 13 additions & 0 deletions .github/workflows/post-commit-workflow.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
name: Post commit workflow

on:
workflow_dispatch:
workflow_call:
push:
branches:
- main

jobs:
build-artifacts:
uses: ./.github/workflows/build-artifacts.yml
secrets: inherit
13 changes: 13 additions & 0 deletions .github/workflows/pull-request-workflow.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
name: Pull request workflow

on:
workflow_dispatch:
workflow_call:
pull_request:
branches:
- main

jobs:
build-artifacts:
uses: ./.github/workflows/build-artifacts.yml
secrets: inherit
126 changes: 1 addition & 125 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -48,132 +48,8 @@ bp_data
third_party/llvm
device_images
generated_modules
build_deps/

# ClangD
compile_commands.json
\n\n# Exclude LFS files to keep the public repo small
third_party/budabackend/common_lib/libboost_filesystem.so.1.65.1
third_party/budabackend/common_lib/libboost_regex.so.1.65.1
third_party/budabackend/common_lib/libboost_system.so.1.65.1
third_party/budabackend/common_lib/libboost_thread.so.1.65.1
third_party/budabackend/common_lib/libicudata.so.60
third_party/budabackend/common_lib/libicui18n.so.60
third_party/budabackend/common_lib/libicuuc.so.60
third_party/budabackend/common_lib/libsqlite3.so.0
third_party/budabackend/common_lib/libz.so.1
third_party/budabackend/dbd/docs-md/images/debuda-buda.png
third_party/budabackend/dbd/docs-md/images/debuda-debuda-server.png
third_party/budabackend/dbd/docs-md/images/debuda-export-db.png
third_party/budabackend/dbd/docs-md/images/debuda-speed-dial.png
third_party/budabackend/dbd/docs-md/images/debuda-start-and-repl.png
third_party/budabackend/dbd/docs-md/images/debuda.png
third_party/budabackend/dbd/docs-md/images/tenstorrent-pdf-titlepage.png
third_party/budabackend/device/bin/silicon/grayskull/clkctl
third_party/budabackend/device/bin/silicon/grayskull/init
third_party/budabackend/device/bin/silicon/grayskull/pm_en.ttx
third_party/budabackend/device/bin/silicon/grayskull/setup_interface
third_party/budabackend/device/bin/silicon/grayskull/tt-script
third_party/budabackend/device/bin/silicon/reset-chip
third_party/budabackend/device/bin/silicon/tensix-reset
third_party/budabackend/device/bin/silicon/wormhole/boot
third_party/budabackend/device/bin/silicon/wormhole/create-ethernet-map
third_party/budabackend/device/bin/silicon/wormhole/flash-spi
third_party/budabackend/device/bin/silicon/wormhole/get-wormhole-interfaces
third_party/budabackend/device/bin/silicon/wormhole/imx8-reset
third_party/budabackend/device/bin/silicon/wormhole/load-eth-fw
third_party/budabackend/device/bin/silicon/wormhole/noc-overlay-dump
third_party/budabackend/device/bin/silicon/wormhole/setup-interface
third_party/budabackend/device/bin/silicon/wormhole/temp
third_party/budabackend/device/bin/silicon/wormhole/tt-flash
third_party/budabackend/device/bin/silicon/wormhole/tt-script
third_party/budabackend/docs/public/images/1.png
third_party/budabackend/docs/public/images/10.png
third_party/budabackend/docs/public/images/perf_ui_candlestick.png
third_party/budabackend/docs/public/images/11.png
third_party/budabackend/docs/public/images/perf_ui_wft.png
third_party/budabackend/docs/public/images/12.png
third_party/budabackend/docs/public/images/13.png
third_party/budabackend/docs/public/images/14.png
third_party/budabackend/docs/public/images/16.png
third_party/budabackend/docs/public/images/17.png
third_party/budabackend/docs/public/images/18.png
third_party/budabackend/docs/public/images/2.png
third_party/budabackend/docs/public/images/20.png
third_party/budabackend/docs/public/images/21.png
third_party/budabackend/docs/public/images/22.png
third_party/budabackend/docs/public/images/23.png
third_party/budabackend/docs/public/images/24.png
third_party/budabackend/docs/public/images/25.png
third_party/budabackend/docs/public/images/26.png
third_party/budabackend/docs/public/images/27.png
third_party/budabackend/docs/public/images/28.png
third_party/budabackend/docs/public/images/29.png
third_party/budabackend/docs/public/images/3.png
third_party/budabackend/docs/public/images/30.png
third_party/budabackend/docs/public/images/31.png
third_party/budabackend/docs/public/images/32.png
third_party/budabackend/docs/public/images/33.png
third_party/budabackend/docs/public/images/34.png
third_party/budabackend/docs/public/images/35.png
third_party/budabackend/docs/public/images/36.png
third_party/budabackend/docs/public/images/37.png
third_party/budabackend/docs/public/images/4.png
third_party/budabackend/docs/public/images/5.png
third_party/budabackend/docs/public/images/6.png
third_party/budabackend/docs/public/images/7.png
third_party/budabackend/docs/public/images/perf_ui_local_select.png
third_party/budabackend/docs/public/images/8.png
third_party/budabackend/docs/public/images/9.png
third_party/budabackend/docs/public/images/Screen_Shot_2023-05-24_at_11.56.12_PM.png
third_party/budabackend/docs/public/images/bfp-efficiency.png
third_party/budabackend/docs/public/images/bfp2_blocks.png
third_party/budabackend/docs/public/images/bfp4_blocks.png
third_party/budabackend/docs/public/images/bfp8_blocks.png
third_party/budabackend/docs/public/images/cropped-favicon-32x32.png
third_party/budabackend/docs/public/images/data-formats.png
third_party/budabackend/docs/public/images/logo.png
third_party/budabackend/docs/public/images/perf_ui_device.png
third_party/budabackend/docs/public/images/perf_ui_diff.png
third_party/budabackend/docs/public/images/perf_ui_front_page.png
third_party/budabackend/docs/public/images/perf_ui_host.png
third_party/budabackend/docs/public/images/perf_ui_inputs.png
third_party/budabackend/docs/public/images/perf_ui_select_test.png
third_party/budabackend/docs/public/images/perf_ui_select_workspace.png
third_party/budabackend/docs/public/images/tt_logo.png
third_party/budabackend/docs/public/images/tt_logo.svg
third_party/budabackend/docs/public/images/unpack_math_pack.png
third_party/budabackend/loader/tests/reference_tensor_conv_bin/expected_tensor_for_conv_bfp8_s2.bin
third_party/budabackend/loader/tests/reference_tensor_conv_bin/expected_tensor_for_conv_fp16b_s2.bin
third_party/budabackend/loader/tests/reference_tensor_conv_bin/expected_tensor_for_conv_fp32_s2.bin
third_party/budabackend/loader/tests/reference_tensor_conv_bin_mt/input_tensor_for_conv_fp16b.0.bin
third_party/budabackend/loader/tests/reference_tensor_conv_bin_mt/input_tensor_for_conv_fp16b_s1.0.bin
third_party/budabackend/loader/tests/reference_tensor_conv_bin_mt/input_tensor_for_conv_fp32.0.bin
third_party/budabackend/perf_lib/graph_tests/grayskull/inference/bert_large_hifi3_fp16b.yaml
third_party/budabackend/perf_lib/graph_tests/grayskull/inference/bert_large_lofi_bfp8b.yaml
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_1.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_2.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_3.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_4.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_5.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_6.zip
third_party/budabackend/verif/graph_tests/netlists/pregenerated/group_7.zip
third_party/budabackend/verif/graph_tests/netlists/t5_large_wormhole_b0.yaml
third_party/budabackend/verif/graph_tests/netlists/wormhole_b0/netlist_bert_12x_encoder_1x_whb0_base_training.yaml
third_party/budabackend/verif/graph_tests/netlists/wormhole_b0/netlist_bert_24x_encoder_1x_whb0_large_training.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/falcon_60l_8chip.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/midbloom_inference_12chip_lab68.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/midbloom_inference_32chip_lab78.no_snakeplace.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/midbloom_inference_32chip_lab78.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/midbloom_inference_4chip_jb11.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/large_cluster/midbloom_inference_8chip_jb11.yaml
third_party/budabackend/verif/multichip_tests/wh_multichip/netlist_bert_concurrent_24x_encoder_2x_wh_large_inference.yaml
third_party/budabackend/verif/pipegen_tests/netlists/grayskull/nightly/baseline.zip
third_party/budabackend/verif/pipegen_tests/netlists/grayskull/push/baseline.zip
third_party/budabackend/verif/pipegen_tests/netlists/wormhole_b0/nightly/baseline.zip
third_party/budabackend/verif/pipegen_tests/netlists/wormhole_b0/push/baseline.zip
third_party/budabackend/verif/template_netlist/netlists/multi_tm_tests/weekly/wormhole_b0/test_dram_input_matmul_3tms_and_reblock.zip
third_party/budabackend/verif/template_netlist/netlists/test_datacopy_matmul_2tms_and_reblock_pregenerated.zip
third_party/budabackend/verif/template_netlist/netlists/test_datacopy_matmul_3tms_and_reblock_000_force_grayskull.zip
third_party/budabackend/verif/template_netlist/netlists/test_dram_input_matmul_3tms_and_reblock_grayskull.zip
third_party/budabackend/verif/tm_tests/directed/bert_large_inference_hifi.yaml
third_party/budabackend/verif/tm_tests/directed/packer_mcast.tar.gz
44 changes: 44 additions & 0 deletions .gitlab-ci.perf.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
include:
- .gitlab-ci.wheels.yml

# PyBuda repo, Grayskull e150
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e150_perf_bfp8_b_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e150_perf_fp16_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e150_perf_release_nightly.yml

# PyBuda repo, Grayskull e75
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e75_perf_bfp8_b_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e75_perf_fp16_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.grayskull_e75_perf_release_nightly.yml

# PyBuda repo, Wormhole B0
- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_bfp8_b_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_fp16_nightly.yml
- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_release_nightly.yml

- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_bfp8_b_manual.yml
- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_fp16_manual.yml
- ci/gitlab-test-lists/.gitlab-ci.wormhole_b0_silicon_perf_release_manual.yml

# Benchmarking repo
# Grayskull e75, Grayskull e150, Wormhole B0
- ci/gitlab-test-lists/benchmarking/.gitlab-ci.wormhole_b0_silicon_perf_release_public.yml
- ci/gitlab-test-lists/benchmarking/.gitlab-ci.grayskull_e75_perf_release_public.yml
- ci/gitlab-test-lists/benchmarking/.gitlab-ci.grayskull_e150_perf_release_public.yml

# Dissable other jobs from .gitlab-ci.wheels.yml
pybuda-gs-latest-bbe-wheel:
rules:
- if: ($CI_PIPELINE_SOURCE == "push" && $CI_COMMIT_MESSAGE !~ /\[no_ci_perf/)

pybuda-wh-b0-latest-bbe-wheel:
rules:
- if: ($CI_PIPELINE_SOURCE == "push" && $CI_COMMIT_MESSAGE !~ /\[no_ci_perf/)

pybuda-gs-unittests:
rules:
- if: ($CI_PIPELINE_SOURCE == "push" && $CI_COMMIT_MESSAGE !~ /\[no_ci_perf/)

pybuda-wh-b0-unittests:
rules:
- if: ($CI_PIPELINE_SOURCE == "push" && $CI_COMMIT_MESSAGE !~ /\[no_ci_perf/)
17 changes: 9 additions & 8 deletions .gitmodules
Original file line number Diff line number Diff line change
@@ -1,12 +1,13 @@
[submodule "third_party/tvm"]
path = third_party/tvm
url = ../tt-tvm.git
[submodule "third_party/budabackend"]
path = third_party/budabackend
url = ../tt-budabackend.git
url = https://github.com/tenstorrent/tt-tvm
[submodule "third_party/pybind11"]
path = third_party/pybind11
url = https://github.com/pybind/pybind11
[submodule "third_party/public-tt-buda"]
path = third_party/public-tt-buda
url = ../tt-buda-demos.git
url = https://github.com/pybind/pybind11.git
[submodule "third_party/buda-model-demos"]
path = third_party/buda-model-demos
url = https://github.com/tenstorrent/tt-buda-demos

[submodule "third_party/tt-mlir"]
path = third_party/tt-mlir
url = [email protected]:tenstorrent/tt-mlir.git
51 changes: 51 additions & 0 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
cmake_minimum_required(VERSION 3.20.0)
project(pybuda LANGUAGES CXX)

find_program(CLANG_17 clang++-17)
find_program(CLANG clang)
if(CLANG_17 AND CLANG)
message(STATUS "Found Clang-17 here: ${CLANG_17}")
set(CMAKE_CXX_COMPILER "${CLANG_17}")
set(CMAKE_C_COMPILER "${CLANG}")
else()
message(WARNING "Clang++-17 or clang not found!!!")
endif()

set(CMAKE_CXX_STANDARD 17)
set(CMAKE_CXX_STANDARD_REQUIRED ON)
set(CMAKE_CXX_EXTENSIONS OFF)
set(CMAKE_EXPORT_COMPILE_COMMANDS ON)

list(APPEND CMAKE_MODULE_PATH "${CMAKE_SOURCE_DIR}/cmake")
include(Utils)

check_required_env_var(PYBUDA_PYTHON_VERSION)
check_required_env_var(PYBUDA_TOOLCHAIN_DIR)
check_required_env_var(PYBUDA_VENV_DIR)

set(PYBUDA_PYTHON_VERSION $ENV{PYBUDA_PYTHON_VERSION})
set(PYBUDA_TOOLCHAIN_DIR $ENV{PYBUDA_TOOLCHAIN_DIR})
set(PYBUDA_VENV_DIR $ENV{PYBUDA_VENV_DIR})

find_package(Python COMPONENTS REQUIRED Interpreter Development)

set(TTMLIR_TOOLCHAIN_DIR $ENV{TTMLIR_TOOLCHAIN_DIR})
set(TTMLIR_VENV_DIR $ENV{TTMLIR_VENV_DIR})

if (NOT CMAKE_BUILD_TYPE)
set(CMAKE_BUILD_TYPE Release)
endif()

add_compile_options(-Wall -Wextra -Wpedantic -Werror -Wno-unused-parameter)

set(PYBUDA_CSRC_WARNINGS -Wall -Wextra -Wno-pragmas -Wno-unused-parameter)
set(CFLAGS_NO_WARN -DFMT_HEADER_ONLY)
set(PYBUDA_CSRC_CFLAGS ${CFLAGS_NO_WARN} ${PYBUDA_CSRC_WARNINGS} -DUTILS_LOGGER_PYTHON_OSTREAM_REDIRECT=1)

set(CONFIG_LDFLAGS "")

set(STATIC_LIB_FLAGS -fPIC)
set(SHARED_LIB_FLAGS -fPIC)

add_subdirectory(third_party)
add_subdirectory(pybuda)
Loading
Loading