Skip to content

Actions: microsoft/DeepSpeed

nv-accelerate-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,895 workflow runs
4,895 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12551: Pull request #6773 synchronize by loadams
December 18, 2024 17:55 17m 44s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 17:55 17m 44s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-accelerate-v100 #12550: Pull request #6803 synchronize by loadams
December 18, 2024 17:55 25m 53s nelyahu:zero2_param_idx
December 18, 2024 17:55 25m 53s
Update version.txt after 0.16.2 release
nv-accelerate-v100 #12549: Pull request #6893 opened by loadams
December 18, 2024 17:52 16m 35s AutoPR/0.16.2
December 18, 2024 17:52 16m 35s
Inference ops unit test failures/fixes
nv-accelerate-v100 #12546: Pull request #6879 synchronize by loadams
December 18, 2024 16:53 17m 55s loadams/inference-ops-test-repro
December 18, 2024 16:53 17m 55s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12545: Pull request #6773 synchronize by loadams
December 18, 2024 16:51 15m 9s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 16:51 15m 9s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-accelerate-v100 #12544: Pull request #6803 synchronize by loadams
December 18, 2024 16:51 12m 36s nelyahu:zero2_param_idx
December 18, 2024 16:51 12m 36s
Update code owners
nv-accelerate-v100 #12543: Pull request #6890 synchronize by loadams
December 18, 2024 16:30 11m 16s olruwase/code_owners
December 18, 2024 16:30 11m 16s
Use ds-specific module id to avoid conflicts
nv-accelerate-v100 #12541: Pull request #6847 synchronize by tjruwase
December 18, 2024 13:59 11m 37s olruwase/pr_6772
December 18, 2024 13:59 11m 37s
Update code owners
nv-accelerate-v100 #12540: Pull request #6890 opened by tjruwase
December 18, 2024 12:04 11m 5s olruwase/code_owners
December 18, 2024 12:04 11m 5s
Fix error caused by all_reduce call in domino
nv-accelerate-v100 #12539: Pull request #6880 synchronize by tjruwase
December 18, 2024 11:51 11m 49s hongwei/fix_domino_allreduce
December 18, 2024 11:51 11m 49s
Stage3: Use new torch grad accumulation hooks API
nv-accelerate-v100 #12538: Pull request #6773 synchronize by deepcharm
December 18, 2024 09:44 11m 24s deepcharm:stage3-use-new-grad-acc-api
December 18, 2024 09:44 11m 24s
Add arctic model support by adding w2 to all_reduce
nv-accelerate-v100 #12536: Pull request #6856 synchronize by loadams
December 18, 2024 01:31 2h 31m 47s pi314ever:arctic-enabling-upstream
December 18, 2024 01:31 2h 31m 47s
nv-accelerate-v100
nv-accelerate-v100 #12534: Scheduled
December 18, 2024 00:07 1h 34m 28s master
December 18, 2024 00:07 1h 34m 28s
Fix no-torch workflow and update real_accelerator
nv-accelerate-v100 #12533: Pull request #6885 opened by loadams
December 17, 2024 22:25 3h 10m 13s loadams/fix-real-accelerator-no-torch
December 17, 2024 22:25 3h 10m 13s
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-accelerate-v100 #12531: Pull request #6803 synchronize by loadams
December 17, 2024 20:22 19m 35s nelyahu:zero2_param_idx
December 17, 2024 20:22 19m 35s
Add arctic model support by adding w2 to all_reduce
nv-accelerate-v100 #12530: Pull request #6856 synchronize by loadams
December 17, 2024 19:58 12m 54s pi314ever:arctic-enabling-upstream
December 17, 2024 19:58 12m 54s
Cleanup ops/transformer/inference tests
nv-accelerate-v100 #12529: Pull request #6830 synchronize by loadams
December 17, 2024 19:55 19m 49s loadams/transformers-inference
December 17, 2024 19:55 19m 49s
Inference ops unit test failures/fixes
nv-accelerate-v100 #12528: Pull request #6879 synchronize by loadams
December 17, 2024 19:54 11m 10s loadams/inference-ops-test-repro
December 17, 2024 19:54 11m 10s
Update transformers ops unit tests to use requried_torch_version
nv-accelerate-v100 #12527: Pull request #6884 synchronize by loadams
December 17, 2024 18:22 11m 41s loadams/fix-transformers-inference
December 17, 2024 18:22 11m 41s
Inference ops unit test failures/fixes
nv-accelerate-v100 #12524: Pull request #6879 synchronize by loadams
December 17, 2024 18:00 11m 20s loadams/inference-ops-test-repro
December 17, 2024 18:00 11m 20s
[inf] Add config var to enable keeping module on host
nv-accelerate-v100 #12522: Pull request #6846 synchronize by oelayan7
December 17, 2024 07:46 3m 51s oelayan7:keep_module_on_host
December 17, 2024 07:46 3m 51s