Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,088 workflow runs
1,088 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Address comments
Lint #454: Commit cb81215 pushed by mariosasko
December 13, 2023 12:57 35s optimize-parquet-reader
December 13, 2023 12:57 35s
Optimize ParquetReader
Run tests #239: Pull request #40 opened by mariosasko
December 12, 2023 18:07 1m 29s optimize-parquet-reader
December 12, 2023 18:07 1m 29s
Optimize ParquetReader
Lint #453: Pull request #40 opened by mariosasko
December 12, 2023 18:07 24s optimize-parquet-reader
December 12, 2023 18:07 24s
Optimize ParquetReader
Lint #452: Commit ba648c9 pushed by mariosasko
December 12, 2023 18:03 26s optimize-parquet-reader
December 12, 2023 18:03 26s
Support Python 3.8
Run tests #238: Pull request #39 opened by mariosasko
December 12, 2023 17:18 1m 21s python3.8-support
December 12, 2023 17:18 1m 21s
Support Python 3.8
Lint #451: Pull request #39 opened by mariosasko
December 12, 2023 17:18 41s python3.8-support
December 12, 2023 17:18 41s
Nit
Lint #450: Commit 7b6e3ba pushed by mariosasko
December 12, 2023 17:07 32s python3.8-support
December 12, 2023 17:07 32s
bugfix to also read WET files with WarcReader
Run tests #237: Commit 6014a6f pushed by guipenedo
December 12, 2023 15:43 1m 36s main
December 12, 2023 15:43 1m 36s
bugfix to also read WET files with WarcReader
Lint #449: Commit 6014a6f pushed by guipenedo
December 12, 2023 15:43 24s main
December 12, 2023 15:43 24s
bugfix outputfilename with gz
Run tests #236: Commit 476de37 pushed by guipenedo
December 12, 2023 03:43 1m 8s main
December 12, 2023 03:43 1m 8s
bugfix outputfilename with gz
Lint #448: Commit 476de37 pushed by guipenedo
December 12, 2023 03:43 22s main
December 12, 2023 03:43 22s
recursive was not taken into account in fsspec
Lint #447: Pull request #38 synchronize by thomwolf
December 10, 2023 01:44 19s fix-recursive
December 10, 2023 01:44 19s
recursive was not taken into account in fsspec
Run tests #235: Pull request #38 synchronize by thomwolf
December 10, 2023 01:44 1m 38s fix-recursive
December 10, 2023 01:44 1m 38s
updates
Lint #446: Commit 126a0b9 pushed by thomwolf
December 10, 2023 01:44 22s fix-recursive
December 10, 2023 01:44 22s
recursive was not taken into account in fsspec
Run tests #234: Pull request #38 opened by thomwolf
December 7, 2023 08:26 1m 25s fix-recursive
December 7, 2023 08:26 1m 25s
recursive was not taken into account in fsspec
Lint #445: Pull request #38 opened by thomwolf
December 7, 2023 08:26 24s fix-recursive
December 7, 2023 08:26 24s
recursive was not taken into account in fsspec
Lint #444: Commit baced94 pushed by thomwolf
December 6, 2023 23:44 22s fix-recursive
December 6, 2023 23:44 22s
started work on the readme
Lint #443: Commit 35fc53e pushed by guipenedo
December 6, 2023 15:52 23s main
December 6, 2023 15:52 23s
started work on the readme
Run tests #233: Commit 35fc53e pushed by guipenedo
December 6, 2023 15:52 1m 24s main
December 6, 2023 15:52 1m 24s
added check on rank 0 for input files on readers
Run tests #232: Commit c834995 pushed by guipenedo
December 6, 2023 12:48 1m 19s main
December 6, 2023 12:48 1m 19s
added check on rank 0 for input files on readers
Lint #442: Commit c834995 pushed by guipenedo
December 6, 2023 12:48 20s main
December 6, 2023 12:48 20s
fix tokenization error on empty data
Run tests #231: Commit 3e3f0c8 pushed by guipenedo
December 6, 2023 12:38 1m 15s main
December 6, 2023 12:38 1m 15s
fix tokenization error on empty data
Lint #441: Commit 3e3f0c8 pushed by guipenedo
December 6, 2023 12:38 25s main
December 6, 2023 12:38 25s
Merge pull request #37 from huggingface/labeling
Run tests #230: Commit 7e007ff pushed by thomwolf
December 6, 2023 12:24 1m 14s main
December 6, 2023 12:24 1m 14s
Merge pull request #37 from huggingface/labeling
Lint #440: Commit 7e007ff pushed by thomwolf
December 6, 2023 12:24 24s main
December 6, 2023 12:24 24s
ProTip! You can narrow down the results and go further in time using created:<2023-12-06 or the other filters available.