chore(dataobj): deduplicate decoder code across bucket and io.ReadSeeker #15945

rfratto · 2025-01-24T15:06:13Z

A previous comment identified that the code for BucketDecoder and ReadSeekerDecoder were extremely similar, and that they could be deduplicated by introducing some kind of "range reader" interface.

This commit introduces such an interface, which maps perfectly to bucket decoding. Implementations of the interface must be able to tolerate concurrent instance of readers, which io.ReadSeeker does not. To tolerate this while still allowing to decode data objects that are either in-memory or backed by a file, ReadSeekerDecoder has been updated to ReaderAtDecoder and accepts a size argument to note how large the object is.

rfratto · 2025-01-24T15:06:38Z

cmd/dataobj-inspect/go.mod

-	github.com/dustin/go-humanize v1.0.1
-	github.com/grafana/loki/v3 v3.3.2
-)
+require github.com/grafana/loki/v3 v3.3.2


@benclive Go told me I needed to run go mod tidy, let me know if anything here seems off

rfratto · 2025-01-24T15:07:12Z

pkg/dataobj/internal/encoding/decoder_range.go

+	ReadRange(ctx context.Context, offset int64, length int64) (io.ReadCloser, error)
+}
+
+type rangeDecoder struct {


This is more or less a copy/paste from the old bucket decoder but it uses rangeReader instead.

A previous comment identified that the code for BucketDecoder and ReadSeekerDecoder were extremely similar, and that they could be deduplicated by introducing some kind of "range reader" interface. This commit inroduces such an interface, which maps perfectly to bucket decoding. Implementations of the interface must be able to tolerate concurrent instance of readers, which io.ReadSeeker does not. To tolerate this while still allowing to decode data objects that are either in-memory or backed by a file, ReadSeekerDecoder has been updated to ReaderAtDecoder.

rfratto requested a review from a team as a code owner January 24, 2025 15:06

pull-request-size bot added the size/XXL label Jan 24, 2025

rfratto commented Jan 24, 2025

View reviewed changes

rfratto force-pushed the dataobj-dedupe-decoders branch from edc9d87 to cf745b6 Compare January 24, 2025 15:08

rfratto requested review from cyriltovena and benclive January 24, 2025 15:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(dataobj): deduplicate decoder code across bucket and io.ReadSeeker #15945

chore(dataobj): deduplicate decoder code across bucket and io.ReadSeeker #15945

rfratto commented Jan 24, 2025

rfratto Jan 24, 2025

rfratto Jan 24, 2025

chore(dataobj): deduplicate decoder code across bucket and io.ReadSeeker #15945

Are you sure you want to change the base?

chore(dataobj): deduplicate decoder code across bucket and io.ReadSeeker #15945

Conversation

rfratto commented Jan 24, 2025

rfratto Jan 24, 2025

Choose a reason for hiding this comment

rfratto Jan 24, 2025

Choose a reason for hiding this comment