`O(n)` worst case sampling runtime #21

LilithHafner · 2024-12-11T23:15:01Z

The worst case sample time is also O(n) (or O(1) with a large constant factor, if you prefer to think about it that way), achieved when the list of levels starts with a lot of zeros:

julia> using DynamicSampling, Chairmarks

julia> ds = DynamicSampler()
DynamicSampler(Tuple{Int64, Float64}[])

julia> push!(ds, 1, 1.0)
DynamicSampler([(1, 1.0)])

julia> @b ds rand
8.429 ns

julia> for i in 2:1000
           push!(ds, i, 2.0^i)
       end

julia> @b ds rand
16.424 ns

julia> for i in 2:1000
           delete!(ds, i)
       end

julia> @b ds rand
1.292 μs (1 allocs: 48 bytes)

It's possible that this could be fixed by performing a partial sort while traversing the list during sampling, though maybe there's another way with less sampling overhead.

The text was updated successfully, but these errors were encountered:

LilithHafner · 2024-12-12T02:40:44Z

IIUC, the best this algorithm can get is O(log(n)) sampling time because of the case where weights are of the form [1, 1/2, 1/2, 1/4, 1/4, 1/4, 1/4, ...] and level selection alone takes log(n) time.

Tortar · 2024-12-12T22:08:21Z

yes my idea for this is to sort at some point after deleting/adding a certain number of elements e.g. now I'm using the really (too) simple https://github.com/Tortar/DynamicSampling.jl/blob/main/src/DynamicWeightedSampler.jl#L252, do you have suggestion for a better criterion? In general It would be cool to have some theoretical results on this I think

Tortar · 2024-12-13T00:23:37Z

IIUC, the best this algorithm can get is O(log(n)) sampling time because of the case where weights are of the form [1, 1/2, 1/2, 1/4, 1/4, 1/4, 1/4, ...] and level selection alone takes log(n) time.

Yes my earlier comment was wrong, you are right, it actually requires O(log(n)) as you say, and this method can't do better than that for this case

This was referenced Dec 13, 2024

Fix some o(n) cases when total weight lose precision #23

Merged

Fix some more o(n) sampling cases #31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`O(n)` worst case sampling runtime #21

`O(n)` worst case sampling runtime #21

LilithHafner commented Dec 11, 2024

LilithHafner commented Dec 12, 2024

Tortar commented Dec 12, 2024 •

edited

Loading

Tortar commented Dec 13, 2024 •

edited

Loading

O(n) worst case sampling runtime #21

O(n) worst case sampling runtime #21

Comments

LilithHafner commented Dec 11, 2024

LilithHafner commented Dec 12, 2024

Tortar commented Dec 12, 2024 • edited Loading

Tortar commented Dec 13, 2024 • edited Loading

`O(n)` worst case sampling runtime #21

`O(n)` worst case sampling runtime #21

Tortar commented Dec 12, 2024 •

edited

Loading

Tortar commented Dec 13, 2024 •

edited

Loading