An Atomic Immutable Resource for Julia that is as light as air.
Noah C. Benson <[email protected]>
Air is an immutable data structure and software transactional memory tool for Julia. It provides several persistent data structures including dictionaries, sets, N-dimensional arrays, heaps, weighted dictionaries, and weighted sets, and it includes a transaction system that allows one to compose atomic operations over such data.
Air is currently under development but includes substantial testing and is generally stable. Inspiration for Air's design is derived largely from paradigms in Clojure.
Documentation for Air
is an ongoing endeavor and can be found
here.
The following code blocks demonstrate some simple examples of how one can use
the Air
library.
julia> using Air
# Creating a persistent dictionary is just like using Dict().
julia> cube = PDict{Symbol,Float64}()
PDict{Symbol,Float64} with 0 entries
# Instead of push!() and setindex!(), PDict objects efficiently produce
# persistent duplicates of themselves with any requested updates using
# functions like push() and setindex().
julia> cube = push(cube, :height => 0.2)
PDict{Symbol,Float64} with 1 entry:
:height => 0.2
julia> cube = push(cube, :width => 9.4, :depth => 1.1)
PDict{Symbol,Float64} with 3 entries:
:depth => 1.1
:height => 0.2
:width => 9.4
# The delete!() function is replaced by delete().
julia> delete(cube, :height)
PDict{Symbol,Float64} with 2 entries:
:depth => 1.1
:width => 9.4
# This does not modify the original object.
julia> cube
PDict{Symbol,Float64} with 3 entries:
:depth => 1.1
:height => 0.2
:width => 9.4
# Lookup operations are nearly as fast as with native Dict objects.
julia> cube[:height]
0.2
# The behavior of PDict is generally similar to Dict.
julia> cube[:name]
ERROR: KeyError: key :name not found
julia> using Air
# Persistent arrays are most commonly made from other arrays.
julia> v = PArray([1.0, 4.4, 2.9])
3-element PArray{Float64,1}:
1.0
4.4
2.9
# They can also be made using pfill(), pones(), and pzeros(), which are all
# similar to their non-persistent counterparts fill(), ones(), and zeros().
julia> m = pfill(5, (2,3))
2×3 PArray{Int64,2}:
5 5 5
5 5 5
# The operations push(), pushfirst(), pop(), and popfirst() are all similar to
# their mutable equivalents (push!(), pushfirst!(), pop!(), and popfirst!()),
# and all are very efficient with PVector (PArray{*,1}) objects.
julia> v = push(v, -1.8)
4-element PArray{Float64,1}:
1.0
4.4
2.9
-1.8
julia> v = pushfirst(v, -4.3)
5-element PArray{Float64,1}:
-4.3
1.0
4.4
2.9
-1.8
julia> pop(v)
4-element PArray{Float64,1}:
-4.3
1.0
4.4
2.9
# Updates can be performed with setindex(), which is like setindex!().
julia> setindex(m, -100, 2, 1)
2×3 PArray{Int64,2}:
5 5 5
-100 5 5
julia> using Air
# Volatile objects are like Ref objects. They are designed to be always safe to
# read but to only be settable inside a transaction.
julia> vol = Volatile(PDict(:a => 1, :b => 2, :c => 3))
Volatile{PDict{Symbol,Int64}}(@CjK4yBeEKUm: PDict(:c => 3,:a => 1,:b => 2))
# Like with Ref objects, Volatiles can be accessed with getindex(), inside or
# outside of a transaction.
julia> vol[]
PDict{Symbol,Int64} with 3 entries:
:c => 3
:a => 1
:b => 2
# However, outside of a transaction, it is an error to change a volatile.
julia> vol[] = push(vol[], :d => 4)
ERROR: cannot set volatile outside of transaction
# Inside of a transaction, it is guaranteed that no other thread will change any
# volatile that is read or changed during the transaction. This is the only safe
# time to edit volatiles.
julia> @tx vol[] = push(vol[], :d => 4)
PDict{Symbol,Int64} with 4 entries:
:d => 4
:c => 3
:a => 1
:b => 2
julia> vol[]
PDict{Symbol,Int64} with 4 entries:
:d => 4
:c => 3
:a => 1
:b => 2
# The tx() function is an alternate way to write transactions.
julia> tx() do
vol[] = push(vol[], :e => 5)
nothing
end
julia> vol[]
PDict{Symbol,Int64} with 5 entries:
:d => 4
:e => 5
:c => 3
:a => 1
:b => 2
# Actors are for managing side-effects of transactions or for managing data that
# needs to be updated through sequential single-threaded operations.
julia> notes = Actor{PVector{String}}(pfill("", 0))
Actor{PArray{String,1}}(@1j6NGQzgacH: String[])
# It is safe to read actors at any time, similar to reading Volatile values.
julia> notes[]
0-element PArray{String,1}
# We can define a function that sends a message to the log. The send function
# requires that the first argument be a function, to be run in the actor's
# thread, whose one parameter is the current value of actor and whose return
# value is the updated value for the actor.
julia> logmsg(s) = begin
send(val -> pushfirst(val, s), notes)
nothing
end
logmsg (generic function with 1 method)
# We can test out the logmsg function. Note that the pushfirst action in the
# logmsg function above is run in a separate thread (the actor's thread).
julia> logmsg("Log initialized.")
julia> notes[]
1-element PArray{String,1}:
"Log initialized."
# Actors can be run and read in transactions, and they are guaranteed to be
# unchanged for the duration of a transaction, and no other thread will send
# it any functions during the transaction.
julia> @tx logmsg("Next log message.")
julia> notes[]
2-element PArray{String,1}:
"Next log message."
"Log initialized."
# If a transaction fails for any reason, no sends occur. Actors are only sent
# functions when a transacton succeeds.
julia> tx() do
logmsg("Failure message.")
error()
end
ERROR:
julia> notes[]
2-element PArray{String,1}:
"Next log message."
"Log initialized."
Note that many of the core components for Air already have working
implementations. Others are currently undergoing testing. In particular, the
existing persistent data structures are fairly well tested and have a
performance comparable to their Clojure counterparts. Additionally, initial
tests of the thread-safe transaction system using the transaction block macro,
Actor
s and Volatile
s appear to work fine. However, as the author is not an
expert on testing multi-threaded code, some caution is advisable.
- Completed plans:
- Persistent data structures:
PArray
, a persistent array type that mimics Julia's nativeArray
PDict
, a persistent dictionary type that mimics Julia's nativeDict
PSet
, a persistent set type.PWSet
, a weighted persistent set type. Each element of aPWSet
has a weight, and the set itself acts both as a piority queue (first(pwset)
always yields the element with the highest weight inO(1)
time) and as a distribution (rand(pwset)
yields a random element of the set where the probability of each element is proportional to its weight).PWDict
, a persistent weighted dictionary type is like thePWSet
type, but instead of elements with weights, the dictionary contains key–value pairs with weights.LazyDict
, a persistent lazy dictionary type.
- Composable multi-threading utilities, inspired by Clojure:
- A
Volatile
type that operates within transaction blocks to ensure that all updates to references within a synchronized block are performed atomically. - An
Actor
type for sending asynchronous jobs to independent threads which also respects the atomic requirements of transactional blocks. - Thread-local
Var
type. - Thread-safe
Delay
type.
- A
- Persistent data structures:
- Plans with incomplete testing:
- Thread-safe
Promise
types.
- Thread-safe
- Plans that are not yet implemented:
- Improved Persistent Array Methods. Currently, most methods of persistent
arrays yield mutable Julia arrays instead of persistent arrays. This is
largely because the
PArray
class does not explicitly overload these functions or the broadcasting functions in order to ensure that it createsPArray
objects. For example,PArray([1,2,3]) .+ 1
should yield aPArray
; currently it yields anArray
. - Forms. Arbitrarily-deep nested persistent dictionaries and vectors are
a common data organization paradigm for persistent data. With
Volatile
objects embedded in such a data-structure, multi-threaded operations can be made almost transparent. TheForm
trait will be a subsystem for such structures, using the various persistent types. - Better query/build/update API tools for the data structures:
- The
setindex
andpush
functions are great, but it would be nice if nested persistent data structures had single coherent way of updating and querying them.
- The
- Persistent Record Types. Tool for defining data structures that support lazy reification of members and updates to their properties in the style of persistent data structure.
- Inheritable Method Parameters. A common problem in many languages is that an API function that calls another lower-level function often needs to support some subset of the argument preprocessing that occurs in the lower-level function. The preprocessing and documentation of the shared arguments shouldn't need to be duplicated in multiple places; rather a simple macro should make it easy for common parameters to occur in both places and be documented identically in both.
- Improved Persistent Array Methods. Currently, most methods of persistent
arrays yield mutable Julia arrays instead of persistent arrays. This is
largely because the
- Problems that needs to be worked out:
- It's not clear to me that Julia has entirely settled on or documented the
difference between the
@async
and the@spawn
macros. My suspicion is that theActor
constructor may need an additional parameterasync=false
that can be set to true in order to force threads to be created using@async
instead of@spawn
. In Clojure there's a similar distinction between thesend
and thesend-off
functions.
- It's not clear to me that Julia has entirely settled on or documented the
difference between the
MIT License
Copyright (c) 2019–2021 Noah C. Benson
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.