Roshi

Roshi is a set CRDT store modeled as a stateless web server in front of N (clusters) x M (shards) redis servers. It has 3 operations: Insert, Select, and Delete

Each element is a {"key", "member", "score"}

A key is the set name essentially. You select based on key. keys can be merged on select.
A member is the unique name of a member of the set
A score is a numeric value that has multiple uses.
- If two elements with the same key and member are inserted, a select will only return the one with the highest score
- If an element is deleted, the score of the delete must be at least as high as the score in the database.
- One can think of the score as a timestamp, because that's SoundCloud's use for it.

Using Roshi for Entry Counts

Option A: Member for each Score

key = <raffle_id>
member = <bundle_id>:<trigger_id>:<score_n> where score_n = 1,2,3,4 for an entry with a score of 4
score = <timestamp_of_action>
EntryCount(raffle_id) = Count(Select(raffle_id))
SelectRandom(raffle_id) = Nth(Select(raffle_id), Random() % EntryCount(raffle_id))

SelectRandom and EntryCount are fast, but memory usage increased by a factor of avg_entry_score over option B

Option B: Scores as Scores

key = <raffle_id>
member = <bundle_id>:<trigger_id>
score = <trigger_score>
EntryCount(raffle_id) = Sum(Map(Select(raffle_id), :score))
SelectRandom(raffle_id) = IterateUntil0(Select(raffle_id), Random() % EntryCount(raffle_id))

SelectRandom is still slow, but memory usage is less (by a factor of avg_entry_score

yanatan16/roshi.md

Roshi

Using Roshi for Entry Counts

Option A: Member for each Score

Option B: Scores as Scores

thealmightygrant commented Jun 18, 2015

Uh oh!