outline2.md

##Basics

##Design Memcached

##Design topK and tiny URL

##Design a key-value store ###Cassandra ####Replication Strategy

SimpleStrategy - RandomPartitioner: Chord like hash partitioning - ByteOrderedPartitioner: Assigns ranges of keys to servers
NetworkTopologyStrategy: for multi DC deployments - Two replicas per DC - Three replicas per DC - Per DC
1. First replica placed according to Partitioner
2. Then go clockwise around ring until you hit a different rack

####Snitches

####Writes

Client sends write to one coordinator node in Cassandra cluster
- Coordinator may be per-key, per-client, or per-query
- Per-key Coordinator ensures writes for the key are serialized
- Coordinator uses Partitionerto send query to all replica nodes responsible for key
- When X replicas respond, coordinator returns an acknowledgement to the client
Always writable: Hinted Handoff mechanism
- If any replica is down, the coordinator writes to all other replicas, and keeps the write locally until down replica comes back up.
- When all replicas are down, the Coordinator (front end) buffers writes (for up to a few hours).
One ring per data center
- Per-DC coordinator elected to coordinate with other DCs
- Election done via Zookeeper, which runs a Paxos(consensus) variant
Workflow
- Once a write comes, Log it in disk commit log (for failure recovery)
- Make changes to appropriate memtables
- when memtableis full or old, flush to disk as SSTable
- Index file and add Bloom filter

####Compaction and Delete

Delete: don’t delete item right away; add a tombstone to the log; when compaction delete it

####Reads

Coordinator can contact X replicas (e.g., in same rack)
A row may be split across multiple SSTables=> reads need to touch multiple SSTables=> reads slower than writes

####Membership

Any server in cluster could be the coordinator
So every server needs to maintain a list of all the other servers that are currently in the server
List needs to be updated automatically as servers join, leave, and fail
Cassandra uses gossip-based cluster membership

####vs. RDBMS

##Design a CDN

##Design a mobile application

dapangmao/outline2.md