Data Replication

Data Replication — approach of storing the same data on multiple storage devices

Benefits
- +Availability
- +Reliability
- +Read/Write Throughput
- -Latency (in case of geo-distributed replication)

Replica

Replica — node in data replication approach
Type
- Active — process all requests from clients
- Passive — process only some of client requests or do not process at all and receive results from active replica

Data Consistency — property of a system to keep data the same at different places
Types
- Eventual Consistency — consistency is reached eventually
- Strong Consistency — consistency is always met from the client's point of view

in case of asynchronous replication

If user is able to modify only small subset of resources then allow user read those resources from leader and rest of them from replicas (e.g. profile in social media)
If application can suffer a replica lag then measure, for example, 99% percentile of resource replication lag and use it to switch read opeartions to replicas (e.g. 99% percentile of resource replication lag is 200ms then within 200ms after write operation read from leader and since 200ms have passed read from replicas)
Keep track of most recent update timestamp on client side and send it as a part of read request. Then ensure that replica from which we are reading store data fresh enough according to provided timestamp

Keep track of most recent update timestamp on client side and send it as a part of read request. Then ensure that replica from which we are reading store data fresh anough according to provided timestamp

also called as primary-secondary backup, active/passive, leader-follower or master-slave replications

Single Leader Replication — data replication with single leader (active replica)
Benefits
- +Read Throughput
- Drawbacks
- +Latency (synchronous mode)
- Reading stale data (asynchronous mode)
- Single point of failure

Client stores timestamps for read-after-write consistency
Sticky routing for monotonic read consistency (clients read from the same replica, if it dies, new replica is chosen for reading)
Quorum and fencing
- If a network is partitioned into two subsets, the subset with the majority of nodes remains active while shooting the minority subset (this approach is literally called STONITH — Shoot The Other Node In The Head) by sending out a special signal to power supply controller

also called active/active or multi-master replications

Multi Leader Replication — data replication with multiple leaders (active replicas)

Leaderless Replication — data replication with no leader (every replica is active)

What are advantages and disadvantages of synchronous replication compared to asynchronous?
- Advantages
  - It's guaranteed that all replicas store up-to-date data
  - no need for conflict resolution
- Disadvantages
  - works slowly
  - can't write if one replica disabled
What is replication strategy for Google Docs?
- Several leaders
  - one leader – big delay, especially in presence of network partition
  - no leader – big delay, especially if there are some of replicas in other region
- Asynchronous replication
  - synchronous replication - big delay, especially in presence of network partition
What are disadvantages of LWW?
- Clock drift causes wrong results of operations
- No conflict resolution logic
What is a difference between vector clock and Lamport Clock?
- Lamport clock doesn't respect causality
What is a usage example of vector clocks?
- Questions and replies in messenger's chats require causality