cassandraschedulerepair

cassandra: scheduling nodetool repair best practice


I have several questions regarding to nodetool repair and its scheduling.

Assumption:

Q1. what's the best practice to determine the groups with which repair is executed. (a)per node or (b)per table or (c)both?

example:

Q2. best pracetice for scheduling repair tasks I think two samples(calendar-base). Any advice or better schedule?

day M T W Th F St S

W1 I1 I2 I3 I4 I1 I2 F1

W2 I3 I4 I1 I2 I3 I4 F2

W3 I1 I2 I3 I4 I1 I2 F3

W4 I3 I4 I1 I2 I3 I4 F4

day M T W Th F St S

W1 I1 I2 I3 I1 I2 I3 F1

W2 I1 I2 I3 I1 I2 I3 F2

W3 I1 I2 I3 I1 I2 I3 F3

W4 I1 I2 I3 I1 I2 I3 spare

edit for clarity.


Solution

  • Q1. Repair in priority:

    Q2. It depends on your cluster size and your load. If your cluster can be repaired within 10 days with full repair then stick to it. Incremental repair has the inconvenience of splitting sstables. This will add extra compaction load later.