riak

Rollback Riak node leaving ring job


I have run riak-admin cluster leave riak@gcp-07-internal.com on a node I want to leave riak ring. The command runs successfully but the partitions transfer is taking more than 2 days.

riak@gcp-07  =======>  media-riak@gcp-01
-internal.com                  -internal.com      
      |=============================================  | 106%  
                          16.96 MB/s     

Is there a way to accelerate this process? If not, I'd line to cancel this process and keep the node back in the ring.

I'm running on riak_kv_version 2.1.7 and all my buckets have n_val=3. How can I safely kill this job

I tried to force join command but had an error message: node already exists


Solution

  • Patrick,

    Thanks for your inquiry. Backing out of a node removal generally requires downtime and is not exactly a refined process. As such, I would avoid it at all costs.

    If you want to speed things up, I would suggest increasing your transfer limit as far as you deem safe or up to the maximum accepted 8 e.g. riak-admin transfer-limit 8 which will allow even more transfers to happen in parallel. Please note that this will cause increased load across the cluster, so, if your cluster is already under heavy load, this is not advisible. In such a situation, I would recommend patience.