hadoop3

Hadoop3 balancer vs disk balancer


I read Hadoop ver 3 document about disk balancer and it said

"Diskbalancer is a command line tool that distributes data evenly on all disks of a datanode.
This tool is different from Balancer which takes care of cluster-wide data balancing."

I really dont know whats difference between 'balancer' and 'disk balancer' yet.
Could you explain what is it?
Thank you!


Solution

  • Balancer deals with internodes data balancing present in multiple datanodes present in the cluster whereas disk balancer deals with data present disks of a single datanode.