HBase Performance Evaluation continued - The Smoking Gun
Update: See also part 1 and part 3 . In my last post I described my initial foray into testing our HBase cluster performance using the PerformanceEvaluation class. I wasn't happy with our conclusions, which could largely be summed up as "we're not sure what's wrong, but it seems slow". So in the grand tradition of people with itches that wouldn't go away, I kept scratching. Everything that follows is based on testing with PerformanceEvaluation (the jar patched as in the last post ) using a 300M row table built with PerformanceEvaluation sequentialWrite 300 , and tested with PerformanceEvaluation scan 300 . I ran the scan test 3 times, so you should see 3 distinct bursts of activity in the charts. And to recap our hardware setup - we have 3 regionservers and a separate master. The first unsettling ganglia metric that kept me digging was of ethernet bytes_in and bytes_out. I'll recreate those here: Figure 1 - bytes_in (MB/...