performancehbasetableau-apiopentsdb

TSDB vs HBase : What to choose in long term?


The problem we are trying to solve : Store all the datapoints that could be queried later point to deduce performance of a webservice. Give a visual representation of data.

Here is my understanding so far from what I have read on Hbase and on TSDB sites

One more issue with TSDB is it doesn't work with current HBase .96 (I know tsuna@ is working on it but I dont have timeline for a stable release of tsdb which works against current Hbase.96)

So your simple answer could be Hbase or TSDB, or in addition to choosing one over another you can justify your answer and help me choose one over another.

I am open for other solutions, if they can support about trillion data-points stored over a year.


Solution

  • In my honest opinion, If you need plotting data, you should consider using OpenTSDB. I'm not sure in a long term period what it could be better because OpenTSDB is continuosly developing and in November it was released the RC 2.0 version. There is also a fork of OpenTSDB that is called KairosDB and it uses Apache Cassandra as database, so that you can store points up to milliseconds, while OpenTSDB allows you to store points up to seconds.

    So I suggest you Timeseries Distributed Databases like OpenTSDB/KairosDB for plotting data in a quite simple way. Not sure about the long term issue, maybe something better could be developed over Hbase in the next month because of its opensource nature.