I'm becoming less and less enchanted with Hadoop as time goes by.
The whole ecosystem of Hadoop software seems half baked. There's one good core idea: HDFS, and distributed computation. Then there is a raft of marginal software associated with it.
Hive: On the plus side, you get some SQL, and adhoc query goodness. On the minus side, it's a marginal subset of SQL, with bugs.
HBase: I haven't heard anyone say this is baked. In fact most people get bit by something or other.
Cascading: A somewhat useful abstraction, that (while not too buggy) doesn't quite do it for me. I'm writing way too much code to do simple things. I have to go out of my way to not lose the meta data ( Fields/Typing ).
I'm hoping that Cliff Click's 0xData doesn't completely suck.
http://0xdata.com/
In my dream world Postgres gets a few new features, and I never have to think about this again.