HOT TOPICS: Customer Experience Marketing Automation Social Business SharePoint 2013 Document Management Big Data Mobile DAM

Who Coined the Term 'Enterprise Data Hub' and Does It Matter?

Information Management, Who Coined the Term 'Enterprise Data Hub' and Does It Matter?“That’s not fair, it was my idea first …" Anyone who has kids has heard that line before, and it seems that some people don’t grow out of it.

Now granted, sometimes there are good reasons for these arguments and accusations. Apple says Samsung "slavishly" copied the iPhone and iPad in its Galaxy line of mobile phones and tablet, and the courts agree that its claim has merit. And the so-called Rockstar consortium, made up of Microsoft, Apple, Sony and others, is suing Google over patents.

But then there’s MapR, the company that develops and sells Apache Hadoop-derived software; a representative sent me an email yesterday. It states:

There has been a lot of attention lately given to enterprise data hubs or data lakes. Some vendors have suggested that their views are novel and first-to-market. Yet, MapR and others have been talking about enterprise data hub requirements for many months.”

It seems like the next line should be “That’s not fair.” Or “That’s not right.”

Sticks and Stones 

Who are these “some” vendors that MapR is referring to? We asked Jack Norris, the company’s chief marketing officer (CMO) this question, but he deflected it insisting that the spotlight be on the criteria that a “true” enterprise data hub requires — long term storage, high availability, data protection, full backup, full disaster recovery and so on…

Needless to say, these are features that MapR claims to have built into its Hadoop-based offerings. But does that mean it stands alone in providing them?

We decided to leave that question for later and to again ask Norris who specifically MapR referred to in the email where it stated, “Some vendors have suggested that their views are novel and first to market.”

Plenty of Room in the Data Hub Sandbox

A Google search for “Enterprise Data Hub” points primarily to one vendor — Cloudera. Yet Cloudera doesn’t seem to claim that the term belongs only to it or that its definition of “Enterprise Data Hub” is the same as everyone else’s.

In fact, at Strata + Hadoop World last month, speaking specifically about data lakes and data hubs, Cloudera founder Mike Olson said that, “This (Enterprise Data Hub) meme is much in the industry right now.”

He then went on to explain that Cloudera’s vision for the data hub is that it takes in diverse data, processes it, and serves it up to a variety of downstream systems. In other words, Cloudera has moved beyond seeing Hadoop as a digital sandbox for data scientists, to something that allows its customers to bring more diverse workloads to their data, beyond just MapReduce.

Matt Brandwein, director of product marketing at Cloudera, explains that it’s with this in mind that Cloudera (long ago) included HBase in its distribution, launched native interactive SQL for Hadoop with Cloudera Impala, provided integrated Search for Hadoop, and so on. He also says that Cloudera was first to market with these features.

He adds that Cloudera saw early on that without comprehensive security and data management — including access controls, auditing, lineage and discovery — that enterprises who wanted to utilize data hubs would never adopt the Hadoop platform and fully realize its potential. As a result they built Sentry, Cloudera Navigator, and doubled down on enhancements to the core open source platform to deliver rock-solid availability and data protection, to ensure that Hadoop could be trusted as a central data management platform.

In many cases, Cloudera’s products are open source and the company’s competitors have subsequently adopted them.

“We see what real customers need and build what the enterprise requires,” says Brandwein.

There’s no doubt that other Hadoop brands might make similar claims and for the most part, that’s not a problem for Cloudera. Partly because they believe they are ahead of the market, (“Other vendors talk about use cases. We have production reference customers”) and partly because what’s good for the marketplace is also good for Cloudera.

As Mike Haro, Hortonworks Director of Communications, has said, “We are way too early in this market to fight now,” meaning that, at this point, the market still has plenty of room to grow.

Gartner analyst, Merv Adrian, seems to agree.


Continue reading this article:

Useful article?
  Email It      

Tags: , , , , , , , , , , ,



Featured Events  View All Events | Add Your Event | feed Events RSS