We can’t remind you often enough that Apache Hadoop is still an emerging technology. Not just that, but that it takes education, training, time and experience to build tools that work with it.

In a world where carpetbaggers sell training courses that teach you to master Hadoop in a day, it’s no wonder that there are cowboys who actually believe that the tools that they’ve built work like a charm.

And that there are Enterprises that buy them.

Who gets the blame when they fail? Everyone, the tool provider, the underlying technology, and the Big Data industry in general.

Cloudera Said, Not On My Watch


While no one can effectively police the wild, wild west, Cloudera has made an effort to certify its partners so that its clients can confidently purchase and leverage solutions that integrate and operate with Cloudera Enterprise 5.

"Certification is a key benefit of Cloudera 5 and the enterprise data hub. Not only does certification significantly reduce deployment risk, it assures customers that building a secure and open enterprise data hub with Cloudera includes the most comprehensive, integrated ecosystem of leading partner applications, tools, frameworks, services, and platforms," said Tim Stevens, Vice President, Business and Corporate Development at Cloudera.

Though there’s no way that we can give you the skinny on all 100, we’ve talked to several to learn about what their solutions do and what the certification means to them and to their mutual customers.


Steven Hillion, the chief product officer of Alpine Data Labs, uses a conductor/orchestra metaphor:

We think of ourselves as an analytics platform sitting on top of a data platform. We're the layer that allows users to define their analytics workflows simply and visually. Cloudera is the data layer. I like to think of us as Herbert von Karajan, and Cloudera is the Berlin Philharmonic."

He adds that Cloudera offers a treasure trove of great technologies beyond Hadoop (Spark, Impala, Cloudera Manager, and so on) together with world-class reliability, making it easier for his team to build powerful analytics apps on top of a platform like that.

“More importantly,” adds Hillion, it’s easier for customers of Cloudera and Alpine to get value out of these combined data and analytics layers.”


Peter Schlampp, vice president, Products at Platfora, said that the certification serves as an assurance that his company’s customers can get value from Hadoop almost immediately.

“It’s a guarantee to customers that our analytics platform will smoothly integrate with their existing Hadoop deployments,” he said. “By using Platfora with Cloudera's Hadoop-based enterprise data hub, business users of all levels can manipulate datasets and analyze them for actionable insights to improve operations and accelerate company growth."

This is crucial in a world where the growth of big data, including transactions, customer interactions such as social and web clickstream, and machine data presents an immense opportunity for businesses to analyze and uncover critical intelligence that will drive fact-based decisions in the new data economy.


Rick Schultz, senior vice president of Marketing at Alteryx, said that Alteryx’s intuitive workflow for data blending and advanced analytics in combination with Cloudera’s enterprise data hub empowers data analysts and business users to perform predictive analytics that drive decisions.

Now they can do that with trust and in short order.

Karen Hsu is director of Product Marketing at Datameer, which provides self-service, end-to-end big data analytics solutions.


She said that customers are the winners. Not only that, but they’ll be winning even bigger as a result of the Cloudera/Intel relationship. “It will empower them to ask bigger questions,” she said.


Splice Machine CEO Monte Zweben is charged about providing solutions that leverage Cloudera. Splice Machine provides the only Hadoop RDBMS which is designed to scale real-time applications using commodity hardware without application rewrites.

It’s worth noting that like many other emerging big data startups, Splice Machine aims to replace traditional RDBMSs, such as Oracle, MySQL, IBM DB2 and Microsoft SQL Server, that are experiencing scaling or cost issues. It can also extend the Hadoop ecosystem beyond batch analytics to power real-time web, mobile, social, and enterprise applications.

And when it comes to the Cloudera/Intel relationship, he adds that it creates additional buzz for Hadoop and adds the market muscle and credibility of Intel to Hadoop.

Hats Off to Cloudera

This is just a tiny fraction of the tools Cloudera has certified, they’re well worth checking out.