Pentaho Expands Big Data Support with EMC Greenplum UAP

December 8, 2011

Today EMC Greenplum announced the industry’s next-generation platform for big data analytics—the EMC Greenplum Unified Analytics Platform (UAP). This platform combines co-processing of structured and unstructured data with a productivity engine that empowers and shatters current barriers to collaboration among data science teams.

Pentaho is proud to participate as a preferred Data Integration Partner for the EMC Greenplum UAP. A great boost for the productivity of data scientists, Pentaho plus the UAP make it easy to extract business intelligence out of unstructured data with the exact same tools and interfaces they use to access, visualize and analyze structured data.

This adds to Pentaho’s big data strategy and another step in our ongoing relationship with EMC Greenplum. Just back in May, we announced Pentaho support for the EMC® Greenplum® distribution of Hadoop, as well as native support for the Greenplum Database GPLoad (bulk loader).

A hybrid architecture that combines Hadoop with data marts and warehouses are a critical part of any effective business analytics architecture, so providing powerful and easy-to-use data integration, which is natively integrated with Hadoop, is a key piece of UAP. That’s why we believe Pentaho Data Integration is perfect for use as a part of UAP, especially as it provides the deepest available native integration with Hadoop, as well as native integration with the Greenplum Database and bulk loader.

The EMC Greenplum UAP is a giant step forward in simplifying the management, integration, and analysis of big data. It’s all about lowering the barriers to adoption, by simplifying working with big data analytics for IT, through things like providing familiar graphical interfaces for managing and analyzing big data, and orchestrating big data tasks. Then for business users, it’s about making it quick and easy for them to access and analyze big data.

Quentin

Fore more details, check out the video I recorded with the EMC Greenplum team discussing our technology and participation as a Preferred Data Integration Partner (I appear at 0:42):


Pentaho’s support of EMC Greenplum HD – what it means and why you should care

May 12, 2011

On Monday we announced our support for the EMC Greenplum distribution of Hadoop called EMC Greenplum HD. You can read about all the details in our press release, Pentaho Makes Hadoop Faster, More Affordable and Easier to Use with EMC.

This week we have been at EMC World in Las Vegas as a sponsor in booth 211 (if you are at the conference come visit us). We’ve had a great crowd and interest in Pentaho BI Suite for Hadoop, Pentaho Data Integration for Hadoop and our new native support for the Greenplum Database GPLoad high performance bulk loader. Two questions that attendees keep asking are: “How is Pentaho supporting EMC Greenplum HD,” and “Why should I care?” You can read my answers below and more details about our announcement in the press release and Pentaho & EMC web page.

How Pentaho supports EMC Greenplum for Hadoop
Pentaho is the only EMC Greenplum partner to provide a complete BI solution from data integration through to reporting, analysis, dashboarding and data mining, from a single BI platform with shared metadata. Pentaho’s support and certification complements the Greenplum distribution of Hadoop by providing an end-to-end data integration and BI suite with the cost advantages of open source that enables:

  • An easy-to-use, graphical ETL environment for input, transformation, and output of Hadoop data;
  • Massively scalable deployment of ETL processing across the Hadoop cluster;
  • Coordination and execution of Hadoop tasks by enabling them to be managed from within the Pentaho management console;
  • Easy spinning off of high performance data marts for interactive analysis;
  • Integration of data from Hadoop with data from other sources for interactive analysis.

Why this is a good thing and how it changes the industry
EMC Greenplum, in combination with key technology partners, for the first time is giving the industry an integrated, supported and certified data management and BI stack that includes storage, a MapReduce framework for processing unstructured data, an analytic database, predictive analytics and business intelligence.

By combining Pentaho’s powerful BI suite with the strength of EMC Greenplum’s storage and data management domain expertise, the industry benefits from maximum data throughput and significantly shorter implementation cycles for new Hadoop deployments.

Already an industry leader in data and storage, EMC is now well-positioned to play a pivotal role in commercializing Hadoop and giving businesses a more cost-effective and simple way to perform advanced analytics in a massively scalable way. For Hadoop to truly get to the next level, it needs to be as easy-to-install and use as off-the-shelf software.

If you are interested to evaluate Pentaho BI Suite and Pentaho Data Integration for the EMC Greenplum distribution of Hadoop, contact us at Pentaho_EMC@pentaho.com

Ian Fyfe
Chief Technology Evangelist
Pentaho

Photos from the Pentaho booth at EMC World this week

This slideshow requires JavaScript.


Follow

Get every new post delivered to your Inbox.

Join 101 other followers