Pentaho 5.3 – Taming messy and extreme data

February 18, 2015

cloudpoints
There has definitely been an evolution of how the industry talks about data. About five years ago the term ‘Big Data’ emerged to define the volume aspect of Big Data. Soon after, the definition of Big Data expanded to a better one that explains what it really is; not just big, but data that moves extremely fast, often lacks structure, varies greatly from existing data, doesn’t fit well with more traditional database technologies, and frankly, is best described as “messy”.

Fast-forward to 2015 and Pentaho’s announcement of version 5.3 this week to deliver on demand big data analytics at scale on Amazon Web Services and Cloudera Impala. This release is driven by what we see in more and more of our customers – (a new data term for you) — EXTREME data problems! Our customer NASDAQ is a very interesting example of where traditional relational data systems have maxed out and have been replaced by cloud architectures that include Hadoop, Pentaho and AWS Redshift.  You can read their story here. What NASDAQ found was that pushing vast amounts of data at extreme levels (10 billion rows everyday) was more easily accomplished by combining cloud and big data technologies, creating a more scalable solution that is highly elastic.

We’ve seen many of our customers processing vast volumes of data in Hadoop with the help of Pentaho to enable analytics at scale like never before.  The biggest challenge these customers face is getting the results out of Hadoop and into the hands of the users who can make the most of fresh insights.  That’s where Pentaho 5.3 comes into play. This release opens the data refinery to Amazon Redshift AND Cloudera Impala to push the limits of analytics through blended and governed data delivery on demand. In addition to adding Redshift and Impala support to the data refinery, 5.3 includes several other key features:

  1. Advanced Auto-Modeling – Advances in the auto-modeling accelerate the creation and increase the sophistication of generated data models offering better analytics and ease of use
  2. Additional Hadoop Support – Support for the latest Hadoop distributions from Cloudera and MapR, Hadoop cluster naming for simplified connectivity and management, and enhanced performance for scale-out integration jobs.
  3. Analyzer API Enhancements – Complete control over the end user experience for highly tailored and easy to deliver embedded analytics.
  4. Simplified Customer Experience – Easier, more simplified mechanism for embedding analytics and documentation improvements to simplify learning

If your data is big, messy, extreme or just plain annoying and needs to be tamed, I encourage you to learn more about Pentaho 5.3. Check out the great resources like the video and white paper to get started taming your data today.

Chuck Yarbrough
Product Marketing, Big Data
Pentaho

 


Why Pentaho Analyzer was ‘on the money’ at Strata

February 9, 2011

This slideshow requires JavaScript.

Last week, Pentaho was a sponsor at The Strata Conference in San Francisco, California. Over the two-day conference our booth was constantly packed with attendees. As I chatted with these eager visitors, I was pleased to hear comments such as: “My boss has asked me to look into Pentaho,” and “We currently have product X and we are looking into solutions that are easy to use and don’t charge an arm and a leg for it.” As I was demoing the Pentaho BI Suite Enterprise Edition, one thing was clear … the line between business users and technical users is fading fast!

Everyone wants access to data, but not everyone has it. Technical users, the ETL and BI developers, spend all their time on working with data, integrating, cleansing, standardizing, and analyzing it, but the business users need their data in a different way. Data gives these business users power.

The days of traditional BI when IT was building standard reports and pushing it out to business users are over. Canned reports are too slow to arrive, too limited on the information they provide, and often are too far out from what the business needed in the first place.

Business users want the power and flexibility to work with their data, and to compare and contrast anything to everything. But they want an intuitive and easy way to work with their data, similar to the tools they are most familiar with such as Excel pivot tables. This became all very clear as I was demoing the Pentaho BI Suite Enterprise Edition. The reaction of people to Pentaho Analyzer was right on the money. Pentaho Analyzer has the same look and feel as Excel spreadsheets, with an easy drag and drop capability to pull business data (customers, sales, orders, payments, time, region, territory, etc.) up to the canvas and slice and dice it to the heart’s desire. “Why not Excel?” one attendee asked me. The answer is simple….Because you can see the changes that your colleagues or IT staff make to data, immediately, without having to cross-check, guess, or ask about. Your Excel data is as good as it once was!

Pentaho Analyzer’s easy to use and intuitive ad hoc analysis capabilities, with hyperlinks for drill through to detailed transaction data behind the scenes, along with its strong charting capabilities and the flexibility to turn your final analysis results into Excel, PDF or any other popular format, topped it all off. I was pleased to see a lot of nods as I was demoing the software, a confirmation to Pentaho’s focus, strategy, and roadmap.

For additional highlights from the Strata conference read blog by Ian Fyfe, Thoughts on last week’s Strata Conference.

For a free 30 day evaluation of the Pentaho BI Suite (including Pentaho Analyzer), please go to http://www.pentaho.com/download/

Farnaz Erfan
Product Marketing Manager
Pentaho Corporation


Follow

Get every new post delivered to your Inbox.

Join 11,883 other followers