Edit Pentaho Mondrian Models Inline in your Browser

Originally posted on Big Data Analytics:

Our friends at Ivy Software (http://www.ivy-is.co.uk/ivy-labs/ivy-software/) have updated one my favorite community marketplace tools available to Pentaho customers called Ivy Schema Editor. This is a very simple tool that is very powerful in that you can modify and edit your Mondrian semantic business models right in-line in your browser from the Pentaho User Console … Great job, guys!


I can now create new models inline and test the model through Analyzer in one place. To me, for anyone building an interactive BI solution with Pentaho, this seems like a must-have tool.

View original

Spark on Fire! Integrating Pentaho and Spark


Spark is really cool stuff

Originally posted on Pentaho Business Analytics Blog:

One of Pentaho’s great passions is to empower organizations to take advantage of amazing innovations in Big Data to solve new challenges using the existing skill sets they have in their organizations today.  Our Pentaho Labs’ innovations around natively integrating data engineering and analytics with Big Data platforms like Hadoop and Storm have already led dozens of customers to deploy next-generation Big Data solutions. Examples of these solutions include optimizing data warehousing architectures, leveraging Hadoop as a cost effective data refinery, and performing advanced analytics on diverse data sources to achieve a broader 360-degree view of customers.

Not since the early days of Hadoop have we seen so much excitement around a new Big Data technology as we see right now with Apache Spark.  Spark is a Hadoop-compatible computing system that makes big data analysis drastically faster, through in-memory computation, and simpler to write, through easy APIs in…

View original 541 more words

Packt’s celebrates 10 years with a special $10 offer


10 days 10 years - Home Banner

This month marks 10 years since we embarked on our mission to deliver effective learning and information services to IT professionals. To celebrate this huge milestone, we are offering ALL of our eBooks and Videos at just $10 each for 10 days – this promotion covers every title and you can stock up on as many copies as you like until July 5th.

If you’ve already tried a Packt title in the past, you’ll know this is a great opportunity to explore what’s new and maintain your personal and professional development. If you’re new to Packt, then now is the time to try our extensive range – we’re confident that in our 2000+ titles you’ll find the knowledge you really need , whether that’s specific learning on an emerging technology or the key skills to keep you ahead of the competition in more established tech. To get you started, we’ve put together 20 of our latest and greatest titles for you to pick up right here. But don’t forget, you can get ANY eBook or Video for $10 in this offer – browse all our categories at the bottom of this page


Book Review: Pentaho Analytics for MongoDB (February 2014)



Book review by: David Fombella Pombal (twitter: @pentaho_fan)

Book Title: Pentaho Analytics for MongoDB

Author: Bo Borland

Paperback: 146 pages

I would like to suggest this book if you want to get started with MongoDB document oriented storage engine and Pentaho Open Source BI suite.

Target Audience
This book is intended for business analysts, data architects, and developers new
to either Pentaho or MongoDB, who want to be able to deliver a complete solution
for storing, processing, and visualizing data. It’s assumed that you already have
experience in defining the data requirements needed to support business processes
and exposure to database modeling, SQL query, and reporting techniques.

Rating: 8 out of 10


Chapter 1, Getting Started with Pentaho and MongoDB, introduces you to the powerful combination of MongoDB and Pentaho and provides step-by-step guidance on how to install and configure both technologies and restore the sample MongoDB data provided with this book.
Chapter 2, MongoDB Database Fundamentals, expands on the topic of data modeling and explains MongoDB database concepts essential to querying MongoDB data with Pentaho.
Chapter 3, Using Pentaho Instaview, shows you how to visualize data by connecting Pentaho to MongoDB. You use Instaview with the sample MongoDB database to analyze and visualize the website clickstream data.
Chapter 4, Modifying and Enhancing Instaview Transformations, introduces Pentaho Data Integration (PDI)—the ETL tool used by Instaview to extract, load, and transform data from various data sources.
Chapter 5, Modifying and Enhancing Instaview Metadata, explores metadata by explaining dimensional modeling concepts and how to model metadata to better reflect business requirements.
Chapter 6, Pentaho Report Designer Fundamentals, teaches you the basics of Pentaho Report Designer (PRD) to build pixel-perfect reports sourced directly from MongoDB databases.
Chapter 7, Pentaho Report Designer Prompting and Charting, expands on the previous chapter by teaching you additional advanced PRD features. You can enhance your report with new queries, charts, and a prompt designed to make the report more interactive.
Chapter 8, Deploying Pentaho Analytics to the Web, is all about web-enabling your MongoDB data using Pentaho methods and web interfaces for connecting to, modeling, and analyzing our sample clickstream data in a web browser.

MongoDB Chapter8


Chapter Contents List:

  Chapter 1: Getting Started with Pentaho and MongoDB
MongoDB technology overview
Pentaho technology overview
Installing MongoDB
Installing MongoDB as a Windows service
Restoring the sample clickstream MongoDB database
Installing Pentaho

Chapter 2: MongoDB Database Fundamentals
MongoDB database objects
Sample clickstream database objects
MongoDB data modeling
Normalized models
Denormalized models
MongoDB query methods
Query exercise 1
Read operations
Query exercise 2
Query operators
Querying arrays

    Chapter 3: Using Pentaho Instaview
Accessing and connecting Instaview to MongoDB
Parsing and profiling a MongoDB collection
Adding a MongoDB query expression
Creating and saving an analysis view and Instaview

    Chapter 4: Modifying and Enhancing Instaview Transformations
Opening an existing Instaview
Data integration
Adding a new data source
CSV file input
Stream lookup
Creating a new analysis view from blended data

Chapter 5: Modifying and Enhancing Instaview Metadata
Model design with dimensions and measures
Open an existing Instaview
Modifying measures and dimensions
Session duration measure
Session count measure
Event count measure
Referring URL dimension
Other dimension changes
Creating a new analysis view

Chapter 6: Pentaho Report Designer Fundamentals
Pentaho Report Designer features
Data sources
Report elements
Aggregations and calculations
Formatting and output
Navigating through Pentaho Report Designer
Report workspace
The Structure tab
The Data tab
The Style and Attributes tabs
The palette
The main menu and toolbar
The tab toolbar
Interface reference
Creating a MongoDB connection and query
Adding a MongoDB data source
Adding and formatting report elements
Adding a message field to your report
Adding number-fields to your report
Adding calculated values to your report

   Chapter 7: Pentaho Report Designer Prompting and Charting
Adding additional MongoDB queries
Adding a bar chart query
Adding a pie chart query
Visualizing your data with charts
JFreeChart chart types
Chart data collectors and properties
Creating a bar chart
Modifying bar chart properties
Creating a pie chart
Creating a report prompt
Creating a new parameter
Adding parameters to existing report queries
Creating subreport import parameters

    Chapter 8: Deploying Pentaho Analytics to the Web
Publishing a Report Designer report to the Web
Publishing the clickstream report
An introduction to the Pentaho User Console
Running and scheduling the clickstream report
Enabling your Instaview output for the Web
Copying and modifying the Instaview transformation
Using the Data Source Wizard to model your data
Creating a JDBC connection and default metadata model
Customizing the metadata model
Creating Analyzer Views and Dashboard Designer dashboards
Creating a map view in Analyzer
Creating a heat grid in Analyzer
Creating a dashboard using Dashboard Designer

Pentaho Reporting Video Course

I would like to recommend this excellent video course created by my friend Francesco Corti and officially reviewed by Paul Hernandez and me.

Pentaho Reporting [Video]



Course Contents:

    1. Getting Started with Pentaho Reporting [15:57 minutes]
      • Installing Pentaho Reporting
      • Loading and Saving Reports and Having a Preview
      • Building a Report Using the Report Wizard
      • Building the ‘My First Report’
      • Customizing the ‘My First Report’
      • Advanced Customization on the My First Report


    1. Dive Deeper into the Pentaho Reporting Engine’s XML and Java APIs [11:44 minutes]
      • Setting the Java Development Environment
      • Embedding a Pentaho Report in an Enterprise Web Application
      • Embedding a Pentaho Report in a SWING Application
      • Introducing Serialized Reports
      • Building a Report Using Pentaho Reporting’s Java API


    1. Configuring the JDBC Database and Other Data Sources [12:43 minutes]
      • Configuring Your Data Source to a DBMS Using JDBC
      • Configuring Your Data Source to an OLAP Engine (Mondrian)
      • Configuring Your Data Source to an XML File and a Table
      • Configuring Your Data Source to Metadata and PDI
      • Working with Data Sources in Java


    1. Introducing Graphic Chart Types – Pie, Bar, Line, and Others [10:36 minutes]
      • Incorporating a Line Chart into a Pentaho Report
      • Incorporating Supported Charts and Common Properties
      • Incorporating and Customizing Charts into a Report
      • Incorporating Images into a Report


    1. Modifying Reports Using Parameters and Internationalization [11:14 minutes]
      • Parameterizing a Pentaho Report
      • Parameterizing a Pentaho Report Using Java
      • Working with Functions and Expressions
      • Working with Formulas
      • Internationalization and Localization of Pentaho Reports


    1. Adding Subreports and Cross Tabs in Your Reports [09:52 minutes]
      • Adding a Multi-page Subreport in a Pentaho Report
      • Parameterizing and Adding Chart Subreport in a Pentaho Report
      • Adding a Side-by-Side Subreport in a Pentaho Report
      • Adding Cross Tabs in a Pentaho Report


    1. Building Interactive SWING and HTML Reports [12:29 minutes]
      • Building Interactive Reports in SWING
      • Building Interactive Reports in HTML


  1. Using Pentaho Reporting in the Pentaho Suite [13:10 minutes]
    • Using Pentaho Reporting with Pentaho Business Intelligence Server
    • Using Pentaho Reporting with Pentaho Data Integration (Kettle)

What you will learn from this video course

  • Install Pentaho Report in your development or production environment
  • Create impressive reports with advanced charts, interaction, multi-language support and much more
  • Use the Pentaho Report Engine in your Java environment for web and swing applications
  • Interact and customize your Pentaho reports using Java (in a web and swing application)
  • Develop your basic and advanced reports using several datasources comprised of the OLAP Engines
  • Deploy and use your Pentaho Reports inside the Pentaho suite, in particular in the Pentaho Business Intelligence Server and the Pentaho Data Integration

Who this video course is for

If you are a Java developer or IT professional who wants to assemble custom reporting solutions with Pentaho Reporting, this video course is ideal for you. Master the advanced concepts within Pentaho Reporting such as sub-reports, cross-tabs, data source configuration, and metadata-based reporting.

In Detail

Pentaho Report Designer is one of the most important core modules of the Pentaho BI Suite, that builds impressive reports using Open Source Business Intelligence Solutions . Pentaho Report Designer helps you to develop professional applications, making them interact with a multi-language support as well as parameterized reports.

You will learn exactly how to develop basic and advanced reports using the Pentaho Report Designer environment, and a more customized Java environment. All of the examples are described in-depth with the source code, and you will be guided through this book using a step-by-step approach which will ensure that you’ll achieve impressive results.

This course begins with the installation of the Java Development Environments using practical examples, moving onto how to develop impressive reports using tables, charts and sub-reports. The examples will also be shown in a Java development environment for web and swing applications.

Next, you will be taken on a practical run through the Pentaho Report Designer. This guide will then explain Java APIs, data source connections, and the development of several chart types. You also learn the most relevant, advanced features needed to make a report , such as internationalization, parameterization, interaction, functions, expressions, sub-reports and cross-tabs, leading the way to the use of reports in the Pentaho Suite (especially in the Pentaho BI server and Pentaho Data Integration).

With the Pentaho Report basic and advanced development video course, you’ll get in touch with the enterprise development of reports, with one of the most relevant Open Source Business Intelligence solutions.

Day against DRM Promotion


Packt Publishing is celebrating ” Day against DRM” by providing all eBooks & Videos for just $10 from its site.

 DRM Info

DRM Frequently Asked Questions

If you have questions or answers to add or to improve upon, feel free to do so on the LibrePlanet wiki.

# What is DRM?

A basic explanation of DRM is here.

# What does DRM stand for?

Industry supporters of DRM refer to it as “digital rights management,” as if to suggest that users should be powerless and relinquish their ability to decide how they can use and interact with their media. DRM is a mechanism to enforce severe restrictions on users’ media that would otherwise be impossible, so DRM is about restrictions, not rights. Users should have control over their own media, not be left at the mercy of major media and technology companies. For that reason, opponents of DRM refer to it as “Digital Restrictions Management”.

# What are some examples of DRM?

Depending on the DRM system, various limits and controls are imposed on both hardware and software. Users may be forced to use certain hardware or software platforms, limited to accessing their media on a predetermined number of devices, required to have a persistent Internet connection to use local files, have their files tied to an online account, unable to use accessibility software such as screen readers, cut off from accessing media in certain locales, or even stripped of their media by having their files silently and remotely deleted at any time.

  • If you purchase electronic copies of games from Steam, you can’t sell them or share them with a friend after you’re done playing them. If you so much as try, Steam will disable your account, which takes away your entire game collection.
  • During the mid-2000s, Sony bundled its music CDs with DRM that tracked users’ listening habits, created security vulnerabilities in their computers, and prevented CD-copying software from functioning.
  • Netflix and YouTube have constructed anti-features to prevent customers from viewing their media in certain countries or on a certain number of devices.
  • In 2009, Amazon remotely deleted copies of George Orwell’s dystopian novel, 1984, that were distributed through the Kindle store. This chilling example of potentially malicious behavior would have never been possible without DRM.

# What is the purpose of DRM?

While it is advertised as a mechanism to prevent copyright infringement, DRM is actually designed to restrict all of the incredible possibilities enabled by digital technologies and place them under the control of a few, who can then micromanage and track every interaction with digital media. In other words, DRM is designed to take away every possible use of digital media, regardless of legal rights, and sell some of these functionalities back as severely limited services.

# Doesn’t DRM limit copyright infringement?

DRM is not about limiting copyright infringement. Such an argument attempts to make DRM appear beneficial to authors and is based entirely on a (very successfully advertised) misrepresentation of DRM’s purpose. To illustrate the absurdity of the argument, consider the nature of file sharing: to obtain a copy of a file without permission, downloaders go to a friend or a file sharing network, not a DRM-encumbered distribution platform. If DRM existed only to prevent unauthorized sharing, every distribution method for that particular piece of media would have to be distributed by an uncrackable DRM-encumbered distribution platform, which is impossible on its own. So long as one copy becomes available without DRM, countless more are easily produced. Industry proponents of DRM are well aware of that DRM is not a copyright enforcement mechanism. DRM is only marketed as a copyright enforcement mechanism to mislead authors into tolerating and even defending it.

# What is the difference between DRM and copyright enforcement?

DRM restricts entirely different activities than copyright does, and serves an entirely separate function. While Copyright restricts who can distribute media, DRM restricts how users can access their media. Copyright already provides leverage against illegal distribution, meaning that the largest distribution platforms must already adhere to the demands of large publishers, studios, music labels, and software companies. DRM provides antifeatures (features that exist only to worsen the service for users) and charges for their removal. This gives major media and technology companies much broader control over the use of media than is enabled by copyright law, while copyright allows them to force all legal media distribution services to use DRM.

# Who does DRM harm?

DRM only restricts and punishes those who have acquired their media legally through DRM-encumbered platforms. Even authors, along with independent labels, studios, and publishers suffer. When a distributor gains significant control over a particular market, DRM enables them to lock in their customers to their platform. Once customers are locked in, so are labels, studios, and publishers. If an independent publisher wants to switch away from a DRM-encumbered distributor, customers might have to re-purchase their media on the new platform. As with any instance of monopolization, businesses which dominate a market can arbitrarily dictate the price they charge, as well as the price they pay for media, because suppliers are dependent on them. Without DRM, users have control over their own media such as where, when, how, and on what platforms they choose to use their files.

# Doesn’t DRM make sense for streaming media and rental services?

The problem with this argument is that it invites a future in which nobody has any control over their devices, and can only access media through DRM-encumbered distribution services. This argument is also based on misinformed claims that DRM prevents copyright infringement (see above). Streaming media services are rising in popularity, and DRM turns this into an opportunity to bring an end to personal media ownership. Rather than having services that can stream a user’s media to any device using whatever software they choose, DRM consolidates distribution and services, such that all access to media must be through these services.

# Isn’t DRM ineffective anyway?

The argument that DRM “doesn’t work” because it can often be circumvented misses the point, because DRM is not about copyright enforcement. DRM is very effective at what it does: limiting the freedom of anyone who uses DRM-encumbered services so that some functionality can be sold back as severely limited services.

# Why is DRM bad for software user freedom?

DRM is incompatible with free software. DRM is only possible by keeping some parts of a computer secret from users and unmodifiable, which is a direct attack on users’s freedom. DRM cannot function while being free software as this would allow the antifeatures enforced by DRM to be undone.

# Are Hollywood and the media companies to blame for DRM?

Not exclusively. Major media companies work in tandem with technology companies to create DRM and force all legal media distributors to encumber files with it. This way, all their customers remain dependent on them, and helps maintain their dominant position in the market.

# Which formats support DRM?

It’s important to remember that sometimes DRM is built into software and not part of a file format, and also file formats that support DRM do not necessarily require it. If you are wondering whether the file you are using could possibly be encumbered by DRM, we maintain the following list. Please note that this is only a list of formats which support DRM, and bears no weight on any other technical merits or restrictions of the formats.

Here is a list of formats that support DRM:



Pentaho and MongoDB Rendezvous in SF

Originally posted on Pentaho Business Analytics Blog:

CityClubThe City Club of San Francisco is well known for hosting the swankiest of weddings in San Francisco. However, on May 1st, it will be the location of a union of a different sort – The Pentaho and MongoDB ‘Better Together’ luncheon. Pentaho and MongoDB have emerged as a powerful combination for scalable data storage, processes and analytics. This special event is not to be missed as the four hours is packed with access to industry leaders, peer use cases, and key points you need to know to make a difference in your organization right away – PLUS a free Pentaho Analytics for MongoDB book and 5 star lunch. Here is a glance at the agenda:

10:30 a.m. to 11:00 a.m.
Welcome & Introductions: Pentaho & MongoDB Teams

11:00 a.m. to 12:00 p.m.
Pentaho Analytics for MongoDB & Demo – Bo Borland, Pentaho’s VP of Field Technical Sales…

View original 115 more words