This article will explain how to use Arm Treasure Data with Pentaho Business Analytics using our JDBC driver. By combining Treasure Data with Pentaho, users can process huge volumes of data in the cloud using existing Pentaho instances.
Download Pentaho Report Designer
Download Pentaho Report Designer from the link below. Version 3.9.0-GA was tested for this article.
|Don't have time to setup Pentaho + Treasure Data? Use our Setup Consultation Service.|
Download our JDBC Driver
Download the driver from the link below. The driver is still in beta, so your feedback is appreciated.
|This driver works only with Treasure Data. It does not work with other environments such as your local Hadoop/Hive cluster.|
Add Treasure Data as Data Source
Let’s add Treasure Data as Pentaho’s data source. Please follow the procedure below.
Add JDBC Driver to Pentaho
Let’s copy the Treasure Data JDBC driver to Pentaho’s jdbc directory. The command for MacOS environment is shown below.
$ cp td-jdbc-VERSION.jar \ /Applications/pentaho/design-tools/Pentaho\ Report\ Designer.app/lib/jdbc/
Create New Data Source
Data -> Add Data Source -> Advanced -> JDBC (Custom). Create a new data source, click GREEN plus button.
Configure JDBC Connection
Generic database -> Native (JDBC) and configure the jdbc connection as shown below. Be sure to edit your credentials (your email + password).
You can now create reports using generic SQL queries. Open the window from
Data -> Add Datasource -> JDBC to add Treasure Data as the data source for the report. Once you hit the preview button, the job will start running on the cloud (you can confirm this using the
td jobs command). You can now design your reports using Treasure Data as your data source.