This article will explain how to use Arm Treasure Data with Pentaho Business Analytics using our JDBC driver. By combining Treasure Data with Pentaho, users can process huge volumes of data in the cloud using existing Pentaho instances.
Download Pentaho Report Designer
Download Pentaho Report Designer from the link below. Version 3.9.0-GA was tested for this article.
|Don't have time to setup Pentaho + Treasure Data? Use our Setup Consultation Service.|
Download our JDBC Driver
Download td-jdbc-jar-with-dependencies-(version).jar from the link below.
|This driver works only with Treasure Data. It does not work with other environments such as your local Hadoop/Hive cluster.|
If you want to know more about Treasure Data jdbc driver, please refer to the link below.
Add Treasure Data as Data Source
Let’s add Treasure Data as Pentaho’s data source. Please follow the procedure below.
Add JDBC Driver to Pentaho
Let’s copy the Treasure Data JDBC driver to Pentaho’s jdbc directory. The command for MacOS environment is shown below.
$ cp td-jdbc-jar-with-dependencies-(version).jar \ /Applications/pentaho/design-tools/Pentaho\ Report\ Designer.app/lib/
Create New Data Source
Data -> Add Data Source -> Advanced -> JDBC (Custom). Create a new data source, click GREEN plus button.
Configure JDBC Connection
Generic database -> Native (JDBC) and configure the jdbc connection as shown below. Be sure to edit your credentials (your email + password).
Connection Type: Generic Database
Custom Connection URL: jdbc:td://api.treasuredata.com/(database name)
Custom Driver Class Name: com.treasuredata.jdbc.TreasureDataDriver
User Name: Your TreasureData Account
Password: Your TreasureData Password
Note: Api endpoint is up to your account site.
If your account site is Tokyo, then Custom Connection URL would be "jdbc:td://api.treasuredata.co.jp/(database name)".
For details, please refer to our document.
You can now create reports using generic SQL queries. Open the window from
Data -> Add Datasource -> JDBC to add Treasure Data as the data source for the report. Once you hit the preview button, the job will start running on the cloud (you can confirm this using the
td jobs command). You can now design your reports using Treasure Data as your data source.