You define your data model in
Data Workbench >
Master Segments. In Treasure Data Profiles (
Audience Studio >
Profiles) a series of workflows are automatically generated to create and refresh the master segment data model. Data in the data model is refreshed according to the schedule that you define for the master segment.
It is possible to have all attributes and behaviors for the master segment contained within one master table. However, it is typically specified using the following types of tables:
- Master Table
- Attribute Tables
- Behavior Tables
Attributes and behaviors are often linked by a common ID (typically the td_global_id or
contains a master list of profiles. The list is typically imported from a CRM system.
|Attribute||contains additional attributes of the profile set, and enriches the information in
contains the behaviors of the customers.
| Website Visits
The following schema diagram is an example of all three tables and how they are linked:
The following table schema is an example of an attribute table and behavior table linked by customer ID.
System-Generated Entities in Master Segment
Master Segments automatically creates entities within your accounts. Do not edit or modify these system-generated entities, otherwise the creation process will fail. Example master segment system-generated entities:
- Database: cdp_audience_(AUDIENCE_ID)
- Table: customers
- Table: partitioned_customers
- Table: behavior_(XYZ)
- Table: activities
- Table: cdp_new_web_access_(SEGMENT_ID)
- Workflow: cdp_audience_(AUDIENCE_ID)
Master Segment Data Enrichment
In Master Segments, you can enrich your data by adding specific td_ columns to either the attribute or behavior tables. The inclusion of one of the following td_ columns triggers some automatic actions by Treasure Data:
Defining Master Segments
- From the TD Console, select Data Workbench > Master Segments.
- Select New Master Segment.
- Specify Master Table , Attribute Tables , and Behavior Tables.
- Specify the schedule by which the master segment data is refreshed. We recommend that you run a refresh on a daily basis, to renew the dataset. The master table needs to be updated as well, before master segment creation begins.
- Optionally, specify the default timezone.
- Specify the Master database and table name. For example:
- Optionally, specify the resource pool. This allows you to define how your allotted Treasure Data resources are used to process the data.
We generally recommend Hive only, because it’s designed for large-scale joins and known for robustness, while Presto and Hive might perform faster for smaller data sets.
- Optionally, specify the attribute tables. The
Join Keyis used to join
Attribute Tables. Include a td_ column here for automatic data enrichment.
- Optionally, specify tables from which behaviors can be derived. Use the
Namefield to give a user-friendly name to the specific column. Include a td_ column here for automatic data enrichment.
- Save, name, and run your master segment.
- Optionally, view the progress of the workflow that is created by defining the master segment.
- To verify that the process ran and created your master segment, navigate to Profiles.