Feed: Cloud Data Integration Software | Matillion.
Author: Dayna Shoemaker
;
Kalyan Arangam, Solution Architect
Matillion uses the Extract-Load-Transform (ELT) approach to use the power of your MPP database/platform to power your data-transformation. This helps us deliver quick results for a wide range of data processing purposes.
The Google AdWords Query component in Matillion ETL for Amazon Redshift presents an easy-to-use graphical interface, enabling you to pull data from a Google AdWords account (yours or of a 3rd party) directly into your Data Warehouse. Many of our customers are using this component to extract data from one or more Google AdWords account into their data warehouse to analyse their Spend, Performance of Campaigns/Ads and where appropriate, even compare with their investments on other marketing platforms.
The connector is completely self-contained: no additional software installation is required. It’s within the scope of an ordinary Matillion license, so there is no additional cost for using the component.
Data Extraction
The image above represents the data extraction workflow for the component.
- First, Matillion sends appropriate requests to the Google AdWords service.
- Then Google AdWords returns relevant data to Matillion.
- Next Matillion streams the data as it arrives to S3. Please note, the data is not persisted to disk.
- Once all data is in S3, Matillion issues a COPY command to Amazon Redshift; passing the names of the files and other relevant metadata.
- Lastly, Amazon Redshift accesses the data and loads it into appropriate target tables.
Matillion automatically deletes any files its created in S3 irrespective of the result of the operation – success/failure.
Prerequisites
- A Google Account with access to AdWords data. Google also requires that you authenticate via a Google account to access AdWords related information. You may not own the data in which case your client/customer needs to allow your Google account access to their AdWords related data.
- A Developer Token that has been authorised for use with production data. Please contact your Google support representative if you need help with your developer token – FAQ.
- A Customer ID for the AdWords account.
- ClientID and Secret key from https://console.developers.google.com. Please see next section on Authentication
Authentication
Matillion supports OAuth to authenticate with Google services. Please see the following link for more information on OAuth registration with Matillion – Setting up OAuth in Matillion. The process involves accessing the Google developer console and generating a ClientID and Secret Key so it can then be used in Matillion for OAuth registration.
Additionally, if required, you may register multiple Google accounts to access data for various google services these accounts may have access to.
On registering for OAuth, you can may use this with other Google related components like Google Analytics, Google AdWords, Google Sheets, etc. The component has a Authentication property which lists Google OAuth registrations. Choose the relevant OAuth registration to work with data from that account.
Configuring the component
To start configuring the component first create/Open an orchestration job and add a Google AdWords Component to the job panel. Next you will need to set relevant properties; and once green, test the component. Watch the video below on how to configure the component.
Data Source
Google AdWords component models Google AdWords entities as relational table. The Data Source property presents a list of tables that will give you access to your Google AdWords report data using the Google AdWords Reporting API. Choose the relevant table from the dropdown.
Please read the data model page for information on available tables and the relevant columns.
Data Selections
This property presents you with a list of metrics, segments, and attributes that are available for that table. Select the ones that suit your requirement.
Google AdWords, notably, doesn’t allow you to select every column in a single query because some data will conflict if selected together. Therefore, in the event that you select all columns, only the default metrics, segments, and attributes will be returned. In general, these defaults are the same fields that are exposed through the AdWords console.
Data Source Filter
You may specify some Filters here. By default, All filters are joined by the AND keyword as OR is not supported by the AdWords API.
Filtering for a Date Range
By default, Google AdWords component returns data for the last 7 days. You may alternatively specify an explicit range using the StartDate and EndDate PseudoColumns.
By default, however, StartDate and EndDate may be hidden. Please enable these by adding a Connection Option PseudoColumns with value *=*. See here for more information on this connection option.
Basic/Advanced Mode
Basic mode is the default and allows a user to choose the data source and columns using a point and click interface.
In contrast, Advanced mode hides the Data Source and Data Selection options and presents a SQL property where the user can write simple SELECT statements to select relevant data. Some examples below.
> Return all columns:
SELECT * FROM CampaignPerformance
> Rename a column:
SELECT [Device] AS MY_Device FROM CampaignPerformance
> Search data:
SELECT * FROM CampaignPerformance WHERE Device = 'Mobile devices with full browsers'
> Restrict a result set to the specified number of rows:
SELECT Clicks, Device FROM CampaignPerformance LIMIT 10
> Specify a date range
SELECT Clicks, Device FROM CampaignPerformance WHERE StartDate='2018-01-01' and EndDate='2018-02-01'
Running the Google AdWords Query component in Matillion ETL for Amazon Redshift
Once you have configured the minimum required properties (Source – Google AdWords, Target – Amazon Redshift) the component changes to Green at which point you may test the component by Right-Clicking on it and choose Run Component. This will execute just that component and load any available data into the specified target table in Amazon Redshift.
Transforming the Data
Once your data is in Amazon Redshift, you may also use a transformation job to embellish the data by applying various transforms which may include adding additional columns (load date?) or even combining it with other datasets to derive useful information.
Video
Here’s a video with a quick introduction to the component and covering some common scenarios.
Conclusion
I hope that gave you a good overview of the Google AdWords Query component in Matillion ETL for Amazon Redshift. If you need further further help, please refer to the links below or contact us at support@matillion.com
Useful Links
Google AdWords Query – Component documentation
Integration Information
Component Data Model
Setting up OAuth in Matillion
Video Tutorial
Google AdWords FAQ
Want to try the Google AdWords component in Matillion ETL for Amazon Redshift? Arrange a free demo, or start a free 14-day trial.
The post Using the Google AdWords Query Component in Matillion ETL for Amazon Redshift appeared first on Cloud Data Integration Software | Matillion.