Cloud Analytics Experience...4 4 Advanced Partner Status •Top strategic partner in APN •Multiple levels of collaboration •Heavy investment in training •>5,100 accreditations

1

© 2015 Teradata

Brian Wood, Director of Cloud Marketing, Teradata

Ray Newell, Learning Consultant, Teradata

Kevin Bogusch, Cloud Marketing, Teradata

Cloud Analytics ExperienceWelcome + Kickoff

© 2016 Teradata

2

2

Welcome!

• Today you will learn:

• How to spin up Teradata in the cloud

• Options for Do-it-yourself and As-a-service

• How to use Teradata Studio for (new) analytics

© 2016 Teradata

3

3

Thank You to Our Sponsors!

© 2016 Teradata

4

4

Advanced Partner Status

• Top strategic partner in APN

• Multiple levels of collaboration

• Heavy investment in training

• >5,100 accreditations

• >140 certifications

• 8 AWS Well-Architected reviews

• Data & Analytics Competency

© 2016 Teradata

5

5

Today’s Agenda

• 0-to-Launch in 5 minutes

• Do-it-yourself: Launch Teradata in the cloud

• Improve performance with software

– Break –

• As-a-service: See and learn about IntelliCloud

• Use Teradata Studio to run and visualize analytics

• See an nPath analytics demonstration

© 2016 Teradata

6

6

Before Jumping In…

© 2016 Teradata

7

7

Why Cloud?

• Accelerate time-to-value• Avoid lengthy procurement process

• Shift to OpEx• Often preferred over CapEx

• Reduce financial risk• Start small, grow only when needed

• Save money• Belief that cloud = cost savings

??

© 2016 Teradata

8

8

Cloud Consumption Option #1

Do-it-yourself Deployment

“I’ll deploy & manage in AWS myself.”

• Benefit: Broad system customization

• Use cases recommended:

• Proofs of Concept

• Test & Development

• Discovery Analytics / Data Labs

© 2016 Teradata

9

9

Standard Cloud Infrastructure

CPU

Memory

Teradata Software

Network Attached Storage

Software

Computeand

Storage

© 2016 Teradata

10

10

Do-it-yourself Footprint (AWS)

Americas International

Brazil, Canada, U.S. Australia, Germany, India, Ireland, Japan, Korea,

Singapore, U.K.

Teradata Software for AWS not yet available in the China, France, or US GovCloud regions.

© 2016 Teradata

11

11

And Now…Your Instructor

Ray Newell• Teradata Learning Consultant

• Teradata Master certification

• AWS Associate Architect certification

© 2016 Teradata

12

12

Cloud Analytics ExperienceDo-it-yourself Deployment of Teradata in AWS

13

13

DIY Deployment on AWS TopicsTeradata in the Cloud

Teradata Database in the AWS Marketplace

Launching the Teradata Database from the AWS Marketplace

Monitoring the Creation of the CloudFormation Stack

Teradata Database Security Group

Connecting to the Teradata Database via Teradata Studio™

Launching Viewpoint from the AWS Marketplace

Connecting and Configuring Viewpoint

Scaling Teradata Database Instances Up or Down

Scaling Teradata Database Instances In and Out

Expanding EBS Storage

Scaling Query Comparison

14

14

Teradata in the Cloud

15

15

DIY Deployment – Introduction

https://www.teradata.com/Products/Cloud/Do-it-yourself-Deployment/AWS

The web site https://www.teradata.com/Products/Cloud/Do-it-yourself-Deployment/AWS is available to provide additional information about deploying Teradata software on your own.




16

16

DIY – License Options There are two paths for using Teradata software on your own with

AWS infrastructure:

Scrolling down, the Intro menu, you will see that you have two options to deploying your own software. On-demand, in which you pay by the hour as you use Teradata software, or BYOL, in which you use a license that you have previously purchased from Teradata. BYOL provides discounts over the on-demand pricing and is the preferred option if you will be running Teradata software for long amounts to time.

17

17

DIY – Documentation

In the Documentation menu, you will have links to various product documentation. The Deployment and Administration Guide contains the detailed steps for deploying Teradata on AWS.

18

18

DIY – Resources

In the Resources links to various other resources such as videos and white papers.

19

19

DIY – Support

In the Support menu, you will have a link to register for Premier Support which is included with Teradata on AWS.

20

20

Teradata Database in the AWS Marketplace

21

21

AWS Marketplace – Teradata

https://aws.amazon.com/marketplace/search/results?searchTerms=“teradata database”

Searching for “teradata database” in the AWS Marketplace will display the various Teradata Database software tier and licensing options you can choose.

For our example, clicking Teradata Database Enterprise link will take you the Product Overview page.

https://aws.amazon.com/marketplace/search/results?searchTerms=%E2%80%9Cteradata

22

22

Product Overview

This is the Product Overview page for the Teradata Database Enterprise. This will give you information about the Teradata Database Enterprise product along with a short video reviewing the benefits.

23

23

Product Overview (cont.)

Scrolling down the Product Overview page will display additional overview information.

Note: the subscription for the Teradata Database Enterprise also includes additional software that can be launched with no additional software costs. They will incur additional AWS EC2 charges and any storage charges.

24

24

Monthly Pricing

Continuing to scroll down will display pricing information. These are the hourly prices for the Teradata software.

Note: AWS infrastructure costs will vary by Region.

25

25

Annual Pricing

Continuing to scroll down will display additional pricing information. These are the Annual prices for the Teradata software.

Note: AWS infrastructure costs will vary by Region.

26

26

Usage

Scrolling further down, after pricing information, usage information will be displayed.

From this page you also get access to documentation regarding the Teradata Database and more specifically Teradata on AWS.

Note that the Getting Started guide will give you information specifically related to the options you will be seeing and the different parameters you will be required to supply when you launch the software via the CloudFormation Template.

27

27

Reviews

The last information on this page will be reviews that have been submitted. After browsing the information from the Product Overview page, you will need to click the Continue to Subscribe button.

28

28

Subscribe

To be able to launch Teradata software in AWS you will first need to create a subscription. To create a subscription, you must first click the Accept Terms button.

29

29

Subscribe (cont.)

After clicking the Accept Terms button, you will need to wait a minute or so while your subscription request is processed.

30

30

Subscribe (cont.)

After a minute or so, you will have a subscription, and the Continue to Configuration button will be activated. You will need to click that button to continue on the Configuration screens.

31

31

Configure the Software

From the configuration screen, you will choose the Software Version and Region from the drop down menus. You can chose from the current 16.20 version or previous versions 16.10 or 15.10.

Note: There is only one Fulfillment Option available.

32

32

Annual License Option

If you choose, it is here that you have the option to purchase an annual license.

Note: When you click the Add button, your account will be charged the annual license price. There will not be any additional confirmation or “are you sure” screens.

33

33

Launching the Software

When you launch the software, you have two options. You can Copy to Service Catalog or Launch CloudFormation. If you choose the Launch CloudFormation option, you will be then taken to the CloudFormation service in the AWS console. This is the preferred method and will be covered in the next section.

34

34

Service Catalog Option

From the Launch screen, if you chose the Copy to Service Catalog option, you will be able to copy the CloudFormation template to the AWS Service Catalog.

35

35

Copying to the Service Catalog

After you copy the configuration to the AWS Service Catalog you will see a success message and you have the option to view the Service Catalog.

36

36

Launching the Teradata Database from the AWS Marketplace

37

37

Teradata CloudFormation Template

After clicking the Launch button from the AWS Marketplace, you will be logged into the AWS Console and taken to the CloudFormation service and the Teradata Database Enterprise template.

The first screen you will see is the Select Template. From this screen you need to make sure that the region you want to launch into is correctly selected. If not, select the Region you want from the drop down menu. After validating the correct Region, you will need to click the Next button.

38

38

Specify Details – Teradata Database

After clicking the Next button you will be taken to Specify Details screens. In this screen, you will specify the Teradata Database parameters.

You must specify the Stack name, System Name, and DBC Password. For the Japanese Support, NTP Servers and System Timezone parameters, you can elect to take the defaults.

The Stack name is what you will see in the CloudFormation service.

39

39

Teradata Database Parameters

Teradata Database Parameters (cont.)

Continuing down the Teradata Database Parameters screen, you will specify whether you want to enable Temporal, Secure Zones, Row-Level Security and Teradata Intelligent Memory.

If you choose to enable Teradata Intelligent Memory, you will specify a percentage number. By default, it is disabled.

40

40

Node Configuration Parameters

After the Teradata Database Parameters, the next set of parameters will be for the Node Configuration. Here you will specify the Number of Nodes, the Instance and Storage Type, and whether you will want to have the system Unfolded at launch time or if you want the system to have the ability to unfold the system after launch.

If you have the system unfolded or unfold ready, additional private IPv4 addresses will be assigned to the instances when they are created and entries will be made into the host file. If you choose none, you will be able to unfold the system, but must run additional commands to configure the additional private IPv4 addresses prior to unfolding the system.

Data Storage only applies if you choose an m4 instance type. With m4 instances, you will also have the ability to expand your storage after creation. The i2 and d2 instance types have fixed storage sizes and cannot be expanded.

The last two node configuration options have to do with encryption. EBS Encryption are for the data volumes attached to the nodes and Root Disk Encryption is for the local system disk.

41

41

VPC/Network Parameters

Required and must already exist

After the Node Configuration parameters, you will supply the VPC/Network parameters. You will choose which VPC and subnet to launch into, from existing VPCs and Subnets in the drop down menus.

Database Placement Group is a required parameter and you must also have created a Cluster Placement Group prior to launching the template. You will enter the name. If you do not remember the name, you can go to the EC2 Dashboard and select Placement Groups. Note: if you mistype the name, the template will fail after launch.

Elastic IPs are static public IP addresses that will be assigned to each of the DBS nodes. It is recommended for the Teradata Database to choose Yes for the Elastic IPs option.

42

42

Cluster Placement Group

In the CloudFormation template, Placement Group is a required parameter

You must also have created a Cluster Placement Group prior to launching the CloudFormation template. If you did not create a Placement Group, you can go to the EC2 Dashboard, select Placement Groups and then click Create Placement Group.

43

43

Cluster Placement Group (cont.)

A Cluster Placement Group clusters instances in a single Availability Zone to provide low latency and higher network throughput

A spread placement group spreads instances across distinct hardware to improve redundancy

In this screen, you will need to supply a Name and a Strategy. The Strategy is either Cluster or Spread. For Teradata we want Cluster.

A Cluster Placement Group clusters instances in a single Availability Zone to provide low latency and higher network throughput.

A spread placement group spreads instances across distinct hardware to improve redundancy.

44

44

Security Group and Key Pair

Typically this would be your VPN IPv4 address

Back to the template, the last set of parameters are Security Group and Key Pair. For the Remote Access From parameter, you will enter the CIDR block of the IPv4 addresses that will allow access to the Teradata Database instances. Rules will be added for ports 22 (SSH) and 1025 (Teradata Database). This means that only traffic from those IPv4 addresses will be allowed to access the Teradata Database instances. All other traffic will be denied. This is for security purposes. If you do not know which IPv4 addresses you want to allow access to at this time, you can supply the CIDR block of the subnet you are launching the instances into. You will then need to explicitly allow other traffic from outside the Subnet by editing the Security Group after launch.

After entering in all of the parameters, click the Next button.

45

45

Options – Tags and Permissions

The next step is to select Options. The first Options you can specify are Tags. Tags are key/value pairs that AWS assigned to the resources. This can be helpful when you need to track resource utilization and consumption at a more detailed level than just your AWS account. These Tags can be used as filters in monitoring and billing services.

For Permissions, you can choose an IAM Role from the drop down menu or enter a Role ARN (Amazon Resource Name). When you launch the template, it will create various resources such as EC2 instances, and in order to be able to do that, you need have explicit permissions. Roles are a collection of those permissions, and AWS administrators can create roles and assign those roles to selected IAM Users to better control who can create resources, since resources have associated costs.

This is optional and if you do not enter anything here, the permissions will be taken from the user credentials used at login. If the necessary permissions are not assigned, the template will fail with a permission error at the time the resource is created in the template.

46

46

Rollback Triggers

Rollback Triggers allow you to monitor the creation of the resources in the CloudFormation Template, and rollback the creation if the alarms set here are triggered.

47

47

Advanced Options

Further down the Options screen you will have additional options under Advanced. You can choose to send notifications using SNS (Simple Notification Service), set a Timeout value (in minutes) in which the CloudFormation will rollback, and assign a stack policy to define the resources that you want to protect from unintentional updates during a stack update.

Note: If you are trying to troubleshoot, it is recommended to set Rollback on failure to No.

After you set whatever options you want, click Next to go the Review screen.

48

48

Create Stack – Review Parameters

Before creating your stack, you have ability to review your selections. If you want to make any changes, click the Specify Details link or Options link to go back to those sections and make changes.

49

49

Review Parameters (cont.)

Scrolling down, you will review the Node Configuration, VPC/Network, Security Group and Key Pair parameters.

50

50

Review – Options

Scrolling further, you will review the Options you entered.

51

51

Create the Stack

At the bottom of the Review screen you must check the “I acknowledge…” box. After checking the box, you can click Create button to begin the creation process.

52

52

What we just

accomplished…

✓

Launched Teradata

in the cloud

© 2016 Teradata

53

53

Monitoring the Creation of the CloudFormation Stack

54

54

Monitor Stack Creation

After clicking the Create button, you will be taken to the CloudFormation service. In the upper right, click the Refresh button and you should see your Stack. Under Status, you should see CREATE_IN_PROGRESS.

Select the box next to the Stack Name and you will be able to monitor the events in the bottom half of the screen.

55

55

Stack Creation

Do NOT try to SSH into an instance during creation, it may cause the stack to fail

To monitor the stack creation progress:

➢ From AWS CloudFormation console, ensure CREATE_IN_PROGRESSdisplays in the Status column

➢ After the stack is created, ensure CREATE_COMPLETE is displayed in the Status column

➢ If the stack fails, scroll back through the Events to find the failure and reason (highlighted in RED)

➢ After the stack is created, the Teradata Database is ready and started

➢ No need to run tdc-init or tdc-start

During the creation, do NOT try and access any of the resources. You need to wait until the creation completes. You can follow the creation progress by scrolling through the events. If there is a failure, you will need to find the failure message (highlighted in red) for the reason for the failure.

56

56

Monitor Stack Completion

After the Stack successfully completes, you will see CREATE_COMPLETED in the Status column. If you go to the EC2 Dashboard, you will be able to view your instances in a running state.

57

57

What Causes a Stack Failure?

Exceeding your AWS account limits

Resources within the AWS region are not available

VPC CIDR Block specified overlaps with another VPC CIDR Block within your AWS account

Permissions not assigned to the IAM role assumed for your AWS account

If the stack fails, you will need to fix the error and launch the stack again

The template is very stable and the reason for failures are usually outside of the template itself; e.g. permissions, resources, or network setup.

58

58

What Causes a Stack Failure? (cont.)

You must launch into an existing VPC and Subnet

Make sure the following options are enabled:

➢ For the VPC: DNS Hostnames and DNS Resolution should be set to yes

➢ For the Subnet: Auto-assign public IPv4 address must be set to yes

There must be an Internet Gateway attached to the VPC

The VPC Route table needs to have a route to the Internet for the IGW

Since you must launch into an existing VPC and Subnet, you need to make sure the following options and rules are set.

Make sure the DNS Hostname and DNS Resolution for the VPC are set to yes. Make sure the Auto-assign public IPv4 address is set to yes for the subnet.

There must be an Internet Gateway attached to the VPC and the VPC’s route table must have n route to the Internet (0.0.0.0/0) for the IGW.

59

59

EC2 Dashboard

After the stack completes, from the EC2 Dashboard, you will see the database instances are running. If you select one of the instance, you will be able to see the instance details.

60

60

What we just

accomplished…

✓

Spun up Teradata

in the cloud

© 2016 Teradata

61

61

Teradata Database Security Group

62

62

Security Groups and EC2 Instances

Security Groups are used to firewall your instances to control incoming traffic

They operate at the instance level

Support ALLOW rules only

Are stateful (outgoing traffic is allowed)

Maximum 50 rules per Security Group

Security Group

EC2

Instance

EC2

Instance

EC2

Instance

EC2

Instance

In AWS, every instance must have a Security Group assigned. Security Groups are firewalls for your instances to determine what traffic is allowed.

63

63

Editing the Security Group

It may be necessary to edit the Security Group that was created for the Teradata Database instances. From the instance details, you can click on the Security Group link to view the Security Group rules that were created by the template.

64

64

Viewing the Security Group Rules

From the CloudFormation template

In the CloudFormation template, we specified in the Remote Access From parameter, the IPv4 address of our client. So when the Security Group was created in the CloudFormation template, access to the Teradata Database (1025) and Teradata Database instance (SSH 22) will only be allowed from that IPv4 address. All other traffic will not be allowed.

65

65

Editing the Security Group Rules

Before

After

If you want to edit the Security Group rules after the Security Group has been created, you can click the Edit button.

66

66

Connecting to the Teradata Database via Teradata Studio™

67

67

Setting Up Teradata Studio™

To create a new Teradata database connection, click on the new connection profile icon.

68

68

Setting Up Teradata Studio™ (cont.)

From the “New Connection Profile” dialog, choose Teradata and supply a name for the new connection, then click the Next button.

In the “New Teradata Connection Profile” dialog, supply the Public IPv4 address or the Public DNS (IPv4) for one of the database instances, the User Name dbc, and the dbc Password that you supplied in the CloudFormation template.

If you want to test the connection first, click the Test Connection button to make sure the information you entered is able to establish a connection with Teradata.

Finally, click the Finish button to be connected.

69

69

Connected Using Teradata Studio™

After connecting, you will be able to interact with the database. From the Query Development window, you can submit a query.

70

70

Launching Viewpoint from the AWS Marketplace

71

71

AWS Marketplace – Viewpoint

https://aws.amazon.com/marketplace/search/results?searchTerms=“viewpoint”

Searching for “viewpoint” in the AWS Marketplace will display the various Teradata Viewpoint options you can choose.

Clicking Teradata Viewpoint (Single Teradata System) link will take you the product overview page.

https://aws.amazon.com/marketplace/search/results?searchTerms=%E2%80%9Cviewpoint%E2%80%9D

72

72

Product Overview

This is the first page that will be displayed after clicking the Teradata Viewpoint link.

73

73

Pricing Information

Scrolling down will then display pricing information. Note that the software charges are $0. Viewpoint is included with the Teradata Database Enterprise software that you subscribed to earlier. But this version will only allow you to monitor one system, so it is more appropriate for a test/dev type of environment, not a production environment where you would need the multi-system version.

74

74

Usage Information

Continuing to scroll, you will see the Usage Information. From this page, you also have links to additional documentation.

75

75

Support Information

Finally, the last page will display the link for both Teradata Support and AWS Support.

76

76

Subscribe

1st

2nd

1. From the product overview page you will click on the Subscribe button. Clicking the subscribe button will take a few minutes to create the subscription.

2. After you have a subscription, you then click on the Configuration button.

77

77

Configure the Software

After clicking the Continue to Configuration button on the previous screen, you will need to make a few configuration choices. There is only one fulfillment option. For software version, you can choose 16.20 or 16.10 and you must choose the region to launch the software. After making you choices, click the Continue to Launch button.

78

78

Launch the Software

On the Launch page, you have a choice to Launch through EC2. This option will take you the EC2 Dashboard menus to launch the Viewpoint AMI. You will need to make the configuration choices using the AWS Console. If you choose Launch from Website, you will make you configuration choices from this web site. The last action is to copy this configuration to the AWS Service Catalog.

Choosing Launch from Website, the EC2 Instance Type is only m4.2xlarge.

79

79

Launch the Software (cont.)

Scrolling down the Launch the Software page, you will next choose the VPC and Subnet to launch the software into.

For Security Group you can choose an existing Security Group or choose to create a new Security Group.

80

80

Creating New Security Group

If you choose to create a new Security Group, you will give the Security Group a name and you can then choose the CIDR blocks in which you will be opening up the ports required by Viewpoint.

Note: port 61616 is need to ActiveMQ and port 5432 is required for PostresSQL.

After you click the Save button, you will then be able to see that the Security Group has been created if you go to the AWS console, EC2 Dashboard and then choose Security Groups menu option.

Note: If you launch into an existing Security Group you will need to make sure that rules for ports 22 (SSH), 80 (HTTP), 61616 (ActiveMQ), 443 (HTTPS) and 5432 (PostgreSQL) exist, or are added.

81

81

Successful Launch

After making your configuration choices, you will see that the software has been successfully deployed.

82

82

EC2 Dashboard – Viewpoint Details

After deploying the software, if you go the EC2 Dashboard, you will now see the Viewpoint instance along with the 2 node Teradata database instances launched previously.

Using the public IPv4 address, you can now connect to Viewpoint through a web browser.

83

83

Connecting and Configuring Viewpoint

84

84

Connecting to Viewpoint

In the address bar of your browser enter the Public IPv4 address

The admin password is the Viewpoint Instance ID

To connect to Viewpoint through a web browser, you will need to enter in the Public IPv4 address. Since this is currently open to all traffic it will not be considered secure and many browsers will display a warning message to that effect.

For AWS, the default admin password is the Viewpoint Instance ID. So you will need to go the EC2 Dashboard and copy and paste the Instance ID into the Password field. You can change the password for the admin user in the Administration portlets – User Manager.

85

85

Configuring Teradata System

After logging into the Viewpoint server, you will need to display the Administration portlets. To view them, in the upper right hand corner, click the Tools icon with the crossed wrench and screw driver.

Clicking Tools will display the various Administration portlets. Choose the Monitored Systems portlet.

86

86

Monitored Systems Portlet

After selecting the Monitored Systems portlet, click the + sign and choose Add Teradata System.

Note: since this is a single system version of Viewpoint, you will only be able to view one of the available systems. If you want to add more than one system, you would need to deploy the multi-system version of Viewpoint.

87

87

Adding Teradata System

After clicking Add Teradata System, you must setup the General parameters.

The first parameter defines a System Nickname for the system you are adding. You will also want to check the Enable system box. The next parameter is the TDPID. For this parameter enter the Private IPv4 address. Again, you can get this from the EC2 Dashboard display for the instance. Do not choose the Public IPv4 address. This instance have been deployed with the VPC on AWS, not externally.

You then will need to supply the Password for tdwm user; the default password is tdwmadmin. After entering in the password, you can then test your connection to the database. If you do not get a successful connection, check that you entered in the correct Private IPv4 address. Another issue is that the Teradata database Security Group is not allowing traffic from the Viewpoint Security Group. We will cover that issue the next slides.

Under Collectors, check the Enable data collectors box. If you do not, you will need to enable the data collectors individually from the Data Collectors setup page that is found underneath General in the Setup menu to the left. Data Collectors are used to collect data that will be stored in the PostgresSQLdatabase on the Viewpoint server and is used to populate the various user portlets. If you do not enable data collectors, your user portlets will be empty when you open them.

‹#›

Lastly, if you have deployed the Enterprise version of Teradata, you must check the box for Enhanced TASM Functions to enable TASM Workload Management functionality.

© 2016 Teradata

88

88

Allow Traffic from Viewpoint

If your connection to the Teradata database fails, if could be because the Teradata database Security Group is not allowing traffic from the Viewpoint Security Group. In the display, you can see that the Viewpoint Security Group (called ViewpointSG) is not displaying as an allowed inbound source.

In this case we must Edit the Security Group to add the Viewpoint Security Group as an inbound source.

89

89

Edit Teradata Security Group

Click the Add Rule button. Then, from the Type drop down menu, choose All traffic and in the Source columns, choose Custom and enter the Viewpoint Security Group ID. You can get this ID from the EC2 Dashboard, Security Groups menu option. After making those entries, click the Save button.

90

90

Traffic Allowed from Viewpoint

You can now see all inbound traffic from the Viewpoint Security Group will be allowed into the Teradata database Security Group.

After making these additions, the Viewpoint server should be able to successfully connect to the Teradata database.

91

91

Viewpoint Dashboard

Closing the administration portlets, you can then click on the Dashboard icon to display the dashboard for the Teradata system.

92

92

Scaling Teradata Database Instances Up or Down

93

93

Scaling Instances Up or Down

You can only change the EC2 instance type to a larger or smaller instance type within the same instance family

Follow these steps to scale the Teradata instances Up or Down:

➢ Shutdown the Teradata Database

➢ Stop the Teradata database EC2 instances

➢ Change the EC2 instance type of each Teradata instance separately

➢ Start the Teradata database EC2 instances

After starting the Teradata EC2 instances, the Teradata database will be running on the new instance types

Scaling up or down is changing the EC2 instance types to a larger or smaller instance type within the same instance family. You cannot change from an m4 instance family to an m5 instance family.

To change the instance family you will do the following:

• Shutdown the Teradata database

• Stop the EC2 instances

• Change the instance type

• Start the EC2 instances

Note: after starting the EC2 instances, the database will be started automatically. This is the default setting.

94

94

Shutdown the Teradata Database

To shutdown the Teradata Database, you must SSH into one of the nodes. When you are logged onto the node, you will see information related to the Teradata database tier and EC2 instance information. In this case we are running the Enterprise tier on a m4.4xlarge instance type.

1. From the command line, you need to enter “sudo su –” to switch to the supervisor.

2. To display the Teradata Database state, enter “pdestate –a” and you will see that the database is running, logons are enabled and the system is currently quiesant.

3. To shutdown the database, you need to enter “tpareset –x shutdown” the –x option says shutdown and the next parameter is a required comment, in this case we just entered shutdown, but it could have been any comment. Note, if you want to restart the database, which is a common task, then enter –f.

4. After entering the tpareset command, you will be prompted to reply, in this case we enter “y”, to shutdown the database.

5. Entering “pdestate –a” again, you see that the database has been shutdown.

95

95

Stopping the EC2 Instances

After the database has been shutdown, from the EC2 Dashboard, select each of the Teradata database nodes and from the Actions drop down, select Instance State and Stop.

When prompted to stop the instances, click the Yes, Stop button.

96

96

Change the Instance Type

To change the instance type, you must do it individually. So select one Teradata database instance and from the Actions drop down, select Instance Settings and Change Instance Type.

97

97


Even though you will see all of the different instance types, you can only change to another supported instance type in the family

If you choose an unsupported instance type you will get an error that it is not supported

In this case, we chose to change from an m4.4xlarge to m4.10xlarge.

Note: Even though you will see all of the different instance types, you can only change to another supported instance type in the same instance family. If you choose an unsupported instance type you will get a prompt that it is not supported.

98

98


You can now see that the instance type is now m4.10xlarge. The next step is to start the Teradata database.

99

99

Starting the Teradata Database

Select the Teradata database instances and from the Actions drop down, select Instance State and Start.

You will be prompted to start the instances, click the Yes, Start button.

100

100

Starting the Teradata Database

After starting the instances, which can take about 3-5 minutes, you will see that the Teradata database is now running on m4.10xlarge instances.

101

101

Validate Teradata has Started

After the instances are started, the default setting is to also start the Teradata database. You do not need to enter any additional start commands to bring the database up.

You can now see that the database is now running on the new instance types, in this case m4.10xlarge. And when you enter “pdestate –a”, the database is running.

102

102

Scaling Teradata Database Instances In and Out

103

103

Scaling Instances In or Out

Scaling instances in or out is also known as folding or unfolding

Unfolding adds more Compute, Memory, I/O Bandwidth and Parsing Engines (PEs) to the configuration

Follow these steps to scale the Teradata instances In or Out:

➢ Shutdown the Teradata Database

➢ Enter the commands to start the folding or unfolding process

After folding or unfolding process completes, the Teradata Database will be running on the new configuration

The number of AMPs and disk storage remains the same

Scaling is also know as unfolding or folding. You can unfold or fold with a factor of 2x or 4x. Unfolding adds additional, CPU, memory, I/O bandwidth and PEs. It does not add more AMPs or storage.

To unfold a system, you need to shutdown the database and enter the commands to start the unfolding or folding process. After the process completes, the database will be running on the new configuration.

104

104

What Happens During an Unfold?

If you want to unfold a 2 node system 2x, to a 4 node system, the following steps will be performed during the unfold process:

➢ 2 more nodes will be provisioned within the same cluster placement group

➢ Half the EBS storage disks will be detached from the existing nodes

➢ The detached EBS storage disks will be attached to the new nodes

➢ The new nodes will be clustered with the existing nodes

You can unfold a system 2x or 4x. So for example, you want to unfold a 2 node system 2x to a 4 node system, the following steps will be performed during the unfold process:

• 2 new nodes will be provisioned within the same cluster placement group

• Half of the EBS storage disks will be detached from the existing nodes

• Attached the EBS storage disks to the new provisioned nodes

• Cluster the nodes for fallback protection

105

105

What Happens During an Unfold?

Before

2 Nodes

After

4 Nodes

This example shows a 2 node system being unfolded to a 4 node system.

106

106

Displaying the Current Configuration

The tdc-unfold –d command will display the current configuration and what unfold options you have. We can unfold this system using either the 2x or 4x options.

107

107

Before Starting the Unfold Process

Before starting the unfold process, we have 2 nodes

Before starting the unfolding, you can see from the EC2 Dashboard, there are 2 database instances.

108

108


Before starting the unfold process, we have to shutdown the database

Before starting the unfold process, we need to shutdown the Teradata database.

To shutdown the database: 1. From the command line, you need to enter “sudo su –” to switch to the

supervisor.2. To shutdown the database, you need to enter “tpareset –x shutdown” the

–x option says shutdown and the next parameter is a required comment, in this case we just entered shutdown, but it could have been any comment. Note, if you want to restart the database, which is a common task, then enter –f.



109

109

Starting the Unfold Process

In this example we issued the command to unfold the system 2x. The current configuration is displayed along with what the configuration will be after it is unfolded.

Note: From the notes, the unfolded system will consume additional resources and add additional costs. It will increase performance but it will not give you additional storage capacity. If you want additional storage capacity you can do that using the tdc-expand-storage command.

Entering “yes” to the continue command, starts the unfolding process.

110

110

Completing the Unfold Process

After entering “yes” to start the unfold process, you can see that in this case, unfolding a system from 2 nodes to 4 nodes, took approximately 10 minutes. The estimated time for an unfold 2x is 20 minutes.

After the unfold process completes successfully, the current configuration is now 4 nodes from 2 nodes. The number of PEs went from 4 to 8 total, the number of AMPs remained at 40 total, but the number of AMPs per node went from 20 to 10. Also, the total CPUs went from 32 to 64 and the total memory went from 130GB to 260GB.

Also note: that after the unfold process is completed, the database is up and running.

111

111

After the Unfold Process Completes

After the unfold process completes, we have 4 nodes

After the unfolding, you can see from the EC2 Dashboard, there are 4 database instances.

112

112


113

113


Allows the customer to start small and grow when needed, avoiding

paying for higher non-utilized storage

EBS Storage can be expanded after Database instances are

launched without reconfiguring the system

Customers can now select any storage value from 5TB to 60TB, per

node, and have the ability to expand their storage in 1TB increments

Expanding storage increases the size of each pdisk EBS volume on

the system up to a maximum of 3TB

Expanding storage allows customers to expand existing EBS storage volumes.

Total EBS storage per node can be specified from a minimum of 5TB to 60TB, and you have the ability to expand the storage in 1TB increments up to the 60TB total.

The maximum EBS storage per disk is 3TB.

114

114

Expanding EBS Storage (cont.)

Does not increase the number of AMPs per node

Requirements to expand storage:

• System can not be in an unfolded state

• Must have 20 AMPs/EBS volumes per node before expansion

• Can only be done for m4 instance types

To be able to expand EBS storage, the system must be in an unfolded state and is only available on the m4 instances which support EBS storage. The i2 and d2 instances support local storage, not EBS storage.

115

115


Before starting the storage expansion, we have to shutdown the database

Before starting the storage expansion, we need to shutdown the Teradata database.

To shutdown the database: 1. From the command line, you need to enter “sudo su –” to switch to

supervisor privileges.2. To shutdown the database, you need to enter “tpareset –x shutdown” the

–x option says shutdown and the next parameter is a required comment, in this case we just entered shutdown, but it could have been any comment. Note, if you want to restart the database, which is a common task, then enter –f.



116

116

Starting the Storage Expansion

In this example, we are going to increase the total storage by 5TB. This will increase each EBS volume 125GB.

Respond “yes” to the prompt to start the expansion process.

117

117

Completing the Storage Expansion

At the completion of the storage expansion, the database will be started with the additional storage available.

118

118

Scaling Query Comparison

119

119

AWS Sample Query Comparisons

Scale Out

Sample query aggregates sales data from the Sales Transaction Line table and joins it to the Item and Brand tables to report quarterly sales by Brand

# Nodes

(m4.4xlarge)

Database

Physical Design

Elapsed

Time

2 (40 AMPs) PI only 1:04

2 (40 AMPs) Row Partitioning 0:03

4 (40 AMPs)

Unfolded

PI only 0:32

4 (40 AMPs)

Unfolded

Row Partitioning 0:01

The table above compares a sample query which does an aggregation from a large sales transition line table and reports it by brand and item. The top rows compare the elapsed times on a 2 node m4.4xlarge reading data from a sales transaction line table with only a primary index and from a sales transaction line table with row partitioning implemented. The bottom rows compare the same query after unfolding the 2 node system to a 4 node system. Unfolding the system added more CPU, memory, and I/O bandwidth. The numbers of AMPs and storage space remained the same. By adding more hardware, we can see about a 50% improvement in performance over row partitioning alone.

© 2016 Teradata

120

120

AWS Sample Query Comparisons

Scale Up

Sample query aggregates sales data from the Sales Transaction Line table and joins it to the Item and Brand tables to report quarterly sales by Brand

Instance Size

(2 nodes)

Database

Physical Design

Elapsed

Time

Instance

CPUs

Instance

Cost

m4.4xlarge PI only 1:04 16 $0.80

m4.4xlarge Row Partitioning 0:03

m4.10xlarge PI only 0:29 40 $2.00

m4.10xlarge Row Partitioning 0:01.7

m4.16xlarge PI only 0:12.5 64 $3.20

m4.16xlarge Row Partitioning 0:00.9

In this test, we compared the performance of the same query on the 2 node m4.4xlarge against a 2 node m4.10xlarge and 2 node m4.16xlarge. By scaling up, we are adding faster CPUs and more memory. The m4.10xlarge comparably, has 2.5 times more CPUs than the m4.4xlarge and the m4.16xlarge has 4 times more CPUs along with more memory.

© 2016 Teradata

121

121

Partitioning Explanation

Why does the query perform better with row partitioning?

Sales Transaction Detail table table defined without Row Partitioning across 4 AMPs

Rows are ordered by Row Hash

SELECT … WHERE TxnDate = DATE '2018-01-03’;

Full Table Scan is executed

'01' 1028 2018/01/03

'12' 1031 2018/01/03

'28' 1032 2018/01/03

'03' 1016 2018/01/02

'17' 1013 2018/01/02

'48' 1023 2018/01/02

'14' 1001 2018/01/01

'35' 1007 2018/01/01

'39' 1011 2018/01/01

'23' 1040 2018/01/04

'30' 1038 2018/01/04

'42' 1047 2018/01/04

RH Txn# TxnDate

'10' 1034 2018/01/03

'29' 1033 2018/01/03

'34' 1029 2018/01/03

'07' 1017 2018/01/02

'16' 1021 2018/01/02

'45' 1015 2018/01/02

'06' 1009 2018/01/01

'26' 1002 2018/01/01

'36' 1012 2018/01/01

'13' 1037 2018/01/04

'21' 1045 2018/01/04

'36' 1043 2018/01/04

RH Txn# TxnDate

'19' 1025 2018/01/03

'40' 1035 2018/01/03

'47' 1027 2018/01/03

'09' 1018 2018/01/02

'27' 1014 2018/01/02

'44' 1022 2018/01/02

'04' 1008 2018/01/01

'24' 1004 2018/01/01

'32' 1003 2018/01/01

'05' 1048 2018/01/04

'15' 1042 2018/01/04

'33' 1039 2018/01/04

RH Txn# TxnDate

'25' 1036 2018/01/03

'31' 1026 2018/01/03

'46' 1030 2018/01/03

'02' 1024 2018/01/02

'11' 1019 2018/01/02

'22' 1020 2018/01/02

'08' 1006 2018/01/01

'20' 1005 2018/01/01

'43' 1010 2018/01/01

'18' 1041 2018/01/04

'38' 1046 2018/01/04

'41' 1044 2018/01/04

RH Txn# TxnDate

✓

✓

✓

✓

✓

✓

✓

✓

✓

✓

✓

✓

So why does the query perform dramatically better when we implemented row partitioning. In the table without row partitioning, the rows are uniformly distributed across the available AMPs based on the Row Hash of the primary index column vale and then each AMP will order the rows by the Row Hash of the primary index value. In this case, when we need to extract rows for a particular date, all of the rows on each AMP will need to be read.

© 2016 Teradata

122

122

Partitioning Explanation (cont.)

Why does the query perform better with row partitioning?

Sales Transaction Detail table table defined with Row Partitioning across 4 AMPs

Rows are ordered by Partition (TxnDate) and Row Hash

SELECT … WHERE TxnDate = DATE '2018-01-03’;

Single Partition Scan is executed

'14’ 1001 2018/01/01

'39’ 1011 2018/01/01

'01’ 1028 2018/01/03

'35’ 1007 2018/01/01

'17' 1013 2018/01/02

'42’ 1047 2018/01/04

'03’ 1016 2018/01/02

'28’ 1032 2018/01/03

'23’ 1040 2018/01/04

'48’ 1023 2018/01/02

'12' 1031 2018/01/03

'30’ 1038 2018/01/04

RH Txn# TxnDate

'36’ 1012 2018/01/01

'29' 1033 2018/01/03

'34' 1029 2018/01/03

'26’ 1002 2018/01/01

'16' 1021 2018/01/02

'36’ 1043 2018/01/04

'06' 1009 2018/01/01

'10’ 1034 2018/01/03

'13’ 1037 2018/01/04

'07’ 1017 2018/01/02

'45’ 1015 2018/01/02

'21' 1045 2018/01/04

RH Txn# TxnDate

'27’ 1014 2018/01/02

'05’ 1048 2018/01/04

'33’ 1039 2018/01/04

'32’ 1003 2018/01/01

'19’ 1025 2018/01/03

'15’ 1042 2018/01/04

'04' 1008 2018/01/01

'44’ 1022 2018/01/02

'40’ 1035 2018/01/03

'24’ 1004 2018/01/01

'09’ 1018 2018/01/02

'47’ 1037 2018/01/03

RH Txn# TxnDate

'25' 1036 2018/01/03

'31' 1026 2018/01/03

'41’ 1044 2018/01/04

'08’ 1006 2018/01/01

'43' 1010 2018/01/01

'22' 1020 2018/01/02

'20' 1005 2018/01/01

'11’ 1019 2018/01/02

'38’ 1036 2018/01/04

'02’ 1024 2018/01/02

'46’ 1030 2018/01/03

'18' 1041 2018/01/04

RH Txn# TxnDate

✓

✓

✓ ✓

✓

✓

✓

✓

✓

✓

✓

✓

When the rows are partitioned by date, the rows are still uniformly distributed by the primary index value Row Hash, but now the rows are first ordered by the partitioned column, date, and then within the partition by Row Hash. This will group the rows for the same date into the same partition. Now instead of reading all of the rows, we only need to read the rows for a single partition, greatly reducing the number of I/O’s and resulting in dramatic performance improvement. This is one of many physical design options available with the Teradata Database that can came result in performance improvement without having to add additional costly hardware. In the case of partitioning, the cost is minimal (2 additional bytes per row for the partition number).

© 2016 Teradata

123

123

Implementing Partitioning

To Implement Partitioning simply add a PARTITION BY clause specifying the partitioning interval

CREATE SET TABLE SALES_TRANSACTION_LINE_PPI ,

( Sales_Tran_Id INTEGER TITLE 'Sales Tran Id' NOT NULL,

Sales_Tran_Line_Num SMALLINT TITLE 'Sales Tran Line Num' NOT NULL,

Item_Id DECIMAL(15,0) TITLE 'Item Id' NOT NULL,

Item_Qty SMALLINT TITLE 'Item Qty' NOT NULL,

Unit_Selling_Price_Amt DECIMAL(8,4) TITLE 'Unit Selling Price Amt' NOT NULL,

Unit_Cost_Amt DECIMAL(8,4) TITLE 'Unit Cost Amt',

Tran_Line_Status_Cd CHAR(1) TITLE 'Tran Line Status Cd',

Sales_Tran_Line_Start_Dttm TIMESTAMP(6) TITLE 'Sales Tran Line Start Dttm',

Tran_Line_Sales_Type_Cd CHAR(2) TITLE 'Tran Line Sales Type Cd',

Sales_Tran_Line_End_Dttm TIMESTAMP(6) TITLE 'Sales Tran Line End Dttm',

Tran_Line_Date DATE FORMAT 'yyyy-mm-dd',

location_id INTEGER )

PRIMARY INDEX ( Sales_Tran_Id )

PARTITION BY RANGE_N(Tran_Line_Date BETWEEN DATE ‘2015-01-01' AND DATE ‘2019-12-31'

EACH INTERVAL '1' DAY );

To implement partitioning, you simply need add a PARTITION BY clause. In this example, we are using the keyword RANGE_N to partition the data by an ascending range of values. In this case, by the partitioning column Tran_Line_Date. We are going to create daily partitions between the dates of 2015-01-01 and 2019-12-31.

© 2016 Teradata

124

124

Summary

125

125

DIY Deployment on AWS Summary

In this presentation, we covered:

➢ The Teradata Database software in the AWS Marketplace

➢ How to Launch the Teradata Database from the AWS Marketplace

➢ How to monitor the creation of the CloudFormation stack

➢ The Teradata Database Security Group that is created

➢ How to connect to the Teradata Database via Teradata Studio™

➢ How to launch Viewpoint from the AWS Marketplace

➢ How to connect to and configure Viewpoint

➢ How to scale the Teradata Database instances up or down

➢ How to scale the Teradata Database instances in and out

➢ How to expand EBS Storage

126

126

Cloud Consumption Options

© 2016 Teradata

127

127


Do-it-yourself Deployment

“I’ll deploy & manage in AWS myself.”

• Benefit: Broad system customization




• Discovery Analytics / Data Labs

© 2016 Teradata

128

128


IntelliCloud as-a-service

“Deploy in AWS & manage it for me.”

• Benefit: No babysitting required!


• Production Analytics

• Disaster Recovery

• Test / Dev / QA / Data Labs

© 2016 Teradata

129

129

IntelliCloud for AWS Attributes

• Predictable cost• Flexible pricing, no hidden fees

• Convenient subscription options• Start small and grow as needed

• “Just bring your data and a DBA”• We manage the environment for you

Service Feature Included

Software ✓

Infrastructure ✓

Monitoring ✓

Encryption ✓

Compliance ✓

Web Console ✓

Onboarding ✓

Up to 99% SLA ✓

Backups ✓

© 2016 Teradata

130

130

IntelliCloud Compliance Audits

• Audited for compliance• GDPR

• PCI

• HIPAA

• SOC 1 and 2

• ISO 27001

• Designed to comply• EU Model Clauses

• EU-US Privacy Shield

• Additional regulations

We invest ~$1M each year in recurring third-party audits

© 2016 Teradata

131

131

Elasticity and Cost Optimization

•View site details

•View utilization

• Scale compute

• Stop/Start database

•Modify backup schedule

• Set firewall rules (coming)

© 2016 Teradata

132

132

IntelliCloud Management Console

© 2016 Teradata

133

133

System Resources

This is the IntelliCloud Management Console (IMC) dashboard. The top pane displays the various systems managed by IntelliCloud and the bottom pane displays the details about those systems.

In the Dashboard, you can view specific details for a database site by selecting it in the Database Sites pane and scrolling down the page.

The Database Sites pane includes CPU and disk space utilization based on the average of all nodes for the last 24 hours.

In this example, the AWS system in us-west-2 region, is at 21% disk utilization and 73% CPU utilization over the last 24 hours.

This system is a development system running Teradata Database version 15.10. It is deployed on 16 m4.10xlarge instances with 20TB per node, 320TB total. The current AWS configuration is 20 AMPs per node, so each AMP is managing 1TB of data.

134

134

Resource Utilization

Scrolling down you will see the Utilization report. Utilization displays a graph of the CPU load and amount of storage used over time. You can configure the this report to display by day, week or month intervals.

135

135

Separate Compute and Storage

Moving your cursor along the high or low peaks will display a popup showing the CPU and Disk metrics.

136

136

Backups, IPs, Support Tickets

Continuing to scroll down the Dashboard, the last 3 displays are Backups, Network Settings and Support Tickets.

By default, IntelliCloud sites are backed up daily. The two most recent full backups are retained at the primary data center. Additional backup features depend on the deployment platform.

For Backup Jobs, you edit the backup job by clicking the Edit Backup Job link. This will open a dialog box allowing you to change the backup job schedule.

• Additional backup services are available by submitting a support ticket: Enable auto-restart to re-run a failed backup job.

• Enable weekly or monthly backups.• Suspend execution of backup jobs.• Request NOSYNC option for backup jobs.

For support tickets you can see previously submitted tickets and you can also open a new ticket if you want. The IntelliCloud Management Console provides a convenient interface to the IntelliCloud support portal. Creating a ticket from the IntelliCloud Management Console generates an IntelliCloud support ticket.

137

137

Additional Capabilities

In the upper left corner, if you click the menu icon, you will be displayed with other menu choices. The following slides will show the examples of User Management, Metering and Availability Reports.

138

138

User Management

© 2016 Teradata

139

139

Instance-Specific Metering

The first display in Metering allows you to look at the instance hours consumed on the various instance types over time. You can filter the display by Platform (AWS), Databases and the interval (hourly, daily or monthly) across a specified data range.

140

140

Availability Reports

Availability reports show availability and outage statistics for each IntelliCloud site, including database downtime and network connectivity disruptions.

• Metrics are available on a calendar month basis, for the last 12 months.• Date ranges are in UTC.• Reports are available by the 10th of each month for the previous month.• Click an availability percentage to view the detailed Availability Report

for the target interval, including a list of incidents. • In an Availability Report, click an incident ID to view all details in the

Cloud Service Portal.

Outage exceptions excluded from Availability Reports: Customer's misuse of Cloud Services

• Customer's internet connectivity failure• Non-Teradata internet traffic issues• Scheduled downtime

141

141

Scale Up/Down

© 2016 Teradata

142

142

Current Size = Medium

With IntelliCloud you can scale the system within the same instance family. This is referred to as scaling up or down. From the IMC dashboard, you click on the “SCALE UP/DOWN” link.

143

143

Scale Up from Medium to Large

After clicking on the “SCALE UP/DOWN” link, you will get a popup screen with the other instance sizes within the same instance family that you can choose from. In this example, we are choosing to scale up from a m4.10xlarge instance to a larger m4.16xlarge.

Note: that you there will be additional AWS CPU charges and also the rate of TCore consumption will be higher.

144

144

Just a Restart, No Redistribution

After about 5 to 10 minutes, the scale up will be completed and you will see that the system is now running on m4.16xlarge instances instead of m4.10xlarge instances.

145

145

Scale Out/In

© 2016 Teradata

146

146

Current Config = 16 Instances

With IntelliCloud you can scale the system out or in to increase or decrease the number of nodes. This is referred to as scaling out or in. This will allow you to increase the CPU, memory and I/O bandwidth capacity. It will also increase the number of Parsing Engines (each node will have 2 PEs ), but will not increase the number of AMPs or storage.

From the IMC Dashboard, click the SCALE OUT/IN link.

147

147

Scale Out Compute By 2X or 4X

After clicking the SCALE OUT/IN link, you will get a pop-up screen that will allow you to choose the scaling options you want. 1x, 2x or 4x. In this example, we are going to scale out 2x, from 16 nodes to 32 nodes.

Note that there will be additional AWS CPU charges and also the rate of TCore consumption will be higher.

148

148

Just a Restart, No Redistribution

After about 20 minutes, the scale out will be completed and you will see that the system is now running on 32 m4.16xlarge instances instead of 16 m4.16xlarge instances.

149

149

Stop/Start

© 2016 Teradata

150

150

Optimize Spend When Not Using

With IntelliCloud you can scale the system out or in to increase or decrease the number of nodes. This is referred to as scaling out or in. This will allow you to increase the CPU, memory and I/O bandwidth capacity. It will also increase the number of Parsing Engines (each node will have 2 PEs ), but will not increase the number of AMPs or storage.

From the IMC Dashboard, click the SCALE OUT/IN link.

151

151

Turn Off Compute (Keep Storage)

© 2016 Teradata

152

152

IntelliCloud Management Console

• Provision sites

• View system config, status info

• View compute, storage utilization

• Manage elasticity

• Monitor, manage backups

• Open, monitor support tickets

• Manage user accounts

153

153

What we just

accomplished…

✓

Learned about

As-a-service options

© 2016 Teradata

154

154

Connecting with TableauCloud Analytics Experience

Kevin Bogusch, Cloud Marketing, Teradata

© 2016 Teradata

155

Agenda

155

• Connect with Teradata Studio

• Load our data set

• Connect with Tableau

• Investigate our data set

Follow Me

Twitter

© 2016 Teradata

156

156

Let’s Load Our Data

© 2016 Teradata

157

157

Start Teradata Studio

• Double-click the Teradata Studio

icon on your remote desktop

• Studio will take about a minute to

get ready

• Studio is available free

for Teradata customers from

https://downloads.teradata.com

© 2016 Teradata

https://downloads.teradata.com

158

158

Create a Connection

• The New Connection Profile

window will appear. If not, on the top right, click Query

Development, then on the left, right-click

Database Connections and select New.

• Click Teradata in the list

• Give it a name, like “CAE”

• Click Next

© 2016 Teradata

159

159

Finishing the Connection

• Enter the IP address of your

database for the server name

• Enter DBC for the user name

• Enter the DBC password that you

used in your stack

• Click Save Password

• Click the Add… button

© 2016 Teradata

160

160


• Select “TCP” from the JDBC Property drop-down

• The TCP Properties window appears

• Select “KEEPALIVE” from the drop-

down

• Click on OK, and then OK again

© 2016 Teradata

161

161


• Click the Finish button

• You will see the connection expand

on the left side

• Click the > (arrow) to expand it to

ensure that you are connected

© 2016 Teradata

162

162

Create a Database

• In the upper right, Click Query

Development

• From the File menu,

choose Open File…

• Select

CAE_attendee_data_demo.sqlfrom the desktop

© 2016 Teradata

163

163

Create a Database

• Click anywhere inside the SQL, just

to make sure the cursor is there

• Press Ctrl+A to select all of the SQL

• Press Ctrl+C to copy it

• Click on the SQL Editor tab

• Press Ctrl+V to paste it

© 2016 Teradata

164

164

Create a Database

• Click the green arrow at the upper

right

• You will know that it is successful if

there are no errors (no red marks) in the bottom window.

© 2016 Teradata

165

165

Load the Data

• Click Data Transfer in the upper

right

• Expand the database connection

on the left:

DBC

→ Databases

→ CAE

→ Tables

→ PARKING_VIOLATION

© 2016 Teradata

166

166

Load the Data

• Right-click on (1)

PARKING_VIOLATION

• (2) Select Data

• (3) Then select Load Data

• Change the source type to

External File

• Click the Launch button

1

2

3

© 2016 Teradata

167

167

Load the Data

• If needed, drag the right edge of

the window a little bit to the right

• (1) Change the file type to

Excel (.xlsx), if needed

• (2) Click the Browse button and

find CAE_parking_violation.xlsx

• (3) Click Column Labels in First Row

• Click the Finish button

• Data will load in about 30 seconds

• You will see Rows Transferred

1

2

3

© 2016 Teradata

168

168

Let’s Visualize Our Data

© 2016 Teradata

169

169

Start Tableau

• Click the Tableau Desktop icon on

your remote desktop

• Tableau should start in less than a

minute

• Thank you to Tableau as a

corporate sponsor!

© 2016 Teradata

170

170

Connect with Tableau

• Click the Teradata Studio icon on

your remote desktop

• On the left, under To a Server,

select More…

• Click on Teradata

© 2016 Teradata

171

171

Connect with Tableau

• Enter the IP address of your

database for the server

• Enter DBC for the user name

• Enter the DBC password that you

used in your stack

• Click the Sign In button

© 2016 Teradata

172

172

Choose Your Database & Table

• In the left-hand menu, click the

down arrow for database

• Enter CAE

• Click the blue plus sign

• Enter parking_violation

• Click the blue plus sign

© 2016 Teradata

173

173

Validate Your Table

• The table appears in the right, near the top

• Click the Update Now button to see

the data

• The data appears below in a

spreadsheet

© 2016 Teradata

174

174

View Your Data

• At the bottom left, click on Sheet 1

• Scroll down the Dimensions box to

the bottom

• Click and drag Violation time up to

the Columns area

• At the bottom left, in the Measures

box, click and drag Number of

Records to the Rows area

© 2016 Teradata

175

175

View Your Data

• In Dimensions, click and drag

Vehicle color on to the “Color” boxin Marks

• To the far right, click Show Me to hide the menu

• In the Vehicle color list, click on the

down arrow (it will appear)

• Click on Sort…

© 2016 Teradata

176

176

View Your Data

• Click the Descending option

• Click on the Field option, and leave

it at “Number of Records”

• Click the OK button

© 2016 Teradata

177

177

View Your Data

• You now have a breakdown of the

number of tickets written at each

hour for the 2018 fiscal year (June

to July), categorized by color

• You can drag Vehicle color off of

the Marks panel and put a different

category there

© 2016 Teradata

178

178

Questions You Can Answer

• Do boroughs differ significantly

when they write their tickets?

• Which agencies write the most

tickets

• What states (based on car

registration) are ticketed the most?

• Does it make a difference what

color your car is?

• What code violation is given the

most and when?

• Are certain vehicle makes targeted

more than others?

© 2016 Teradata

179

179

What we just

accomplished…

✓

Created an

analytic visualization

© 2016 Teradata

180

180

Closing Remarks

© 2016 Teradata

181

181

What Did We Do Today?

• Today you learned:

• How to spin up Teradata in the cloud

• Options for Do-it-yourself and As-a-service

• How to use Teradata Studio for (new) analytics

© 2016 Teradata

182

182

Cloud Consumption Categories

Do-it-yourself“I’ll deploy & manage myself.”

•Benefit: System customization

•Use cases recommended:



• Discovery Analytics / Labs

As-a-service“Deploy & manage it for me.”

•Benefit: No babysitting required!

•Use cases recommended:

• Production Analytics

• Disaster Recovery

• Test / Dev / QA / Data Labs

© 2016 Teradata

183

183

Consistency and Portability

Pricing Freedom

Teradata Hardware

Commodity Hardware

Public CloudBYOL

Teradata Cloud

Subscription-based Teradata licenses are portable

© 2016 Teradata

184

184

Strong Services Organization

`ADVISE & ARCHITECT

Advise on Business Outcomes

Helps customers set an analytic vision that aligns business and IT, assesses current state capabilities, and evaluates technology & approaches

DESIGN & IMPLEMENT

Enable an Ecosystem Architecture

Design and implement analytic ecosystems, the underlying data, and platforms

OPTIMIZE & MANAGE

Ensure Value Delivery

Support customers’ analytic ecosystems and drive continuous improvement; includes Support & Managed Services

Artificial Intelligence

Strategy

Business & Analytics

Capability Review


Roadmap


Governance

Ecosystem Architecture

Definition

Rapid Analytic Consulting

Engagement (RACE)

AnalyticOps &

Optimization

Business Continuity Operations

Managed Database & Application Operations

Managed Ecosystem

Administration

Elastic Provisioning

Ecosystem Upgrades & Migrations

Support Services

Advanced Analytics &

Data Science

AnalyticOps Foundation

BI & Data Visualization

Data, Acquisition & Integration

Data LakeData Platform & Foundation

Data Security & Privacy

Teradata Applications

Data Design & Access

© 2016 Teradata

185

185

Let’s Keep Learning…

• Connect with cloudy colleagues

• Expand your network

• View and read the info on our website

• Many videos, whitepapers, blogs

• Tap into Teradata Education Network resources

• Dozens of online courses

© 2016 Teradata

186

186

http://analyticsexperience.com/

© 2016 Teradata

http://analyticsexperience.com/

187

187

Spin Up on Your Own

© 2016 Teradata

188

188

Thank You to Our Sponsors!

© 2016 Teradata

189

189

Take the Survey!

Your input helps us

improve how we serve you

© 2016 Teradata

Documents

Cloud Analytics Experience...4 4 Advanced Partner Status •Top strategic partner in APN •Multiple levels of collaboration •Heavy investment in training •>5,100 accreditations