3
hvr-software.com | [email protected] | 135 Main Street, Suite 850, San Francisco, CA 94105 © Copyright 2018. All Rights Reserved. HVR Software. Virtually all organizations are either considering a (full or partial) move to the cloud or have already started it, with some running the bulk of their IT infrastructure and services in the cloud. Amazon Web Services (AWS) is a commonly selected cloud provider. This datasheet describes HVR’s continuous data integration support for AWS technologies. Why HVR for AWS HVR delivers efficient continuous data integration for the cloud: on-premises to cloud, cloud to on-premise and cloud to cloud environments. With HVR, you can set up once and deploy often, creating efficiencies across your team. Additionally the HVR platform enables you to feed multiple destinations continuously and at once. Starting with tables in an existing database or application, HVR enables you to: Start with table creation by automatically mapping your source system data types to compatible loss-less data types in the AWS destination. Perform an initial load integrated with continuous log-based change data capture (CDC). Move data in real-time with log-based CDC. You can achieve optimized continuous delivery onto your AWS technologies. Check your data with data validation and repair. This is a unique function that enables you to ensure the data you have moved is correct and alerts you if it is not. Monitor your data statistics management console in which you can setup the data flows and any automated alerts, also giving access to rich data movement insights. Installing HVR in Your Environment HVR’s modular architecture is designed to work well on high latency, low bandwidth networks. Installing the HVR architecture in your environment is low impact and includes the use of agents and a hub. One of the installations in an HVR setup plays the role as the coordinator and is referred to as the hub. The hub always initiates communication and hence determines in which direction the firewall must be opened. You can choose where to run the hub. (e.g. many on-premises to AWS setups run the hub on-premises and only open the firewall to AWS) Data Integration for AWS DATASHEET

DATASHEET Data Integration for AWS · Amazon Web Services (AWS) is a commonly selected cloud provider. This datasheet describes HVR’s continuous data integration support for AWS

  • Upload
    others

  • View
    9

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DATASHEET Data Integration for AWS · Amazon Web Services (AWS) is a commonly selected cloud provider. This datasheet describes HVR’s continuous data integration support for AWS

hvr-software.com | [email protected] | 135 Main Street, Suite 850, San Francisco, CA 94105

© Copyright 2018. All Rights Reserved. HVR Software.

Virtually all organizations are either considering a (full or partial) move to the cloud or have already started it, with some running the bulk of

their IT infrastructure and services in the cloud. Amazon Web Services (AWS) is a commonly selected cloud provider.

This datasheet describes HVR’s continuous data integration support for AWS technologies.

Why HVR for AWSHVR delivers efficient continuous data integration for the cloud: on-premises to cloud, cloud to on-premise and cloud to cloud environments.

With HVR, you can set up once and deploy often, creating efficiencies across your team. Additionally the HVR platform enables you to feed

multiple destinations continuously and at once.

Starting with tables in an existing database or application, HVR enables you to:

Start with table creation by automatically mapping your source system data types to compatible loss-less data types in the AWS destination.

Perform an initial load integrated with continuous log-based change data capture (CDC).

Move data in real-time with log-based CDC. You can achieve optimized continuous delivery onto your AWS technologies.

Check your data with data validation and repair. This is a unique function that enables you to ensure the data you have moved is correct and alerts you if it is not.

Monitor your data statistics management console in which you can setup the data flows and any automated alerts, also giving access to rich data movement insights.

Installing HVR in Your Environment  HVR’s modular architecture is designed to work well on high latency, low bandwidth networks. Installing the HVR architecture in your

environment is low impact and includes the use of agents and a hub.

One of the installations in an HVR setup plays the role as the coordinator and is referred to as the hub. The hub always initiates communication

and hence determines in which direction the firewall must be opened.  You can choose where to run the hub.

(e.g. many on-premises to AWS setups run the hub on-premises and only open the firewall to AWS)

Data Integration for AWSDATASHEET

Page 2: DATASHEET Data Integration for AWS · Amazon Web Services (AWS) is a commonly selected cloud provider. This datasheet describes HVR’s continuous data integration support for AWS

hvr-software.com | [email protected] | 135 Main Street, Suite 850, San Francisco, CA 94105

© Copyright 2018. All Rights Reserved. HVR Software.

An agent, which is an additional installation of software on or near the source or target, facilitates the

movement of changes that occur between systems. Agents don’t have to be installed on database servers, but

agents should be used when communicating across a Wide Area Network (WAN), either between on-premises and the

cloud, or when moving data between availability zones or different providers in the cloud. 

When moving data using HVR, the following features ensure performance and security:

• Proprietary compression combined with large data block transfer ensures fast performance maximizing bandwidth utilization

irrespective of latency.

• Network communication is secured using SSL/TLS encryption with the use of explicit certificates.

• Certificates can also be used for more secure two-factor authentication on top of username/password validation.

• Use of a proxy is supported so that individual systems or database access don’t have to be exposed through the firewall but instead data

is securely routed through a proxy. (e.g. in a DMZ (de-militarized zone))

Supported ServicesThe majority of data storage on AWS services are available through Infrastructure as a Service (IaaS) and Platform as a Service (PaaS). Some

technology vendors such as Snowflake and Teradata use the AWS infrastructure to run their branded cloud services.

We strongly recommend that you use an HVR installation in the availiability zone of the source and/or destination end point in order to achieve

optimal efficiency and performance. The HVR for AWS image in AWS Marketplace is a good starting point with many database drivers to

various services pre-installed on the image. Agent sizing is dependent on the end point technology but can be very modest.

HVR is compatible with the infrastructure service Elastic Load Balancer (ELB) with customers using this service in their production

environments.

Page 3: DATASHEET Data Integration for AWS · Amazon Web Services (AWS) is a commonly selected cloud provider. This datasheet describes HVR’s continuous data integration support for AWS

hvr-software.com | [email protected] | 135 Main Street, Suite 850, San Francisco, CA 94105

© Copyright 2018. All Rights Reserved. HVR Software.

IaaS SupportHVR supports all technologies installed on Amazon EC2 servers in

AWS as if these were on-premises installations. For example HVR

customers use Oracle Databases directly running on Amazon EC2

servers, Greenplum running on AWS, and various other databases

and technologies.

The list of supported technologies continue to expand. Refer to

the HVR platform support page on the HVR website for a current

overview of supported technologies.

PaaS SupportAWS services can be used as a source or a target for continuous

data integration. Any of HVR’s supported source technologies and

platforms, used in the cloud or on-premises, are supported as a

source for AWS services as a target. The following AWS services are

supported natively:

• All flavors of the Amazon Relational Database Service (RDS)

are supported as a target, including Amazon Aurora. As a

source for log-based CDC, HVR supports the Amazon RDS

flavors Oracle, PostgreSQL, MySQL and MySQL-compatible

Amazon Aurora.

• The AWS data warehouse solution, Amazon Redshift, is

supported as a target. Changes are integrated most efficiently

using compressed data files staged in Amazon S3 and running

integration through micro-batches so that on aggregate

Amazon Redshift can keep up with the number of row changes

arriving from one or more busy transactional systems.

• Amazon S3 is supported both as a destination and as a

source. A popular use case of Amazon S3 is as a data lake

destination, with HVR supporting the management of files

(e.g store files, etc.) organized by table, per year/month/day,

and data publication through the use of manifests. Supported

data formats include JSON, Parquet, Avro, CSV and XML with

many options to fine-tune the format and control compression.

Both client-side and server-side encryption through the AWS

ABOUT HVR

We accelerate data movement so that you can revolutionize your business. HVR is designed to move large volumes of data FAST and efficiently in modern environments for real-time updates.

Key Management Service

(KMS) are supported.

• HVR supports the use of Amazon EMR

as external tables on top of the data files

in Amazon S3, both to enable direct access, and

to support compare between the source database

and target data in Amazon S3. Amazon S3 is also used as

the staging area for pushing changes into other services like

Snowflake and Amazon Redshift.

• Amazon Kinesis is supported as a destination for changes

using JSON as the default data format.

• Amazon Elastic File System (EFS) is supported as a source and

destination implicitly through HVR’s support for file systems.

In addition to the natively supported PaaS technologies, additional

destinations can be utilized through the agent plugin framework.

The agent plugin is a program or script that runs in a pre-defined

environment with data available in files. Python is commonly used

to write an agent plugin script but other programming languages can

be used as well. Plugin examples are included in the HVR installation

directory $HVR_HOME/lib/agent.

HVR in AWS MarketplaceTo get started using HVR for AWS contact us,

or look for HVR in AWS Marketplace.

INTERESTED TO LEARN MORE?Sign up for a LIVE DEMO Webinar

Sign Up