2
IMPACT BRIEF | 1 ©2014 Enterprise Management Associates, Inc. All Rights Reserved. | www.enterprisemanagement.com Abstract In September 2014 SnapLogic announced the Fall 2014 release of its integration solution with new data wrangling capabilities and major SnapReduce enhancements for Hadoop 2.0 deployments. is ENTERPRISE MANAGEMENT ASSOCIATES® (EMA™) impact brief covers the announcement, identifying significant new capabilities and direction that the new release provides. SnapLogic Announces Fall 2014 Release On September 30, 2014, SnapLogic (http://www.snaplogic.com/) announced the Fall 2014 release of its SnapLogic Elastic Integration Platform. e new release offers an expanded set of capabilities for big data acquisition, preparation, and delivery to provide true integration platform as a service (iPaaS) capability. e release includes the new Hadooplex option, by which users can schedule and trigger a multi-source pipeline to run natively as a YARN application. e new release also supports new Hadoop file formats for transforming workflows into MapReduce jobs. Additionally, the Fall 2014 release includes significant upgrades in the areas of usability, deployability, security and performance. SnapLogic provides data and application integration tools for connecting cloud data sources, SaaS applications and on-premise business applications. SnapLogic describes its approach as Elastic Integration, helping companies connect enterprise applications and data in the cloud and on-premise for improved business agility and faster decision-making. With the a SnapLogic Elastic Integration Platform, organizations can more quickly and affordably accelerate migration to the cloud of enterprise IT with a fast, multi-point and modern integration platform as a service. SnapLogic was founded in 2006 and is headed by co-founder and ex-CEO of Informatica, Gaurav Dhillon. In October 2014 the company announced the initial closing of its $20 million Series D financing. SnapLogic is headquartered in San Mateo, CA Key Observations • Hadooplex: With the Fall 2014 release, SnapLogic users can now set, schedule and trigger a multi- source pipeline to run natively as a YARN application. Users can configure the elastic execution grid can by selecting the Hadooplex option when running in a Hadoop cluster. e new option is certified by Cloudera and Hortonworks. • Hadoop-enabled pipelines: Via the multi-tenant, cloud-based SnapLogic Designer, users can select SnapReduce to transform pipelines, which are Snaplogic-configured data flows, into MapReduce jobs that run on Hadoop. e new release introduces support for parsing and formatting additional Hadoop file formats (SequenceFile and RCFile) and document (JSON) processing for MapReduce jobs. • Productivity and performance: e Fall 2014 release includes significant productivity and performance improvements to better support full enterprise iPaaS. Users now have the ability to preview a subset of data and its structure before and after it has been transformed. Hierarchical SmartLinking has been enhanced to respond to context from JSON and XML documents. Additionally, data flow designers can now easily see previous versions of a pipeline and replace an existing pipeline with a newer one or rollback to a previous version. • Details: For more detailed information on the Fall 2014 Snaplogic release, visit: http://www.snaplogic.com/fall2014 SnapLogic Enhancements Support iPaaS for Hadoop 2.0 Environments

SnapLogic Enhancements Support iPaaS for Hadoop 2.0 Environments

Embed Size (px)

Citation preview

Page 1: SnapLogic Enhancements Support iPaaS for Hadoop 2.0 Environments

IMPACT BRIEF | 1 ©2014 Enterprise Management Associates, Inc. All Rights Reserved. | www.enterprisemanagement.com

AbstractIn September 2014 SnapLogic announced the Fall 2014 release of its integration solution with new data wrangling capabilities and major SnapReduce enhancements for Hadoop 2.0 deployments. This ENTERPRISE MANAGEMENT ASSOCIATES® (EMA™) impact brief covers the announcement, identifying significant new capabilities and direction that the new release provides.

SnapLogic Announces Fall 2014 ReleaseOn September 30, 2014, SnapLogic (http://www.snaplogic.com/) announced the Fall 2014 release of its SnapLogic Elastic Integration Platform. The new release offers an expanded set of capabilities for big data acquisition, preparation, and delivery to provide true integration platform as a service (iPaaS) capability. The release includes the new Hadooplex option, by which users can schedule and trigger a multi-source pipeline to run natively as a YARN application. The new release also supports new Hadoop file formats for transforming workflows into MapReduce jobs. Additionally, the Fall 2014 release includes significant upgrades in the areas of usability, deployability, security and performance.

SnapLogic provides data and application integration tools for connecting cloud data sources, SaaS applications and on-premise business applications. SnapLogic describes its approach as Elastic Integration, helping companies connect enterprise applications and data in the cloud and on-premise for improved business agility and faster decision-making. With the a SnapLogic Elastic Integration Platform, organizations can more quickly and affordably accelerate migration to the cloud of enterprise IT with a fast, multi-point and modern integration platform as a service. SnapLogic was founded in 2006 and is headed by co-founder and ex-CEO of Informatica, Gaurav Dhillon. In October 2014 the company announced the initial closing of its $20 million Series D financing. SnapLogic is headquartered in San Mateo, CA

Key Observations• Hadooplex: With the Fall 2014 release, SnapLogic users can now set, schedule and trigger a multi-

source pipeline to run natively as a YARN application. Users can configure the elastic execution grid can by selecting the Hadooplex option when running in a Hadoop cluster. The new option is certified by Cloudera and Hortonworks.

• Hadoop-enabled pipelines: Via the multi-tenant, cloud-based SnapLogic Designer, users can select SnapReduce to transform pipelines, which are Snaplogic-configured data flows, into MapReduce jobs that run on Hadoop. The new release introduces support for parsing and formatting additional Hadoop file formats (SequenceFile and RCFile) and document (JSON) processing for MapReduce jobs.

• Productivity and performance: The Fall 2014 release includes significant productivity and performance improvements to better support full enterprise iPaaS. Users now have the ability to preview a subset of data and its structure before and after it has been transformed. Hierarchical SmartLinking has been enhanced to respond to context from JSON and XML documents. Additionally, data flow designers can now easily see previous versions of a pipeline and replace an existing pipeline with a newer one or rollback to a previous version.

• Details: For more detailed information on the Fall 2014 Snaplogic release, visit: http://www.snaplogic.com/fall2014

SnapLogic Enhancements Support iPaaS for Hadoop 2.0 Environments

Page 2: SnapLogic Enhancements Support iPaaS for Hadoop 2.0 Environments

IMPACT BRIEF | 2 ©2014 Enterprise Management Associates, Inc. All Rights Reserved. | www.enterprisemanagement.com

EMA PerspectiveEnterprise integration requirements are becoming ever more complex and demanding as the number of mobile, cloud, and social apps that enterprises are called upon to integrate continues to grow. SnapLogic’s cloud-based iPaaS significantly reduces the amount of time and resources required to incorporate new integration points. The SnapLogic drag-a-drop development tool “snaps” together integration objects visually, supporting complex multi-point integration scenarios in addition to basic point-to-point integration. With the Fall 2014 release, Snaplogic solidifies its position in the market as the provider of simplified and accelerated integration solutions for big data environments.

At the core of the SnapLogic offering are “Snaps.” These are modular collections of integration components built for a specific application or data source. Some Snaps provide connectivity to databases, cloud and on-premise applications, while others enable data transformation such as document filtering or adding, removing, or modifying fields. Snaps are also used to sort, join and aggregate data. Snaps are designed to support a wide variety of use cases, including big data analytics, identity management, social media, online storage, and ERP. Snaps are used to bring together such disparate data types as XML, JSON, Oauth, SOAP and REST. SnapLogic pipelines are workflows built from Snaps. Users construct pipelines via the SnapLogic Designer, with each Snap encapsulating a specific application or technology functionality.

With the enhancements provided in the Fall 2014 release, SnapLogic takes significant steps in the direction of providing true iPaaS capability for big data. The Hadooplex option enables SnapLogic users to designate Hadoop as an execution target for SnapLogic pipelines. Hadooplex leverages YARN to schedule execution of pipelines (workflows) on Hadoop nodes. SnapLogic’s SnapReduce 2.0 enables a Hadooplex to translate SnapLogic pipelines into MapReduce jobs. These pipelines are compiled to MapReduce jobs to execute on very large data sets within HDFS. Additionally, the productivity and performance enhancements – including the ability to preview a subset of data and its structure before and after it has been transformed, hierarchical SmartLinking, which understands context from JSON and XML documents, and the ability to view previous versions of a pipeline and replace an existing pipeline with a newer one (or rollback to a previous version) – all support a more flexible and robust overall integration environment.

SnapLogic has recently announced upcoming support for Apache Spark as a target big data platform. Similar to SnapReduce, the new Spark functionality will enable users to compile SnapLogic pipelines into Spark code so that pipelines can run as Spark jobs. With the significant uptake in usage by Spark that we have witnessed over the past few months – with more organizations using Spark, and those that are using it deploying increasingly mission-critical solutions via Spark – SnapLogic’s announcement is well timed.

When looking at the technology developments at SnapLogic and the recent infusion of capital, SnapLogic is well positioned to make the case that the company and its solution are moving into a strong leadership position in the big data analytics and integration space. EMA considers SnapLogic as an excellent strategic option for organizations to adopt and implement cloud-based data integration strategies.

About EMA Founded in 1996, Enterprise Management Associates (EMA) is a leading industry analyst firm that provides deep insight across the full spectrum of IT and data management technologies. EMA analysts leverage a unique combination of practical experience, insight into industry best practices, and in-depth knowledge of current and planned vendor solutions to help EMA’s clients achieve their goals. Learn more about EMA research, analysis, and consulting services for enterprise line of business users, IT professionals and IT vendors at www.enterprisemanagement.com or blogs.enterprisemanagement.com. You can also follow EMA on Twitter, Facebook or LinkedIn. 3027.111214