Upload
mountaha
View
1.018
Download
0
Embed Size (px)
DESCRIPTION
EMC #1 Open XML Database (OEM)
Citation preview
1© Copyright 2008 EMC Corporation. All rights reserved.
EMC Documentum XML StoreOEM Edition
The Native XML Database
Prateek KathpalSr. Manager – OEM & Platform Marketing
2© Copyright 2008 EMC Corporation. All rights reserved.
Agenda
Need & Value of Native XML Database
Overview of XML Store OEM Edition
Developer & Support Programs
Success Stories
3© Copyright 2008 EMC Corporation. All rights reserved.
Need for a Native XML Database
4© Copyright 2008 EMC Corporation. All rights reserved.
Advantages of XML for Information Management
Universal Format for all types of information– XML can represent all types of information from Office Documents to an archival
format for database tables
Self Describing– XML and its associate standards (Schema, Semantic Web) allow for information to be
self-describing and richly typed
Platform and Application Neutral– XML is not dependent on a platform or an application to read it
Separation of Content from Presentation– Supports multi-channel delivery of information
Searching and Retrieval– XML, tagging and XML Query enable better searching, retrieval and dynamic
assembly of information
5© Copyright 2008 EMC Corporation. All rights reserved.
Options for Storing and Managing XML
Flat files No way to query and processDifficult to update XML fragments
Relational database No structured queries Difficult to update XML fragmentsDoes not support the hierarchal structure of XMLPerformance affected by need to process content
Native XML database Fast and flexible when working with XMLOptimized for XML contentHigh performance
6© Copyright 2008 EMC Corporation. All rights reserved.
Ask yourself these questions
These are the reasons for embedding native XML database in
your application!
How are you managing your XML content in your application today?Local or network file systems?(Z:\Shared Drive\Application_Files)Storing XML as BLOB in RBMS
How do you retrieve the information?Based on the file path or XML link?
How do you manage and process large volumes of complex XML content?
Are you using traditional SAX & DOM parsers?Is this affecting performance of your system?
How are you transforming XML Content?Are you writing code to create PDF or XHTML renditions?
How are you managing internationalization support?Language support & issues with Unicode characters
Can you track changes in the information?What changed and at what time?What about changes in different snippets of XML files?
How about searching the XML files?Are you trying to create or embed a search engine?Are you searching using XQUERY?
7© Copyright 2008 EMC Corporation. All rights reserved.
Bud
get
Res
ourc
es
Build or Buy?
Multiple technologies must be integrated to provide a full suite of content services
Your budget and resources get spread very thin.
Complexity and problems at every layer lead to:
Added costDelays in developmentMissed deadlines
Advances in technology are difficult to incorporate in future releases
Maintenance and support become increasingly complex
Where do you want to spend your limited R&D resources and budget
8© Copyright 2008 EMC Corporation. All rights reserved.
The Value of Native XML Database
Focus on the value of your application, not the infrastructure
Server
ContentMgmt. Services
Bud
get
Res
ourc
es
YourApplication
Management of XML is handled by Native XML Database
Your budget and resources are focused on creating business value and competitive differentiation
Less development, fewer problems leads to on time product delivery
Advances in technology are incorporated for you as part of the infrastructure
Maintenance and support are now focused on the application, not the infrastructure
9© Copyright 2008 EMC Corporation. All rights reserved.
EMC DocumentumXML Store
OEM Edition
10© Copyright 2008 EMC Corporation. All rights reserved.
EMC Documentum: The Leader in Content Management
16 years focusing on content management and related technologies
Successfully helping thousands of companies solve their technology challenges
Technology suited to solving a wide range of business problems
Flexible, standards-based, architecture designed from the beginning to work with any type of digital content
Mature product - proven in all types of applications and environments
EMC Documentum understands content management and offers a wide range of technologies, complemented by proven development and support
methodologies, to make our partners successful.
11© Copyright 2008 EMC Corporation. All rights reserved.
EMC Documentum XML Store
A native XML database, built from scratch and fully optimized for XML content
Stores, queries, retrieves, updates, transforms, and publishes XML data
Not built on a relational database
100% Java, runs on any supported JDK 1.5 platform
• Designed for software developers and designed to be embedded:– Extensive API– No UI– Minimal resource demands / light weight footprint
12© Copyright 2008 EMC Corporation. All rights reserved.
Rich XML ManagementFull versioning & tracking of differencesBuilt in XQuery engine for high performanceIndexing methods to optimize accessTransformation and publishing capabilitiesIntelligent linking Client side caching
Key features of XML Store
TransformationXQuery
Engine
IndexesMeta-data
Doc Type: P.O.Owner: Bob FreemanSubject: CapExTitle: Request for FundsCreated: Feb. 1, 2007Keywords: Q1, Budget
Versioning
1.0
1.1
2.0
Check in/out Rich Set
APIs
13© Copyright 2008 EMC Corporation. All rights reserved.
Fully standards based
The fastest and most scalable XML database powered by open standards
• XML 1.1 • XQuery • XML Schema • XPath• XPointer• XLink• XSL • XUpdate• DOM • WebDAV• J2EE • JSP • JAAS • SOAP • Unicode
Uses and supports all standards
14© Copyright 2008 EMC Corporation. All rights reserved.
Standards Based Architecture
Reduce risk by adopting an XML database that uses industry / technology standardsIntegrate faster by leveraging open standardsEMC participates actively in standard organizations
15© Copyright 2008 EMC Corporation. All rights reserved.
XML Store architecture
EMC Documentum XML Store
16© Copyright 2008 EMC Corporation. All rights reserved.
Architected for Scalability and Performance
High PerformanceLinear Load time of 14.9 MB/s; load time is independent of the size of the Database and / or size of the documents loaded
A document with 1 thousand elements queried in 0.063 ms
A document with 10 billion elements queried in 16.9 ms (on average)
Scalability Designed to scale in terms of concurrent users and database size. Supports load balancing of the database over multiple physical machines Built in fault-tolerance and replication capabilities
Loading 10Gb of data
0
500
1000
1500
2000
2500
3000
3500
4000
4500
1000
00 x
100k
b
1000
0 x
1 M
b
1000
x 1
0 M
b
100
x 10
0 M
b
10 x
1 G
b
1 x
10 G
b
Number of Documents
Tim
e [s
]
Indexed XQuery
0.088 0.088 0.089 0.093 0.099
3.3
8.06
13.1
0
2
4
6
8
10
12
14
1,00
0
10,0
0 0
100,
0 00
1,00
0 ,0 0
010
,00 0
,000
100,
0 00,
000
1,00
0 ,0 0
0,00
010
,00 0
,000
,000
Number of elements in the document
Que
ry ti
me
[mill
isec
onds
]
17© Copyright 2008 EMC Corporation. All rights reserved.
Designed for OEM Applications
Silent Installation
Low System Requirements - Needs min 256 Mb internal Memory and 300 Mb hard disk space with Sun JDK 1.5
Runs on any OS / hardware platform that has JDK 1.5
Unicode compliant
Can store both XML and Large Binary Objects.
Built in J2EE Resource Connector
Bridges to relational data through the SQL loader
Offers various index flavors such as Construct Indexes, Content Indexes, Context Conditioned Indexes and Full Text Indexes
Integrated for SAX based parsing , Xalan and FOP
Easily integrates with other formatting engines like XEP, Framemaker etc.
18© Copyright 2008 EMC Corporation. All rights reserved.
OEM, Developer, and Support Programs
19© Copyright 2008 EMC Corporation. All rights reserved.
The EMC Velocity2 ISV Program is for Application Vendor’s & ISVs that wish to take advantage of the power and range of EMC’s open software and network storage platforms.
EMC ISV Velocity Program enables you to: • Build and market integrations between your products and
EMC technology• Develop and sell applications that run on top of EMC
technology• Embed EMC components into your own products/
solutions through OEM agreement• Offer hosted on-demand or application service provider
(ASP) solutions based on EMC’s platform
Extending the Velocity Brand to ISVs
Velocity ISV Partner Program
20© Copyright 2008 EMC Corporation. All rights reserved.
Designed for EMC
Partner application accreditation service
Quickstart program to provide guidance on how to successfully build XML applications
Gives your customers a way to spot good quality applications
Foster interoperability between applications
Gives you a marketing vehicle to differentiate their offering
Designed for EMC Partner Accreditation Logos
21© Copyright 2008 EMC Corporation. All rights reserved.
Large established EMC developer community
EMC developer community– More than 30,000 Active users – Newsletter to 20,000 each month – Almost 40,000 users
Features – Application Roadmap – Component Exchange – Code Snippets – Sample Code – Forums – Podcasts – Articles/Tips
22© Copyright 2008 EMC Corporation. All rights reserved.
Customers &Success Stories
23© Copyright 2008 EMC Corporation. All rights reserved.
XML Store Customers
The fastest and most scalable XML database powered by open standards
Designed for software developers who require advanced XML data processing and storage
24© Copyright 2008 EMC Corporation. All rights reserved.
Renault Formula 1 Racing: split-second storage
Challenges– Storing existing and newly-acquired data from
Formula 1 race cars – Providing access to a global data repository via the
use of a secure web portal – Extracting various types of data (including audio
and video signals) from several proprietary formats – Real-time diagnosis of car’s performance, based on
information from car sensors and environmental factors captured in XML
Solution– Selected XML Store for its ability to quickly process
complex raw data into the hierarchical structure of XML.
Results– Storing complex data in an XML database helps the
team to quickly perform ‘what-if’ analyses and infer various scenarios as required
– ‘Split-second’ data storage provides the F1 team realtime data to perform complete diagnoses of a car’s performance for a competitive edge
– Use of open standards provides the ability to extract various types of data (including audio and video signals) from several proprietary formats
Business Overview:Renault F1 is the Renaultcompany’s Formula One racing team. Renault has competed in Formula One (originally via subsidiary Renault Sport), both as an engine supplier and as a constructor from the late 1970s to the present day, with several breaks. Renault F1 is coordinated from the team’s UK base at Enstone, Oxfordshire where the chassis are designed and built.
Industry:Automobile
25© Copyright 2008 EMC Corporation. All rights reserved.
Forest Technology Group
Challenges– Existing solution based on a relational database
(Oracle) resulting in difficult customizations– Needed a more flexible approach to managing
and extracting data. – Reduce the cost for adding new customized
features
Solution– Selected XML Store for its flexibility & scalability
Results– Flexible, native XML database makes it easy to
adapt applications to meet customer needs – Indexes and stores information from disparate
sources, and provides an array of querying and reporting capabilities
– Enables FTG to address change requests swiftly, to close more business
Business Overview:Forest Technology Group (FTG)provides information technology solutions to companies that manage forests and other natural resources
IndustryForestry and natural resources management
GeographiesHeadquarters: North Charleston, South Carolina
26© Copyright 2008 EMC Corporation. All rights reserved.
With XML Store
How are you managing your XML content in your application today?Local or network file systems?(Z:\Shared Drive\Application_Files)Storing XML as BLOB in RBMS
How do you retrieve the information?Based on the file path or XML link?
How do you manage and process large volumes of complex XML content?
Are you using traditional SAX & DOM parsers?Is this affecting performance of your system?
How are you transforming XML Content?Are you writing code to create PDF or XHTML renditions?
How are you managing internationalization support?Language support & issues with Unicode characters
Can you track changes in the information?What changed and at what time?What about changes in different snippets of XML files?
How about searching the XML files?Are you trying to create or embed a search engine?Are you searching using XQUERY?
Rich APIHigh PerformanceVersioningMeta data ManagementIndexingBuilt in XQueryTransformationScalabilityLow MaintenanceReduced Development CostsOpen Standards Support
27© Copyright 2008 EMC Corporation. All rights reserved.
Summary: Business Benefits for You
Reduced development and maintenance costs– Leverage existing services instead of developing them– Lower cost, less effort, easy installation– Develop on one standards based architecture for both Windows and Linux
Reduced risk– Industry leading & mature XML database with support from a leading platform company EMC.– Fully self-contained, no dependency on expensive third-party components such as Oracle– Scalable to fit any customer and application need
From hundreds to billions of documentsFrom tens to thousands of usersFrom gigabytes to terabytes
– Extensible, Scalable and Flexible– Open Standards based architecture easily adapts to changes in IT infrastructure– History of rapid adoption of new industry trends and technologies
28© Copyright 2008 EMC Corporation. All rights reserved.
Contacts
For further information visit us at:http://www.emc.comEmail us at: [email protected]