View
29
Download
0
Category
Tags:
Preview:
DESCRIPTION
.
Citation preview
Emerging Trends
Prof. Srikumar KrishnamoorthyComputer and Information Systems Group
Indian Institute of Management Ahmedabad
Some Emerging Technologies you
might have encountered ….
P2P : Peer-to-Peer Networks
Cloud Computing
Grid Computing
Utility Computing
SaaS – Software as a Service
2
P2P Networks
Peer-to-peer follows a distributed architecture
without the need for central coordination, with
participants being at the same time both
producers and consumers of resources (in
contrast to the traditional client–server model)
3Source: Wikipedia
P2P – Key Characteristics
4
• Peers have equal responsibility (server/client)
• Highly dynamic environments
• Increased autonomy
P2P File Sharing – Napster
• Program for sharing files over the Internet
• a “disruptive” application/technology?
• history:
– 5/99: Shawn Fanning (freshman, Northeasten U.) founds Napster Online music service
– 12/99: first lawsuit
– 3/00: 25% UWisc traffic Napster
– 2000: est. 60M users
– 2/01: US Circuit Court of
Appeals: Napster knew users
violating copyright laws
– 7/01: # simultaneous online users:
Napster 160K, Gnutella: 40K, Morpheus: 300K 5
Napster – How does it work?
Four Steps
• Connect to Napster server
• Upload your list of files (push) to server
• Give server keywords to search the full list
with
• Select “best” of correct answers
6
Napster
napster.com
users
File list is uploaded
1.
7
Napster
napster.com
user
Request
and
results
User requests search at server.
2.
8
Napster
napster.com
user
pingspings
User pings hosts that apparently have data.
Looks for best transfer rate.
3.
9
Napster
napster.com
user
Retrieves
file
User retrieves file
4.
10
Napster
• Central Napster server�Can ensure correct results
�Fast search
�Bottleneck for scalability
�Single point of failure
�Susceptible to denial of service• Malicious users
11
Napster – P2P File Sharing System
• File download directly between members’ computers, bypassing the central computer
• Central computer necessary for initial contact
• Legal issues concerning copyrighted material being freely distributed:
– Napster never stored any MP3 files!
– It provided means for people to exchange such files
• Napster paid music creators & owners $33million for copyright infringement
12
P2P File Sharing – Gnutella
• Decentralized P2P network
• Peers search files by sending requests to their
neighbors
I want song.mp3
She wants song.mp3
We havesong.mp3
Give mesong.mp3
Here it is!13
P2P File Distribution – BitTorrent
tracker: tracks peers
participating in torrent
torrent: group of peers exchanging chunks of a file
obtain list
of peers
trading
chunks
peer
❒ P2P file distribution
14
P2P – Key Benefits
• Cost sharing/reduction
• Improved scalability and reliability
• Resource aggregation
• Ad-hoc communication/collaboration
15
Grid Computing – Definitions
Grid computing is coordinated resource sharing and problem
solving in dynamic, multi-institutional virtual organizations
Grid computing (Foster and Kesselman, 1999) is a growing
technology that facilitates the executions of large-scale resource
intensive applications on geographically distributed computing
resources.
Facilitates flexible, secure, coordinated large scale resource
sharing among dynamic collections of individuals, institutions,
and resource16
Grid Computing – Characteristics
� Co-ordinated resource sharing
• Create an illusion of a simple, yet powerful
computer out of a large number of heterogeneous
systems
� Dynamic collection of institutions
� Used for problem solving
• Tasks are submitted and distributed on nodes in
the grid
17
Why Grids ?
• Solving grand challenge applications using
computer modeling, simulation and analysis
Life Sciences
CAD/CAM
Aerospace
Military ApplicationsDigital Biology Military ApplicationsMilitary Applications
Internet & Ecommerce
18
Grid Computing – Distributed Processing
19
The Grid
Master
Slaves
e.g. Hadoop, Hazelcast
A Typical Grid Computing Environment
Grid Resource Broker
Resource Broker
Application
Grid Information Service
Grid Resource Broker
databaseR2R3
RN
R1
R4
R5
R6
Grid Information Service
20
An Operational Grid
Grid Middleware
App1 App2 App3
21Grid
A B CD
Resource
Monitor
Resource
Discovery
Job
Manager
Job
Monitor
Workflow ExecutorResource Broker
Drug Design: Data Intensive Computing
on Grid
• It involves screening millions
of chemical compounds
(molecules) in the Chemical
DataBase (CDB) to identify
those having potential to
serve as drug candidates.
Protein
Molecules
Chemical Databases(legacy, in .MOL2 format)
22
DesignDrug@Home ArchitectureA Virtual Lab for “Molecular Modeling for Drug Design” on Grid
“Screen 2K molecules in 30min. for $10”
Grid Market Directory
ResourceBroker
Grid Info. Service
GTS
GTS
GTS
GTS
“Give me list PDBs sourcesOf type aldrich_300?”
(GTS - Grid Trade Server)
PDB2
Data Replica Catalogue
GTS
PDB1
“mol.5 please?”
(RB maps suitable Grid nodes and Protein DataBank)
23
MEG(MagnetoEncephaloGraphy) Data Analysis on the Grid: Brain Activity Analysis
Life-electronics laboratory,
AIST
Data Analysis
•Provision of expertise in
the analysis of brain function
•Provision of MEG analysis
Data Generation
Nimrod-G
64 sensors MEG
Results
Analysis All pairs (64x64) of MEG data by shifting the
temporal region of MEG data over time: 0 to 29750:
64x64x29750 jobs
World-Wide Grid
•deadline, budget,
•optimization preference
1
5
4
3
2
24
Search for Extra-terrestrial
Intelligence
~ 180K volunteers, 300K computers,
~ 600Tera FLOPS computing power 25
Grid – Financial Services Applications
• Portfolio risk analysis
• Value at Risk calculations
• Monte Carlo simulations
• Algorithmic trading
• Options Pricing calculations
26
Cloud Computing
• “Cloud computing is the use of computing resources
(hardware and software) that are delivered as a service over a
network (typically the Internet)” – Wikipedia
• “A model for delivering information technology services in
which resources are retrieved from the internet through web-
based tools and applications, rather than a direct connection
to a server.” – Investopedia
• “Cloud computing is a style of computing where massively
scalable IT-related capabilities are provided ‘as a service’
across the Internet to multiple external customers” – Gartner
27
Key Business Drivers for the Grid/Cloud
• Low Infrastructure utilization
• Management of (heterogeneous) resources
were quite challenging
• Handling performance and scalability of
business applications gracefully
– Run jobs processing petabytes (1015) of data
– Takes more than 10 days to read on 1 computer
28
Over and Under Provisioning of
Resources
29
• Real world server utilization
in data centers is 5 – 20 %
on an average
• Under utilized resources due
to over-provisioning
• Lost business due to under-
provisioning
Cloud Computing Advantages
�Illusion of infinite computing resources available on demand– no need to plan far ahead for provisioning
�Elastic infrastructureo Add or remove resources at fine grain(one server at a
time with EC2). Can be added/removed in minutes
�Allows companies to start small and expand as needed
�Applications/Services can be accessed anytime, anywhere on the Internet
30
Cloud Computing Benefits
• Lower IT application support costs
• Pay as you go Model– No upfront commitment on the Infrastructure
– Ability to pay for use of computing resources on short-term basis as needed (e.g., processors by the hour and storage by the day)
• Lower costs and higher revenue– Infrastructure utilization is no longer a concern for the
enterprise who consume the service
– Manage peak customer load effectively and avoid lost business opportunity
31
Real World Examples
�Animoto debut on Facebook
• experienced a demand surge that resulted in growing from 50 servers to 3500 servers in three days
• later traffic fell to a level well below peak
�Target.com (uses AWS) on “Black Friday”(Nov. 28)
• Other retailers had severe performance problems and intermittent unavailability
• Target’s and Amazon’s sites were just slower by about 50%.
32
33
Cloud Computing is:
a) A way to access applications hosted on the web through your web browser
b) A pay-as-you-go model for IT resources accessed over the Internet
c) Use of commodity computers, distributed throughout an internet, to perform parallel processing, distributed storage, indexing and mining of data
d) An IT buzzword that assures potential clients that your product is on the cutting edge of technology
e) All of the above
Software-as-a-Service
(SaaS)
Platform-as-a-Service
(PaaS)
Infrastructure-as-a-Service
(IaaS)
Provides services & applicationsEnd Users
Google Search, Google DocsSocial Networking Sites,Web Photo Albums, Google AppsCRM Apps by Salesforce.com
Provides a platform for application development Developers
Azure by Microsoft, Google App. Engine, EngineYard, OrangeScape
Provides storage, network and computing servicesSystem Administrators
Blue Cloud Data Center – IBMAkamai, 3Tera, RackspaceAmazon EC2, S3
Cloud Service Models
34
35
Cloud Services
Low
High
Source: Wikipedia
High
Low
Level of
Control
Level of
Abstraction
Cloud Deployment Models
• Internal (private) cloud. The cloud infrastructure is operated within
the consumer’s organization.
• Community cloud. The cloud infrastructure is jointly owned by
several organizations and supports a specific community that has
shared concerns (e.g., mission, security requirements, policy, and
compliance considerations).
• Public cloud. The cloud infrastructure is owned by an organization
selling cloud services to the general public or to a large industry
group.
• Hybrid cloud. The cloud infrastructure is a composition of two or
more clouds (internal, community, or public) that remain unique
entities but are bound together by standardized or proprietary
technology that enables data and application portability.
NIST working definitions
36
Key Considerations for Adoption
� Demand Patterns
� Level of Customization needed
� Integration Requirements
� Is Application Mission Critical?
� Performance Needs of Application
� Stage of Company – Start-up or established?
� Preferred Technology and Dev Platform? Vendor Lock in
Possible
� Stringent Compliance Needs?
� Security & Privacy issues
37
Cloud Adoption Issues
38
� ��'��� %��%�%���!����+���� ,• ���� �����!����9�������
– B� ���– � �� ���� �– � �����1 �������
����������– 3������� 12@1����� ����� �– �@�
• ����� ��������� �������������� ��������+� ��� �1���� �,
• ������������– � �������� ����� ����� �– �� ��%������– ��� �� � -8
�� ��%������
• ���� ����������� ������� ��'�������!������ ��� ��� ����� �����K��� ���+�������,
• 9������� �����!������ ��� �����!������ !���
/;
SaaS Maturity Levels
Level Configurability Multi-tenancy Scalability
1 No No No
2 Yes No No
3 Yes Yes No
4 Yes Yes Yes 41
Source: Microsoft MSDN
Architecture Centre
Grid Vs Cloud Computing
42
Gartner Hype Cycle
43
Source: Gartner
Source: Gartner44
45
Commercial Cloud Formation
Case Study – Email
� On-premise emailo All email services (mailboxes, filtering, etc.) run on
company-owned servers.
� Hosted emailo All email services are delivered by a hosted mailbox
service provider.
� Hosted ancillary services (hybrid)o Some ancillary services like filtering or archiving are
delivered by a cloud-based provider.
� Split-domain email (hybrid)o Some employees use the on-premise email and some use
a hosted mailbox service.
46
Source: Forrester Research
Architecture Description Benefits Challenges
On-premise email
All email services (mailboxes, filtering, etc.)run on company-owned servers.
•Traditional, hence comfortable• Easier integration with other applications and resources
•Expensive to maintain•Consumes IT staff time and resources
Hosted email All email services are delivered by a hosted mailbox service provider.
•Pay-as-you-go financing model•Always-current software and protection•Operated by someone else
•Integration with directory and business applications•Exposure to business failure by service provider
Hosted ancillaryservices (hybrid)
Some ancillary services like filtering or archiving are delivered by a cloud-based provider.
•Offload maintenance of specialty services•Often less expensive•Keeps core email on-premise
•Culture shift to trust provider•Potential of conflict in implementing archiving processes
Split-domainemail (hybrid)
Some employees use the on-premise email and some use a hosted mailbox service.
•Move occasional users or new users to a hosted service•Easier to provision new users or acquired companies
•Integration with directory and other business applications•Different experiences for workers using on-premise and hosted email
47
Advantages of Cloud Based e-Mail
� Rapidly provision new users
�Allocate valuable IT professionals to more business-centric projects
�Always run the latest software and configurations without upgrade hassles
� Shift the financial burden from Cap-Ex to ongoing Op-Ex
48
Evaluate cloud offerings
• Segment employees based on their e-mail
needs
• Identify cost elements and compute costs
• Compare e-mail options – both on-premise
and cloud based
49
Mobileexecutives
Informationworkers
Occasionalusers
Percentage of workforce (example)
10% 70% 20%
Mailbox size 2 gigabytes 1 gigabyte250
megabytes
Need email client software?
Yes Yes No
Archive mailbox? Yes Yes Yes
Need mobile email? Yes No No
Segment Employees Based On
Their Email Needs
50
Category Cost factors
Hardware Servers, operating system, data center costs, power
Server software Mail server software, client licenses, maintenance
Client software Client-installed software, maintenance
Storage Storage, redundancy, power
Message filtering Installed filtering hardware or filtering service
Message archiving Archiving monthly costs
Mobile messaging Mail delivery and administration
StaffingAdministration for hardware, software, storage, and
mobile
Financing Cost of financing servers, storage, and software
On-Premise Email Includes Hardware,
Staffing, And Financing Costs
51
Cost per user per month
On-premise
Cloud-based
MicrosoftExchange
OnlineGoogleApps*
Subscription $0.00 $9.78 $8.66 $4.17
Server hardware and OS
$0.56 $0.00 $0.00 $0.00
Server software $3.61 $0.00 $0.00 $0.00
Client software $3.49 $3.49 $3.49 $0.00
Storage $1.23 $0.00 $0.00 $0.00
Message filtering $2.99 $1.86 $0.00 $0.00
Message archiving
$8.89 $8.11 $6.33 $3.75
Staffing $4.41 $1.85 $1.85 $0.55
Total $25.18 $25.08 $20.32 $8.47
*Google doesn’t currently offer an offline email client.The pricing and features available in each architecture vary by provider. This is a scenario for 15,000 employees with email.
52
Cost comparison
Cost per user per month
5,000 seats
15,000 seats
25,000 seats
35,000 seats
45,000 seats
55,000 seats
On-premise email $28.22 $25.18 $22.52 $20.11 $17.84 $16.59
Cloud-based email
$27.24 $25.08 $23.05 $21.09 $19.18 $18.18
Microsoft Exchange Online
Standard$21.55 $20.32 $19.22 $18.19 $17.21 $16.68
Google Apps* $8.59 $8.47 $8.39 $8.32 $8.28 $8.24
*Google doesn’t currently offer an offline email client.Note: The pricing and features available in each architecture vary by provider.
53
Cost comparison (Cont’d…)
Summary
• P2P Computing
• Grid Computing
• Cloud Computing
– Service & Deployment Models
– SaaS Maturity Levels
– Considerations for Cloud Adoption
– Evaluation of on-premise Vs cloud offering
• Email case study
54
References and Additional Readings
• Above the Clouds: A Berkeley View of Cloud Computing by EECS Department, Feb 2009, University of California, Berkeley
• Cloud Computing – Key Considerations for Adoption, Apr 2009, Infosys White Paper
• Cloud Computing— Latest Buzzword or a Glimpse of the Future?, Feb 2009, Tech Republic White Paper
• The Total Cost of (Non) Ownership of Web Applications in the Cloud, Aug 2012, Amazon White Paper
• How AWS Pricing Works, Dec 2011, Amazon White Paper55
Thank You
Recommended