17

Large Systems: Large Systems: Design + Design + Implementation: Implementation: ➢ Google Search Google Search Image (c) Facebook

Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

Download PDF Report

Upload
others
View
6
Download
0

Embed Size (px)

Citation preview

Page 1: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

Large Systems:Large Systems:Design + Design + Implementation:Implementation:

➢ Google SearchGoogle Search

Image (c) Facebook

Page 2: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

2

Case Study: Google Evolution

Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer Scientist Lecture lecture, November, 2010

Jeff Dean, “Evolution and future directions of large-scale storage and computation systems at Google”, SoCC '10: Proceedings of the 1st ACM symposium on Cloud computing, ACM, New York, NY, USA (2010), pp. 1-1

https://research.google.com/pubs/jeff.html

https://research.google.com/pubs/jeff.html

Page 3: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

3

Page 4: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

4

Page 5: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

5

Page 6: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

6

Page 7: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

7

Page 8: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

8

Page 9: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

9

Page 10: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

10

Page 11: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

11

Leaf servers handle both index & doc requests from in-memory data structures

Page 12: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

12

Leaf servers handle both index & doc requests from in-memory data structures

Coordinates index switching as new shards become available

Page 13: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

13

New Problems

More collections to search besides Web More structured: Maps

Need more real-time results

Page 14: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

14

More Real-Time

Creating Index was batch process via MapReduce Store all documents in GFS (==HDFS) Run several MapReduce jobs to create index Upload index to Leaf servers

New documents would not show up in search results for 2-3 days [Peng and Dadek, 2010]

Needed lower “time from crawl-to-search-hit” Solution:

New data storage system: Colossus / BigTable Event-driven, incremental processing: Caffeine /

Percolator

Page 15: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

15

BigTable:

Page 16: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

16

BigTable:

Page 17: Large Systems€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished Computer

17

Caffeine / Percolator

Crawler uploads new version of page in BigTable Updates to BigTable can trigger code E.g. code to create index Push index update to Leafs

BSc (Honours) Computer Systems - cs.cit.iecs.cit.ie/contentfiles/PDFs/Undergraduate/Computer Systems - BSc (… · BSc (Honours) Computer Systems The BSc (Hons) in Computer Systems

BSc (Honours) Computer Systems - cs.cit.iecs.cit.ie/contentfiles/PDFs/Undergraduate/Computer Systems - BSc (… · BSc (Honours) Computer Systems The BSc (Hons) in Computer Systems

Documents

PSC COMPUTER msi App Store Google Play · PSC COMPUTER msi App Store Google Play

PSC COMPUTER msi App Store Google Play · PSC COMPUTER msi App Store Google Play

Documents

Intelligent Computer Systems Large-Scale Deep …...Large-Scale Deep Learning for Intelligent Computer Systems Jeff Dean In collaboration with many other people at Google “Web Search

Intelligent Computer Systems Large-Scale Deep …...Large-Scale Deep Learning for Intelligent Computer Systems Jeff Dean In collaboration with many other people at Google “Web Search

Documents

Google File Systems

Google File Systems

Technology

Designing Distributed Systems: Google Case Study€¦ · Designing Distributed Systems: Google Case Study. Google Company •Google is a US-based corporation with its headquarter

Designing Distributed Systems: Google Case Study€¦ · Designing Distributed Systems: Google Case Study. Google Company •Google is a US-based corporation with its headquarter

Documents

Google Earth Project Sa Computer Orig

Google Earth Project Sa Computer Orig

Business

Outline Computer Systems - University of Houstoncms.dt.uh.edu/faculty/ongards/cs1410/Lectures/Computer System.pdfComputer Systems 1 Outline Computer hardware ... Computer Systems 3

Outline Computer Systems - University of Houstoncms.dt.uh.edu/faculty/ongards/cs1410/Lectures/Computer System.pdfComputer Systems 1 Outline Computer hardware ... Computer Systems 3

Documents

Manufacturer's Brands/Contacts List for CT DEEP's ... · Google, Inc. Google Google Pixel Chromebook Google Pixel Tablet Google Project Tango Google Chromebook Computer, Monitor Computer

Manufacturer's Brands/Contacts List for CT DEEP's ... · Google, Inc. Google Google Pixel Chromebook Google Pixel Tablet Google Project Tango Google Chromebook Computer, Monitor Computer

Documents

Computer Systems and Architecture - McGill Universitymsdl.cs.mcgill.ca/people/hv/teaching/ComputerSystemsArchitecture/... · Computer Systems and Architecture Computer Systems Stephen

Computer Systems and Architecture - McGill Universitymsdl.cs.mcgill.ca/people/hv/teaching/ComputerSystemsArchitecture/... · Computer Systems and Architecture Computer Systems Stephen

Documents

Unit 2: Computer Systems Computer Systems Created By: Alex Walls

Unit 2: Computer Systems Computer Systems Created By: Alex Walls

Documents

High-availability computer systems - Computer

High-availability computer systems - Computer

Documents

Computer Tricks - Google+

Computer Tricks - Google+

Documents

Computer Systems

Computer Systems

Business

COMPUTER INFORMATION SYSTEMS · COMPUTER INFORMATION SYSTEMS ... BBA in Information Systems ... Information Technology Auditing

COMPUTER INFORMATION SYSTEMS · COMPUTER INFORMATION SYSTEMS ... BBA in Information Systems ... Information Technology Auditing

Documents

Digital Toolkit. Inspiration Safari Google Docs Computer Inspiration Google Docs Google Docs Google Docs

Digital Toolkit. Inspiration Safari Google Docs Computer Inspiration Google Docs Google Docs Google Docs

Documents

Electrical, Computer, & Systems Engineering...ELECTRICAL & COMPUTER SYSTEMS ENGINEERING UNDERGRADUATE BOOKLET 06/29/18 3 Computer and Systems Engineering Computer and Systems Engineering

Electrical, Computer, & Systems Engineering...ELECTRICAL & COMPUTER SYSTEMS ENGINEERING UNDERGRADUATE BOOKLET 06/29/18 3 Computer and Systems Engineering Computer and Systems Engineering

Documents

Computer Information Systems Computer Information Systems

Computer Information Systems Computer Information Systems

Documents

Google News Computer - Youth Teaching Adults

Google News Computer - Youth Teaching Adults

Documents

Computer Engineering. Computer Architecture Computer Circuits Computer Systems System Software Networks Digital & Analog Systems Computer Engineering

Computer Engineering. Computer Architecture Computer Circuits Computer Systems System Software Networks Digital & Analog Systems Computer Engineering

Documents

Computer Technology & Computer Systems Technology

Computer Technology & Computer Systems Technology

Documents

LS Google Search - OS3€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished

LS Google Search - OS3€¦ · Case Study: Google Evolution Jeff Dean, “Building Software Systems at Google and Lessons Learned”, Stanford Computer Science Department Distinguished

Documents

using google drive on your computer

using google drive on your computer

Education

"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keynote Presentation from Google

"Large-Scale Deep Learning for Building Intelligent Computer Systems," a Keynote Presentation from Google

Technology

EECS 262a Advanced Topics in Computer Systems Lecture 23 ...kubitron/courses/... · Google projects –Web indexing, Personalized Search, Google Earth, ... – Reading based on timestamp

EECS 262a Advanced Topics in Computer Systems Lecture 23 ...kubitron/courses/... · Google projects –Web indexing, Personalized Search, Google Earth, ... – Reading based on timestamp

Documents