Introduc)on to Map-Reduce - imaglig-membres.imag.fr › leroyv › wp-content › uploads › sites...

Introduc)ontoMap-Reduce

VincentLeroy

Sources

•  ApacheHadoop•  Yahoo!DeveloperNetwork•  Hortonworks•  Cloudera•  Prac)calProblemSolvingwithHadoopandPig

«BigData»

•  Google,2008– 20PB/day– 180GB/job(variable)

•  Webindex– 50Bpages– 15PB

•  LargeHadronCollider(LHC)@CERN:produces15PB/year

Capacityofa(large)server

•  RAM:256GB•  Harddrivecapacity:24TB•  Harddrivethroughput:100MB/s

Solu)on:Parallelism

•  1server– 8disks– ReadtheWeb:230days

•  HadoopCluster@Yahoo– 4000servers– 8disks/server– ReadtheWebinparallel:1h20

DatacenterGoogle

Pi_allsinparallelism

•  Synchroniza)on– Mutex,semaphores…

•  Difficul)es– Deadlocks– Op)miza)on– Costly(experts)– Notreusable

Programmingmodels

•  Sharedmemory(mul)cores)

•  Messagepassing(MPI)

Faulttolerance

•  Aserverfailseveryfewmonths•  1000servers…– MTBF(mean)mebetweenfailures)<1day

•  Abigjobmaytakeseveraldays–  Therewillbefailures,thisisnormal–  Computa)onsshouldfinishwithinareasonable)meàYoucannotstartoverincaseoffailures

•  Checkpoin)ng,replica)on– Hardtoimplementcorrectly

BigDataPla_orm

•  Leteveryonewriteprogramsformassivedatasets– Encapsulateparallelism•  Programmingmodel•  Deployment

– Encapsulatefaulttolerance•  Detectandhandlefailures

à Codeonce(experts),benefittoall

MAP-REDUCEMODEL

WhatareMapandReduce?

•  2simplefunc)onsinspiredfromfunc)onalprogramming– Transforma6on:mapmap(f,[x1,…,xn])=[f(x1),…,f(xn)]Ex:map(*2,[1,2,3])=[(*21),(*22),(*23)] =[2,4,6]

– Aggrega6on:reducereduce(f,[x1,…,xn])=f(x1,f(x2,f(x3,…f(xn-1,xn)))))Ex:reduce(+,[2,4,6])=(+2(+46)) =12

WhatareMapandReduce?

•  Generic– Takeafunc)onasaparameter

•  Canbeinstan)atedandcombinedtosolvemanydifferentproblems– map(toUpperCase,[“hello”,“data”])=[“HELLO”,“DATA”]

–  reduce(max,[87,12,91])=91

•  Thedeveloperprovidesthefunc)onapplied

Dataaskey/valuepairs

•  MapReducedoesnotmanipulateatomicpiecesofdata– Everythingisa(Key,Value)pair– Keyandvaluecanbeofanytype•  Ex:(Hello,17)

–  Key=Hello,typetext–  Value=17typeint

•  Whenini)aldataisnotkey/value,interpretitaskey/value–  Inputtextfilebecomes[(#line,line_content)…]

Map-ReduceonKey-Valuepairs

•  MapandReduceadjustedtoKey-Valuepairs–  Inmap,fisappliedindependentlyoneverykey/valuepairf(key,value)àlist(key,value)

–  Inreduce,fisappliedtoallvaluesassociatedwiththesamekeyf(key,list(value))àlist(key,value)

– Thetypesofkeysandvaluestakenasinputdoesnothavetobethesameastheoutput

Example:Coun)ngfrequencyofwords

•  Input:Afileof2lines–  1,"abcaabc"–  2,"abbccaccb"

•  Output–  a,3–  b,3–  c,2–  aa,1–  bb,1–  cc,2

Wordfrequency:Mapper

•  Mapprocessesapor)on(line)oftext–  Splitwords–  Foreachword,countoneoccurrence–  Keynotusedinthisexample(linenumber)

•  map(IntlineNumber,Textline,Outputoutput){ foreachwordinline.split(space){ output.write(word,1) }}

Wordfrequency:Reducer•  Foreachkey,reduceprocessesallthecorrespondingvalues– Addnumberofoccurrences

•  reduce(Stringword,List<Int>occurrences,Outputoutput){ intcount=0 foreachintoccinoccurrences{ count+=occ } output.write(word,count)}

Execu)onflow1,"abcaabc" 2,"abbccaccb"a,1b,1c,1aa,1b,1c,1

a,1bb,1cc,1a,1cc,1b,1

Reduce a,[1,1,1]

b,[1,1,1]

c,[1,1]

aa,[1]

bb,[1]

cc,[1,1]

cc,2 19

HowtobuildaWebindex?

•  Ini)aldata:(URL,web_page_content)•  Goal:buildinvertedindex

Grenoble

h}ps://fr.wikipedia.org/wiki/Grenoble

h}p://www.grenoble.fr/

h}p://www.grenoble-tourisme.com/

h}p://wikitravel.org/en/Grenoble

h}p://www.unil.ch/

h}ps://fr.wikipedia.org/wiki/Universit%C3%A9_de_Lausanne

h}ps://twi}er.com/unil

h}p://www.forma)on-con)nue-unil-epfl.ch/

•  map(URLpageURL,TextpageContent,Outputoutput){ foreachwordinpageContent.parse(){ output.write(word,pageURL) }}

•  reduce(Textword,List<URL>webPages,Outputoutput){ pos)ngList=initPos)ngList() foreachurlinwebPages{ pos)ngList.add(url) } output.write(word,pos)ngList)}

APACHEHADOOP:MAPREDUCEFRAMEWORK

Objec)veofHadoopMapReduce

•  Provideasimpleandgenericprogrammingmodel:mapandreduce

•  Deployexecu)onautoma)cally•  Providefaulttolerance•  Scaletothousandsofmachines•  Performanceisimportantbutnotthepriority– What’simportantisthatjobsfinishwithinreasonable)me

–  Ifit’stoslow,addservers!KillItWithIron(KIWIprinciple)

Architecture

•  Fromamonolithicarchitecturetocomposablelayers

Execu)onsteps

Shuffle&Sort:groupbykeyandtransfertoreducer

Shuffle&Sort

•  Barrierintheexecu)on– Allmaptasksmustcompletebeforestar)ngreduce

•  Par))onertoassignkeystoserversexecu)ngreduce– Ex:hash(key)%nbServers– Dealwithloadbalancing

Combiner•  Poten)alproblemofamapfunc)on:manykey/valuepairsintheoutput– Materializedtodisk,senttothereduceroverthenetwork

–  Costlystepoftheexecu)on•  Addanoperator:Combiner– Mini-reducerexecutedonthedataproducedbymaponasinglemachinetostartaggrega)ngit

•  CombinermaybeusedbyHadoop(op)onal)–  Thecorrectnessoftheprogramshouldnotdependonit

CombinerMap

Reduce

Key Value

Input MKI MVI

Output MK0 MV0

Key Value

Input RKI RVI

Output RK0 RV0

CombinerMap

Reduce

Combine

Key Value

Input MKI MVI

Output MK0 MV0

Key Value

Input CKI CVI

Output CK0 CV0

Key Value

Input RKI RVI

Output RK0 RV0

CombinerMap

Reduce

Combine

Key Value

Input MKI MVI

Output MK0 MV0

Key Value

Input CKI CVI

Output CK0 CV0

Key Value

Input RKI RVI

Output RK0 RV0

Combiner1,"abcaabc" 2,"abbccaccb"a,1b,1c,1aa,1b,1c,1

a,1bb,1cc,1a,1cc,1b,1

Reduce a,[1,2]

b,[2,1]

aa,[1]

bb,[1]

cc,[2]

a,1b,2c,2aa,1

a,2bb,1cc,2b,1

Combiner

•  SameAPIasreduce(key,List<value>)– Notthesamecontract!Foronekey,yougetSOMEvalues

•  O�enthesameaggrega)onasreduce– E.g.WordCount

•  Differentwhenusingglobalproper)es– E.g.Keepwordspresentatleast5)mes

HadoopMapReduceasadeveloper

•  Providethefunc)onsperformedbyMapandReduce(Java,C++)– Applica)ondependent

•  Definesthedatatypes(keys/values)–  Ifnotstandard(Text,IntWritable…)– Func)onsforseraliza)on

•  That’sall.

Importsimport java.io.IOException ; import java.util.* ; import org.apache.hadoop.fs.Path ; import org.apache.hadoop.io.IntWritable ; import org.apache.hadoop.io.LongWritable ; import org.apache.hadoop.io.Text ; import org.apache.hadoop.mapreduce.Mapper ; import org.apache.hadoop.mapreduce.Reducer ; import org.apache.hadoop.mapreduce.JobContext ; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat ; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat ; import org.apache.hadoop.mapreduce.Job ;

DonotusetheoldmapredAPI! 35

Mapper // input key type, input value type, output key type, output value type public class WordCountMapper extends Mapper<LongWritable, Text, Text, IntWritable> {

@Override protected void map(LongWritable key, Text value,

Context context) throws IOException, InterruptedException {

for (String word : value.toString().split("\\s+")) { context.write(new Text(word), new IntWritable(1)); } }

Reducer// input key type, input value type, output key type, output value type public class WordCountReducer extends Reducer<Text, IntWritable, Text, LongWritable> {

@Override protected void reduce(Text key, Iterable<IntWritable>

values, Context context) throws IOException, InterruptedException {

long sum = 0; for (IntWritable value : values) { sum += value.get(); }

context.write(key, new LongWritable(sum)); }

Mainpublic class WordCountMain { public static void main(String [] args) throws Exception {

Configuration conf = new Configuration();

String[] otherArgs = new GenericOptionsParser(conf, args).getRemainingArgs();

Job job = Job.getInstance(conf, "word count");

job.setJarByClass(WordCountMain.class);

job.setMapOutputKeyClass(Text.class);

job.setMapOutputValueClass(IntWritable.class);

job.setOutputKeyClass(Text.class);

job.setOutputValueClass(LongWritable.class);

job.setMapperClass(WordCountMapper.class);

job.setReducerClass(WordCountReducer.class);

job.setInputFormatClass(TextInputFormat.class);

job.setOutputFormatClass(TextOutputFormat.class);

FileInputFormat.addInputPath(job, new Path(otherArgs[0]));

FileOutputFormat.setOutputPath(job, new Path(otherArgs[1]));

System.exit(job.waitForCompletion(true) ? 0 : 1);

Writableexamplepublic class StringAndInt implements WritableComparable<StringAndInt> {

private IntWritable iw = new IntWritable(); private Text t = new Text(); public StringAndInt() {} public StringAndInt(String s, int i) { this.iw.set(i); this.t.set(s);} @Override public void write(DataOutput out) throws IOException { this.iw.write(out); this.t.write(out);} @Override public void readFields(DataInput in) throws IOException { this.iw.readFields(in); this.t.readFields(in);} @Override public int compareTo(StringAndInt o) { int c1 = this.t.compareTo(o.t); if (c1 != 0) { return c1; } else { return this.iw.compareTo(o.iw); }}

Terminology

•  MapReduceprogram=job•  Jobsaresubmi}edtotheJobTracker•  Ajobisdividedinseveraltasks– AMapisatask– AReduceisatask

•  TasksaremonitoredbyTaskTrackers– Aslowtaskiscalledastraggler

Jobexecu)on•  $hadoopjarwordcount.jarorg.myorg.WordCountinputPath(HDFS)

outputPath(HDFS)•  Checkparameters

–  Isthereanoutputdirectory?–  Doesitalreadyexist?–  Isthereaninputdirectory?

•  Computesplits•  Thejob(MapReducecode),itsconfigura)onandsplitsarecopied

withahighreplica)on•  Createanobjecttofollowtheprogressathetasksiscreatedbythe

JobTracker•  Foreachsplit,createaMap•  Createdefaultnumberofreducers

Tasktracker•  TaskTrackersendsaperiodicsignaltotheJobTracker–  Showthatthenodes)llfunc)ons–  TellwhethertheTaskTrackerisreadytoacceptanewtask

•  ATaskTrackerisresponsibleforanode–  Fixednumberofslotsformaptasks–  Fixednumberofslotsforreducetasks–  Taskscanbefromdifferentjobs

•  EachtaskrunsonitsownJVM–  PreventsataskcrashtocrashtheTaskTrackeraswell

JobProgress

•  AMaptaskreportsonitsprogress,i.e.amountofthesplitprocessed

•  Forareducetask,3states–  copy–  sort–  reduce

•  ReportsenttotheTaskTracker•  Every5seconds,reportforwardedtotheJobTracker•  UsercanseetheJobTrackerstatethroughWebinterface

Progress

EndofJob•  Outputofeachreducerwri}entoafile•  Jobtrackerno)fiestheclientandwritesa

reportforthejob14/10/2811:54:25INFOmapreduce.Job:Jobjob_1413131666506_0070completedsuccessfullyJobCountersLaunchedmaptasks=392Launchedreducetasks=88Data-localmaptasks=392[...]Map-ReduceFrameworkMapinputrecords=622976332Mapoutputrecords=622952022Reduceinputgroups=54858244Reduceinputrecords=622952022Reduceoutputrecords=546559709[...]

Serverfailureduringajob

•  Buginatask–  taskJVMcrashes→TaskTrackerJVMno)fied–  taskremovedfromitsslot

•  Taskbecomeunresponsive– )meouta�er10minutes–  taskremovedfromitsslot

•  Eachtaskmaybere-runuptoN)mes(default7)incaseofcrashes

HDFS:DISTRIBUTEDFILESYSTEM

RandomvsSequen)aldiskaccess•  Example

–  DB100Musers–  100B/user–  Alter1%records

•  Randomaccess–  Seek,read,write:30mS–  1Musersà8h20

•  Sequen)alaccess–  ReadALLWriteALL–  2x10GB@100MB/Sà3minutes

àItiso�enfastertoreadallandwriteallsequen)ally

DistributedFileSystem(HDFS)

•  Goal–  Faulttolerance(redundancy)–  Performance(parallelaccess)

•  Largefiles–  Sequen)alreads–  Sequen)alwrites

•  “inplace”dataprocessing– Dataisstoredonthemachinesthatprocessit

•  Be}erusageofmachines(nodedicatedfiler)•  Lessnetworkbo}lenecks(be}erperformance)

HDFSmodel

•  Dataorganizedinfilesanddirectoriesàmimicsastandardfilesystem

•  Filesdividedinblocks(default:64MB)spreadonservers

•  HDFSreportsthedatalayouttotheMap-ReduceframeworkàIfpossible,processdataonthemachineswhereitisalreadystored

Faulttolerance

•  Fileblocksreplicated(default:3)totoleratefailures

•  Placementaccordingtodifferentparameters– Powersupply– Networkequipment– Diverseserverstoincreasetheprobabilityofhavinga“close”copy

•  Checksumofdatatodetectcorrupterblocks(alsoavailableinmodernfilesystems)

Master/Workerarchitecture•  Amaster,theNameNode– Managethespaceoffilenames– Managesaccessrights–  Superviseopera)onsonfiles,blocks…–  Supervisethehealthofthefilesystem(failures,loadbalance…)

•  Many(1000s)slaves,theDataNodes–  Storethedata(blocks)–  Performreadandwriteopera)ons–  Performcopies(replica)on,orderedbytheNameNode)

NameNode

•  Storesthemetadataofeachfileandblock(inode)– Filename,directory,blocksasso)ated,posi)onoftheseblocks,numberofreplicas…

•  Keepsallinmainmemory(RAM)– Limi)ngfactor=numberoffiles– 60Mobjectsin16GB

DataNode

•  Manageandmonitorthestateofblocksstoredonthehostfilesystem(o�enLinux)

•  DirectlyaccessedbytheclientsàdatanevertransitthroughtheNameNode

•  SendheartbeatstotheNameNodetoshowthattheserverhasnotfailed

•  ReporttotheNameNodeifblocksarecorrupted

Wri)ngafile•  TheclientsendsaquerytotheNameNodetocreateanew

file•  TheNameNodechecks

–  Clientauthoriza)ons–  Filesystemconflicts(exis)ngfile…)

•  NameNodechosesDataNodestostorefileandreplicas–  DataNodes“pipelined”

•  BlocksareallocatedontheseDataNodes•  StreamofdatasenttothefirstDataNodeofthepipeline•  EachDataNodeforwardsthedatareceivedtothenext

DataNodeinthepipeline

Readingafile•  ClientsendsarequesttotheNameNodetoreadafile•  NameNodechecksthefileexistsandbuildsalistofDataNodes

containingthefirstblocks•  Foreachblock,NameNodesendstheaddressoftheDataNodes

hos)ngthem–  Listorderedwrt.Proximitytotheclient

•  ClientconnectstotheclosestDataNodecontainingthe1stblockofthefile

•  Blockreadends:–  Closeconnec)ontotheDataNode–  Newconnec)ontotheDataNodecontainingthenextblock

•  Whenallblocksareread:–  QuerytheNameNodetoretrievethefollowingblocks

HDFSStructure

HDFScommands(directories)

•  Createdirectorydir$hadoopdfs-mkdir/dir

•  ListHDFScontent$hadoopdfs-ls

•  Removedirectorydir$hadoopdfs-rmr/dir

HDFScommands(files)

•  Copylocalfiletoto.txttoHDFSdir/$hadoopdfs-puttoto.txtdir/toto.txt

•  CopyHDFSfiletolocaldisk$hadoopdfs-getdir/toto.txt./

•  Readfile/dir/toto.txt$hadoopdfs-cat/dir/toto.txt

•  Removefile/dir/toto.txt$hadoopdfs-rm/dir/toto.txt

Introduc)on to Map-Reduce - imaglig-membres.imag.fr › leroyv › wp-content › uploads › sites...

Documents

Didier DONSEZ - imaglig-membres.imag.fr/donsez/cours/ip.pdfDidier Donsez, 1996-2004 2,QWHUQHW 3URWRFRO Didier DONSEZ Université Joseph Fourier (Grenoble 1) IMA – LSR/ADELE ’LGLHU

Programmation par Objets - imaglig-membres.imag.fr/pernet/Enseignements/L3METI... · But: traiter de la meme fac¸on des objets de classesˆ differentes´ Solution:Si une classe B

Applications Web Java Servlets - imaglig-membres.imag.fr/.../pdf/PL2/PL2_16Servlets.pdf · les Servlets s'inscrivent dans la plateforme Java EE (Java Entreprise Edition) Java EE (Java

M2P GI Thème SLE 2009-2010 UE PM2M - imaglig-membres.imag.fr/donsez/ujf/m2pgi/pm2m/projetm2m-p...M2P GI Thème SLE 2009-2010 UE PM2M Projet de Service Machine-to-Machine partie 1

Java WebServer Tomcat/JBoss/JRun/JOnAS - imaglig-membres.imag.fr/plumejeaud/NFE107-fichesLecture/web...JGroups Portlet Swap Hibernate Portal Portlet Bridge JBoss S&IM JBoss Forums

Introduction to Mobile Robotics - imaglig-membres.imag.fr/aycard/html/Enseignement/M1/Robotics/...ROS is a very good tool for designing/prototyping robotics application; But ROS is

Objets et prototypes en JavaScript - imaglig-membres.imag.fr/genoud/teaching/PL2AI/cours/pdf/AI/... · 2019. 11. 16. · dernière modification Références •Pour désigner des

Programmation Concurrente - imaglig-membres.imag.fr/boyer/html/Documents/cours/SE/... · Algorithmes d’ordonnancement Multiprogrammation FIFO / FCFS PCTE / SJF (Plus Court Temps

TCP - imaglig-membres.imag.fr/duda/2at/tcp1-comm.pdf · TCP 4 If the application issues a half-close (eg. shutdown(1)) then data can be received in states FIN_WAIT_1 and FIN_WAIT_2

NoSQL Databases - imaglig-membres.imag.fr/.../uploads/sites/125/2017/11/NoSQL.pdfDatabases – SQL query language, very expressive – Limited scalability (generally 1 server) 4 Size

Cours 1 ricm1 Intro - imaglig-membres.imag.fr/sicard/crRES/Cours 1 Intro.pdf · 2014. 1. 9. · © P. Sicard-Cours Réseaux 1 Introduction 01 13 ordinateurs 1970 1980 1990 2000 Années

Gestion de Versions - imaglig-membres.imag.fr/donsez/cours/version.pdf · Protocole Orienté Web WebDAV/DeltaV. 06/10/2008 Didier Donsez, 1995-2007, Gestions de Versions 22 SCCS (Source

chap10 anomaly detection - imaglig-membres.imag.fr/.../125/2014/02/chap10_anomaly_detection.ppt_.… · chap10_anomaly_detection.ppt Created Date: 4/15/2013 1:50:16 PM

Sérialisation - imaglig-membres.imag.fr/genoud/ENSJAVA/cours/supportsPDF/... · 2011. 11. 28. · Serialisation Java vs Serialisation XML • Pour • Utilise des classes Java standards

TP n°9 : Formes Animées - imaglig-membres.imag.fr/genoud/ENSJAVA/tds/corrections/... · import java.awt.Color; import java.awt.Rectangle public abstract Forme implements IDessinable

junit - imaglig-membres.imag.fr/donsez/cours/junit.pdf · 2014. 1. 9. · 05/01/2009 Didier Donsez, JUnit, 2007-2009 2 Sommaire Motivations JUnit API Misc Bibliographie

Web 2.0 Introduction à Ajax - imaglig-membres.imag.fr/genoud/ENSJAVA/cours/supportsPDF/...Web 2.0 Introduction à Ajax Philippe Genoud - UJF (c) - Février 2015 2 • Navigateur outil

MODULE INF112 - imaglig-membres.imag.fr/dubousquet/docs/TD09-2012.pdf · 2012-2013 INF112 -TD9 2 Plan 1. CC2 2. Formulaires en HTML 3. Javascriptet Java 4. Sécuritésur Internet

Didier DONSEZ - imaglig-membres.imag.fr/donsez/cours/geolocation.pdf · 11/07/2010 Didier Donsez, 2000-2010, Géo Localisation 27 Standard NMEA 0183 (iii) Message Id GGA : GPS fix

M2P GI Thème SLE 2008-2009 UE PM2M - imaglig-membres.imag.fr/donsez/ujf/m2pgi/pm2m/projetm2m-0809...06/01/2009 UE PM2M Introduction 3 Environnement Physique Transformation Infrastructure