2
MongoDB and Hadoop can replace relational database management systems (RDBMS), such as Oracle, SQL Server, DB2 and MySQL. This has been branded as “NoSQL” as they do not generally use SQL. Relational Database Systems were designed when disk space and CPU were more pre- cious commodities than they are today. MongoDB and Hadoop exploit these highly avail- able commodities and scale with less effort. MongoDB and Hadoop distribute the load across multiple machines more efficiently. NoSQL/BigData systems require less work to scale to large concurrency or datasets While NoSQL solutions do not typically support RDBMS style transactions, they support other methods of data integrity. Hadoop has been installed in larger systems than MongoDB. MongoDB is more efficient than Hadoop. Generally, MongoDB is optimal for typical websites and CRUD (Create, Retrieve, Update, Delete) or “transactional” applications. Generally, Hadoop replaces OLAP or reporting systems. Hadoop does not provide high availability, but other vendors have derived solutions which do. MongoDB provides high availability, this may make expensive solutions such as Oracle RAC or SQL Server clustering unnecessary. In some cases Hadoop and MongoDB are used together in the same system. NoSQL, BigData: MongoDB vs Hadoop Technical Technology Hadoop MongoDB Largest System Typical Usage Business Structure Best Feature Biggest Limitation Maturity Complexity Java C ++ native code 4000 nodes, 100 TB 100 nodes, 5 TB reporting analytics, “map-reduce” on large datasets high volume systems, moderately sized data sets Non-profit (Apache) backed by multiple vendors For-profit (10gen) Flexibility, map-reduce implementation light weight, better performance single point of failure each node is single- threaded Widely deployed, but less productized Newer, less widely deployed, but more productized High Moderate MongoDB vs Hadoop Big Solutions for Big Problems Executive Summary

Hadoop vs MongoDB

Embed Size (px)

DESCRIPTION

Hadoop vs MongoDB

Citation preview

Page 1: Hadoop vs MongoDB

• MongoDBandHadoopcanreplacerelationaldatabasemanagementsystems(RDBMS),suchasOracle,SQLServer,DB2andMySQL.Thishasbeenbrandedas“NoSQL”astheydonotgenerallyuseSQL.

• RelationalDatabaseSystemsweredesignedwhendiskspaceandCPUweremorepre-ciouscommoditiesthantheyaretoday.MongoDBandHadoopexploitthesehighlyavail-ablecommoditiesandscalewithlesseffort.MongoDBandHadoopdistributetheloadacrossmultiplemachinesmoreefficiently.

• NoSQL/BigDatasystemsrequirelessworktoscaletolargeconcurrencyordatasets• WhileNoSQLsolutionsdonottypicallysupportRDBMSstyletransactions,theysupport

othermethodsofdataintegrity.• HadoophasbeeninstalledinlargersystemsthanMongoDB.• MongoDBismoreefficientthanHadoop.• Generally,MongoDBisoptimalfortypicalwebsitesandCRUD(Create,Retrieve,Update,

Delete)or“transactional”applications.• Generally,HadoopreplacesOLAPorreportingsystems.• Hadoopdoesnotprovidehighavailability,butothervendorshavederivedsolutions

whichdo.• MongoDBprovideshighavailability,thismaymakeexpensivesolutionssuchasOracle

RACorSQLServerclusteringunnecessary.• InsomecasesHadoopandMongoDBareusedtogetherinthesamesystem.

NoSQL,BigData:MongoDBvsHadoop

Technical

Technology

Hadoop MongoDB

Largest System

Typical Usage

Business Structure

Best Feature

Biggest Limitation

Maturity

Complexity

Java C ++ native code

4000 nodes, 100 TB 100 nodes, 5 TB

reporting analytics, “map-reduce” on large datasets

high volume systems, moderately sized data sets

Non-profit (Apache) backed by multiple vendors

For-profit (10gen)

Flexibility, map-reduce implementation

light weight, better performance

single point of failureeach node is single-threaded

Widely deployed, but less productized

Newer, less widely deployed, but more productized

High Moderate

MongoDBvsHadoopBig Solutions for Big Problems

ExecutiveSummary

Page 2: Hadoop vs MongoDB

Organizational

MongoDB

Hadoop

• Adobedobe• EBay• Facebook• FOXAudienceNetwork• Hulu• LinkedIn• NAVTEQ• TheNewYorkTimes

KeyInstallations

• 10gen• SAP• RedHat• FlybridgeCapitalPartners• SequoiaCapital• UnionSquareVentures

• Craigslist• UKGovernment(NationalArchives,UK.gov)• Shutterfly• Forbes• TheNewYorkTimes• Inuit• FourSquare• LexisNexis

KeyInstallationsKeyBackers

KeyBackers

• Cloudera(vendor)• HortonWorks(vendor)• IBM• Yahoo!

BottomLineNewsystemswhichrequirehigherendscalabilitycanutilizeMongoDBand/orHadooptoscalewithlesseffort.MongoDBcanprovidehighavailabilityatafractionofthecostofOracleRACandsimilarsolutions.Lackofstandardizationinthisareawillrequirewritingspecificallytotheirinterface.ThetechnologyisnotasmatureasRDBMSsoftware,butalreadywidelydeployedandreadyforusebymainstreambusi-nesses.Differentcapabilitiesineachoftheseimplementationsmakethemmorereadilyappropriatetoreplacedifferenttypesofsystems(OLAPvsOTAP).AwiderecosystemaroundApacheHadoopispositivebutsinglevendorsupportofMongoDBmaymakeabusiness/supportarrangementmorestraightforward.BothonsiteinstallationsandCloudPlatformasaService(PaaS)usesarepossiblewithbothMongoDBandHadoop.Seeourdetailedreportathttp://osintegrators.com/MongoAndHadoop

www.osintegrators.com345W.MainSt.Suite201Durham,NC27701(919)[email protected]

Contact