23
Copyright © 2016 Splunk Inc. Raffael Eiler Senior Engineer, BOSCH Correlated Monitoring of an Enterprise ALM Environment at Bosch Juergen Magiera ITSI Lead Architekt EMEA, Splunk

Correlated Monitoring of an Enterprise ALM Environment at Bosch

Embed Size (px)

Citation preview

Page 1: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Copyright©2016Splunk Inc.

RaffaelEilerSeniorEngineer,BOSCH

CorrelatedMonitoringofanEnterpriseALMEnvironmentatBosch

JuergenMagieraITSILeadArchitekt EMEA,Splunk

Page 2: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Disclaimer

2

Duringthecourseofthispresentation,wemaymakeforwardlookingstatementsregardingfutureeventsortheexpectedperformanceofthecompany.Wecautionyouthatsuchstatementsreflectourcurrentexpectationsandestimatesbasedonfactorscurrentlyknowntousandthatactualeventsorresultscoulddiffermaterially.Forimportantfactorsthatmaycauseactualresultstodifferfromthose

containedinourforward-lookingstatements,pleasereviewourfilingswiththeSEC.Theforward-lookingstatementsmadeinthethispresentationarebeingmadeasofthetimeanddateofitslivepresentation.Ifreviewedafteritslivepresentation,thispresentationmaynotcontaincurrentoraccurateinformation.Wedonotassumeanyobligationtoupdateanyforwardlookingstatementswemaymake.Inaddition,anyinformationaboutourroadmapoutlinesourgeneralproductdirectionandissubjecttochangeatanytimewithoutnotice.Itisforinformationalpurposesonlyandshallnot,beincorporatedintoanycontractorothercommitment.Splunkundertakesnoobligationeithertodevelopthefeaturesor

functionalitydescribedortoincludeanysuchfeatureorfunctionalityinafuturerelease.

Page 3: Correlated Monitoring of an Enterprise ALM Environment at Bosch

WhoWeAreWhatwedo

Page 4: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Aboutus

RaffaelEiler([email protected])– RobertBoschGmbHStuttgart,Germany– ClearCaseandRationalTeamConcertDeploymentExpert

JuergenMagiera([email protected])– SplunkMunich,Germany– ITSIArchitectandLeadEMEA

4

Page 5: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Overview about BoschGroup

5

Page 6: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Bosch– technologytoenhancequalityoflife

6

• Some 56,0001 researchers and developerswork atBosch:at1182 locations worldwide,inasingle network.

• Boschis one of the world’s leading internationalproviders of technologyand services.

• Overthe past five years,Boschhas investedmore than 24billion euros inresearch anddevelopment.

• Our objective:to develop innovative,useful,andexciting products and solutions toenhance quality of life – technology that is“Invented for life.”

Page 7: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Electronics&SoftwareDevelopmentPlatformsProducts&Services

ClearCase DoorsClearQuest

WTS/VDI

electronicsdevelopmentplatforms

softwaredevelopmentplatforms

7

Page 8: Correlated Monitoring of an Enterprise ALM Environment at Bosch

BoschCLMinfrastructure

8

IBMCLMisthepreferredtoolforALM*1 withintheBoschGroupIBMCLMisasetofweb-appshostedinWebSphere8.5runningonvirtualWindowsservers.AsdatabaseisusedORACLE11(RAC)CLMisinaramp-upphaseinmostproductlinesWehavemanySW-developersusingthatsystem(about3000concurrentsessions)CLMsystemisessentialforstepsinSWdevelopmentprocessUnplannedsystemoutageshavetobeminimized

Factsandfigures

*1 : ALM = Application Lifecycle Management

Page 9: Correlated Monitoring of an Enterprise ALM Environment at Bosch

CLMSystemTypesatBosch

9

Type DB Count Availability Version Purpose

P-System Ora 18 Highest 5.0.2 Productiveservers

Q-Systems Ora 9 High 5.0.2 Stagingservers,foracceptancetesting.

Test-Servers DB2 ~10 Low 5.0.2 UsedbyPLtoolsteamsforprocessdevelopments,playgrounds.

Development-Servers

DB2,Ora

6 Lowest 5.0.2and6.0.1

ForPLplugindevelopment.CustomershaveJazzAdmin-Role.

Beta-Servers DB2 2 Low 6.0.1Mx Tohostandshowtheupcomingpre-releaseversions(Mx/RCx).

Demo-Servers DB2 2 Low 5.0.2and6.0.1

Generalplaygroundandproductshow-caseforanyoneinterested.Stableversion.

Training-Server DB2 1 Low 5.0.2 For user training.

Proxy-Server Squid 13 Highest 3.1.10 For remoteaccess at each location,based oncustomer request.

Page 10: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Splunk Topologyin2015

10

indexer

Forwardersinstalledonmanyserver

searchhead

End-2-Endperformanceclients

• Westartedwithoneindexerandonesearchheadrunningonwindows

• AllSplunk serverhavebeenmanagedbytheteamthemself

• Overtimesplunk usagerised veryfast• Performanceissues(slowresponse,concurrentsearches,…)duringdailyusagecameup

Page 11: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Splunk Topologytoday

11

Universalforwarder

All-in-onesplunk server:- Indexer- searchhead- deploymentserver

End-2-Endperformanceclients

• Everyproduct(e.g.SubversionorALM)hasit’sownsplunk server

• AllSplunk server(basicoperation)aremanagedbyaBoschinternalserviceprovider

• Splunk configuration(inputs.conf,scripts,alters,dashboards,…)isunderourcontrol

• Splunk serverarerunningonLinux• Ifwerecognizeperformanceissueswe

willsplitindexerandsearchhead

Heavyforwarder

Variousotherinputs

Page 12: Correlated Monitoring of an Enterprise ALM Environment at Bosch

12

Windows Linux/Unix Virtualization &Cloud

Applications Databases Networking

Ourusecases

• Registry• Event logs• File system• WMI• PerfMon• Logfiles• ….

• Configurations• Syslogs• File system• ps, iostat,

netstat, top, …• Logfiles• ….

• Hypervisor• Guest OS• Apps• Cloud• …

• Web logs• Log4J• JMX• Scripts• …

• Configurations• Audit/query logs• Tables• Size• …

• Network Interface

• Configurations• SNMP• ….

implemented

Page 13: Correlated Monitoring of an Enterprise ALM Environment at Bosch

13

Example:HeapIncrease• Before heap increase

• Heap usage constantlyabove 80%

• Very frequent GarbageCollection Cycles

• Lots of Hung threads• After heap increase

• GC less stressed• Less hung threads

• -> Less impact for the user

Page 14: Correlated Monitoring of an Enterprise ALM Environment at Bosch

14

Example:SystemDetails- RAM• Optimise usage of RM index in

RAM • Calculate real „RAM left“

Page 15: Correlated Monitoring of an Enterprise ALM Environment at Bosch

HighlightsFrom End-usersPerspective

15

„All-in-one“solution

Scales good

No need to consultmonitoring solutionfrom other teams

Allnecessarylogfiles accessiblefrom one place

Findroot causes „on-the-fly“

„Management-friendly“Reporting

/Dashboards

Page 16: Correlated Monitoring of an Enterprise ALM Environment at Bosch

FeedbackFrom My Colleagues

16

Splunk provides early warnings if certain parametersof the system start to leave the safe boundaries, e.g.free disk space, heap usage, CPU usage

Stefan O.

Splunk is my first stop in case ofproblems. I can quickly check whaterrors have been logged, and where.It is also really useful to track thesystem load and resourceconsumption. We have graphs withmatching timelines that allow you toeasily detect patterns acrossdifferent data sources, or evendifferent servers.

Volker G

Splunk informs me when heap usage is high so I canconsider increasing the heap long before userscomplain about performance issues.

Danny M.

Page 17: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Currentsplunk activities(NetIQphase-out)

17

InthepastweusedNetIQasmonitoringsystemprovidedbyanotherdepartmentwithinBosch

Withsplunk wehavenowasystemthatis/has……possibilityofimplementinganykindofchangesinaveryfastway…stableandwellperformingsolution…easytolearnandveryusefulindailyworkasasysadmin

Page 18: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Currentsplunkactivities(ITSI)

18

Page 19: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Clearquest GlassTableDetailsITSIGlasstableforRationalClearQuest (RQ1) service:KPIs:• SeleniumEnd-to-Endtransactiontime.• OverallhealthofIHSapplication• DetailedperformancemetricsonWAS• ResponsetimeforDBValue:• Getnotifiedaboutaboutpoorresponse

timesforcustomersearly.• Ataglanceviewofcurrentandhistoric

performancemetricsalongthewholeservicechain.

RationalClearQuest (RQ1)

Page 20: Correlated Monitoring of an Enterprise ALM Environment at Bosch

CurrentSplunk Activities(ITSI)

20

Page 21: Correlated Monitoring of an Enterprise ALM Environment at Bosch

PlannedImplementation(additionalInfo)

21

Planned:• Reporting• Managementview(e.g.dashboard

withtrafficlights)• Long-termmonitoring(trendanalysis)• Historical,cumulateddata• DifferentDashboardsfordifferent

interests(Managers,Technicals,ProblemAnalysis,Quickoverview,…)

• E2Etestresults(selenium)• AmountofHTTPrequests

Implemented:• Licensestatistics• Logfiles• MonitoringWASwithJMX• Mostsystemressources(perfmon)

Page 22: Correlated Monitoring of an Enterprise ALM Environment at Bosch

Conclusion – next stepsTo be evaluated:SSL(certificates expiration)Monitoringcaching proxies,for example:– How much data is provided through cache?CSM(CLMServerMonitoring)integration– Get Application data to correlate this with system resources,e.g.heap size:

ê How much users are working?ê How much work items are created today?

ESXmonitoringNetworkmonitoring (Whole route,notjustthe network interface)

22

Page 23: Correlated Monitoring of an Enterprise ALM Environment at Bosch

THANKYOU