Deploying Splunk. Arquitetura e dimensionamento do Splunk

Simeon Yep Sales Engineering Manager, Splunk

ArchitecAng and Sizing your Splunk Deployment

#splunkconf

Legal NoAces During the course of this presentaAon, we may make forward-‐looking statements regarding future events or the expected performance of the company. We cauAon you that such statements reflect our current expectaAons and esAmates based on factors currently known to us and that actual events or results could differ materially. For important factors that may cause actual results to differ from those contained in our forward-‐looking statements, please review our filings with the SEC. The forward-‐looking statements made in this presentaAon are being made as of the Ame and date of its live presentaAon. If reviewed aTer its live presentaAon, this presentaAon may not contain current or accurate informaAon. We do not assume any obligaAon to update any forward-‐looking statements we may make. In addiAon, any informaAon about our roadmap outlines our general product direcAon and is subject to change at any Ame without noAce. It is for informaAonal purposes only and shall not, be incorporated into any contract or other commitment. Splunk undertakes no obligaAon either to develop the features or funcAonality described or to include any such feature or funcAonality in a future release.

Splunk, Splunk>, Splunk Storm, Listen to Your Data, SPL and The Engine for Machine Data are trademarks and registered trademarks of Splunk Inc. in the United States and other countries. All other brand names, product names, or trademarks belong to their respecCve

owners.

IntroducAon

About Me !   5+ years @ Splunk !   Experience:

–  SupporAng, administering, and architecAng large scale deployments –  OEM – technical sales –  Strategic accounts – technical sales

!   Based in HQ (San Francisco office) !   Currently: Business Development, Technical Synergies

Agenda !   Sizing Fundamentals !   ArchitecAng Fundamentals !   Deployment Topologies

Sizing Fundamentals

Sizing Fundamentals !   Understand the sizing factors !   Data volume !   Search volume !   Sizing sheet

Sizing Factors !   How much data (raw sizes)?

–  Daily volume –  Peak volume –  Retained volume (archive size) –  Future volume?

!   How much searching? –  Use cases –  How many people? How oTen?

!   Jobs –  SummarizaAon, alerAng, reporAng

Data Volumes

!   EsAmate input volume –  Verify raw log sizes –  Leverage _internal metrics to get actual input volumes

!   Confirm esAmates with actual data –  Create a baseline with real or simulated data –  Find compression rates (range from 30%-‐120%, typically 50%) –  Determine retenAon needs

!   Document use cases –  Use case determines search needs –  Plan for expansion as adopAon grows (search and volume)

Data Sizing Exercise !   Via Filesystem !   Use the Splunk log files: metrics.log or license_usage.log !   OpAonally:

–  License report view in Splunk Enterprise 6 –  S.o.S app in 5.x

Search Volumes !   Gather use case informaAon

–  How much ad-‐hoc searching? –  How much background searching?

!   Ad-‐hoc searching –  Evaluate the data being searched –  Evaluate the Ame duraAon (real-‐Ame vs historic) –  Real-‐Ame searches are typically less overhead

!   Background searching –  AlerAng and monitoring –  General reports –  Summary indexing

Final Sizing Numbers !   Data capacity

–  Daily and peak !   User capacity

–  Concurrent and total !   Search capacity

–  Concurrent and total *Document the use cases!!

Architecture

Architecture !   Splunk server roles: distributed/clustered deployments !   Reference server !   Rules of thumb !   Hardware factors

Splunk Distributed Roles

Indexer

Search Head (regular and job server)

Forwarder (universal)

indexer

forwarder

search head

Splunk Distributed Roles

Cluster Master (clustering/replicaAon requirement)

License Master

Deployment Server

Recommended ConfiguraAons Stand-‐ alone

Indexer (distributed)

Search head (distributed)

Indexer (clustered)

Search head

(clustered)

Cluster master

(clustered)

Forwarding * * * * *

Searching √ √ * √ Indexing √ √ * √

Deployment server

License master

* √ * *

Cluster master √

√ -‐ common * -‐ uncommon

What’s a “Reference” Server? !   Sizing based on commodity x86 servers !   Dual quad-‐core CPUs at 3.0 GHz (dual six core is common) !   8 GB of RAM – (16 GB is common) !   64-‐bit OS !   4x10k RPM local SAS drives in RAID 1+0 (800+ IOPs) !   VariaAons cause corresponding changes in performance/requirements

Rules of Thumb !   These all have excepAons and qualificaAons !   1 reference indexer per 100 GB/day !   1 reference search head per 8 to 12 users !   1 reference job server per 20 concurrent jobs !   1 deployment server per 3000 polls/min !   ReplicaAon later…

How Many Indexers? !   Rule of thumb says: 1 per 100 GB/day !   Leaves room for:

–  Daily peaks –  Light searching and reporAng for about 3 concurrent users

!   Need more indexers for: –  Heavy reporAng –  More users –  Slower disks, slower CPUs, fewer CPUs

How Many Search Heads?

!   Rule of thumb says: 1 per 8 to 12 concurrent users !   Limit is concurrent queries !   30-‐50 web sessions !   1:1 raAo of search query to CPU core !   Only add first search head if ≥3 indexers !   Don’t add search heads; add indexers: indexers do most work !   But you need more if:

–  Running a lot of scheduled jobs on the search head

Search Head vs. Job Server !   Search Head Pooling (SHP): uses NFS to manage user profiles/configuraAons and job queue

!   Search head and job server are equivalent with SHP !   Use job servers for scheduled searches (summaries, alerts, and reports)

!   Use search heads for ad-‐hoc searching

How Many Deployment Servers? !   Rule of thumb says: 1 per 3000 polls/minute !   Just use one deployment server, and adjust the polling period !   Small deployments can share the same splunkd !   Low requirement for disk performance (good candidate for virtualizaAon)

!   Windows OS – 1 per 500 polls/minute !   Or use something other than deployment server

–  Puppet, SCCM, cfengine, chef…

More is Bexer? !   CPUs

–  Search process uAlizes up to 1 CPU core (1:1) –  Indexers sAll need to do the heavy liTing (search exists on indexer AND

search head) –  Limited benefit for indexing (up to 2 CPU cores for indexing)

!   Memory –  Good for search heads and indexers (16+ GB)

!   Disks –  Faster is bexer (15k rpm) –  More disks in RAID 1+0 = faster

System change Search Speed Indexing Speed

Faster disks ++ ++

Add an indexer ++ ++

Add a search head + Report acceleraAon/

summaries ++

Performance and Sizing Tips

System change Search Speed Indexing Speed

OpAmize searches +++

OpAmize field extracAon +

OpAmize input parsing + Faster CPU + +

Performance and Sizing Tips

Capacity à Architecture !   Sizing recipe

–  Capacity –  Rules of thumb determines number of servers

!   Building blocks for architecture

Architecture Factors !   What are my sizing requirements? !   Where is the data? !   Where are the users? !   What is the security policy? !   What are the retenAon and compliance policies? !   What is the availability requirement? !   What about the cloud?

Architecture Factors !   What are my sizing requirements?

–  Data capacity –  Search capacity –  User capacity

!   Obtained from the sizing process

Architecture Factors !   Where is the data?

–  Local or remote to the indexing machine –  If remote – use forwarders when possible –  Index in local data center (zone) or index centrally –  Persist network data to disk as a best pracAce –  Use intermediate forwarders to distribute data

!   Where are the users? –  User experience affected by search head locaAon

ê  Time zone tuning (5.x +) ê  Distributed search over LAN vs WAN

Architecture Factors !   What is the Security Policy?

–  Apply user security policies ê  Auth method ê  Roles ê  Filters

–  Apply physical security policies ê  Index locaAon

Architecture Factors !   RetenAon, compliance, governance

–  Where is the data allowed to be? –  Where is the data not allowed to go? –  Where must the data go?

!   Availability –  Local failover, fault-‐tolerance, clustering –  Geographic disaster recovery/fault-‐tolerance –  Index replicaAon!

Architecture Factors !   Same old story !   Cloud consideraAons

–  AuthenAcaAon restricAons –  Data transfer costs –  Security – SSL tunnel –  Zones

Topologies

Topology Examples

Centralized Decentralized Hybrid

Architecture Factors à Topology

Centralized Topology

Indexers

Forwarders

Syslog Devices

Intermediate Forwarder Forwarders

Search Head Pooling

Decentralized Topology

Search Head Pooling

Hybrid Topology

Scaling and Expansion !   Add to your indexer pool for more performance or capacity

–  Mixed pla{orm and hardware is okay

!   Use search head pooling for more UI capacity –  Requires NFS

!   Create new indexes for new data types –  Follows best pracAces

Index ReplicaAon (aka Clustering) !   What is it?

–  Indexes are replicated to 1 or more indexers (tunable) –  Splunk controlled

!   Basics –  Master node (manages indexing and searching locaAon) –  Distributed deployment –  NOT = “index and forward “

!   HA license VS. index replicaAon –  HAL – Separate fully funcAoning Splunk deployments –  IR – Data is made available on 1 or more indexers

Index ReplicaAon !   Rule of thumb: 1 per 50 GB/day

–  Assume simple replicaAon (2 in existence) –  Increase in I/O, CPU, and disk requirement

!   Need more indexers if: –  Increase in replicaAon factor –  Performance or capacity needs (search and index)

Index ReplicaAon (aka – Clustering) Search Head Pooling

Peer Nodes

Cluster Master

Forwarders

Index ReplicaAon !   Data is replicated and made available !   WAN configuraAon is not recommended !   Careful consideraAon when inserted into standard topologies !   Increases

–  Storage requirement –  I/O requirement (disk, network) –  Total indexer requirement

!   Disaster recovery and high availability .conf session

Final Thoughts !   Sizing is more than data volume – it’s also search load !   Centralized architecture is the baseline !   VariaAons on architecture are driven by

–  Sizing –  Data locaAon –  User locaAon –  RetenAon/access/governance –  Availability requirements

Next Steps

Download the .conf2013 Mobile App If not iPhone, iPad or Android, use the Web App

Take the survey & WIN A PASS FOR .CONF2014… Or one of these bags! View the sessions listed on the next slide Available on the Mobile App

More InformaAon !   Contact: syep@splunk.com !   DocumenaAon: hxp://docs.splunk.com !   Answers: hxp://answers.splunk.com !   Other presentaAons

–  Best PracAces: Deploying Splunk on Physical, Virtual, and Cloud Environments –  ArchitecAng Splunk for High Availability and Disaster Recovery

THANK YOU

Deploying Splunk. Arquitetura e dimensionamento do Splunk

Technology

Splunk For IT Operational Intelligenceuploads.integrity360.com/1425903243-Splunk-for-IT-Operations.pdf · • Splunk Apps –Splunk App for VMware –Splunk Apps for Citrix & Hyper

Splunk Review - University of Birmingham · Ref: Splunk Review, Dec 2014 Mohammad Rameez Shafsad itinnovation@contacts.bham.ac.uk Indexing 2.2.1 Searching using splunk Splunk uses

Referent / Redner Benjamin Tiggemann · 2015-07-08 · Splunk for Exchange Splunk for Enterprise Security* Splunk for Vmware Splunk for PCI Compliance* Splunk for Windows Splunk for

Splunk Cloud Product Briefwinncom.com.ua/wp-content/uploads/2018/07/splunk-Log_Flow.pdf · Extend Benefits With Splunk Apps Splunk Cloud includes support for premium solutions such

stoQ’ing your Splunk - SANS · PDF filestoQ’ingyour Splunk Ryan Kovar, Splunk Marcus LaFerrera, PUNCH SANS DFIR 2016

Splunk Spark Integration - GitHub Pageslitaotao.github.io/files/4. splunk_spark.pdf · Splunk Spark Integration ... • Splunk"products:""! Splunk"Enterprise"! ... splunk_spark Created

Industrial Data Internet of Things splunk (REST API, SDKs) splunk . TESCO splunk 500 Y—Jt1-3— 41328 . SHANON 1/5 splunk . htt . (500MB/B) s lunk splunk . Created Date:

Introduction and purpose · cisco Introduction and purpose: This document provides details steps for deploying Splunk version 4.2 and 4.3 release, Configuring and Implementing Splunk

SPLUNK® ENTERPRISE - CDWwebobjects.cdw.com/.../Splunk/Product-Brief-Splunk-Enterprise-The... · Splunk Enterprise collects data from any source, including logs, clickstreams, sensors,

Splunk conf2014 - Splunk for Data Science

Splunk のご紹介 - hitachi.co.jp · Splunkの歴史. 5 . 2006 . Splunk 1 . Splunk 2 . 2007 . Splunk 3 . 2009 . Splunk 4 . 2010 . Splunk 4.1 . 2011 Splunk 4.2 . App for Enterprise

Splunk Developer & Admin Certification Training...This Splunk course also includes various topics of Splunk, such as installation and configuration, Splunk Syslog, Syslog Server, log

TimeACerTime*–** ComparingTimeRangesinSplunk* · AboutMe! SplunkSeniorInstructorsince2009! FrequentcontributortoSplunkAnswers! LoveSplunksearchlanguagepuzzles*

Splunk Conf2010: Corporate Express presents Splunk with SAP

FireEye Splunk InterFireEye + Splunk: Intermediate Guidemediate Guide

DeployingSplunkEnterprise*On** Microso=AzureIaaS* SplunkAzureDeployment@Microso=Oﬃce Provisioning&&&AutomaNon& AzureBestPracCces Splunk&AzureIntegraons 49 Provisioning&Automaon*

データ分析プラットフォーム「Splunk」eww.gp-club.panasonic.co.jp/data/splunk/splunk.pdfTitle データ分析プラットフォーム「Splunk」 Author パナソニック

FLASHSTACK CI FOR SPLUNK REFERENCE ARCHITECTURE … · FLASHSTACK CI FOR SPLUNK REFERENCE ARCHITECTURE A Framework for Deploying Splunk® Enterprise at Scale June 2017

Rivium Splunk Windows · o Splunk Enterprise Security * o uberAgent* o Splunk App for Web Analytics Common Splunk Apps & Add-ons 15 SplunkingWIndows o Splunk Add-on for Microsoft

SplunkingYour Mobile Apps - .conf19 | Splunk · Introducing Splunk for Mobile Intelligence (Splunk MINT) Deploying Splunk MINT – Intro to SDKs – Getting Started Using Splunk MINT