Elasticsearch for Logs & Metrics - a deep dive

Elasticsearch for logs and metrics (a deep dive)

Rafał Kuć and Radu GheorgheSematext Group, Inc.

About us

LogseneSPM

ES API

metrics

Products Services

Agenda

Index layout

Cluster layout

Per-index tuning of settings and mappings

Hardware+OS options

Pipeline patterns

Daily indices are a good start

...indexing, most searches

Indexing is faster in smaller indices

Cheap deletes

Search only needed indices

“Static” indices can be cached

The Black Friday problem*

* for logs. Metrics usually don’t suffer from this

Typical indexing performance graph for one shard*

* throttled so search performance remains decent

At this point it’s better to index in a new shard

Typically 5-10GB, YMMV

Y U NO AS FAST

more merges

more expensive

(+uncached) searches

Mostly because

Rotate by size*

* use Field Stats for queries or rely on query cache:https://github.com/elastic/kibana/issues/6644

Aliases; Rollover Index API*

* 5.0 feature

Slicing data by time

For spiky ingestion, use size-based indices

Make sure you rotate before the performance drop(test on one node to get that limit)

Multi tier architecture (aka hot/cold)

Client

Master

We can optimize data nodes layer

Ingest

logs_2016.11.07

indexing

es_hot_1 es_cold_1 es_cold_2

logs_2016.11.07logs_2016.11.08

indexing

curl -XPUT localhost:9200/logs_2016.11.07/_settings -d '{

"index.routing.allocation.exclude.tag" : "hot",

"index.routing.allocation.include.tag": "cold"

logs_2016.11.08 logs_2016.11.07

indexing

logs_2016.11.11 logs_2016.11.07logs_2016.11.09

logs_2016.11.08logs_2016.11.10

indexing, most searches long running searches

good CPU, best possible IO heap, IO for backup/replication and stats

SSD or RAID0 for spinning

Hot - cold architecture summary

Costs optimization - different hardware for different tier

Performance - above + fewer shards, less overhead

Isolation - long running searches don't affect indexing

Elasticsearch high availability & fault tolerance

Dedicated masters is a mustdiscovery.zen.minimum_master_nodes = N/2 + 1

Keep your indices balancednot balanced cluster can lead to instability

Balanced primaries are also goodhelps with backups, moving to cold tier, etc

total_shards_per_node is your friend

Elasticsearch high availability & fault tolerance

When in AWS - spread between availability zonesbin/elasticsearch -Enode.attr.zone=zoneAcluster.routing.allocation.awareness.attributes: zone

We need headroom for spikesleave at least 20 - 30% for indexing & search spikes

Large machines with many shards?look out for GC - many clusters died because of thatconsider running smaller ES instances but more

Which settings to tune

Merges → most indexing time

Refreshes → check refresh_interval

Flushes → normally OK with ES defaults

Relaxing the merge policyLess merges ⇒ faster indexing/lower CPU while indexingSlower searches, but:

- there’s more spare CPU- aggregations aren’t as affected, and they are typically the bottleneck

especially for metricsMore open files (keep an eye on them!)

Increase index.merge.policy.segments_per_tier ⇒ more segments, less mergesIncrease max_merge_at_once, too, but not as much ⇒ reduced spikesReduce max_merged_segment ⇒ no more huge merges, but more small ones

And even more settingsRefresh interval (index.refresh_interval)*

- 1s -> baseline indexing throughput- 5s -> +25% to baseline throughput- 30s -> +75% to baseline throughput

Higher indices.memory.index_buffer_size higher throughput

Lower indices.queries.cache.size for high velocity data to free up heap

Omit norms (frequencies and positions, too?)

Don't store fields if _source is used

Don't store catch-all (i.e. _all) field - data copied from other fields

* https://sematext.com/blog/2013/07/08/elasticsearch-refresh-interval-vs-indexing-performance/

Let’s dive deeper into storage

Not searches on a field, just aggregations ⇒ index=false

Not sorting/aggregating on a field ⇒ doc_values=false

Doc values can be used for retrieving (see docvalue_fields), so:

● Logs: use doc values for retrieving, exclude them from _source*

● Metrics: short fields normally ⇒ disable _source, rely on doc values

Long retention for logs? For “old” indices:

● set index.codec=best_compression

● force merge to few segments

* though you’ll lose highlighting, update API, reindex API...

Metrics: working around sparse dataIdeally, you’d have one index per metric type (what you can fetch with one call)

Combining them into one (sparse) index will impact performance (see LUCENE-7253)

One doc per metric: you’ll pay with space

Nested documents: you’ll pay with heap (bitset used for joins) and query latency

What about the OS?

Say no to swap

Disk scheduler: CFQ for HDD, deadline for SSD

Mount options: noatime, nodiratime, data=writeback, nobarrier

because strict ordering is for the weak

And hardware?

Hot tier. Typical bottlenecks: CPU and IO throughputindexing is CPU-intensiveflushes and merges write (and read) lots of data

Cold tier: Memory (heap) and IO latencymore data here ⇒ more indices&shards ⇒ more heap

⇒ searches hit more filesmany stats calls are per shard ⇒ potentially choke IO when cluster is idle

Generally:network storage needs to be really good (esp. for cold tier)network needs to be low latency (pings, cluster state replication)network throughput is needed for replication/backup

AWS specifics

c3 instances work, but there’s not enough local SSD ⇒ EBS gp2 SSD*c4 + EBS give similar performance, but cheaper

i2s are good, but expensived2s are better value, but can’t deal with many shards (spinning disk latency)m4 + gp2 EBS are a good balance

gp2 → PIOPS is expensive, spinning is slow3 IOPS/GB, but caps at 160MB/s or 10K IOPS (of up to 256kb) per driveperformance isn’t guaranteed (for gp2) ⇒ one slow drive slows RAID0

Enhanced Networking (and EBS Optimized if applicable) are a must

* And used local SSD as cache. With --cachemode writeback for async writing: https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Logical_Volume_Manager_Administration/lvm_cache_volume_creation.html

block size?

The pipeline

read buffer deliver

The pipeline

read buffer deliver

Log shipper reason #1

The pipeline

read buffer deliver

Log shipper reason #1

Files? Sockets? Network?What if buffer fills up?

Processing before/after buffer?

Others besides Elasticsearch?How to buffer if $destination is down?

Overview of 6 log shippers: sematext.com/blog/2016/09/13/logstash-alternatives/

Types of buffers

buffer

application.log Log file can act as a buffer

Memory and/or disk of the log shipperor a dedicated tool for buffering

Where to do processing

Logstash(or Filebeat or…)

Buffer(Kafka/Redis)

Logstash Elasticsearch

Logstash Buffer(Kafka/Redis)

something else

Logstash Buffer(Kafka/Redis)

something elseOutputs

need to be in sync

Logstash Kafka Logstash Elasticsearch

something elseLogstashElasticsearch

offsetotheroffset

here,too

Where to do processing (syslog-ng, fluentd…)

hereElasticsearch

something else

Where to do processing (rsyslogd…)

herehere

Zoom into processing

Ideally, log in JSON

Otherwise, parse

For performance and maintenance(i.e. no need to update parsing rules)

Regex-based (e.g. grok)Easy to build rules

Rules are flexible

Slow & O(n) on # of rules

Tricks:

Move matching patterns to the top of the list

Move broad patterns to the bottom

Skip patterns including others that didn’t match

Grammar-based(e.g. liblognorm, PatternDB)

Faster. O(1) on # of rules. References:

Logagent

Logstash

rsyslog syslog-ng

sematext.com/blog/2015/05/18/tuning-elasticsearch-indexing-pipeline-for-logs/www.fernuni-hagen.de/imperia/md/content/rechnerarchitektur/rainer_gerhards.pdf

Back to buffers: check what happens if when they fill up

Local files: when are they rotated/archived/deleted?

TCP: what happens when connection breaks/times out?

UNIX sockets: what happens when socket blocks writes?

UDP: network buffers should handle spiky load

Check/increasenet.core.rmem_max net.core.rmem_default

Unlike UDP&TCP,both DGRAM and STREAM

local socketsare reliable/blocking

Let’s talk protocols now

UDP: cool for the app, but not reliable

TCP: more reliable, but not completely

Application-level ACKs may be needed:

No failure/backpressure handling needed

App gets ACK when OS buffer gets it ⇒ no retransmit if buffer is lost*

* more at blog.gerhards.net/2008/05/why-you-cant-build-reliable-tcp.html

sender receiverACKs

Protocol Example shippers

HTTP Logstash, rsyslog, syslog-ng, Fluentd, Logagent

RELP rsyslog, Logstash

Beats Filebeat, Logstash

Kafka Fluentd, Filebeat, rsyslog, syslog-ng, Logstash

Wrapping up: where to log?

critical?

UDP. Increase network buffers on destination, so it can handle spiky

traffic

Paying with RAM or IO?

UNIX socket. Local shipper with memory

buffers, that can drop data if needed

Local files. Make sure rotation is in place or you’ll run out of disk!

IO RAM

Flow patterns (1 of 5)

application.log

Logstash

Elasticsearch

Easy&flexible

Overhead

application.log

Filebeat

Elasticsearch(with Ingest)

Light&simple

Harder to scale processing

sematext.com/blog/2016/04/25/elasticsearch-ingest-node-vs-logstash-performance/

Elasticsearch

files,sockets (syslog?),localhost TCP/UDP

LogagentFluentdrsyslog

syslog-ng

Light, scales

No central control

sematext.com/blog/2016/09/13/logstash-alternatives/

ElasticsearchKafka

FilebeatLogagentFluentdrsyslog

syslog-ng

Good for multiple destinations

More complex

somethingelse

Logstash,custom consumer

Thank you!

Rafał Kućrafal.kuc@sematext.com@kucrafal

Radu Gheorgheradu.gheorghe@sematext.com@radu0gheorghe

Sematextinfo@sematext.comhttp://sematext.com@sematext

Join Us! We are hiring!

http://sematext.com/jobs

Pictureshttps://pixabay.com/get/e831b60920f71c22d2524518a33219c8b66ae3d11eb611429df9c77f/scuba-diving-147683_1280.png

https://pixabay.com/static/uploads/photo/2012/04/18/12/17/firewood-36866_640.png

http://i3.kym-cdn.com/entries/icons/original/000/004/006/y-u-no-guy.jpg

http://memepress.wpgoods.com/wp-content/uploads/2013/06/neutral-feel-like-a-sir-clean-l1.png

Elasticsearch for Logs & Metrics - a deep dive

Technology

Kubernetes Cluster Securing A Multitenant · Fluentd: gathers logs and sends to Elasticsearch Kibana: A web UI for Elasticsearch. Access control Cluster administrators can view all

OSMC 2014: Processing millions of logs with Logstash and integrating with Elasticsearch, Hadoop and Cassandra | Valentin Fischer-Mitoiu

Automated Debugging of Bad Deployments - USENIX · •Collect logs with Kafka and Elasticsearch+Logstash+Kibana (ELK Stack) ... Match stack trace line numbers with lines changed in

Deep Dive Into Elasticsearch

DETECTING ADVANCED THREATS WITH SYSMON, …...DETECTING ADVANCED THREATS WITH SYSMON, WEF, AND ELASTICSEARCH WHY EVENT LOGS? From an advanced threat detection perspective, most analysts

COMO MONTAR UMA INFRAESTRUTURA PARA MENOS DE 40 … · Elasticsearch logs Collect and parse logs created by HAProxy metrics Fetch from the HAProxy server. Kibana metrics Apache logs

Wrangling Logs With Logstash and ElasticSearch Presentation

Tuning Elasticsearch Indexing Pipeline for Logs

Got Logs? Get Answers with Elasticsearch ELK - PuppetConf 2014

50 shades of ElasticSearch · 50 shades of ElasticSearch Denis Ćutić @ Infobip. USE CASE. Event logs. Distributed processing. Multiple events per message. Additional information

Elasticsearch cluster deep dive

Using Kibana4 to read logs at Wikimedia...2016/11/14 · Using Kibana4 to read logs at Wikimedia Wikimedia Tech Talk, 2016-11-14 Elasticsearch Document oriented full text search engine

Meetup ElasticSearch : « Booster votre Magento avec Elasticsearch »

Indexing and Searching Logs with Elasticsearch/Solr by Radu Gheorghe from Sematext

Amazon Elasticsearch Service Security Deep Dive - AWS Online Tech Talks

DB2 „Deep dive into DB2“ - exstor.de · „Deep dive into DB2 ... Tuning for performance, scalability Memory and virtual memory manager Database reorganization DB2 logs db2diaglog

Dell EMC Search · View Elasticsearch logs.....123 View or change the Elasticsearch configuration.....124 Monitor the health of the Elasticsearch cluster..... 124 Insufficient memory

Elasticsearch como gerenciar seus logs com logstash e kibana

ELASTICSEARCH INTRODUCTION › content › download › 6739 › 115193 › file...ELASTICSEARCH INTRODUCTION Zagreb, 27.03.2015. Kristijan Duvnjak & Mladen Maravi ć Elasticsearch

WSDOT Underwater Inspection Form...Dive logs for each diver will be maintained at the site. Pre-dive briefing and checklist, equipment procedures and checklist, emergency procedures,