Log, monitoring and QoS platform of a large
scale servicePhan Huy Hoàng – Lead Engineer
Web Mobile – Zing
Agenda
• 1/ Quality of Service Platform: Why we need pay attention to QoS
• 2/ Logging: Handling lots of data
• 3/ Monitoring
• 4/ Questions & Answers
Quality of Service PlatformWhy we need to pay attention to QoS ?
Why we need QoS?
• How your app actually works in real-life ?
• Users are using which functions ? Chat, social, search nearby… ?
• Do those functions even work at all ?
• If it does work, how good, in which environment ?
A few numbers
• Lots of data
• 3M users
• 28M messages chat/day
• 600M million events/day
Too much data
Logging Handling lots of data
Logging Flow v1
Logging Flow v2
Monitoring
Charting
• Live data
• Trends data
Dashboard
• Simplified your life
• Concentrate on drastic change
Anomaly data points detection
• How to deal with stuff like this ?• Too many data point deviated from normal deviation
should trigger an alert
• You will get a lot of false positive
What’s next ?
• Sending alert• By Zalo, SMS, email
• Happy life
Questions?