Beyond Line Charts: Why Some Diversity in Telemetry Visualization Is Long Overdue

Summary

Disclaimer: This summary has been generated by AI. It is experimental, and feedback is welcomed. Please reach out to info@qconsf.com with any comments or concerns.

Beyond Line Charts: Why Some Diversity in Telemetry Visualization Is Long Overdue

The presentation by Yao Yue advocates for diversifying telemetry visualization techniques beyond the traditional line charts, which have dominated the industry due to their simplicity in visualizing service metrics. Yue, a seasoned platform engineer, argues that while line charts are effective for certain data types, they fall short when dealing with complex or noisy datasets, which are crucial for decision-making in real-time telemetry systems.

Key Points Discussed:

  • Limitations of Line Charts: Line charts are criticized for their inadequacy in handling busy or noisy data. They often rely on extrapolation, which can obscure the true variability of the data, creating misleading conclusions.
  • Importance of Data Shape: Understanding the data shape is crucial for choosing the right visualization. Busy datasets necessitate techniques to reduce noise, such as window operations, which can transform data into more comprehensible formats.
  • Telemetry Visualization Diversity: Yue emphasizes the need for trying different visualization styles tailored to the data's properties rather than defaulting to line charts, which often do not provide clear insights for operational questions.
  • Telemetry Data Typing: The presentation outlines three main types of telemetry data—counters, gauges, and histograms—and their respective characteristics that influence visualization choices.

Solution Approach:

  • Visualization Flexibility: Encourages experimenting with multiple visualization forms to find ones that best highlight the underlying data structure and trends.
  • Spotlight on Data Rawness: Suggests using plots like dot plots that emphasize raw data points over extrapolated data to provide a more honest representation of datasets.

The session highlights a broad systemic issue in telemetry visualization where reliance on line charts can lead to critical misinterpretations, proposing a shift towards a more varied and data-sensitive approach.

This is the end of the AI-generated content.


Abstract

For decades, visualization of service metrics overwhelmingly converges to line charts. The time-centric nature of real-time telemetry further cemented this phenomenon via storage layouts and domain-specific query languages.

But what questions do we, as service operators, ask or should ask? Can we find all the answers in line charts? Obviously my answer to that last question is no. I want to show you how some of the most valuable operational questions are much better served by presenting, and perhaps processing, telemetry data and metadata in alternative formats. 

It is easy for software developers to treat operations as an afterthought. But running software well is a serious engineering challenge in its own right. By taking a fresh and principled look at service telemetry, we can better appreciate both the breadth and depth of this fascinating problem that transcends programming languages, software/hardware abstractions, and application domains.
 


Speaker

Yao Yue

Founder & Chief Executive Officer @IOP Systems, Platform Engineer, Distributed System Aficionado, Cache Expert

Yao Yue is a platform engineer with specialties in caching, distributed systems, and performance engineering. She worked at Twitter for 12 years, first led the Cache Team and later created the Performance Team. After November 2022, she co-founded IOP Systems, a company that is working to improve software efficiency and reliability via smart performance engineering, with her teammates from Twitter.

Read more
Find Yao Yue at:

From the same track

Session Platform Engineering

Continuous Delivery for Foundational Platforms

Monday Nov 17 / 10:35AM PST

Platform teams frequently inherit systems that were never architected for their current scale, yet are so foundational that downtime can halt the business.

Speaker image - Ian Nowland

Ian Nowland

CEO @Junction Labs, Author of O'Reilly's Platform Engineering, Previously SVP Core Engineering at Datadog and Leader of AWS Nitro

Session

Microservices Platforms: When Team Topologies Meets Microservices Patterns

Monday Nov 17 / 01:35PM PST

When many teams work on a large, complex application, the microservice architecture potentially enables them to work independently and deliver a continuous stream of changes.

Speaker image - Chris Richardson

Chris Richardson

Creator of microservices.io, Java Champion, & Core Microservices Thoughtleader

Session Resilience

Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix

Monday Nov 17 / 05:05PM PST

How does Netflix maintain a seamless viewing experience for millions of users, especially during traffic spikes or when backend datastores are overloaded? Autoscaling can help during traffic spikes, but it costs money, takes a few minutes to kick in, and capacity may not always be available.

Speaker image - Anirudh Mendiratta

Anirudh Mendiratta

Staff Software Engineer, Playback Lifecycle @Netflix, Previously @Amazon Prime Video and @fuboTV

Speaker image - Benjamin Fedorka

Benjamin Fedorka

Staff Software Engineer, Productivity Engineering @Netflix

Session

Platform Engineering: Lessons from the Rise and Fall of eBay Velocity

Monday Nov 17 / 03:55PM PST

Once a stock market darling and a pioneering hyperscaler in the 1990s and early 2000s, eBay has been in steady decline since the 2010s. A household name with a flat business, eBay has been unable to make substantive strides in its market reach or its engineering outcomes in the last 15 years.

Speaker image - Randy Shoup

Randy Shoup

SVP Engineering @Thrive Market, Previously @eBay, @Google, @Stitch Fix

Session

Unconference: Modern Platform Engineering and Dev Enablement

Monday Nov 17 / 02:45PM PST