Give me 15 minutes and I'll change your view of Linux tracing [video]

haberman · on Sept 29, 2017

Argh. This talk reinforced my existing view of Linux tracing: it's really fragmented.

Five years ago I tried to make some sense of this by researching all of the existing technologies. In the kernel I found:

   - ftrace (https://lwn.net/Articles/290277/)
   - tracepoints (https://www.kernel.org/doc/Documentation/trace/tracepoints.txt)
   - kprobes (https://www.kernel.org/doc/Documentation/kprobes.txt)
   - events (https://www.kernel.org/doc/Documentation/trace/events.txt)

Now apparently we can add:

   - BPF, a packet filter that grew into a tracing framework
     (https://lwn.net/Articles/599755/)

In user-space we have:

   - perf
   - systemtap
   - lttng
   - other, random, fragmented things

This talk seems to add a bunch of other fragmented user-space tools.

I don't mean to put down anybody's work, but this stuff will never be user-friendly as long as it remains so fragmented, IMHO.

xelxebar · on Sept 29, 2017

I'm of a similar mind. However, somewhat recently I came across this article which helped provide a framework to think about all these things. Turns it that it's not just a flat space of competing tools:

https://jvns.ca/blog/2017/07/05/linux-tracing-systems/

haberman · on Sept 29, 2017

Excellent post, thanks!

brendangregg · on Sept 29, 2017

It's not that bad.

- Some of those are not in-tree, like LTTng and SystemTap.

- Tracepoints, kprobes, events, and uprobes are all event libraries used by perf or ftrace, just like DTrace had multiple providers (fbt, pid, etc).

The real fragmentation is perf and ftrace, since both are in-tree front ends. That's not too bad, and they both have different strengths.

eBPF is weird in that it's neither an event library or a front end. It's programmatic capabilities. We're mostly using an out of tree project, bcc, to run it.

db48x · on Sept 29, 2017

What's fragmented about it? Almost everything he showed there was a script that use ftrace, kprobes, or BPF to measure something specific. Since those are all available in the kernel at the same time, you can certainly think of them as a single API.

lsd5you · on Sept 29, 2017

So really it's a branding problem?

db48x · on Sept 30, 2017

I wouldn't go that far; there are important differences between them.

Sacho · on Sept 29, 2017

I think lttng has kernel tracing. I don't know why the fragmentation you describe is "bad", though; it really depends on the tools themselves.

If there's just a variety of tools for the same task, then that's healthy competition and how you get better software.

If no single tool can fulfill all your tracing needs, that's still not necessarily a condemnation of the tools. It's entirely possible that each tool can complete a subset of tasks, but is significantly simpler to use as a result, so SUM(effort to learn tools you need) may still be comparable to the effort of a theoretical omni-tool.

cyphar · on Sept 29, 2017

He didn't mention this in this snippet, but the BCC (BPF Compiler Collection) intends to make this much simpler[1]. In particular it lets you write a tracer in Python (with the BPF program written in C) that attaches the BPF program to whatever types of probe points you like. So while internally there might be all this fragmentation a user shouldn't have to deal with it as much.

[1]: https://github.com/iovisor/bcc

cryptonector · on Sept 30, 2017

Brendan used to be Mr. DTrace User. (Not Mr. DTrace -- that was bmc, ahl, and mws.) But the world isn't using Solaris or FreeBSD, so I guess he moved on like most of the rest of us Solaris diaspora. Still, every time I see one of Brendan's blogs I know, deep down, he must miss DTrace; I sure do. This video doesn't help me feel at home with Linux, but it's a resource for when I need to trace something. Mostly though, when I have to debug something on Linux, I do it the pre-DTrace way, which is to say: the hard way.

thephyber · on Sept 29, 2017

Isn't that kinda the ethos of Linux? Every program does one thing and does it well.

X86BSD · on Sept 29, 2017

No that's the UNIX philosophy. And Linux threw that out the window from day one. From file systems to solving /dev/poll and containers.

cyphar · on Sept 29, 2017

The good (and bad) thing about all of the technologies that make up "containers" on Linux is that they can be used by separate projects. Chromium uses seccomp, systemd uses namespaces and cgroups, a bunch of tools use AppArmor/SELinux.

But ultimately the reason that this is the current state is because of how Linux is developed. Trying to push something like Jails or Zones is an exercise in futility because the patchset would be too large, would touch everything, and the infrastructure would likely not be reusable by other people.

alexnewman · on Sept 29, 2017

Some of those are old tools some are old. This will always be the case

erikb · on Sept 28, 2017

okay, after 15 minutes of this.

Previous view: Linux tracing is so complicated. Without a personally important usecase I probably won't invest the time to learn it.

Current view: Linux tracing is so complicated. Without a personally important usecase I probably won't invest the time to learn it.

Soo... What I can say is I really hate these keyboard sounds. Otherwise I'm not sure I learned much.

piyush_soni · on Sept 29, 2017

And I'm jealous of his keyboard.

johnrivera · on Sept 28, 2017

I can't help but giggle like a schoolchild because of those keyboard sounds.

pmoriarty · on Sept 28, 2017

It was cute for about 2 seconds, then got incredibly annoying.

vacri · on Sept 29, 2017

I get the feeling that it's more satisfying when it's on your own keyboard, but it's definitely annoying on another person's.

ycombimike · on Sept 29, 2017

I'll bet he can't even hear it.

justinjlynn · on Sept 29, 2017

It's like playing a video game, I'd imagine. The sound effects are the bane of everyone's existence but your own. Seriously though, you don't play your _game boy_ in public without headphones; let alone present while playing one! I'm amazed -- so, so tempted to turn of the preso even if I was fascinated by the content.

burkaman · on Sept 29, 2017

He turns it on at the beginning and says it's because he likes having audio feedback.

jachee · on Sept 28, 2017

Brenden Gregg was already my favorite Linux internals experts. This would've taken him to the top of the list if he weren't already there.

kzahel · on Sept 28, 2017

Pretty funny. Found this in homebrew casks (https://github.com/yingDev/Tickeys)

etblg · on Sept 29, 2017

This is amazing. Now I'm swinging around a virtual sword every time I type.

boondaburrah · on Sept 28, 2017

Suddenly I'm revisiting memories of Platinum Sounds.

lotyrin · on Sept 28, 2017

BPF really seems nice. Ramifications to me though are: if I was willing to pay a few percent overhead on all my production instances, what I would be able to monitor 24/7 and get a return on the investment, and I haven't found much writing in that area.

Seems like there could be a lot of opportunity, hopefully I'll get a chance to dive in and find out myself.

tomsthumb · on Sept 28, 2017

> I haven't found much writing in that area

This book would _probably_ get you moving in the right direction: http://www.brendangregg.com/sysperfbook.html

It should be something like: look at your bottlenecks and utilization, look at your costs, look at (cost effective) ways to reduce or remove those bottlenecks or that utilization. Pick the cheapest place to have a bottleneck. Using SSD at an extra 30$ a month lets you use half the CPU and RAM, saving 60$ a month? Go for it.

pbhjpbhj · on Sept 28, 2017

It was on his blog before, http://www.brendangregg.com/blog/2016-12-27/linux-tracing-in... and has been posted here a couple of times. Interesting this is the most traction it's got AFAICS.

Philipp__ · on Sept 29, 2017

Or just learn DTrace and hope it will be eventually ported somehow to Linux... /s

cryptonector · on Sept 30, 2017

https://github.com/dtrace4linux/linux

I... haven't tried it in ages. I've no idea if it works. It used to crash my system easily, but maybe now it's fine.

viraptor · on Sept 28, 2017

I feel like despite all this progress, sysdig is still the most accessible solution at the moment. It even includes a slowish, but super simple way of tracing user space (you can even write traces from bash scripts). I wish there was a built-in Linux equivalent.

the8472 · on Sept 29, 2017

perf with source annotation is pretty nice if you're profiling for individual hotspots. But I have not found any solution that lets me spot amdahl bottlenecks which get drowned out in raw cycles spent by the parallel parts. In java this is trivial with thread utilization timlines that incorporate sampling.

Maybe this could be solved by weighting samples by the inverse of number of running threads at the time

chicago_wade · on Sept 28, 2017

Will BPF replace ftrace? From what I understood he was able to do everything ftrace could by using BPF and BPF was more efficient.

cyphar · on Sept 29, 2017

BPF was used to do the aggregation and calculation in-kernel. You still need ftrace to actually run the BPF program in that context. You can read the cover page for the patch that added this in 2015[1].

[1]: https://lwn.net/Articles/630965/

brendangregg · on Sept 29, 2017

Right; plus theres some capabilities where ftrace is (and maybe always will be) better. Eg, function counting: ftrace can count all kernel functions instantly (try my perf-tools funccount tool), whereas the BPF method involves setting a kprobe on everything, which takes much longer (setup and tear down). And function graph tracing from ftrace will likely be better than anything we can do in BPF (as it uses tracing all functions as well).

wyldfire · on Sept 28, 2017

perf is awesome. ftrace is awesomer still for finding great stuff but I'm often on a system with a kernel with no or limited support enabled for it.