Measuring Function Duration with FTrace
Contents
Author
Tim Bird
Sony Corporation of America
tim dot bird (at) am dot sony dot com
Abstract
FTrace is a relatively new kernel tool for tracing function execution in the Linux kernel. Recently, FTrace supports the ability to trace function exit in addition to function entry. This allows for measurement of function duration, which adds an incredibly powerful tool for finding time-consuming areas of kernel execution. In this paper, the current state of the art for measuring function duration with FTrace is described. This includes recent work to add a new capability to filter the trace data by function duration, and tools for analyzing kernel function call graphs and visualizing kernel boot time execution.
Introduction
Overview of FTrace operation
instrumentation at compile time
- instrumentation of the code using -pg
- insertion of calls in each function prologue to 'mcount'
- architecture-specific mcount routine
- new mcount routine (_gnu_mcount) for ARM??
- must use frame-pointers on ARM (can't use -fomit-frame-pointers)
- Not sure it works with EABI (but assembly analysis indicates that all mcount callers have well-defined frames)
- ability to manually instrument with markers??
- out of scope
trace setup at run-time
- pseudo file system
- selecting a tracer
- trace functions
- stub, others
- tracing parameters
- trace functions
- selecting a tracer
- collecting trace data
- cat
tracing operation
- ring buffer
- generic mechanism for buffering time-stamped events, per-cpu
- no other details here.
- trace functions
- ring operations
- reservation, data fillout, commit?
- ring operations
function duration tracer
- Measuring function duration
- requires instrumenting function exit
- option to use -finstrument-functions, which instruments both function entry and function exit
- conflicts with mechanisms for dynamic ftrace instrumentation
- option to use -finstrument-functions, which instruments both function entry and function exit
- hooking function exit using only mcount
- implement a trampoline
- details of the ARM trampoline
- requires instrumenting function exit
measuring startup
Using FTrace
- example of use
interpreting the results
- cat ftrace
- trace sample
== performance issues
- overhead of tracing
- mention of dynamic tracing (not detailed here)
- benchmark with and without function tracing?
Things to do/add
- ARM exit tracing - done (need to test, see below)
- better clock on ARM
- duration filtering
- kd for analysis (port kd)
- use in early boot
- send patch for ARM exit tracing
- find out if all trace data is collected correctly
- ftrace data on ARM seems to be missing a lot of functions
- how to determine if functions are missing?
- port KFT to 2.6.29, and use to check ftrace data collection?
- how to determine if functions are missing?
- ftrace data on ARM seems to be missing a lot of functions
- get a better clock routine.
- Current on has max resolution of 31 microseconds.
- what is resolution of KFT clock on ARM?
- measure early startup
- determine how early the clock is available
- use duration filtering
- get latest ftrace code, and apply patches for filtering
- see if duration filtering is available