Lighthouse is a code coverage plugin for IDA Pro. The plugin leverages IDA as a platform to map, explore, and visualize externally collected code coverage data when symbols or source may not be available for a given binary.
This plugin is labeled only as a prototype and IDA / Qt code example for the community.
Batch load can quickly aggregate hundreds (thousands?) of collected coverage files into a single composite at load time.
Coverage Painting
Lighthouse 'paints' the active coverage data across the three major IDA views as applicable. Specifically, the Disassembly, Graph, and Pseudocode views.
Coverage Overview
The Coverage Overview is a dockable widget that provides a function level view of the active coverage data for the database.
This table can be sorted by column, and entries can be double clicked to jump to their corresponding disassembly.
Coverage Composition
Building relationships between multiple sets of coverage data often distills deeper meaning than their individual parts. The shell at the bottom of the Coverage Overview provides an interactive means of constructing these relationships.
Pressing enter on the shell will evaluate and save a user constructed composition.
Composition Syntax
Coverage composition, or Composing as demonstrated above is achieved through a simple expression grammar and 'shorthand' coverage symbols (A to Z) on the composing shell.
Grammar Tokens
Logical Operators: |, &, ^, -
Coverage Symbol: A, B, C, ..., Z
Coverage Range: A,C, Q,Z, ...
Parenthesis: (...)
Example Compositions
A & B
(A & B) | C
(C & (A - B)) | (F,H & Q)
The evaluation of the composition may occur right to left, parenthesis are suggested for potentially ambiguous expressions.
Hot Shell
Additionally, there is a 'Hot Shell' mode that asynchronously evaluates and caches user compositions in real-time.
The hot shell serves as a natural gateway into the unguided exploration of composed relationships.
Search
Using the shell, one can search and filter the functions listed in the coverage table by prefixing their query with /.
The head of the shell will show an updated coverage % computed only from the remaining functions. This is useful when analyzing coverage for specific function families.
Jump
Entering an address or function name into the shell can be used to jump to corresponding function entries in the table.
Coverage ComboBox
Loaded coverage data and user constructed compositions can be selected or deleted through the coverage combobox.
Collecting Coverage
Before using Lighthouse, one will need to collect code coverage data for their target binary / application.
The examples below demonstrate how one can use DynamoRIO or Intel Pin to collect Lighthouse compatible coverage agaainst a target. The .log files produced by these instrumentation tools can be loaded directly into Lighthouse.
DynamoRIO
Code coverage data can be collected via DynamoRIO's drcov code coverage module.
Using a custom pintool contributed by Agustin Gianni, the Intel Pin DBI can also be used to collect coverage data.
Example usage:
pin.exe -t CodeCoverage64.dll -- boombox.exe
For convenience, binaries for the Windows pintool can be found on the releases page. MacOS and Linux users need to compile the pintool themselves following the instructions included with the pintool for their respective platforms.
Future Work
Time and motivation permitting, future work may include:
Profiling based heatmaps/painting
Coverage & Profiling Treemaps
Additional coverage sources, trace formats, etc
Improved Pseudocode painting
I welcome external contributions, issues, and feature requests.