1 files changed, 159 insertions, 0 deletions
diff --git a/docs/safetynet.md b/docs/safetynet.md
new file mode 100644
index 0000000000..540f7db2eb
--- /dev/null
+++ b/docs/safetynet.md
@@ -0,0 +1,159 @@
+# SafetyNet - Performance regression detection for PDFium
+
+[TOC]
+
+This document explains how to use SafetyNet to detect performance regressions
+in PDFium.
+
+## Comparing performance of two versions of PDFium
+
+safetynet_compare.py is a script that compares the performance between two
+versions of pdfium. This can be used to verify if a given change has caused
+or will cause any positive or negative changes in performance for a set of test
+cases.
+
+The supported profilers are exclusive to Linux, so for now this can only be run
+on Linux.
+
+An illustrative example is below, comparing the local code version to an older
+version. Positive % changes mean an increase in time/instructions to run the
+test - a regression, while negative % changes mean a decrease in
+time/instructions, therefore an improvement.
+
+```
+$ testing/tools/safetynet_compare.py ~/test_pdfs --branch-before beef5e4
+================================================================================
+   % Change      Time after  Test case
+--------------------------------------------------------------------------------
+   -0.1980%  45,703,820,326  ~/test_pdfs/PDF Reference 1-7.pdf
+   -0.5678%      42,038,814  ~/test_pdfs/Page 24 - PDF Reference 1-7.pdf
+   +0.2666%  10,983,158,809  ~/test_pdfs/Rival.pdf
+   +0.0447%  10,413,890,748  ~/test_pdfs/dynamic.pdf
+   -7.7228%      26,161,171  ~/test_pdfs/encrypted1234.pdf
+   -0.2763%     102,084,398  ~/test_pdfs/ghost.pdf
+   -3.7005%  10,800,642,262  ~/test_pdfs/musician.pdf
+   -0.2266%  45,691,618,789  ~/test_pdfs/no_metadata.pdf
+   +1.4440%  38,442,606,162  ~/test_pdfs/test7.pdf
+   +0.0335%       9,286,083  ~/test_pdfs/testbulletpoint.pdf
+================================================================================
+Test cases run: 10
+Failed to measure: 0
+Regressions: 0
+Improvements: 2
+```
+
+### Usage
+
+Run the safetynet_compare.py script in testing/tools to perform a comparison.
+Pass one or more paths with test cases - each path can be either a .pdf file or
+a directory containing .pdf files. Other files in those directories are
+ignored.
+
+The following comparison modes are supported:
+
+1. Compare uncommitted changes against clean branch:
+```shell
+$ testing/tools/safetynet_compare.py path/to/pdfs
+```
+
+2. Compare current branch with another branch or commit:
+```shell
+$ testing/tools/safetynet_compare.py path/to/pdfs --branch-before another_branch
+$ testing/tools/safetynet_compare.py path/to/pdfs --branch-before 1a3c5e7
+```
+
+3. Compare two other branches or commits:
+```shell
+$ testing/tools/safetynet_compare.py path/to/pdfs --branch-after another_branch --branch-before yet_another_branch
+$ testing/tools/safetynet_compare.py path/to/pdfs --branch-after 1a3c5e7 --branch-before 0b2d4f6
+$ testing/tools/safetynet_compare.py path/to/pdfs --branch-after another_branch --branch-before 0b2d4f6
+```
+
+4. Compare two build flag configurations:
+```shell
+$ gn args out/BuildConfig1
+$ gn args out/BuildConfig2
+$ testing/tools/safetynet_compare.py path/to/pdfs --build-dir out/BuildConfig2 --build-dir-before out/BuildConfig1
+```
+
+safetynet_compare.py takes care of checking out the appropriate branch, building
+it, running the test cases and comparing results.
+
+### Profilers
+
+safetynet_compare.py uses callgrind as a profiler by default. Use --profiler
+to specify another one. The supported ones are:
+
+#### perfstat
+
+Only works on Linux.
+Make sure you have perf by typing in the terminal:
+```shell
+$ perf
+```
+
+This is a fast profiler, but uses sampling so it's slightly inaccurate.
+Expect variations of up to 1%, which is below the cutoff to consider a
+change significant.
+
+Use this when running over large test sets to get good enough results.
+
+#### callgrind
+
+Only works on Linux.
+Make sure valgrind is installed:
+```shell
+$ valgrind
+```
+
+This is a slow and accurate profiler. Expect variations of around 100
+instructions. However, this takes about 50 times longer to run than perf stat.
+
+Use this when looking for small variations (< 1%).
+
+One advantage is that callgrind can generate `callgrind.out` files (by passing
+--output-dir to safetynet_compare.py), which contain profiling information that
+can be analyzed to find the cause of a regression. KCachegrind is a good
+visualizer for these files.
+
+### Common Options
+
+Arguments commonly passed to safetynet_compare.py.
+
+* --profiler: described above.
+* --build-dir: this specified the build config with a relative path from the
+pdfium src directory to the build directory. Defaults to out/Release.
+* --output-dir: where to place the profiling output files. These are
+callgrind.out.[test_case] files for callgrind, perfstat does not produce them.
+By default they are not written.
+* --case-order: sort test case results according to this metric. Can be "after",
+"before", "ratio" and "rating". If not specified, sort by path.
+* --this-repo: use the repository where the script is instead of checking out a
+temporary one. This is faster and does not require downloads. Although it
+restores the state of the local repo, if the script is killed or crashes the
+uncommitted changes can remain stashed and you may be on another branch.
+
+### Other Options
+
+Most of the time these don't need to be used.
+
+* --build-dir-before: if comparing different build dirs (say, to test what a
+flag flip does), specify the build dir for the “before” branch here and the
+build dir for the “after” branch with --build-dir.
+* --interesting-section: only the interesting section should be measured instead
+of all the execution of the test harness. This only works in debug, since in
+release the delimiters are stripped out. This does not work to compare branches
+that don’t have the callgrind delimiters, as it would otherwise be unfair to
+compare a whole run vs the interesting section of another run.
+* --machine-readable: output a json with the results that is easier to read by
+code.
+* --num-workers: how many workers to use to parallelize test case runs. Defaults
+to # of CPUs in the machine.
+* --threshold-significant: highlight differences that exceed this value.
+Defaults to 0.02.
+* --tmp-dir: directory in which temporary repos will be cloned and downloads
+will be cached, if --this-repo is not enabled. Defaults to /tmp.
+
+## Setup a nightly job
+
+TODO: Complete with safetynet_job.py setup and usage.