Documentation/dev-tools/ktap.rst

1638Srgrimes.. SPDX-License-Identifier: GPL-2.0
1638Srgrimes
1638Srgrimes===================================================
1638SrgrimesThe Kernel Test Anything Protocol (KTAP), version 1
1638Srgrimes===================================================
1638Srgrimes
1638SrgrimesTAP, or the Test Anything Protocol is a format for specifying test results used
1638Srgrimesby a number of projects. It's website and specification are found at this `link
1638Srgrimes<https://testanything.org/>`_. The Linux Kernel largely uses TAP output for test
1638Srgrimesresults. However, Kernel testing frameworks have special needs for test results
1638Srgrimeswhich don't align with the original TAP specification. Thus, a "Kernel TAP"
1638Srgrimes(KTAP) format is specified to extend and alter TAP to support these use-cases.
1638SrgrimesThis specification describes the generally accepted format of KTAP as it is
1638Srgrimescurrently used in the kernel.
1638Srgrimes
1638SrgrimesKTAP test results describe a series of tests (which may be nested: i.e., test
1638Srgrimescan have subtests), each of which can contain both diagnostic data -- e.g., log
1638Srgrimeslines -- and a final result. The test structure and results are
1638Srgrimesmachine-readable, whereas the diagnostic data is unstructured and is there to
1638Srgrimesaid human debugging.
1638Srgrimes
1638SrgrimesKTAP output is built from four different types of lines:
1638Srgrimes- Version lines
1638Srgrimes- Plan lines
1638Srgrimes- Test case result lines
1638Srgrimes- Diagnostic lines
1638Srgrimes
1638SrgrimesIn general, valid KTAP output should also form valid TAP output, but some
1638Srgrimesinformation, in particular nested test results, may be lost. Also note that
1638Srgrimesthere is a stagnant draft specification for TAP14, KTAP diverges from this in
1638Srgrimesa couple of places (notably the "Subtest" header), which are described where
1638Srgrimesrelevant later in this document.
50476Speter
1638SrgrimesVersion lines
1638Srgrimes-------------
1638Srgrimes
79538SruAll KTAP-formatted results begin with a "version line" which specifies which
1638Srgrimesversion of the (K)TAP standard the result is compliant with.
1638Srgrimes
1638SrgrimesFor example:
1638Srgrimes- "KTAP version 1"
1638Srgrimes- "TAP version 13"
84306Sru- "TAP version 14"
1638Srgrimes
1638SrgrimesNote that, in KTAP, subtests also begin with a version line, which denotes the
1638Srgrimesstart of the nested test results. This differs from TAP14, which uses a
1638Srgrimesseparate "Subtest" line.
1638Srgrimes
1638SrgrimesWhile, going forward, "KTAP version 1" should be used by compliant tests, it
1638Srgrimesis expected that most parsers and other tooling will accept the other versions
1638Srgrimeslisted here for compatibility with existing tests and frameworks.
1638Srgrimes
1638SrgrimesPlan lines
13744Smpp----------
79727Sschweikh
1638SrgrimesA test plan provides the number of tests (or subtests) in the KTAP output.
107788Sru
1638SrgrimesPlan lines must follow the format of "1..N" where N is the number of tests or subtests.
1638SrgrimesPlan lines follow version lines to indicate the number of nested tests.
1638Srgrimes
1638SrgrimesWhile there are cases where the number of tests is not known in advance -- in
1638Srgrimeswhich case the test plan may be omitted -- it is strongly recommended one is
1638Srgrimespresent where possible.
1638Srgrimes
70466SruTest case result lines
1638Srgrimes----------------------
1638Srgrimes
1638SrgrimesTest case result lines indicate the final status of a test.
1638SrgrimesThey are required and must have the format:
1638Srgrimes
1638Srgrimes.. code-block:: none
1638Srgrimes
1638Srgrimes	<result> <number> [<description>][ # [<directive>] [<diagnostic data>]]
1638Srgrimes
107788SruThe result can be either "ok", which indicates the test case passed,
1638Srgrimesor "not ok", which indicates that the test case failed.
1638Srgrimes
15082Smpp<number> represents the number of the test being performed. The first test must
1638Srgrimeshave the number 1 and the number then must increase by 1 for each additional
1638Srgrimessubtest within the same test at the same nesting level.
1638Srgrimes
1638SrgrimesThe description is a description of the test, generally the name of
119964Sruthe test, and can be any string of characters other than # or a
33780Sbdenewline.  The description is optional, but recommended.
1638Srgrimes
33780SbdeThe directive and any diagnostic data is optional. If either are present, they
33780Sbdemust follow a hash sign, "#".
1638Srgrimes
55466SbdeA directive is a keyword that indicates a different outcome for a test other
55466Sbdethan passed and failed. The directive is optional, and consists of a single
1638Srgrimeskeyword preceding the diagnostic data. In the event that a parser encounters
33780Sbdea directive it doesn't support, it should fall back to the "ok" / "not ok"
33780Sbderesult.
33780Sbde
33780SbdeCurrently accepted directives are:
33780Sbde
33780Sbde- "SKIP", which indicates a test was skipped (note the result of the test case
33780Sbde  result line can be either "ok" or "not ok" if the SKIP directive is used)
33780Sbde- "TODO", which indicates that a test is not expected to pass at the moment,
33780Sbde  e.g. because the feature it is testing is known to be broken. While this
1638Srgrimes  directive is inherited from TAP, its use in the kernel is discouraged.
1638Srgrimes- "XFAIL", which indicates that a test is expected to fail. This is similar
55466Sbde  to "TODO", above, and is used by some kselftest tests.
55466Sbde- ���TIMEOUT���, which indicates a test has timed out (note the result of the test
55466Sbde  case result line should be ���not ok��� if the TIMEOUT directive is used)
55466Sbde- ���ERROR���, which indicates that the execution of a test has failed due to a
1638Srgrimes  specific error that is included in the diagnostic data. (note the result of
33780Sbde  the test case result line should be ���not ok��� if the ERROR directive is used)
1638Srgrimes
33780SbdeThe diagnostic data is a plain-text field which contains any additional details
33780Sbdeabout why this result was produced. This is typically an error message for ERROR
1638Srgrimesor failed tests, or a description of missing dependencies for a SKIP result.
1638Srgrimes
1638SrgrimesThe diagnostic data field is optional, and results which have neither a
22056Smppdirective nor any diagnostic data do not need to include the "#" field
22056Smppseparator.
22056Smpp
33780SbdeExample result lines include::
33780Sbde
33780Sbde	ok 1 test_case_name
33780Sbde
33780SbdeThe test "test_case_name" passed.
33780Sbde
33780Sbde::
33780Sbde
33780Sbde	not ok 1 test_case_name
22056Smpp
33780SbdeThe test "test_case_name" failed.
33780Sbde
33780Sbde::
33780Sbde
33780Sbde	ok 1 test # SKIP necessary dependency unavailable
1638Srgrimes
33780SbdeThe test "test" was SKIPPED with the diagnostic message "necessary dependency
33780Sbdeunavailable".
33780Sbde
33780Sbde::
33780Sbde
33780Sbde	not ok 1 test # TIMEOUT 30 seconds
33780Sbde
33780SbdeThe test "test" timed out, with diagnostic data "30 seconds".
1638Srgrimes
55466Sbde::
33780Sbde
1638Srgrimes	ok 5 check return code # rcode=0
1638Srgrimes
33780SbdeThe test "check return code" passed, with additional diagnostic data ���rcode=0���
1638Srgrimes
1638Srgrimes
18480SwoschDiagnostic lines
1638Srgrimes----------------
1638Srgrimes
1638SrgrimesIf tests wish to output any further information, they should do so using
1638Srgrimes"diagnostic lines". Diagnostic lines are optional, freeform text, and are
1638Srgrimesoften used to describe what is being tested and any intermediate results in
1638Srgrimesmore detail than the final result and diagnostic data line provides.
140561Sru
140561SruDiagnostic lines are formatted as "# <diagnostic_description>", where the
140561Srudescription can be any string.  Diagnostic lines can be anywhere in the test
140561Sruoutput. As a rule, diagnostic lines regarding a test are directly before the
test result line for that test.

Note that most tools will treat unknown lines (see below) as diagnostic lines,
even if they do not start with a "#": this is to capture any other useful
kernel output which may help debug the test. It is nevertheless recommended
that tests always prefix any diagnostic output they have with a "#" character.

Unknown lines
-------------

There may be lines within KTAP output that do not follow the format of one of
the four formats for lines described above. This is allowed, however, they will
not influence the status of the tests.

This is an important difference from TAP.  Kernel tests may print messages
to the system console or a log file.  Both of these destinations may contain
messages either from unrelated kernel or userspace activity, or kernel
messages from non-test code that is invoked by the test.  The kernel code
invoked by the test likely is not aware that a test is in progress and
thus can not print the message as a diagnostic message.

Nested tests
------------

In KTAP, tests can be nested. This is done by having a test include within its
output an entire set of KTAP-formatted results. This can be used to categorize
and group related tests, or to split out different results from the same test.

The "parent" test's result should consist of all of its subtests' results,
starting with another KTAP version line and test plan, and end with the overall
result. If one of the subtests fail, for example, the parent test should also
fail.

Additionally, all lines in a subtest should be indented. One level of
indentation is two spaces: "  ". The indentation should begin at the version
line and should end before the parent test's result line.

"Unknown lines" are not considered to be lines in a subtest and thus are
allowed to be either indented or not indented.

An example of a test with two nested subtests:

::

	KTAP version 1
	1..1
	  KTAP version 1
	  1..2
	  ok 1 test_1
	  not ok 2 test_2
	# example failed
	not ok 1 example

An example format with multiple levels of nested testing:

::

	KTAP version 1
	1..2
	  KTAP version 1
	  1..2
	    KTAP version 1
	    1..2
	    not ok 1 test_1
	    ok 2 test_2
	  not ok 1 test_3
	  ok 2 test_4 # SKIP
	not ok 1 example_test_1
	ok 2 example_test_2


Major differences between TAP and KTAP
--------------------------------------

==================================================   =========  ===============
Feature                                              TAP        KTAP
==================================================   =========  ===============
yaml and json in diagnosic message                   ok         not recommended
TODO directive                                       ok         not recognized
allows an arbitrary number of tests to be nested     no         yes
"Unknown lines" are in category of "Anything else"   yes        no
"Unknown lines" are                                  incorrect  allowed
==================================================   =========  ===============

The TAP14 specification does permit nested tests, but instead of using another
nested version line, uses a line of the form
"Subtest: <name>" where <name> is the name of the parent test.

Example KTAP output
--------------------
::

	KTAP version 1
	1..1
	  KTAP version 1
	  1..3
	    KTAP version 1
	    1..1
	    # test_1: initializing test_1
	    ok 1 test_1
	  ok 1 example_test_1
	    KTAP version 1
	    1..2
	    ok 1 test_1 # SKIP test_1 skipped
	    ok 2 test_2
	  ok 2 example_test_2
	    KTAP version 1
	    1..3
	    ok 1 test_1
	    # test_2: FAIL
	    not ok 2 test_2
	    ok 3 test_3 # SKIP test_3 skipped
	  not ok 3 example_test_3
	not ok 1 main_test

This output defines the following hierarchy:

A single test called "main_test", which fails, and has three subtests:
- "example_test_1", which passes, and has one subtest:

   - "test_1", which passes, and outputs the diagnostic message "test_1: initializing test_1"

- "example_test_2", which passes, and has two subtests:

   - "test_1", which is skipped, with the explanation "test_1 skipped"
   - "test_2", which passes

- "example_test_3", which fails, and has three subtests

   - "test_1", which passes
   - "test_2", which outputs the diagnostic line "test_2: FAIL", and fails.
   - "test_3", which is skipped with the explanation "test_3 skipped"

Note that the individual subtests with the same names do not conflict, as they
are found in different parent tests. This output also exhibits some sensible
rules for "bubbling up" test results: a test fails if any of its subtests fail.
Skipped tests do not affect the result of the parent test (though it often
makes sense for a test to be marked skipped if _all_ of its subtests have been
skipped).

See also:
---------

- The TAP specification:
  https://testanything.org/tap-version-13-specification.html
- The (stagnant) TAP version 14 specification:
  https://github.com/TestAnything/Specification/blob/tap-14-specification/specification.md
- The kselftest documentation:
  Documentation/dev-tools/kselftest.rst
- The KUnit documentation:
  Documentation/dev-tools/kunit/index.rst