Network Working Group                                          J. Postel
Request for Comments: 1025                                           ISI
                                                          September 1987

TCP AND IP BAKE OFF

Status of This Memo

   This memo describes some of the procedures, scoring, and tests used
   in the TCP and IP bake offs held in the early development of these
   protocols.  These procedures and tests may still be of use in testing
   newly implemented TCP and IP modules.  Distribution of this memo is
   unlimited.

Introduction

   In the early days of the development of TCP and IP, when there were
   very few implementations and the specifications were still evolving,
   the only way to determine if an implementation was "correct" was to
   test it against other implementations and argue that the results
   showed your own implementation to have done the right thing.  These
   tests and discussions could, in those early days, as likely change
   the specification as change the implementation.

   There were a few times when this testing was focused, bringing
   together all known implementations and running through a set of tests
   in hopes of demonstrating the N squared connectivity and correct
   implementation of the various tricky cases.  These events were called
   "Bake Offs".

   An early version of the list of tests included here appears in IEN-69
   of October 1978.  A demonstration of four TCP implementations was
   held at the Defense Communication Engineering Center in Reston,
   Virginia on 4 December 1978, and reported in IEN-70 of December 1978.
   A bake off of six implementations was held 27-28 January 1979 at
   USC-Information Sciences Institute in Marina del Rey, California and
   reported in IEN-77 of February 1979.  And a distributed bake off was
   held in April 1980 over the network and reported in IEN-145 of May
   1980.

   The following section reproduces (with very slight editing) the
   procedure, tests, and scoring of the April 1980 Bake Off.

Procedure

   This is the procedure for the TCP and IP Bake Off.  Each implementor
   of a TCP and IP is to perform the following tests and to report the
   results.  In general, this is done by using a test program or user
   Telnet program to open connections to your own or other TCP
   implementations.

   Some test are made more interesting by the use of a "flakeway".  A
   flakeway is a purposely flakey gateway.  It should have control
   parameters that can be adjusted while it is running to specify a
   percentage of datagrams to be dropped, a percentage of datagrams to
   be corrupted and passed on, and a percentage of datagrams to be
   reordered so that they arrive in a different order than sent.

   Many of the following apply for each distinct TCP contacted (for
   example, in the Middleweight Division there is a possibility of 20
   points for each other TCP in the Bake Off).

   Note Bene: Checksums must be enforced.  No points will be awarded if
   the checksum test is disabled.

      Featherweight Division

1 point for talking to yourself (opening a connection).

         1 point for saying something to yourself (sending and receiving
         data).

         1 point for gracefully ending the conversation (closing the
         connection without crashing).

2 points for repeating the above without reinitializing the
TCP.

5 points for a complete conversation via the testing gateway.

      Middleweight Division

         2 points for talking to someone else (opening a connection).

         2 points for saying something to someone else (sending and
         receiving data).

         2 points for gracefully ending the conversation (closing the
         connection without crashing).

4 points for repeating the above without reinitializing the
TCP.

10 points for a complete conversation via the testing gateway.

      Heavyweight Division

         10 points for being able to talk to more than one other TCP at
         the same time (multiple connections open and active
         simultaneously with different TCPs).

         10 points for correctly handling urgent data.

         10 points for correctly handling sequence number wraparound.

         10 points for correctly being able to process a "Kamikaze"
         packet (AKA nastygram, christmas tree packet, lamp test
         segment, et al.).  That is, correctly handle a segment with the
         maximum combination of features at once (e.g., a SYN URG PUSH
         FIN segment with options and data).

         30 points for KOing your opponent with legal blows.  (That is,
         operate a connection until one TCP or the other crashes, the
         surviving TCP has KOed the other.  Legal blows are segments
         that meet the requirements of the specification.)

         20 points for KOing your opponent with dirty blows.  (Dirty
         blows are segments that violate the requirements of the
         specification.)

         10 points for showing your opponents checksum test is faulty or
         disabled.

      Host & Gateway IP Division

25 points for doing fragmentation and reassembly.

15 points for doing loose source route option.

         15 points for doing strict source route option.

         10 points for doing return route option.

         10 points for using source quench messages.

         10 points for using routing advice messages.

         5 points for doing something with the type of service.

         5 points for doing something with the security option.

         5 points for doing something with the timestamp option.

         5 points for showing that a gateway forwards datagrams without
         decreasing the time to live (showing a gateway is faulty).

         5 points for showing that a gateway forwards datagrams with the
         time to live equal zero (showing a gateway is faulty).

         10 points for showing that a gateway or hosts checksum test is
         faulty or disabled (showing a gateway is faulty).

      Bonus Points

         10 points for the best excuse.

         20 points for the fewest excuses.

30 points for the longest conversation.

40 points for the most simultaneous connections.

50 points for the most simultaneous connections with distinct
TCPs.

   Tests

      The following tests have been identified for checking the
      capabilities of a TCP implementation.  These may be useful in
      attempting to KO an opponent.

         1.  Single connection.  Open & close a single connection many
             times.

         2.  Multi connections.  Open several connections
             simultaneously.  Two connections to the same socket
             (i.e., a-b and a-c) check proper separation of data.

3. Half Open Connection. Open a connection, crash local TCP
and attempt to open same connection again.

         4.  Piggy-back Loop.  Open connections via Telnet.

         user telnet--->TCP--->IP--->net--->IP--->TCP--->server telnet
                                                               |
                                                               V
         server telnet<---TCP<---IP<---net<---IP<---TCP<---user telnet
              |
              V
         user telnet--->...

         5.  Maximum connections.  Open connections between a pair of
             TCP until refused or worse.

6. Refused connection. Open a connection to a non-accepting
socket, does it get refused?

7. Zero Window. Try to send data to a TCP that is presenting
a zero window.

8. Fire Hose. Make many connections to data source ports, or
connections to a data sink and send as fast as you can.

9. Urgent Test. Try to send data to a user program that only
receives data when in urgent mode.

         10. Kamikazi Segment.  Send and receive nastygrams.  A
             nastygram is a segment with SYN, EOL, URG, and FIN on and
             carrying one octet of data.

         11. Sequence Wraparound.  Test proper functioning when sequence
             numbers (a) pass 2**31 (i.e., go from plus to "minus") and
             (b) pass 2**32 (i.e., go from 2**32-1 to 0).

12. Buffer size. With buffer size not equal to one, send data
in segments of various sizes, use urgent occasionally.

13. Send a nastygram into a half open connection when the
sequence number is about to wrap around.

New Ideas

   The above tests check for basic operation and handling of some of the
   tricky cases.  They do not consider performance in any way, or check
   to see if some of the recently developed ideas have been implemented.

      New Mechanisms

         1.  The John Nagel Procedures (RFC-896).

         2.  The Van Jacobson Procedures (slow start, RTT measurements,
             etc).

         3.  The SQuID Procedures (RFC-1016).

      Performance Tests

         Performance tests are difficult to specify because the results
         depend so much on the state of the environment of the test.
         Here are a few possibilities:

         1.  FTP Throughput:  Send a 1 megabyte file to a locally nearby
             machine on an Ethernet measuring the elapsed time.

         2.  FTP Throughput:  Send a 1 megabyte file to a locally nearby
             machine on an ARPANET measuring the elapsed time.

         3.  NETBLT Throughput:  Send a 1 megabyte file to a locally
             nearby machine on an Ethernet measuring the elapsed time.

         4.  NETBLT Throughput:  Send a 1 megabyte file to a locally
             nearby machine on an ARPANET measuring the elapsed time.

         5.  Character Test:  Use a test program to send a character via
             TCP to the Echo Server (RFC-862), time the round trip (from
             the time the character is sent until the echo is returned
             to the test program).

Appendix

   For History Buffs Only:

      The following item was in the original 1980 tests, but has been
      moved to this appendix since it no longer applies.

         10 points for correctly handling rubber baby buffer bumpers in
         both directions (End of Letter sequence number adjustments).

Status of This Memo

1 point for talking to yourself (opening a connection).

2 points for repeating the above without reinitializing the TCP.

5 points for a complete conversation via the testing gateway.

4 points for repeating the above without reinitializing the TCP.

10 points for a complete conversation via the testing gateway.

20 points for KOing your opponent with dirty blows. (Dirty blows are segments that violate the requirements of the specification.)

25 points for doing fragmentation and reassembly.

15 points for doing loose source route option.

30 points for the longest conversation.

40 points for the most simultaneous connections.

50 points for the most simultaneous connections with distinct TCPs.

3. Half Open Connection. Open a connection, crash local TCP and attempt to open same connection again.

6. Refused connection. Open a connection to a non-accepting socket, does it get refused?

7. Zero Window. Try to send data to a TCP that is presenting a zero window.

8. Fire Hose. Make many connections to data source ports, or connections to a data sink and send as fast as you can.

9. Urgent Test. Try to send data to a user program that only receives data when in urgent mode.

12. Buffer size. With buffer size not equal to one, send data in segments of various sizes, use urgent occasionally.

13. Send a nastygram into a half open connection when the sequence number is about to wrap around.