Statsd can be viewed as a blackbox with network inputs and outputs.
Implementing a test harness that could be run against any implementation would be very helpful for the implementations and for the goal of coordinating those implementations.
(For example, the Promises/A test suite has worked well to highlight bugs in various implementations of the Promises/A specification.)