Can you please summarize the options proposed? I don’t think there’s any written form of it anywhere, which makes it hard to form a good opinion for readers. If I understood correctly, the options were:
- Do nothing and keep a millisecond granularity on hold times reported in attributable failures
- Change the encoding of the hold time in attributable failures to have a granularity of Xms (with X to be defined): advantages/drawbacks of this option?
- Keep the millisecond granularity but ask nodes to subtract a hard-coded threshold value: we’ve discussed several variations of it, and to be honest I’m not sure exactly how that would work and would like to see it written down for analysis