I measure latency on a client, on service and on using some very expensive, cut-through switches. This shotgun method lets me lazily detect where the latency comes from with a single look. I understand not everybody has access to cut-through switching, but for most uses port mirroring is available on a lot of hardware
- was it over a LAN or WAN?
- how were the latency improvements measured?
- was there a UI on the front end or was this strictly a backend to backend scenario