[metrics-team] How to interpret written/read bytes per second in Relay Search?
David Fifield
david at bamsoftware.com
Tue Mar 24 21:39:42 UTC 2020
I'm looking at the graphs for the Snowflake bridge,
https://metrics.torproject.org/rs.html#details/5481936581E23D2D178105D44DB6915AB06BFB7F
For the past year and more, the "written bytes" and "read bytes" have
matched almost identically. On 2020-02-19, they start to diverge. That
date coincides with the limited release of a version of Snowflake that
is able to run for a longer time, and a rebuild and restart of the
bridge (https://bugs.torproject.org/33336#comment:8). In the past few
days, the "read bytes" has grown to be about 10 times the "written
bytes". I'm trying to interpret this information.
1. Where does the data in the graph come from? I didn't find it covered
at https://metrics.torproject.org/reproducible-metrics.html. I looked
at the most recent bridge-extra-info descriptor:
write-history 2020-03-24 14:50:48 (86400 s) 2308002816,3215030272,2062971904,3323116544,4469634048
read-history 2020-03-24 14:50:48 (86400 s) 5265647616,8424690688,5873989632,38813471744,7116800000
dirreq-write-history 2020-03-24 14:50:48 (86400 s) 95039488,88645632,70933504,96854016,57280512
dirreq-read-history 2020-03-24 14:50:48 (86400 s) 2764800,1468416,1036288,2764800,2027520
However if I just divide the write-history and read-history numbers
by 86400, the numbers I get don't match the graph.
write read
2020-03-20 26.7 kB/s 60.9 kB/s
2020-03-21 37.2 kB/s 97.5 kB/s
2020-03-22 23.9 kB/s 68.0 kB/s
2020-03-23 38.5 kB/s 449.2 kB/s
2020-03-24 51.7 kB/s 82.4 kB/s
2. When I look at the graphs of some default bridges, I see the written
and read number being almost equal always.
https://metrics.torproject.org/rs.html#details/5F161D2E5713C93F16FEEDD63178E37208AA78DF
https://metrics.torproject.org/rs.html#details/8F4541EEE3F2306B7B9FEF1795EC302F6B84DAE8
When I look at moria1, a directory authority, I see written being
much greater than read.
https://metrics.torproject.org/rs.html#details/9695DFC35FFEB861329B9F1AB04C46397020CE31
What accounts for the equality in some cases and the inequality in
others? What could explain the divergence in the case of the
Snowflake bridge?
3. Roger found a case where traffic tagged with a 0.0.0.0/8 address was
being ignored by some part of tor's internal bandwidth accounting
(https://bugs.torproject.org/33693). Until recently, the Snowflake
bridge had a bug where, for certain clients, it reported a client
address of 0.0.0.0 to the tor bridge's ExtORPort (https://bugs.torproject.org/33157).
The bug is only partially fixed--we now report no address at all for
the affected clients. The fix was not deployed until 2020-02-22, so
it doesn't explain the divergence of read/written on its own. Do you
know offhand whether an apparent client address of 0.0.0.0, or no
address at all, would cause problems with measuring usage?
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 20200324-flakey-bwhist-6m.svg.gz
Type: application/gzip
Size: 7640 bytes
Desc: not available
URL: <http://lists.torproject.org/pipermail/metrics-team/attachments/20200324/521e4398/attachment.gz>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 20200324-flakey-bwhist-1m.svg.gz
Type: application/gzip
Size: 1772 bytes
Desc: not available
URL: <http://lists.torproject.org/pipermail/metrics-team/attachments/20200324/521e4398/attachment-0001.gz>
More information about the metrics-team
mailing list