[tor-bugs] #29624 [Metrics/Exit Scanner]: New version of exit list format
Tor Bug Tracker & Wiki
blackhole at torproject.org
Fri Apr 12 10:05:12 UTC 2019
#29624: New version of exit list format
-------------------------------------+--------------------------------
Reporter: irl | Owner: irl
Type: task | Status: needs_revision
Priority: Medium | Milestone:
Component: Metrics/Exit Scanner | Version:
Severity: Normal | Resolution:
Keywords: metrics-roadmap-2019-q2 | Actual Points:
Parent ID: #29650 | Points:
Reviewer: irl | Sponsor:
-------------------------------------+--------------------------------
Changes (by karsten):
* status: needs_review => needs_revision
Comment:
Nice blog post! It's indeed interesting to see what fraction of IP
addresses is only available in exit lists. If I may nitpick:
- It's slightly confusing (to me) that the red and green line show single
set sizes whereas the blue line shows one set minus the other one. I
wonder if the lines would work better (for me) if they were: Found only in
consensus, Found in consensus and exit list, and Found only in exit list.
That way it would be possible to add up all three lines and obtain the
total number of addresses.
- Maybe this graph would work as a stacked area plot like
[https://metrics.torproject.org/bandwidth-flags.html our bandwidth flags
graph], just with three stacked areas rather than four. If you look at
that linked bandwidth flags graph, yellow would be "Found only in
consensus", green would be "Found in consensus and exit list" and blue
would be "Found only in exit list". That would show pretty quickly what
fraction someone is missing when only looking at consensuses: the blue
part at the top.
- If you like, I can help you with making that graph. Just give me the
raw data, and I'll make a quick graph out of it. Also as an experience to
learn more about the tidyverse. And for the next funding proposal, of
course.
Anyway, I think you wanted a review of the exit list spec! Here's what I
found:
- "[the identity-ed25519 line] MUST appear as the first or second element
in the router descriptor": Why the first? In version 2 or later the first
element is always "exit-list", so it can only be the second element. And
why router descriptor? You mean exit list?
- Sometimes we're using two spaces between sentences. Let's be consistent
and either use one or two.
- In the "location" line, should we write the AS number as AS12345 rather
than just 12345? Technically, it's correct to just use an integer there,
but if we think of humans reading the format, we might want to include the
"AS" part anyway.
- Should the "platform" line suggest a possible format for exit list
scanner software version, Tor software version, and operating system?
Something like "ExitScanner 55.5 using Tor 1.0.0 on Windows 10"?
- There are at least two places where you write "[XXX: Informational:
...]". Can we just include those informational parts without the XXX
stuff?
- A bit below that you refer to the "created" timestamp, which is now
called the "published" timestamp.
- I wonder if the "address" part of "exit-address4" and "exit-address6"
lines should be unique (with respect to the exit list entry, not the whole
exit list, of course). This has implications on how somebody parsing an
exit list entry can store and retrieve parsed addresses. Of course, if we
think it's useful to include an address several times with different scan
times, we'll have to say that the address is not necessarily unique. In
any case, we should say what people can expect there, or they'll make
assumptions.
- The "exit-scanner-sig-ed25519" should probably specify where to expect
the signature part. Is it going to be part of the line, separated by SP
from the keyword? Or is it going to be an object in the next line? In the
latter case the subsequent description would have to say "including the
newline character after" rather than "including the first space after".
Other than these comments, the plan to build it see if it works sounds
reasonable to me. Thanks!
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/29624#comment:18>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
More information about the tor-bugs
mailing list