[tor-bugs] #2680 [Metrics]: present bridge usage data so researchers can focus on the math
Tor Bug Tracker & Wiki
torproject-admin at torproject.org
Mon Mar 14 20:39:17 UTC 2011
#2680: present bridge usage data so researchers can focus on the math
---------------------+------------------------------------------------------
Reporter: arma | Owner: karsten
Type: task | Status: assigned
Priority: normal | Milestone:
Component: Metrics | Version:
Keywords: | Parent:
Points: | Actualpoints:
---------------------+------------------------------------------------------
Comment(by arma):
The "fingerprint" and "descriptor" in statuses.csv are always the same. I
think you're printing "fingerprint" for both of them?
I think the next step is to write a short overview of how to reconstruct
these files to answer some research question. For example, say I want to
get a list of all the countries that a given bridge has seen over time. I
guess I want to iterate over all bridge fingerprints -- should I use the
list of all fingerprints I find in statuses.csv or in descriptors.csv --
should they be the same?
So step zero, given a fingerprint, is to look it up in relays.csv and make
sure it's not there. If it is, either ignore it or if we want to get
fancier, ignore data from it close to the time it's in the relay list.
Step one is to look it up in statuses.csv, get a set of descriptor hashes,
discard all the ones whose third-to-last value is not TRUE, and skip
duplicate hashes.
Then step two is to take those remaining descriptor hashes and look them
up in descriptors.csv, at which point I can learn which countries they saw
unless the countries are all NA in which case we don't have data?
And the optional step three is to take the timestamp from the status file
and look up the fingerprint in assignments.csv to decide if it's http,
email, or unassigned?
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/2680#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
More information about the tor-bugs
mailing list