[metrics-bugs] #25383 [Metrics/Website]: Deprecate stats.html and stats/*.csv files
Tor Bug Tracker & Wiki
blackhole at torproject.org
Thu Mar 1 09:01:26 UTC 2018
#25383: Deprecate stats.html and stats/*.csv files
-----------------------------+------------------------------
Reporter: karsten | Owner: metrics-team
Type: enhancement | Status: new
Priority: Medium | Milestone:
Component: Metrics/Website | Version:
Severity: Normal | Resolution:
Keywords: | Actual Points:
Parent ID: | Points:
Reviewer: | Sponsor:
-----------------------------+------------------------------
Comment (by karsten):
Replying to [comment:3 irl]:
> #25240 is the relevant ticket, with the idea being to add some tweets to
metrics-bot about the number of users. It sounds to me that the new graph
based ones are better for this use case than the raw files under stats/.
>
> A few questions though:
>
> * Would we update stats.html to describe the format of the new CSV files
along with the API for retrieving them programmatically?
Ideally, the new CSV files won't need as much description as the old ones.
Column names are chosen in reference to graphs, units are the same as in
the graph, parameters are the same as for the graph, and so on. The
current plan is to include a short comment header in the CSV files, see
#25387. Ideally, we wouldn't have to write and maintain another HTML page
for that. It's yet one more thing we need to write whenever we want to
provide a new graph, and we'd want to make it easier to add new graphs,
not harder.
> * Do we describe how we get from CollecTor to the new graph based CSV
files (which would include the steps necessary for getting to the
stats/*.csv files)?
Yes, this is still relevant. And this deserves a HTML page somewhere.
Could be stats.html or a new page.
> I'm not opposed to removing them from the website for the reason that I
feel they are intermediate artefacts of the analysis/visualisation
process, as long as we are not creating them with secret data that would
not be accessible to others (afaik it's all in CollecTor) and as long as
we describe our process.
>
> For most use cases I think the graph based CSV files are going to be
easier for new researchers to get started with anyway.
Sounds good!
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/25383#comment:5>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
More information about the metrics-bugs
mailing list