[metrics-bugs] #25383 [Metrics/Website]: Deprecate stats.html and stats/*.csv files

Tor Bug Tracker & Wiki blackhole at torproject.org
Sat Jul 28 10:26:25 UTC 2018


#25383: Deprecate stats.html and stats/*.csv files
-----------------------------+------------------------------
 Reporter:  karsten          |          Owner:  metrics-team
     Type:  enhancement      |         Status:  needs_review
 Priority:  High             |      Milestone:
Component:  Metrics/Website  |        Version:
 Severity:  Normal           |     Resolution:
 Keywords:                   |  Actual Points:
Parent ID:                   |         Points:
 Reviewer:                   |        Sponsor:
-----------------------------+------------------------------
Changes (by karsten):

 * status:  assigned => needs_review
 * priority:  Medium => High


Comment:

 I made more progress on this ticket. Going through the remaining steps
 from comment 16 above:

 > Next steps after that, in no particular order:
 >  - Decide where to add the legend (Java or R).

 Maybe the CSV file header is not the right place for this legend after
 all. The specification of parameters and columns can be quite long, and if
 we also plan to include scheduled and past changes, the header section
 will be even longer. Oh, and whatever we write here won't change in the
 CSV file that somebody downloaded, until they decide to download a new CSV
 file from us.

 I tried out something else: extend our existing `stats.html` to also cover
 the per-graph CSV files. The CSV file header could then include a link to
 that page or possibly even a subsection on that page.

 I'll post a branch shortly.

 >  - Discuss whether we want to use wide/long format for these CSVs. Yes,
 we should have had this discussion a few weeks back, but it's better to
 have it next week than never.

 I made remarks in the extended `stats.html` page to change the format.
 This could be the first scheduled change that would become effective a
 couple weeks later.

 >  - Decide how we announce and make changes in the future, in particular
 backward-incompatible ones. For example, Onionoo has a
 `"next_major_version_scheduled"` field to announce backward-incompatible
 changes, and we need something like that, too.

 We could include remarks like the ones I made on `stats.html`, and we
 might even add a change log to the top of that page to summarize past and
 upcoming changes.

 >  - Add a note to stats.html saying when it's going to go away.

 In the page.

 >  - Add a note to CSV file header saying it's still BETA until the same
 date as mentioned on stats.html, maybe with 2 or 4 weeks overlap.

 I did not touch CSV file headers yet. Once we have a fixed deprecation
 date, let's include it there.

 Alright, please review [https://gitweb.torproject.org/karsten/metrics-
 web.git/commit/?h=task-25383-2&id=cc81eea95ea58767aecc414f5a45165e27bf9f3a
 commit cc81eea in my task-25383-2 branch]. Couple questions:
  - Does it make sense to specify our per-graph CSV files there, rather
 than in the CSV file header?
  - Is the format with two subsections Parameters and Columns okay? Is
 something missing?
  - Are specifications roughly correct/plausible?
  - Do the suggestions make sense? The rule of thumb for deciding which
 columns we need was: "it should require a code change to change columns,
 and neither the user should be able to control which columns exist by
 their choice of parameters, nor should the available data have any
 influence on that."

 Regarding timing, how about we deploy this page still in July, make
 suggested changes by August 15, take out pre-aggregated stats files by
 September 15, and handle any questions coming out of that in the two weeks
 before the Mexico City meeting?

 Changing priority back to high for the still-in-July bit. Thanks!

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/25383#comment:29>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the metrics-bugs mailing list