[metrics-bugs] #34191 [Metrics/Onionperf]: Combine multiple analysis files into single data set
Tor Bug Tracker & Wiki
blackhole at torproject.org
Mon Jun 1 07:31:20 UTC 2020
#34191: Combine multiple analysis files into single data set
-------------------------------+--------------------------------
Reporter: karsten | Owner: acute
Type: enhancement | Status: needs_revision
Priority: Medium | Milestone:
Component: Metrics/Onionperf | Version:
Severity: Normal | Resolution:
Keywords: | Actual Points: 0.5
Parent ID: #33321 | Points: 0.5
Reviewer: karsten | Sponsor: Sponsor59-must
-------------------------------+--------------------------------
Comment (by acute):
Replying to [comment:6 karsten]:
> Thanks for working on this and providing a branch! I read your comment
and the code but did not test anything yet. Two concerns:
Thank you very much for the review!
>
> - I'm not sure if we can make it work with a variable number of
arguments to the `-d` parameter. The usage string is really misleading by
saying `-d PATH [LABEL ...]`, indicating that it accepts a single `PATH`
and zero or more `LABEL`s. We would want it to say `-d PATH [PATH ...]
LABEL` there. Do you know how to override that string? I didn't find
something and stopped looking at options that felt like hacking argparse.
From the top of my head, I think this is what `METAVAR` does in
`argparse`, will revise.
>
> - Regarding the limitation that the `LABEL` cannot be a path anymore, I
think that's problematic. For one, it's turning this change into a
backward incompatible one. And it's also not intuitive for new users who
don't know how the parameter works today. Imagine a case where somebody
puts all files for OnionPerf instance op-hk into one directory `op-hk/`
and all files for op-ab into another one `op-ab/`. If they want to
visualize these files using OnionPerf instance names as labels, they'd
either have to pick different labels or rename the directories. Doable,
but for sure surprising.
>
That is a very good point! We could turn the error into a warning to
allow the user to run the command anyway. We could not show this warning
if the label is a path already included the list of inputs specified by
the user. What do you think?
> Maybe it's better to keep this simple by allowing just two arguments as
we do right now. Users will understand that they have to put all files for
one data set into a directory. No surprises, they'll get what they want.
What do you think?
What happens when we want to produce analyses comparing several months,
for each instance? Currently, with the data as it is downloaded from
collector, files from all the instances are in the same month folder,
which makes producing a graph like the one just described hard - you'd
have to create several directories, move files around and then move them
back if you want to look at the data aggregated in a different way.
With this, you can just do:
`onionperf visualize -d onionperf-2019-11/*/*nl* nov2019 -d
onionperf-2019-12/*/*nl* dec2019 -d onionperf-2020-01/*/*nl* jan2020`
The point is to avoid manually moving things around in the file system,
which is where mistakes happen.
>
> Apart from these concerns about the user interface, the patch looks good
to me. The decisions about using `*json*` as pattern (you might want to
use `*onionperf.analysis.json*` here, now that I think about it) or using
functionality from the reprocessing module look good to me. I'd say if we
can figure out the potential user interface issues, this will be a quick
review. Thanks!
I will move to using the pattern you suggested, thank you!
I will work on this patch as my next task.
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/34191#comment:7>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
More information about the metrics-bugs
mailing list