[metrics-bugs] #29374 [Metrics/Onionperf]: Analysis files sometimes present negative numbers in the payload_progress field

Tor Bug Tracker & Wiki blackhole at torproject.org
Wed Apr 10 11:55:38 UTC 2019


#29374: Analysis files sometimes present negative numbers in the payload_progress
field
-----------------------------------+------------------------------
 Reporter:  irl                    |          Owner:  metrics-team
     Type:  defect                 |         Status:  new
 Priority:  Medium                 |      Milestone:
Component:  Metrics/Onionperf      |        Version:
 Severity:  Normal                 |     Resolution:
 Keywords:  acute-2019-q1-planned  |  Actual Points:
Parent ID:                         |         Points:
 Reviewer:                         |        Sponsor:
-----------------------------------+------------------------------

Comment (by acute):

 Replying to [comment:3 karsten]:
 > Some thoughts on this, since I have been looking at tgen logs for a
 while now:
 >
 >  - The tgen log line you mention does not contain anything to identify
 the transfer. Would it be easier to use the next tgen log line which at
 least contains the source port? `2019-02-11 00:13:00 1549843980.555482
 [info] [shd-tgen-transport.c:203] [_tgentransport_changeState] transport
 TCP,16,localhost:127.0.0.1:43810,localhost:127.0.0.1:41204,op-
 ab.onionperf.torproject.net:137.50.19.2:80,state=CONNECT,error=NONE moving
 from state CONNECT to state INIT`; and would it still be accurate enough?

 Have had another look at it, the OP parser only looks at lines containing
 [transfer-status], [transfer-complete] and [transfer-error]. It currently
 identifies transfers by the time and transfer number only, without
 including any ports.
 The problem is that the transfer number only starts appearing in lines
 onwards after 'moving from state COMMAND to state RESPONSE', and the
 parser gets all the other information for a transfer from the tgen process
 summary report...

 Not sure where to go from here...we could change the way we identify
 reports (I agree with you that we should use both ports + time) to get
 timestamp information from the previous lines, and we add rules for
 parsing these? This would mean a major change to the OP parser.



 >  - What's the reason for overwriting this timestamp with the calculated
 value for ''complete'' transfers? This log line has usec granularity, too
 (1554084910'''.250433'''). I think it would be easier to compare partial
 progress of complete and incomplete transfers if we'd use the same
 timestamps for calculating them.
 My fear was that if the values reported by the tgen process are the
 absoulute source of truth, by changing them we'd report inaccurate
 information. I agree that calculating everything the same way is more
 consistent - we should go with this after we figure out what to do for the
 previous step.
 >  - Do we know of any tools ''relying'' on `payload_progress` numbers
 being negative for incomplete downloads? I admit that this would be a
 weird use of the data. Still, maybe worth checking if we're using them
 anywhere ourselves and probably worth writing something in the next
 release notes.

 Excellent point, will have a look at the visualization module.

 >  - You'll find more tgen logs for testing at https://op-
 ab.onionperf.torproject.net/ .
 Thanks for this!

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/29374#comment:4>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the metrics-bugs mailing list