[network-health] Error 500 on metrics.torproject.org
Hiro
hiro at torproject.org
Tue Apr 2 12:48:09 UTC 2024
On 4/2/24 12:27, Roger Dingledine wrote:
> On Fri, Mar 29, 2024 at 03:09:37PM +0000, torix via network-health wrote:
>> https://metrics.torproject.org/userstats-bridge-table.html
>> Gives me the 500 error page.
>>
>> Hope this is the right place to let you know,
> Thanks. Hiro fixed it in a short-term way by restarting one of the
> back-end services, but I think there is an ongoing problem where it
> will continue to need restarts. Some sort of monitoring would probably
> be smart too imo, but it's easy to suggest more work for people :),
> and it is for Hiro to pick/manage the metrics roadmap in terms of which
> fires to put out when.
Hi,
we have had an issue with the R-server we run to produce the graphs on
metrics.torproject.org.
It seems there is a bug that makes the process consume a lot of memory
and the kernel kills it.
We never had an issue in the past with this, but it seems some specific
query or set of queries is causing it now.
We have some history of our services being targeted like this. Sometimes
it is because someone likes to have fun like this, and some other time
it is because someone decided to setup some tool that is making a lot of
requests.
The idea here is to fix the bug rather than setting up something that
just restarts the service, but we are also a bit stretched and it might
take us longer that we would have wanted to, so just restarting the
service might be an option.
Talk soon,
-hiro
>
> See also
> https://gitlab.torproject.org/tpo/network-health/metrics/website/-/issues/40112
>
> --Roger
>
> _______________________________________________
> network-health mailing list
> network-health at lists.torproject.org
> https://lists.torproject.org/cgi-bin/mailman/listinfo/network-health
More information about the network-health
mailing list