[metrics-team] server-descriptor not readable (2018-02-08-11-05-00-server-descriptors has >30k descriptors)
Karsten Loesing
karsten at torproject.org
Fri Feb 9 09:19:51 UTC 2018
Hi Kathi,
On 2018-02-09 09:41, Katharina Haselhorst wrote:
> Yes, the descriptor seems to be valid - but still the java metrics lib
> can't read it (maybe too big? or some bug in the implementation?).
> Can someone confirm this issue or is it a problem specific to my
> machine? Process runs with 100% cpu, I killed it after 10mins or so...
> Just for reference my testing code:
>
> public static void testReadServerDescriptor(String file) {
> DescriptorReader dr =
> DescriptorSourceFactory.createDescriptorReader();
> for (Descriptor d : dr.readDescriptors(new File(file))) {
> if (!(d instanceof RelayServerDescriptor)) {
> System.err.println("wrong descriptor type: "
> + d.getClass());
> continue;
> }
> RelayServerDescriptor descriptor = (RelayServerDescriptor) d;
> System.out.println("parsed descriptor for relay "
> + descriptor.getFingerprint());
> }
> }
I just gave this a try and was able to parse the file in a bit under 20
minutes. I set -Xmx6g, though it might also work with less heap space.
So, this is a known limitation of metrics-lib which cannot handle large
descriptor files very well. We're tracking this bug here, just in case
you want to follow along:
https://trac.torproject.org/projects/tor/ticket/20395
Until then, maybe just split the file into several smaller files, right
before lines starting with "@type".
Thanks for reporting this issue!
> Regards, Kathi
All the best,
Karsten
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 528 bytes
Desc: OpenPGP digital signature
URL: <http://lists.torproject.org/pipermail/metrics-team/attachments/20180209/0aac02ce/attachment.sig>
More information about the metrics-team
mailing list