[tor-bugs] #11788 [Metrics Data Processor]: Consider providing descriptor tarballs as .tar.xz rather than .tar.bz2
Tor Bug Tracker & Wiki
blackhole at torproject.org
Wed May 7 18:14:19 UTC 2014
#11788: Consider providing descriptor tarballs as .tar.xz rather than .tar.bz2
----------------------------------------+-----------------
Reporter: karsten | Owner:
Type: enhancement | Status: new
Priority: normal | Milestone:
Component: Metrics Data Processor | Version:
Resolution: | Keywords:
Actual Points: | Parent ID:
Points: |
----------------------------------------+-----------------
Comment (by karsten):
Sample 2:
{{{
$ ls -lh votes-2014-04.tar.bz2
-rw-r--r-- 1 metrics metrics 4.9G May 7 06:15 votes-2014-04.tar.bz2
$ bunzip2 votes-2014-04.tar.bz2
$ ls -lh votes-2014-04.tar
-rw-r--r-- 1 metrics metrics 13G May 7 14:14 votes-2014-04.tar
$ time xz -9 votes-2014-04.tar
real 123m8.199s
user 117m30.129s
sys 0m21.541s
$ ls -lh votes-2014-04.tar.xz
-rw-r--r-- 1 metrics metrics 172M May 7 14:14 votes-2014-04.tar.xz
}}}
That's an impressive reduction by factor 29. I had no idea!
What will be funny is when people decompress a few votes tarballs (or even
all of them) on their hard disk and find that these tarballs occupy 77
times the disk space as in compressed form. Guess we should add a warning
to data.html.
--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/11788#comment:4>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online
More information about the tor-bugs
mailing list