Optimize Piece-s memory usage #35

gzsombor · 2013-02-25T00:53:31Z

Remove a couple of unnecessary fields from the Piece class :

offset -> can be calculated from index * pieceLength
seeder -> torrent.isSeeder()
bucket -> torrent.getBucket()
hash -> which is equivalent with torrent.getPiecesHashes() from index*Torrent.PIECE_HASH_SIZE, so we don't have duplicate this information.

…entHash instead of separate byte buffer in every Piece object

…ecessary data duplication can be avoided.

zanella · 2016-11-14T16:25:50Z

src/main/java/com/turn/ttorrent/common/Torrent.java

@@ -418,6 +418,12 @@ public void save(OutputStream output) throws IOException {
 		return md.digest();
 	}

+	public static byte[] hash(ByteBuffer data) throws NoSuchAlgorithmException {
+		MessageDigest md = MessageDigest.getInstance("SHA-1");
+		md.update(data);


This will create the byte array internally

Only if it's a ByteBuffer which doesn't backed by an accessible byte array, and in that case, only create a 4K buffer, which is more memory friendly solution, than to create a byte buffer at least 64K, to accommodate a torrent block just for hashing. I could run the torrent client more efficiently this way, YMMV.

The buffer being passed is the data read from the "output" file (this._read(...)), which is a direct buffer, i.e. is not array backed, no ? The piecesHashes one is array backed.

I couldn't understand the rest of your answer, sorry. Upon reading the source of the MessageDigest I can see this optimization is there, reusing the byte array internally tempArray, but here md is created and discarded just as well, I think the solution lies more in the lines of #184, of having a "resource store" where you pick it up, use it and relinquish ownership...

If you look into here : https://github.com/mpetazzoni/ttorrent/pull/35/files#diff-454b627720e29935f25dab9a660670b1L166 you can see a big memory allocation. I don't know how big is it, if it's 64K or 512K - I've implemented it 3.5 years ago - but I'm fairly certain that one torrent piece can be pretty big, and it is much simpler to pass the ByteBuffer directly to the MessageDigest class, instead of allocating a new byte array, copy everything into it, calculate the hash.

Sorry being not too clear, English is not my native language :(

Hey, I think I can sum up the bytes duplication:

each byte piece is duplicated from the piecesHashes -> this one you got rid of by hitting the piecesHashes using the offset;

each file piece is read into a ByteBuffer then copied into a byte array -> this one you got rid of copying, by passing the ByteBuffer (which has a byte array internally) to the MessageDigest mehod;

I think that what remains is limiting the number of existing ByteBuffer buffer from file reading, like I tried in #195 , I'll try to limit it then profile it, with this done I think this patch is complete.

Yes, that happened :)

zanella · 2016-11-14T16:27:07Z

src/main/java/com/turn/ttorrent/client/Piece.java

+
+		int torrentHashPosition = getIndex() * Torrent.PIECE_HASH_SIZE;
+		for (int i = 0; i < Torrent.PIECE_HASH_SIZE; i++) {
+			byte value = this.torrentHash.get(torrentHashPosition + i);


Fetching byte per byte on a direct buffer is faster than just fetching the whole thing once ?

gzsombor added 2 commits February 25, 2013 00:29

Optimization: avoid creating too much buffers, use the inherited torr…

4017a57

…entHash instead of separate byte buffer in every Piece object

pass SharedTorrent to the individual Piece objects, so a lot's of unn…

1d4db41

…ecessary data duplication can be avoided.

ghost assigned mpetazzoni Apr 8, 2013

mpetazzoni force-pushed the master branch from bec5f55 to 4b9cafa Compare March 9, 2016 03:01

zanella reviewed Nov 14, 2016

View reviewed changes

zanella mentioned this pull request Nov 16, 2016

Limit mem usage on torrent creation #195

Open

Dead-off force-pushed the master branch from 9321e51 to 35f374c Compare July 31, 2018 13:03

zanella mentioned this pull request Dec 1, 2020

Use BufferPool instead of creating and destroying ByteBuffers #223

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Piece-s memory usage #35

Optimize Piece-s memory usage #35

gzsombor commented Feb 25, 2013

zanella Nov 14, 2016

gzsombor Nov 14, 2016

zanella Nov 15, 2016

gzsombor Nov 15, 2016

zanella Nov 16, 2016 •

edited

Loading

gzsombor Nov 18, 2016

zanella Nov 14, 2016

Optimize Piece-s memory usage #35

Are you sure you want to change the base?

Optimize Piece-s memory usage #35

Conversation

gzsombor commented Feb 25, 2013

zanella Nov 14, 2016

Choose a reason for hiding this comment

gzsombor Nov 14, 2016

Choose a reason for hiding this comment

zanella Nov 15, 2016

Choose a reason for hiding this comment

gzsombor Nov 15, 2016

Choose a reason for hiding this comment

zanella Nov 16, 2016 • edited Loading

Choose a reason for hiding this comment

gzsombor Nov 18, 2016

Choose a reason for hiding this comment

zanella Nov 14, 2016

Choose a reason for hiding this comment

zanella Nov 16, 2016 •

edited

Loading