summaryrefslogtreecommitdiff
path: root/doc/todo
diff options
context:
space:
mode:
authorGravatar Joey Hess <joeyh@joeyh.name>2015-08-05 09:45:26 -0400
committerGravatar Joey Hess <joeyh@joeyh.name>2015-08-05 09:45:26 -0400
commitc5c047822beb3c84b9ab881b1f634aeb0159d938 (patch)
tree87ffde02076a56c946d025dfd49c94886eb1ed9e /doc/todo
parent52077eaa9a53d985b09e64f5f1b3f080af4e08a1 (diff)
parent5b1fd6c805848e1abe73dd8264836d39ac614a39 (diff)
Merge branch 'master' of ssh://git-annex.branchable.com
Diffstat (limited to 'doc/todo')
-rw-r--r--doc/todo/speed_up_transfers_over_ssh+rsync_--_directly_reuse_the_same_connection__63__/comment_4_2669ce8914d75eda225607d4d0b45ab0._comment15
1 files changed, 15 insertions, 0 deletions
diff --git a/doc/todo/speed_up_transfers_over_ssh+rsync_--_directly_reuse_the_same_connection__63__/comment_4_2669ce8914d75eda225607d4d0b45ab0._comment b/doc/todo/speed_up_transfers_over_ssh+rsync_--_directly_reuse_the_same_connection__63__/comment_4_2669ce8914d75eda225607d4d0b45ab0._comment
new file mode 100644
index 000000000..e6c16db6d
--- /dev/null
+++ b/doc/todo/speed_up_transfers_over_ssh+rsync_--_directly_reuse_the_same_connection__63__/comment_4_2669ce8914d75eda225607d4d0b45ab0._comment
@@ -0,0 +1,15 @@
+[[!comment format=mdwn
+ username="https://me.yahoo.com/a/EbvxpTI_xP9Aod7Mg4cwGhgjrCrdM5s-#7c0f4"
+ subject="my bad"
+ date="2015-08-05T02:33:46Z"
+ content="""
+indeed my use of ts timestamp was somewhat incorrect ;), I have used ts \"%b %d %H:%M:%S.%s\"
+
+\" This is quite likely simply the overhead of git-annex needing to query git to work out what remote each file is located on\" -- unlikely since CPU utilization is close to none.
+
+\"coupled with the overhead of needing to start a new git-annex-shell and rsync processes\" -- that is the likely major part of the overhead here
+
+-J is indeed of notable help BUT overall disproportionate to mitigate the overhead at large. I seem can successfully raise it to -J 4. With -J 6 I already start getting \"E: channel 22: open failed: administratively prohibited: open failed\" from time to time (not sure if it is benign or results in that particular transfer not succeeding). Not quite sure on the exact reason for it, i.e. why server side refuses to open a new channel -- I guess because of the same issue of too quickly following requests for new connections (i.e. \"the overhead\").
+
+Why transfer requests could not be queued up and batched for execution within the same ssh invocation? Keys are unique, so could be received within the same tmp/ in a batch transfer and then sorted into their corresponding hash subdirs. Bundled with -J to possibly even split among multiple source remotes from which different content could be available from it could lead to greatly improved transfer rates. Or would it be a major undertaking?
+"""]]