summaryrefslogtreecommitdiff
path: root/doc/bugs/copy_doesn__39__t_scale
diff options
context:
space:
mode:
authorGravatar http://joey.kitenet.net/ <joey@web>2012-01-28 19:32:36 +0000
committerGravatar admin <admin@branchable.com>2012-01-28 19:32:36 +0000
commit2a6be431d4cc4d8056f45f32e91b1b72a65fb107 (patch)
treead0b0bdfd529dae41a9d01baa3d3d5cdfda2e6f5 /doc/bugs/copy_doesn__39__t_scale
parent141718da73173855ce08662b1131344e6c10ef46 (diff)
Added a comment
Diffstat (limited to 'doc/bugs/copy_doesn__39__t_scale')
-rw-r--r--doc/bugs/copy_doesn__39__t_scale/comment_2_f85d8023cdbc203bb439644cf7245d4e._comment15
1 files changed, 15 insertions, 0 deletions
diff --git a/doc/bugs/copy_doesn__39__t_scale/comment_2_f85d8023cdbc203bb439644cf7245d4e._comment b/doc/bugs/copy_doesn__39__t_scale/comment_2_f85d8023cdbc203bb439644cf7245d4e._comment
new file mode 100644
index 000000000..9a2bd92fa
--- /dev/null
+++ b/doc/bugs/copy_doesn__39__t_scale/comment_2_f85d8023cdbc203bb439644cf7245d4e._comment
@@ -0,0 +1,15 @@
+[[!comment format=mdwn
+ username="http://joey.kitenet.net/"
+ nickname="joey"
+ subject="comment 2"
+ date="2012-01-28T19:32:36Z"
+ content="""
+Ah, I see, I was not thinking about the location log update that's done on the remote side.
+
+For transfers over ssh, that's a separate git-annex-shell invoked per change. For local-local transfers, it's all done in a single process but it spins up a state to handle the remote and then immediately shuts it down, also generating a commit.
+
+In either case, I think there is a nice fix. Since git-annex *does* have a journal nowadays, and goes to all the bother to
+support recovery if a process was interrupted and journalled changes that did not get committed, there's really no reason in either of these cases for the remote end to do anything more than journal the change. The next time git-annex is actually run on the remote, and needs to look up location information, it will merge the journalled changes into the branch, in a single commit.
+
+My only real concern is that some remotes might *never* have git-annex run in them directly, and would just continue to accumulate journal files forever. Although due to the way the journal is structured, it can have, at a maximum, the number of files in the git-annex branch. However, the number of files in it is expected to be relatively smal and it might get a trifle innefficient, as it lacks directory hashing. These performance problems could certainly be dealt with if they do turn out to be a problem.
+"""]]