diff options
author | http://joey.kitenet.net/ <joey@web> | 2011-12-22 16:39:24 +0000 |
---|---|---|
committer | admin <admin@branchable.com> | 2011-12-22 16:39:24 +0000 |
commit | 30cf6ce81ca8ff99f5284c5b991e546eb1da72ae (patch) | |
tree | 482cdd9c54a92ec575c82d9d26d52cf51c35107e /doc/todo | |
parent | 6808b08c1ab096eb8c7c986379ec7a24d70434e3 (diff) |
Added a comment
Diffstat (limited to 'doc/todo')
-rw-r--r-- | doc/todo/wishlist:_Provide_a___34__git_annex__34___command_that_will_skip_duplicates/comment_6_f24541ada1c86d755acba7e9fa7cff24._comment | 14 |
1 files changed, 14 insertions, 0 deletions
diff --git a/doc/todo/wishlist:_Provide_a___34__git_annex__34___command_that_will_skip_duplicates/comment_6_f24541ada1c86d755acba7e9fa7cff24._comment b/doc/todo/wishlist:_Provide_a___34__git_annex__34___command_that_will_skip_duplicates/comment_6_f24541ada1c86d755acba7e9fa7cff24._comment new file mode 100644 index 000000000..93d3d41f4 --- /dev/null +++ b/doc/todo/wishlist:_Provide_a___34__git_annex__34___command_that_will_skip_duplicates/comment_6_f24541ada1c86d755acba7e9fa7cff24._comment @@ -0,0 +1,14 @@ +[[!comment format=mdwn + username="http://joey.kitenet.net/" + nickname="joey" + subject="comment 6" + date="2011-12-22T16:39:24Z" + content=""" +My main concern with putting this in git-annex is that finding duplicates necessarily involves storing a list of every key and file in the repository, and git-annex is very carefully built to avoid things that require non-constant memory use, so that it can scale to very big repositories. (The only exception is the `unused` command, and reducing its memory usage is a continuing goal.) + +So I would rather come at this from a different angle.. like providing a way to output a list of files and their associated keys, which the user can then use in their own shell pipelines to find duplicate keys: + + git annex find --include '*' --format=\"%f %k\n\" | sort foo --key 2 | uniq --all-repeated --skip-fields=1 + +(Making that properly handle filenames with spaces is left as an exercise for the reader..) +"""]] |