diff options
author | Xyem <Xyem@web> | 2014-07-29 15:38:40 +0000 |
---|---|---|
committer | admin <admin@branchable.com> | 2014-07-29 15:38:40 +0000 |
commit | 732368dc8dc3b5656c0be638e4187de132183247 (patch) | |
tree | 2f7c8c78f3394e213ce5cf9122bea50b050147cc | |
parent | 7255eae45de2fc6904a3d4e7e83a2eb35c9b226a (diff) |
-rw-r--r-- | doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn | 7 |
1 files changed, 7 insertions, 0 deletions
diff --git a/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn b/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn new file mode 100644 index 000000000..34c21ab01 --- /dev/null +++ b/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn @@ -0,0 +1,7 @@ +I'm currently in the process of gutting old (some broken) git-annex's and cleaning out download directories from before I started using git-annex. + +To do this, I am running `git annex import --clean--duplicates $PATH` on the directories I want to clear out but sometimes, this takes a unnecessarily long time. + +For example, git-annex will calculate the digest for a huge file (30GB+) in $TARGET, even though there are no files in the annex of that size. + +It's a common shortcut to check for duplicate sizes first to eliminate definite non-matches really quickly. Can this be added to git-annex's `import` in some way or is this a no-go due to the constant memory constraint? |