summaryrefslogtreecommitdiff
path: root/doc/todo
diff options
context:
space:
mode:
authorGravatar Xyem <Xyem@web>2014-07-29 15:38:40 +0000
committerGravatar admin <admin@branchable.com>2014-07-29 15:38:40 +0000
commit732368dc8dc3b5656c0be638e4187de132183247 (patch)
tree2f7c8c78f3394e213ce5cf9122bea50b050147cc /doc/todo
parent7255eae45de2fc6904a3d4e7e83a2eb35c9b226a (diff)
Diffstat (limited to 'doc/todo')
-rw-r--r--doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn7
1 files changed, 7 insertions, 0 deletions
diff --git a/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn b/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn
new file mode 100644
index 000000000..34c21ab01
--- /dev/null
+++ b/doc/todo/Speed_up___39__import_--clean-duplicates__39__.mdwn
@@ -0,0 +1,7 @@
+I'm currently in the process of gutting old (some broken) git-annex's and cleaning out download directories from before I started using git-annex.
+
+To do this, I am running `git annex import --clean--duplicates $PATH` on the directories I want to clear out but sometimes, this takes a unnecessarily long time.
+
+For example, git-annex will calculate the digest for a huge file (30GB+) in $TARGET, even though there are no files in the annex of that size.
+
+It's a common shortcut to check for duplicate sizes first to eliminate definite non-matches really quickly. Can this be added to git-annex's `import` in some way or is this a no-go due to the constant memory constraint?