From 81f71e57b9ac68b8d79c30fb27a22c5e3941fcee Mon Sep 17 00:00:00 2001 From: Joey Hess Date: Mon, 25 Oct 2010 11:47:45 -0400 Subject: reorg --- doc/todo/add_a_git_backend.mdwn | 6 ++++++ doc/todo/backendSHA1.mdwn | 1 + doc/todo/branching.mdwn | 36 +++++++++++++++++++++++++++++++ doc/todo/done.mdwn | 4 ++++ doc/todo/file_copy_progress_bar.mdwn | 3 +++ doc/todo/fsck.mdwn | 1 + doc/todo/gitrm.mdwn | 2 ++ doc/todo/network_remotes.mdwn | 5 +++++ doc/todo/parallel_possibilities.mdwn | 13 +++++++++++ doc/todo/pushpull.mdwn | 2 ++ doc/todo/rsync.mdwn | 2 ++ doc/todo/symlink_farming_commit_hook.mdwn | 12 +++++++++++ doc/todo/using_url_backend.mdwn | 11 ++++++++++ 13 files changed, 98 insertions(+) create mode 100644 doc/todo/add_a_git_backend.mdwn create mode 100644 doc/todo/backendSHA1.mdwn create mode 100644 doc/todo/branching.mdwn create mode 100644 doc/todo/done.mdwn create mode 100644 doc/todo/file_copy_progress_bar.mdwn create mode 100644 doc/todo/fsck.mdwn create mode 100644 doc/todo/gitrm.mdwn create mode 100644 doc/todo/network_remotes.mdwn create mode 100644 doc/todo/parallel_possibilities.mdwn create mode 100644 doc/todo/pushpull.mdwn create mode 100644 doc/todo/rsync.mdwn create mode 100644 doc/todo/symlink_farming_commit_hook.mdwn create mode 100644 doc/todo/using_url_backend.mdwn (limited to 'doc/todo') diff --git a/doc/todo/add_a_git_backend.mdwn b/doc/todo/add_a_git_backend.mdwn new file mode 100644 index 000000000..91a5001cc --- /dev/null +++ b/doc/todo/add_a_git_backend.mdwn @@ -0,0 +1,6 @@ +There should be a backend where the file content is stored.. in a git +repository! + +This way, you know your annexed content is safe & versioned, but you only +have to deal with the pain of git with large files in one place, and can +use all of git-annex's features everywhere else. diff --git a/doc/todo/backendSHA1.mdwn b/doc/todo/backendSHA1.mdwn new file mode 100644 index 000000000..40ff868c2 --- /dev/null +++ b/doc/todo/backendSHA1.mdwn @@ -0,0 +1 @@ +This backend is not finished. diff --git a/doc/todo/branching.mdwn b/doc/todo/branching.mdwn new file mode 100644 index 000000000..21996ecc0 --- /dev/null +++ b/doc/todo/branching.mdwn @@ -0,0 +1,36 @@ +The use of `.git-annex` to store logs means that if a repo has branches +and the user switched between them, git-annex will see different logs in +the different branches, and so may miss info about what remotes have which +files (though it can re-learn). + +An alternative would be to store the log data directly in the git repo +as `pristine-tar` does. Problem with that approach is that git won't merge +conflicting changes to log files if they are not in the currently checked +out branch. + +It would be possible to use a branch with a tree like this, to avoid +conflicts: + +key/uuid/time/status + +As long as new files are only added, and old timestamped files deleted, +there would be no conflicts. + +A related problem though is the size of the tree objects git needs to +commit. Having the logs in a separate branch doesn't help with that. +As more keys are added, the tree object size will increase, and git will +take longer and longer to commit, and use more space. One way to deal with +this is simply by splitting the logs amoung subdirectories. Git then can +reuse trees for most directories. (Check: Does it still have to build +dup trees in memory?) + +Another approach would be to have git-annex *delete* old logs. Keep logs +for the currently available files, or something like that. If other log +info is needed, look back through history to find the first occurance of a +log. Maybe even look at other branches -- so if the logs were on master, +a new empty branch could be made and git-annex would still know where to +get keys in that branch. + +Would have to be careful about conflicts when deleting and bringing back +files with the same name. And would need to avoid expensive searching thru +all history to try to find an old log file. diff --git a/doc/todo/done.mdwn b/doc/todo/done.mdwn new file mode 100644 index 000000000..e7c98081b --- /dev/null +++ b/doc/todo/done.mdwn @@ -0,0 +1,4 @@ +recently fixed [[todo]] items. + +[[!inline pages="./* and link(./done) and !*/Discussion" sort=mtime show=10 +archive=yes]] diff --git a/doc/todo/file_copy_progress_bar.mdwn b/doc/todo/file_copy_progress_bar.mdwn new file mode 100644 index 000000000..cd4ea33b7 --- /dev/null +++ b/doc/todo/file_copy_progress_bar.mdwn @@ -0,0 +1,3 @@ +Find a way to copy a file with a progress bar, while still preserving +stat. Easiest way might be to use pv and fix up the permissions etc +after? diff --git a/doc/todo/fsck.mdwn b/doc/todo/fsck.mdwn new file mode 100644 index 000000000..308a1cb63 --- /dev/null +++ b/doc/todo/fsck.mdwn @@ -0,0 +1 @@ +add a git annex fsck that finds keys that have no referring file diff --git a/doc/todo/gitrm.mdwn b/doc/todo/gitrm.mdwn new file mode 100644 index 000000000..d771aa32a --- /dev/null +++ b/doc/todo/gitrm.mdwn @@ -0,0 +1,2 @@ +how to handle git rm file? (should try to drop keys that have no +referring file, if it seems safe..) diff --git a/doc/todo/network_remotes.mdwn b/doc/todo/network_remotes.mdwn new file mode 100644 index 000000000..42efa832f --- /dev/null +++ b/doc/todo/network_remotes.mdwn @@ -0,0 +1,5 @@ +Support for remote git repositories (ssh:// specifically can be made to +work, although the other end probably needs to have git-annex +installed..) + +[[done]], at least get and put work.. diff --git a/doc/todo/parallel_possibilities.mdwn b/doc/todo/parallel_possibilities.mdwn new file mode 100644 index 000000000..178f95021 --- /dev/null +++ b/doc/todo/parallel_possibilities.mdwn @@ -0,0 +1,13 @@ +One of my reasons for using haskell was that it provides the possibility of +some parallell processing. Although since git-annex hits the filesystem +heavily and mostly runs other git commands, maybe not a whole lot. + +Anyway, each git-annex command is broken down into a series of independant +actions, which has some potential for parallelism. + +Probably they would need to be split further. Each action currently has 3 +distinct phases, basically "check", "do", and "record". If the check action +returned a do action that returned a record action, then it could easily +make sense to parallelize the check actions and start on the do actions +(which probably won't parallelize well) while they are still being +generated, and possibly parallelize the record actions at the end. diff --git a/doc/todo/pushpull.mdwn b/doc/todo/pushpull.mdwn new file mode 100644 index 000000000..47da2107f --- /dev/null +++ b/doc/todo/pushpull.mdwn @@ -0,0 +1,2 @@ +--push/--pull should take a reponame and files, and push those files + to that repo; dropping them from the current repo diff --git a/doc/todo/rsync.mdwn b/doc/todo/rsync.mdwn new file mode 100644 index 000000000..75e0175c8 --- /dev/null +++ b/doc/todo/rsync.mdwn @@ -0,0 +1,2 @@ +Transferring a file from a ssh:// remote should use rsync to allow resuming +of a prior transfer. diff --git a/doc/todo/symlink_farming_commit_hook.mdwn b/doc/todo/symlink_farming_commit_hook.mdwn new file mode 100644 index 000000000..af03beb70 --- /dev/null +++ b/doc/todo/symlink_farming_commit_hook.mdwn @@ -0,0 +1,12 @@ +TODO: implement below + +git-annex does use a lot of symlinks. Specicially, relative symlinks, +that are checked into git. To allow you to move those around without +annoyance, git-annex can run as a post-commit hook. This way, you can `git mv` +a symlink to an annexed file, and as soon as you commit, it will be fixed +up. + +`git annex init` tries to set up a post-commit hook that is itself a symlink +back to git-annex. If you want to have your own shell script in the post-commit +hook, just make it call `git annex` with no parameters. git-annex will detect +when it's run from a git hook and do the necessary fixups. diff --git a/doc/todo/using_url_backend.mdwn b/doc/todo/using_url_backend.mdwn new file mode 100644 index 000000000..1f3cd5628 --- /dev/null +++ b/doc/todo/using_url_backend.mdwn @@ -0,0 +1,11 @@ +There is no way to `git annex add` a file using the URL [[backend|backends]]. + +For now, we have to manually make the symlink. Something like this: + + ln -s .git/annex/URL:http:%%www.example.com%foo.tar.gz + +Note the escaping of slashes. + +A `git annex register ` command could do this.. + +[[done]] -- cgit v1.2.3