git-annex-gpl - git-annex without the AGPL

	Commit message (Collapse)	Author	Age
*	sync: Show a nicer message if a user tries to sync to a special remote.	Joey Hess	2012-05-27
\|
*	Clean up handling of git directory and git worktree.	Joey Hess	2012-05-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Baked into the code was an assumption that a repository's git directory could be determined by adding ".git" to its work tree (or nothing for bare repos). That fails when core.worktree, or GIT_DIR and GIT_WORK_TREE are used to separate the two. This was attacked at the type level, by storing the gitdir and worktree separately, so Nothing for the worktree means a bare repo. A complication arose because we don't learn where a repository is bare until its configuration is read. So another Location type handles repositories that have not had their config read yet. I am not entirely happy with this being a Location type, rather than representing them entirely separate from the Git type. The new code is not worse than the old, but better types could enforce more safety. Added support for core.worktree. Overriding it with -c isn't supported because it's not really clear what to do if a git repo's config is read, is not bare, and is then overridden to bare. What is the right git directory in this case? I will worry about this if/when someone has a use case for overriding core.worktree with -c. (See Git.Config.updateLocation) Also removed and renamed some functions like gitDir and workTree that misused git's terminology. One minor regression is known: git annex add in a bare repository does not print a nice error message, but runs git ls-files in a way that fails earlier with a less nice error message. This is because before --work-tree was always passed to git commands, even in a bare repo, while now it's not.
*	Fix use of several config settings	Joey Hess	2012-05-05
\| \| \| \| \| \| \|	annex.ssh-options, annex.rsync-options, annex.bup-split-options. And adjust types to avoid the bugs that broke several config settings recently. Now "annex." prefixing is enforced at the type level.
*	addunused: New command, the opposite of dropunused, it relinks unused ↵	Joey Hess	2012-05-02
\| \| \| \|	content into the git repository.
*	dropunused: Allow specifying ranges to drop.	Joey Hess	2012-05-02
\| \| \| \| \|	Sort of by popular demand, but the last straw for not using seq was that it can run into command line length limits.
*	percentage library	Joey Hess	2012-04-29
\|
*	show percent the bloom filter is full	Joey Hess	2012-04-29
\|
*	show amount of reserved space	Joey Hess	2012-04-23
\|
*	Add annex.httpheaders and annex.httpheader-command config settings	Joey Hess	2012-04-22
\| \| \| \| \| \|	Allow custom headers to be sent with all HTTP requests. (Requested by the Internet Archive)
*	noop	Joey Hess	2012-04-21
\|
*	better file mode setting code	Joey Hess	2012-04-21
\|
*	Support git's core.sharedRepository configuration	Joey Hess	2012-04-21
\| \| \| \| \| \|	This is incomplete, it does not honor it yet for hash directories and other annex bookkeeping files. Some of that is not needed for a bare repo; some of it may be.
*	export a more generalized checkDiskSpace	Joey Hess	2012-04-20
\|
*	use unabbreviated size units in status	Joey Hess	2012-04-06
\|
*	Rewrote free disk space checking code	Joey Hess	2012-03-22
\| \| \| \| \|	Moving the portability handling into a small C library cleans up things a lot, avoiding the pain of unpacking structs from inside haskell code.
*	use new getConfig	Joey Hess	2012-03-22
\|
*	rationalize getConfig	Joey Hess	2012-03-22
\| \| \| \| \| \| \| \| \| \|	getConfig got a remote-specific config, and this confusing name caused it to be used a couple of places that only were interested in global configs. Rename to getRemoteConfig and make getConfig only get global configs. There are no behavior changes here, but remote.<name>.annex-web-options never actually worked (and per-remote web options is a very unlikely to be useful case so I didn't make it work), so fix the documentation for it.
*	tweak	Joey Hess	2012-03-22
\|
*	status: Prints available local disk space, or shows if git-annex doesn't know.	Joey Hess	2012-03-21
\|
*	fun with symbols	Joey Hess	2012-03-17
\| \| \| \| \| \|	Nothing at all on hackage is using <&&> or <\|\|>. (Also, <&&> should short-circuit on failure.)
*	optimize monadic \|\|	Joey Hess	2012-03-16
\| \| \| \| \|	(\|\|) used applicative style runs both conditions rather than short circuiting. Add an orM that properly short-circuits.
*	added ifM and nuked 11 lines of code	Joey Hess	2012-03-14
\| \| \| \|	no behavior changes
*	Merge branch 'master' into bloom	Joey Hess	2012-03-14
\|\ \| \| \| \| \| \| \| \| \| \|	Conflicts: Command/Commit.hs debian/changelog
\| *	ignore hook exit status	Joey Hess	2012-03-14
\| \|
\| *	git-annex-shell: Runs hooks/annex-content after content is received or dropped.	Joey Hess	2012-03-14
\| \|
* \|	git-annex-shell: Runs hooks/annex-content after content is received or dropped.	Joey Hess	2012-03-14
\| \|
* \|	Merge branch 'master' into bloom	Joey Hess	2012-03-12
\|\\| \| \| \| \| \| \| \| \|	Conflicts: debian/changelog
* \|	finish bloom filters	Joey Hess	2012-03-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add tuning, docs, etc. Not sure if status is the right place to remote size.. perhaps unused should report the size and also warn if it sees more keys than the bloom filter allows?
* \|	added second stage bloom filter	Joey Hess	2012-03-12
\| \|
* \|	fixed bloom filter creation space leak	Joey Hess	2012-03-12
\| \| \| \| \| \| \| \|	it works!
* \|	try at using bloom filters	Joey Hess	2012-03-12
\| \| \| \| \| \| \| \|	leaks memory
\| *	status: More accurate display of sizes of tmp and bad keys.	Joey Hess	2012-03-12
\|/ \| \| \| \| \| \| \| \|	Can't trust the key size to be accurate for tmp and bad keys, so check actual file size. In the wild I saw the old code be wrong by a factor of about 100! If all tmp/bad keys are empty, they're not shown in status at all. Showing 0 bytes and suggesting to clean it up seemed weird..
*	prettify	Joey Hess	2012-03-11
\|
*	avoid needing to keep list of present keys	Joey Hess	2012-03-11
\| \| \| \| \| \|	Stale and bad files are rare, so it's more efficient to use inAnnex to see if they can be deleted, rather than keeping the list of all present keys around for them.
*	status: Fixed to run in nearly constant space.	Joey Hess	2012-03-11
\| \| \| \| \| \| \| \|	Before, it leaked space due to caching lists of keys. Now all necessary data about keys is calculated as they stream in. The "nearly constant" is due to getKeysPresent, which builds up a lot of [] thunks as it traverses .git/annex/objects/. Will deal with it later.
*	unused: Reduce memory usage significantly.	Joey Hess	2012-03-11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Much of the memory bloat turned out to be due to getKeysReferenced containing a mapM, which is strict and buffered the whole list rather than streaming it. The other half of the bloat was due to building a temporary Set in order to call S.difference. While that is more cpu efficient, I switched to successive S.delete, since with it, I can run a whole git annex unused in less than 8 mb of memory. The whole Set of keys with content available is still stored in memory, so running unused in a repo with a whole lot of file content will still use more memory. In a repo containing 6000 files, it needed 40 mb. Note that the status command still uses the bloatful getKeysReferenced.
*	sync: Sync to lower cost remotes first.	Joey Hess	2012-03-10
\| \| \| \| \| \| \| \| \|	This has two benefits. 1. When a lot of refs are going to be received, get them via lower cost connection when possible. 2. Allows ctrl-c of sync after the cheaper remotes have been pulled from (or pushed to).
*	fsck: Fix up any broken links and misplaced content caused by the directory ↵	Joey Hess	2012-03-10
\| \| \| \|	hash calculation bug fixed in the last release.
*	cleanup	Joey Hess	2012-03-06
\|
*	"here" can be used to refer to the current repository, which can read better ↵	Joey Hess	2012-03-01
\| \| \| \|	than the old "." (which still works too).
*	move --from, copy --from: 10 times faster scanning remote on local disk	Joey Hess	2012-02-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than go through the location log to see which files are present on the remote, it simply looks at the disk contents directly. I benchmarked this speeding up scanning 834 files, from an annex on my phone's SSD, from 11.39 seconds to 1.31 seconds. (No files actually moved.) Also benchmarked 8139 files, from an annex on spinning storage, speeding up from 103.17 to 13.39 seconds. Note that benchmarking with an encrypted annex on flash actually showed a minor slowdown with this optimisation -- from 13.93 to 14.50 seconds. Seems the overhead of doing the crypto needed to get the filenames to directly check can be higher than the overhead of looking up data in the location log. (Which says good things about how well the location log and git have been optimised!) It may make sense to make encrypted local remotes not have hasKeyCheap set; further benchmarking is called for.
*	add git-annex-shell commit	Joey Hess	2012-02-25
\| \| \| \| \| \| \| \| \|	Eventually, git-annex might try running this after making changes to a remote. I have not yet thought of a good way for it to tell which remotes it needs to run it on though. It can't just do it when shutting down a cached ssh connection, because ssh connection caching is optional, and that would not handle local remotes not accessed over ssh either.
*	improve alwayscommit=false mode	Joey Hess	2012-02-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now changes are staged into the branch's index, but not committed, which avoids growing a large journal. And sync and merge always explicitly commit, ensuring that even when they do nothing else, they commit the staged changes. Added a flag file to indicate that the branch's journal contains uncommitted changes. (Could use git ls-files, but don't want to run that every time.) In the future, this ability to have uncommitted changes staged in the journal might be used on remotes after a series of oneshot commands.
*	more robustness fixes	Joey Hess	2012-02-18
\|
*	don't fail with --pathdepth when file already exists	Joey Hess	2012-02-18
\|
*	don't error out entirely if an url cannot be downloaded	Joey Hess	2012-02-18
\|
*	variable name	Joey Hess	2012-02-17
\|
*	reorg	Joey Hess	2012-02-17
\|
*	reorder for clarity	Joey Hess	2012-02-16
\|
*	make Migrate use ReKey rather than the other way around	Joey Hess	2012-02-16
\| \| \| \|	as ReKey is plumbing, this makes sense