git-annex-gpl - git-annex without the AGPL

	Commit message (Collapse)	Author	Age
*	unused: Fix to actually detect unused keys when in direct mode.	Joey Hess	2014-02-20
\|
*	add tip about metadata driven views (and more flexible view filtering)	Joey Hess	2014-02-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	While writing this documentation, I realized that there needed to be a way to stay in a view like tag=* while adding a filter like tag=work that applies to the same field. So, there are really two ways a view can be refined. It can have a new "field=explicitvalue" filter added to it, which does not change the "shape" of the view, but narrows the files it shows. Or, it can have a new view added, which adds another level of subdirectories. So, added a vfilter command, which takes explicit values to add to the filter, and rejects changes that would change the shape of the view. And, made vadd only accept changes that change the shape of the view. And, changed the View data type slightly; now components that can match multiple metadata values can be visible, or not visible. This commit was sponsored by Stelian Iancu.
*	pre-commit: Update metadata when committing changes to annexed files within ↵	Joey Hess	2014-02-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	a view. So the user can now switch to a view and then move files around within it to manage metadata. For example, moving a file into a new directory when in the tags=* view adds a tag to it. Implementation is fairly efficient. One diff-index, which is no more expensive than the first stage of a git commit, followed by possibly some cat-file --batch traffic to find the key (when deleting a file). Very similar to what's done in direct mode when committing. And like direct mode when updating the WC after a merge, it has to buffer the diff-tree values in order to make 2 passes over them. When not in a view, pre-commit now does one extra git symbolic-ref, which is tiny overhead. This commit was sponsored by Andrew Eskridge.
*	decruft	Joey Hess	2014-02-19
\|
*	make view globs case-insensative, memoized, and bring back TFDA	Joey Hess	2014-02-19
\| \| \| \| \| \|	I was careful to write the code so its clear how laziness memoizes it, although it's likely that much less explicit currying would have had the same effect. Verified that the memoization works using a Debug.Trace.
*	reject views with too many nested subdirs	Joey Hess	2014-02-19
\|
*	remove Read instance for Ref	Joey Hess	2014-02-19
\| \| \| \| \| \| \| \|	Removed instance, got it all to build using fromRef. (With a few things that really need to show something using a ref for debugging stubbed out.) Then added back Read instance, and made Logs.View use it for serialization. This changes the view log format.
*	fix view changing when in subdir	Joey Hess	2014-02-18
\| \| \| \| \|	Failed reading some files with relative paths. This is a quick and dirty fix.
*	view: preserve toplevel dotfiles	Joey Hess	2014-02-18
\|
*	improve view filenames	Joey Hess	2014-02-18
\|
*	add git annex view command	Joey Hess	2014-02-18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(And a vpop command, which is still a bit buggy.) Still need to do vadd and vrm, though this also adds their documentation. Currently not very happy with the view log data serialization. I had to lose the TDFA regexps temporarily, so I can have Read/Show instances of View. I expect the view log format will change in some incompatable way later, probably adding last known refs for the parent branch to View or something like that. Anyway, it basically works, although it's a bit slow looking up the metadata. The actual git branch construction is about as fast as it can be using the current git plumbing. This commit was sponsored by Peter Hogg.
*	better data types	Joey Hess	2014-02-17
\|
*	split out types	Joey Hess	2014-02-17
\|
*	tricky view refining code that keeps track of whether the view is widenening ↵	Joey Hess	2014-02-16
\| \| \| \|	or narrowing
*	support globs when built w/o TDFA, just slower	Joey Hess	2014-02-16
\|
*	add another quickcheck property, and several edge cases handled	Joey Hess	2014-02-16
\|
*	simplify type	Joey Hess	2014-02-16
\|
*	filter branches (incomplete)	Joey Hess	2014-02-16
\| \| \| \| \| \| \| \|	Promosing work toward metadata driven filter branches. A few methods to construct them are stubbed out; all the data types and pure code seems good. This commit was sponsored by Walter Somerville.
*	limiting files based on metadata	Joey Hess	2014-02-13
\| \| \| \| \| \|	Note that there is currently no caching, so --metadata foo=bar --metadata tag=blah will currently read the log 2x per file.
*	add metadata command to get/set metadata	Joey Hess	2014-02-12
\| \| \| \| \| \| \| \| \|	Adds metadata log, and command. Note that unsetting field values seems to currently be broken. And in general this has had all of 2 minutes worth of testing. This commit was sponsored by Julien Lefrique.
*	random hlint (to give the autobuilder something new to build)	Joey Hess	2014-02-11
\|
*	remove workaround for old bug	Joey Hess	2014-02-06
\| \| \| \| \| \| \|	560b644a52971a7e4706c775982ec29e03ca3ab2 described a bug moving between remotes in direct mode; I can no longer reproduce it with this strange workaround removed. Also test suite still passes. Hope the broken code just got fixed in the meantime.
*	work around absNormPath not working on Windows	Joey Hess	2014-02-06
\| \| \| \|	When making git-annex links, we want unix-style paths in the link targets.
*	--in can now refer to files that were located in a repository at some past ↵	Joey Hess	2014-02-06
\| \| \| \|	date. For example, --in="here@{yesterday}"
*	fix failing test case on Windows	Joey Hess	2014-02-03
\| \| \| \|	ensure file being modified is all read before it's opened for write
*	avoid using openFile when withFile can be used	Joey Hess	2014-02-03
\| \| \| \| \| \|	Potentially fixes some FD leak if an action on an opened file handle fails for some reason. There have been some hard to reproduce reports of git-annex leaking FDs, and this may solve them.
*	factor out utility function	Joey Hess	2014-02-03
\|
*	remove some monkey faces	Joey Hess	2014-02-01
\|
*	Windows: Avoid using unix-compat's rename, which refuses to rename directories.	Joey Hess	2014-01-29
\| \| \| \|	Opened a bug about this: https://github.com/jystic/unix-compat/issues/10
*	rework annexed object locking in direct mode & support Windows	Joey Hess	2014-01-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Seems that locking of annexed objects when they're being dropped was broken in direct mode: * When taking the lock before dropping, it created the .git/annex/objects file, as an empty file. It seems that the dropping code deleted that, but that is not right, and for all I know could in some situation cause a corrupted object to leak out. * When the lock was checked, it actually tried to open each direct mode file, and checked if it was locked. Not the same lock used above, and could also fail if some consumer of the file locked it. Fixed this, and added windows support by switching direct mode to lock a .lck file.
*	use locking on Windows	Joey Hess	2014-01-28
\| \| \| \|	This is all the easy cases, where there was already a separate lock file.
*	remove debug print	Joey Hess	2014-01-26
\| \| \| \|	just saw it legitimately occur when 2 git-annex were running
*	reorg	Joey Hess	2014-01-26
\|
*	added annex.secure-erase-command config option.	Joey Hess	2014-01-24
\|
*	fix transfers of key with no associated file	Joey Hess	2014-01-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Several places assumed this would not happen, and when the AssociatedFile was Nothing, did nothing. As part of this, preferred content checks pass the Key around. Note that checkMatcher is sometimes now called with Just Key and Just File. It currently constructs a FileMatcher, ignoring the Key. However, if it constructed a FileKeyMatcher, which contained both, then it might be possible to speed up parts of Limit, which currently call the somewhat expensive lookupFileKey to get the Key. I have not made this optimisation yet, because I am not sure if the key is always the same. Will need some significant checking to satisfy myself that's the case..
*	add "unused" preferred content expression	Joey Hess	2014-01-22
\| \| \| \| \| \| \|	With a really nice optimisation that keeps it from having any overhead in normal operation! This commit was sponsored by Ulises Vitulli.
*	benchmarked numcopies .gitattributes in preferred content	Joey Hess	2014-01-21
\| \| \| \| \| \| \| \| \| \| \|	Checking .gitattributes adds a full minute to a git annex find looking for files that don't have enough copies. 2:25 increasts to 3:27. I feel this is too much of a slowdown to justify making it the default. So, exposed two versions of the preferred content expression, a slow one and a fast but approximate one. I'm using the approximate one in the default preferred content expressions to avoid slowing down the assistant.
*	reorg	Joey Hess	2014-01-21
\|
*	numcopies cleanup, part 2	Joey Hess	2014-01-21
\| \| \| \|	This includes several bug fixes.
*	reorganize numcopies code (no behavior changes)	Joey Hess	2014-01-21
\| \| \| \| \| \| \|	Move stuff into Logs.NumCopies. Add a NumCopies newtype. Better names for various serialization classes that are specific to one thing or another.
*	Add and use numcopiesneeded preferred content expression.	Joey Hess	2014-01-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add numcopiesneeded preferred content expression. * Client, transfer, incremental backup, and archive repositories now want to get content that does not yet have enough copies. This means the asssistant will make copies of files that don't yet meet the configured numcopies, even to places that would not normally want the file. For example, if numcopies is 4, and there are 2 client repos and 2 transfer repos, and 2 removable backup drives, the file will be sent to both transfer repos in order to make 4 copies. Once a removable drive get a copy of the file, it will be dropped from one transfer repo or the other (but not both). Another example, numcopies is 3 and there is a client that has a backup removable drive and two small archive repos. Normally once one of the small archives has a file, it will not be put into the other one. But, to satisfy numcopies, the assistant will duplicate it into the other small archive too, if the backup repo is not available to receive the file. I notice that these examples are fairly unlikely setups .. the old behavior was not too bad, but it's nice to finally have it really correct. .. Almost. I have skipped checking the annex.numcopies .gitattributes out of fear it will be too slow. This commit was sponsored by Florian Schlegel.
*	global numcopies setting	Joey Hess	2014-01-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* numcopies: New command, sets global numcopies value that is seen by all clones of a repository. * The annex.numcopies git config setting is deprecated. Once the numcopies command is used to set the global number of copies, any annex.numcopies git configs will be ignored. * assistant: Make the prefs page set the global numcopies. This global numcopies setting is needed to let preferred content expressions operate on numcopies. It's also convenient, because typically if you want git-annex to preserve N copies of files in a repo, you want it to do that no matter which repo it's running in. Making it global avoids needing to warn the user about gotchas involving inconsistent annex.numcopies settings. (See changes to doc/numcopies.mdwn.) Added a new variety of git-annex branch log file, that holds only 1 value. Will probably be useful for other stuff later. This commit was sponsored by Nicolas Pouillard.
*	much better command action handling for sync --content	Joey Hess	2014-01-20
\|
*	fix inversion of control in CommandSeek (no behavior changes)	Joey Hess	2014-01-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've been disliking how the command seek actions were written for some time, with their inversion of control and ugly workarounds. The last straw to fix it was sync --content, which didn't fit the Annex [CommandStart] interface well at all. I have not yet made it take advantage of the changed interface though. The crucial change, and probably why I didn't do it this way from the beginning, is to make each CommandStart action be run with exceptions caught, and if it fails, increment a failure counter in annex state. So I finally remove the very first code I wrote for git-annex, which was before I had exception handling in the Annex monad, and so ran outside that monad, passing state explicitly as it ran each CommandStart action. This was a real slog from 1 to 5 am. Test suite passes. Memory usage is lower than before, sometimes by a couple of megabytes, and remains constant, even when running in a large repo, and even when repeatedly failing and incrementing the error counter. So no accidental laziness space leaks. Wall clock speed is identical, even in large repos. This commit was sponsored by an anonymous bitcoiner.
*	sync --content: New option that makes the content of annexed files be ↵	Joey Hess	2014-01-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transferred. Similar to the assistant, this honors any configured preferred content expressions. I am not entirely happpy with the implementation. It would be nicer if the seek function returned a list of actions which included the individual file gets and copies and drops, rather than the current list of calls to syncContent. This would allow getting rid of the somewhat reundant display of "sync file [ok\|failed]" after the get/put display. But, do that, withFilesInGit would need to somehow be able to construct such a mixed action list. And it would be less efficient than the current implementation, which is able to reuse several values between eg get and drop. Note that currently this does not try to satisfy numcopies when getting/putting files (numcopies are of course checked when dropping files!) This makes it like the assistant, and unlike get --auto and copy --auto, which do duplicate files when numcopies is not yet satisfied. I don't know if this is the right decision; it only seemed to make sense to have this parallel the assistant as far as possible to start with, since I know the assistant works. This commit was sponsored by Øyvind Andersen Holm.
*	improve matcher data type to allow matching Keys, instead of just files (no ↵	Joey Hess	2014-01-18
\| \| \| \|	behavior changes)
*	avoid needing a build-dep on hxt for Data.AssocList	Joey Hess	2014-01-14
\|
*	Fix a long-standing bug that could cause the wrong index file to be used ↵	Joey Hess	2014-01-14
\| \| \| \|	when committing to the git-annex branch, if GIT_INDEX_FILE is set in the environment. This typically resulted in git-annex branch log files being committed to the master branch and later showing up in the work tree. (These log files can be safely removed.)
*	also check diskreserve for quvi downloads	Joey Hess	2014-01-04
\|
*	addurl, importfeed: Honor annex.diskreserve as long as the size of the url ↵	Joey Hess	2014-01-04
\| \| \| \| \| \| \| \|	can be checked. This adds a http HEAD before the download is done. That was already the case when the assistant was running, and it seems worth it to avoid filling up the whole disk, like happened to my server today.