git-annex-gpl - git-annex without the AGPL

	Commit message (Collapse)	Author	Age
*	add webapp UI to manage unused files	Joey Hess	2014-01-23
\|
*	allow annex.expireunused to be set to false, as well as to a duration	Joey Hess	2014-01-22
\|
*	assistant unused file handling	Joey Hess	2014-01-22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Make sanity checker run git annex unused daily, and queue up transfers of unused files to any remotes that will have them. The transfer retrying code works for us here, so eg when a backup disk remote is plugged in, any transfers to it are done. Once the unused files reach a remote, they'll be removed locally as unwanted. If the setup does not cause unused files to go to a remote, they'll pile up, and the sanity checker detects this using some heuristics that are pretty good -- 1000 unused files, or 10% of disk used by unused files, or more disk wasted by unused files than is left free. Once it detects this, it pops up an alert in the webapp, with a button to take action. TODO: Webapp UI to configure this, and also the ability to launch an immediate cleanup of all unused files. This commit was sponsored by Simon Michael.
*	assistant: Run the periodic git gc in batch mode.	Joey Hess	2014-01-22
\|
*	reorganize numcopies code (no behavior changes)	Joey Hess	2014-01-21
\| \| \| \| \| \| \|	Move stuff into Logs.NumCopies. Add a NumCopies newtype. Better names for various serialization classes that are specific to one thing or another.
*	global numcopies setting	Joey Hess	2014-01-20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* numcopies: New command, sets global numcopies value that is seen by all clones of a repository. * The annex.numcopies git config setting is deprecated. Once the numcopies command is used to set the global number of copies, any annex.numcopies git configs will be ignored. * assistant: Make the prefs page set the global numcopies. This global numcopies setting is needed to let preferred content expressions operate on numcopies. It's also convenient, because typically if you want git-annex to preserve N copies of files in a repo, you want it to do that no matter which repo it's running in. Making it global avoids needing to warn the user about gotchas involving inconsistent annex.numcopies settings. (See changes to doc/numcopies.mdwn.) Added a new variety of git-annex branch log file, that holds only 1 value. Will probably be useful for other stuff later. This commit was sponsored by Nicolas Pouillard.
*	much better command action handling for sync --content	Joey Hess	2014-01-20
\|
*	sync --content: New option that makes the content of annexed files be ↵	Joey Hess	2014-01-19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	transferred. Similar to the assistant, this honors any configured preferred content expressions. I am not entirely happpy with the implementation. It would be nicer if the seek function returned a list of actions which included the individual file gets and copies and drops, rather than the current list of calls to syncContent. This would allow getting rid of the somewhat reundant display of "sync file [ok\|failed]" after the get/put display. But, do that, withFilesInGit would need to somehow be able to construct such a mixed action list. And it would be less efficient than the current implementation, which is able to reuse several values between eg get and drop. Note that currently this does not try to satisfy numcopies when getting/putting files (numcopies are of course checked when dropping files!) This makes it like the assistant, and unlike get --auto and copy --auto, which do duplicate files when numcopies is not yet satisfied. I don't know if this is the right decision; it only seemed to make sense to have this parallel the assistant as far as possible to start with, since I know the assistant works. This commit was sponsored by Øyvind Andersen Holm.
*	assistant: Detect if .git/annex/index is corrupt at startup, and recover.	Joey Hess	2014-01-14
\|
*	avoid needing a build-dep on hxt for Data.AssocList	Joey Hess	2014-01-14
\|
*	Fix a long-standing bug that could cause the wrong index file to be used ↵	Joey Hess	2014-01-14
\| \| \| \|	when committing to the git-annex branch, if GIT_INDEX_FILE is set in the environment. This typically resulted in git-annex branch log files being committed to the master branch and later showing up in the work tree. (These log files can be safely removed.)
*	add GETAVAILABILITY to external special remote protocol	Joey Hess	2014-01-13
\| \| \| \| \|	And some reworking of types, and added an annex-availability git config setting.
*	revert use of Data.Map.Strict	Joey Hess	2014-01-07
\| \| \| \| \|	memory profile shows this did not contribute to the memory leaks fixed in 4cf6d95c1a9d10cb59669eaceafce4c7a3155eb6
*	tested transferkeys restarting; fix some bugs	Joey Hess	2014-01-06
\|
*	use strict version of map	Joey Hess	2014-01-06
\|
*	assistant: Start a new git-annex transferkeys process after a network ↵	Joey Hess	2014-01-06
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	connection change So that remotes that use a persistent network connection are restarted. A remote might keep open a long duration network connection, and could fail to deal well with losing the connection. This is particularly a concern now that we have external special reotes. An external special remote that is implemented naively might open the connection only when PREPARE is sent, and if it loses connection, throw errors on each request that is made. (Note that the ssh connection caching should not have this problem; if the long-duration ssh process loses connection, the named pipe is disconnected and the next ssh attempt will reconnect. Also, XMPP already deals with disconnection robustly in its own way.) There's no way for git-annex to know if a lost network connection actually affects a given remote, which might have a transfer in process. It does not make sense to force kill the transferkeys process every time the NetWatcher detects a change. (Especially because the NetWatcher sometimes polls 1 change per hour.) In any case, the NetWatcher only detects connection to a network, not disconnection. So if a transfer is in progress over the network, and the network goes down, that will need to time out on its own. An alternate approch that was considered is to use a separate transferkeys process for each remote, and detect when a request fails, and assume that means that process is in a failing state and restart it. The problem with that approach is that if a resource is not available and a remote fails every time, it degrades to starting a new transferkeys process for every file transfer, which is too expensive. Instead, this commit only handles the network reconnection case, and restarts transferkeys only once the network has reconnected and another transfer needs to be made. So, a transferkeys process will be reused for 1 hour, or until the next network connection. ---- The NotificationBroadcaster was rewritten to use TMVars rather than MSampleVars, to allow checking without blocking if a notification has been received. ---- This commit was sponsored by Tobias Brunner.
*	assistant: Fixed several minor memory leaks that manifested when adding a ↵	Joey Hess	2014-01-05
\| \| \| \|	large number of files.
*	assistant: Ensure that .ssh/config and .ssh/authorized_keys are not group or ↵	Joey Hess	2014-01-03
\| \| \| \|	world writable when writing to those files, as that can make ssh refuse to use them, if it allows another user to write to them.
*	Remotes can now be made read-only, by setting remote.<name>.annex-readonly	Joey Hess	2014-01-02
\|
*	mirror: Support --all (and --unused).	Joey Hess	2014-01-01
\|
*	avoid empty env vars when setting up clean environment	Joey Hess	2013-12-31
\|
*	external special remotes mostly implemented (untested)	Joey Hess	2013-12-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This has not been tested at all. It compiles! The only known missing things are support for encryption, and for get/set of special remote configuration, and of key state. (The latter needs separate work to add a new per-key log file to store that state.) Only thing I don't much like is that initremote needs to be passed both type=external and externaltype=foo. It would be better to have just type=foo Most of this is quite straightforward code, that largely wrote itself given the types. The only tricky parts were: * Need to lock the remote when using it to eg make a request, because in theory git-annex could have multiple threads that each try to use a remote at the same time. I don't think that git-annex ever does that currently, but better safe than sorry. * Rather than starting up every external special remote program when git-annex starts, they are started only on demand, when first used. This will avoid slowdown, especially when running fast git-annex query commands. Once started, they keep running until git-annex stops, currently, which may not be ideal, but it's hard to know a better time to stop them. * Bit of a chicken and egg problem with caching the cost of the remote, because setting annex-cost in the git config needs the remote to already be set up. Managed to finesse that. This commit was sponsored by Lukas Anzinger.
*	assistant: Set StrictHostKeyChecking yes when creating ssh remotes, and add ↵	Joey Hess	2013-12-20
\| \| \| \|	it to the configuration for any ssh remotes previously created by the assistant. This avoids repeated prompts by ssh if the host key changes, instead syncing with such a remote will fail. Closes: #732602
*	Another round of s/amoung/among/	Richard Hartmann	2013-12-19
\|
*	flip for clarity	Joey Hess	2013-12-16
\|
*	assistant: Always batch changes found in startup scan.	Joey Hess	2013-12-16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Batch detection is heuristic, so can sometimes fail. I observed one such failure while starting up in a repository with 87000 files. After the first several batches of ~5000 files, it fell out of batch mode, and never re-entered it, and so made many more commits of a few files at a time than necessary. So, let's always use batch mode when in the startup scan. This avoids the heuristic there, at least. There is clearly also room to improve the heuristic. Possibly 10 files is too high a bar to be found during a commit, on a system that can commit quickly.
*	squash warning in OSX build	Joey Hess	2013-12-15
\|
*	typo	Joey Hess	2013-12-10
\|
*	different PID types for Unix and Windows	Joey Hess	2013-12-10
\| \| \| \| \| \| \| \|	Windows has a larger (unsigned) PID space, so cannot use the unix CInt there. Note that TransferInfo does not yet ever get the TransferPid populated, as there is missing locking.
*	port transferkeys to windows; make stopping in progress transfers work too ↵	Joey Hess	2013-12-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(probably) transferkeys had used special FDs for communication, but that would be quite annoying to do in Windows. Instead, use stdin and stdout. But, to avoid commands like rsync stomping on them and messing up the communications channel, they're duplicated to a different handle; stdin is replaced with a null handle, and stdout is replaced with a copy of stderr. This should all work in windows too. Stopping in progress transfers may work on windows.. if the types unify anyway. ;) May need some more porting.
*	Improve repair of git-annex index file.	Joey Hess	2013-12-10
\| \| \| \| \| \| \| \| \| \|	Fixes a test case I received where a corrupted repo was repaired, but the git-annex branch was not. The root of the problem was that the MissingObject returned by the repair code was not necessarily a complete set of all objects that might have been deleted during the repair. So, stop trying to return that at all, and instead make the index file checking code explicitly verify that each object the index uses is present.
*	avoid needing --force on windows despite no lsof	Joey Hess	2013-12-09
\| \| \| \| \|	Note that I still need to think this through and make sure handling of open files is safe. This is just for testing purposes.
*	close tmp file handle	Joey Hess	2013-12-07
\| \| \| \|	May fix permission problem on windows
*	avoid trying to use lsof when it's not in path and --forced	Joey Hess	2013-12-04
\|
*	avoid repeatedly searching path to make batch command when running transferkeys	Joey Hess	2013-12-01
\|
*	assistant: Run transferkeys as batch jobs.	Joey Hess	2013-12-01
\|
*	Avoid using git commit in direct mode, since in some situations it will read ↵	Joey Hess	2013-12-01
\| \| \| \| \| \| \| \| \| \| \| \|	the full contents of files in the tree. The assistant's commit code also always avoids git commit, for simplicity. Indirect mode sync still does a git commit -a to catch unstaged changes. Note that this means that direct mode sync no longer runs the pre-commit hook or any other hooks git commit might call. The git annex pre-commit hook action for direct mode is however explicitly run. (The assistant already ran git commit with hooks disabled, so no change there.)
*	merge improved fsck types from git-repair and some associated changes	Joey Hess	2013-11-30
\|
*	tested multi-daemon upgrade	Joey Hess	2013-11-24
\|
*	cleanup on failed upgrade	Joey Hess	2013-11-24
\|
*	show version in upgrade alert	Joey Hess	2013-11-24
\|
*	add support for fully automatic upgrades	Joey Hess	2013-11-24
\| \| \| \| \| \| \| \| \|	The Upgrader avoids checking for upgrades on startup when it was just upgraded. This avoids an upgrade loop if something goes wrong. One example of something going wrong would be if the upgrade info file and the distribution file get out of sync (or the distribution file is cached in a proxy), so it thinks it has upgraded to a new version, but has really not.
*	linux upgrade code debugged and working	Joey Hess	2013-11-24
\|
*	completely untested linux upgrade code	Joey Hess	2013-11-23
\|
*	queue and start download of git-annex from web, using git-annex, when ↵	Joey Hess	2013-11-23
\| \| \| \|	upgrade is started
*	global webapp redirects, to finish upgrades	Joey Hess	2013-11-23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When an automatic upgrade completes, or when the user clicks on the upgrade button in one webapp, but also has it open in another browser window/tab, we have a problem: The current web server is going to stop running in minutes, but there is no way to send a redirect to the web browser to the new url. To solve this, used long polling, so the webapp is always listening for urls it should redirect to. This allows globally redirecting every open webapp. Works great! Tested with 2 web browsers with 2 tabs each. May be useful for other purposes later too, dunno. The overhead is 2 http requests per page load in the webapp. Due to yesod's speed, this does not seem to noticibly delay it. Only 1 of the requests could possibly block the page load, the other is async.
*	better UI flow through upgrade process	Joey Hess	2013-11-23
\| \| \| \| \|	Move button to enable automatic upgrades to an alert displayed after successful upgrade. Unclutters the UI and makes psychological sense.
*	restart on upgrade is working, including automatic restart	Joey Hess	2013-11-23
\| \| \| \| \| \| \| \| \| \|	Made alerts be able to have multiple buttons, so the alerts about upgrading can have a button that enables automatic upgrades. Implemented automatic upgrading when the program file has changed. Note that when an automatic upgrade happens, the webapp displays an alert about it for a few minutes, and then closes. This still needs work.
*	got assistant upgrade detection to notice when I build a new version with ↵	Joey Hess	2013-11-22
\| \| \| \|	cabal build!
*	assistant restart on upgrade	Joey Hess	2013-11-22
\|