git-annex-gpl - git-annex without the AGPL

	Commit message (Collapse)	Author	Age
*	fix chunk=0	Joey Hess	2014-08-01
\| \| \| \|	Found by testremote
*	testremote: New command to test uploads/downloads to a remote.	Joey Hess	2014-08-01
\| \| \| \| \| \| \| \| \|	This only performs some basic tests so far; no testing of chunking or resuming. Also, the existing encryption type of the remote is used; it would be good later to derive an encrypted and a non-encrypted version of the remote and test them both. This commit was sponsored by Joseph Liu.
*	add new section for testing commands	Joey Hess	2014-08-01
\|
*	chunk size advice	Joey Hess	2014-07-30
\|
*	WebDAV: Dropped support for DAV before 0.6.1.	Joey Hess	2014-07-30
\| \| \| \| \|	0.6.1 is in testing, and stable does not have DAV at all, so I can dispense with this compatability code
*	only chunk stable keys	Joey Hess	2014-07-30
\| \| \| \| \| \|	The content of unstable keys can potentially be different in different repos, so eg, resuming a chunked upload started by another repo would corrupt data.
*	update progress after each chunk, at least	Joey Hess	2014-07-29
\| \| \| \| \| \|	This way, when the remote implementation neglects to update progress, there will still be a somewhat useful progress display, as long as chunks are used.
*	fix cleanup of FileContents once done when them when retrieving	Joey Hess	2014-07-29
\|
*	optimise case of remote that retrieves FileContent, when chunks and ↵	Joey Hess	2014-07-29
\| \| \| \| \| \| \| \| \| \|	encryption are not being used No need to read whole FileContent only to write it back out to a file in this case. Can just rename! Yay. Also indidentially, fixed an attempt to open a file for write that was already opened for write, which caused a crash and deadlock.
*	support chunking for all external special remotes!	Joey Hess	2014-07-29
\| \| \| \| \| \| \|	Removing code and at the same time adding great features, including upload/download resuming. This commit was sponsored by Romain Lenglet.
*	better type for Retriever	Joey Hess	2014-07-29
\| \| \| \| \| \| \| \|	Putting a callback in the Retriever type allows for the callback to remove the retrieved file when it's done with it. I did not really want to make Retriever be fixed to Annex Bool, but when I tried to use Annex a, I got into some type of type mess.
*	allow Retriever action to update the progress meter	Joey Hess	2014-07-29
\| \| \| \| \| \| \| \|	Needed for eg, Remote.External. Generally, any Retriever that stores content in a file is responsible for updating the meter, while ones that procude a lazy bytestring cannot update the meter, so are not asked to.
*	lift types from IO to Annex	Joey Hess	2014-07-29
\| \| \| \| \| \| \| \| \| \| \|	Some remotes like External need to run store and retrieve actions in Annex, not IO. In order to do that lift, I had to dive pretty deep into the utilities, making Utility.Gpg and Utility.Tmp be partly converted to using MonadIO, and Control.Monad.Catch for exception handling. There should be no behavior changes in this commit. This commit was sponsored by Michael Barabanov.
*	add ContentSource type, for remotes that act on files rather than ByteStrings	Joey Hess	2014-07-29
\| \| \| \| \|	Note that currently nothing cleans up a ContentSource's file, when eg, retrieving chunks.
*	fix non-checked hasKeyChunks	Joey Hess	2014-07-29
\|
*	make explicit the implicit requirement that CHECKPRESENT not say a key is ↵	Joey Hess	2014-07-28
\| \| \| \|	present until it's all done being stored
*	resume interrupted chunked uploads	Joey Hess	2014-07-28
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Leverage the new chunked remotes to automatically resume uploads. Sort of like rsync, although of course not as efficient since this needs to start at a chunk boundry. But, unlike rsync, this method will work for S3, WebDAV, external special remotes, etc, etc. Only directory special remotes so far, but many more soon! This implementation will also allow starting an upload from one repository, interrupting it, and then resuming the upload to the same remote from an entirely different repository. Note that I added a comment that storeKey should atomically move the content into place once it's all received. This was already an undocumented requirement -- it's necessary for hasKey to work reliably. This resume code just uses hasKey to find the first chunk that's missing. Note that if there are two uploads of the same key to the same chunked remote, one might resume at the point the other had gotten to, but both will then redundantly upload. As before. In the non-resume case, this adds one hasKey call per storeKey, and only if the remote is configured to use chunks. Future work: Try to eliminate that hasKey. Notice that eg, `git annex copy --to` checks if the key is present before sending it, so is already running hasKey.. which could perhaps be cached and reused. However, this additional overhead is not very large compared with transferring an entire large file, and the ability to resume is certianly worth it. There is an optimisation in place for small files, that avoids trying to resume if the whole file fits within one chunk. This commit was sponsored by Georg Bauer.
*	fix handling of removal of keys that are not present	Joey Hess	2014-07-28
\|
*	add ChunkMethod type and make Logs.Chunk use it, rather than assuming fixed ↵	Joey Hess	2014-07-28
\| \| \| \| \| \| \| \|	size chunks (so eg, rolling hash chunks can be supported later) If a newer git-annex starts logging something else in the chunk log, it won't be used by this version, but it will be preserved when updating the log.
*	Merge branch 'master' of ssh://git-annex.branchable.com into newchunks	Joey Hess	2014-07-28
\|\
\| *	(no commit message)	divB	2014-07-27
\| \|
\| *	(no commit message)	divB	2014-07-27
\| \|
\| *	devblog	Joey Hess	2014-07-27
\| \|
* \|	resume interrupted chunked downloads	Joey Hess	2014-07-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Leverage the new chunked remotes to automatically resume downloads. Sort of like rsync, although of course not as efficient since this needs to start at a chunk boundry. But, unlike rsync, this method will work for S3, WebDAV, external special remotes, etc, etc. Only directory special remotes so far, but many more soon! This implementation will also properly handle starting a download from one remote, interrupting, and resuming from another one, and so on. (Resuming interrupted chunked uploads is similarly doable, although slightly more expensive.) This commit was sponsored by Thomas Djärv.
* \|	add key stability checking interface	Joey Hess	2014-07-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Needed for resuming from chunks. Url keys are considered not stable. I considered treating url keys with a known size as stable, but just don't feel that is enough information.
* \|	use map for faster backend name lookup	Joey Hess	2014-07-27
\| \|
* \|	Merge branch 'master' into newchunks	Joey Hess	2014-07-27
\|\\| \| \| \| \| \| \| \| \|	Conflicts: doc/design/assistant/chunks.mdwn
\| *	update	Joey Hess	2014-07-27
\| \|
* \|	use existing chunks even when chunk=0	Joey Hess	2014-07-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When chunk=0, always try the unchunked key first. This avoids the overhead of needing to read the git-annex branch to find the chunkcount. However, if the unchunked key is not present, go on and try the chunks. Also, when removing a chunked key, update the chunkcounts even when chunk=0.
* \|	reorg	Joey Hess	2014-07-27
\| \|
* \|	comment typo	Joey Hess	2014-07-27
\| \|
* \|	faster storeChunks	Joey Hess	2014-07-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	No need to process each L.ByteString chunk, instead ask it to split. Doesn't seem to have really sped things up much, but it also made the code simpler. Note that this does (and already did) buffer in memory. It seems that only the directory special remote could take advantage of streaming chunks to files w/o buffering, so probably won't add an interface to allow for that.
* \|	better Preparer interface	Joey Hess	2014-07-27
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This will allow things like WebDAV to opean a single persistent connection and reuse it for all the chunked data. The crazy types allow for some nice code reuse.
* \|	update does for chunking	Joey Hess	2014-07-26
\| \|
* \|	improve exception handling	Joey Hess	2014-07-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Push it down from needing to be done in every Storer, to being checked once inside ChunkedEncryptable. Also, catch exceptions from PrepareStorer and PrepareRetriever, just in case..
* \|	add some more exception handling primitives	Joey Hess	2014-07-26
\| \|
* \|	better exception display	Joey Hess	2014-07-26
\| \|
* \|	fix key checking when a directory special remote's directory is missing	Joey Hess	2014-07-26
\| \| \| \| \| \| \| \| \| \|	The best thing to do in this case is return Left, so that anything that tries to access it will fail.
* \|	fix another fallback bug	Joey Hess	2014-07-26
\| \|
* \|	allM has slightly better memory use	Joey Hess	2014-07-26
\| \|
* \|	fix fallback to other chunk size when first does not have it	Joey Hess	2014-07-26
\| \|
\| *	Merge branch 'master' of ssh://git-annex.branchable.com	Joey Hess	2014-07-26
\| \|\
\| * \|	devblog	Joey Hess	2014-07-26
\| \| \|
* \| \|	doc update for new chunking	Joey Hess	2014-07-26
\| \| \|
* \| \|	fix build	Joey Hess	2014-07-26
\| \| \|
* \| \|	fix build	Joey Hess	2014-07-26
\| \| \|
* \| \|	convert directory special remote to using ChunkedEncryptable	Joey Hess	2014-07-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	And clean up legacy chunking code, which is in its own module now. So much cleaner! This commit was sponsored by Henrik Ahlgren
* \| \|	Support for remotes that are chunkable and encryptable.	Joey Hess	2014-07-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I'd have liked to keep these two concepts entirely separate, but that are entagled: Storing a key in an encrypted and chunked remote need to generate chunk keys, encrypt the keys, chunk the data, encrypt the chunks, and send them to the remote. Similar for retrieval, etc. So, here's an implemnetation of all of that. The total win here is that every remote was implementing encrypted storage and retrival, and now it can move into this single place. I expect this to result in several hundred lines of code being removed from git-annex eventually! This commit was sponsored by Henrik Ahlgren.
* \| \|	finish up basic chunked remote groundwork	Joey Hess	2014-07-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Chunk retrieval and reassembly, removal, and checking if all necessary chunks are present. This commit was sponsored by Damien Raude-Morvan.
* \| \|	wording	Joey Hess	2014-07-26
\| \| \|