summaryrefslogtreecommitdiff
path: root/Remote/S3.hs
Commit message (Collapse)AuthorAge
* add filename to progress bar, and display ok/failed at endGravatar Joey Hess2015-04-14
| | | | This needed plumbing an AssociatedFile through retrieveKeyFileCheap.
* update my email address and homepage urlGravatar Joey Hess2015-01-21
|
* add getFileSize, which can get the real size of a large file on WindowsGravatar Joey Hess2015-01-20
| | | | | | | | | | | | | | Avoid using fileSize which maxes out at just 2 gb on Windows. Instead, use hFileSize, which doesn't have a bounded size. Fixes support for files > 2 gb on Windows. Note that the InodeCache code only needs to compare a file size, so it doesn't matter it the file size wraps. So it has been left as-is. This was necessary both to avoid invalidating existing inode caches, and because the code passed FileStatus around and would have become more expensive if it called getFileSize. This commit was sponsored by Christian Dietrich.
* Fix build with -f-S3.Gravatar Joey Hess2014-12-19
|
* reformatGravatar Joey Hess2014-12-16
|
* Expand checkurl to support recommended filename, and multi-file-urlsGravatar Joey Hess2014-12-11
| | | | This commit was sponsored by an anonymous bitcoiner.
* Urls can now be claimed by remotes. This will allow creating, for example, a ↵Gravatar Joey Hess2014-12-08
| | | | external special remote that handles magnet: and *.torrent urls.
* add stub claimUrlGravatar Joey Hess2014-12-08
|
* support S3 front-end used by globalways.netGravatar Joey Hess2014-11-05
| | | | | | | | | | This threw an unusual exception w/o an error message when probing to see if the bucket exists yet. So rather than relying on tryS3, catch all exceptions. This does mean that it might get an exception for some transient network error, think this means the bucket DNE yet, and try to create it, and then fail when it already exists.
* Revert "work around minimum part size problem"Gravatar Joey Hess2014-11-04
| | | | | | This reverts commit 2ba5af49c94b97c586220c3553367988ef095934. I misunderstood the cause of the problem.
* work around minimum part size problemGravatar Joey Hess2014-11-04
| | | | | | | | | When uploading the last part of a file, which was 640229 bytes, S3 rejected that part: "Your proposed upload is smaller than the minimum allowed size" I don't know what the minimum is, but the fix is just to include the last part into the previous part. Since this can result in a part that's double-sized, use half-sized parts normally.
* fix a couple type errors and the progress barGravatar Joey Hess2014-11-04
|
* fix memory leakGravatar Joey Hess2014-11-04
| | | | | | | | | | Unfortunately, I don't fully understand why it was leaking using the old method of a lazy bytestring. I just know that it was leaking, despite neither hGetUntilMetered nor byteStringPopper seeming to leak by themselves. The new method avoids the lazy bytestring, and simply reads chunks from the handle and streams them out to the http socket.
* combine 2 checksGravatar Joey Hess2014-11-04
|
* casts; now fully working.. but still leakingGravatar Joey Hess2014-11-03
| | | | | Still seems to buffer the whole partsize in memory, but I'm pretty sure my code is not what's doing it. See https://github.com/aristidb/aws/issues/142
* this should avoid leaking memoryGravatar Joey Hess2014-11-03
|
* logic errorGravatar Joey Hess2014-11-03
|
* WIP 3Gravatar Joey Hess2014-11-03
|
* WIP 2Gravatar Joey Hess2014-11-03
|
* WIP try sending using RequestBodyStreamChunkedGravatar Joey Hess2014-11-03
| | | | | May not work; if it does this is gonna be the simplest way to get good memory size and progress reporting.
* link to memory leak bugGravatar Joey Hess2014-11-03
|
* improve info display for multipartGravatar Joey Hess2014-11-03
|
* fix buildGravatar Joey Hess2014-11-03
|
* adjust version checkGravatar Joey Hess2014-11-03
| | | | | I assume 0.10.6 will have the fix for the bug I reported, which got fixed in master already..
* show multipart configuration in git annex info s3remoteGravatar Joey Hess2014-11-03
|
* finish multipart support using unreleased update to aws lib to yield etagsGravatar Joey Hess2014-11-03
| | | | | | | | | | | | | | | | | | Untested and not even compiled yet. Testing should include checks that file content streams through without buffering in memory. Note that CL.consume causes all the etags to be buffered in memory. This is probably nearly unavoidable, since a request has to be constructed that contains the list of etags in its body. (While it might be possible to stream generation of the body, that would entail making a http request that dribbles out parts of the body as the multipart uploads complete, which is not likely to work well.. To limit this being a problem, it's best for partsize to be set to some suitably large value, like 1gb. Then a full terabyte file will need only 1024 etags to be stored, which will probably use around 1 mb of memory.
* WIP multipart S3 uploadGravatar Joey Hess2014-10-28
| | | | | | | | | | | | I'm a little stuck on getting the list of etags of the parts. This seems to require taking the md5 of each part locally, which doesn't get along well with lazily streaming in the part from the file. It would need to read the file twice, or lose laziness and buffer a whole part -- but parts might be quite large. This seems to be a problem with the API provided; S3 is supposed to return an etag, but that is not exposed. I have filed a bug: https://github.com/aristidb/aws/issues/141
* fix buildGravatar Joey Hess2014-10-23
|
* fix buildGravatar Joey Hess2014-10-23
|
* update for aws 0.10's better handling of DNE for HEADGravatar Joey Hess2014-10-23
| | | | Kept support for older aws, since Debian has 0.9.2 still.
* fix buildGravatar Joey Hess2014-10-23
|
* one last build fix, yes it builds nowGravatar Joey Hess2014-10-23
|
* needs type familiesGravatar Joey Hess2014-10-23
|
* fix buildGravatar Joey Hess2014-10-23
|
* Merge branch 'master' into s3-awsGravatar Joey Hess2014-10-22
|\ | | | | | | | | Conflicts: Remote/S3.hs
| * add internet archive item url to infoGravatar Joey Hess2014-10-21
| |
| * include creds location in infoGravatar Joey Hess2014-10-21
| | | | | | | | | | | | | | | | This is intended to let the user easily tell if a remote's creds are coming from info embedded in the repository, or instead from the environment, or perhaps are locally stored in a creds file. This commit was sponsored by Frédéric Schütz.
| * add per-remote-type infoGravatar Joey Hess2014-10-21
| | | | | | | | | | | | | | | | | | | | Now `git annex info $remote` shows info specific to the type of the remote, for example, it shows the rsync url. Remote types that support encryption or chunking also include that in their info. This commit was sponsored by Ævar Arnfjörð Bjarmason.
| * S3: Fix embedcreds=yes handling for the Internet Archive.Gravatar Joey Hess2014-10-12
| | | | | | | | | | Before, embedcreds=yes did not cause the creds to be stored in remote.log, but also prevented them being locally cached.
| * glacier, S3: Fix bug that caused embedded creds to not be encypted using the ↵Gravatar Joey Hess2014-09-18
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | remote's key. encryptionSetup must be called before setRemoteCredPair. Otherwise, the RemoteConfig doesn't have the cipher in it, and so no cipher is used to encrypt the embedded creds. This is a security fix for non-shared encryption methods! For encryption=shared, there's no security problem, just an inconsistentency in whether the embedded creds are encrypted. This is very important to get right, so used some types to help ensure that setRemoteCredPair is only run after encryptionSetup. Note that the external special remote bypasses the type safety, since creds can be set after the initial remote config, if the external special remote program requests it. Also note that IA remotes never use encryption, so encryptionSetup is not run for them at all, and again the type safety is bypassed. This leaves two open questions: 1. What to do about S3 and glacier remotes that were set up using encryption=pubkey/hybrid with embedcreds? Such a git repo has a security hole embedded in it, and this needs to be communicated to the user. Is the changelog enough? 2. enableremote won't work in such a repo, because git-annex will try to decrypt the embedded creds, which are not encrypted, so fails. This needs to be dealt with, especially for ecryption=shared repos, which are not really broken, just inconsistently configured. Noticing that problem for encryption=shared is what led to commit cc54ff9e49260cd94f938e69e926a273e231ef4e, which tried to fix the problem by not decrypting the embedded creds. This commit was sponsored by Josh Taylor.
* | Merge branch 'master' into s3-awsGravatar Joey Hess2014-08-15
|\| | | | | | | | | Conflicts: git-annex.cabal
| * testremote: Add testing of behavior when remote is not availableGravatar Joey Hess2014-08-10
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Added a mkUnavailable method, which a Remote can use to generate a version of itself that is not available. Implemented for several, but not yet all remotes. This allows testing that checkPresent properly throws an exceptions when it cannot check if a key is present or not. It also allows testing that the other methods don't throw exceptions in these circumstances. This immediately found several bugs, which this commit also fixes! * git remotes using ssh accidentially had checkPresent return an exception, rather than throwing it * The chunking code accidentially returned False rather than propigating an exception when there were no chunks and checkPresent threw an exception for the non-chunked key. This commit was sponsored by Carlo Matteo Capocasa.
* | auto-create IA bucketsGravatar Joey Hess2014-08-09
| | | | | | | | Needs my patch to aws which will hopefully be accepted soon.
* | better memoizationGravatar Joey Hess2014-08-09
| |
* | fix meter updateGravatar Joey Hess2014-08-09
| |
* | S3: finish converting to aws libraryGravatar Joey Hess2014-08-09
| | | | | | | | | | | | | | | | | | | | | | | | | | | | Implemented the Retriever. Unfortunately, it is a fileRetriever and not a byteRetriever. It should be possible to convert this to a byteRetiever, but I got stuck: The conduit sink needs to process individual chunks, but a byteRetriever needs to pass a single L.ByteString to its callback for processing. I looked into using unsafeInerlaveIO to build up the bytestring lazily, but the sink is already operating under conduit's inversion of control, and does not run directly in IO anyway. On the plus side, no more memory leak..
* | pass metadata headers and storage class to S3 when putting objectsGravatar Joey Hess2014-08-09
| |
* | remove dead codeGravatar Joey Hess2014-08-09
| |
* | S3: convert to aws for store, remove, checkPresentGravatar Joey Hess2014-08-09
| | | | | | | | | | | | | | | | Fixes the memory leak on store.. the second oldest open git-annex bug! Only retrieve remains to be converted. This commit was sponsored by Scott Robinson.
* | wordingGravatar Joey Hess2014-08-08
| |