diff options
author | Joey Hess <joeyh@joeyh.name> | 2015-07-20 14:56:57 -0400 |
---|---|---|
committer | Joey Hess <joeyh@joeyh.name> | 2015-07-20 14:56:57 -0400 |
commit | 25f5ee379cebcc7ea4cb0b338f43f3c0e7477400 (patch) | |
tree | 9589c405aa007c24ebd51fb16362e455e93d3795 /doc/git-annex-schedule.mdwn | |
parent | 347d025c0930ac7994aa00e92fdfe8b54a2258e2 (diff) |
importfeed: Look at not only permalinks, but now also guids to identify previously downloaded files.
I've seen rss feeds that have no permalinks, only guids (which are
sometimes in the form of permalinks, argh/sigh).
I had previously avoided trusting guids to be globally unique, because my
survey of rss feeds that I subscribe to shows a lot of pretty bad
"guids" like "2 at http://serialpodcast.org" or even worse "oth20150401-hq".
Worry was that two podcasts that are generating guids so badly, that
there's no guarantee they're actually globally unique.
But, I'm seeing too many url changes that result in redundant files, so
let's try this. If feeds are so broken that guids overlap, they could just
as well incorrectly call them permalinks too.
Diffstat (limited to 'doc/git-annex-schedule.mdwn')
0 files changed, 0 insertions, 0 deletions