Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | Processor: Do rewriter before sanitizer for `entry.Content`. | 2018-07-06 | ||
| | | | | Addresses #163. | |||
* | Add support for protocol relative YouTube URLs | 2018-07-04 | ||
| | ||||
* | Sandbox iframes when sanitizing. | 2018-07-03 | ||
| | | | | | | Updated iframe unit tests. Refactored sanitizer.getExtraAttributes() to use `switch` instead of multiple `if` statements. | |||
* | Add specific 404 and 401 error messages | 2018-06-30 | ||
| | ||||
* | Refactor AddImageTitle rewriter. | 2018-06-26 | ||
| | | | | | | | | | | * Only processes images with `src` **and** `title` attributes (others are ignored). * Processes **all** images in the document (not just the first one). * Wraps the image and its title attribute in a `figure` tag with the title attribute's contents in a `figcaption` tag. Updated xkcd rewriter unit test. Added another xkcd rewriter unit test to check rendering of images without title tags. | |||
* | Improve sanitizer to remove style tag contents. | 2018-06-24 | ||
| | | | | | | See #157. Refactored how blacklisted tags are handled so they're easier manage in the future. | |||
* | Improve sanitizer to remove script and noscript contents | 2018-06-23 | ||
| | | | | | These tags where removed but the content was rendered as escaped HTML. See #157 | |||
* | Add new fields for feed username/password | 2018-06-19 | ||
| | ||||
* | Rewrite iframe Youtube URLs to https://www.youtube-nocookie.com | 2018-06-12 | ||
| | ||||
* | Handle feeds with dates formatted as Unix timestamp | 2018-05-08 | ||
| | ||||
* | Add API endpoint to import OPML file | 2018-04-29 | ||
| | ||||
* | Move HTTP client to its own package | 2018-04-28 | ||
| | ||||
* | Scrape parent element for iframe | 2018-04-27 | ||
| | | | | | | | | Current behavior: if you have an `iframe` scraper rule, `scrapContent` tries to return the inner HTML of the `iframe`, which turns up blank. New behavior: like `img` elements, if an `iframe` is matched by a scraper rule, the parent element's inner HTML (i.e. the `iframe` is returned). | |||
* | Add soundcloud and bandcamp iframe sources | 2018-04-27 | ||
| | ||||
* | Add support for Dublin Core date in RDF feeds | 2018-04-10 | ||
| | ||||
* | Handle some non-english date formats | 2018-04-09 | ||
| | ||||
* | Rename RSS parser getters | 2018-04-09 | ||
| | ||||
* | Get the right comments URL when having multiple namespaces | 2018-04-09 | ||
| | ||||
* | Add unit test for comments url and French translation | 2018-04-07 | ||
| | ||||
* | Add CommentsURL to entry | 2018-04-07 | ||
| | ||||
* | Handle RSS author elements with inner HTML | 2018-03-18 | ||
| | ||||
* | Convert enclosure size field to bigint | 2018-03-14 | ||
| | ||||
* | Fix broken OPML import with Go 1.10 | 2018-03-14 | ||
| | ||||
* | Improve parser error messages | 2018-02-27 | ||
| | ||||
* | Support localized feed errors generated by background workers | 2018-02-27 | ||
| | ||||
* | Handle Atom feeds with HTML title | 2018-02-17 | ||
| | ||||
* | Improve error handling for HTTP client | 2018-02-08 | ||
| | ||||
* | Strip invalid XML characters to avoid parsing errors | 2018-02-07 | ||
| | ||||
* | Remove period for feed errors | 2018-02-07 | ||
| | ||||
* | Improve error handling when the response is empty | 2018-02-07 | ||
| | ||||
* | Show API URL endpoints in user interface | 2018-01-31 | ||
| | ||||
* | Do not override existing entries when the crawler is enabled | 2018-01-20 | ||
| | ||||
* | Handle more encoding edge cases | 2018-01-20 | ||
| | | | | | | - Feeds with charset specified only in Content-Type header and not in XML document - Feeds with charset specified in both places - Feeds with charset specified only in XML document and not in HTTP header | |||
* | Do not crawl existing entry URLs | 2018-01-20 | ||
| | ||||
* | Add more comments (GoDoc) | 2018-01-11 | ||
| | ||||
* | Add scraper rule for darkreading.com | 2018-01-06 | ||
| | ||||
* | Add more scraper rules | 2018-01-04 | ||
| | ||||
* | Add content length check when refreshing feeds | 2018-01-04 | ||
| | ||||
* | Handle more date formats | 2018-01-03 | ||
| | ||||
* | If the website URL is empty, assign the feed URL | 2018-01-03 | ||
| | ||||
* | Rename helper packages | 2018-01-02 | ||
| | ||||
* | Make sure the scraper parse only HTML documents | 2018-01-02 | ||
| | ||||
* | Add scraper rules for version2.dk and ing.dk | 2017-12-27 | ||
| | ||||
* | Add more scraper rules | 2017-12-27 | ||
| | ||||
* | Add support for data URL favicons | 2017-12-22 | ||
| | ||||
* | Handle more date formats | 2017-12-22 | ||
| | ||||
* | Add logger | 2017-12-15 | ||
| | ||||
* | Improve content scraper | 2017-12-13 | ||
| | ||||
* | Make sure that item URL are absolute | 2017-12-13 | ||
| | ||||
* | Rewrite imports | 2017-12-12 | ||
| |