aboutsummaryrefslogtreecommitdiffhomepage
path: root/reader
Commit message (Collapse)AuthorAge
* Update scraper rule for heise.deGravatar Frédéric Guillot2018-08-25
|
* Use canonical importsGravatar Frédéric Guillot2018-08-24
|
* Add support for published tag in Atom feedsGravatar neepl2018-07-17
|
* Add embedly.com to iframe whitelistGravatar Frédéric Guillot2018-07-10
|
* New `add_dynamic_image` rewriter for JavaScript-loaded images.Gravatar dzaikos2018-07-09
| | | | | | Searches tags for various `data-*` attributes and sets `img` tag `src` attribute appropriately. Falls back to searching `noscript` for `img` tags. Includes unit tests.
* Processor: Do rewriter before sanitizer for `entry.Content`.Gravatar dzaikos2018-07-06
| | | | Addresses #163.
* Add support for protocol relative YouTube URLsGravatar Frédéric Guillot2018-07-04
|
* Sandbox iframes when sanitizing.Gravatar dzaikos2018-07-03
| | | | | | Updated iframe unit tests. Refactored sanitizer.getExtraAttributes() to use `switch` instead of multiple `if` statements.
* Add specific 404 and 401 error messagesGravatar Frédéric Guillot2018-06-30
|
* Refactor AddImageTitle rewriter.Gravatar dzaikos2018-06-26
| | | | | | | | | | * Only processes images with `src` **and** `title` attributes (others are ignored). * Processes **all** images in the document (not just the first one). * Wraps the image and its title attribute in a `figure` tag with the title attribute's contents in a `figcaption` tag. Updated xkcd rewriter unit test. Added another xkcd rewriter unit test to check rendering of images without title tags.
* Improve sanitizer to remove style tag contents.Gravatar dzaikos2018-06-24
| | | | | | See #157. Refactored how blacklisted tags are handled so they're easier manage in the future.
* Improve sanitizer to remove script and noscript contentsGravatar Dave Z2018-06-23
| | | | | These tags where removed but the content was rendered as escaped HTML. See #157
* Add new fields for feed username/passwordGravatar Frédéric Guillot2018-06-19
|
* Rewrite iframe Youtube URLs to https://www.youtube-nocookie.comGravatar Frédéric Guillot2018-06-12
|
* Handle feeds with dates formatted as Unix timestampGravatar Frédéric Guillot2018-05-08
|
* Add API endpoint to import OPML fileGravatar Frédéric Guillot2018-04-29
|
* Move HTTP client to its own packageGravatar Frédéric Guillot2018-04-28
|
* Scrape parent element for iframeGravatar aniran2018-04-27
| | | | | | | | Current behavior: if you have an `iframe` scraper rule, `scrapContent` tries to return the inner HTML of the `iframe`, which turns up blank. New behavior: like `img` elements, if an `iframe` is matched by a scraper rule, the parent element's inner HTML (i.e. the `iframe` is returned).
* Add soundcloud and bandcamp iframe sourcesGravatar aniran2018-04-27
|
* Add support for Dublin Core date in RDF feedsGravatar Frédéric Guillot2018-04-10
|
* Handle some non-english date formatsGravatar Frédéric Guillot2018-04-09
|
* Rename RSS parser gettersGravatar Frédéric Guillot2018-04-09
|
* Get the right comments URL when having multiple namespacesGravatar Frédéric Guillot2018-04-09
|
* Add unit test for comments url and French translationGravatar Frédéric Guillot2018-04-07
|
* Add CommentsURL to entryGravatar Ben Brooks2018-04-07
|
* Handle RSS author elements with inner HTMLGravatar Frédéric Guillot2018-03-18
|
* Convert enclosure size field to bigintGravatar Frédéric Guillot2018-03-14
|
* Fix broken OPML import with Go 1.10Gravatar Frédéric Guillot2018-03-14
|
* Improve parser error messagesGravatar Frédéric Guillot2018-02-27
|
* Support localized feed errors generated by background workersGravatar Frédéric Guillot2018-02-27
|
* Handle Atom feeds with HTML titleGravatar Frédéric Guillot2018-02-17
|
* Improve error handling for HTTP clientGravatar Frédéric Guillot2018-02-08
|
* Strip invalid XML characters to avoid parsing errorsGravatar Frédéric Guillot2018-02-07
|
* Remove period for feed errorsGravatar Frédéric Guillot2018-02-07
|
* Improve error handling when the response is emptyGravatar Frédéric Guillot2018-02-07
|
* Show API URL endpoints in user interfaceGravatar Frédéric Guillot2018-01-31
|
* Do not override existing entries when the crawler is enabledGravatar Frédéric Guillot2018-01-20
|
* Handle more encoding edge casesGravatar Frédéric Guillot2018-01-20
| | | | | | - Feeds with charset specified only in Content-Type header and not in XML document - Feeds with charset specified in both places - Feeds with charset specified only in XML document and not in HTTP header
* Do not crawl existing entry URLsGravatar Frédéric Guillot2018-01-20
|
* Add more comments (GoDoc)Gravatar Frédéric Guillot2018-01-11
|
* Add scraper rule for darkreading.comGravatar Frédéric Guillot2018-01-06
|
* Add more scraper rulesGravatar Frédéric Guillot2018-01-04
|
* Add content length check when refreshing feedsGravatar Frédéric Guillot2018-01-04
|
* Handle more date formatsGravatar Frédéric Guillot2018-01-03
|
* If the website URL is empty, assign the feed URLGravatar Frédéric Guillot2018-01-03
|
* Rename helper packagesGravatar Frédéric Guillot2018-01-02
|
* Make sure the scraper parse only HTML documentsGravatar Frédéric Guillot2018-01-02
|
* Add scraper rules for version2.dk and ing.dkGravatar Frédéric Guillot2017-12-27
|
* Add more scraper rulesGravatar Frédéric Guillot2017-12-27
|
* Add support for data URL faviconsGravatar Frédéric Guillot2017-12-22
|