I have a feed from https://b3ta.com/ at https://b3ta.com/xml/combined.php?rdf=1 that is not up to date.
I can fetch the feed form this URL with
curl 'https://b3ta.com/xml/combined.php?rdf=1' and see that there are entries as new as 2020-11-02T19:18:00+00:00 but the latest entry in NewsBlur is 2020-08-14.
Instant Fetch does not cause it to update. Using “Site Settings” shows an OK parse as late as
2020-11-17 20:26:45 OK (200). “Statistics” says “This site has published zero stories in the past month or has averaged less than a single story a month.” but
curl 'https://b3ta.com/xml/combined.php?rdf=1' | grep '<dc:date>' shows two stories per month on average.
I’m new to NewsBlur so I may be misunderstanding something about how it works, but this seems odd.
I’m finding more of these. For example https://blog.perfectdayfoods.com/blog/rss.xml . In NewsBlur the last story is 2019-12-17 but
curl shows the last entry correctly as 2020-10-10.
By the way, I’m checking these feeds with https://validator.w3.org/feed/#validate_by_uri to make sure they’re not broken as well.
Some sort of cacheing issue? Why isn’t NewsBlur seeing the feeds?
This is quite a problem as it means I can’t trust NewsBlur on whether a blog is really dead.
By the way, I’d appreciate it if some other NewsBlur reader could try out these feeds and see if they have the same issue, just to make sure my quirky setup isn’t causing it. Thanks!
Looks like B3ta is invalid after all. https://validator.w3.org/feed/check.cgi?url=https%3A%2F%2Fb3ta.com%2Fxml%2Fcombined.php%3Frdf%3D1
I tried loading it and it has some bizarre things going on with it, including dating stories from 2001.
Oh, the second feed, https://newsblur.com/site/8025234/perfect-day-blog, looks exactly like the feed. The 2020-10-10 date you’re seeing if the date the RSS feed was last updated. It’s not on an
item but rather the
Many thanks for taking a look at these. Much appreciated. I need to read the RSS spec so that I can check them better myself.