20040202 Monday February 02, 2004

JavaBlogs and Roller's duplicate post problem Charles has figured out why JavaBlogs gets duplicate posts from Roller-based blogs. And Dave proposes a solution:

I just now changed the JRoller "absolute URL to site" setting to force the domain name to jroller.com. The setting was blank before.

Without this setting, the JRoller feeds were using whatever hostname was requested at cache refresh time. So if the first request after the cache timeout was for freeroller.net then the GUID's in the RSS feed would read freeroller.net until the next cache timeout.

This site was missing the absolute URL to site setting as well, so I changed it to http://raibledesigns.com. I'll try adding myself back to JavaBlogs aggregator and hope that works! Posted in Java at Feb 02 2004, 08:46:25 AM MST 3 Comments

Comments:

After a little further thought, I'm guessing think this problem started to happen only after JRoller server was rebuilt after a disk failure. Before the disk failure, the JRoller guys had configured Apache to do URL rewrites so that the only URL Roller would every see was the jroller.com URL. After the failure, rewrites were not enabled and so we started to see this problem. That's my theory at the moment.

I believe there are two ways to solve this problem, 1) use Apache to rewrite URLS or 2) set Roller's "absolute URL to site" parameter.

However, I'm not yet sure the problem has been resolved - I'm still watching JRoller and Javablogs.

- Dave

Posted by Dave Johnson on February 02, 2004 at 11:43 AM MST #

I could be mistaken, but Roller also seems to update the (pubDate) element. I know that this was the reason FeedDemon was duplicating posts. Though that was prior to your adding (guid). Cheers Koz

Posted by Koz on February 02, 2004 at 01:59 PM MST #

I see duplicate posts in nntp//rss as well. I just assumed it was an issue with the latter, though raibledesigns is the only feed effected. --johnt

Posted by John Tangney on February 03, 2004 at 08:46 AM MST #

Post a Comment:
  • HTML Syntax: Allowed