Sunday, February 14th, 2010
Just a quick update. The latest dictionary update job for ReadPath just completed.
835,029 feeds monitored.
299,113,314 content items.
28,914,230,869 words for the dictionary. This works out to 123,036,971 distinct two word pairs.
Entire job ran just over 7 hours on an 8 node hadoop/hbase cluster. The job read just under 400Gb of data ...
Posted in ReadPath | No Comments »
Friday, February 12th, 2010
It's finally here, you can now post from the stories that you read on ReadPath to Facebook. The permissions on who sees what on these posts is handled on the Facebook end. You can set it so that everyone sees the posts, just friends, or friends of friends.
As always, I'm ...
Posted in ReadPath | No Comments »
Friday, February 12th, 2010
Found the fix for thumbnail images not showing on posts for all browsers other than Firefox. That'll teach me not to be lazy and test only in Firefox on a Mac. Who knew that anyone used anything else?
Turns out that Firefox is very forgiving with a fat finger typo in ...
Posted in ReadPath, updates | No Comments »
Thursday, February 11th, 2010
Well, it's only over a year overdue. I think something might have happened in the last year that may have slowed down my development a bit, but Facebook integration is finally on its way. It's now possible to link your ReadPath account to your Facebook account.
If you're already logged in ...
Posted in ReadPath | 1 Comment »
Wednesday, January 20th, 2010
When creating the code to do the scanning of content for potential thumbnail images I needed to work with the FixedThreadPool to get the level of performance that I wanted. There are large amounts of code within ReadPath that extend a class Scanner. This class creates a Thread and makes ...
Posted in ReadPath | 1 Comment »
Wednesday, January 20th, 2010
I've now added the ability to subscribe to search results. If you're a logged in user, on the search results page and on Category pages, you should see a subscribe button. By clicking on this, you will be able to create a new feed that includes items that match the ...
Posted in ReadPath | No Comments »
Wednesday, January 20th, 2010
I've updated the code to start examining content items to see if there is a primary image associated with the post. If there is an image there, then get the URL and size of that image. This is so that when the content item is displayed in snippet form a ...
Posted in ReadPath | 1 Comment »
Wednesday, January 20th, 2010
There were all sorts of updates that went out last night. One of which was a fix for the order of Folders on the News page on Chrome. All of the other major browsers keep the default order of an array in JavaScript based on insert order. It appears that ...
Posted in ReadPath | No Comments »
Saturday, January 16th, 2010
Awhile ago, I added a feature that would try and add the favicon.ico for a site next to its feed name. Most sites have this image in a common location at http://www.site.com/favicon.ico. It's also possible to set another location with a tag in the head of the html page. Since ...
Posted in ReadPath | No Comments »
Wednesday, January 13th, 2010
You might have noticed that in the last week the related categories area has gotten a lot more relevant. I'm now using the OpenCalais api to scan content items for the semantically relevant bits. The system has a rate limit, so I can't scan every item that ReadPath pulls in. ...
Posted in ReadPath | No Comments »