Thanks for the tips! I’ve been playing with the Alchemy API for NLP (http://www.alchemyapi.com/) and an API called DayLife (http://developer.daylife.com/) for news sources, etc.
I’m trying to do my best to make it as un-spammy as possible, but how far I can get with that remains to be seen. I have a plan to take advantage of the inverted pyramid story structure so common in news reporting, along with entity extraction on the paragraph level, to get something out of it that’s more or less readable. I’ll post an example when my prototype works.
Just FYI, the link above (http://pwnee.com/Sequences/list.html) currently 404′s.