I’m trying to do my best to make it as un-spammy as possible, but how far I can get with that remains to be seen. I have a plan to take advantage of the inverted pyramid story structure so common in news reporting, along with entity extraction on the paragraph level, to get something out of it that’s more or less readable. I’ll post an example when my prototype works.
I would start with something like reuters API, http://wordnet.princeton.edu/ and some research on these guys http://pdos.csail.mit.edu/scigen/ this is a fairly well studied problem by spammers, so I’d also work there
Thanks for the tips! I’ve been playing with the Alchemy API for NLP (http://www.alchemyapi.com/) and an API called DayLife (http://developer.daylife.com/) for news sources, etc.
I’m trying to do my best to make it as un-spammy as possible, but how far I can get with that remains to be seen. I have a plan to take advantage of the inverted pyramid story structure so common in news reporting, along with entity extraction on the paragraph level, to get something out of it that’s more or less readable. I’ll post an example when my prototype works.