i’d like to be able to browse lesswrong while offline. even better if i can process the data with a script while offline.
it would also be useful for backup purposes: if ever something happens to the site in the longterm, some of it’s users might have an exact copy that can be corss-compared and used to restore/share its content.
There used to be a downloadable archive of all posts on the old LW 1.0 site. I think there was a SiteMeta post about it, but I can’t find it, but anyway that link wouldn’t work anymore. I think such an archive it’s a nice thing to have and also avoids unnecessary scraping.
[Question] Is there a lesswrong archive of all public posts?
like for example: the wikimedia db dumps, or the stack exchange db dumps.
i’d like to be able to browse lesswrong while offline. even better if i can process the data with a script while offline.
it would also be useful for backup purposes: if ever something happens to the site in the longterm, some of it’s users might have an exact copy that can be corss-compared and used to restore/share its content.
related:
Why is lesswrong blocking wget and curl (scrape)?
Can I archive content from lesswrong.com on the wayback machine (internet archive, archive.org) ?
There used to be a downloadable archive of all posts on the old LW 1.0 site. I think there was a SiteMeta post about it, but I can’t find it, but anyway that link wouldn’t work anymore. I think such an archive it’s a nice thing to have and also avoids unnecessary scraping.
@gwern : is this something you’d be interested in having?
Not particularly now that GW exists. Its posts are easy to archive, and if I need something, I can always just ask saturn2.
thanks for answering. who/what is “saturn2” ?
https://www.lesswrong.com/posts/66DXhQJyPEJNsXgfw/an-alternative-way-to-browse-greaterwrong-2-0
@clone of saturn, aka “saturn2”, runs GW
does greaterwrong.com have archive files that i can download? or do it need to archive it myself (ex:
wget -mk
) ?You should probably ask saturn2 for permission before doing so.