I found this useful https://impact.economist.com/perspectives/technology-innovation/measuring-cost-cybercrime/article/what’s-number-estimating-cost-cybercrime
JaimeRV
Thinking About Propensity Evaluations
A Taxonomy Of AI System Evaluations
I have been using sider for a few weeks and found it pretty helpful:
Setup:
use gpt4o-mini which is basically free and faster than doing anything in Claude or ChatGPT
mostly for papers and LW/EAF articles
I have a shortcut to add “https://r.jina.ai/″ to the url before to convert to markdown and then I just ctrl+A the entire page and chat
For privacy reasons I have only allowed the extension in https://r.jina.ai/* and https://www.youtube.com/*
I use similar prompts than Jacques. Some additional ones: —Justify your previous answers citing the from original text —Challenge my knowledge (here I have a longer promt where it asks me to du stuff like draw a mindmap, answer questions,...)
I also have it with (external) whisper cause often I think better outloud
Pros:
Fast
Basically free
Way easier to digest and interact with dry papers/articles
Customazible prompts for the conversation which make workflow faster cause you only have to click
For youtube as a first filter
Cons:
gpt40-mini (at least) hallucinates a bunch so you often have to ask to justify the answers
(as with all the chatbots) you shall take the responses with a grain of salt, be very specific with your questions and reread the original relevant sections to double check.
Other:
IMO if you end up integrating something like this in LW I think it would be net positive. Specially if you can link it to @stampy or similar to ask for clarification questions about concepts, …
I used to use that one but I moved to Sider: https://sider.ai/pricing?trigger=ext_chrome_btm_upgrd it works in all the pages, including youtube. For Papers and articles I have shortcut to automatically modify the url (adding the prefix ”https://r.jina.ai/″) so you get the markdown and then do Sider on that. With gpt4o-mini it is almost free. Also nice is Sider is that you can write your own prompt templates
Cool idea! thanks for making this! Do you happen to have also a Telegram bot for it?
Review of METR’s public evaluation protocol
List of projects that seem impactful for AI Governance
I think at thumbs up/down with a field to enter feedback would be very helpful, but there is an open issue already for that https://github.com/StampyAI/stampy-chat/issues/35
https://chat.openai.com/g/g-O6KK4ERZz-qaisi is a customer GPT that uses the Q&A from aisafety.info. https://chat.aisafety.info/ shows the sources more accurately
Thanks for sharing this! Great to see the impact of ARENA!
According to the OpenPhil public grant[1] this iteration of Arena got £245,895, and with this you were able to achieve the points mentioned in this post right?
Also it is great to hear that there are 4 new people working in AIS thanks to the program! It would be nice to know how did you manage it (and what was the counterfactual). Getting 4 people through full hiring processes within 4 weeks seems impresive, did you manage because they got jobs at orgs who were also at LISA? or there were other networking effects or other factors that made this possible?
[1] https://www.openphilanthropy.org/grants/alignment-research-engineer-accelerator-ai-safety-technical-program-2024/