JaimeRV

Karma: 49

JaimeRV Dec 12, 2024, 8:38 AM
1 point
0
in reply to: James Fox’s comment on: ARENA 4.0 Impact Report
Thanks for sharing this! Great to see the impact of ARENA!

According to the OpenPhil public grant[1] this iteration of Arena got £245,895, and with this you were able to achieve the points mentioned in this post right?

Also it is great to hear that there are 4 new people working in AIS thanks to the program! It would be nice to know how did you manage it (and what was the counterfactual). Getting 4 people through full hiring processes within 4 weeks seems impresive, did you manage because they got jobs at orgs who were also at LISA? or there were other networking effects or other factors that made this possible?

[1] https://www.openphilanthropy.org/grants/alignment-research-engineer-accelerator-ai-safety-technical-program-2024/

JaimeRV Sep 27, 2024, 9:14 AM
10 points
0
on: Is cybercrime really costing trillions per year?
I found this useful https://impact.economist.com/perspectives/technology-innovation/measuring-cost-cybercrime/article/what’s-number-estimating-cost-cybercrime

Thinking About Propensity Evaluations

Maxime Riché, Harrison G, JaimeRV and Edoardo Pona

Aug 19, 2024, 9:23 AM

10 points

0 comments27 min readLW link

A Taxonomy Of AI System Evaluations

Maxime Riché, JaimeRV, Harrison G and Edoardo Pona

Aug 19, 2024, 9:07 AM

13 points

0 comments14 min readLW link

JaimeRV Aug 14, 2024, 10:22 AM
3 points
0
in reply to: Ruby’s comment on: jacquesthibs’s Shortform
I have been using sider for a few weeks and found it pretty helpful:

Setup:
- use gpt4o-mini which is basically free and faster than doing anything in Claude or ChatGPT
- mostly for papers and LW/EAF articles
- I have a shortcut to add “https://r.jina.ai/″ to the url before to convert to markdown and then I just ctrl+A the entire page and chat
- For privacy reasons I have only allowed the extension in https://r.jina.ai/* and https://www.youtube.com/*
- I use similar prompts than Jacques. Some additional ones: —Justify your previous answers citing the from original text —Challenge my knowledge (here I have a longer promt where it asks me to du stuff like draw a mindmap, answer questions,...)
- I also have it with (external) whisper cause often I think better outloud
Pros:
- Fast
- Basically free
- Way easier to digest and interact with dry papers/articles
- Customazible prompts for the conversation which make workflow faster cause you only have to click
- For youtube as a first filter
Cons:
- gpt40-mini (at least) hallucinates a bunch so you often have to ask to justify the answers
- (as with all the chatbots) you shall take the responses with a grain of salt, be very specific with your questions and reread the original relevant sections to double check.
Other:
- IMO if you end up integrating something like this in LW I think it would be net positive. Specially if you can link it to @stampy or similar to ask for clarification questions about concepts, …

JaimeRV Aug 11, 2024, 5:07 PM
5 points
0
in reply to: jacquesthibs’s comment on: jacquesthibs’s Shortform
I used to use that one but I moved to Sider: https://sider.ai/pricing?trigger=ext_chrome_btm_upgrd it works in all the pages, including youtube. For Papers and articles I have shortcut to automatically modify the url (adding the prefix ”https://r.jina.ai/″) so you get the markdown and then do Sider on that. With gpt4o-mini it is almost free. Also nice is Sider is that you can write your own prompt templates

JaimeRV Jul 10, 2024, 11:23 AM
3 points
0
on: Announcing the Double Crux Bot
Cool idea! thanks for making this! Do you happen to have also a Telegram bot for it?

Review of METR’s public evaluation protocol

nahoj and JaimeRV

Jun 30, 2024, 10:03 PM

10 points

0 comments5 min readLW link

List of projects that seem impactful for AI Governance

JaimeRV and Teun van der Weij

Jan 14, 2024, 4:53 PM

14 points

0 comments13 min readLW link

JaimeRV Jan 3, 2024, 4:23 PM
1 point
0
in reply to: Gurkenglas’s comment on: AI Safety Chatbot
I think at thumbs up/down with a field to enter feedback would be very helpful, but there is an open issue already for that https://github.com/StampyAI/stampy-chat/issues/35

JaimeRV Jan 3, 2024, 4:19 PM
1 point
0
in reply to: mruwnik’s comment on: AI Safety Chatbot
1. https://chat.openai.com/g/g-O6KK4ERZz-qaisi is a customer GPT that uses the Q&A from aisafety.info. https://chat.aisafety.info/ shows the sources more accurately

JaimeRV

Think­ing About Propen­sity Evaluations

A Tax­on­omy Of AI Sys­tem Evaluations

Re­view of METR’s pub­lic eval­u­a­tion protocol

List of pro­jects that seem im­pact­ful for AI Governance

Thinking About Propensity Evaluations

A Taxonomy Of AI System Evaluations

Review of METR’s public evaluation protocol

List of projects that seem impactful for AI Governance