Jessica Rumbelow

Karma: 1,135

AI researcher

Jessica Rumbelow Feb 5, 2023, 7:24 PM
3 points
0
in reply to: mic’s comment on: SolidGoldMagikarp (plus, prompt generation)
Not yet, but there’s no reason why it wouldn’t be possible. You can imagine microscope AI, for language models. It’s on our to-do list.

Jessica Rumbelow Feb 5, 2023, 12:11 PM
2 points
0
in reply to: Charlie Steiner’s comment on: SolidGoldMagikarp (plus, prompt generation)
Yep, aside from running forward prop n times to generate an output of length n, we can just optimise the mean probability of the target tokens at each position in the output—it’s already implemented in the code. Although, it takes way longer to find optimal completions.

Jessica Rumbelow Nov 26, 2022, 10:03 AM
1 point
0
in reply to: Richard_Kennaway’s comment on: Guardian AI (Misaligned systems are all around us.)
Yeah, I think it could be! I’m considering pursuing it after SERI-MATS. I’ll need a couple of cofounders.

Guardian AI (Misaligned systems are all around us.)

Jessica RumbelowNov 25, 2022, 3:55 PM

15 points

Jessica RumbelowNov 17, 2022, 11:06 AM

27 points

Jessica Rumbelow Nov 11, 2022, 1:00 PM
6 points
1
in reply to: Joseph Bloom’s comment on: Why I’m Working On Model Agnostic Interpretability
Hi Joseph! I’ll briefly address the saliency map concern here – it likely originates from this paper, which showed that some types of saliency mapping methods had no more explanatory power than edge detectors. It’s a great paper, and worth a read. The key thing to note is that this was only true of some gradient-based saliency mapping methods, which are, of course, model-specific. Gradients can be deceptive! Model agnostic, perturbation-based saliency mapping doesn’t suffer from the same kind of problems – see p.12 here.

Jessica RumbelowNov 11, 2022, 9:24 AM

27 points

Jessica Rumbelow Oct 9, 2022, 8:15 PM
3 points
0
on: [Crosspost] AlphaTensor, Taste, and the Scalability of AI
“being able to reorganise a question in the form of a model-appropriate game” seems like something we already have built a set of reasonable heuristics around—categorising different types of problems and their appropriate translations into ML-able tasks. There are well established ML approaches to, e.g. image captioning, time-series prediction, audio segmentation etc etc. is the bottleneck you’re concerned with the lack of breadth and granularity of these problem-sets, OP—and we can mark progress (to some extent) by the number of these problem sets we have robust ML translations for?