Willow BP

Karma: 11

Willow BP Mar 24, 2025, 6:15 PM
1 point
0
in reply to: ChristianKl’s comment on: Good Research Takes are Not Sufficient for Good Strategic Takes
My use of “must” wasn’t just about technical necessity, but rather a philosophical or strategic imperative — that we ought to inform AGI not only through recent trends in deep learning (say, post-2014), but also by drawing from longer-standing academic traditions, like epistemic logic.

Willow BP Mar 24, 2025, 12:37 AM
0 points
−2
on: Good Research Takes are Not Sufficient for Good Strategic Takes
I completely agree on the importance of strategic thinking. Personally, I like to hear what early AI pioneers had to say about modeling AI. For example, Minsky’s society of mind. I believe the trend of AI must be informed by the development of epistemology, and I’ve basically bet my research on the idea that epistemological progress will shape AGI

Willow BP Mar 22, 2025, 12:28 PM
9 points
0
in reply to: Fabien Roger’s comment on: Fabien’s Shortform
I think LLMs are even worse — not just with rare encodings, but also when it comes to reasoning with rare structures. Theory-of-mind tasks provide good evidence for this. LLMs aren’t good at inferring others’ mental states; rather, they tend to mimic reasoning when reasoning steps are present in the training data.

Willow BP Mar 11, 2025, 10:22 PM
1 point
0
on: Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs
This is a highly intriguing research finding. It seems consistent with observations in multi-modal models, where different data types can effectively jailbreak each other.
At the same time, unlike visual reasoning, code is processed entirely in natural language. This suggests two possible approaches to analyzing the underlying cause.
1. Data Type: Analyzing the unique characteristics of coding, compared to natural language, may help explain this phenomenon.
2. Representation: Examining which neurons change during fine-tuning and analyzing their correlations could provide a clearer causal explanation.
Based on your experimental insights, which approach do you think is more effective for identifying the cause of this phenomenon?
Curious to hear your thoughts!

Willow BP Oct 20, 2024, 12:33 PM
5 points
0
on: Open Thread Fall 2024
I’m really interested in AI and want to build something amazing, so I’m always looking to expand my imagination! Sure, research papers are full of ideas, but I feel like insights into more universal knowledge spark a different kind of creativity. I found LessWrong through things like LLM, but the posts here give me the joy of exploring a much broader world!
I’m deeply interested in the good and bad of AI. While aligning AI with human values is important, alignment can be defined in many ways. I have a bit of a goal to build up my thoughts on what’s right or wrong, what’s possible or impossible, and write about them.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer