Jan Rzymkowski

Karma: 5

Jan Rzymkowski Aug 2, 2020, 11:40 PM
4 points
on: Sufficiently Advanced Language Models Can Do Reinforcement Learning
There a huge leap between a procedure allowing a predictive model to iteratively decrease False Positive Rate and having an AGI.

Jan Rzymkowski Aug 2, 2020, 11:34 PM
1 point
in reply to: TurnTrout’s comment on: What specific dangers arise when asking GPT-N to write an Alignment Forum post?
Upon reflection, you’re right that it won’t be maximizing response per se.
But as we get deeper it’s not so straightforward. GTP-3 models can be trained to minimize prediction loss (or, plainly speaking, to simply predict more accurately) on many different tasks, which usually are very simply stated (eg. choose a word that would fill the blank).
But we end up with people taking models trained thusly and use them to generate a long texts based on some primer. And yes, in most cases such abuse of the model will end up with text that is simply coherent. But I would expect humans to have a tendency to conflate coherence and persuasiveness.
I suppose one can fairly easily choose such prediction loss for GTP-3 models that the longer texts would have some desired characteristics. But also even standard tasks probably shape GTP-3 so that it would keep producing vague sentences that continue the primer and that give the reader a feel of “it making sense”. That would entail possibly producing fairly persuasive texts reinforcing primer thesis.

Jan Rzymkowski Jul 31, 2020, 12:45 PM
1 point
on: What specific dangers arise when asking GPT-N to write an Alignment Forum post?
As far as I understand GPT-N it’s not very agent-like (it doesn’t perform me vs environment abstraction and doesn’t look for ways to transform its perceived environment to satisfy some utility function). I wouldn’t expect it to “scheme” against people since it lacks any concept of “affecting its environment”.

However it seems likely that GTP-N can perfect the skill of crowd-pleasing (we already see that; we’re constantly amazed by it, despite little meaning of created texts). It can precisely modulate it’s tone and identify the talking points that get the most response.

So I expect the GTP-N generated texts to sound really persuasive, not because of novel ideas but because of superhuman ability to compose heard ideas into persuasive essay.

I would expect GTP-N to focus on presenting solutions for alignement (therefore making us overly optimistic about naive approaches), presenting novel risks (it’s easy to make something up by simple rehashing) and possibly venturing in philosophical muddling the water (humans prove to be very easily engaged by certain topics, like self-consciousness)

Jan Rzymkowski Aug 19, 2019, 2:13 PM
2 points
on: Being the (Pareto) Best in the World
This analysis seems to quietly assume that various important skills are independent variables and therefor many people in top of their field will neccesserly be average in various other skills (actually, the chart goes even further and assumes that there’s universal negative correlation between skills—I’m not even sure if that’s mathematically possible for more than 2 variables).
World’s greatest genontologist will probably be very good at statistics and even Ed Jaynes would probably be a above average generontologist just because he can effectively interpret generontology data.

Jan Rzymkowski Jun 15, 2019, 1:08 AM
2 points
on: What kind of thing is logic in an ontological sense?
Appliability of logic in physical world is sort of a theorem based on the laws of physics (mostly more metaphysical and less technical like the persistence of objects, that themselves as theorems of the basic laws of physics) and the laws governing the process of formulating atomic statements based on the observations.
At the same time we need to be careful as we can easily fall into the trap of unfalsifiability—when the predictions of logic fail, we’re used to say that the problem was with our atomic statements.
That’s just the sketch of the full explanation of the topic, which would require at least a chapter.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer