Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
MadHatter comments on
Real-Time Research Recording: Can a Transformer Re-Derive Positional Info?
MadHatter
4 Nov 2022 5:14 UTC
1
point
2
Yeah, just changing the max to a min produces this much smoother loss curve from your notebook..
Back to top
Yeah, just changing the max to a min produces this much smoother loss curve from your notebook..