Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Buck comments on
Sam Marks’s Shortform
Buck
18 Dec 2024 4:55 UTC
2
points
0
If you have kv caching between inference calls, this shouldn’t be a big cost.
Back to top
If you have kv caching between inference calls, this shouldn’t be a big cost.