PhD student in reinforcement learning and interpretability at University of Bath.
Karolis Jucys
Karma: 18
Would “delta hedging” be useful here? It helps hedge long option exposure by shorting some amount of a stock.
For example, at the money calls generally have a delta of 0.5, so holding 100 at the money calls and shorting 50 shares makes you roughly neutral for small moves in the underlying asset.
Would probably require monthly rebalancing based on how many options you effectively hold and market moves. It also wouldn’t work well if AGI happens at GDM and Google stock goes exponential (“volatility smile” problem).
non pdf arxiv link: https://arxiv.org/abs/2305.15324
For the four examples of
24-16=12, 53-25=25, 34-16=13, 63-17=16
is this the pattern?ab-cd=ca
DreamerV3 is not a great example, as they use so many hacks to make the task easier that it barely counts as getting a diamond or Minecraft anymore. Action shaping, macro actions, instant block breaking, fake “bug fixing”, all to get a diamond in 0.4% of episodes.
More info here: https://x.com/Karolis_Ram/status/1785750372394348632