All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 242526 27 28 29 30 31

[Question] Would it be useful to collect the contexts, where various LLMs think the same?

Martin Vlach24 Aug 2023 22:01 UTC

6 points

1 comment1 min readLW link

[Question] Help Needed: Crafting a Better CFAR Follow-Up Survey

kotrfa24 Aug 2023 17:26 UTC

9 points

2 comments1 min readLW link

AI #26: Fine Tuning Time

Zvi24 Aug 2023 15:30 UTC

49 points

6 comments33 min readLW link

(thezvi.wordpress.com)

Is this the beginning of the end for LLMS [as the royal road to AGI, whatever that is]?

Bill Benzon24 Aug 2023 14:50 UTC

3 points

15 comments3 min readLW link

AI Safety Bounties

PatrickL24 Aug 2023 14:29 UTC

11 points

0 comments7 min readLW link

(rethinkpriorities.org)

AI Regulation May Be More Important Than AI Alignment For Existential Safety

otto.barten24 Aug 2023 11:41 UTC

65 points

39 comments5 min readLW link

AI Probability Trees—Katja Grace

Nathan Young24 Aug 2023 9:45 UTC

8 points

3 comments7 min readLW link

[Question] What wiki-editing features would make you use the LessWrong wiki more?

Nathan Young24 Aug 2023 9:22 UTC

21 points

27 comments1 min readLW link

On Lucidity

Leber24 Aug 2023 8:45 UTC

0 points

5 comments1 min readLW link

(leber.substack.com)

The God of Humanity, and the God of the Robot Utilitarians

Raemon24 Aug 2023 8:27 UTC

79 points

13 comments2 min readLW link 1 review

I measure Google’s MusicLM over 3 months as it appears to go from jaw-dropping to embarrassingly repeating itself

AttentionResearcher24 Aug 2023 4:20 UTC

19 points

4 comments4 min readLW link

Enhancing Corrigibility in AI Systems through Robust Feedback Loops

Justausername24 Aug 2023 3:53 UTC

1 point

0 comments6 min readLW link

The lost millennium

Ege Erdil24 Aug 2023 3:48 UTC

53 points

14 comments3 min readLW link

Regreasing a KitchenAid Mixer

jefftk24 Aug 2023 2:30 UTC

15 points

0 comments1 min readLW link

(www.jefftk.com)

Assessment of intelligence agency functionality is difficult yet important

trevor24 Aug 2023 1:42 UTC

47 points

5 comments9 min readLW link

China’s position on autonomous weapons

bhauth23 Aug 2023 22:20 UTC

17 points

2 comments1 min readLW link

(academic.oup.com)

Diet Experiment Preregistration: Long-term water fasting + seed oil removal

lc23 Aug 2023 22:08 UTC

56 points

18 comments1 min readLW link

The Low-Hanging Fruit Prior and sloped valleys in the loss landscape

Dmitry Vaintrob and Nina Panickssery

23 Aug 2023 21:12 UTC

82 points

1 comment13 min readLW link

Governing, Fast and Slow

Carson23 Aug 2023 20:01 UTC

3 points

0 comments3 min readLW link

A problem with the most recently published version of CEV

ThomasCederborg23 Aug 2023 18:05 UTC

10 points

7 comments8 min readLW link

[Question] Which paths to powerful AI should be boosted?

Zach Stein-Perlman23 Aug 2023 16:00 UTC

5 points

1 comment1 min readLW link

A Theory of Laughter

Steven Byrnes23 Aug 2023 15:05 UTC

102 points

14 comments28 min readLW link

Why Is No One Trying To Align Profit Incentives With Alignment Research?

Prometheus23 Aug 2023 13:16 UTC

51 points

11 comments4 min readLW link

Exploring the Responsible Path to AI Research in the Philippines

MiguelDev23 Aug 2023 8:44 UTC

6 points

0 comments6 min readLW link

[Question] Do agents with (mutually known) identical utility functions but irreconcilable knowledge sometimes fight?

mako yass23 Aug 2023 8:13 UTC

14 points

13 comments1 min readLW link

South Bay ACX/SSC Fall Meetups Everywhere

allisona23 Aug 2023 3:00 UTC

3 points

0 comments1 min readLW link

Separate the truth from your wishes

Jacob G-W23 Aug 2023 0:52 UTC

6 points

3 comments1 min readLW link

(jacobgw.com)

Implications of evidential cooperation in large worlds

Lukas Finnveden23 Aug 2023 0:43 UTC

39 points

4 comments17 min readLW link

(lukasfinnveden.substack.com)

South Bay Casual Group Walk

allisona22 Aug 2023 22:43 UTC

7 points

2 comments1 min readLW link

Walk while you talk: don’t balk at “no chalk”

dkl922 Aug 2023 21:27 UTC

41 points

10 comments2 min readLW link

(dkl9.net)

State of Generally Available Self-Driving

jefftk22 Aug 2023 18:50 UTC

66 points

6 comments2 min readLW link

(www.jefftk.com)

Seth Explains Consciousness

Jacob Falkovich22 Aug 2023 18:06 UTC

38 points

125 comments14 min readLW link

(putanumonit.com)

ChatGPT challenges the case for human irrationality

Kevin Dorst22 Aug 2023 12:46 UTC

3 points

10 comments7 min readLW link

(kevindorst.substack.com)

[Question] Does one have reason to believe the simulation hypothesis is probably true?

kuira22 Aug 2023 8:34 UTC

1 point

20 comments1 min readLW link

The Joan of Arc Challenge For Objective List Theory

omnizoid22 Aug 2023 8:01 UTC

−2 points

4 comments10 min readLW link

The Lopsided Lives Argument For Hedonism About Well-being

omnizoid22 Aug 2023 7:59 UTC

−2 points

8 comments22 min readLW link

Causality and a Cost Semantics for Neural Networks

scottviteri21 Aug 2023 21:02 UTC

22 points

1 comment1 min readLW link

Ideas for improving epistemics in AI safety outreach

mic21 Aug 2023 19:55 UTC

64 points

6 comments3 min readLW link

Rice’s Theorem says that AIs can’t determine much from studying AI source code

Michael Weiss-Malik21 Aug 2023 19:05 UTC

−12 points

4 comments1 min readLW link

Large Language Models will be Great for Censorship

Ethan Edwards21 Aug 2023 19:03 UTC

183 points

14 comments8 min readLW link

(ethanedwards.substack.com)

“Throwing Exceptions” Is A Strange Programming Pattern

Thoth Hermes21 Aug 2023 18:50 UTC

−2 points

13 comments6 min readLW link

(thothhermes.substack.com)

[Question] Which possible AI systems are relatively safe?

Zach Stein-Perlman21 Aug 2023 17:00 UTC

42 points

20 comments1 min readLW link

Self-shutdown AI

jan betley21 Aug 2023 16:48 UTC

13 points

2 comments2 min readLW link

Contextual Translations—Attempt 1

Varshul Gupta21 Aug 2023 14:30 UTC

−1 points

0 comments2 min readLW link

(dubverseblack.substack.com)

DIY Deliberate Practice

lynettebye21 Aug 2023 12:22 UTC

62 points

4 comments5 min readLW link

(lynettebye.com)

Downstairs Opening: 2br Apartment

jefftk21 Aug 2023 0:50 UTC

8 points

2 comments3 min readLW link

(www.jefftk.com)

Efficiency and resource use scaling parity

Ege Erdil21 Aug 2023 0:18 UTC

51 points

1 comment20 min readLW link 1 review

Ruining an expected-log-money maximizer

philh20 Aug 2023 21:20 UTC

31 points

33 comments1 min readLW link 1 review

(reasonableapproximation.net)

Steven Wolfram on AI Alignment

Bill Benzon20 Aug 2023 19:49 UTC

66 points

15 comments4 min readLW link

[Question] What value does personal prediction tracking have?

fx20 Aug 2023 18:43 UTC

7 points

3 comments1 min readLW link