Eliezer's Unteachable Methods of Sanity

Thanks!

The reason I asked you to write some-version-of-this is, I have in fact noticed myself veering towards a certain kind of melodrama about the whole x-risk thing, and I've found various flavors of your "have you considered just... not doing that?" to be helpful to me. "Oh, I can just choose to not be melodramatic about things."

(on net I am still fairly relatively dramatic/narrative-shaped as rationalists go, but, I've deliberately tuned the knob in the other direction periodically and think various little bits of writing of yours has helped me)

I liked the framing you did at Solstice of it as a general prompt to treat it as a skill issue without being about the exact recipe.

1Nathan Young7d

Seems like lots of people found this valuable, so bayes points I guess.

[-]Kaj_Sotala13d5528

I read this as being premised on "going crazy about the world ending" meaning that you end up acting obviously stupid and crazy, with the response basically being "find a way to not do that".

My model about going crazy at the end of the world isn't so much doing something that's obviously crazy in your own view, but that the world ending is so out-of-distribution for everything you've been doing so far that you have no idea of what even is a sane or rational response anymore. For instance, if your basic sense of meaning has been anchored to a sense of the world persisting after you and you making some kind of mark on the world, you won't know what to do with your life if there won't be anything to make a mark on.

So staying sane requires also knowing what to do, not just knowing what not to do. Is there anything you would say about that?

[-]Eliezer Yudkowsky13d7132

Base plan: Stay still, die quietly.

There, you now have a better plan than going crazy! If you think up an even better plan you can substitute that one. Meliorization!

[-]jimmy13d207

The point is that "maintaining sanity" is a (much) higher bar than "Don't flail around like a drama queen". Maintaining sanity requires you to actually update on the situation you find yourself in, and continue to behave in ways that make sense given the reality as it looks after having updated on all the information available. Not matching obvious tropes of people losing their mind is a start, but it is no safe defense. Especially since not all repeated/noticeable failure modes are active and dramatic, and not all show up in fiction.

For example, if there's something to David Gross's comment that the wretched journalist was actually giving you an opening because they saw importance in what you had to say about the situation, blowing off a genuine opening to influence the discourse on AI safety while calling it "doing nothing" would not be sane. Preemptive contempt has a purpose in bounded rationality, but it's still a form of pushing away from the information the journalist has to offer. It can make sense within a grand plan that weights this journalist low, but that requires a grand plan.

How do you actually orient to the world, now that we are what we are? Are you still working to... (read more)

4TristanTrim12d

I agree with this in a "catgirl volcano utopia" kinda way, but I think Kaj_Sotala was pointing more to a "words as pointers to locations in thingspace" issue. The word "sane" points to taking actions that work in the context you're facing. It isn't sane to shout about the sky falling when the sky isn't falling and it's easy for sane people to notice that the sky isn't falling and that shouting about it is insane. But there isn't an obvious plan for what you should do when the sky really is falling, so if the sky starts falling in ways that are obvious and difficult for normal people to ignore, then the thingspace cluster that "sane" used to point to starts to come apart. I like expanding "sane" to something like "know what's true and do what works"... it's an impossible standard but something to aspire to. It seems "sane" may also point to "not indulging in dramatic emotional expressions", like not screaming, not crying, not punching inanimate objects. But pathos works. Emotions make characters in stories relatable. So the goal isn't to stay sane, for that is not a well defined thing to do. The goal isn't even to look sane, for looking insane may be compelling, and looking sane to everyone all the time is probably impossible. For people in general... "don't think about what's sane, think about what works" is probably good advice to gesture towards the actual goal.

1David J Higgs10d

In addition to the option of spending effort on reducing the chance the world ends, one could also reframe from "leaving a mark on the world that outlives you" to "contributing to something bigger and beyond yourself." The world is bigger than you, more important than you and exists outside of you right now, as well as up until the world ends (if/when it does). Helping the world right now, and helping the world after you are gone, are morally equivalent, and quite possibly equivalent at the level of fundamental physics. I'm not sure what, other than a false sense of personal immortality (legacy as something beyond the actual beneficial effects on the world), is tied to benefiting the world later than your own time of existence. But perhaps that's my own ignorance.

1Markvy13d

Re: “For instance, if your basic sense of meaning has been anchored to a sense of the world persisting after you and you making some kind of mark on the world, you won't know what to do with your life if there won't be anything to make a mark on.” Presumably the thing to do then is to devote x% of your effort to saving the world.

[-]David Gross13d3914

"How are you coping with the end of the world?" journalists sometimes ask me... The journalist is imagining a story that is about me, and about whether or not I am going insane...

Seems too cynical. I can imagine myself as a journalist asking you that question not because I'm hoping to write a throw-away cliche of an article, but because if I take seriously what you're saying about AGI risk, you're on the cutting edge of coping with that, and the rest of us will have to cope with that eventually, and we might have an easier time of it if we can learn from your path.

[-]Eliezer Yudkowsky13d2113

I would of course take the question very differently from a journalist who had otherwise dealt with that slight inconvenience of trying to get to grips with an idea, and started to seem worried; instead of having had the brilliant idea of writing a Relatable Character-Focused Story instead.

Perhaps I overestimate how much I can deduce from tone and context, but to me it seems like there's a visible departure from the norm for the person who becomes worried themselves and wonders "How will people handle it?" versus the kid visiting the zoo to look at the strange creatures who believe strange things.

5ChristianKl8d

I think you underrate how much of the job of a journalist is about simplifying complex events into a narrative that's easy to read and consume for the audience of the newspaper.

[-]FlorianH13d3832

[There's also a much more banal answer that I wouldn't be surprised if it is a major, deep underlying driver, with all the interesting psychology provided in OP being some sort of half-conscious rationalization for our actual deep-rooted tendencies:] Not going insane simply is the very natural default outcome for humans even in such felt dire situation:

While shallowly it might feel like it would, going insane actually appears to me to NOT AT ALL be the default human reaction to an anticipation of (even a quite high probability of) the world ending (even very soon). I haven't done any stats or research, but everything I've ever seen or heard of seems to suggest to me:

While they're not anywhere nearly the majority, still very many people have very high P(doom soon) yet stay nearly perfectly calm (at best you might call them insanely calm, given the [true or imagined] circumstances).
- I think this applies to many people e.g. on this forum, but I'm reminded of much more 'normal' persons uttering even more dramatic 'I'm sure AI might already TMORROW kill us all' - all while simply going on with their usual lives.
Slightly less 1:1 but imho still underlining our sanity's resilience in close

... (read more)

[-]Caleb Biddulph13d1914

Makes sense. Surely there were many cases in which our ancestors' "family and/or friends and/or tribe were facing extinction," and going insane in those situations would've been really maladaptive! If anything, the people worried about AI x-risk have a more historically-normal amount of worry-about-death than most other people today.

[-]Eliezer Yudkowsky13d2218

They didn't need to deal with social media informing them that they need to be traumatized now, and form a conditional prediction of extreme and self-destructive behavior later.

1Nemoto1d

There actually are rising cases of "mental ill-health" right now. Here in the UK, services are swamped. I'm sure some of this is due to an attitude change, in that people now refer to anything from a minor upset, or a slight difference in neurological function, to normal emotions such as grief, shame and regret, as a mental illness. Previously the attitude in the post-war generation was more like Truman's toward Oppenheimer: “Blood on his hands; damn it, he hasn’t half as much blood on his hands as I have. You just don’t go around bellyaching about it”. (Robert Oppenheimer: A Life Inside the Center; by Ray Monk.) Bellyaching, cutting people out of one's life due to some human imperfection, and turning to substance abuse, all seem to be excused under this mental health label, as if the difficulties the world is facing right now are already sufficient to trigger the insanity expectation. I'm grateful to you for this article, which I intend to take very seriously, because it provides the tools we need to arm ourselves against succumbing to stresses of every kind, as well as coping better with everyday life today and in the future, however long that might be.

[-]Nick_Tarleton13d165

A cynical theory of why someone might believe going insane is the default human reaction: weaponized incompetence, absolving them of responsibility for thinking clearly about the world, because they can't handle the truth, and they can't reasonably be expected to because no normal human can either.

3jmh11d

I wonder if situations like the Cuban missile crisis are good examples for your position. But then I also wonder if that (I think apparently worried but calm about the world ending in a nuclear conflict) isn't contrasted by the claims about the mass hysteria after the radio broadcast of Well's War of the Worlds.

2Martin Randall10d

Apparently (edit: that particular case of) mass hysteria is a myth. But however many people got confused, I don't think this is a contradiction. If I updated P(aliens are invading) from 0% to 1%, it would change my plans for the evening, because I am sane.

4Nick_Tarleton9d

that particular case of mass hysteria is a myth

[-]Zach Stein-Perlman13d350

Context: Bay Area Secular Solstice 2025

[-]dr_s13d345

This is why, in a much more real and also famous case, President Truman was validly angered and told "that son of a bitch", Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman. Oppenheimer was trying to have nuclear weapons be about Oppenheimer's remorse at having helped create nuclear weapons. This feels obviously icky to me; I would not be surprised if Truman felt very nearly the same.

I did sympathise with Truman in the way that scene is portrayed in Nolan's movie more than most seem to have (or even, that the movie intended to). But I am not sure that wasn't just Truman making the bombs about him instead - he made the call after all, it was his burden to bear. Which again sort of shifts it from it being about, you know, the approximately 200k civilians they killed and stuff.

[-]David Joshua Sartor13d514

Truman only made the call for the first bomb; the second was dropped by the military without his input, as if they were conducting a normal firebombing or something. Afterward, he cancelled the planned bombings of Kokura and Niigata, establishing presidential control of nuclear weapons.

[-]Eliezer Yudkowsky10d173

...amazing.

[-]Garrett Baker10d*172

There is also recent debate about whether Truman was even well informed about the fact that Hiroshima was a city rather than a "purely military target", eg see the book The Most Awful Responsibility, well reviewed by many including Richard Rhodes, as well as the excellent interview with the author by Dan Carlin.

9David J Higgs11d

Huh, I knew there wasn't the sort of plan you'd naively expect where the US gov/military command observes the response of the Japanese gov/military to one of their cities being destroyed by unthinkable godlike powers and then decides what to do next. I didn't know that president Truman literally didn't know about/have implicit preemptive control over the 2nd bombing.

[-]Chris Wintergreen10d140

Dan Carlin recently did a Hardcore History Addendum show about Truman called Atomic Accountability. It was an interview with Alex Wellerstein who brings into question how much Truman actually knew about the location of the first bomb being dropped. Truman (possibly) thought that rulling out Kyoto (which was number one on the list), meant he was ruling out cities as targets, and didn't know Hiroshima was a city. This seems wild, until you factor in how all the information is being fed to him, how long he'd known about the nuclear program and what the competing military interests were. Worth a listen if you're into the topic as it's a new perspective.

5Garrett Baker10d

The book in which Alex Wellerstein really makes the case was also released yesterday, buy it here!

1David Joshua Sartor10d

Thanks, I hadn't seen this. I agree Truman thought Hiroshima was mostly a military base. IIRC you can see him make basic factual errors to that effect in an early draft of a speech.

2Nathan Young7d

I take Perplexity to be about 80% accurate, but this suggests the above isn't accurate. He had signed off on multiple bombs and didn't stop the second between the 6th and 9th when he could have.

1David Joshua Sartor5d

My previous statements are technically correct, and IMO mostly make a correct point in context (that Truman had not realized, at the time, the immediate consequences of his decision), but are somewhat misleading. Thanks. The process was still stupid,and not what Truman would have preferred. Truman was surprised and disturbed by the second bomb being dropped so quickly. But it seems like it wouldn't have been too hard for him to anticipate and prevent this outcome, if he had been paying more attention (the same way he thought Hiroshima was a military base due to his own deficit of curiosity); I hadn't realized that before, thanks.

[-]romeostevensit13d*334

All of this is not to be confused with the Buddhist doctrine that every form of negative internal experience is your own fault for not being Buddhist enough.

Not really, but it's a long explanation and at this point I'm pretty sure some of the inference steps have to be confirmed by laborious trained processes. Nor is this process about reality (as many delusional Buddhists seem to insist), but more like choosing to run a different OS on ones hardware. The size of the task and the low probability of success makes it not worth the squeeze for many afaict. For the record, in case it is helpful to anyone at all, there are three types of dukkha, and painful sensations are explicitly the ones one can do nothing about (other than mundane skillful action). It is the dukkha of change (stuck priors) and the dukkha of fabrications (much more complicated) that Buddhist training eliminates.

But the thing I actually want to comment about is related to a point I've had a really hard time communicating to people about the deciding to be sane thing. It's a kind of scale-free mental move where people seem to have a really hard time with self-reference, thinking it's some sort of gotcha when it isn't.... (read more)

7TsviBT12d

(I think I may have asked you a similar question before, sorry if I forgot your answer:) Are there a couple compelling examples of someone who 1. did something you'd identify as roughly this procedure; 2. then did something I'd consider impressive (like a science or tech or philosophy or political advance); 3. and attributed 2 to 1?

8romeostevensit12d

Not directly attributable, no. I think of most of these things as bringing up the floor rather than raising the ceiling.

9TsviBT12d

Ohhhh ok. That's helpful, thanks.

2Jonas Hallgren12d

(I kind of wanted to give some nuance on the reality part from the OS Swapping perspective. You're of course right with some overzealous people believing they've found god and similar but I think there's more nuance here) If we instead take your perspective of OS swap I would say it is a bit like switching from Windows to Linux because you get less bloatware. To be more precise one of the main parts of the swap is the lessening of the entrenchments of your existing priors. It's gonna take you a while to set up a good distro but you will be less deluded as a consequence and also closer to "reality" if reality is the ability to see what happens with the underlying bits in the system. As a consequence you can choose from more models and you start interpreting things more in real time and thus you're closer to reality, what is happening now rather than the story of your last 5 years. Finally on the pain of the swap, there are also more gradual forms of this, you can try out Ubuntu (mindfulness, loving kindness) before switching over. Seeing through your existing stories can happen in degrees, you don't have to become enlightened to enjoy the benefits?

9romeostevensit12d

This is an appealing story, but I haven't really observed anyone get noticeably better at epistemology as a result of their practice. I remain confused about this for similar reasons to this story.

2Kaj_Sotala10d

I think part of the issue is that epistemology is largely a question of mindware, and practice does not fix missing or bad mindware any more than it can teach a person calculus if they've never studied it.

[-]Cole Wyeth14d2913

I have no plans to go insane, but I’m certainly pretty anxious about everyone dying.

5Wei Dai13d

Try applying: * Is the potential astronomical waste in our universe too small to care about? * Shut Up and Divide? Also recall that we're in a tiny tiny corner of Reality (whatever Tegmark level it is, it's probably much larger than what we can see), and it's pretty unclear how to update EU(Reality | human history).

4Cole Wyeth12d

I don’t believe in large mathematical multiverses.

6Wei Dai12d

Do you believe in a quantum multiverse, or a spatially infinite universe (beyond the observable universe)? You can get a similar conclusion with either of these (which are Tegmark Levels 3 and 1, respectively).

8Cole Wyeth12d

More plausible, somewhat comforted that some branches could survive. However, my brain works by caring about what I can effect and observe. For instance, this kind of argument is not going to make me less worried about S-risks (or just personally being tortured) or like, even my friends and family dying.

[-]Annabelle11d*150

Hey Cole! I also went through a period of feeling pretty worried about s-risks, and have recently come out the other side. If you'd like someone to talk to, or even any advice re: any materials you might find helpful for coming to accept/loosen the grip of fear and anxiety, my inbox is open (I'm a clinical psych PhD student and have lots of resources for existential/humanist therapy, compassion-focused therapy, CBT, DBT, etc.). I've probably read a lot of what you're worried about, so you don't need to worry about having any hazardous effect on me :)

Also, I'd love to learn more from you about your research! I like your posts.

3Davidmanheim12d

Is this anxiety in the typical form of making it harder for you to do other things? Because yes, we all agree that it's very bad outcome, but a critical point of the post is that you might want to consider ways to not do the thing that makes your life worse and doesn't help.

4Cole Wyeth12d

It would be better if I were less anxious (though perhaps, not zero). I guess I'm just claiming that this is probably not a matter of being dramatic etc. For instance, I used to read the Precipice before bed and had trouble sleeping. My girlfriend had to point out to me that maybe it was because of the Precipice (it didn't consciously occur to me at all). I stopped reading it and slept fine again.

5TristanTrim12d

Did you read the Precipice during the day instead? I'd hate if the parable here was "avoid thinking about things you find stressful". The parable "pay attention to your somatic experience and don't mess up your circadian rhythm and wellbeing by dumping anxiety into your system before trying to sleep" is pretty good though.

4Cole Wyeth12d

....no

2TristanTrim12d

Haha... well it looks by your profile you're still managing to think about things you find stressful. "chances of AGI in the next few years are high enough (though still <50%) that it’s best to focus on disseminating safety relevant research as rapidly as possible"... so no problems there. Hope my comment didn't come across as mean. Also you're advised by Marcus Hutter? That's cool! I got a copy of "Universal Artificial Intelligence" I want to get to reading sometime. Could I DM you and talk about UAI sometime?

4Cole Wyeth12d

Sure, anytime. I also organize the AIXI research community here: https://uaiasi.com There is a reading group on the newer one “an introduction to UAI” running now (mostly finished but maybe we’ll start another round). The old book still has advantages.

3Davidmanheim12d

Agree that it's not just about being dramatic / making the problem about you. But that was only one of the points Eliezer made about why people could fail at this in ways that are worth trying to fix. And in your case, yes, dealing with the excessive anxiety seems helpful.

2Cole Wyeth12d

For sure, but nothing in this post seems directly helpful with the problem I'm describing?

3Davidmanheim12d

"Actual LessWrong readers also sometimes ask me how I deal emotionally with the end of the world. I suspect a more precise answer may not help. But Raymond Arnold thinks I should say it, so I will say it. I say again, I don't actually think my answer is going to help."

2Cole Wyeth12d

I don't think there's any disagreement here.

[-]Sabiola13d185

Errors vs. Bugs and the End of Stupidity is a great post about "skill issues".

[-]AprilSR13d18-2

Wow, this sure is a much clearer way to look at the self-pseudo-prediction/action-plan thingy than any I've seen laid out before.

2niplav13d

I got Claude to read this text and explain the proposed solution to me [[1]] , which doesn't actually sound like a clean technical solution to issues regarding self-prediction, did Claude misexplain or is this an idiosyncratic mental technique & not a technical solution to that agent foundations problem? C.f. Steam (Abram Demski, 2022), Proper scoring rules don’t guarantee predicting fixed points (Caspar Oesterheld/Johannes Treutlein/Rubi J. Hudson, 2022) and the follow-up paper, Fixed-Point Solutions to the Regress Problem in Normative Uncertainty (Philip Trammell, 2018), active inference which simply bundles the prediction and utility goal together in one (I find this ugly (I didn't read these two comments before writing this one, so the distaste for active inference was developed independently)). I guess this was also talked about in Embedded Agency (Abram Demski/Scott Garrabrant, 2020) under the terms "action counterfactuals", "observation counterfactuals"? Claude 4.5 Sonnet explanation Your brain has a system that generates things that feel like predictions but actually function as action plans/motor output. These pseudo-predictions are a muddled type in the brain's type system. You can directly edit them without lying to yourself because they're not epistemic beliefs — they're controllers. Looking at the place in your mind where your action plan is stored and loading a new image there feels like predicting/expecting, but treating it as a plan you're altering (not a belief you're adopting) lets you bypass the self-prediction problem entirely. So: "I will stay sane" isn't an epistemic prediction that would create a self-fulfilling prophecy loop or violate the belief-action firewall. It's writing a different script into the pseudo-model that connects to motor output — recognizing that the thing-that-feels-like-a-prediction is actually the controller, and you get to edit controllers. 1. I didn't want to read a bunch of unrelated text from Yudkowsky about a

7Algon13d

It is an idiosyncratic mental technique. Look up trigger action plans, say. What you're doing there is a variant of what EY describes.

5niplav13d

I fortunately know of TAPs :-) (I don't feel much apocalypse panic so I don't need this post.) I guess I was hoping there'd be some more teaching from up high about this agent foundations problem that's been bugging me for so long, but I guess I'll have to think for myself. Fine.

4AprilSR13d

Yeah I'm pretty sure it's an idiosyncratic mental technique / human psychology observation, there isn't technical agent foundations progress here.

[-]Linch12d*172

The third way I stay sane is a fiat decision to stay sane.
My mental landscape contains that option; I take it.
This is the point I am even less expecting to be helpful, or to correspond to any actionable sort of plan for most readers.

Some years ago, I had a friend who told me she was still anorexic even though the reason she originally acquired anorexia no longer applies^[1].

I responded "Have you considered not being anorexic?" She thought about it and replied something like "No, actually."

Two weeks later she thanked me for helping to cure her anorexia.

This is the type of advice that I expect to be profoundly unhelpful to >95% of people in that position (and indeed is rightfully lampooned approximately everywhere). Yet it was the exact thing this specific person needed to hear, and hopefully "you can just decide to stay sane" is the exact thing some small fraction of people reading your post needed to hear as well.

^{^}
(censoring the exact reason)

7gwillen10d

Someone mentioned "mass hysteria" above. I think there are cases where, surrounded by a certain culture or context, people feel positive-tribal-emotions about going insane. If that's true, it seems perhaps quite helpful -- to some particular people, in some particular context -- for a Big Tribal Leader (or a friend!) to say, "I strongly recommend not going insane! To the extent that this seems interpretable as a choice, I strongly recommend choosing the other thing!"

2Linch10d

Interesting, are there examples of people feeling positive-tribal-emotions to "go insane" in the abstract? I suspect in practice it looks more like social pressure to be sleep-deprived, social pressure to repeat what the Glorious Leader says without question, pressure to ignore widely held taboos, pressure to sacrifice the self, etc.

[-]J Bostock13d124

In what sense are you using "sanity" here? You normally place the bar for sanity very high, like ~1% of the general population high. A big chunk of people I've met in the UK AI risk scene I would call $s a n e_{j b}$ . Does $s a n e_{e l i e z e r}$ mean?

You are $s a n e_{e l i e z e r}$ iff you avoid totally crashing out, being unable to hold down a job, panicking or crying most of the time, threatening people
You are $s a n e_{e l i e z e r}$ iff you do the stuff in 1 and you're able to think about AI without making stupid errors, knowing the limits of your own reasoning about the topic
You are $s a n e_{e l i e z e r}$ iff you do the stuff in 2 and you reliably perform (or could perform) $n e t p o s i t i v e_{e l i e z e r}$ work reducing doom
You are $s a n e_{e l i e z e r}$ iff you do the stuff in 3 and you also have a $b a s i c a l l y f u l l y a c c u r a t e_{e l i e z e r}$ model of the AI doom situation

[-]Eliezer Yudkowsky13d2219

This is about "insane" in the sense of people ceasing to meet even their own low bars for sanity.

[-]Eli Tyre13d110

To be clear, I actually do this very rarely

Why do you only do it very rarely? Is there a non-obvious cost?

[-]Eliezer Yudkowsky13d245

It's fancy and indirect, compared to getting out of bed.

[-]Ben Pace2d72

Curated. It helps to read accounts of how other people aren't wrecked by the current state of the world, especially someone who has a good model of their mental world and who has been dealing with this longer than most people. And there's lots of interesting things here, about being genre-savvy, the asides on predictions vs plans, and the Irori motto and image (which I love).

[-]sarahconstantin9d70

I liked your account of:

looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen.

I do a similar sort of thing myself sometimes, and similarly do not think it is the same as predictive processing theory, which I don't believe in for several reasons.

I would call that "visualization", and I'd say that it's not hyperstition/woo because it's not believing in a prediction, it's forming a plan. (E... (read more)

[-]JenniferRM11d70

Sanity has numerous indicators.

For example, when paranoid crazy people talk about the secret courts that control the spy machines, they don't provide links to wikipedia, but I do! This isn't exactly related, but if you actually have decent security mindset then describing real attacks and defenses SOUNDS crazy to normies, and for PR purposes I've found that it is useful to embrace some of that, but disclaim some of it, in a mixture.

I'm posting this on "Monday, December 8th" and I wrote that BEFORE looking it up to make sure I remembered it correctly and cr... (read more)

2JenniferRM1d

Update: I went swing dancing and am full of bliss again.

1HoVY3h

Note that the reference is specifically in regards to older patients and for diagnosing a specific form of "crazy". Does it generalize to all forms of "crazy"? I don't know, I haven't looked into it at all. I just was curious and wanted to read the citation, and thought it was worth noting. From the conclusion: "Disorientation to time is a useful guide to the presence and severity of dementia or delirium in older hospital patients."

[-]James_Miller13d71

I teach a course at Smith College called the economics of future technology in which I go over reasons to be pessimistic about AI. Students don't ask me how I stay sane, but why I don't devote myself to just having fun. My best response is that for a guy my age with my level of wealth giving into hedonism means going to Thailand for sex and drugs, an outcome my students (who are mostly women) find "icky".

4StanislavKrym13d

I strongly suspect that the answer stems from historical analogies. The equivalent of doom was related to catastrophes like epidemics, natural disasters, genocide-threatening wars and destruction of the ecosystem. Genocide-threatening wars could motivate individuals to weaken the aggressive collective as much as possible (so that said collective would either think twice before starting the war or commiting genocide or have a bigger chance of being outcompeted). Epidemics, natural disasters and gradual destruction of the ecosystem historically left survivors who would keep the culture afloat and could even be motivated by it. AI-related imminent doom would be most equivalent to genocide of mankind and likely to deserve a similar response, which is minimising p(doom), helping those who work on it or at least doing the work which benefitted the society and was expected from you had it not been for imminent doom. It could be also useful to consider the counterfactual possibility of an unavoidable gamma-ray burst that was predicted to wipe the Earth out. The GRB would require the civilisation to build bunkers and to preserve the ecosystem. Even if nearly every individual is unlikely to actually enter the bunker, living a life of debauchery could be a bad decision due to acausal trade or actively motivating others to do the same and indirectly undermining the chance of mankind to survive.

[-]Chris_Leong13d71

Even if they had almost destroyed the world, the story would still not properly be about their guilt or their regret, it would be about almost destroying the world. This is why, in a much more real and also famous case, President Truman was validly angered and told "that son of a bitch", Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman. Oppenheimer was trying to have nuclear weapons be about Oppenheimer's remorse at having helped create nuclear weapons. This feels obviously icky to me; I would not be surpr

... (read more)

[-]Eliezer Yudkowsky13d160

The technique is older than the "active inference" malarky, but the way I wrote about it is influenced by my annoyance with "active inference" malarky.

4Richard_Kennaway13d

I wondered the same thing. I'm not a fan of the idea that we do not act, merely predict what our actions will be and then observe the act happening of itself while our minds float epiphenomenally above, and I would be disappointed to discover that the meme has found a place for itself in Eliezer's mind.

[-]Eliezer Yudkowsky13d27-1

Oh, absolutely not. Our incredibly badly designed bodies do insane shit like repurposing superoxide as a metabolic signaling molecule. Our incredibly badly designed brains have some subprocesses that take a bit of predictive machinery lying around and repurpose it to send a control signal, which is even crazier than the superoxide thing, which is pretty crazy. Prediction and planning remain incredibly distinct as structures of cognitive work, and the people who try to deeply tie them together by writing wacky equations that sum them both together plus throwing in an entropy term, are nuts. It's like the town which showed a sign with its elevation, population, and year founded, plus the total of those numbers. But one reason why the malarky rings true to the knowlessones is that the incredibly badly designed human brain actually is grabbing some bits of predictive machinery and repurposing them for control signals, just like the human metabolism has decided to treat insanely reactive molecular byproducts as control signals. The other reason of course is the general class of malarky which consists of telling a susceptible person that two different things are the same.

[-]AnnaSalamon10d197

Prediction and planning remain incredibly distinct as structures of cognitive work,

I disagree. (Partially.) For a unitary agent who is working with a small number of possible hypotheses (e.g., 3), and a small number of possible actions, I agree with your quoted sentence.

But let’s say you’re dealing with a space of possible actions that’s much too large to let you consider each exhaustively, e.g. what blog post to write (considered concretely, as a long string of characters).

It’d be nice to have some way to consider recombinable pieces, e.g. “my blog post could include idea X”, “my blog post could open with joke J”, “my blog post could be aimed at a reader similar to Alice”.

Now consider the situation as seen by the line of thinking that is determining: “should my blog post be aimed mostly at readers similar to Alice, or at readers similar to Bob?”. For this line of thinking to do a good estimate of ExpectedUtility(post is aimed at Alice), it needs predictions about whether the post will contain idea X. However, for the line of thinking that is determining whether to include idea X (or the unified agent, at those moments when it is actively considering this), it’’ll of course need go... (read more)

[-]TsviBT10d166

In this example, you're trying to make various planning decisions; those planning decisions call on predictions; and the predictions are about (other) planning decisions; and these form a loopy network. This is plausibly an intrinsic / essential problem for intelligences, because it involves the intelligence making predictions about its own actions--and those actions are currently under consideration--and those actions kinda depend on those same predictions. The difficulty of predicting "what will I do" grows in tandem with the intelligence, so any sort of problem that makes a call to the whole intelligence might unavoidably make it hard to separate predictions from decisions.

A further wrinkle / another example is that a question like "what should I think about (in particular, what to gather information about / update about)", during the design process, wants these predictions. For example, I run into problems like:

I'm doing some project X.
I could do a more ambitious version of X, or a less ambitious version of X.
If I'm doing the more ambitious version of X, I want to work on pretty different stuff right now, at the beginning, compared to if I'm doing the less ambitious version

... (read more)

[-]AnnaSalamon10d122

A further wrinkle / another example is that a question like "what should I think about (in particular, what to gather information about / update about)", during the design process, wants these predictions.

Yes; this (or something similar) is why I suspect that "'believing in' atoms" may involve the same cognitive structure as "'believing in' this bakery I am helping to create" or "'believing in' honesty" (and a different cognitive structure, at least for ideal minds, from predictions about outside events). The question of whether to "believe in" atoms can be a question of whether to invest in building out and maintaining/tuning an ontology that includes atoms.

2Mateusz Bagiński9d

(FYI, I initially failed to parse this because I interpreted "'believing in' atoms" as something like "atoms of 'believing in'", presumably because the idea of "believing in" I got from your post was not something that you typically apply to atoms.)

6Richard_Kennaway12d

I like this, and will show it to some of my colleagues who are also sceptical of the FEP/ActInf paradigm.

[-]Markvy13d60

Parts of that made me feel as if I understand my procrastination habit a bit better. That’s more mundane than sanity but still.

[-]jamiefisher12d5-2

I want to say something about how this post lands for people like me -- not the coping strategies themselves, but the premise that makes them necessary.

I would label myself as a "member of the public who, perhaps rightly or wrongly, isn't frightened-enough yet". I do have a bachelor's degree in CS, but I'm otherwise a layperson. (So yes, I'm using my ignorance as a sort of badge to post about things that might seem elementary to others here, but I'm sincere in wanting answers, because I've made several efforts this year to be helpful in the "communication,... (read more)

[-]Linch12d137

I'm not convinced though that ASI will bother to kill us or, if it does, very immediately.

I don't think we're certainly doomed (and have shallower models than Eliezer and some others here), but for me the strongest arguments for why things might go very badly:

An agent that wants other things might find their goals better achieved by acquiring power first. "If you don't know what you want, first acquire power." Instrumental convergence is a related concept.
There is and will continue to be strong training/selection effects for agency and not just unmoored intelligence for AI in the upcoming years. Ability to take autonomous actions is both economically and militarily useful.
In a multipolar/multiagent setup with numerous powerful AIs flying around, the more ruthless ones are more likely to win and accumulate more power. So it doesn't matter if some fraction of AIs wirehead, become Buddhist, are bad at long-term planning, have very parochial interests etc, as long as some powerful AIs want to eliminate or subjugate humanity for their purposes, and the remaining AIs/rest of humanity don't coordinate to stop them in time.

This arguments are related to each other, and not independe... (read more)

4MondSemmel11d

Not every post is addressed at everyone. This post (and others like Death With Dignity) is mostly for those who already believe the world is likely ending. For others, there are far more suitable resources, whether on LW, as books (incl. Yudkowsky's and Soares' recent If Anyone Builds It, Everyone Dies), or as podcasts. Though re: Yudkowsky argues against using the concept of "p(doom)" for reasons like this. See this post.

[-]David Joshua Sartor13d50

I was doing do-nothing meditation maybe a month ago, managed to switch to a frame (for a few hours) where I felt planning as predicting my actions, and acting as perceiving my actions. IIRC, I exited when my brother-in-law asked me a programming question, 'cause maintaining that state took too much brainpower.
I think a lot of human action is simple "given good things happen, what will I do right now?", which obviously leads to many kinds of problems. (Most obviously:)

[-]Eli Tyre13d*52

One of the ways you can get up in the morning, if you are me, is by looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen. (To be clear, I actually do this very rarely; it is just a fun fact that this is a way I can defeat bed inertia.)

I do this, or something very much like this.

For me, it's like the motion of setting a TAP, but to fire imminently instead of at some future trigger, by doing cycles of multi-sensory visualization of the behavior in question.

[-]Algon13d*51

Besides being a thing I can just decide, my decision to stay sane is also something that I implement by not writing an expectation of future insanity into my internal script / pseudo-predictive sort-of-world-model that instead connects to motor output.

Does implementing a trigger action plan by simulating observing the trigger and then taking the action, which needs to call up your visual, kinaesthetic and other senses, route through similar machinery to what you're describing here? Because it sounds vaguely similar, but: A) I wouldn't describe what I do th... (read more)

7Eliezer Yudkowsky13d

That does sound similar to me! But I haven't gotten a lot of mileage out of TAPs and if you're referring to some specific advanced version of it, maybe I'm off. But the basic concept of mentally rehearsing the trigger, the intended action, and (in some variations) the later sequence of events leading up to an outcome you feel is good, sure sounds to me like trying to load a plan into a predictorlike thing that has been repurposed to output plan images.

5Algon13d

Hmm, interesting. I think what confused me is: 1) Your warning. 2) You sound like you have deeper access to your unconscious, somehow "closer to the metal", rather than what I feel like I do, which is submitting an API request of the right type. 3) Your use cases sound more spontaneous. I'm not referring to more advanced TAPs, just the basics, which I also haven't got much mileage out of. (My bottleneck is that a lot of the most useful actions require pretty tricky triggers. Usually, I can't find a good cue to anchor on, and have to rely on more delicate or abstract sensations, which are too subtle for me to really notice in the moment, recall or simulate. I'd be curious to know if you've got a solution to this problem.) That said, playing with TAPs helped me realize what type of conscious signals my unconscious can actually pick up on, which is useful. For me, a big use case is updating my value estimator for various actions. I query my estimator, do the action, reflect on the experience, and submit it to my unconscious and blam! Suddenly I'm more enthusiastic about pushing through confusion when doing maths. BTW, is this class of skills we're discussing all that you meant by "thinking at the 5-second level"? Because for some reason, I thought you meant I should reconstruct your entire mental stack-trace during the 5 seconds I made an error, simulate plausible counterfactual histories and upvote the ones that avoid the error. This takes like an hour to do, even for chains of thought that last like 10 seconds, which was entirely impractical. Yet, I've just been assuming you could somehow do this in like 30s, which meant I had a massive skill issue. It would be good to know if that's not the case so I can avoid a dead-end in the cognitive-surgery skill tree.

2FiftyTwo2d

That sounds very useful could you say more about it? Or suggest any resources

[-]Hastings13d4-1

One way I could write a computer program that e.g. lands a rocket ship is to simulate many landings that could happen after possible control inputs, pick the simulated landing that has properties I like ( such as not exploding and staying far from actuator limits) and then run a low latency loop that locally makes reality track that simulation, counting on the simulation to reach a globally pleading end.

Is this what you mean by loading something into your pseudo prediction?

[-]Eliezer Yudkowsky13d110

This is just straight-up planning and doesn't require doing weird gymnastics to deal with a biological brain's broken type system.

[-]testingthewaters12d31

Even if they had almost destroyed the world, the story would still not properly be about their guilt or their regret, it would be about almost destroying the world

It is possible to not be the story's subject and still be the protagonist of one strand for it. After all, that's the only truth most people know for ~certain. It's also possible to not dramatize yourself as the Epicentre of the Immanent World-Tragedy (Woe is me! Woe is me!) and still feel like crap in a way that needs some form of processing/growth to learn to live with. Similarly, you can... (read more)

[-]Eliezer Yudkowsky12d140

I would of course have a different response to someone who asked the incredibly different question, "Any learnable tricks for not feeling like crap while the world ends?"

(This could be seen as the theme of a couple of other brief talks at the Solstice. I don't have a 30-second answer that doesn't rely on context, and don't consider myself much of an expert on that question versus the part of the problem constraint that is maintaining epistemic health while you do whatever. That said, being less completely unwilling to spend small or even medium amounts of money made a difference to my life, and so did beginning a romantic relationship in the frame of mind that we might all be dead soon and therefore I ought to do more fun things and worry less about preserving the relationship, which led to a much stronger relationship relative to the wrong things I otherwise do by default.)

6David Lorell11d

(Can you give one or more examples of what doing more fun things in your relationship looks like as opposed to worrying about preserving it?)

[-]bodry13d30

This vocalized some thoughts I had about our current culture. Stories can be training for how to act and bad melodramatic tropes are way too common. Every sad song about someone not getting over their ex or a dark hero movie where the protagonist is perpetually depressed about something that happened in the past conditions people the wrong way.

There is an annoying character in the recent Nuremberg film. He's based off a real person but I don't know how accurate that portrayal is.

He’s a psychiatrist manipulated by Goering. He's suppos

... (read more)

2Mary Chernyshenko6d

They wrote a great reason to get mad at someone. Perfectly observable in nature.

[-]Tapatakt13d30

Thank you! Datapoint: I think at least some parts of this can be useful for me personally.

Somehat connected to the first part, one of the most "internal-memetic" moments from "Project: Lawful" for me is this short exchange between Keltham and Maillol:

"For that Matter, what is the Governance budget?"
"Don't panic. Nobody knows."
"Why exactly should I not panic?"
"Because it won't actually help."
"Very sensible."

If evil and not very smart bureaucrat understands it, I can too :)

Third part is the most interesting. It makes perfect sense, but I have no easy-to-acce... (read more)

[-]Rana Dexsin13d110

It makes perfect sense, but I have no easy-to-access perception of this thing. Will try to do something with this skill issue.

As someone who believes myself to have had some related experiences, this is very easy to Goodhart on and very easy to screw up badly if you try to go straight for it without [a kind of prepwork that my safety systems say I shouldn't try to describe] first, and the part where you're tossing that sentence out without obvious hesitation feels like an immediate bad sign. See also this paragraph from that very section (to be clear, it's my interpretation that treats it as supporting here, and I don't directly claim Eliezer would agree with me):

(Frankly I expect almost nobody to correctly identify those words of mine as internally visible mental phenomena after reading them; and I'm worried about what happens if somebody insists on interpreting it anyway. Seriously, if you don't see phenomena inside you that obviously looks like what I'm describing, it means, you aren't looking at the stuff I'm talking about. Do not insist on interpreting the words anyway. If you don't see an elephant, don't look under every corner of the room until you find something tha

... (read more)

4Tapatakt13d

Thanks for your concern! I think I worded it poorly. I think it is an "internally visible mental phenomena" for me. I do know how it feels and have some access to this thing. It's different from hyperstition and different from "white doublethink"/"gamification of hyperstition". It's easy enough to summon it on command and check, yeah, it's that thing. It's the thing that helps to jump in a lake from a 7-meters cliff, that helps to get up from a very comfy bed, that sometimes helps to overcome social anxiety. But I didn't generalise from these examples to one unified concept before. And in the cases where I sometimes do it, my skill issues are due to the fact that the access is not easy enough: * I can't do it constantly, it takes several seconds and eats attention. * I can't reliably remember to do when it's most important - in highly stressful situations or when my attention is too occupied with other stuff. * Some internal processes (usually - strong negative emotions) can override it by uploading more powerful image into the script, so I follow it instead, even while understanding that it's worse. * Also it doesn't really work for long period of time from one uploading. (So it works best when returning to default course of action after initial decision would be hard/impossible/obviously silly/embarassing/weird.) Do you think I'm wrong and this is a different thing?

[-]Kaj_Sotala13d30

This is why, in a much more real and also famous case, President Truman was validly angered and told "that son of a bitch", Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman.

For anyone else who didn't remember the details of what this was referencing:

Claude Opus 4.5's explanation of the reference

This refers to a meeting between J. Robert Oppenheimer and President Harry Truman in October 1945, about two months after the atomic bombings of Hiroshima and Nagasaki.

The meeting itself

Oppenheimer was invited to the Oval Offic

... (read more)

[-]Wei Dai13d331

After reading this article by a human historian (Bill Black), I think there's a number of inaccuracies in Claude's account above, but the key point I wanted to verify is that Truman's reaction happened after just that one sentence by Oppenheimer (which in my mind seems like an appropriate expression of reflection/remorse, not being a drama queen, if he didn't do or say anything else "dramatic"), and that does seem to be true.

The author's conclusions, which seems right to me:

He, the president, dropped the bomb, not Oppenheimer. How dare this scientist — this government employee — assume the guilt for the greatest weapon ever used in human history? How dare he make himself the hero, albeit a tragic one?
I think Nolan got this right — this was what really annoyed Truman about Oppenheimer’s comment. By assuming guilt for the bomb, Oppenheimer was taking credit for it. And Truman resented this. He wanted the credit for dropping the bomb and saving American lives, whatever bloodguilt that may have entailed.

[-]Eliezer Yudkowsky13d122

My understanding is that there's a larger pattern of behavior here by Oppenheimer, which Truman might not've known about but which influences my guess about Oppenheimer's tone that day and the surrounding context. Was Truman particularly famous for wanting sole credit on other occasions?

8David Joshua Sartor13d

It'd be weird for him to take sole credit; he only established full presidential control of nuclear weapons afterward. He didn't even know about the second bomb until after it dropped.

[-]S1d20

I kind of had a hard time not taking this as an ironic, veiled self-satire narrative by the author using a first-person perspective to deliver between-the-lines the critique of the character they've portrayed in the first-person. It hit me at some point that it -could- be, depending on how clever the author was or not. I don't try to be sharp or ironic as I find it distasteful most of the time, although when I ran into the concept of benevolent irony it gave me moral food for thought, irony has largely just looked like another clever way to wound people, and especially by projecting superior ability against the inferior. In this case it makes for effective satire, however, just because the cleverness (if I'm not misperceiving the author's intent) is quite brilliant.

That being said, if I were to try and interpret this writing as ironically satirizing the character's perspective by the author, the identifying tokens would be: to find strength in disconnecting one's self from enculturation via tropes that allow one to own one's mistakes so as to make half-hearted fixes after-the-fact which did not require hindsight to avoid causing, maintaining a covert ego in relation to them, and es... (read more)

[-]mikbp1d20

Just remember that we are just evolved monkeys and the world is very complex. We may very well not have the capacity to see the reason why the world ending because of AI is actually implausible. I have been wrong too many times to get crazy or too upset for a thing I positively know we cannot foresee --even if the possibility space we are able to see is overwhelmingly bad.

[-]peter_hartree6d22

Thanks for sharing this.

I don't expect that my methods of sanity will be reproducible by nearly anyone.

I think you're mistaken here. I've long used all three of your methods, broadly speaking, and I know several others for whom that is true.

[-]Nathan Young7d*20

Somewhat worrying the extent that reading Planecrash really does help understand this.

Does LW have spoilter tags?

edit moderate planecrash spoiler that comes perhaps 70 hours of listening in.

Rough approximation: After a while Keltham, wonders if he's in a story and begins discussing "the tropes" - he thinks are aspects of his reality which seem to be more story-like and whether they should play into or out of those aspects. Yudkowsky seems to be referencing the same concept here. Do we wish to play towards or away from the tropes we might expect to see?

2habryka7d

Yep, just start a paragraph with >! and then you will be in a spoiler tag

[-]NickH8d20

I would respond to that question with: "How are you coping with the certainty that you, and everyone you ever knew or cared about or who cares about you, will be dead in a hundred years or so"? (And before many peoples estimate of AI doom). The simple answer is that we did not evolve to be able to truly feel that kind of thing and for good reason.

[-]jmh12d20

You really get asked that? Wow.

I also have always found the "the world might end tonight/tomorrow/next week" stories with people running around madly doing all the things they never would have otherwise a bit stretched. But then mob mentalities are not rational so I don't really try to make too much sense of them

I suppose that would be my first approach to coping with the world ending -- just keep my eye open to external madness and perhaps put some space between me and large population or something.

Since I generally don't believe anyone has ever pro... (read more)

[-]Double12d20

I reflected on why I didn’t feel overwhelming debilitating sadness due to x-risk and realized that “there’s no rule that says you should be sad if you aren’t feeling sad.”
Even a recent widow in a previously happy marriage shouldn’t feel bad about not feeling sad if they find themselves not being sad.

[-]Nathan Rosquist12d20

Why can’t this too be a trope: having had the thought “I’m a writer and can write myself; I can write internal scripts for what I do and how I react,” the character believes he has near-perfect agency over how he feels, thinks, and acts, until one day a particular stress test (in an accelerating series of increasingly rigorous stress tests) suggests that he doesn’t.

2Davidmanheim12d

It's not a common trope, certainly, but if it is one, it's also one that Eliezer is happy to play out. (And there are lots of good tropes that people play out which they shouldn't avoid just because they are tropes - like falling in love, or being a good friend to others when they are sad, or being a conscientious ethical objector, or being someone who can let go of things while having fun, etc.)

[-]Chris Datcu13d2-1

One could incorrectly summarize all this as "I have decided not to expect to go insane," but that would violate the epistemic-instrumental firewall and therefore be insane.

would a saner alternative then go in the lines of:
"I have decided to entertain thoughts and actions under the expectation that I will not go insane, because that's the most adaptive and constructive way to face this situation, even though I can't be certain"?

if so, I see a good dynamic for sanity;
- choose (non egocentric & constructive) narrative;
- guide thoughts to fit chosen narrative.

slightly tangential question: how do you maintain coherence/continuity of narrative across contexts?

[-]Eliezer Yudkowsky13d196

Nope. Breaks the firewall. Exactly as insane.

Beliefs are for being true. Use them for nothing else.

If you need a good thing to happen, use a plan for that.

[-]rain8dome910h10

I think "its someone else's problem to stop AI" and so sleep soundly.

[-]HoVY1d10

"I was rolling my eyes about how they'd now found a new way of being the story's subject"

That reads to me like it's still rolling eyes at a status overreach, just a slightly different one than the one most people would roll their eyes at

[-]Aorou1d10

For those wondering about Raistlin Majere, this is from Wikipedia:

« Born to a mother prone to trance-like fits and a woodcutter father, Raistlin inherited his mother's aptitude for magic. He undertook and passed the arduous Test of High Sorcery, but in the process, he acquired white hair and golden skin and was cursed with hourglass eyes which saw the effects of time on all things. His health, while never robust, was ruined further, leaving him weak and subject to frequent bouts of coughing blood. Initially wearing the white robes of good, as the first series progresses Raistlin's powers increase while his mood and actions darken, he goes to neutral red robes for the majority of the "War of the Lance" series until he adopts the black robes of evil while under the tutelage of "Fistandantilus" during the War of the Lance.

Raistlin, although physically very weak, is extremely intelligent, and possesses uncommonly powerful magical abilities. While ruthless in his pursuit of power, he holds to a code of conduct which repays all debts and protects those disadvantaged through no fault of their own. His relationship with his much stronger, better-liked, and good-natured twin brother Caramon is fraught with tensions as Caramon seeks to protect and shelter his weaker brother while denying his cruelty and penchant for hurting any others while in pursuit of his goals. »

[-]Alephwyr2d10

Being crazy is unpleasant. Even when it produces feelings of intense euphoria or meaning it destroys agency both at the level of being able to move and the more Kantian level of being able to understand. One alternative solution then would be like the "making your kid smoke an entire carton of cigarettes" solution. Lots of people do LSD, do Mushrooms, do DMT, once or with sporadic frequency, and this turns into some sort of secularized come to Jesus moment for them, where the scales fell off their eyes, and they learned about the complex, limited, real virtues of being insane, became an expert in them in fact, maybe the only expert, and go around trying to curate insanity in other people, judging it, cultivating it. The opportunity cost of making a person be continuously crazy for a year or more is extremely high but I suspect, capable of producing empirical refutation of some perspectives.

[-]Elliot Callender3d10

And a fiat decision to stay sane, implemented by not instructing myself that any particular stupidity or failure will be my reaction to future stress.

I have not implemented the other two, but this decision I made during HPPD-like psychosis; yes, it is for some a learnable skill.

[-]Jesper L.6d10

I think you are severely underestimating how relatable and common your thoughts on this topic are (also to many journalists). In short, you underestimate people's capacity to get this (probably because they are out-of-distribution for your way of structured reasoning in general, to borrow LW 2.0 lingo).

If I would make a guess, I think that (self-aware) people outside of LW and similar circles may be even more likely to relate to several of these points than people inside of LW. For example, "a sentence about snow is words, is made of words, but it is about... (read more)

[-]Chris Wintergreen10d10

Eliezer, on number three: I give it a 5% chance that I'm talking about the same thing as you, and that's before applying my overconfidence factor of 0.6. You're talking about injecting instructions into your motor plan. I'm visualing doing the thing really hard. It seems to work? It's like I'm deliberately making a few predictions about the next few seconds, and just continuing to visualise those things rather than thinking about something else, then I just start moving. Is this the same thing you're talking about? Or am I just doing some form of "Yud said... (read more)

[-][email protected]11d10

I think the journalistic conceit behind the "how are you coping" question in this context amounts to treacle, and I see value in the frame of eschewing genre. Where I get stuck is that I think the trope/response that the question is intended to elicit would, under the indulged journalistic narrative, play more along the lines of a rational restatement of the Serenity Prayer. In other words, in the script as put, the Eliezer Yudkowsky "character" is being prompted not to give vent to emotive self-concern, but to articulate a more grounded, calm and focused ... (read more)

[-]Steve Kommrusch11d10

Thanks for the interesting peak into your brain. I have a couple thoughts to share on how my own approaches relate.

The first is related to watching plenty of sci-fi apocalyptic future movies. While it's exciting to see the hero's adventures, I'd like to think that I'd be one of the scrappy people trying to hold some semblance of civilization together. Or the survivor trying to barter and trade with folks instead of fighting over stuff. In general, even in the face of doom, just trying to help minimize suffering unto the end. So the 'death with dignity' eth... (read more)

[-]XelaP11d10

I think I know of the trick you are talking about, in that there does seem to be an obvious pseudoprediction place in my mind that interfaces with motor output, and it's obviously different from actually believing, or trying to believe. However I mostly can't manage more than twitches or smaller motor movements, and it gets harder the more resistant I am to doing it (thus, less useful the more I would need use of it). If I'm thinking of the right thing, then the failure of me to sometimes send the pseudoprediction to my muscles seems to be the cause of som... (read more)

[-]Jon Garcia12d1-2

Oh come on, Eliezer. These strategies aren't that alien.

I remember a time in my early years, feeling apprehensive about entering adolescence and inevitably transforming into a stereotypical rebellious teenager. It would have been not only boring and cliche but also an affront to every good thing I thought about myself. I didn't want to become a rebellious teenager, and so I decided, before I was overwhelmed with teenage hormones, that I wouldn't become one. And it turns out that intentional steering of one's self-narrative can (sometimes) be quite effectiv... (read more)

[-]justinpombrio12d10

"There exists a place in your cognition that feels like an expectation but actually stores an action plan that your body will follow, and you can load plans into it." is a valuable insight and I'm not sure I've seen it stated quite in that form elsewhere.

Do you have more you could say about how cognition works, or reliable references to point at?

Everything I've read is either true but too specific or low level to be useful (on the science end) or mixed with nonsense (on the meditation end), and my own mind is too muddled to easily distinguish true facts about how it works from almost-true facts about how it works. This makes building up a reliable model really hard.

1Jon Garcia12d

If you can get access to the book, try reading The Intelligent Movement Machine. Basically, motor cortex is not so much about stimulating the contraction of certain muscles, but it's instead encoding the end-configuration to move the body towards (e.g., motor neurons in monkey motor cortex that encode the act of bringing the hand to the mouth, not matter the starting position of the arm). How the muscles actually achieve this is then more a matter of model-based control theory than RL-trained action policy. It's closely related to end-effector control, where the position, orientation, force, speed, etc. of the movement of the end of a robotic appendage are the focus of optimization, as opposed to joint control, which focuses only on the raw motor outputs along the joints of the appendage that cause the movement. You can also try diving deeper into the active inference literature if you want to build an intuition for how "predictive" circuits can actually drive motor commands. Just remember that Friston comes at this from the perspective of trying to find unifying mathematical formalisms for everything the brain does, both perception and action, which leads him to use terminology for the action side of things that is unintuitive. Active inference is not saying that the brain "predicts" that the body will achieve a certain configuration and then the universe grants its wish. Instead, just like perception is about predicting what things out in the world are causing your senses to receive the signals that they do, action is about predicting what low-level movements of your body would cause your desired high-level behavior and then using those predictions to actually drive the low-level movements. Or rather, the motor cortex is finding the low-level movements (proprioceptive trajectories) that the agent's intended behavior would cause and then carrying out those movements. Again, don't get too hung up on the "prediction" nomenclature; the system does what it does rega

[-]Michael Steele13d10

The human brain is just a wacky biological tangle, the same way that human metabolism repurposes the insanely reactive chemical byproduct of superoxide as a key signaling molecule.

It sounds like you read Petro Dobromylsky's Hyperlipid and Brad Marshall's Fire in a Bottle!

[-]pku13d10

Translating this to the mental script that works for me:
If I picture myself in the role of the astronauts on the Columbia as it was falling apart, or a football team in the last few minutes of a game where they're twenty points behind, I know the script calls for just keeping up your best effort (as you know it) until after the shuttle explodes or the buzzer sounds. So I can just do that.

Why is there an alternative script that calls to go insane? I think because there's a version that equates that with a heroic effort, that thinks that if I dramatize and j... (read more)

3CronoDAS12d

I thought the "going insane" thing would have been about showing everyone around you that you need help and/or are not a person able to give help to anyone else.

2CronoDAS10d

An example: near the end of "Saving Private Ryan", the squad led by Tom Hanks gets into a pitched battle with some German soldiers. One of the members of the squad spends the entire battle hiding behind a building and crying.

[-][email protected]2d-10

My method of staying sane is way less complicated.

I am not unique or special. I am human. Ergo things that keep humans sane should work on me.

So I read up on mental health and then did those things. Sleep, nutrition, exercise, sunshine, make friends, community service, clean air.

It's likely I may still experience issues later in life. But all life is always temporary. It's about the now, appreciating this moment when I have a dog beside me and a snoring spouse and a wool blanket and a nice book.

I can only control what I can control.

I'm learning rock carving. Rocks are awesome and last through lots of disasters.

[+][comment deleted]13d20

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

442

Eliezer's Unteachable Methods of Sanity

442

442

Stay genre-savvy / be an intelligent character.

Don't make the end of the world be about you.

Just decide to be sane, and write your internal scripts that way.