Embedded vs. External Decision Problems

https://www.lesswrong.com/posts/br7KRSeNymwSvZnf5/embedded-vs-external-decision-problems

(I’ve discussed these ideas in the past, but this idea was lacking a canonical reference post) Suppose that you are an expected-utility maximising agent. This means that when you make a decision you should calculate the expected-utility for each counterfactual and choose the counterfactual with the highest expected utility. However, if you know yourself well enough to know that you are an expected utility maximiser, this knowledge can change your counterfactuals. For example, this can provide information about your likely future actions, the expected number of expected utility maximisers in the world or how clones of you might behave. In general, if you are in an embedded problem and you have reflective knowledge of your own algorithm, then your decision theory provides both a system of evaluation and a data-point about the world. We can call this the embedded view. In contrast, we could also imagine a utility maximiser outside the problem description, analysing the decisions of various agents within the problem according to how well they maximise utility, none of which need to be utility maximisers themselves. In general, when we use a decision theory in this way, we will call this the external view of the problem. Normally, we’ll be talking about applying one or both of these two things:

Comment

https://www.lesswrong.com/posts/br7KRSeNymwSvZnf5/embedded-vs-external-decision-problems?commentId=u6jBrsQg8sPixsDvg

For example, [for] an updateless agent that always ["]one-boxes["] in Newcomb’s, … it kind of doesn’t make sense to ask what that agent would have scored if that agent had two-boxed.An updateless agent could (always) two-box.

Comment

https://www.lesswrong.com/posts/br7KRSeNymwSvZnf5/embedded-vs-external-decision-problems?commentId=sfHCLN7ZkxkEFDSyN

The agent has code. It can only do what the code says. If the code will make it one box, there was a sense in which it never could have two-boxed.

Comment

But if the code will make it two box, there was a sense in which it never could have one-boxed.

Comment

That’s true also