Cross-posted from Putanumonit. Some of this post is relevant mostly to readers of my blog who aren’t LessWrongers, but you may still be interested in my general thoughts about the essays, the book as an artifact, and the state of the community. Is there a better way to bid goodbye to 2020 than with a book set of the best Rationalist writing of 2018? I wouldn’t know — Ben Pace, who compiled and edited the set, sent me a review copy and so I spent Christmas day reading the Epistemology entry. So this post is a review, and of more than just the books. A great thing you’ll notice right away about the books is that they smell exactly like Wiz, the Israeli video game magazine from the 90s that was the joy of my middle school years. A not-so-great thing about the books is that they’re small. The essays are printed in a very small font and the quotes within each essay are printed, for some reason, in an even smaller font. There are rumors that inside the quotes the secrets of the universe are rendered in the tiniest font of all, but I lack the visual acuity to discern if that is the case. The book set looks almost comical next to the hardcover SlateStarCodex collection on my shelf: Ironically, this juxtaposition describes the state of the Rationality community when I discovered it in early 2014. That year was Scott Alexander’s unassailable annus mirabilis. In the span of 12 months he taught us about outgroups and the gray tribe, Moloch, fashions, layers and countersignaling, words and categories, toxoplasma, the psychology of EA and of social justice, drugs, other drugs, better drugs, scientific validity, and whale cancer. The same period for LessWrong is described by Ben Pace in the introduction to the book set as "a dark age from 2014-2017, with contributions declining and the community dispersing". This led to the LessWrong 2.0 team forming in 2018, and, as the book set can attest, ushering in a true renaissance of rationality writing and intellectual progress. The Epistemology tome contains the following ten essays, which I’ll refer to by the bolded part:
-
A Sketch of Good Communication by Ben Pace (significantly edited from the online version)
-
Babble by Xiaoyu He (aka alkjash)
-
Local Validity as a Key to Sanity and Civilization by Eliezer Yudkowsky
-
The Loudest Alarm Is Probably False by Patrick LaVictoire (aka orthonormal)
-
Varieties of Argumentative Experience by Scott Alexander
-
More Babble** **by Xiaoyu
-
Naming the Nameless by Sarah Constantin
-
Pruneby Xiaoyu
-
Toolbox-thinking and Law-thinking by Eliezer
-
Toward a New Technical Explanation of Technical Explanation by Abram Demski I like to think of rationality in general and rationalist epistemology in particular as comprising three interdependent domains:
-
Epistemology for monkeys, i.e. what happens in an individual’s mushy evolved brain as they’re trying (or not) to form true beliefs.
-
Epistemology for algorithms, i.e. Bayes, logic, computation, and the rest of the rigorous mathematical foundation of truth-seeking AI.
-
Epistemology for groups, how we arrive or fail to arrive at truth together. Since humans are algorithmic monkeys in groups, all three branches are relevant to epistemology for humans. The essays, although selected by simple voting and without an eye for comprehensiveness, cover all three domains. Local Validity is the underwhelming essay of the bunch, and feels like a throwback to 2008 when Eliezer himself was merely laying the groundwork for capital R Rationality. It draws a parallel between valid steps in a mathematical proof, non-fallacious arguments, and a civilization’s laws. It points out that the "game", whether its deriving mathematical proofs, true conclusions, or maintaining a functioning civilization, requires everyone to abide by rules of validity. This parallel rings true, but doesn’t seem very novel or productive by itself. Did citizens of erstwhile civilizations follow rules because they understood the game theory of defection equilibria? If they didn’t, would this essay have convinced them? The juice is in understanding where and why humans stray from validity. That’s where Scott comes in. *Varieties *could function as the organizational essay for the entire tome. It addresses Local Validity by pointing out that people usually play different games from an honest debate, games like manipulating the Overton window of acceptable debate, manipulating the norms of debate in particular groups, or vying for status supremacy directly. Scott’s essay drives home the point that actual truth-seeking debate is so rare in many spaces that people often forget it’s an actual thing. Some of the best Rationalist writing of 2020 is about statements-that-aren’t-truth-claims and fights-that-aren’t-debates, such as the writings about simulacra. Rationalists often need a reminder that we’re quite unusual as a group in finding ourselves regularly venturing to the top of the pyramid in search of truth. *Sketch of Good *slots in nicely into the pyramid with the idea that collaborative truth-seekers should share the gears of their models, and not just the models’ output. Aside from allowing you to make better updates from a conversation with one person, this technique also improves your ability to update on future evidence that may come and integrate the opinions of more people. It’s an important complement to the technique of double crux. I will try to communicate some of my own model with regards to a pressing new COVID-related development in this post. At the top of Scott’s pyramid are generators of disagreement that cannot be resolved with a simple double crux, like heuristics built up of countless bits of evidence and aesthetic disagreements. This is where Nameless comes in. Sarah explains that style and aesthetics are not arbitrary matters of individual taste, but compress a huge amount of information about a culture and its norms. When you walk into a Sweetgreen cafe, every element of design tells you not just what sort of food you are about to eat but also what sort of people you will be eating it next to and what these people’s attitudes are about a variety of non-salad topics. Aesthetics are initially manufactured by a creative class that is almost always aligned with the political left. Trying to negate the influence of aesthetics (as some conservatives do), ignore it as a communitarian ploy (libertarians), or being simply blind to it all (rationalists) is a bad move that cedes this important ground to a tribe you may otherwise dislike. *Nameless *is, in my opinion, the most impressive essay in the book. It’s charging boldly into new territory and is dense with insight. I hope that rationalists continue to build up on this idea, especially as I am personally becoming more and more convinced that you can’t really **argue **people into things like rationality, transhumanism, polyamory, Effective Altruism, or anything else I hold important. You have to inspire and seduce them, and that requires understanding beauty in addition to truth. Loudest Alarm is brief and novel. It rang very true to my wife (who is both demure and constantly worried about imposing on people) but less so to me (my friends and I tend to agree on my main weaknesses). Toolbox and Law is another essay that will be most useful to people new to rationality and who are confused about the proper role of Bayesian theory in epistemological practice. It seems to have been written as a reply to David Chapman, part of Eliezer’s tireless pwning of meta-, post-, and other too-cool-to-call-themselves-rationalists. Since a big part of rationality as a brand is supporting Eliezer’s caliphate, it is only fitting that he leads the war for the brand’s status. I am quite happy to throw my own memes into the fray as it is called for.New Technical is a bit too technical me, so at the book’s recommendation I read An Untrollable Mathematician Illustrated instead and got a cool lesson on the work done to bring together probability theory and logical induction. I’m in this weird spot where I know more math than the vast majority of people but vastly less math than e.g. the researchers at MIRI. And so when I read posts about MIRI’s research and the mathematics of AI alignment I’m either bored or hopelessly lost within two paragraphs. The star of Epistemology is alkjash, with three essays (the remaining two in the sequence are also worthwhile) making the cut. I was extremely excited about Babble and Prune when it came out and ran a meetup about it, and it is now one of my main models for thinking about creativity — itself is an underexplored topic on LessWrong. His suggestion of leaving ambiguity in the text (as the Bible does) to let the readers prune their own meaning informs my approach to Twitter, although I’m still working on bringing the same Biblical spirit to Putanumonit. This model is closely related to predictive processing (although PP being my recent all-encompassing obsession, I’m liable to think that everything is closely related to it). Babble and Prune mirrors the core structure of hierarchical prediction in which predictions are propagated downward from the abstract and conscious levels to the detailed and subconscious, and only errors (prediction mismatches) propagate back up. Alkjash connects the model to AI and Google’s algorithms as if anticipating the breakthrough in babbling that is GPT-3. Of course, I was inspired to connect GPT-3 and AI to predictive processing and the future of AI as well. This is straying from Epistemology a bit, but I do think that it will be an enormously fruitful project to recontextualize rationality through the lens of predictive processing. The main reason I haven’t started on this project yet is because of how huge it is, expanding vastly before me with every step I take in understanding. This may require hundreds of hours of my life to do justice to (perhaps I should run a kickstarter to gauge interest). I’ll probably start by building on this action-informed rethinking of confirmation bias and see how it goes. In general, the essays are all worth reading. However, I’m not entirely sure they make sense as a (tiny) book. The constraint of picking essays from a single year and going by public vote wasn’t designed to create a collection that coheres together. I would much rather have had essays collected by topic and across years, which may happen eventually with the introduction of concept tags on LessWrong. I wish that Ben had edited a volume that made sense to him, instead of carefully abstaining from putting his finger on the scales. But of course, Ben’s job is not to edit paper book collections, it’s to build up LessWrong as an online resource and vibrant community. And that job is done fantastically, including with the project of annual reviews. The 2019 review, which is ongoing now, is getting thousands of people to re-read the best essays and discuss them. The real book was the comments we wrote along the way. By the way, my post on Rationalist Self-Improvement has been nominated for the 2019 collection but not reviewed yet. Please consider writing a review if that post had an effect on you. It may seem slightly unfair to other nominees to ask for this on my own blog but I am also significantly handicapping the karma and visibility of all my posts on LessWrong by cross-posting only after hundreds of people have already read them on Putanumonit. If LW was the only place to read my posts more people would read and review them there, so hopefully this balances out. Ideally, this book will serve as an encouragement for more people to write on LessWrong. Perhaps I am myself lucky in having found LessWrong during its dark age — I wrote a few bad posts that no one read and, in the absence of a flood of rationalist content to intimidate me, started Putanumonit in 2015. Now that the dark days are over, you may feel daunted by the quality of writing in the LessWrong collection when thinking about writing yourself. Alkjash reminds you not to set your personal prune to the standards of work that has been already filtered and curated. Unlike the other authors in Epistemology, he himself wrote his first post only in January 2018 and by the end of one year had already progressed to making contributions worthy of inclusion in books. And finally, this book should be an encouragement for everyone to read LessWrong. I can log on daily to find new and interesting writing, and I can also log on whenever I want smart people’s opinion on something like the new COVID strain and find detailed discussion of everything from the epidemiology to the biology to the investment impacts. New features are being added all the time, like the recent launch of in-post predictions. 2018-2020 LessWrong is very different, in content and tone and structure, from both the early days of the Sequences and the dark ages of the mid-2010s. But the book is a testament to the fact that this new age is a golden one (in shades of green and gray), and you are very much invited to join it.
I liked the tiny books!
Predictive Processing strikes me as a poor framework; I’d like to try and discuss your enthusiasm vs my lack of enthusiasm. What insights do you think it gives? What basically does PP mean to you, so we’re not talking post each other?
Comment
Off the top of my head, here are some new things it adds:
Comment
FYI Jacobian, very high in the review-request-thread is a post on neural annealing. I think many people would be interested in reading your review of that post. (Thank you very much for this review as well :D )
PP is not one thing. This makes it very difficult for me to say what I don’t like about it, since no one element seems to be necessarily present in all the different versions. What follows are some remarks about specific ideas I’ve seen associated with PP, many of them contradictory. Do let me know which ideas you endorse / don’t endorse. It is also possible that each of my points is based on a particular misconception about PP. While I’ve made some effort to be well-informed about PP, I have not spent so much time on it, so my understanding is definitely shallow. The three main meanings of PP (each of which is a cluster, containing many many different sub-meanings, as you flesh out the details in different ways):
**A theory of perception. **If you look PP up on Wikipedia, the term primarily refers to a theory of perceptual processing in which prediction plays a central role, and observations interact with predictions to provide a feedback signal for learning. So, the theory is that perception is fundamentally about minimizing prediction error. I basically believe this theory. So let’s set it aside.
A theory of action. Some people took the idea "the brain minimizes prediction error" and tried to apply it to motor control, too—and to everything else in the brain. I think this kind of made sense as a thing to try (unifying these two things is a worthwhile goal!), but doesn’t go anywhere. I’ll have a lot to say about this. This theory is what I’ll mean when I say PP—it is, in my experience, what rationalists and rationalist-adjacent people primarily mean by "PP".
A theory of everything. Friston’s free-energy principle. This is not only supposed to apply to the human brain, but also evolution, and essentially any physical system. I have it on good authority that the math in Friston’s papers is full of errors, and no one who has been excited about this (that I’ve seen) has also claimed to understand it.
Standard bayesian learning theory does just fine learning about acting, along with learning about everything else.
Standard bayesian decision theory offers a theory of acting based on that information.
Granted, standard bayesian theory has the agent learning about everything, regardless of its usefulness, rather than learning specifically those things which help it act. This is because standard Bayesian theory assumes sufficient processing power to fully update beliefs. However, I am unaware of any PP theory which improves on this state of affairs. Free-energy-minimization models can help deal with limited processing power by variational bayesian inference, but this minimizes the error of all beliefs, rather than providing a tool to specifically focus on those beliefs which will be useful for action (again, to my knowledge). Practical bayesian inference has some tools for focusing inference on the most useful parts, but I have never seen those tools especially associated with PP theory.
Comment
There’s a whole lot to respond to here, and it may take the length of Surfing Uncertainty to do so. I’ll point instead to one key dimension.You’re discussing PP as a possible model for AI, whereas I posit PP as a model for animal brains. The main difference is that animal brains are **evolved **and occur inside bodies. Evolution is the answer to the dark room problem. You come with prebuilt hardware that is adapted a certain adaptive niche, which is equivalent to modeling it. Your legs are a model of the shape of the ground and the size of your evolutionary territory. Your color vision is a model of berries in a bush, and your fingers that pick them. Your evolved body is a hyperprior you can’t update away. In a sense, you’re predicting all the things that are adaptive: being full of good food, in the company of allies and mates, being vigorous and healthy, learning new things. Lying hungry in a dark room creates a persistent error in your highest-order predictive models (the evolved ones) that you can’t change. Your evolved prior supposes that you have a body, and that the way you persist over time is by using that body. You are not a disembodied agent learning things for fun or getting scored on some limited test of prediction or matching. Everything your brain does is oriented towards acting on the world effectively. You can see that perception and action rely on the same mechanism in many ways, starting with the simple fact that when you look at something you don’t receive a static picture, but rather constantly saccade and shift your eyes, contract and expand your pupil and cornea, move your head around, and also automatically compensate for all of this motion. None of this is relevant to an AI who processes images fed to it "out of the void", and whose main objective function is something other than maintaining homeostasis of a living, moving body.Zooming out, Friston’s core idea is a direct consequence of thermodynamics: for any system (like an organism) to persist in a state of low entropy (e.g. 98°F) in an environment that is higher entropy but contains some exploitable order (e.g. calories aren’t uniformly spread in the universe but concentrated in bananas), it must exploit this order. Exploiting it is equivalent to minimizing surprise, since if you’re surprised there some pattern of the world that you failed to make use of (free energy). Now if you just apply this basic principle to your genes persisting over an evolutionary time scale and your body persisting over the time scale of decades and this sets the stage for PP applied to animals. For more, here’s a conversation between Clark, Friston, and an information theorist about the Dark Room problem.
Comment
I suspect some of the things that you want to use PP for, I would rather use my machine-learning model of meditation. The basic idea is that we are something like a model-based RL agent, but (pathologically) have some control over our attention mechanism. We can learn what kind of attention patterns are more useful. But we can also get our attention patterns into self-reinforcing loops, where we attend to the things which reinforce those attention patterns, and not things which punish them. For example, when drinking too much, we might resist thinking about how we’ll hate ourselves tomorrow. This attention pattern is self-reinforcing, because it lets us drink more (yay!), while refusing to spend the necessary attention to propagate the negative consequences which might stop that behavior (and which would also harm the attention pattern). All our hurting tomorrow won’t de-enforce the pattern very effectively, because that pattern isn’t very active to be de-enforced, tomorrow. (RL works by propagating expected pain/pleasure shortly after we do things—it can achieve things on long time horizons because the expected pain/pleasure includes expectations on long time horizons, but the actual learning which updates an action only happens soon after we take that action.) Wishful thinking works by avoiding painful thoughts. This is a self-reinforcing attention pattern for the same reason: if we avoid painful thoughts, we in particular avoid propagating the negative consequences of avoiding painful thoughts. Avoiding painful thoughts feels useful in the moment, because pain is pain. But this causes us to leave that important paperwork in the desk drawer for months, building up the problem, making us avoid it all the more. The more successful we are at not noticing it, the less the negative consequences propagate to the attention pattern which is creating the whole problem. I have a weaker story for confirmation bias. Naturally, confirming a theory feels good, and getting disconfirmation feels bad. (This is not because we experience the basic neural feedback of perceptual PP as pain/pleasure, which would make us seek predictability and avoid predictive error—I don’t think that’s true, as I’ve discussed at length. Rather, this is more of a social thing. It feels bad to be proven wrong, because that often has negative consequences, especially in the ancestral environment.) So attention patterns (and behavior patterns) which lead to being proven right will be reinforced. This is effectively one of those pathological self-reinforcing attention patterns, since it avoids its own disconfirmation, and hence, avoids propagating the consequences which would de-enforce it. I would predict confirmation bias is strongest when we have every social incentive to prove ourselves right. However, I doubt my story is the full story of confirmation bias. It doesn’t really explain performance in the task where you have to flip over cards to check whether "every vowel has an even number on the other side" or such things. In any case, my theory is very much a just-so story which I contrived. Take with heap of salt.
Comment
Just saying, if anyone ever forms a band out of less wrongers, "algorithmic monkeys" is my suggested name
I found this review of the SSC books on the Amazon website: