
Philip Tetlock: scourge of the experts
Thirty years ago at the University of Yale, psychologist Philip Tetlock witnessed an experiment which was to inspire his life’s work: an epic 20-year investigation, the results of which are highly surprising and hold important lessons for us all.
The set up of the experiment was not particularly novel: it involved a rat, a maze, some food and a group of students to observe the results. The rat was released into the maze and given a choice to go left or right at a T-junction. Food was placed to the left on 60% of occasions, and to the right the rest of the time. It didn’t take the rat long to work out that it was more likely to be found to the left than to the right, and it soon chose to go left each time, guaranteeing itself a snack on 60% of occasions.
But this wasn’t the real point of the experiment. At the same time the rat was working for its dinner, the students were also asked to guess on which side the food would be found (they weren’t aware of the set 60% ratio of lefts to rights). The students, in their search for a pattern, switched their predictions regularly between right and left, resulting in a success rate of 52%. The rat was thus declared to have greater predictive powers than a roomful of Yale students.
Tetlock wondered if these observations pointed to a wider problem in the human brain’s ability to predict the future, and whether this problem extended to predictions made by professional forecasters. He embarked on a 20 year odyssey to investigate the ability of these experts to anticipate future trends.
Tetlock studied the forecasts of 284 experts who made their living identifying political and economic trends, periodically asking them to make predictions relating to their fields of expertise. Rather than giving them yes/no questions, he asked them to assess probabilities, for example: ‘What are the chances of inflation in the UK rising, staying the same or falling in the next 12 months?’, or ‘Will casualties in the Israel-Palestine conflict fall, rise, or stay the same in the next year?’ He didn’t just ask for their opinions, but studied the basis of their decision and how they reacted when they were proven to be wrong. After 20 years on the project, Tetlock ended up with an enormous database of 82.361 expert predictions.
The project’s findings were astounding. Just like in the rat test, arbitrarily assigning equal probabilities to the ‘worse’, ’same’ and ‘better’ outcomes would have yielded more accurate predictions than those given by the ‘experts’. In Tetlock’s words, a dart-throwing chimp would have been as much help as the paid experts in predicting the future (although please don’t try this at home, kids).
Other studies into the reliability of economic forecasts have yielded similar results, indicating that basing your forecast on a repeat of the result in the previous period would be more accurate than professional forecasts. This is more than a little disconcerting. Even the most independent-minded of us relies on professional expertise to tell us what the future may bring.
What is it that prevents people who know a subject inside out from outperforming rats and chimps in predicting the future? Some clues can be found in Tetlock’s analysis of his results. He looked for what personal characteristics made his forecasters more or less likely to make good predictions. One trend he noticed was that there was a significant correlation with the fame of the person. The more famous they were, the worse their forecasts tended to be. He also found that, beyond a basic working knowledge, the more an expert knew on the subject they were making predictions about, the more likely their prediction was to be wrong.
What lies behind this highly counterintuitive finding? Tetlock suggests that the experts made similar mistakes to the Yale students. The more you think you know, the more you are likely to overanalyse a situation which is inherently unpredictable or subject to far simpler rules than those applied by the sophisticated forecaster. The more an expert has invested in their complicated theories and arcane knowledge, the more likely they are to unnecessarily apply it to their prediction.
Another contributing factor could be Black Swans. The concept comes originally from the work of philosopher John Stuart Mill, but was popularised by Karl Popper in his influential treatise “The Logic of Scientific Discovery”, in which he dealt with the problem of induction. Induction is a process of thinking whereby a universal conclusion in reached on the basis of a limited number of observations. For anyone living in Europe prior to the discovery of Australia and New Zealand, the statement “all swans are white” would have seemed reasonable. If all the swans you and everyone you know and have ever encountered are white, it seems natural to assume it to be a universal fact that all swans are white – until, that is, someone ruins everything by finding black swans in the Antipodes.
In his excellent book, The Black Swan, Nassim Nicholas Taleb expands the idea of the Black Swan to any phenomenon which comes out of the blue, has a major impact, and which, in hindsight, is analysed to the point where its occurrence seems to have been almost inevitable. Example cited by Taleb include 9/11, World War I, and the advent of the PC and internet. To that list could be added the current financial crisis. Taleb argues that such events have a much larger effect on the world than is anticipated by traditional forecasting methods and by our hard-wired perception of the future.
Take as an example economic forecasts for Canada’s growth in GDP in 2010. According to the Bank of Canada, a growth rate of 3.8% is expected next year, a figure which is so incongruous that it looks suspiciously politically motivated. In comparison, the IMF expects growth of 1.6%, which seems more plausible at first sight. But what do these figures actually mean? What assumptions are they based upon? No more bank failures, one more failure, or how many exactly? Is the collapse of GM factored into the figures or not? How would they be affected by another large terrorist attack on North American soil, or an earthquake in San Francisco? How about a tsunami hitting Tokyo, or a bird flu pandemic? They may not be are not very cheery thoughts, but these and a myriad of other inherently unpredictable events and knife-edge outcomes in aggregate make these predictions worth less than the paper they’re written on. Economists may argue that the law of averages cancels out ‘external factors’, but this is patently untrue in the case of events of this magnitude. The economists are either fooling themselves, or just trying to fool us to keep their jobs intact.
Tetlock also noted in his analysis that the approach of the more successful forecasters differed significantly from the least successful. To explain the difference, he used the analogy of foxes and hedgehogs. According to the ancient Greek poet Archilochus, “The fox knows many things, but the hedgehog knows one big thing”. In other words, for all the cunning a fox has, a hedgehog only needs one tactic (rolling into a ball) to frustrate it. The philosopher Isaiah Berlin used the distinction in his celebrated essay “The Hedgehog and the Fox” to categorise thinkers and writers according to whether their work was based on a unified, consistent view of the world, or whether they brought multiple perspectives into their work.
Tetlock, similarly, labeled as hedgehogs those experts whose predictions were based on the application of one big idea, and who did not waver in their beliefs when they made mistakes, but rather made excuses for their failures. Their in-depth specialization in their subject led them to try to force the square peg of their pet theories into the round holes of subjects where they didn’t fully apply, and thus negatively affected their forecasting capabilities.
Foxes, on the other hand, had no grand theory, but instead drew on a broad range of ideas to determine their prognosis, and were always eager to learn from their mistakes. Tetlock found that foxes were far better forecasters than hedgehogs on average. They may have still only had around a 60% success rate (being a fox does not help one predict black swans), but this was significantly better on average than the hedgehogs.
It should be noted, however, that hedgehogs tend to be far more controversial than foxes, and thus get more column inches and air-time. Television, in particular, loves to pit two hedgehogs with alternate world views against each other. It makes a far more dramatic spectacle than two foxes trying to understand each others’ opinions. This helps to explain the inverse relationship between fame and forecasting prowess.
So where does that leave the layperson in need of expert advice? Firstly, it is clear that one should be skeptical about the worth of any opinions coming from so-called experts. If you have a basic understanding of the subject, you can probably do just as good a job yourself. Tetlock recommends following the predictions of those with a proven track record of correct forecasting. As a caveat to this, however, I would say that this only counts if these predictions demonstrate the expert’s ability to be flexible in his or her outlook. For example, the media has lauded those few brave souls who predicted the current financial crisis. At least some of these, however will probably have been right just because they have an extreme pessimistic bias. Relying on such individuals to accurately predict the bottom of the slump almost certainly be very unwise.
The news that experts are little or no better at predicting the future than ourselves suggests that we don’t need them. Do not expect, however, for them to be thrown out on the street any time soon. Even if I had never heard of chaos theory, my observation of the unreliability of long-term weather forecasting would tell me that they are as good as useless beyond a window of about three days. Nevertheless, every day I check the seven day forecast.
It seems that our brains are so uncomfortable dealing with an undefined future that we will will gladly listen to anyone willing to predict it. For some it may be fortune tellers and tarot cards, for others it may be economists or political analysts. Either way, the reassurance of having some expected future to plan for seems far more important to us than the reliability of the forecast itself. Rather than mocking the experts, perhaps we should first look a little closer to home.


9 comments
Comments feed for this article
March 30, 2009 at 8:52 am
J-P Thiéblot
Hello!
I run numerous Google daily alerts, one of which I labelled “economic theory trends”, and it brought me to this post.
Not being trained in psychology, I must admit humbly that I did not know of Philip Tetlock until I read your post, but I’m familar with the black swan theory, and also to a lesser degree with chaos theory, largely because econophysicists refer to it.
I share your skepticism concerning the ability of so-called “experts” to make any valuable economic forecasts, and I’ve been researching this question out of personal interest for the last few months.
As regards short-term forecasts, I think that the black swan theory and the law of unexpected consequences can help explain the poor performance of these “experts”. However, from the viewpoint of economic theory, I also believe that their failure if largely attributable to the fact that 98%+ of the economists still base their opinions on a model – neo-classical economics – that is totally obsolete, and they stubbornly refuse to consider what other approaches could bring them.
For instance, besides econophysics, behavioural economics, ecological economics and evolutionary economics can all help better understand what is going on in the real world, and up to a point, make long-term forecasts.
In any event, because I liked your post, and because you seem to share my interest in the shortcomings of economic forecasting, I would like to know more about your background and what brought you to this topic.
Brgds,
J-P Thiéblot
Montreal
jpthieblot@zba.ca
March 30, 2009 at 6:47 pm
anewleif
Hi J-P,
Thanks for your interesting comment. My background is in economics, and I am very much an amateur (in both the English and French meanings of the word) when it comes to psychology. I was left very disillusioned by my studies in economics, and was surprised that so few of my fellow students shared my skepticism over how applicable neoclassical models, rational expectations and crude game theory, for example, were to the real world. By the end of my studies, I had the suspicion that most of what I was studying was good for nothing more than filling university libraries and tenured academics’ pockets.
While I feel that the complexities of the world will always be a barrier to accurate economic forecasting, I agree that most economists are too caught up in their own elegant, but irrelevant, mathematical models and theories to consider how alternative approaches could help them to develop more realistic models of economic behaviour. Even if economists had the humility to put confidence intervals on their forecasts it would be a start, but of course that is highly unlikely to happen because it would reveal exactly how useless the forecast actually is (95% chance of growth between -3% and 3% in Canada next year perhaps?).
By evolutionary economics, are your referring to the work of Thorstein Veblen? I didn’t even hear of Veblen until after I had finished my studies, which speaks volumes about the narrow neoclassical approach taught by many academic institutions, I think. Veblen would certainly have had a lot to say about the current financial situation, and his approach yields insights into the causes of the crisis that neoclassical economics could never hope to provide.
March 30, 2009 at 9:52 pm
J-P Thiéblot
You might find this laughable, but after I finished college, I decided to study economics, not because I wanted to become an economist, but because I naively thought studying economics was the best way to understand our world… This was in the early seventies, and I was 18…
Needless to say, I was exposed to the same neo-classical nonsense that later turned you off. So I left economics to study business administration, then finance. I even taught finance for a while (at Laval University in Quebec City, and later at McGill and Concordia in Montreal), but I left finance for the same reason I had left economics, i.e., the reference framework of academic finance is as nonsensical as that of economics, and at the time there was absolutely no room in the academic world for someone who did not fully subscribe to the dogma of mainstream economics and finance. Regrettably, I don’t think things have changed much since then…
Let’s jump, a quarter of a century later, to evolutionary economics…
Last month, I came across a post on an Australian blog by David Tow, author of the book “The Future of Life – Meta-Evolution: A Unified Theory of Evolution”. His book is freely downloadable at
http://dhtow01.googlepages.com/futureoflife.
As regards “evolutionary economics”, David referred me to the book “The Origin of Wealth” by Eric D. Beinhocker, from McKinsey. I think it was written in 2006 or 2007. I read very positive critique about it, but I haven’t had time to read the book yet.
Earlier this month, another post brought me to Professor John M. Gowdy from the Rensselaer Polytechnic Institute, in Troy, NY. The post was announcing a conference (that Dr. Gowdy actually gave last week), entitled “The Financial Meltdown and the Economics of Sustainability”. The description of the conference said “Dr. John Gowdy, a professor and ecological economist at RPI, will discuss the shift from a growth economy to a sustainable economy and how the world might then function. Topics covered will include the current revolution in economic theory, the ongoing financial meltdown, global climate change policy, and more.” Mr. Gowdy’s email address at RPI is: gowdyj@rpi.edu.
Mr. Gowdy sent me PDF copies of two interesting papers he wrote in the Journal of Economic Behavior & Organization, one in 2006 (“Evolutionary Theory and Economic Policy with Reference to Sustainability”, and another in 2008 (“Behavioral economics and climate change policy”). (I have copies, but because he sent them to me directly, I don’t have links to the Journal, which anyway is not available for free.)
If you’re not already a fan of it, you will love “The Real World Economic Review” (http://www.paecon.net/PAEReview/), totally dedicated to non-classical economic theory, and with top-notch contributors. I just wish I had discovered it much sooner!
Finally, in another direction, I have found the numerous recent posts of Umair Haque (”Edge Economy”) on http://blogs.harvardbusiness.org/haque/?cm_re=homepage-031909-_-body-middle-tert-_-voices somewhat disconcerting, but most interesting, and to the point. I have no idea what his academic training might be, but he’s really good!
Voilà!
Brgds,
J-P Thiéblot
Montreal
jpthieblot@zba.ca
April 1, 2009 at 8:30 pm
anewleif
That’s quite a reading list you’ve given me, J-P! Thanks for all the links. I’ve subscribed to the Haque blog, and I have read some of the Real World Economic Review before, in its ‘post-autistic’ days.
It’s great that the internet at least allows people to bypass the mainstream these days, and I am very grateful for this very interesting exchange of views and sources.
Keep tuned in to the blog. Although the wide range of influences for my posts means that economics is one of many subjects, I hope you may be interested by some of the other topics, and perhaps in particular those pertaining to Canada. Originally, I planned to write primarily in the context of a new Canadian, although the remit has become somewhat wider. I would be particularly interested in your perspectives as a French-Canadian. I am fascinated by the similarities and differences between French-Canada and its English counterpart.
Thanks again for the interesting dialogue.
April 7, 2009 at 10:16 am
transorz
Many years ago, I received two C grades for Micro and Macro undergrad. At the time, I thought it was just because I never went to class since I had scheduled it for 8 am. (And maybe also because I didn’t really do any of the reading.) Now I realize that I was incredibly insightful and saw Economics for the intellectually bankrupt and irrelevant discipline that it is.
Sorry to say, the economists don’t stand a chance, even when they aren’t handmaidens to powerful interests. Why? Because the fundamental economic indicators they rely on are themselves politicized, as anewleif alludes to WRT GDP growth projections.
http://online.wsj.com/article/SB123897612802791281.html
IMO, if you want to find out how the world really works, you’re better off getting a sales job in a car dealership for a year.
April 7, 2009 at 9:54 pm
anewleif
Funnily enough, I also never made a 9am lecture after my first week, and maybe it was during those lectures that the invincible arguments in favour of the relevance of econometrics were made, but I somehow doubt it.
You’re absolutely right about the politicisation of economic indicators, and politicians will inevitably talk up recovery in a downturn. Although I am skeptical about politicians’ power to determine economic destiny under normal circumstances, this current crisis has certainly been handcrafted by bankers and politicians.
April 8, 2009 at 6:38 am
transorz
BTW, enjoying your blog very much. Thanks for motivating me to finally read the excellent Michael Lewis article “The End.” I’m a fantasy baseball guy — and a Boston Red Sox fan — and read Moneyball a few years back. He really does have a knack for bringing out the drama in numbers.
April 8, 2009 at 5:05 pm
anewleif
Everything I have read by Michael Lewis I have found fascinating, particularly Moneyball. I am also a baseball nut and a Blue Jays fan (contrary to popular belief, the two are not mutually exclusive), so I can’t wish you too much of a good season, but as long as we share the division and the wild card slot, I’ll be happy. New York who???
May 1, 2009 at 11:12 pm
Hedgehogs and foxes at The Guardian « A New Leif
[...] May 1, 2009 in Uncategorized Two opinion pieces in the Guardian this week on the subject of swine flu influenza A (H1N1) provided an interesting example of Philip Tetlock’s classification of good and bad pundits as ‘foxes’ and ‘hedgehogs’, as I outlined in another recent post. [...]