A year ago, it was hardly unthinkable that a math wizard like David X. Li might someday earn a Nobel Prize. After all, financial economists—even Wall Street quants—have received the Nobel in economics before, and Li's work on measuring risk has had more impact, more quickly, than previous Nobel Prize-winning contributions to the field. Today, though, as dazed bankers, politicians, regulators, and investors survey the wreckage of the biggest financial meltdown since the Great Depression, Li is probably thankful he still has a job in finance at all. Not that his achievement should be dismissed. He took a notoriously tough nut—determining correlation, or how seemingly disparate events are related—and cracked it wide open with a simple and elegant mathematical formula, one that would become ubiquitous in finance worldwide.
For five years, Li's formula, known as a Gaussian copula function, looked like an unambiguously positive breakthrough, a piece of financial technology that allowed hugely complex risks to be modeled with more ease and accuracy than ever before. With his brilliant spark of mathematical legerdemain, Li made it possible for traders to sell vast quantities of new securities, expanding financial markets to unimaginable levels.
His method was adopted by everybody from bond investors and Wall Street banks to ratings agencies and regulators. And it became so deeply entrenched—and was making people so much money—that warnings about its limitations were largely ignored.
Then the model fell apart. Cracks started appearing early on, when financial markets began behaving in ways that users of Li's formula hadn't expected. The cracks became full-fledged canyons in 2008—when ruptures in the financial system's foundation swallowed up trillions of dollars and put the survival of the global banking system in serious peril.
David X. Li, it's safe to say, won't be getting that Nobel anytime soon. One result of the collapse has been the end of financial economics as something to be celebrated rather than feared. And Li's Gaussian copula formula will go down in history as instrumental in causing the unfathomable losses that brought the world financial system to its knees.
How could one formula pack such a devastating punch? The answer lies in the bond market, the multitrillion-dollar system that allows pension funds, insurance companies, and hedge funds to lend trillions of dollars to companies, countries, and home buyers.
A bond, of course, is just an IOU, a promise to pay back money with interest by certain dates. If a company—say, IBM—borrows money by issuing a bond, investors will look very closely over its accounts to make sure it has the wherewithal to repay them. The higher the perceived risk—and there's always some risk—the higher the interest rate the bond must carry.
Bond investors are very comfortable with the concept of probability. If there's a 1 percent chance of default but they get an extra two percentage points in interest, they're ahead of the game overall—like a casino, which is happy to lose big sums every so often in return for profits most of the time.
Bond investors also invest in pools of hundreds or even thousands of mortgages. The potential sums involved are staggering: Americans now owe more than $11 trillion on their homes. But mortgage pools are messier than most bonds. There's no guaranteed interest rate, since the amount of money homeowners collectively pay back every month is a function of how many have refinanced and how many have defaulted. There's certainly no fixed maturity date: Money shows up in irregular chunks as people pay down their mortgages at unpredictable times—for instance, when they decide to sell their house. And most problematic, there's no easy way to assign a single probability to the chance of default.
Wall Street solved many of these problems through a process called tranching, which divides a pool and allows for the creation of safe bonds with a risk-free triple-A credit rating. Investors in the first tranche, or slice, are first in line to be paid off. Those next in line might get only a double-A credit rating on their tranche of bonds but will be able to charge a higher interest rate for bearing the slightly higher chance of default. And so on.
The reason that ratings agencies and investors felt so safe with the triple-A tranches was that they believed there was no way hundreds of homeowners would all default on their loans at the same time. One person might lose his job, another might fall ill. But those are individual calamities that don't affect the mortgage pool much as a whole: Everybody else is still making their payments on time.
But not all calamities are individual, and tranching still hadn't solved all the problems of mortgage-pool risk. Some things, like falling house prices, affect a large number of people at once. If home values in your neighborhood decline and you lose some of your equity, there's a good chance your neighbors will lose theirs as well. If, as a result, you default on your mortgage, there's a higher probability they will default, too. That's called correlation—the degree to which one variable moves in line with another—and measuring it is an important part of determining how risky mortgage bonds are.
Investors like risk, as long as they can price it. What they hate is uncertainty—not knowing how big the risk is. As a result, bond investors and mortgage lenders desperately want to be able to measure, model, and price correlation. Before quantitative models came along, the only time investors were comfortable putting their money in mortgage pools was when there was no risk whatsoever—in other words, when the bonds were guaranteed implicitly by the federal government through Fannie Mae or Freddie Mac.
Yet during the '90s, as global markets expanded, there were trillions of new dollars waiting to be put to use lending to borrowers around the world—not just mortgage seekers but also corporations and car buyers and anybody running a balance on their credit card—if only investors could put a number on the correlations between them. The problem is excruciatingly hard, especially when you're talking about thousands of moving parts. Whoever solved it would earn the eternal gratitude of Wall Street and quite possibly the attention of the Nobel committee as well.
To understand the mathematics of correlation better, consider something simple, like a kid in an elementary school: Let's call her Alice. The probability that her parents will get divorced this year is about 5 percent, the risk of her getting head lice is about 5 percent, the chance of her seeing a teacher slip on a banana peel is about 5 percent, and the likelihood of her winning the class spelling bee is about 5 percent. If investors were trading securities based on the chances of those things happening only to Alice, they would all trade at more or less the same price.
But something important happens when we start looking at two kids rather than one—not just Alice but also the girl she sits next to, Britney. If Britney's parents get divorced, what are the chances that Alice's parents will get divorced, too? Still about 5 percent: The correlation there is close to zero. But if Britney gets head lice, the chance that Alice will get head lice is much higher, about 50 percent—which means the correlation is probably up in the 0.5 range. If Britney sees a teacher slip on a banana peel, what is the chance that Alice will see it, too? Very high indeed, since they sit next to each other: It could be as much as 95 percent, which means the correlation is close to 1. And if Britney wins the class spelling bee, the chance of Alice winning it is zero, which means the correlation is negative: -1.
If investors were trading securities based on the chances of these things happening to both Alice and Britney, the prices would be all over the place, because the correlations vary so much.
But it's a very inexact science. Just measuring those initial 5 percent probabilities involves collecting lots of disparate data points and subjecting them to all manner of statistical and error analysis. Trying to assess the conditional probabilities—the chance that Alice will get head lice if Britney gets head lice—is an order of magnitude harder, since those data points are much rarer. As a result of the scarcity of historical data, the errors there are likely to be much greater.
In the world of mortgages, it's harder still. What is the chance that any given home will decline in value? You can look at the past history of housing prices to give you an idea, but surely the nation's macroeconomic situation also plays an important role. And what is the chance that if a home in one state falls in value, a similar home in another state will fall in value as well?
Probability
Specifically, this is a joint default probability—the likelihood that any two members of the pool (A and B) will both default. It's what investors are looking for, and the rest of the formula provides the answer.
Survival times
The amount of time between now and when A and B can be expected to default. Li took the idea from a concept in actuarial science that charts what happens to someone's life expectancy when their spouse dies.
Equality
A dangerously precise concept, since it leaves no room for error. Clean equations help both quants and their managers forget that the real world contains a surprising amount of uncertainty, fuzziness, and precariousness.
Copula
This couples (hence the Latinate term copula) the individual probabilities associated with A and B to come up with a single number. Errors here massively increase the risk of the whole equation blowing up.
Distribution functions
The probabilities of how long A and B are likely to survive. Since these are not certainties, they can be dangerous: Small miscalculations may leave you facing much more risk than the formula indicates.
Gamma
The all-powerful correlation parameter, which reduces correlation to a single constant—something that should be highly improbable, if not impossible. This is the magic number that made Li's copula function irresistible.