A continuous variable with a long tail distribution

MR-dice-icon.png Download a pdf copy of this help file  here

See also: Splicing Distributions window

What do we mean by a long-tailed distribution? One distribution is said to have a longer tail than another if its probability density (or mass) function is (asymptotically) larger than the other distribution's for very large values of the variable, i.e. for two distributions A and B:

image63.gif

Many socioeconomic and other natural random variables take long-tailed distributions. Examples are city population sizes, occurrences of natural resources (e.g. size of reserves in a certain geological region), stock price fluctuations, size of companies, income.

MR_window_Splice_1.jpg
The Splicing Distributions window

The most commonly fitted distribution to the extreme of such data has been the Pareto. There is no decent theory to explain why the Pareto distribution tends to fit the tails of long-tailed variables, but most people accept that it works and use it anyway.

The Pareto is usually a poor fit for the main body of the variable, though. Thus, when modelling long-tailed distributions one usually does so using a splice of one distribution (like the Lognormal, or Gamma, for example), with a Pareto distribution to model the tail.

In ModelRisk you can use the Splicing Distributions window to splice two distributions together.