profile pic
⌘ '
raccourcis clavier

80% of the outcomes are due to 20% of causes, only Pareto distributions with shape value α=log45=1.16\alpha=\log_4 5=1.16 reflect this.

in machine learning, we can do feature ablation based on its Pareto distribution.

definition

if XX is a random variable with Pareto distribution (Type I), then the survival function is given by:

F(x)=Pr(X>x)={(xmx)α,xxm,1,x<xm.\overline{F(x)} = Pr(X > x) = \begin{cases} \displaystyle \left(\frac{x_m}{x}\right)^\alpha, & x \ge x_m,\\[1em] 1, & x < x_m. \end{cases}

where xmx_m is the (necessarily positive) minimum possible value of XX, and α\alpha is a positive parameter.

improvement

also: Pareto efficiency 1

when a change in allocation of good harms no one and benefits at least one person

a state is Pareto-optimal if there is no alternative state where at least one participant’s well-being is higher, and nobody else’s well-being is lower.

  • If a state change satisfies this, then the new state is Pareto improvement
  • When no Pareto improvement is possible, then it is Pareto optimum.

zero-sum game

every outcome is Pareto-efficient.

Remarque

  1. Pareto originally used the word “optimal”, but Pareto’s concept more closely aligns with an idea of “efficiency”.

    Because it does not identify a single “best” (optimal) outcome. Instead, it only identifies a set of outcomes that might be considered optimal, by at least one person.