a convex set is essentially a set that intersects every line in a line segment 1
ELI5: a convex function graph is shaped like a cup ∪, where as a concave function graph is shaped like a cap ∩
Let X be a convex subset of a real vector space and let f:X→R be a function.
Then f is called convex iff any of the following equivalent holds:
- ∀0≤t≤1∩x1,x2∈X∣f(tx1+(1−t)x2)≤tf(x1)+(1−t)f(x2)
- ∀0≤t≤1∩x1,x2∈X where x1=x2∣f(tx1+(1−t)x2)≤tf(x1)+(1−t)f(x2)
This is also known as the Jensen’s inequality:
f(tx1+(1−t)x2)≤tf(x1)+(1−t)f(x2)
The form as follows:
Given X is a random variable with φ a convex function, then
φ(E[X])≤E[φ(X)]