Suppose we have a complex vector space $V$.
A norm is a function $f : V \rightarrow \mathbb{R}$ which satisfies
(i) $f(x) \ge 0$ for all $x \in V$ (Positivity - Non-Negativity)
(ii) $f(x + y) \le f(x) + f(y)$ for all $x, y \in V$ (Subadditivity - Triangle Inequality)
(iii) $f(\lambda x) = |\lambda|f(x)$ for all $\lambda \in \mathbb{C}$ and $x \in V$ (Positive Homogeneity)
(iv) $f(x) = 0$ if and only if $x = 0$ (Definiteness)
From these properties, we define the $l_p-norm$
$$ ||x||_p = \left(\sum \limits_{i=1}^{n}|x_i|^p \right)^\frac{1}{p}$$
My questions relate to why these four properties were chosen for this definition:
(1) If norm is just a measure, why do we need all four of these properties?
(2) Do these properties allow for a nice geometric interpretation of norms?
(3) Were these properties chosen to nicely allow for a nesting of the various norms? For example
$$ ||x||_\infty \le ||x||_2 \le ||x||_1$$
(4) Were these properties chosen to satisfy pairwise inequalities? For example $$ ||x||_1 \le n||x||_\infty$$
(5) Were these properties chosen because they lend themselves in other variants? For example, in $l_2$ for bi-infinite sequences, vector-valued sequences or general sequences.
(6) Do these properties allow an extension into other function spaces where all of the items above still hold? For example, $L_2$ or $L_p$ function spaces.
I haven't looked at norms in quite a while, so please forgive me if I am missing something very obvious.