4

I wanted to make my own problem and solve it to learn some new stuff --

enter image description here

Let's say we have some type of standardized test, where a rough data curve depicts $P(x)=x^2e^{-\frac{x}{14}}$ which is the number of people who got that score.

Now, we have $\displaystyle \int_0^{100}P(x)\,dx = 5342.0$ approximately.

So, we scale our probabilities so we have $\displaystyle p(x)=\frac{1}{5342}x^2e^{-\frac{x}{14}}$.

Then, to get the mean, we evaluate $\displaystyle \int_0^{100}xp(x)\,dx$, right? I don't think we have to divide by $100$ because it's a weighted average and we have $p(x)$ being the weight of $x$.

This would give us $39.928$, which kind of makes sense from the graph!

Then, I think that variance would be $\displaystyle \int_0^{100}(x-39.928)^2p(x)\,dx=434.55$

Then, standard deviation would equal to $\sqrt{434.55}=\boxed{20.84}$ Are all my steps correct? What can I do with a standard deviation? Does the $68-95-99.7$ rule apply?

UPDATE

I now let $X$ range from $0$ to $+\infty$, and the results are really nice!

We have the density $p(x)=\displaystyle \frac{1}{5488}x^2e^{-\frac{x}{14}}$

We have mean $42$.

We have variance $588$.

We have standard deviation $\sqrt{588}$.

2 Answers2

3

Yes, you have computed the quantities correctly. (But you made a minor typo in the denominator of $p(x)$).

As for the 68-95-99.7 rule, that applies to the Gaussian distribution and you can't always apply it to other distributions. You can compute the quantiles of this distribution, though. If you want the $95$-th percentile, just solve $$ \int_0^xp(t)dt = 0.95 $$ for $x.$ As for whether or not it will be approximately two standard deviations from the mean, that depends how much the distribution looks like a Gaussian. In this case it comes pretty close, but the relation has to break down for three standard deviations... that would be over 100.

3

Your formulas are correct and your results should also be correct should you make all your computations correct.

68-95-99.7 rule does not apply here because it is defined for the Gaussian distribution. Here you have a single tail..