When calculating outliers using the IQR method, we find a range and define outliers outside of that range (below). Is it 'mathematically' accepted if I change the 1.5 to a 2 to get less outliers for a particular dataset? Or does this break a conventional theory?
Additionally, does the data need to follow a normal distribution to use this method?
- IQR: Q3 – Q1
- Upper bound: Q3 + (1.5 * IQR).
- Lower bound: Q3 – (1.5 * IQR)
- Outlier = outside of range [Lower, Upper]