I'm doing some independent reading on the Lebesgue measure and I have the following questions:
1) on the definition of a Lebesgue measurable set: Let $E \subseteq \mathbb{R} $. On Wikipedia, $$\mu(E) = \mu^*(E) \iff \mu^*(A) = \mu^*(A\cap E) + \mu^*(A\cap E^c) \, \, \text{for every} \, \, A \subseteq \mathbb{R}.$$ However on another text, the definition of Lebesgue measurable is defined as $$ \mu(E) = \mu^*(E) \iff \mu^*(A) = \mu^*(A\cap E) + \mu^*(A ^c\cap E) \, \, \text{for every} \, \, A \subseteq \mathbb{R}.$$ The subtle, yet seemingly important, distinction is which intersection we're looking at. Is this distinction important? Are these two definitions equivalent? If so, could you please explain the compatibility?
2) on the motivation for such a definition: why is Lebesgue measurable defined in this way? Why not stop at the definition of $\mu^*(E)$? What is the motivation for using $E$ as a "partitioning" (for a lack of better words) set? And why do we require this for every $A \subseteq \mathbb{R}$?
3) on the existence of counter-examples: What are some common examples of non-Lebesgue measurable sets? Where does the importance of defining measurable in the above way(s) appear in these examples? I.e. what is an example of a set in which defining measurability differently fails to capture the "desired" property?