I've often seen as an explanation for shanon entropy that it represents "the average number of yes/no questions needed to find out which state a given system is in."
This works very well for a system having an even number of possible states, but if I try to do this with let's say, 3 possible states all equiprobable I don't find this to be true. Indeed, using the entropy, $ H = \sum_{n = 1}^{3}\frac{1}{3}\log_2(\frac{1}{3}) = 1.58$ I find an average number of 1.58 yes/no questions. If I encode the $1^{st}$ state as 0, the $2^{nd}$ state as 1 and the $3^{rd}$ as 00, the average number of bits (questions) that I need to characterize the system is $\frac{4}{3}=1.33$.
Did I make a mistake somewhere or does it mean that this explanation with the yes/no questions is just a way to simplify things ?