1

I'm unsure how the softmax function comes about. I understand that the sigmoid function is a result of phrasing the log-odds as a linear equation and then rearranging the formula to model for the probability of an event occurring, but I'm not sure how to do this for softmax. Can someone point me in the right direction?

  • Are you dissatisfied with the explanation in https://en.wikipedia.org/wiki/Softmax_function ? – Andreas Nov 10 '17 at 18:13
  • @Andreas the answer wasn't really contained in there as it just states "this is a composition of K linear functions", however, the explanation I was looking for was presented here: https://en.wikipedia.org/w/index.php?title=Logistic_regression&oldid=755697139#As_a_.22log-linear.22_model – tryingtolearn Nov 13 '17 at 09:59

1 Answers1

0

Answer on how logistic regression is extended to the multinomial case is presented here:

https://en.wikipedia.org/w/index.php?title=Logistic_regression&oldid=755697139#As_a_.22log-linear.22_model