This might be a "please do my homework" question, except I'm a humble developer trying to predict system behavior rather than a student...
I have N devices issuing potentially-expensive http requests - each request takes, say, 2 seconds to complete on average. These devices are making requests at a certain average rate - say every 3 minutes.
My server infrastructure can only handle M simultaneous requests - I need to understand how likely requests are to fail due to "too many concurrent requests".
Initially it's tempting to just simplify to "I have events randomly generated at a certain avg rate, they last a certain avg duration, what is the probability of a certain number of simultaneous events in a certain period", but that doesn't seem to be accurate: At a minimum, it misses the fact that as long as the number of "request issuing agents" is lower than or equal to the "simultaneous request threshold" you're testing for, the probability will clearly be 0...
This seems like something that people must have figured out already, but I can't seem to find anything.
Is the best (only?) way to test this to actually run simulations?