As it's used on the first page, "utilization" is just the fraction of time that ...

_qwfv · on July 3, 2019

This throws me though. Because if a service center can process one unit per minute, and the average arrival rate is also one unit per minute, I'd assume the utilization was 1 (or 100%). In that scenario I wouldn't expect the queue to be infinitely long. What insight am I missing?

panda88888 · on July 3, 2019

One way to look at is that the utilization is capped at 1, while arrival rate is not.

Assume the processing center can process one unit per minute.

Let's start with a simple case--constant arrival rate at 1 per minute and constant processing rate at 1 per minute.

If the arrival is constant (ie no variation), then indeed the queue size would be 0, because the arrival coincides with completion of processing of previous arrival by the service center. The expected queue wait time is 0.

Now let's look at what happens if arrival rate is variable but average to 1 per minute.

We know the steady state utilization is 1. Because if the utilization is less than 1, the processing center is processing at less than 1 per minute, which means new arrivals would start to fill the queue until the utilization approaches and reaches 1. In other words, the processing center is always busy processing at steady state.

Now, how do we estimate the queue size? Intuitively, we can think about the inter-arrival time. If the next unit arrives at less than 1 minute after the previous, this means the queue grows (since the processing center needs 1 minute to process each unit). Conversely, if the unit arrives more than 1 minute after the previous, the queue shrinks. The number of units in the system (queue + processing center) = queue size + 1. The processing center is always busy, i.e. = 1, and the queue size is lower-bounded at 0 but not upper-bounded. Let p(i) be the probability that the system has i units, at steady state we can express it as,

if i == 0, p(i) = 0

if i > 0, p(i) = p(i-1) * p(inter-arrival time < 0.5) + p(i+1) * p(inter-arrival time > 0.5)

Basically, this says the system will always have at least 1 unit (processing center is always busy), and probability of having i in the system is dependent on probability of previous states i-1 and i+1, and the inter-arrival probability. If the queue size is not bounded, it turns out the p(i) is close to uniform (transition probability from i to i+1 or i-1 is the same at each i, each i feeds into adjacent states, and only boundary condition on i=0), especially for larger i because the boundary condition at i=0 has less effect. And the expected value of a (roughly) uniform distribution that spans (0, inf) is, well, infinite!

Feel free to correct me if I am wrong. I haven't touched probability and queueing theory in a long time.