It's more technical. Because the method call traffic is not plain in nature. A burst will come when a customer keeps sending requests for hours while other customer sends request maybe one per minute. As a result, most customers will encounter longer repsonse time during the burst.
We must regulate the traffic before we delivery the service. Right?
I think the main cause is that, the customer calls our service through
servlet, which is designed for www without Qos in mind. It only looks at requests, not the user. All clients are the same.
What we are going to do is add one more layer on top of servlet, to ensure the Qos or SLA for each user. Currently the SLA is very simple, only Transaction Per Second (TPS) will be specified. Later we will consider to add more.
Originally posted by Ilja Preuss:
I don't fully agree - if it's cheaper to impose the limit on users and the customer can live with it, it might well be the way to go. With other words, I think it's a business decision, not a technical decision.