We use Fail2Ban to protect your servers from malicious activity, and load testing servers often trigger security blocks. As such, before you start load testing, please SSH to your servers and disable Fail2Ban temporarily by issuing
fail2ban-client stop. Once your testing is done, you can start it again with
Your application performance is dependant on many factors, such as code efficiency, server performance, networking and so on. Answering these two questions helps us gauge your requirements and recommend a custom solution:
- What is the desired traffic level you want to be able to handle (e.g. users/sec)?
- What is the average response time of your application?
As an example, an application that takes 1 second to respond and which is running on a server with 1 worker (thread) would need 1,000 servers to handle 1,000 concurrent connections. If your servers have 10 workers, you would need 100 servers. The more workers you have, the more resources you require, and consequently the more concurrent requests you can handle.
Preparing for server load
There are a number of steps you can take when expecting an increase in server load. It's always best to scale before you need it - the relative cost for more servers for a few days is worth more than the potential lost traffic, and you can always scale down easily.
- Add a load balancer to your application, and point your DNS to it. It will distribute traffic across your web servers, and offers benefits such as maximizing throughoutput, minimizing response times and avoiding overload on any single server.
- Scale your application servers horizontally to the extent that you require.
- Monitor your servers to ensure that they are able to serve your users.