18

Today I build a queue to spread the load of the 300.000 daily caculations. To prevent slow server response time from to many analist calculating at the same time.

First run on the server I managed to get the server load to 120% and get us offline for 30 minutes.

Accepation environment and production are on the same hardware.

Today was not a good day.

Comments
  • 2
    Learning experience. Any chance they'll separate out environments?
  • 1
    @dan-pud they are trying to save cost so probably not.

    Still have to find out what happend. But I expect several analist where doing calculations at the same time I was doing a test run.

    Got everything back running, time to check logs will be tomorow.
  • 0
    Yeah, not sure how the server did it but using more memory then your memory and swap together is a problem. xD

    But everything is working perfect now. Balancing calculations to a max per minute and server is going strong on a stable 60% capacity.
  • 1
    @Letmecode Thank you!

    Fixed it, documented everything until the last var. Wrote an user manual and told the client. Today was a really productive day for no apparant reason.

    Reading al the wk64 must have been a real positive boost :p
Add Comment