Timeout connecting to lookup-domain-daemon.pl

1 post / 0 new
#1 Thu, 10/27/2016 - 10:46
morpheus747

Timeout connecting to lookup-domain-daemon.pl

I have many many problems with lookup-domain.pl and lookup-domain daemon.pl for some reason the server have a very heavy load (CPU intensive) from this process.

I'm trying to solve the situation disabling the antispam and antivirus via virtualmin command but is not solving the high load. when the mail reach the procmail-wrapper and excecute the lookup-domain to know if the user have resources to receive the mail the server start crashing...

I don't "really" receive so many mails but i'm searching in the procmail log and i see the timeout connection messages.

based on some investigation i see if i "kill" the procmail-wrapper (lookup-domain.pl children) then the mail start to flush "a single one" but sometimes the queue start to increase and i have 200... 300...500 mails in an active queue and the clients start to question about "why" one mail take 2 or 3 hours to reach the account.

I test lookupdomain one at time and in parallell options. i upgrade my webmin/virtualmin to the last version but i can't get the reason about the timeouts or why all the mails reaching the procmail-wrapper stuck in the lookup-domain process.

based on some forum post i see lookup-domain must not be "process intensive task" the only use is to get the virtualmin account information (if there spam activated? quota information, etc....) so i don't understand WHY! use so many CPU. and WHY i have those timeout messages.

the netstat show that the port 11000 is open and "working???" netstat -nap | grep 11000 tcp 0 0 127.0.0.1:11000 0.0.0.0:* LISTEN 18283/lookup-domain tcp 0 0 127.0.0.1:47288 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47377 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47327 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47257 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47408 127.0.0.1:11000 ESTABLISHED 18884/perl tcp 0 0 127.0.0.1:47419 127.0.0.1:11000 ESTABLISHED 18937/perl tcp 0 0 127.0.0.1:47407 127.0.0.1:11000 ESTABLISHED 18886/perl tcp 0 0 127.0.0.1:47372 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47255 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47308 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47227 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47210 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:11000 127.0.0.1:47419 ESTABLISHED 18942/lookup-domain tcp 0 0 127.0.0.1:47214 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47187 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:11000 127.0.0.1:47406 ESTABLISHED 18898/lookup-domain tcp 0 0 127.0.0.1:47266 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47351 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47406 127.0.0.1:11000 ESTABLISHED 18889/perl tcp 0 0 127.0.0.1:47437 127.0.0.1:11000 ESTABLISHED 19038/perl tcp 0 0 127.0.0.1:47303 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:11000 127.0.0.1:47407 ESTABLISHED 18900/lookup-domain tcp 0 0 127.0.0.1:11000 127.0.0.1:47437 ESTABLISHED 18283/lookup-domain tcp 0 0 127.0.0.1:11000 127.0.0.1:47409 ESTABLISHED 18903/lookup-domain tcp 0 0 127.0.0.1:47216 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47389 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47256 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47197 127.0.0.1:11000 TIME_WAIT - tcp 0 0 127.0.0.1:47409 127.0.0.1:11000 ESTABLISHED 18891/perl tcp 0 0 127.0.0.1:11000 127.0.0.1:47410 ESTABLISHED 18910/lookup-domain tcp 0 0 127.0.0.1:47410 127.0.0.1:11000 ESTABLISHED 18893/perl tcp 0 0 127.0.0.1:11000 127.0.0.1:47408 ESTABLISHED 18902/lookup-domain

Based on procmail logs i can deliver 2 or 3 mails every 20 seconds that is SOOO SLOW we haver servers with similar resources with cpanel or zimbra and can deliver 30 or 40 mails per second so there is no way.