Strange behaviour unbound server
Eduard Ahmatgareev
e.ahmatgareev at gmail.com
Thu Jul 11 08:34:10 UTC 2019
Hi everyone,
I faced with intersting issue with unbound server and couldn't figure out
without your help
We used unbound as primary dns resolver in our aws infrastructure, but from
time to time unbound server is not responding to queries from our clients
Also I found by tcpdump and wireshark a lot of retransmission DNS requests
from clients in the subnets.
But this issue present periodically, our clients get timeout issue during
the day.
from 100 queries, timeout can be get for 3-8 queries.
For debug I used command:
perf trace -p $(pidof unbound) --duration=10
and got next:
13.285 (599.741 ms): unbound/15943 epoll_pwait(epfd:
54<anon_inode:[eventpoll]>, events: 0x564955c6ae10, maxevents: 128,
timeout: -1, sigsetsize: 8) = -1 EINTR Interrupted system call
616.016 (94.403 ms): unbound/15943 epoll_pwait(epfd:
54<anon_inode:[eventpoll]>, events: 0x564955c6ae10, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
710.662 (130.206 ms): unbound/15943 epoll_pwait(epfd:
54<anon_inode:[eventpoll]>, events: 0x564955c6ae10, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
616.649 (224.502 ms): unbound/15952 epoll_pwait(epfd:
42<anon_inode:[eventpoll]>, events: 0x7faea89ea7f0, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
850.606 (112.947 ms): unbound/15952 epoll_pwait(epfd:
42<anon_inode:[eventpoll]>, events: 0x7faea89ea7f0, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
13.453 (1160.129 ms): unbound/15951 epoll_pwait(epfd:
37<anon_inode:[eventpoll]>, events: 0x7faea47ca3e0, maxevents: 64, timeout:
-1, sigsetsize: 8) = 1
840.904 (335.113 ms): unbound/15943 epoll_pwait(epfd:
54<anon_inode:[eventpoll]>, events: 0x564955c6ae10, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
710.891 (465.469 ms): unbound/15950 epoll_pwait(epfd:
36<anon_inode:[eventpoll]>, events: 0x7faeac8b2680, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
13.769 (1174.857 ms): unbound/15954 epoll_pwait(epfd:
48<anon_inode:[eventpoll]>, events: 0x7fae98747c20, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
1176.048 (17.121 ms): unbound/15943 epoll_pwait(epfd:
54<anon_inode:[eventpoll]>, events: 0x564955c6ae10, maxevents: 128,
timeout: -1, sigsetsize: 8) = -1 EINTR Interrupted system call
1175.740 (21.495 ms): unbound/15951 epoll_pwait(epfd:
37<anon_inode:[eventpoll]>, events: 0x7faea47ca3e0, maxevents: 64, timeout:
-1, sigsetsize: 8) = 1
1177.587 (19.955 ms): unbound/15950 epoll_pwait(epfd:
36<anon_inode:[eventpoll]>, events: 0x7faeac8b2680, maxevents: 128,
timeout: 264, sigsetsize: 8) = 1
1196.914 (11.097 ms): unbound/15954 epoll_pwait(epfd:
48<anon_inode:[eventpoll]>, events: 0x7fae98747c20, maxevents: 128,
timeout: -1, sigsetsize: 8) = 1
our infra:
ec2: c5.2xlarge (16gb mem, 8cores, 60gb gp2)
dist: amazon linux 2
unbound-libs-1.6.6-1.amzn2.0.2.x86_64
unbound-python-1.6.6-1.amzn2.0.2.x86_64
unbound-1.6.6-1.amzn2.0.2.x86_64
conf:
server:
verbosity: 1
num-threads: 8
statistics-interval: 0
extended-statistics: yes
statistics-cumulative: no
msg-cache-slabs: 4
rrset-cache-slabs: 4
infra-cache-slabs: 4
key-cache-slabs: 4
rrset-cache-size: 100m
msg-cache-size: 50m
so-rcvbuf: 4m
so-sndbuf: 4m
so-reuseport: yes
outgoing-range: 8192
num-queries-per-thread: 4096
do-daemonize: no
prefetch: yes
rrset-roundrobin: yes
logfile: ""
use-syslog: no
directory: "/etc/unbound"
chroot: ""
log-queries: no
access-control: 0.0.0.0/0 allow
interface: 0.0.0.0
interface-automatic: yes
port: 53
do-ip4: yes
do-ip6: no
do-udp: yes
do-tcp: yes
username: "unbound"
pidfile: "/var/run/unbound/unbound.pid"
root-hints: /etc/unbound/root.hints
key-cache-size: 32m
local-zone: "10.in-addr.arpa." nodefault
remote-control:
control-enable: yes
any ideas?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nlnetlabs.nl/pipermail/unbound-users/attachments/20190711/33972baf/attachment.htm>
More information about the Unbound-users
mailing list