Message ID | 53B39DB8.2060306@huawei.com |
---|---|
State | New |
Headers | show |
From: Ding Tianhong <dingtianhong@huawei.com> Date: Wed, 2 Jul 2014 13:50:48 +0800 > The problem was triggered by these steps: > > 1) create socket, bind and then setsockopt for add mc group. > mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); > mreq.imr_interface.s_addr = inet_addr("192.168.1.2"); > setsockopt(sockfd, IPPROTO_IP, IP_ADD_MEMBERSHIP, &mreq, sizeof(mreq)); > > 2) drop the mc group for this socket. > mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); > mreq.imr_interface.s_addr = inet_addr("0.0.0.0"); > setsockopt(sockfd, IPPROTO_IP, IP_DROP_MEMBERSHIP, &mreq, sizeof(mreq)); > > 3) and then drop the socket, I found the mc group was still used by the dev: > > netstat -g > > Interface RefCnt Group > --------------- ------ --------------------- > eth2 1 255.0.0.37 > > Normally even though the IP_DROP_MEMBERSHIP return error, the mc group still need > to be released for the netdev when drop the socket, but this process was broken when > route default is NULL, the reason is that: > > The ip_mc_leave_group() will choose the in_dev by the imr_interface.s_addr, if input addr > is NULL, the default route dev will be chosen, then the ifindex is got from the dev, > then polling the inet->mc_list and return -ENODEV, but if the default route dev is NULL, > the in_dev and ifIndex is both NULL, when polling the inet->mc_list, the mc group will be > released from the mc_list, but the dev didn't dec the refcnt for this mc group, so > when dropping the socket, the mc_list is NULL and the dev still keep this group. > > v1->v2: According Hideaki's suggestion, we should align with IPv6 (RFC3493) and BSDs, > so I add the checking for the in_dev before polling the mc_list, make sure when > we remove the mc group, dec the refcnt to the real dev which was using the mc address. > The problem would never happened again. > > Signed-off-by: Ding Tianhong <dingtianhong@huawei.com> Applied, thank you. -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 2014/7/8 12:31, David Miller wrote: > From: Ding Tianhong <dingtianhong@huawei.com> > Date: Wed, 2 Jul 2014 13:50:48 +0800 > >> The problem was triggered by these steps: >> >> 1) create socket, bind and then setsockopt for add mc group. >> mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); >> mreq.imr_interface.s_addr = inet_addr("192.168.1.2"); >> setsockopt(sockfd, IPPROTO_IP, IP_ADD_MEMBERSHIP, &mreq, sizeof(mreq)); >> >> 2) drop the mc group for this socket. >> mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); >> mreq.imr_interface.s_addr = inet_addr("0.0.0.0"); >> setsockopt(sockfd, IPPROTO_IP, IP_DROP_MEMBERSHIP, &mreq, sizeof(mreq)); >> >> 3) and then drop the socket, I found the mc group was still used by the dev: >> >> netstat -g >> >> Interface RefCnt Group >> --------------- ------ --------------------- >> eth2 1 255.0.0.37 >> >> Normally even though the IP_DROP_MEMBERSHIP return error, the mc group still need >> to be released for the netdev when drop the socket, but this process was broken when >> route default is NULL, the reason is that: >> >> The ip_mc_leave_group() will choose the in_dev by the imr_interface.s_addr, if input addr >> is NULL, the default route dev will be chosen, then the ifindex is got from the dev, >> then polling the inet->mc_list and return -ENODEV, but if the default route dev is NULL, >> the in_dev and ifIndex is both NULL, when polling the inet->mc_list, the mc group will be >> released from the mc_list, but the dev didn't dec the refcnt for this mc group, so >> when dropping the socket, the mc_list is NULL and the dev still keep this group. >> >> v1->v2: According Hideaki's suggestion, we should align with IPv6 (RFC3493) and BSDs, >> so I add the checking for the in_dev before polling the mc_list, make sure when >> we remove the mc group, dec the refcnt to the real dev which was using the mc address. >> The problem would never happened again. >> >> Signed-off-by: Ding Tianhong <dingtianhong@huawei.com> > > Applied, thank you. > Hi david: Can you queue this patch to stable, this problem is existed in every earlier kernel version, and could be triggered easily, I think we should fix it for every stable tree. Ding > . > -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
From: Ding Tianhong <dingtianhong@huawei.com> Date: Tue, 8 Jul 2014 12:47:16 +0800 > Can you queue this patch to stable, this problem is existed in every > earlier kernel version, and could be triggered easily, I think we > should fix it for every stable tree. Done. -- To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/net/ipv4/igmp.c b/net/ipv4/igmp.c index 17d34e3..7fb6565 100644 --- a/net/ipv4/igmp.c +++ b/net/ipv4/igmp.c @@ -1944,6 +1944,10 @@ int ip_mc_leave_group(struct sock *sk, struct ip_mreqn *imr) rtnl_lock(); in_dev = ip_mc_find_dev(net, imr); + if (!in_dev) { + ret = -ENODEV; + goto out; + } ifindex = imr->imr_ifindex; for (imlp = &inet->mc_list; (iml = rtnl_dereference(*imlp)) != NULL; @@ -1961,16 +1965,14 @@ int ip_mc_leave_group(struct sock *sk, struct ip_mreqn *imr) *imlp = iml->next_rcu; - if (in_dev) - ip_mc_dec_group(in_dev, group); + ip_mc_dec_group(in_dev, group); rtnl_unlock(); /* decrease mem now to avoid the memleak warning */ atomic_sub(sizeof(*iml), &sk->sk_omem_alloc); kfree_rcu(iml, rcu); return 0; } - if (!in_dev) - ret = -ENODEV; +out: rtnl_unlock(); return ret; }
The problem was triggered by these steps: 1) create socket, bind and then setsockopt for add mc group. mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); mreq.imr_interface.s_addr = inet_addr("192.168.1.2"); setsockopt(sockfd, IPPROTO_IP, IP_ADD_MEMBERSHIP, &mreq, sizeof(mreq)); 2) drop the mc group for this socket. mreq.imr_multiaddr.s_addr = inet_addr("255.0.0.37"); mreq.imr_interface.s_addr = inet_addr("0.0.0.0"); setsockopt(sockfd, IPPROTO_IP, IP_DROP_MEMBERSHIP, &mreq, sizeof(mreq)); 3) and then drop the socket, I found the mc group was still used by the dev: netstat -g Interface RefCnt Group --------------- ------ --------------------- eth2 1 255.0.0.37 Normally even though the IP_DROP_MEMBERSHIP return error, the mc group still need to be released for the netdev when drop the socket, but this process was broken when route default is NULL, the reason is that: The ip_mc_leave_group() will choose the in_dev by the imr_interface.s_addr, if input addr is NULL, the default route dev will be chosen, then the ifindex is got from the dev, then polling the inet->mc_list and return -ENODEV, but if the default route dev is NULL, the in_dev and ifIndex is both NULL, when polling the inet->mc_list, the mc group will be released from the mc_list, but the dev didn't dec the refcnt for this mc group, so when dropping the socket, the mc_list is NULL and the dev still keep this group. v1->v2: According Hideaki's suggestion, we should align with IPv6 (RFC3493) and BSDs, so I add the checking for the in_dev before polling the mc_list, make sure when we remove the mc group, dec the refcnt to the real dev which was using the mc address. The problem would never happened again. Signed-off-by: Ding Tianhong <dingtianhong@huawei.com> --- net/ipv4/igmp.c | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-)