OpenVZ Forum: Devel » [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace

Home » Mailing lists » Devel » [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace [message #21573 is a reply to message #21572]

Thu, 11 October 2007 07:57

ebiederm
Messages: 1354
Registered: February 2006

Senior Member

"Denis V. Lunev" <den@sw.ru> writes:

>> This patchset does need to get rebased on top of net-2.6.25 when it
>> opens and hopefully your patchset to remove the unnecessary work in
>> rtnl_unlock, and to really process netlink requests in process
>> context.  I see a need for the more fundamental change you seem to
>> be advocating.

Grr.  That last sentence should have been I do not see a need for the more
fundamental change you seem to be advocating.

> I see that current patchset of RTNL code is attached. I'll start the
> next piece of work next week after some reaction from people.

> I understand your position. But still have some points :)
>
> First. A real-life usecase we have recently fixed in the OpenVZ. I have
> described it in the previous post, but (may be) not in great details.
>
> UDP buffers management for outgoing traffic.
> The packet carries over a destructor, which is called in skb_orphan in
> the real networking device. This destructor allows to queue more
> packets. There is no problem in the current implementation. But there is
> one after namespace introduction in the following configuration:
>     [NS1] <-> [NS2] <-> [world]
> There are two namespaces on the host. One of them is connected to the
> outside world via another and the packet follows usual forwarding path
> in NS2.

Yes.  We can't quite do that today because of the missing ipv4 support,
but the basic infrastructure exists to describe this is already
merged.

> The problem: if we call skb_orphan in [NS1] outgoing device, there is a
> great possibility to have a really huge packet drop in [NS2] on queuing
> operation.

Yes.  Removing skb_orphan from veth to solve this looks like a
pretty substantial hack.  A clean solution is more likely to resemble
ethernet pause frames so we temporarily plug the virtual device.
Although there are issues with that as currently virtual devices
don't have queues.

> In OpenVZ we have added skb_orphan into receive path (tcp_v4_rcv,
> __udp4_lib_rcv etc) and removed one from our virtual devices. As
> unfortunate side effect we have packet in NS2 with a socket from NS1.

That also does some really nasty things to accounting.  Using
the macvlan devices solve all of this rather neatly and with
higher performance.

> So, we should have an architectural solution for this from a beginning.
> It will be too late to hack around and change namespace lookup scheme.

However what you are specifically concerned about seems to be
using skb->sk->sk_net.  Currently the only place I use this
is in the rtnetlink code because the functions that process packets
are not also passed a socket.  Those functions we could easily
pass in an explicit namespace or a socket parameter, and so those
functions would not care.

> I see that we should adopt a generic approach:
> - each concrete packet belongs to a concrete namespace
> - if function has a packet as a parameter, we should get namespace from
> packet

This approach when suggested in earlier review was pretty
substantially shot down.  Shrinking the size of struct sk_buff is
currently an ongoing todo for the linux networking stack.

> - if skb->dev is present we should get namespace as skb->dev->nd_net
> - otherwise we should get skb->sk->sk_net. If skb->sk is NULL -> this is
>  a bug

Currently killing skb->dev is a todo list item, so using it more is
probably the wrong approach.

I do agree the duplication information between fields on the skb
and fields passed to functions is a pain.  Moving more onto the
skb seems contrary to the how the rest of the networking stack would
like to go, so it is likely better to simply remove skb->dev and pass
an explicit dev parameter instead.  Please note that parameters passed
explicitly will live in registers and will be quite fast.

> So, for the case of all netfilter calls we'll have a pskb->dev->nd_net
> defined correctly.

That would certainly be a nice extra, but that really seems the wrong
way to go.  I would rather add an explicit struct net *net parameter
to nf_hookfn. 

Eric

Report message to a moderator

[Message index]

		[PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: ebiederm on Sat, 29 September 2007 01:00
		[PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: ebiederm on Sat, 29 September 2007 01:02
		[PATCH 3/5] net: Make the netlink methods in rtnetlink handle multiple network namespaces By: ebiederm on Sat, 29 September 2007 01:04
		[PATCH 4/5] net: Make AF_PACKET handle multiple network namespaces By: ebiederm on Sat, 29 September 2007 01:07
		[PATCH 5/5] net: Make AF_UNIX per network namespace safe. By: ebiederm on Sat, 29 September 2007 01:08
		Re: [PATCH 5/5] net: Make AF_UNIX per network namespace safe. By: Patrick McHardy on Sat, 29 September 2007 15:47
		Re: [PATCH 5/5] net: Make AF_UNIX per network namespace safe. By: ebiederm on Sat, 29 September 2007 17:03
		Re: [PATCH 5/5] net: Make AF_UNIX per network namespace safe. By: Patrick McHardy on Sat, 29 September 2007 17:50
		Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: Patrick McHardy on Sat, 29 September 2007 15:44
		Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: ebiederm on Sat, 29 September 2007 16:51
		Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: Patrick McHardy on Sat, 29 September 2007 17:48
		Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: ebiederm on Sat, 29 September 2007 21:00
		Re: Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: dlunev on Sun, 30 September 2007 13:13
		Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: Patrick McHardy on Sun, 30 September 2007 15:39
		Re: Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: den on Mon, 01 October 2007 08:26
		Re: Re: [PATCH 2/5] net: Make rtnetlink infrastructure network namespace aware By: ebiederm on Mon, 01 October 2007 08:45
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: den on Wed, 10 October 2007 12:33
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: Daniel Lezcano on Wed, 10 October 2007 14:05
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: den on Wed, 10 October 2007 14:29
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: ebiederm on Wed, 10 October 2007 19:37
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: den on Thu, 11 October 2007 06:35
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: ebiederm on Thu, 11 October 2007 07:57
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: den on Thu, 11 October 2007 12:17
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: ebiederm on Thu, 11 October 2007 17:08
		Re: [PATCH 1/5] net: Modify all rtnetlink methods to only work in the initial namespace By: dlunev on Thu, 11 October 2007 17:26

Previous Topic:	[PATCH][NETNS] Make ifindex generation per-namespace
Next Topic:	[PATCH RFC] cgroups: implement device whitelist cgroup+lsm

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

]

Current Time: Wed Sep 18 08:22:57 GMT 2024

Total time taken to generate the page: 0.04725 seconds