foundry27 : Post

Forum Topic - thread theory: (6 Items)

View: as

Lewis Donzis

12/20/2008 12:13 PM

post19100

We're trying to make sure we understand how threads are used in the io-pkt native driver model, especially as compared
to io-net.

In io-net, we had no drivers that sent packets upstream in multiple threads from the same interface. In other words,
when an upstream filter received a packet, it would normally not be executing simultaneously with another thread
processing a packet from the same interface. Our use of multiple CPUs was essentially limited to one thread per
physical network interface.

In io-pkt, there is a chunk of code devoted to dispatching patckets to multiple threads simultaneously and deciding
which thread to use for an incoming packet, which looks advantageous, but may also create some challenges.

It also appears that things become essentially single-threaded once they get into the TCP/IP stack in that, if we
understand the code correctly, there can be only one "stack thread" at a time. If that's right, it would seem to
eliminate the benefit of dispatching packets into multiple threads, but solves other problems.

In our application, we like to see packets from a given source IP address in the same order they come into the machine.
In contemplating methods of getting multiple CPUs active in io-net, we had considered allowing this, but always
dispatching packets from the same source IP to the same CPU. In io-pkt, we don't have much control over how packets are
dispatched to threads, and we would like to be able to take advantage of multiple CPUs, especially as we move toward
supporting 10Gbps. But at the same time, if locks have to be used heavily for protection, the benefit of multiple CPUs
is reduced and in some cases the flow of execution becomes essentially single-threaded.

Getting to the point, my questions are whether these basic understandings are correct, whether there is any fundamental
explanation of "how threads are used in io-pkt", and, more importantly, what can we expect in terms of threading model
when processing packets from a pfil hook.

Our goal is to optimize performance to take advantage of multiple CPUs, but without having to lock so heavily that those
benefits are lost.

Thanks,
lew

Sean Boudreau(deleted)

12/20/2008 4:40 PM

post19102

Re: thread theory

On Sat, Dec 20, 2008 at 12:13:52PM -0500, Lewis Donzis wrote:
> We're trying to make sure we understand how threads are used in the io-pkt native driver model, especially as compared
 to io-net.
> 
> In io-net, we had no drivers that sent packets upstream in multiple threads from the same interface.  In other words, 
when an upstream filter received a packet, it would normally not be executing simultaneously with another thread 
processing a packet from the same interface.  Our use of multiple CPUs was essentially limited to one thread per 
physical network interface.
> 
> In io-pkt, there is a chunk of code devoted to dispatching patckets to multiple threads simultaneously and deciding 
which thread to use for an incoming packet, which looks advantageous, but may also create some challenges.

Multiple threads can be handling interrupts, but only one
thread will be servicing a particular interrupt (struct
_iopkt_inter) at a particular time.  From a driver's point
of view this should be similar to io-net.  Notice for
example that there's often no mutexing in the receive path
as opposed to the tx path which can be tickled by multiple
threads at once.

> 
> It also appears that things become essentially single-threaded once they get into the TCP/IP stack in that, if we 
understand the code correctly, there can be only one "stack thread" at a time.  If that's right, it would seem to 
eliminate the benefit of dispatching packets into multiple threads, but solves other problems.

This again is similar to io-net; the difference being that
in io-net there was a dedicated thread for "stack"
processing while in io-pkt a thread may switch from an
"interrupt processing" context to "stack" context.  In
both cases there is only ever one "stack" thread.

> 
> In our application, we like to see packets from a given source IP address in the same order they come into the machine
.  In contemplating methods of getting multiple CPUs active in io-net, we had considered allowing this, 

Allowing what exactly?

> but always dispatching packets from the same source IP to the same CPU.  In io-pkt, we don't have much control over 
how packets are dispatched to threads, and we would like to be able to take advantage of multiple CPUs, especially as we
 move toward supporting 10Gbps.  But at the same time, if locks have to be used heavily for protection, the benefit of 
multiple CPUs is reduced and in some cases the flow of execution becomes essentially single-threaded.

Currently it depends on packet flow.  Layer two processing
(eg bridging) as well as the ip forwarding "ipflow" layer is
more heavily optimised for multithreaded concurrency.
Things currently funnel into the stack thread for upper
layer processing but again this is similar to io-net. 

> 
> Getting to the point, my questions are whether these basic understandings are correct, whether there is any 
fundamental explanation of "how threads are used in io-pkt", and, more importantly, what can we expect in terms of 
threading model when processing packets from a pfil hook.

Again it depends on where the hook is located.  The ones
in ether_input() / ether_putput() can be tickled my
multiple threads at once but ones in ip_input() /
ip_output() (for example) are in the "stack" thread
context.

> 
> Our goal is to optimize performance to take advantage of multiple CPUs, but without having to lock so heavily that 
those benefits are lost.
> 
> Thanks,
> lew
> 
> _______________________________________________
> io-net migration
> http://community.qnx.com/sf/go/post19100
>

Lewis Donzis

12/21/2008 10:10 AM

post19107

Re: thread theory

> Multiple threads can be handling interrupts, but only one
> thread will be servicing a particular interrupt (struct
> _iopkt_inter) at a particular time.  From a driver's point
> of view this should be similar to io-net.  Notice for
> example that there's often no mutexing in the receive path
> as opposed to the tx path which can be tickled by multiple
> threads at once.

This does sound similar to io-net... multiple interrupt threads, but a given physical card's packets are not processed 
my multiple threads in parallel?

> Allowing what exactly?

Allowing packets from a single physical card to be processed by multiple threads simultaneously. 

> Again it depends on where the hook is located.  The ones
> in ether_input() / ether_putput() can be tickled my
> multiple threads at once but ones in ip_input() /
> ip_output() (for example) are in the "stack" thread
> context.

We're using the eth_input_pfil_hook that you added for Murf.

Thanks,
lew

Robert Craig

12/21/2008 2:40 PM

post19110

RE: thread theory

Just to hijack this topic a bit, would it be worthwhile adding a generic
eth_output_pfil_hook that lets you hook into the output routine for all
packets just before they're sent to the driver?  Right now, the hook
that's in there allows interface specific hooks to be called but not a
"one shot catches all" hook.

   Robert. 

-----Original Message-----
From: Lewis Donzis [mailto:community-noreply@qnx.com] 
Sent: Sunday, December 21, 2008 10:11 AM
To: ionetmig-networking
Subject: Re: thread theory

> Multiple threads can be handling interrupts, but only one thread will 
> be servicing a particular interrupt (struct
> _iopkt_inter) at a particular time.  From a driver's point of view 
> this should be similar to io-net.  Notice for example that there's 
> often no mutexing in the receive path as opposed to the tx path which 
> can be tickled by multiple threads at once.

This does sound similar to io-net... multiple interrupt threads, but a
given physical card's packets are not processed my multiple threads in
parallel?

> Allowing what exactly?

Allowing packets from a single physical card to be processed by multiple
threads simultaneously. 

> Again it depends on where the hook is located.  The ones in 
> ether_input() / ether_putput() can be tickled my multiple threads at 
> once but ones in ip_input() /
> ip_output() (for example) are in the "stack" thread context.

We're using the eth_input_pfil_hook that you added for Murf.

Thanks,
lew

_______________________________________________
io-net migration
http://community.qnx.com/sf/go/post19107

Lewis Donzis

01/11/2009 8:44 PM

post19817

Re: RE: thread theory

I'd probably defer that to Murf, our real driver expert, but it seems like such a hook certainly couldn't hurt and might
 be useful.  Just the other day we were talking about a test case where we wanted to generate a packet at the socket 
layer and be able to see the resulting packet at the driver, after it's been through the routing table and ARP and so on
.  Such a hook might be just the thing for that.

Thanks!
lew

Sean Boudreau(deleted)

01/12/2009 9:28 AM

post19840

Re: RE: thread theory

On Sun, Jan 11, 2009 at 08:44:30PM -0500, Lewis Donzis wrote:
> I'd probably defer that to Murf, our real driver expert, but it seems like such a hook certainly couldn't hurt and 
might be useful.  Just the other day we were talking about a test case where we wanted to generate a packet at the 
socket layer and be able to see the resulting packet at the driver, after it's been through the routing table and ARP 
and so on.  Such a hook might be just the thing for that.

You can see the packet now via the bpf taps.  eg 'tcpdump -ien0'.
The hook Robert's refering to would also let one modify the
packet.

-seanb

Return

The text you entered is not a valid object ID
More Information
Object IDs begin with an object prefix and end with a number. For example, if you enter
artf2345
the application will jump directly to an artifact with the ID artf2345. Some valid object prefixes are:
artf	for an artifact
doc	for a document
page	for a project page
topc	for a discussion topic
wiki	for a wiki page