Upload
others
View
9
Download
0
Embed Size (px)
Citation preview
Last Update: v1.0 - January 2011
6WINDGate™
White Paper
Packet Processing Software for Wireless Infrastructure
6WINDGate White Paper
Packet Processing Software for Wireless Infrastructure
Performance Challenges for Wireless Networks As advanced services proliferate and video consumes an ever-increasing share of wireless network capacity, the requirements for high-performance processing of network traffic will continue to grow dramatically. Each piece of equipment in the network must achieve higher levels of packet processing performance. At the same time, the equipment must be designed to meet challenging power, cost and schedule requirements.
This white paper discusses how specialized software, designed for high-performance processing of network packets and optimized for multicore processors, enables system designers to meet the conflicting goals of high traffic rates, low system power and minimum system cost. The explanation will be illustrated using real-world examples of 4G equipment based on multicore Intel® Architecture (IA) platforms and will leave readers with a good understanding of how to use advanced multicore packet processing techniques effectively in next-generation networking equipment.
Designers of 4G telecom infrastructure products, whether LTE or WiMAX, face challenging performance requirements that cannot be addressed with the same techniques that worked for 2G and 3G equipment.
Driven by high-bandwidth Internet applications, the total traffic in the core network is growing at over 100% per year, so service providers expect individual network elements such as packet gateways to provide at least a corresponding increase in bandwidth.
At the same time, telecom equipment is increasingly deployed in commercial and outdoor environments without forced-air cooling, placing severe restrictions on the number of high-performance processor subsystems that can be used.
Finally, equipment suppliers operate under ever more challenging cost constraints. These apply both to CAPEX, since low product cost is essential to support worldwide deployments of 4G networks, and to OPEX, where electrical power, both to run the equipment and for cooling, is a major contributor to the calculation of overall Total Cost of Ownership (TCO).
To be successful, developers of 4G networking equipment must deliver solutions that achieve maximum throughput for tomorrow’s network traffic patterns (dominated by video and data), while minimizing system-level power consumption and cost.
Core network traffic doubles every year
6WINDGate White Paper
Packet Processing Software for Wireless Infrastructure
Packet Processing Software Solutions For 4G networks, 3GPP has specified a flat IP-based network architecture (SAE: System Architecture Evolution) with the goal of efficiently supporting massive usage of IP services. As a consequence, the network architecture is much simpler than existing architectures such as 3G. However, as all the services (data, voice, video…) use IP packets, processing these packets efficiently becomes critical to ensure LTE system performance.
On top of the IP protocol itself (actually the two IP protocols as the SAE architecture supports both IPv4 and IPv6 versions), a large number of individual protocols have to be implemented:
• Low-level protocols such as IPsec (Internet Protocol Security), ROHC (Robust Header Compression) and VLAN (Virtual LAN).
• Within an overall 4G network, a number of protocols support communication between individual subsystems. For example, GTP (GPRS Tunneling Protocol) carries user data via IP tunnels between a Signaling Gateway (SGW) and a base station (eNodeB). Similarly, SCTP (Stream Control Transmission Protocol) implements signaling between the Mobility Management Entity (MME), the SGW and the eNodeB. Likewise, IPinIP, GRE (Generic Routing Encapsulation) or GTPu provide tunnel connections from the SGW to the Packet Gateway (PGW). And there are many more protocols that are used throughout the network.
• Differentiating the services is also critical. IP QoS is required to prioritize real-time traffic over pure data traffic. Similarly, packet inspection implements the mechanisms to identify the user traffic to provide a better service to users and/or applications.
All these protocols are encapsulated in IP packets. Starting from layer 2 protocols, packet processing software has to analyze successive encapsulated headers as fast as possible.
The critical performance challenge for 4G networking equipment is to process these IP packets at the highest possible throughput. In general, the designer’s objective is to perform this processing fast enough that the throughput of the equipment is limited, not by the packet processing performance, but by the speed of the physical network connection, typically 10Gb/s, 40Gb/s or, soon, 100Gb/s. If the processing throughput matches the speed of the network, the system is said to be performing at “wire-speed”, maximizing the efficiency of the equipment.
Over the past few years, developers of high-end processors migrated to multicore architectures in order to meet never-ending needs for increased performance in networking equipment and a constant evolution of customized protocols. The traditional processor design approach of continually increasing clock frequencies in order to boost performance led to prohibitive processor power consumption, since power is proportional to the square of the clock frequency. The industry adopted multicore architectures in which the cores run at a clock frequency that leads to manageable power consumption for the processor as a whole.
Today, all processors used in high-performance networking products are based on multicore architectures. These platforms provide the ideal environment for implementing the high-performance packet processing that is required for 4G equipment.
For developers of networking equipment, selecting a multicore processor for their system is only one step in designing a high-performance system solution. Generally, the more complex question is how to architect the software which, as explained above, typically needs to process packets from multiple streams of network traffic at wire-speed.
Packet processing fundamentals
A standand is sthreads,passing throughsupport complexeight coGTPu-tothe pote
A supermulticorinto twoincominoverheacomplexnecessasignalingfunction
A multwell-suitthis karchiteccores crunningorder overall system,is requithe OSand theplane.
In pracvarious and fastsystem that, sindedicaterecent edynamicthe cont
SplittingapplicatExistingfaster benvironm
Pac
dard networkingsubject to sign, timers and lo
through the hput. Furthermo
multicore archx packet proceores may not o-GRE encapsuential packet pr
rior solution is re architectureo layers. The log packets out
ads that degrax processing aary manageg and c
ns.
ticore processted to implemkind of socture. Most ocan be dedica the fast pato maximizethroughput o
while only onired to run thS networking e application’s c
ctice, the desigsoftware elem
t path), decidinworkload. Unt
nce the cores red exclusively evolution towacally as traffic trol plane and
g the networktion software, applications d
because the ument.
ket Process
g stack uses snificant overheocking. These system, resu
ore, although hitectures, perfessing such as process packe
ulations. All in arocessing perfo
provided by ss. In a well-deower layer, typtside the OS ade overall peare forwarded ement, control
sor is menting oftware of the ted to
ath, in e the of the ne core he OS,
stack control
gner will analyments in the syng on the mosttil recently, therunning the fasto the fast p
rds a hybrid fapatterns chanthe fast path.
king stack in which interfac
do not need to nderlying pack
sing Softwa
services provideeads associated
processing ovulting in a masome improvemformance fails required by 4
ets significantlyall, a standard ormance of a m
specialized pacesigned implempically called thenvironment aerformance. Oto the OS n
yze the specifystem (applicat appropriate ale only restrictiost path were ru
path and not sast path modelge in order to
this way hasces to the sam
be rewritten oket processing
are for Wire
ed by the oped with functionverheads are imajor performanments can be to scale linearl
4G and a procey faster than OS stack does
multicore proce
cket processingmentation, the he fast path, pand without inOnly those raretworking stac
fic performanctions, control llocation of coron when configunning outsideshared with otl, the system c share the CPU
s no impact me OS networkor recertified, b
is accelerated
6WINWhit
eless Infrast
rating Operatinns such as premposed on eance penalty fmade to an Oy over multipleessor with, forone with two
s a poor job of essor.
g software optnetworking sta
processes the mncurring any ore packets thack, which per
ce requirementplane, networ
res to balance tguring the pla
e the OS, they ther software.can now be recU resources al
on the functiking stack as pbut they run sid through the
DGate te Paper tructure
ng System eemptions, ach packet for overall
OS stack to e cores for r example,
cores for exploiting
timized for ack is split majority of of the OS at require rforms the
ts for the king stack the overall tform was had to be With the configured located to
onality of previously. gnificantly fast path
The p
gold stanpacket pro
s
dard for ocessing software
In a typ(PGW) standardoptimizefast patthe proseven to
This masystem with thbased imeet syunachieusing abreakthdirectly cost.
The 6Warchitec
for a hiBy remomultiplenetwork
Architecstacks aapplicatmulticorproprietsystem
When configur
Pac
IntroducIA Platfo
pical 4G applicaor switching
d OS networed packet procth concept, thcessor subsysto ten times.
assive increaswill be able to e same hardwimplementationystem throughpevable on a sina standard Oroughs in systeinto improvem
WINDGate™ paccture describe
igh-performancoving the neede suppliers, 6Wking equipment
cturally, 6WINand is fully-comtions, such as Lre system usintary or third-pcost and time-
installed on ared at run-time
ket Process
ing 6WINrms
ation such as agateway (SG
king stacks acessing softwahe networking tems will typic
e in performamanage 7x to
ware. This typn can allow tput goals that ngle multicore OS stack. Them performanc
ments in energ
cket processinged above and
ce wireless infrd for developerWINDGate hast by up to twelv
DGate is a dmpatible with LTE or WiMAX ng 6WINDGateparty applicatioto-market adva
a multicore Ie to make the o
sing Softwa
NDGate™
packet gatewaGW), when thare replaced bre based on thperformance
cally increase b
ance means tho 10x more usepe of fast patthe designer may have beeprocessor whe
hese compellince also transla
gy efficiency an
g software prod has been
equipment comprehenscontrol planpath, 6WINsingle-vendo
rastructure plars to integrate been proven ve months.
rop-in replacestandard Linuxapplications, w
e. This allows ons while fullantages provid
ntel® Architeoptimum use o
are for Wire
™ Softwar
ay he by he of by
he ers th-to en en ng ate nd
oduct implemendeployed by
providers ive set of proe, the networ
NDGate providor solution for aatform based onetworking softo accelerate
ement for stanx application A
will run unchanOEMs to presey benefitting ed by 6WINDG
ecture platformf the number c
6WINWhit
eless Infrast
re for Mu
nts the type ofwireless infrworldwide.
otocols availabrking stack anddes developerall the protocolon multicore teftware compon the time-to-m
ndard Linux nAPIs. Any exisnged when migerve their invefrom the per
Gate.
m, 6WINDGatecores available.
DGate te Paper tructure
lticore
f fast path rastructure
With a le for the d the fast s with a ls required echnology. nents from market for
networking sting Linux grated to a estment in rformance,
e can be .
Scalm
eable, extulticore so
arch
tensible oftware itecture
In the efollows:
•
•
It’s impmulti-prrequired
The follconfigur
Pac
example show
One core is cowell as the 6W
The remainingmakes full useDevelopment K
portant to note rocessor architd level of perfo
lowing section ration.
ket Process
wn below, the
onfigured to ruWINDGate contr
g five cores aree of processor-Kit (Intel® DPD
that the 6WINtectures. Multi
ormance (both
explains the s
sing Softwa
six-core Intel®
un Linux and throl plane and th
e configured to-specific servicDK) software.
NDGate softwaiple processoron fast path an
system-level p
are for Wire
® Xeon® proc
he LTE or WiMhe 6WINDGate
o run the 6WINes provided by
are is fully extes can be connd Linux protoc
performance ac
6WINWhit
eless Infrast
cessor E5645 i
MAX applicatione networking st
NDGate fast pay the Intel® D
ensible, with snfigured to prcols).
chieved by this
DGate te Paper tructure
is used as
n stack, as tack;
ath, which Data Plane
upport for rovide the
s software
p
Industry-packet pro
performamulti
pro
-leading ocessing ance on icore IA
ocessors
As a stasoftwareforwardplatform
6WIND 6WINDGE5645. Forwardthat thethe fastconstrai
Clearly, functionideal foplatform
(Of couthe oveof the p
A typicatunneliFor thisprocess each cobandwidXeon® handle t
Pac
6WINDGInfrastruarting-point fore provides foring performan
m.
has recently dGate packet pr
On this plading performane performance t path (subjectints).
though, 4G wns far beyond or this becausems.
rse, processingrall performan
protocols used.)
al system incluling, flow acc workload, an around 2.5 M
ore is thereforedth per LTE usprocessor E56
the traffic of ap
ket Process
ate Perfocture Equr understandingr wireless infrance. This can
emonstrated 1rocessing softwatform, 6WINDnce on a singlescales linearly t, of course, to
wireless infrastbasic IP forwae it includes a
g a large numce measured i)
udes a fast patcounting and Intel® Xeon®
Mpps/core. Asse able to proceser is typically645 core runnipproximately 8
sing Softwa
ormance uipmentg the system-lastructure equ
be used to
10Gb/s Etherneware running oDGate deliver
e core, while thaccording to t
o any finite th
tructure equiprding. The 6Wwide selection
ber of protocon packets per
th implementatQoS conditio
® processor E56suming the ave
ess traffic at a y around 1.2 Ming the 6WIND,860 active use
are for Wire
in Wirele
evel performanipment, it’s inevaluate the
et IP forwardingon a 2.4 GHz Irs approximathe architecture the number ofhroughput limit
pment requiresWINDGate packn of protocols
ols requires mosecond decrea
tion of VLAN, ioner functions645 core runnierage size of arate of 10.6 G
Mbps, this implDGate packet pers.
6WINWhit
eless Infrast
ess
nce that the 6Wstructive to exraw capabiliti
g performanceIntel® Xeon® ely 13.9 Mppof the softwar
f cores configuts imposed by
s the implemeket processing
optimized for
ore processor cases with the c
IP forwardins, all using 6Wing at 2.4 GHzn IP packet is 5
Gbps. Since thies that a singprocessing sof
DGate te Paper tructure
WINDGate xamine IP es of the
e using the processor
ps of IP re ensures red to run hardware
ntation of solution is multicore
cycles and complexity
ing, GTPu WINDGate. z is able to 512 bytes,
he average gle Intel® ftware can
620
0,000 active LTE users
Based oprocessable to
With the62,000
Typical such asgatewayincludesblades, for syste
The mnumberservice users su
Using thblade-leis straperformequippp
Assuminprocessmaximuconfigurcalculatwith 10active process
Pac
on this analysor E5645 blademanage aroun
e new generatactive LTE use
telecom infras packet gatewys, comprise s multiple isupported by
em manageme
most relevant r from the pprovider is th
upported by a c
he analysis of evel performancightforward t
mance (or user ped chassis.
ng blades baseors running 6W
um available red to run theions show that0 blades willLTE users (ors).
ket Process
sis, a telecome with five cored 44,000 active
tion of eight-cors (assuming s
astructure sysways and swit
a chassis dentical proccommon reso
ent, power and
LTE performoint of view he total numbcomplete chass
processor-levece outlined aboto extrapolatecapacity) of a
ed on Intel® XWINDGate, witnumber of
e fast path, st a chassis equl handle 440six-core proce
sing Softwa
blade based es configured te LTE users.
ore processors,seven cores run
stems, tching
that cessor ources I/O.
mance of a
ber of sis.
el and ove, it e the fully-
Xeon® th the cores
simple uipped 0,000 essors) or 620
are for Wire
on a single to run the 6WI
, the performanning the fast p
0,000 active
6WINWhit
eless Infrast
six-core Intel®NDGate fast pa
ance per blade path).
LTE users (
DGate te Paper tructure
® Xeon® ath will be
will reach
(eight-core
6WINDGate White Paper
Packet Processing Software for Wireless Infrastructure
6WINDGate Benefits Summarized This white paper has illustrated some of the key benefits that the 6WINDGate packet processing software provides for developers of high-performance 4G wireless equipment. These benefits include:
• Optimized support for industry-leading multicore processor platforms such as the Intel® Xeon® processor E5645 running the Intel® Data Plane Development Kit (Intel® DPDK) software;
• Portable software architecture, eliminating any dependency on a single processor or CPU vendor;
• Best-in-class packet processing performance, delivering seven to ten times the performance of standard OS networking stacks, enabling the development of 4G networking equipment that meets challenging performance requirements;
• Comprehensive set of 40+ optimized networking protocols, ideally suited to 4G equipment, eliminates the need to integrate networking software components from multiple suppliers;
• Full compatibility with standard OS APIs simplifies software development, integration and migration;
• Built-in support for High-Availability frameworks enables Carrier Grade system reliability;
• Full compatibility with all commercial Linux distributions for maximum flexibility in software platform design;
• Award-winning technology with best-in-class technical support, already deployed in 4G networking equipment by tier-1 OEMs worldwide.
Conclusions 4G equipment needs to achieve a breakthrough level of packet processing performance in order to provide advanced services for high numbers of users. While multicore processor platforms are capable of delivering impressive raw performance, standard OS networking stacks cannot reach the necessary throughput. The 6WINDGate software achieves a 7x to 10x improvement in packet processing performance and enables OEMs to meet 4G performance requirements. Because 6WINDGate is compatible with standard APIs and includes a comprehensive suite of optimized networking protocols, developers can accelerate their time-to-market by up to twelve months while reusing their existing software.
For more information, please visit www.6wind.com.
Deployed by tier-1 OEMs worldwide in 4G networking equipment