Performance improvement of dynamic buffered ATM switch

www.elsevier.com/locate/compeleceng

Computers and Electrical Engineering 31 (2005) 152–165

Performance improvement of dynamic buffered ATM switch

Abdulhamit Subasi a,*, Etem Koklukaya b

a Department of Electrical and Electronics Engineering, Kahramanmaras� Sutcu _Imam University,

46100 Kahramanmaras�, Turkeyb Department of Electrical and Electronics Engineering, Sakarya University, 54187 Sakarya, Turkey

Received 16 April 2003; received in revised form 1 November 2004; accepted 26 January 2005

Available online 26 April 2005

Abstract

Performance of ATM networks depends on switch performance and architecture. This paper presents a

simulation study of a new dynamic allocation of input buffer space in ATM switching elements. The switch-

ing elements are composed of input and output buffers which are used to store received and forwarded cells,

respectively. Efficient and fair use of buffer space in an ATM switch is essential to gain high throughput andlow cell loss performance from the network. In this paper, input buffer space of each switching element is

allocated dynamically as a function of traffic load. A shared buffer pool is provided with threshold-based

virtual partition among input ports, which supplies the necessary input buffer space as required by each

input port. The system behaviour under varying traffic loads has investigated using a simulation program.

Also, a comparison with a static allocation scheme shows that the threshold based dynamic buffer alloca-

tion scheme ensures an increased network throughput and a fair share of the buffer space even under bursty

loading conditions.

� 2005 Elsevier Ltd. All rights reserved.

Keywords: ATM networks; Dynamic buffered switch; Dynamic threshold updating; Performance evaluation

0045-7906/$ - see front matter � 2005 Elsevier Ltd. All rights reserved.

doi:10.1016/j.compeleceng.2005.01.001

* Corresponding author. Tel.: +90 344 251 23 15x282; fax: +90 344 251 23 42.

E-mail addresses: [email protected] (A. Subasi), [email protected] (E. Koklukaya).

mailto:[email protected]

mailto:[email protected]

A. Subasi, E. Koklukaya / Computers and Electrical Engineering 31 (2005) 152–165 153

1. Introduction

Telecommunication networks based on the Broadband Integrated Services Digital Network(B-ISDN) can support a variety of multimedia services such as voice, video and data. The Asyn-chronous Transfer Mode (ATM) is envisaged as the basic transfer mode for implementing theB-ISDN and is based on cell switching technique. To guarantee the speed and the quality of ser-vice (QoS) of CBR services, the cells have a fixed length of 53 octets and are known as cells. Sincethe switching technique is based on the cell switching principle, the events (arrival of cells andtheir onward transmission) occur only at regularly spaced points in time [1,2].In the last decade or so, considerable research effort has gone into the development of ATM

switches. Many architectures have been proposed over the years [3–11], whereby hardware imple-mentation often turned out to be the major design constraint, due to the high speeds at whichthese switches should operate. Concurrently, a large number of articles were devoted to the per-formance analysis of the proposed architectures [10–22]. In many instances, simplifying assump-tions had to be made to make the analysis feasible and/or tractable. Mostly, these assumptions notonly pertain to the switch itself, but also to the traffic offered to the switch [15].High-speed cell switches have been proposed to utilize buffers to prevent arriving cell from los-

ing in ATM networks. Buffers can be located at input ports or output ports. They can be designedto be single in each input port or shared for all input ports. It was seen that completely sharedbuffering achieves optimal throughput-delay performance. However, there is a fairness problemin input queued shared-buffer switches that a single input port can take over most of the buffers,preventing cells destined for less utilized ports from gaining access under overload condition [14].High-speed cell switches have been proposed to utilize buffers to prevent arriving cell from los-

ing in ATM networks. Buffers can be located at input ports or output ports, and can be designedto be single in each input port or shared for all input ports. It was concluded that input queuingand completely shared buffering achieve optimal throughput-delay performance. However, thereis a fairness problem in input queued shared-buffer switches that a single input port can take overmost of the buffers, preventing cells destined for less utilized ports from gaining access under over-load condition [14]. To overcome this problem, we add an output buffer to the switch module inaddition to input buffer.Two kinds of buffer sharing restriction have been proposed in shared-buffer switches. The first

one named as the threshold scheme puts threshold on buffers that should be available to any indi-vidual input port. In this method, a threshold is assigned to each input port. An arriving cell isaccepted only if the current queue length is smaller than the threshold. Only comparators andcounters are needed in the threshold scheme. Thus, it is really simple to implement the thresholdscheme [4,14,15].The second kind of buffer sharing restriction is pushout scheme. An arriving cell is allowed to

enter the buffer as long as the shared-buffer is not full. When the buffer is full, an incoming cell isallowed to enter the buffer by taking the place of a cell that is already in the buffer. For the fairnessreason, the cell located at the head of the longest queue among the input ports is selected to bediscarded first. It should be noticed that the pushing and pushed cells might belong to differentinput ports. Indeed, the performance of the pushout scheme is better than the threshold scheme.The performance of the pushout scheme is optimal because no input queue is ever starved forspace and no space is ever held idle while some queue desires more. The drawback of the pushout

https://www.researchgate.net/publication/223222541_Simulation_of_dynamic_input_buffer_space_in_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3334631_Optimal_buffer_management_policies_for_shared-buffer_ATM_switches?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/223477009_Performance_analysis_of_finite_buffer_discrete-time_queue_with_bulk_service?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/220339592_An_output_queueing_analysis_of_multipath_ATM_switches?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3042566_Performance_analysis_of_multibuffered_cell-switching_networks_in_multiprocessor_systems?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3042566_Performance_analysis_of_multibuffered_cell-switching_networks_in_multiprocessor_systems?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3043251_Finite_buffer_analysis_of_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3043251_Finite_buffer_analysis_of_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/222903744_Processor-sharing_model_for_input-buffered_ATM-switches_in_a_correlated_traffic_environment?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3043759_Design_and_analysis_of_high_performance_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3334531_Dynamic_queue_length_thresholds_for_shared-memory_packet_switches?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==


https://www.researchgate.net/publication/3334658_Sizing_exit_buffers_in_ATM_networks_An_intriguing_coexistence_of_instability_and_tiny_cell_loss_rates?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/223270300_Dynamic_buffer_allocation_in_an_ATM_switch?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==



https://www.researchgate.net/publication/222856010_New_shared-buffer_packet_switch_in_ATM_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==




https://www.researchgate.net/publication/223277833_ATM_switch_architectures_with_input-output-buffering_effect_of_input_traffic_correlation_contention_resolution_policies_buffer_allocation_strategies_and_delay_in_backpressure_signal?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3334452_Performance_of_an_InputOutput_Buffered_Type_ATM_LAN_Switch_with_Back-Pressure_Function?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/220154502_Directed_virtual_path_layouts_in_ATM_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/3334637_Evaluation_of_pipelined_dilated_banyan_switch_architectures_for_ATM_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/223924754_Design_and_analysis_of_the_Stacked-Banyan_ATM_switch_fabric?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/220253301_Convergence_of_a_Dynamic_Policy_for_Buffer_Management_in_Shared_Buffer_ATM_Switches?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/257581160_A_switched_priority_scheduling_mechanism_for_ATM_switches_with_multi-class_output_buffers?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

154 A. Subasi, E. Koklukaya / Computers and Electrical Engineering 31 (2005) 152–165

scheme lies in that it is too complicated to implement the buffer management in the switch. Be-cause when the shared-buffer is full, writing a cell into a queue involves complicated cell sequencepreservation [4,14,15].Choudhury [4] proposed a dynamic threshold scheme. The key idea in the dynamic threshold

scheme is that the maximum permissible length is proportional to the unused buffering for anyindividual queue at any instance of time. In fact, the dynamic threshold scheme uses part ofthe threshold scheme and part of the pushout scheme.In this paper, we propose a dynamic input buffer allocation with dynamic threshold scheme in a

cell switched ATM network. By using this scheme, we achieve a superior throughput and a re-duced number of lost cells that the fixed input buffer suffers from at high loads. We have devel-oped a simulation program of the proposed scheme and compared it to the static fixed size inputbuffer. This paper is organized as follows. In Section 2, some major issues about ATM switch areaddressed; in Section 3, the dynamic buffer management scheme is presented; simulation study isshown in Section 4; results and discussion are provided in Section 5; finally, we draw our conclu-sions on this study in Section 6.

2. Dynamic buffer model description

An ATM switch contains a set of input ports and output ports, through which it is intercon-nected to users, other switches, and other network elements. It might also have other interfacesto exchange control and management information with special purpose networks. Theoretically,the switch is only assumed to perform cell relay and support of control and management func-tions. It will be useful to adopt a functional block model to simplify the discussion of various de-sign alternatives.Generally speaking, ATM switches are network elements that support functionalities such as

cell transport, connection control and network management. An ATM switch must provide thesame functions of routing and buffering as traditional switches, but at a much higher speed. Stud-ies in recent years into the architecture of ATM switches have considered several bufferingschemes [8].Under bursty traffic, statistical multiplexing of ATM cells lead to severe cell loss. Large buffers

are costly, create excessive cell delay and increase switch cost. Thus an appropriate number of buf-fers must be allocated in a switch and an optimal utilization of them must be made. The goal is toaccommodate more incoming traffic cells from various sources and smooth out the burst arrivalrate while limiting the overhead of the switch to a predetermined size. A feasible solution is a dy-namic buffering strategy, which applies a threshold-based sharing policy to the buffers and a prior-ity-based cell push-out policy to either buffered or incoming cells [15,23,24].Choice of a threshold: A number of buffer allocation schemes have been investigated.

(1) Complete sharing accommodates all cells if the storage space is not full.(2) Complete partitioning divides permanently the entire space to all input ports.(3) Partial sharing reserves a certain number of buffers and the rest are partitioned among theinput ports. When traffic is bursty and the aggregated cell arrival rate exceeds the link trans-mission rate then the shared buffers in the switch are quickly filled, which makes cell loss

https://www.researchgate.net/publication/3334631_Optimal_buffer_management_policies_for_shared-buffer_ATM_switches?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==






https://www.researchgate.net/publication/220445851_Effect_of_Windowing_Policies_for_Input_Buffered_ATM_Switch?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==

https://www.researchgate.net/publication/220446117_Performance_comparison_criteria_for_ATM_switch_models?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==


unavoidable. This raises the question of how to choose the cells to be dropped, while main-taining fairness of cell loss among all input ports, when the switch buffer is full. Under sym-metric traffic equal traffic on all ports, it has been proved that the optimal policy in sharedmemory switches with two output ports is the ‘‘push-out type with threshold’’. In otherwords, whenever the buffer is not full, an incoming cell should be accepted. Once the bufferis full, a cell of type 1 (or of type 2) is allowed in the switch and a cell of type 2 (or of type 1)is pushed out, if the number of type 1 (or type 2) cells is below some threshold, such as T1 (orT2 for type 2), where buffer size = T1 + T2. Under asymmetric traffic, however, for a systemwith N ports, the optimal policy is still an open question. We are motivated to impose athreshold on buffer allocation to logical input queues, thus preventing one queue from drain-ing out all the space while throughput is very low due to smaller buffers on other outgoinglinks [15,25–27].

The ATM network considered consists of n identical stages, each having an�1 switching ele-ments (SEs). Each SE is an a · a crossbar, with input buffer modules (dynamic length) to storeconflicting cells and output buffer (length two) modules to store unacknowledged cells. Fig. 1shows a sample model of a a · a proposed shared-buffer switch structure. As shown in figure,an SE is composed of a input and output ports. At each input port, a logical input buffer module(IBM) of length B is provided to hold forwarded cells from previous stages. Each output portincludes an output buffer module (OBM) of length equal to two [1].The following assumptions are made in our model:

1. Cells are assumed of a fixed length and are generated randomly at the beginning of a clockperiod. They are admitted at the IBM of stage 0 only if there is place in the correspondingIBM.

2. The switch uses input queuing and shared buffer, i.e. each input port of the fabric has a log-ical queue, but these queues all share the same fabric memory. Each input port has a logicalbuffer of size B. An arriving cell destined for input port I joins the input queue I from head to

Dynamic input queue

Output queue

2

1

2

a

1

a

a-1

Fig. 1. Proposed shared-buffer ATM switch with input and output buffer.

https://www.researchgate.net/publication/223222541_Simulation_of_dynamic_input_buffer_space_in_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==


tail sequentially. When the queue length of input queue I reaches B (i.e. queue I is full), thecontrol logic will choose a shortest-length input queue among the other a � 1 input queues,said as input queue J. Now, input queue J is used to allocate the new arriving cell destined forthe input port I, and the new arriving cells destined for input port I will borrow the storageplace from input queue J and join to input queue J from tail to head sequentially. The con-trol logic sets a connection between input queue I and input queue J. When there is no emptyspace left in both input queue I and input queue J, an arriving cell destined for input port Iwill be discarded and an arriving cell destined to input port J will take the place of the lastcell stored in the queue and destined for input port I.

3. The time needed to allocate space to an IBM is assumed negligible with respect to the cycletiming.

4. The switching network operates synchronously at the rate of s, which is equal to the sum ofthe time taken to select a cell and forward it to the next stage IBM.

5. At the beginning of each cycle, cells waiting to be forwarded in the IBMs are checked, and iftwo cells are destined to the same OBM, they are randomly resolved.

6. Cells generated by the cell sources are uniformly distributed over all SEs� IBMs. A source ofcells is connected to each port of stage 0 IBMs. The rates at which cells are generated at thesource nodes determine the traffic load.

7. The output ports of the SEs in the final stage of the switching network are connected to sinknodes. Hence, any cell that arrives at those output ports is accepted by the sink nodes with aprobability of one.

8. A cell is transferred from the OBM of stage k to the IBM of stage (k + 1) when the bufferspace is available at stage (k + 1) IBM and a cell is available for transmission at stage kOBM. A cell is available at stage k OBM when there are two cells in the OBM, or thereis one cell and it is not held up.

9. A cell is accepted by the (k + 1)th stage IBM in a cycle if it is not full or it is full, and a cell inthat IBM is able to move forward to an OBM.

10. The SEs are a · a switches which have the capability of connecting an input to one of theoutput ports labelled 0 and a, depending on the state of the corresponding value of the des-tination address.

11. The operation of loading a cell from the IBM to the OBM, and its transmission to the nextstage, takes place in parallel with the same cycle. If that cell is accepted in the next stageIBM, it does not contribute toward the delay experienced by the cell in the network.

Intuitively, in order to adapt to different mixes of incoming traffic, the threshold should bechanging dynamically, unlike static partitioning. Some kinds of traffic applications, such as videoand audio, are very sensitive to delay and cell loss. Under a bursty condition, the more traffic ofsuch types the switch can accommodate, the better the QoS will be. Yet, the statistical nature ofthe significant part of the traffic, its burstiness, and its variability combine to pose difficulties indistributing buffers of fixed size. Therefore, in a statistical multiplexing environment with hetero-geneous traffic, before arriving at an optimal threshold, it is quite important to define congestiondetection criteria for dynamic threshold updating. The complexity and the performance of theswitch mechanism play a critical role in preventive congestion control. Our algorithm detectsthe congestion and updates the corresponding threshold.


3. Design objectives of the proposed buffer allocation

Consider an a · a ATM switch with input-queued shared-buffer as shown in Fig. 1. Theswitch uses input queuing and shared buffer, i.e. each input port of the fabric has a logicalqueue of length B, but these queues all share the same fabric memory. An arriving cell destinedfor input port I joins the input queue I from head to tail sequentially. When the queue length ofinput queue I reaches B (i.e. queue I is full), the control logic will choose a shortest-length inputqueue among the other a � 1 input queues, said as input queue J. Now, input queue J is used toallocate the new arriving cell destined for the input port I, and the new arriving cells destinedfor input port I will borrow the storage place from input queue J and join to input queue Jfrom tail to head sequentially. The control logic sets a connection between input queue I andinput queue J. When there is no empty space left in both input queue I and input queue J,an arriving cell destined for input port I will be discarded and an arriving cell destined to inputport J will take the place of the last cell stored in the queue and destined for input port I. Be-cause input port I has borrowed the buffer from input port J, the last cell destined for inputport I occupied the storage place of input queue J will be discarded when an arriving cell isdestined for input port I.When the queue length of input queue I decreases to B or the queue length of input queue J

increases to B, the control logic assigns two different shortest-queue-length buffers among theother a � 2 input ports to input port I and J. In the proposed buffer management scheme, whenthe queue length of each input buffer increases to B or decreases to B, the control logic assignsalternative shortest-queue-length buffer for storage.Take a simple example for illustration. In a 4 · 4 ATM switch, the size of each queue is six.

There are six, two, four and three cells stored in input queue 1, 2, 3, and 4, respectively. Whenan arriving cell is destined for input port 1, the control logic will select a shortest-length inputqueue, which is input queue 2 to store the arriving cell. The arriving cell is allocated at the tailof the input queue 2. If there are four arriving cells destined for input port 2, these four arrivingcells are stored in input queue 2, and the cell belonging to input port 1 but stored in inputqueue 2 is discarded. However, because the queue length of input port 1 and 2 reaches six,the control logic selects two shortest-length queues (input queue 3 and input queue 4) to inputport 1 and input port 2. Thus, the proposed buffer management scheme improves the cell lossperformance because each input buffer can borrow the storage space from other input buffers[14].In our model, the control and the data information are transmitted in parallel. In this model,

the control signals exchanged between consecutive stages are changed, whereby the control logicconsists of three state machines, each one working independently of the other, dealing with celltransmission, acknowledge reception and acknowledge transmission for accepted cells. At thebeginning of every cycle, the control logic determines which cell is to be forwarded to the nextstage, which is forwarded regardless of the state to the next stage input buffer SE. Therefore, ina cycle, a cell forwarded from stage k output buffer SE is accepted by stage (k + 1) SE, if itsIBM is not full or stage (k + 1) SE has forwarded a cell from its full IBM to stage (k + 2) SE.In this way, cells are forwarded without using a control signal in a cycle, providing a fasterthroughput. However, when a cell forwarded cannot be accepted by stage (k + 1), it has to bedropped. This would result in cell loss in the event that these cells are not retransmitted. Therefore,



each SE output port has an OBMs of length two. As soon as a cell is transmitted to the next stage,the control logic saves the cell in the OBM [12]. The algorithm is shown below:In each cycle do in parallel:

a. Cell transmission{Check the status of the OBM.CASE full: select the unacknowledged cell in the OBM and forward it to the next stage.CASE not full: check the status of the IBMs.If a cell to the OBM is available then forward the cell and save the cell in the OBM,Else wait till the next cycle.}b. Ack transmission for accepted cells{If a cell is available at the input port and the IBM is not full or IBM is full but another buffer isavailable then accept the cell and send an ack to the previous stage.}c. Ack reception{If an ack is received from the next stage then clear the cell awaiting an ack in the previousOBM.}End parallel do

As mentioned, the OBM is two cells long. As long as the OBM is full, there is always anunacknowledged cell that needs to be retransmitted. Hence, instead of loading a new cell fromthe IBM, the waiting cell is retransmitted thus eliminating lost cells. The proposed buffer man-agement scheme can prevent a single input port taking over most of the buffer to achieve fair-ness purpose.

4. Simulation study

A simulation program was designed and developed using C++. A data structure was imple-mented to model the network connectivity and simulated cells. A cell is characterized by fourattributes: (a) destination address; (b) time admitted into the network; (c) time released fromthe network; and (d) time to set the flag of unacknowledged cells to be cleared. An SE has thefollowing attributes: (a) IBM�s current length; (b) list of cells in each IBM; (c) list of cells in eachOBM; and (d) next stage input port number linked to each OBM.We have made following assumptions during the implementation of simulator. These are as

follows:

(1) The N processors generate cells at each stage cycle with probability q(1) (input load).(2) The destination of each cell at each stage cycle by processors is set randomly by a randomnumber generator to simulate uniform traffic.

(3) If there is a conflict among cells within an SE, one of them is selected randomly.(4) The throughput and the delay are measured at each output port of the network, and aver-aged over the network size and simulation time span to get the normalized throughput andthe normalized delay of the network.

(5) Cell service at each buffer module is done by using first-in-first-out (FIFO) policy.

https://www.researchgate.net/publication/3043759_Design_and_analysis_of_high_performance_multistage_interconnection_networks?el=1_x_8&enrichId=rgreq-6746692e-a26f-4948-abc0-79e4d881dd22&enrichSource=Y292ZXJQYWdlOzIyMzU1MTE0NztBUzoxMDQ1MTM3OTcwOTk1MjNAMTQwMTkyOTQzOTMyNw==


The simulation starts by initializing the IBMs and OBMs for each SE to null. The ATM switch-ing topology is then initialized to implement the data structure of each SE. Other initializationsinclude the simulation clock, which is equal to the clock cycle the switching network operatesat, the current pool size used to null, and the current number of simulated cells to zero. We chan-ged threshold updating frequency according to the weight of each traffic factor. These are used todetermine new shares of the buffer space. After the initializing step, the simulation loop begins.Each simulation loop includes the following computations:

1. For each SE in the last stage, do the following: if a cell exists in any one of the two OBMs,then remove this cell from the OBM. This simulates the acceptance of cells by the sink node.Set the cell attribute (time released) to the current simulation time in order to calculate thenetwork delay.

2. Check the current number of simulated cells. If this number has exceeded the number spec-ified by the user, and no cells exist in the IBM and OBM in the network, then the simulationshould stop. Calculate and display the performance measures (throughput and delay).

3. Check each SE�s OBM. If a cell in any of the two OBMs is flagged to be cleared as a result ofacknowledge reception from the next stage, then clear this cell in that OBM. Update theOBM length accordingly.

4. Check each SE�s OBM in the network from the last stage to the first:a. If the OBM of that SE is full, this means that an unacknowledged cell exists in this OBM.Select this cell and forward it to the next stage IBM if a place exits in. Set the flag ofthis cell to be cleared in case a place exist in the next stage IBM (acknowledge recep-tion).

b. If the OBM is not full:If cells exist in that SE�s IBM: check if two cells from two IBMs inthat SE are destined to the same OBM. If they are, then randomly select one, and forwardit to the destined OBM. If the next stage IBM is full, then clear the forwarded cell from itsIBM, and return its memory space to the pool. If the next stage IBM is not full, thenadmit this cell to the next stage IBM (if there is place in the pool), and clear the cell fromthe SE�s IBM. Return its memory space to the pool. Cells do not exist in IBM: go to step(5).

5. After all SEs in the switching network are processed in step 4, randomly generate a cell witha random output destination address and admit it to the corresponding IBM if a place exits.Increase the number of simulated cells by one, and save the time the cell is admitted to thenetwork for later analysis.

6. Thresholds of the logical queues for all input ports are equally assigned during the initiali-zation phase.

7. Thresholds are updated every M time slots.8. Any excess of buffer space from any logical queue goes to the shared buffer pool.9. A cell accommodation rule (CAR) is applied whenever a cell arrives at the switch.A CARis needed to make use of the dynamic thresholds. When the buffer pool is not full, anyincoming cell should be accepted; otherwise a selective cell admission/drop mechanism isenforced.

10. Increase the simulation clock.11. Go to step (2).


5. Results and discussion

The principal goal of this work is to find an ATM switch which improves the network perfor-mance. A new model was introduced in an ATM environment. In this model, a shared buffer poolprovided with threshold-based virtual partition was proposed. Also performances of the static andthe dynamic buffered networks have been compared to each other.A simulated study of ATM networks has been performed and the following terms are used for

comparison with the existing models:

• Normalized throughput: the average number of cells passed by the switching network per outputlink per cycle.

• Normalized delay: the average number of cycles required for a cell to pass the network per out-put link per cycle.

Two major factors are important in determining the performance of ATM networks, which arethe buffer length and traffic load. As the buffer length is dynamically allocated, it will only con-tribute to an increase in throughput for various loads as compared to the fixed IBM maximumlength. The simulation results are shown in Figs. 2–7. Each graph shows the variation of a givenparameter as a function of the traffic load for various IBM lengths and for the dynamic scheme.The input parameters of the simulation program are traffic load, buffer length and number of cellsto be simulated. The performance measures include the following: normalized throughput anddelay.Fig. 2 shows the network�s normalized throughput vs. input traffic load. As the network traffic

load increases, the normalized throughput increases because of the increased number of cells thatare passed by the network as a result of the increased load. However, this increase is saturated athigh loads because of the limited input buffer length and increased conflicts. As can be seen, the

0.2

0.3

0.4

0.5

0.6

0.7

0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1Traffic Load

Nor

mal

ized

Thr

ough

put

No controlThreshold schemeProposed scheme

Fig. 2. The normalized throughput vs. traffic load.

1

1.5

2

2.5

3

3.5

4

4.5

5

0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1Traffic Load

Nor

mal

ized

Del

ayNo control

Threshold scheme

Proposed scheme

Fig. 3. The normalized delay vs. traffic load.

0.4

0.45

0.5

0.55

0.6

0.65

0.7

0.75

0.8

0 2 4 6 8 10 12 14 16 18Buffer Size

Nor

mal

ized

Thr

ough

put

Proposed scheme

Threshold scheme

No control

Fig. 4. The normalized throughput vs. buffer size for various IB lengths.


proposed scheme provides a higher throughput than the threshold scheme and the no controlscheme, because of the flexible IBM length which accommodates all cells admitted to the network.Also, as the maximum IBM length is increased beyond a certain value (i.e. 10) all three switchmodel provide approximately the same normalized throughput.In Fig. 3, the normalized delay increases with traffic load due to the limited input buffer length

and increased conflicts between cells that are destined to the same output port. In the proposedscheme, this delay is higher because more cells can be accommodated in the IBMs as the trafficincreases, leading to an increase in waiting delays in the IBMs.Fig. 4 shows the normalized throughput vs. buffer size for various IBM lengths. Fig. 5 shows

the normalized delay vs. buffer size for various IBM lengths. In the proposed scheme, normalized

1

3

5

7

9

11

13

15

0 2 4 6 8 10 12 14 16 18Buffer Size

Nor

mal

ized

Del

ay

Proposed scheme

Threshold scheme

No control

Fig. 5. The normalized delay vs. buffer size for various IB lengths.

0.4

0.45

0.5

0.55

0.6

0.65

0.7

0.75

0.8

1 3 5 7 9 11Network Size

Nor

mal

ized

Thr

ough

put

No control

Threshold scheme

Proposed scheme

Fig. 6. The normalized throughput vs. network size.


throughput is better than the threshold scheme and the no control scheme. However, because ofthe novel architecture in the proposed scheme, the normalized delay is worse than the other twoschemes. It is seen that the normalized throughput increases as the buffer size increases. The nor-malized delay increases as the buffer size increases. The normalized throughput reaches the satu-ration point very quickly after the six buffer sizes. The increase in normalized throughput is verysignificant up to the buffer size of 3–4. The normalized delay increases almost linearly with thebuffer size. So, adding buffers to a cell switching network can increase throughput.Fig. 6 compares the normalized throughput vs. network size for the proposed scheme, the

threshold scheme and no control scheme. When the network size N increases, the normalized

1

2

3

4

5

6

7

8

2 4 6 8 10 12Network Size

Nor

mal

ized

Del

ayNo control

Threshold scheme

Proposed scheme

Fig. 7. The normalized delay vs. network size.


throughput decreases in all models. However, the performance of the proposed scheme is muchbetter than the other two.Fig. 7 compares the normalized delay vs. network size. In this case, the normalized delay of

each input port with no control scheme is better than the threshold scheme and the proposedscheme, because the cell arrival rate for each incoming port increases.In summary, these results showed that performance improvement is achieved in proposed model

as compared to other two models. The network performance in our simulation is measuredaccording to the metrics: normalized throughput and normalized delay. These results are not sur-prising because the no control scheme constitutes a bottleneck that yields a reduced networkthroughput and therefore, degradation in the network performance. The proposed scheme will al-ways accept new cells to the IBMs as the network traffic increases, because the buffer pertaining tothe IBMs of switching elements is automatically increased upon demand. This is only limited bythe maximum allowable buffer size allocated to the switch ports. We also note that the cell loss isachieved for the ATM switch when we add an extra output buffer.

6. Conclusion

In this paper, a novel shared buffer switch with shortest-queue-length buffer management inATM networks was proposed. In the proposed buffer structure, if the queue length of an inputqueue reaches to B, the control logic chooses a shortest-length input queue among the othera � 1 output queues to store the new arriving cell. This proposed structure will decrease the num-ber of out-of-order and lost packets for high-traffic loads. In addition, an increased networkthroughput and an increased network performance is achieved. Thus, by using the proposed model,a superior and efficient ATM switch is achieved. However, the allocation time needed to allocate apacket space to an IBM is the only drawback, which increases the network delay. The allocation


time increases when multiple IB requests are initiated. However, it may be reduced by using a clus-tered buffer for each stage, or for the SEs in the same level. This allows parallel buffer allocation,which decreases the allocation time. Thus, the proposed buffer management scheme gives fairnessto every input port. Also when we add a buffer at the output port in the switch, the proposed buf-fer management scheme can prevent packets from being lost. Thus packet lost in the switch is pre-vented by using output buffers. Besides, compared to the pushout scheme, the proposed scheme iseasy to implement in the input-queued shared-buffer ATM switches.

References

[1] Diab H, Tabbara H, Mansour N. Simulation of dynamic input buffer space in multistage interconnection

networks. Adv Eng Software 2000;31:13–24.

[2] Gupta UC, Goswami V. Performance analysis of finite buffer discrete-time queue with bulk service. Comput Oper

Res 2002;29:1331–41.

[3] Badran H, Mouftah H. ATM switch architectures with input–output buffering: effect of input traffic correlation,

contention resolution policies, buffer allocation strategies and delay in backpressure signal. Comput Networks

ISDN Syst 1994;26:1187–213.

[4] Choudhury AK, Hahne EL. Dynamic queue length thresholds for shared-memory packet switches. IEEE Trans

Network 1998;6:130–40.

[5] Osaki H, Wakamiya N, Murata N, Miyahara H. Performance of an input/output buffered-type ATM LAN switch

with back-pressure function. IEEE/ACM Trans Network 1997;5(2):278–90.

[6] Levy H, Mendelson T, Sidi M, Keren-Zvi J. Sizing exit buffers in ATM networks: an intriguing coexistence of

instability and tiny cell loss rates. IEEE/ACM Trans Network 1999;7(6):926–36.

[7] Al-Mouhamed MA, Kaleemuddin M, Yousef H. Evaluation of pipelined dilated banyan switch architectures for

ATM networks. IEEE/ACM Trans Network 1999;7(5):724–40.

[8] Sharma S, Viniotis Y. Optimal buffer management policies for shared-buffer ATM switches. IEEE/ACM Trans

Network 1999;7(4):575–87.

[9] Park JH, Yoon H, Lee HK. The deflection self-routing Banyan network: a large-scale ATM switch using the fully

adaptive self-routing and its performance analysis. IEEE/ACM Trans Network 1999;7(4):588–604.

[10] Yoon H, Lee KY, Liu MT. Performance analysis of multibuffered packet-switching networks in multiprocessor

systems. IEEE Trans Comput 1990;39(3):319–27.

[11] Ding J, Bhuyan LN. Finite buffer analysis of multistage interconnection networks. IEEE Trans Comput

1994;43(2):243–7.

[12] Bhogavilli SK, Abu-Amara H. Design and analysis of high performance multistage interconnection networks.

IEEE Trans Comput 1997;46(1):110–7.

[13] Cam H. A high-performance ATM switch based on modified shuffle-exchange network. Comput Commun

1999;22:110–9.

[14] Huang TY. A new shared-buffer packet switch in ATM networks. Comput Commun 2001;24:445–51.

[15] Yaprak E, Chronopoulos AT, Psarris K, Xiao Y. Dynamic buffer allocation in an ATM switch. Comput Networks

1999;31:1927–33.

[16] Laevens K. A processor-sharing model for input-buffered ATM-switches in a correlated traffic environment.

Microprocess Microsyst 1999;22:589–96.

[17] Choia YH, Leeb PG. A scalable and reconfigurable priority queue architecture for ATM switches. Comput

Commun 2000;23:333–40.

[18] Li ZG, Yuan XJ, Wen CY, Soong BH. A switched priority scheduling mechanism for ATM switches with multi-

class output buffers. Comput Networks 2001;35:203–21.

[19] Song H, Kwon B, Jang I, Yoon H. An output queueing analysis of multipath ATM switches. J Syst Architect

2000;46:1005–12.


[20] Sharma S, Viniotis Y. Convergence of a dynamic policy for buffer management in shared buffer ATM switches.

Perform Eval 1999;36–37:249–66.

[21] Kraimeche B. Design and analysis of the stacked-Banyan ATM switch fabric. Comput Networks 2000;32:171–84.

[22] Bermonda JC, Marlina N, Peleg D, Perennes S. Directed virtual path layouts in ATM Networks. Theor Comput

Sci 2003;291:3–28.

[23] Becker M, Beylot AL. Performance comparison criteria for ATM switch models. Comput Networks

2000;34:85–95.

[24] Sharma NK. Effect of windowing policies for input buffered ATM switch. Comput Networks ISDN Syst

1998;30:1083–90.

[25] Papadimitriou GI, Pomportsis AS. Learning-automata-based scheduling algorithms for input-queued ATM

switches. Neurocomputing 2000;31:191–5.

[26] Ho JD, Sharma NK. Multicast performance in shared-memory ATM switches. Perform Eval 2000;41:23–36.

[27] Prabhakar B, McKeown N. On the speedup required for combined input- and output-queued switching.

Automatica 1999;35:1909–20.

Abdulhamit Subasi graduated from Hacettepe University in 1990. He took his M.Sc. degree from

Middle East Technical University in 1993, and his Ph.D. degree from Sakarya University in 2001,

all in Electronics Engineering. He is an Assistant Professor at Kahramanmaras� Sutcu _ImamUniversity. His areas of interest are application of neural networks, biomedical signal processing,

computer networks and security.

Etem Koklukaya received his B.Sc., M.Sc. and Ph.D. degrees from Istanbul Technical University

(1978), Yildiz Technical University (1982) and Istanbul Technical University (1990), respectively.

He is with the Department of Electrical and Electronics Engineering at Sakarya University,

Sakarya, Turkey. His research areas are intelligent and expert system applications in electronics

and biomedical engineering.

Documents

Performance improvement of dynamic buffered ATM switch