Beruflich Dokumente
Kultur Dokumente
Ciaran Toal, Dwayne Burns, Kieran McLaughlin, Sakir Sezer, Stephen O’Kane
Institute of Electronics, Communications and Information Technology (ECIT)
Queen’s University Belfast
Ciaran.Toal@ee.qub.ac.uk
200Mhz Domain
2.3 RLDRAM II
100Mhz Domain
The high speed, high capacity shared buffer design that Ingress Fifo
Dual clock
Egress Fifo
Dual clock
has been developed utilises a special type of DRAM
optimised for low latency access, called RLDRAM II.
This memory, developed by Micron, is targeted for
Packet Scheduler
networking applications, L3 cache, high-end commercial
graphics, etc. RLDRAM II utilizes an eight-bank
architecture that is optimized for high-speed operation and Figure 1: High-level diagram of packet buffer
a double data rate I/O for increased bandwidth. The architecture
eight-bank architecture enables RLDRAM II devices to
achieve peak bandwidth by decreasing the probability of The five RLDRAM II chips are shown at the top of the
random access conflicts. diagram. It can be seen that the system clock operates at
RLDRAM II architecture offers separate I/O (SIO) and half that of the memory interface logic, which itself is
common I/O (CIO) options. The SIO devices have composed of an interface to the RLDRAM II chips,
separate READ and WRITE ports to eliminate bus ingress and egress controllers and separate state machines
turnaround cycles and contention. Optimized for near- to operate the control memory for the linked list and for
term READ and WRITE balance, RLDRAM II SIO the packet memory itself.
devices are able to achieve full bus utilization.
Packet Memory 4 1
State Machine
6 2
10 3
Ingress Egress Egress
Nop Nop Nop 5 4
Read State Read State Write State
E 5
Control Memory
State Machine
7 6
8 7
9 8
12 clock cycle TDM
E 9
Figure 2: Packet and Control Memory State Machine
11 10
Timing Diagram
12 11
ingress and one read for the egress and also one write per
serviced packet to maintain the address linked-list. The no Figure 3: Address linked-list
operation (NOPs) are in place to satisfy the bank tRC
(Row Cycle times) timing requirements. The RLDRAM II Once a packet has been read out of memory, the
control memory only ever accesses bank 0, so each time it corresponding linked-list of cell addresses must be joined
on to the end of the empty address linked-list. The last
the last link in the empty address linked list. The last cell 14.0000
address of this packet is then stored locally in the memory 12.0000
Figure 4: Linked-list write back procedure Table 1: Post Layout Synthesis Results
20.0000
20.0000
B andw idth (Gbps)
15.0000
10.0000
10.0000
5.0000
5.0000
0.0000
0 100 200 300 400 500 0.0000
Payload size (bytes) 0 100 200 300 400 500
Actual rate Rate required for 20Gbps Payload size (bytes)
Figure 6: Throughput for 400MHz with 4-5-4-5 TDM Actual rate Rate required for 20Gbps
6. Conclusions
20.0000
7. References
[1] International Data Group (IDG), “Worldwide
Bandwidth End-User Forecast and Analysis, 2003-2007:
More is Still Not Enough,” 2003.
[2] http://www.micron.com/products/dram/rldram/
[3] H. Kuwahara, “A shared buffer memory switch for
an ATM exchange,” IEEE GLOBECOM, 1989.
[4] J.Y. Hui, E. Arthurs “A Broadband Packet Switch
for Integrated Transport" IEEE Journal Selected Areas in
Communications, Vol.5, No. 8, pp 1264-1273, Oct. 1988.
[5] C. Minkenberg, T. Engbersen, “A combined input
and output queued packet switched system based on
PRIZMA switch on a chip technology,” IEEE
Communications Magazine, Dec. 2000.
[6] M. Hluchyj and M. Karol, “Queuing in high-
performance packet switching,” IEEE Journal, SAC-5,
1987.
[7] M. Lau, S. Shieh, “Gigabit Ethernet switches using a
shared buffer architecture,” IEEE Communications
Magazine, 2003.
[8] K. J. Schultz and P. G. Gulak, “CAM-Based single-
chip shared buffer ATM switch”, IEEE ICC’94, 1994.
[9] A. Demers, S. Keshav, S. Shenker, “Analysis and
Simulation of a Fair Queuing Algorithm,” in Proc. ACM
SIGCOMM’89, pp. 3–12.
[10] A. Parekh and R. Gallager, “A Generalized
Processor Sharing Approach to Flow Control in Integrated
Services Networks: The Single Node Case,” ACM/IEEE
Trans. Networking, vol. 1, June 1993, pp. 344–357.
[11] J.W. Shim, G.J. Jeong, M.K. Lee, “FPGA
implementation of a scalable shared buffer ATM switch,”
ATM, June 1998.