4
A Nonvolatile Look-Up Table Using ReRAM for Reconfigurable Logic Wen-Pin Lin 1 , Shyh-Shyuan Sheu 2 , Chia-Chen Kuo 1 , Pei-Ling Tseng 1 , Meng-Fan Chang 3 , Keng-Li Su 1 , Chih-Sheng Lin 1,3 , Kan-Hsueh Tsai 1 , Sih-Han Lee 1 , Szu-Chieh Liu 1 ,Yu-Sheng Chen 1 , Heng-Yuan Lee 1 , Ching-Chih Hsu 1 , Frederick T. Chen 1 , Tzu-Kun Ku 1 , Ming-Jinn Tsai 1 , Ming-Jer Kao 1 1 Electronics and Optoelectronics Research Laboratories (EOL), ITRI, Hsinchu, Taiwan 2 National Taipei University, Taipei, Taiwan 3 National Tsing Hua University, Hsinchu, Taiwan 3 Ming Shin University of Science & Technology, Hsinchu, Taiwan E-mail : [email protected] Abstract—This study demonstrated a nonvolatile look-up table (nvLUT) that involves using resistive random access memory (ReRAM) cells with normally-off and instant-on functions for suppressing standby current. Compared with the conventional static random access memory (SRAM)- magnetoresistive random-access memory (MRAM)-hybrid LUTs the proposed ReRAM-based two-input nvLUT circuit decreases the number of transistors and the area of nvLUT by 79% and 90.4%, respectively. The areas of the two- and three-input ReRAM nvLUTs are 11.5% and 74.2% smaller than the other MRAM-based two-input and PCM-based three-input LUTs, respectively. Because of the low current switching and high R- ratio characteristics of ReRAM, the proposed ReRAM-based nvLUT achieves 24% less power consumption than that of SRAM-MRAM-hybrid LUTs. The functionality of the fabricated adder of the three-input ReRAM nvLUT was confirmed using an HfOx-based ReRAM and a 0.18-ȝm complementary metal-oxide semiconductor with a delay time of 900 ps. Keywords—LUT; Look-Up Table; FPGA; nonvolatile logic; resistive memory; ReRAM; RRAM I. INTRODUCTION Miniaturizing electronic products and integrated circuit (IC) chips is crucial in the developing of advanced semiconductor technology. Current configurable ICs have various internal connection structures and designs of an internal configurable logic block (CLB), which is equipped with an embedded memory that serves as a look-up table (LUT) [1] [8]. A conventional reconfigurable logic circuit uses the SRAM to store the configuration information. However, the SRAM may cost approximately 40% of the total area [8], resulting in higher cost and longer routing delay. To maintain the configuration data during the power-off mode, a nonvolatile memory is also required. Traditionally, the data is serially transferred from the SRAM to the nonvolatile memory unit, which consumes both time and energy. Because of recent developments in the emerging memory technology, these obstacles can be resolved by replacing the SRAM with a more compact nonvolatile memory unit. This paper proposes a novel ReRAM-based LUT for reconfigurable logic. An nonvolatile look-up table (nvLUT) can be implemented with various kind of emerging memory devices, however, ReRAM provides better CMOS process compatibility and a higher resistance ratio compared with other emerging memories such as phase-change random access memories (PCRAM), magnetoresistive random-access memory (MRAM), and ferroelectric random-access memory (FeRAM). In addition, ReRAM is a high-performance, nonvolatile memory with a write speed as fast as 5 ns [5], [9], [10]. The proposed nonvolatile LUT reduces the area and active-power consumption compare with SRAM-based LUT and published nvLUTs . II. RERAM-BASED, NONVOLATILE, LOOK-UP TABLE A. Technological Background Fig. 1 (a) shows the ReRAM devices comprising a TiN/TiOx/HfOx/TiN structure. The typical bipolar switching IV characteristics are displayed in Fig. 1 (b). The low- resistance state (LRS) in the SET operation is achieved by applying a positive voltage (1.5V) across the top electrode (TE) and the bottom electrode (BE). By contrast, the RESET operation requires a negative voltage (-1.1V) to achieve the high-resistance state (HRS). The write time of the HfOx-based ReRAM can be as fast as 5 ns [10]. (a) (b) (c) Fig. 1 1T2R memory unit (a) Cell Structure and Layout (b) IV characteristic of HfO x RRAM (c) Schematic IEEE Asian Solid-State Circuits Conference November 10 - 12, 2014/Kaohsiung, Taiwan 978-1-4799-4089-9/14/$31.00 © 2014 IEEE

IEEE Paper for ReRAM

Embed Size (px)

DESCRIPTION

ReRam is Volatile memory

Citation preview

Page 1: IEEE Paper for ReRAM

A Nonvolatile Look-Up Table Using ReRAM for Reconfigurable Logic

Wen-Pin Lin1, Shyh-Shyuan Sheu2, Chia-Chen Kuo1, Pei-Ling Tseng1, Meng-Fan Chang3, Keng-Li Su1, Chih-Sheng Lin1,3 , Kan-Hsueh Tsai1, Sih-Han Lee1, Szu-Chieh Liu1,Yu-Sheng Chen1, Heng-Yuan Lee1, Ching-Chih Hsu1,

Frederick T. Chen1, Tzu-Kun Ku1, Ming-Jinn Tsai1, Ming-Jer Kao1

1Electronics and Optoelectronics Research Laboratories (EOL), ITRI, Hsinchu, Taiwan 2National Taipei University, Taipei, Taiwan

3National Tsing Hua University, Hsinchu, Taiwan 3Ming Shin University of Science & Technology, Hsinchu, Taiwan

E-mail : [email protected]

Abstract—This study demonstrated a nonvolatile look-up

table (nvLUT) that involves using resistive random access memory (ReRAM) cells with normally-off and instant-on functions for suppressing standby current. Compared with the conventional static random access memory (SRAM)- magnetoresistive random-access memory (MRAM)-hybrid LUTs the proposed ReRAM-based two-input nvLUT circuit decreases the number of transistors and the area of nvLUT by 79% and 90.4%, respectively. The areas of the two- and three-input ReRAM nvLUTs are 11.5% and 74.2% smaller than the other MRAM-based two-input and PCM-based three-input LUTs, respectively. Because of the low current switching and high R-ratio characteristics of ReRAM, the proposed ReRAM-based nvLUT achieves 24% less power consumption than that of SRAM-MRAM-hybrid LUTs. The functionality of the fabricated adder of the three-input ReRAM nvLUT was confirmed using an HfOx-based ReRAM and a 0.18- m complementary metal-oxide semiconductor with a delay time of 900 ps.

Keywords—LUT; Look-Up Table; FPGA; nonvolatile logic; resistive memory; ReRAM; RRAM

I. INTRODUCTION Miniaturizing electronic products and integrated circuit (IC)

chips is crucial in the developing of advanced semiconductor technology. Current configurable ICs have various internal connection structures and designs of an internal configurable logic block (CLB), which is equipped with an embedded memory that serves as a look-up table (LUT) [1] [8]. A conventional reconfigurable logic circuit uses the SRAM to store the configuration information. However, the SRAM may cost approximately 40% of the total area [8], resulting in higher cost and longer routing delay. To maintain the configuration data during the power-off mode, a nonvolatile memory is also required. Traditionally, the data is serially transferred from the SRAM to the nonvolatile memory unit, which consumes both time and energy. Because of recent developments in the emerging memory technology, these obstacles can be resolved by replacing the SRAM with a more compact nonvolatile memory unit.

This paper proposes a novel ReRAM-based LUT for reconfigurable logic. An nonvolatile look-up table (nvLUT) can be implemented with various kind of emerging memory devices, however, ReRAM provides better CMOS process

compatibility and a higher resistance ratio compared with other emerging memories such as phase-change random access memories (PCRAM), magnetoresistive random-access memory (MRAM), and ferroelectric random-access memory (FeRAM). In addition, ReRAM is a high-performance, nonvolatile memory with a write speed as fast as 5 ns [5], [9], [10]. The proposed nonvolatile LUT reduces the area and active-power consumption compare with SRAM-based LUT and published nvLUTs .

II. RERAM-BASED, NONVOLATILE, LOOK-UP TABLE

A. Technological Background Fig. 1 (a) shows the ReRAM devices comprising a

TiN/TiOx/HfOx/TiN structure. The typical bipolar switching IV characteristics are displayed in Fig. 1 (b). The low-resistance state (LRS) in the SET operation is achieved by applying a positive voltage (1.5V) across the top electrode (TE) and the bottom electrode (BE). By contrast, the RESET operation requires a negative voltage (-1.1V) to achieve the high-resistance state (HRS). The write time of the HfOx-based ReRAM can be as fast as 5 ns [10].

(a)

(b) (c) Fig. 1 1T2R memory unit (a) Cell Structure and Layout (b) IV

characteristic of HfOx RRAM (c) Schematic

IEEE Asian Solid-State Circuits ConferenceNovember 10 - 12, 2014/Kaohsiung, Taiwan

978-1-4799-4089-9/14/$31.00 © 2014 IEEE

Page 2: IEEE Paper for ReRAM

B. Architechture of the Proposed 1T2R Memory Unit Fig. 1 (c) displays the architecture of the proposed 1T2R

memory unit, which consists of an NMOS switch transistor (MSEL) and two HfOx-based ReRAMs (RA and RB). The BEs of RA and RB are connected to the output of the memory unit terminal C. The select transistor MSEL provides stable write conditions by directly controlling the terminal C. Terminals S, A, and B provide the path for applying read and write bias conditions. The detailed operation would be descripted in the next section.

The ReRAM devices RA and RB work jointly as a complementary resistive switching (CRS) device [11]. The “0” state is defined with RA = HRS and RB = LRS. The “1” state is defined with RA = LRS and RB = HRS, as listed in Table 1. The CRS architecture guarantees that the combined resistance of RA and RB is higher than HRS, which thus minimizes the DC leakage in the read operation; this implies that the operating current can be reduced by increasing the HRS resistance. Moreover, compared with the 1T1R structure, the CRS structure further increases the speed of the read operation because the output is always driven by the LRS ReRAM device.

Table 1 Truth table of proposed 1T2R memory unitRA RB State

HRS HRS Forbidden HRS LRS “1” LRS HRS “0” LRS LRS Forbidden

C. Operation of the Proposed Memory Unit As shown in Table 2, when the logic value “1” is written

into the memory unit, the RA is reset to a high-resistance state (HRS, H) by applying reset voltage VR, applying VDD, grounding, and floating on nodes S, W, A, and B, respectively. The RB is then set to a low-resistance state (LRS, L) by grounding, applying clamping voltage VG_SET, floating, and applying SET voltage Vs on Nodes S, W, A, and B, respectively. When the logic value “0” is written into the memory unit, RA is set to LRS, and RB is reset to HRS. Fig. 2 displays the read operation conditions of the proposed memory unit. During the read operation, Nodes A and B are adopted to Vread and 0 V, respectively. Moreover, to prevent the ReRAM device from read disturbance, Vread should not be greater than the minimum disturb voltage of 0.5 V [12]. For standby operations, the nvLUT can be shut down to prevent extra power consumption. Because the configured data are stored in the ReRAM device, the memory unit can operate correctly and instantly after power restoration.

Fig. 2 Read principle of proposed memory unit

Table 2 Operation table of proposed 1T2R memory unit ReRAM A B W S

RESET

RA 0V F VDD VR RB F 0V VDD VR

SET

RA VS F VG SET 0V RB F Vs VG SET 0V

READ - 0V VREAD 0V 0V

D. Proposed Nonvolatile Look-Up Table The proposed nvLUT consists of plurality of the proposed

memory units, selecting circuit, and read/write control circuit. Fig. 3 depicts a two-input LUT with a two-input NAND configuration. Outputs of the memory units 0 3 correspond to XY = 00 11, respectively. The W terminal of each memory unit can be shared. Moreover, the RESET and SET operations are required to store and configure data into the memory unit. The SET operation can be performed by applying 0V to S, Vs to Ax or Bx (based on which ReRAM device requires the SET operation) and by applying a pulse with the amplitude of VG_SET on W. The RESET operation can be performed similarly. Using this approach, the operator may write any desired truth table into the memory units. Table 3 shows a common function with its corresponding memory unit resistance state configuration. For the read operation, the selecting circuit selects a particular memory unit based on the input logic value. The sensing amplifiers then determine the output logic values according to the reference voltage (Vref). The output is “1” if the C terminal of the memory unit is larger than Vref. Fig. 4 displays the simulation results of a two-input nvLUT for the NAND gate.

Fig. 3 2 input LUT based on memory unit

Fig. 5 shows another application of nvLUT, the full adder, which comprises two three-input LUTs, one for the sum S and another for the carry out (Cout). As observed in Fig. 6, because the targeted HRS of the device is > 1M ohm, power consumption of the two-input LUT can be reduced by 24%,

Page 3: IEEE Paper for ReRAM

36%, and 2.8% compared with the conventional SRAM-MRAM-hybrid LUTs [3], MRAM-based nvSRAMs [2] and nvLUTs [3], respectively. Fig. 7 depicts the analysis of the full-adder delay time against VDD. The proposed LUT can achieve a delay time of 900 ps at a power supply of 1.8 V.

Table 3 Operating ReRAM states of 2-input LUT

Function ReRAM State

RA1 RB1 RA2 RB2 RA3 RB3 RA4 RB4 NOR H L L H L H L H OR L H H L H L H L

NAND H L H L H L L H AND L H L H L H H L XOR L H H L H L L H

XNOR H L L H L H H L

Fig. 4 R/W operation of the 2-input LUT for NAND gate

Fig. 5 Two 3-input LUT based full adder

Fig. 6 Power consumption vs. R-ratio

Fig. 7 Delay time vs. VDD

III. MEASUREMENT, RESULTS, AND CONCLUSION Fig. 8 displays a photomicrograph of the fabricated chip

with the proposed LUTs using the HfOx-based ReRAM with a

0.18- m CMOS. In addition, Fig. 9 shows the measured waveforms of the NAND and NOR gates in the two-input LUT. The output indicates the correct result with an input sweep from 00 to 11. Fig. 10 shows the measured output waveforms of the full adder. These waveforms demonstrate a normal adder operation, followed by a power off and then a power on. The output bits, Cout and Sum, exhibit the operation of the full adder, as expected. Moreover, this indicates that the proposed nvLUT has the ability to perform normally-off and instant-on functions. Fig. 11 displays the measured propagation delay of the full adder. The pull-up and pull-down delay time values are 900 and 730 ps, respectively. Hence, the delay time can be as fast as 900 ps. Table 4 shows the performance comparison of the LUTs.

Compared with the conventional SRAM-MRAM-hybrid LUTs and the MRAM-based nvLUTs, the area of the proposed two-input LUT can reduce the area by 90.4% and 11.5%, respectively [3]. Moreover, compared with the PCM-based nvLUTs, the area of the three-input LUTs can be reduced by 74.2% [4]. Power consumption of the two-input LUT can be reduced by 24%, 36%, and 2.8% compared with the conventional SRAM-MRAM-hybrid LUTs [3], MRAM-based nvSRAMs [2], [3], and nvLUTs [3], [4], respectively. Therefore, the proposed nvLUT consisting of ReRAM as the memory unit can be successfully used to achieve lower cost, high operation speed, zero standby power. As the result, the proposed nvLUT can be expexted to realize a normally-off and instant-on reconfigurable logic

Fig. 8 Micro photograph of proposed LUT

Fig. 9 Measured waveforms of the 2-input LUT

Fig. 10 Measured waveforms of the full adder

Page 4: IEEE Paper for ReRAM

Table 4 Comparison Table

Fig. 11 Measured propagation delay of the full adder

IV. REFERENCE [1] Shoun Matsunaga, J. Hayakawa, S. Ikeda, K. Miura, H. Hasegawa, T.

Endoh, H. Ohno, and T. Hanyu, “Fabrication of a Nonvolatile Full Adder Based on Logic-in-Memory Architecture Using Magnetic Tunnel Junctions,” Applied Physics Express, 2008, pp. 091301_1-091301_3.

[2] Weisheng Zhao, E. Belhaire, C. Chappert, F. Jacquet, and P. Mazoyer, “New non-volatile logic based on spin-MTJ,” Physica Status SOLIDI a Application and Materials Science, 2009, pp. 1373-1377.

[3] Daisuke Suzuki, M. Natsui, S. Ikeda, H. Hasegawa, K. Miura, J. Hayakawa, T. Endoh, H. Ohno, T. Hanyu, “Fabrication of a nonvolatile lookup-table circuit chip using magneto/semiconductor-hybrid structure for an immediate-power-up field programmable gate array, ” VLSI Circuits, 2009 Symposium on, pp.80-81, 16-18 June 2009

[4] C.-Y. Wen, J. Li, S. Kim, M. Breitwisch, C. Lam, J. Paramesh, L. T. Pileggi, “A non-volatile look-up table design using PCM (phase-change memory) cells,” VLSI Circuits (VLSIC), 2011 Symposium on , pp.302-303, 15-17 June 2011

[5] Pi-Feng Chiu, M.-F. Chang, S.-S. Sheu, K.-F. Lin, P.-C. Chiang, C.-W. Wu, W.-P. Lin, C.-H. Lin, C.-C. Hsu, Frederick T. Chen, K.-L. Su, M.-J.

Kao, and M.-J. Tsai, “A low store energy, low VDDmin, nonvolatile 8T2R SRAM with 3D stacked RRAM devices for low power mobile applications,’’ VLSI Circuits (VLSIC), 2010 IEEE Symposium on , pp.229-230, 16-18 June 2010

[6] Young Yang Liauw, Z. Zhang, W. Kim, A. E. Gamal, S. S. Wong, “Nonvolatile 3D-FPGA with monolithically stacked RRAM-based configuration memory,” Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2012 IEEE International., pp.406-408, 19-23 Feb. 2012

[7] Kejie Huang, Ha, Y.; Zhao, R.; Kumar, A; Lian, Y., “A Low Active Leakage and High Reliability Phase Change Memory (PCM) Based Non-Volatile FPGA Storage Element, ” Circuits and Systems I: Regular Papers, IEEE Transactions on , vol.61, no.9, pp.2605,2613, Sept. 2014

[8] Pierre-Emmanuel Gaillardon, D. Sacchetto, G. B. Beneventi, M. H. B. Jamaa, L. Perniola, F. Clermidy, I. O’Connor, and G. De Micheli, “Design and Architectural Assessment of 3-D Resistive Memory Technologies in FPGAs,” Nanotechnology, IEEE Transactions on , vol.12, no.1, pp.40-50, Jan. 2013

[9] S.-S. Sheu, M.-F. Chang, K.-F. Lin, C.-W. Wu, Y.-S. Chen, P.-F. Chiu, C.-C. Kuo, Y.-S. Yang, P.-C. Chiang, W.-P. Lin, C.-H. Lin, H.-Y. Lee, P.-Y. Gu, S.-M. Wang, F. T. Chen, K.-L. Su, C.-H. L., K.-H. Cheng, H.-T. Wu, T.-K. Ku, M.-J. Kao, M.-J. Tsai, “A 4Mb embedded SLC resistive-RAM macro with 7.2ns read-write random-access time and 160ns MLC-access capability, ” Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2011 IEEE International, pp.200-202, 20-24 Feb. 2011

[10] Shyh-Shyuan Sheu, P.-C. Chiang, W.-P. Lin, H.-Y. Lee, P.-S. Chen, Y.-S. Chen, T.-Y. Wu, F.T. Chen, K.-L. Su, M.-J. Kao, K.-H. Cheng, M.-J. Tsai, "A 5ns fast write multi-level non-volatile 1 K bits RRAM memory with advance write scheme," VLSI Circuits, 2009 Symposium on , pp.82-83, 16-18 June 2009

[11] E. Linn, R. Rosezin, C. Kügeler, and R. Waser, "Complementary resistive switches for passive nanocrossbar memories," Nat. Mater., vol. 9, no. 5, pp. 403–406, May 2010.

[12] Heng-Yuan Lee, Y.-S. Chen, P.-S. Chen, P.-Y. Gu, Y.-Y. Hsu, W.-H. Liu, W.-S. Chen, C.-H. Tsai, F. Chen, C.-H. L,; M.-J. Tsai, "Comprehensively study of read disturb immunity and optimal read scheme for high speed HfOx based RRAM with a Ti layer," VLSI Technology Systems and Applications (VLSI-TSA), 2010 International Symposium on, pp.132-133, 26-28 April 2010