Alexandru Gheorghiu (advisor) [email protected] draft ... · Alexandru Gheorghiu (advisor) [email protected] California Institute of Technology ABSTRACT Noisy intermediate-scale

Unpublishedworkingdraft.

Not for distribution.

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

71

72

73

74

75

76

77

78

79

80

81

82

83

84

85

86

87

88

89

90

91

92

93

94

95

96

97

98

99

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

A deep learning approach to noise prediction and circuitoptimization for near-term quantum devicesAlexander [email protected]

California Institute of Technology

Alexandru Gheorghiu (advisor)[email protected]

California Institute of Technology

ABSTRACTNoisy intermediate-scale quantum (NISQ) devices face challengesin achieving high-fidelity computations due to hardware-specificnoise. As a basis for noise mitigation, we develop a convolutionalneural network noise model to estimate the difference in noise be-tween a given pair of equivalent quantum circuits. On a classicallysimulated dataset of 1.6 million pairs of quantum circuits with a sim-plified noise model calibrated to IBM Q hardware, the deep learningapproach shows a significant improvement in noise prediction overlinear gate count models. A greedy peephole optimization proce-dure is proposed to minimize noise using the deep learning modelas an objective function, showing further improvement in noisemitigation compared to commonly used gate count minimizationheuristics.

KEYWORDSquantum computing, machine learning, artificial intelligence

ACM Reference Format:Alexander Zlokapa and Alexandru Gheorghiu (advisor). 2019. A deep learn-ing approach to noise prediction and circuit optimization for near-termquantum devices. In SC ’19: ACM/IEEE Supercomputing Conference, Novem-ber 17–22, 2019, Denver, CO. ACM, New York, NY, USA, 2 pages. https://doi.org/10.1145/nnnnnnn.nnnnnnn

1 INTRODUCTIONIn the early stages of quantum computing, NISQ devices are lim-ited in usability and reliability by errors due to thermal relaxation,hardware imperfections, and interactions between adjacent qubits[8]. Given that quantum operations (gates) on these devices aresubject to noise, one way to mitigate this is to produce an equiva-lent circuit that uses fewer gates than the original [2, 5, 7], or onethat uses fewer gates of a specific type [1]. While these techniquesare useful in reducing noise, they fail to completely capture allhardware-specific faults that can occur in a circuit. We address thiswith a deep learning approach that learns a noise model for a givenarchitecture. We then use it to optimize quantum circuits for noisereduction.

Unpublished working draft. Not for distribution.Permission to make digital or hard copies of all or part of this work for personal orclassroom use is granted without fee provided that copies are not made or distributedfor profit or commercial advantage and that copies bear this notice and the full citationon the first page. Copyrights for components of this work owned by others than ACMmust be honored. Abstracting with credit is permitted. To copy otherwise, or republish,to post on servers or to redistribute to lists, requires prior specific permission and/or afee. Request permissions from [email protected] ’19, November 17–22, 2019, Denver, CO© 2019 Association for Computing Machinery.ACM ISBN 978-x-xxxx-xxxx-x/YY/MM. . . $15.00https://doi.org/10.1145/nnnnnnn.nnnnnnn

2 METHODOLOGY2.1 Simulated datasetWe start by creating a training set of equivalent quantum circuits. Aquantum circuit is a sequence of gates, each of which is representedas a unitary operation, that acts on either one or two qubits. Wefirst perform an exhaustive search over all circuits acting on 2qubits, with up to 4 gates. The circuits are divided into equivalenceclasses labeled by the circuit’s matrix representation. Equivalentcircuits found in this way may be identified as sub-circuits of alarger circuit and substituted. A ground-truth dataset is generatedwith 668 equivalence classes of 50 circuits each. Each circuit actson 8 qubits and contains up to 200 gates. This leads to 1.6 millionunique pairs of equivalent circuits.

We then consider a simple noise model for a simulated version ofthe IBM Q Melbourne device [3]. This model accounts for readouterrors, depolarization errors and thermal relaxation errors. Fromthe generated dataset, we simulate each pair of equivalent circuits(with noise) and record the noise difference as measured by theenergy distance between readout probability distributions of thetwo circuits and a noiseless simulation [9].

2.2 Noise prediction modelGiven the ability of the ResNet convolutional neural network archi-tecture [4] to achieve state-of-the-art results on image regressionproblems [6], we propose an image encoding scheme for quantum cir-cuits and train a ResNet model to learn the noise difference betweena pair of equivalent circuits. To establish encoding consistency, weperform a lexicographic topological sort on a directed acyclic graphrepresentation of the circuit. Each circuit is then represented by an8-channel image (7 gates + control channel), allowing circuit pairsto be easily passed to a neural network. Because of its importanceas a source of noise, the two-qubit gate count is concatenated beforethe fully-connected layer of the ResNet architecture (Figure 1).

2.3 Circuit optimizationTo apply the learned noise model, we use an iterative peepholeoptimization algorithm to greedily rewrite regions of the maincircuit according to the found equivalence classes. After identifyingequivalence classes of each possible region, the substitution that isexpected to cause the most favorable noise reduction according tothe ResNet prediction is selected. This process is repeated until nofurther substitutions can be made.

3 RESULTSWe find significant improvement in noise optimization comparedto prior work using gate count minimization [2, 5, 7], which we

2019-08-06 15:49. Page 1 of 1–2.

https://doi.org/10.1145/nnnnnnn.nnnnnnn



Unpublishedworkingdraft.

Not for distribution.

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

SC ’19, November 17–22, 2019, Denver, CO Zlokapa and Gheorghiu

175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254

Figure 1: Deep learning noise predictionmodel consisting ofcircuit generation, image encoding, and ResNet prediction.

provide as a benchmark for both noise prediction and noise mini-mization.

A linear gate count model has a substantially worseR2 predictionof noise difference between a pair of equivalent quantum circuits(Table 1) compared to the deep learning model (Figure 2) on the testset. Given the basic noise model used in this analysis, we expectthe advantage of deep learning to increase with more complexnoise models that include correlated phenomena such as crosstalkbetween qubits.

Greedy noise minimization with deep learning is 15% ± 3% moreeffective than gate count minimization (Table 1), thus significantlyoutperforming this heuristic method for noise reduction. To in-crease robustness, greedy optimization may be substituted for ascheme such as a Monte Carlo Tree Search, increasing the numberof circuits explored and involving a larger number of comparisonsto average out errors in noise prediction.

4 CONCLUSIONCompared to existing heuristics such as gate count minimization,we find that deep learning provides significant improvement innoise reduction through greedy circuit optimization. Our results

Figure 2: Deep learning noise prediction model predictionson test set (160,000 circuit pairs) with R2 = 0.401 ± 0.003.

Table 1: Comparison of deep learning noise model and gatecount heuristic performance. Errors indicate 1σ uncertaintydetermined by Poisson bootstrapping.

Metric Deep learning noise model Gate count noise model

R2 0.401 ± 0.003 0.314 ± 0.003∆ noise −0.023 ± 0.001 −0.020 ± 0.001

suggest that deep learning may substantially improve noise mitiga-tion on real-world quantum hardware, addressing a major obstaclein the applicability of NISQ devices with an adaptable and versatilemethodology.

ACKNOWLEDGMENTSWe wish to thank Professor Thomas Vidick of Caltech, who pro-vided helpful insight, comments and support for this project.

REFERENCES[1] Nabila Abdessaied, Mathias Soeken, and Rolf Drechsler. 2014. Quantum Circuit

Optimization by Hadamard Gate Reduction. In Reversible Computation, ShigeruYamashita and Shin-ichi Minato (Eds.). Springer International Publishing, Cham,149–162.

[2] Gerhard W. Dueck, Anirban Pathak, Md. Mazder Rahman, Abhishek Shukla, andAnindita Banerjee. 2018. Optimization of Circuits for IBM’s Five-Qubit QuantumComputers. 2018 21st Euromicro Conference on Digital System Design (DSD) (2018),680–684.

[3] Gadi Aleksandrowicz et al. 2019. Qiskit: An Open-source Framework for QuantumComputing. https://doi.org/10.5281/zenodo.2562110

[4] K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep Residual Learning for ImageRecognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition(CVPR). 770–778. https://doi.org/10.1109/CVPR.2016.90

[5] Vadym Kliuchnikov and Dmitri Maslov. 2013. Optimization of Clifford circuits.Phys. Rev. A 88 (Nov 2013), 052307. Issue 5. https://doi.org/10.1103/PhysRevA.88.052307

[6] S. LathuiliÃľre, P. Mesejo, X. Alameda-Pineda, and R. Horaud. 2019. A Compre-hensive Analysis of Deep Regression. IEEE Transactions on Pattern Analysis andMachine Intelligence (2019), 1–1. https://doi.org/10.1109/TPAMI.2019.2910523

[7] Yunseong Nam, Neil J. Ross, Yuan Su, Andrew M. Childs, and Dmitri Maslov. 2018.Automated optimization of large quantum circuits with continuous parameters. npjQuantum Information 4, 1 (2018), 23. https://doi.org/10.1038/s41534-018-0072-4

[8] John Preskill. 2018. Quantum Computing in the NISQ era and beyond. Quantum2 (2018), 79.

[9] Maria L. Rizzo and Gábor J. Székely. 2016. Energy Distance. WIREs Comput. Stat.8, 1 (Jan. 2016), 27–38. https://doi.org/10.1002/wics.1375

2019-08-06 15:49. Page 2 of 1–2.

https://doi.org/10.5281/zenodo.2562110

https://doi.org/10.1109/CVPR.2016.90

https://doi.org/10.1103/PhysRevA.88.052307

https://doi.org/10.1103/PhysRevA.88.052307

https://doi.org/10.1109/TPAMI.2019.2910523

https://doi.org/10.1038/s41534-018-0072-4

https://doi.org/10.1002/wics.1375

Documents

Alexandru Gheorghiu (advisor) [email protected] draft ... · Alexandru Gheorghiu (advisor) [email protected] California Institute of Technology ABSTRACT Noisy intermediate-scale