17
Research Article Research on UUV Obstacle Avoiding Method Based on Recurrent Neural Networks Changjian Lin , 1 Hongjian Wang , 1 Jianya Yuan , 1 Dan Yu , 1 and Chengfeng Li 1,2 1 College of Automation, Harbin Engineering University, Harbin, China 2 College of Electrical Engineering, Suihua University, Suihua, China Correspondence should be addressed to Hongjian Wang; [email protected] Received 8 July 2018; Revised 14 November 2018; Accepted 18 December 2018; Published 10 January 2019 Guest Editor: Jing Na Copyright © 2019 Changjian Lin et al. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. In this paper, we present an online obstacle avoidance planning method for unmanned underwater vehicle (UUV) based on clockwork recurrent neural network (CW-RNN) and long short-term memory (LSTM), respectively. In essence, UUV online obstacle avoidance planning is a spatiotemporal sequence planning problem with the spatiotemporal data sequence of sensors as input and control instruction to motion controller of UUV as output. And recurrent neural networks (RNNs) have proven to give state-of-the-art performance on many sequence labeling and sequence prediction tasks. In order to train the networks, a UUV obstacle avoidance dataset is generated and an offline training and testing is adopted in this paper. Finally, the proposed two types of RNN based online obstacle avoidance planners are compared in path cost, obstacle avoidance planning success rate, training time, time-consumption, learning, and generalization, respectively. And the good performance of the proposed methods is demonstrated with a series of simulation experiments in different environments. 1. Introduction e online obstacle avoidance planner is one of the impor- tant modules of UUV which reflects its intelligence level, which requires the UUV to plan a collision-free trajec- tory autonomously when it navigates in long range and unknown environment. At present, the main obstacle avoid- ance methods include traditional methods [1–6], bionics algorithm [7–11], and reinforcement learning methods [12– 15]. Traditionally, a bottleneck restricting the development of UUV obstacle avoidance technology is the uncertainty of underwater sensing equipment. And, the performance of obstacle avoidance in complex environment and even maze environment is not satisfactory. LSTM is a RNN architecture that employs three special gating schemes to address the vanishing and exploding gradient problems. It is able to process complex sequential information for learning features from long-term input data and has proven to give state-of-the-art performance on many challenging problems involving precipitation nowcasting, predicting water table depth, traffic forecasting, object track- ing, punctuation prediction, and so on. e aim of precipitation nowcasting is to provide a forecast of the rainfall intensity in a local region over a relatively short period of time (e.g., 0-6 hours). Shi et al. formulated precipitation nowcasting as a spatiotemporal sequence forecasting problem with the sequence of past radar maps as input and the sequence of a fixed number of future radar maps as output and proposed the convolutional LSTM model, in which the convolutional structures are used to extract features and LSTM is used to do forecasting problem [16]. Long-term predictions of water table depth in agricultural areas face enormous challenges because of their complex, het- erogeneous hydrogeological characteristics, boundary condi- tions, and human activities. In addition, there are nonlinear interactions among these factors. Zhang et al. proposed a time series model based on LSTM to alternate computationally expensive physical models, especially in areas where hydro- geological data are difficult to obtain [17]. Accurate and real-time traffic flow prediction especially short-term traffic flow information is an important part of intelligent transportation system. However, due to the stochastic and nonlinear nature of traffic flow, accurately Hindawi Complexity Volume 2019, Article ID 6320186, 16 pages https://doi.org/10.1155/2019/6320186

Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Research ArticleResearch on UUV Obstacle Avoiding Method Based onRecurrent Neural Networks

Changjian Lin 1 Hongjian Wang 1 Jianya Yuan 1 Dan Yu 1 and Chengfeng Li 12

1College of Automation Harbin Engineering University Harbin China2College of Electrical Engineering Suihua University Suihua China

Correspondence should be addressed to Hongjian Wang cctime99163com

Received 8 July 2018 Revised 14 November 2018 Accepted 18 December 2018 Published 10 January 2019

Guest Editor Jing Na

Copyright copy 2019 Changjian Lin et al This is an open access article distributed under the Creative Commons Attribution Licensewhich permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited

In this paper we present an online obstacle avoidance planning method for unmanned underwater vehicle (UUV) based onclockwork recurrent neural network (CW-RNN) and long short-term memory (LSTM) respectively In essence UUV onlineobstacle avoidance planning is a spatiotemporal sequence planning problem with the spatiotemporal data sequence of sensorsas input and control instruction to motion controller of UUV as output And recurrent neural networks (RNNs) have proven togive state-of-the-art performance on many sequence labeling and sequence prediction tasks In order to train the networks a UUVobstacle avoidance dataset is generated and an offline training and testing is adopted in this paper Finally the proposed two types ofRNN based online obstacle avoidance planners are compared in path cost obstacle avoidance planning success rate training timetime-consumption learning and generalization respectively And the good performance of the proposedmethods is demonstratedwith a series of simulation experiments in different environments

1 Introduction

The online obstacle avoidance planner is one of the impor-tant modules of UUV which reflects its intelligence levelwhich requires the UUV to plan a collision-free trajec-tory autonomously when it navigates in long range andunknown environment At present the main obstacle avoid-ance methods include traditional methods [1ndash6] bionicsalgorithm [7ndash11] and reinforcement learning methods [12ndash15] Traditionally a bottleneck restricting the developmentof UUV obstacle avoidance technology is the uncertaintyof underwater sensing equipment And the performance ofobstacle avoidance in complex environment and even mazeenvironment is not satisfactory

LSTM is a RNN architecture that employs three specialgating schemes to address the vanishing and explodinggradient problems It is able to process complex sequentialinformation for learning features from long-term input dataand has proven to give state-of-the-art performance onmanychallenging problems involving precipitation nowcastingpredicting water table depth traffic forecasting object track-ing punctuation prediction and so on

The aim of precipitation nowcasting is to provide aforecast of the rainfall intensity in a local region over arelatively short period of time (eg 0-6 hours) Shi etal formulated precipitation nowcasting as a spatiotemporalsequence forecasting problemwith the sequence of past radarmaps as input and the sequence of a fixed number of futureradar maps as output and proposed the convolutional LSTMmodel in which the convolutional structures are used toextract features and LSTM is used to do forecasting problem[16]

Long-termpredictions of water table depth in agriculturalareas face enormous challenges because of their complex het-erogeneous hydrogeological characteristics boundary condi-tions and human activities In addition there are nonlinearinteractions among these factors Zhang et al proposed a timeseries model based on LSTM to alternate computationallyexpensive physical models especially in areas where hydro-geological data are difficult to obtain [17]

Accurate and real-time traffic flow prediction especiallyshort-term traffic flow information is an important partof intelligent transportation system However due to thestochastic and nonlinear nature of traffic flow accurately

HindawiComplexityVolume 2019 Article ID 6320186 16 pageshttpsdoiorg10115520196320186

2 Complexity

E

N

bx

by

u

v

O y

x

r

r

0∘

u

Figure 1 Global and local coordinate systems

predicting traffic state is a challenging task The LSTM isable to learn time series with long time dependency andautomatically determine the optimal time lags Ma et alfound this feature is especially desirable for traffic predic-tion problems where future traffic condition is commonlyrelevant to the previous events with long time spans andproposed a LSTM-based traffic flow prediction method tocapture nonlinear traffic dynamic in an effective manner [18]Duan et al explored a LSTM neural network model whichcan automatically reserve historical sequence information inits model structure for travel time prediction [19] Chen etal trained a LSTM model to learn the patterns in trafficcondition sequences which utilized the historical trafficconditions obtained fromAMAP a webmap service providerin China to predict traffic conditions in the future [20]

Object tracking is a fundamental problem in computervision with a wide range of applications The target of atracking system is to estimate the state sequence of the objectbased on observation sequence LSTM has been introducedin object tracking for object representations via sequencelearning Li et al employed LSTM units to directly learntemporally correlated representations of the objects in longsequences [21] Zhou et al introduced a bidirectional LSTM-based appearance model to learn the spatial contextualdependency [22] Wang et al proposed a 3D fish trackingmethod and multifish tracking method in which a LSTMnetwork is employed to model the fishrsquos motion process [2324]

In other fields Chen et al modeled and predicted Chinastock returns using LSTMand improved the accuracy of stockreturn prediction greatly [25] Sak et al demonstrated thestate-of-the-art performance of LSTM networks on speechrecognition tasks compared with RNN and deep neuralnetworks (DNNs) models [26] Wu et al utilized LSTM tosolve remaining useful life estimation problem and got goodremaining useful life prediction accuracy [27] Chherawalaet al presented a handwriting recognition model based onLSTM network which automatically learns features from theinput image in a supervised fashion [28]

In 2014 Koutnık et al introduced CW-RNN whichsimplifies theRNNarchitecture improves the performance ofnetwork and speeds up the network evaluation [29] Achanta

et al showed that CW-RNN is equivalent to the standardRNN architecture with a time-varying leaky integration [30]

Two end-to-end online obstacle avoidance plannersbased on LSTM and CW-RNN respectively are presented inthis paper The obstacle avoidance planners take the infor-mation obtained by multibeam forward looking sonar (FLS)as input and directly output control instruction to motioncontroller of UUV The RNN based obstacle avoidanceplanners remain robust performance even though the effectsof measurement noises are considered And due to the stronglearning ability of RNN the obstacle avoidance planners arecapable for obstacle avoidance in the environments which farmuch complex than those environments existed in trainingsamples

2 UUV System Modeling

The obstacle avoidance planning on the vertical plane isusually achieved through depth adjustment while the depthadjustment strategy often brings large pitch adjustmentwhich affects the attitude control of UUV Therefore thispaper adopts the strategy of horizontal plane obstacle avoid-ance regulation priority and defines a horizontal 3 degrees-offreedom (DPF) control model for UUV which can not onlyguarantee the safety of UUV collision avoidance planningbut also facilitate the UUVmotion control

The North East-fixed reference frame and body-fixedreference frame are shown in Figure 1 The 3 DOF controlmodel of UUV is described as follows [31]120578 = 119877 (120595)119881 (1)119872 + 119862 (119881)119881 + 119863 (119881)119881 = 120591 (2)

where 120578 = [119909 119910 120595]119879 is position vector correspond to theposition of UUV in North East-fixed reference frame andthe heading of UUV respectively 119877(120595) is the transformationmatrix from North East-fixed reference frame to body-fixedreference frame 119881 = [119906 V 119903]119879 denotes the velocity vectorincluding the surge sway and yaw of UUV in body-fixedreference frame the actuator input is denoted by 120591 =[120591119906 0 120591119903]119879 and119872 = 119872119877119861 + 119872119860 119862(119881) = 119862119877119861(119881) + 119862119860(119881)

Complexity 3

and 119863(119881) = 119863119897 + 119863119899119897(119881) denote the system inertia matrixcoriolis-centripetalmatrix and dampingmatrix respectivelySpecifically

119877 (120595) = [[[cos120595 minus sin120595 0sin120595 cos120595 00 0 1]]] (3)

119872119877119861 = [[[119898 0 00 119898 1198981199091198660 119898119909119866 119868119911 ]]]

119872119860 = [[[minus119883 0 00 minus119884 V minus119884 1199030 minus119884 119903 minus119873 119903]]]

(4)

119862119877119861 (119881) = [[[0 0 minus119898 (119909119866119903 + V)0 0 119898119906119898 (119909119866119903 + V) minus119898119906 0 ]]]

119862119860 (119881) = [[[0 0 119884 VV + 119884 1199031199030 0 minus119883119906minus119884 VV minus 119884 119903119903 119883119906 0 ]]]

(5)

119863119897 = [[[119883119906 0 00 119884V minus1198841199030 minus119873V 119873119903 ]]]

119863119899119897 (119881) = [[[119883|119906|119906 |119906| 0 00 119884|V|V |V| minus119884|V|119903 |V|0 minus119873|119903|V |119903| 119873|119903|119903 |119903| ]]]

(6)

A constant current is assumed in this paper which isexpressed as a vector [119906119888 V119888]119879 in body-fixed reference frameAnd then the kinematic and dynamic equations of UUV canbe described as 119903 = (minus11988911119906119903 + 120591119906)11989811

V119903 = (11986011989833 minus 11986111989823)(1198982211989833 minus 119898223)119903 = (11986111989822 minus 11986011989823)(1198982211989833 minus 119898223) = 119906 cos (120595) minus V sin (120595)119910 = 119906 sin (120595) + V cos (120595) = 119903

(7)

where119860 = minus11988922V119903 +(11988923minus11990611990311988823 minus119898119906119888)119903 119861 = (11988932 minus11990611990311988832)V119903 minus11988933119903 + 120591119903 and 11989811 = 119898 minus 11988311989822 = 119898 minus 119884 V

11989823 = minus119884 11990311989833 = 119868119911 minus 119873 11990311988823 = 119898 minus 11988311988832 = 119883 minus 119884 V11988911 = 119883119906119903

+ 119883|119906119903|119906119903

1003816100381610038161003816119906119903100381610038161003816100381611988922 = 119884V119903

+ 119884|V119903|V119903

1003816100381610038161003816V119903100381610038161003816100381611988923 = 11988411990311988932 = 119873V11990311988933 = 119873119903 + 119873|119903|119903 |119903|

(8)

Assume there are twopropellers distribute in the horizon-tal plane of UUV And the force vector 120591 is modeled as

[120591119906120591119903] = [ 1 1119889119901 minus119889119901][119879(119899119901)119879 (119899119904)] (9)

where 119899119901 and 119899119904 denote the speeds of propellers of UUVrespectively 119889119901 is the distance between the propeller andcentral axis of UUV and 119879(119899119901) and 119879(119899119904) denote propellercoefficients

3 Simulation Model of Sonar

The input data of obstacle avoidance planners proposed bythis paper are obtained by multibeam forward looking sonarA 2D simulation model of multibeam FLS based on SeaBat8125 is established in this section SeaBat 8125 is a state-of-the-art high-resolution multibeam echosounder [32] It has a120∘ field of view sector 80 beams with width of 15∘ andthe maximum scan radius of 120119898 To simplify the inputinformation of network define the distance vector 119863119905 =[120 minus 1198890119905 120 minus 1198891119905 sdot sdot sdot 120 minus 11988979119905 ] where 119889119894119905 is the distanceinformation detected by 119894th ray of sonar at time step 119905 andif 119889119894119905 gt 120 then set 119889119894119905 = 120 The precision of sonar isset as 5119898 And taking the uncertainty of sonar detection intoaccount this paper sets the false alarm rate as 10

4 The Structures of ObstacleAvoidance Planners

41 The Structure of CW-RNN The forward propagation ofstandard RNN is as follows119904119905 = 120590 (119882119909119904119909119905 +119882119904119904119904119905minus1 + 119887119904) (10)119900119905 = tanh (119882119904119900119904119905 + 119887119900) (11)

where 119882119909119904 and 119882119904119904 denote the weight matrices from inputlayer and hidden layer to hidden layer respectively119882119904119900 is theweight matrix between hidden layer and output layer 119909119905 119904119905and 119900119905 are the input vector hidden state vector and output

4 Complexity

0x

81 inputs

23 units

2 units

Full connectionFull connection

1x 2x 9x

Input layer

Hidden layer

Middle layer

Output layer41

42

43

44

41

42

43

44

41

42

43

44

Figure 2 The network structure of CW-RNN Bias units are omitted to simplify the visualization network

vector at time step 119905 respectively and 119887119904 and 119887119900 correspond tothe biases of hidden layer and output layer respectively

As shown in Figure 2 the neurons in hidden layer aregrouped into 119892modules of size 119896 in the forward propagationof CW-RNN Each module i is set an explicit clock 119879119894 = 2119894minus1to operate For every module j only if 119879119894 le 119879119895 the recurrentconnections from module 119895 to module 119894 are existed Andthe state of modules i will be updated only if the modulesi satisfy (119905MOD119879119894) = 0 at each time step 119905 The long-term memory is restored by the modules have long periodThe local information obtained from input data is solved bymodules with short period

Therefore 119882119909119904 and 119882119904119904 are partitioned into 119892 blocks-rows corresponding to 119892 modules and 119882119904119904 is a block-uppertriangular matrix

119882119909119904 = (1198821199091199041119882119909119904 g)119882119904119904 = (1198821199041199041119882119904119904 g)

(12)

where each block-row119882119904119904 119894 is partitioned into block-columns0 sdot sdot sdot 0119882119904119904 119894119894 sdot sdot sdot 119882119904119904 119894119892 and119882119904119904 119894 = 119882119904119904 119894 if (119905MOD119879119894) = 00 otherwise

(13)

42 The Structure of LSTM In LSTM the memory blocksare used to replace the hidden units in RNN As shown in

Figure 3 such amemory block consists of a cell an input gatean output gate and a forget gate The current state of hiddenlayer is restored in cell the three import gate units whichcontrol the input output and forget of cell respectively Theforward propagation of LSTM is as follows119894119905 = 120590 (119882119909119894119909119905 +119882ℎ119894ℎ119905minus1 + 119887119894) (14)119891119905 = 120590 (119882119909119891119909119905 +119882ℎ119891ℎ119905minus1 + 119887119891) (15)119888119905 = 119891119905 ∙ 119888119905minus1 + 119894119905 ∙ tanh (119882119909119888119909119905 +119882ℎ119888ℎ119905minus1 + 119887119888) (16)119900119905 = 120590 (119882119909119900119909119905 +119882ℎ119900ℎ119905minus1 + 119887119900) (17)ℎ119905 = 119900119905 ∙ tanh (119888119905) (18)

where 119894119905 119891119905 119888119905 119900119905 and ℎ119905 are outputs of input gate forgetgate cell output gate and memory block at time step trespectively 119909119905 is input vector of memory block at time step tℎ119905minus1 is the output vector ofmemory block at t-1 time step119882119909119894119882119909119891119882119909119888 and119882119909119900 are the weight matrices from input vectorto input gate forget gate cell and output gate respectively119882ℎ119894 119882ℎ119891 119882ℎ119888 and 119882ℎ119900 are the weight matrices from theoutput of memory block at previous time step to input gateforget gate cell and output gate respectively 119887119894 119887119891 119887119888 and119887119900 are biases of input gate forget gate cell and output gaterespectively 120590(sdot) is activation function of gate unit whichis set as logistic sigmoid function in this paper ∙ representselement-wise product

5 Construction of UUV Autonomous ObstacleAvoidance Planning Learning System

The principle framework of UUV autonomous obstacleavoidance planning learning system is shown in Figure 4At first the RNN based obstacle avoidance planners are

Complexity 5

tanh

tanh

+ +

tanh

tanh

tanh

tanh

tminus9x tminus8x

tf

Full connection

23 units

2 units

th

Hidden layer

Middle layer

Input layer

Output layer

Lb Blocks

81 inputs

Full connection

+ + + + + + + + + + + +ibfb cb ob ibfb cb ob ibfb cb ob

ti to

tx

1th minus

tanh fbHyperbolic tangent

Element wise addition concatenation

Element wise multiplication

BiasSigmoid

Forget gate vector Input gate vector Output gate vectortf ti toGate vectors

tx Input vector tminus1h Output of previous block tminus1c Memory from previous block

Operations

Nonlinearities

Inputs

Outputsth ct

ctctminus1

Output of current block Memory of current block

+

Figure 3 The network structure of LSTM

trained offline Then these fully trained planners are usedto do obstacle avoidance planning for UUV in real timeaccording to the environmental information obtained by FLSand some information of UUV obtained by motion and atti-tude sensor The motion controller controls the UUV basedon control commands output by online obstacle avoidanceplanners

The flowchart of RNN based online obstacle avoidanceplanning system is as follows

Step 1 Initialize the start position and target position ofUUVand deploy UUV in the start position

Step 2 Acquire data from sonar motion and attitude sen-sors

Step 3 The online RNN obstacle avoidance planner outputthe desired yaw and velocity of UUV according to sensorsdata

Step 4 UUV adjusts its heading and velocity according to theoutput instruction of onlineRNNobstacle avoidance planner

Step 5 Determine whether the UUV reach the target posi-tion and if so the obstacle avoidance planning algorithm isstopped Else jump to Step 2

6 Data Processing and Network Training

The input sequence 119909119905 of obstacle avoidance planners at timestep t consists of distance vector 119863119905 and the angle betweenUUV and target in North East-fixed reference frame 120593119905 Theoutput vector of obstacle avoidance planners at time step tis constituted by the adjustment of heading and the velocityof UUV The dataset consists of 120000 training samplesand 4810 test samples In the dataset the start point targetpoint and obstacles are generated randomly And MinndashMaxnormalization is used to preprocess input and output data

The only difference between the two types of obstacleavoidance planners is the structure hidden layers which arecomposed by CW-RNN and LSTM respectively This settingis convenient for comparison between the performance ofCW-RNN and LSTM on obstacle avoidance for UUV Thetwo types of obstacle avoidance planners consist of inputlayer hidden layer middle layer and output layer Thereare 81 neurons in input layer 23 neurons in middle layerand 2 neurons in output layer To overcome the problem ofoverfitting dropout with 06 keep probability is used in theprocess of train The loss function is mean squared error(MSE) the weights are updated using the backpropagationthrough timeminibatch gradient descent tominimizeMSEofwhich batch size is set as 10000 And the optimizer is Adam

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 2: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

2 Complexity

E

N

bx

by

u

v

O y

x

r

r

0∘

u

Figure 1 Global and local coordinate systems

predicting traffic state is a challenging task The LSTM isable to learn time series with long time dependency andautomatically determine the optimal time lags Ma et alfound this feature is especially desirable for traffic predic-tion problems where future traffic condition is commonlyrelevant to the previous events with long time spans andproposed a LSTM-based traffic flow prediction method tocapture nonlinear traffic dynamic in an effective manner [18]Duan et al explored a LSTM neural network model whichcan automatically reserve historical sequence information inits model structure for travel time prediction [19] Chen etal trained a LSTM model to learn the patterns in trafficcondition sequences which utilized the historical trafficconditions obtained fromAMAP a webmap service providerin China to predict traffic conditions in the future [20]

Object tracking is a fundamental problem in computervision with a wide range of applications The target of atracking system is to estimate the state sequence of the objectbased on observation sequence LSTM has been introducedin object tracking for object representations via sequencelearning Li et al employed LSTM units to directly learntemporally correlated representations of the objects in longsequences [21] Zhou et al introduced a bidirectional LSTM-based appearance model to learn the spatial contextualdependency [22] Wang et al proposed a 3D fish trackingmethod and multifish tracking method in which a LSTMnetwork is employed to model the fishrsquos motion process [2324]

In other fields Chen et al modeled and predicted Chinastock returns using LSTMand improved the accuracy of stockreturn prediction greatly [25] Sak et al demonstrated thestate-of-the-art performance of LSTM networks on speechrecognition tasks compared with RNN and deep neuralnetworks (DNNs) models [26] Wu et al utilized LSTM tosolve remaining useful life estimation problem and got goodremaining useful life prediction accuracy [27] Chherawalaet al presented a handwriting recognition model based onLSTM network which automatically learns features from theinput image in a supervised fashion [28]

In 2014 Koutnık et al introduced CW-RNN whichsimplifies theRNNarchitecture improves the performance ofnetwork and speeds up the network evaluation [29] Achanta

et al showed that CW-RNN is equivalent to the standardRNN architecture with a time-varying leaky integration [30]

Two end-to-end online obstacle avoidance plannersbased on LSTM and CW-RNN respectively are presented inthis paper The obstacle avoidance planners take the infor-mation obtained by multibeam forward looking sonar (FLS)as input and directly output control instruction to motioncontroller of UUV The RNN based obstacle avoidanceplanners remain robust performance even though the effectsof measurement noises are considered And due to the stronglearning ability of RNN the obstacle avoidance planners arecapable for obstacle avoidance in the environments which farmuch complex than those environments existed in trainingsamples

2 UUV System Modeling

The obstacle avoidance planning on the vertical plane isusually achieved through depth adjustment while the depthadjustment strategy often brings large pitch adjustmentwhich affects the attitude control of UUV Therefore thispaper adopts the strategy of horizontal plane obstacle avoid-ance regulation priority and defines a horizontal 3 degrees-offreedom (DPF) control model for UUV which can not onlyguarantee the safety of UUV collision avoidance planningbut also facilitate the UUVmotion control

The North East-fixed reference frame and body-fixedreference frame are shown in Figure 1 The 3 DOF controlmodel of UUV is described as follows [31]120578 = 119877 (120595)119881 (1)119872 + 119862 (119881)119881 + 119863 (119881)119881 = 120591 (2)

where 120578 = [119909 119910 120595]119879 is position vector correspond to theposition of UUV in North East-fixed reference frame andthe heading of UUV respectively 119877(120595) is the transformationmatrix from North East-fixed reference frame to body-fixedreference frame 119881 = [119906 V 119903]119879 denotes the velocity vectorincluding the surge sway and yaw of UUV in body-fixedreference frame the actuator input is denoted by 120591 =[120591119906 0 120591119903]119879 and119872 = 119872119877119861 + 119872119860 119862(119881) = 119862119877119861(119881) + 119862119860(119881)

Complexity 3

and 119863(119881) = 119863119897 + 119863119899119897(119881) denote the system inertia matrixcoriolis-centripetalmatrix and dampingmatrix respectivelySpecifically

119877 (120595) = [[[cos120595 minus sin120595 0sin120595 cos120595 00 0 1]]] (3)

119872119877119861 = [[[119898 0 00 119898 1198981199091198660 119898119909119866 119868119911 ]]]

119872119860 = [[[minus119883 0 00 minus119884 V minus119884 1199030 minus119884 119903 minus119873 119903]]]

(4)

119862119877119861 (119881) = [[[0 0 minus119898 (119909119866119903 + V)0 0 119898119906119898 (119909119866119903 + V) minus119898119906 0 ]]]

119862119860 (119881) = [[[0 0 119884 VV + 119884 1199031199030 0 minus119883119906minus119884 VV minus 119884 119903119903 119883119906 0 ]]]

(5)

119863119897 = [[[119883119906 0 00 119884V minus1198841199030 minus119873V 119873119903 ]]]

119863119899119897 (119881) = [[[119883|119906|119906 |119906| 0 00 119884|V|V |V| minus119884|V|119903 |V|0 minus119873|119903|V |119903| 119873|119903|119903 |119903| ]]]

(6)

A constant current is assumed in this paper which isexpressed as a vector [119906119888 V119888]119879 in body-fixed reference frameAnd then the kinematic and dynamic equations of UUV canbe described as 119903 = (minus11988911119906119903 + 120591119906)11989811

V119903 = (11986011989833 minus 11986111989823)(1198982211989833 minus 119898223)119903 = (11986111989822 minus 11986011989823)(1198982211989833 minus 119898223) = 119906 cos (120595) minus V sin (120595)119910 = 119906 sin (120595) + V cos (120595) = 119903

(7)

where119860 = minus11988922V119903 +(11988923minus11990611990311988823 minus119898119906119888)119903 119861 = (11988932 minus11990611990311988832)V119903 minus11988933119903 + 120591119903 and 11989811 = 119898 minus 11988311989822 = 119898 minus 119884 V

11989823 = minus119884 11990311989833 = 119868119911 minus 119873 11990311988823 = 119898 minus 11988311988832 = 119883 minus 119884 V11988911 = 119883119906119903

+ 119883|119906119903|119906119903

1003816100381610038161003816119906119903100381610038161003816100381611988922 = 119884V119903

+ 119884|V119903|V119903

1003816100381610038161003816V119903100381610038161003816100381611988923 = 11988411990311988932 = 119873V11990311988933 = 119873119903 + 119873|119903|119903 |119903|

(8)

Assume there are twopropellers distribute in the horizon-tal plane of UUV And the force vector 120591 is modeled as

[120591119906120591119903] = [ 1 1119889119901 minus119889119901][119879(119899119901)119879 (119899119904)] (9)

where 119899119901 and 119899119904 denote the speeds of propellers of UUVrespectively 119889119901 is the distance between the propeller andcentral axis of UUV and 119879(119899119901) and 119879(119899119904) denote propellercoefficients

3 Simulation Model of Sonar

The input data of obstacle avoidance planners proposed bythis paper are obtained by multibeam forward looking sonarA 2D simulation model of multibeam FLS based on SeaBat8125 is established in this section SeaBat 8125 is a state-of-the-art high-resolution multibeam echosounder [32] It has a120∘ field of view sector 80 beams with width of 15∘ andthe maximum scan radius of 120119898 To simplify the inputinformation of network define the distance vector 119863119905 =[120 minus 1198890119905 120 minus 1198891119905 sdot sdot sdot 120 minus 11988979119905 ] where 119889119894119905 is the distanceinformation detected by 119894th ray of sonar at time step 119905 andif 119889119894119905 gt 120 then set 119889119894119905 = 120 The precision of sonar isset as 5119898 And taking the uncertainty of sonar detection intoaccount this paper sets the false alarm rate as 10

4 The Structures of ObstacleAvoidance Planners

41 The Structure of CW-RNN The forward propagation ofstandard RNN is as follows119904119905 = 120590 (119882119909119904119909119905 +119882119904119904119904119905minus1 + 119887119904) (10)119900119905 = tanh (119882119904119900119904119905 + 119887119900) (11)

where 119882119909119904 and 119882119904119904 denote the weight matrices from inputlayer and hidden layer to hidden layer respectively119882119904119900 is theweight matrix between hidden layer and output layer 119909119905 119904119905and 119900119905 are the input vector hidden state vector and output

4 Complexity

0x

81 inputs

23 units

2 units

Full connectionFull connection

1x 2x 9x

Input layer

Hidden layer

Middle layer

Output layer41

42

43

44

41

42

43

44

41

42

43

44

Figure 2 The network structure of CW-RNN Bias units are omitted to simplify the visualization network

vector at time step 119905 respectively and 119887119904 and 119887119900 correspond tothe biases of hidden layer and output layer respectively

As shown in Figure 2 the neurons in hidden layer aregrouped into 119892modules of size 119896 in the forward propagationof CW-RNN Each module i is set an explicit clock 119879119894 = 2119894minus1to operate For every module j only if 119879119894 le 119879119895 the recurrentconnections from module 119895 to module 119894 are existed Andthe state of modules i will be updated only if the modulesi satisfy (119905MOD119879119894) = 0 at each time step 119905 The long-term memory is restored by the modules have long periodThe local information obtained from input data is solved bymodules with short period

Therefore 119882119909119904 and 119882119904119904 are partitioned into 119892 blocks-rows corresponding to 119892 modules and 119882119904119904 is a block-uppertriangular matrix

119882119909119904 = (1198821199091199041119882119909119904 g)119882119904119904 = (1198821199041199041119882119904119904 g)

(12)

where each block-row119882119904119904 119894 is partitioned into block-columns0 sdot sdot sdot 0119882119904119904 119894119894 sdot sdot sdot 119882119904119904 119894119892 and119882119904119904 119894 = 119882119904119904 119894 if (119905MOD119879119894) = 00 otherwise

(13)

42 The Structure of LSTM In LSTM the memory blocksare used to replace the hidden units in RNN As shown in

Figure 3 such amemory block consists of a cell an input gatean output gate and a forget gate The current state of hiddenlayer is restored in cell the three import gate units whichcontrol the input output and forget of cell respectively Theforward propagation of LSTM is as follows119894119905 = 120590 (119882119909119894119909119905 +119882ℎ119894ℎ119905minus1 + 119887119894) (14)119891119905 = 120590 (119882119909119891119909119905 +119882ℎ119891ℎ119905minus1 + 119887119891) (15)119888119905 = 119891119905 ∙ 119888119905minus1 + 119894119905 ∙ tanh (119882119909119888119909119905 +119882ℎ119888ℎ119905minus1 + 119887119888) (16)119900119905 = 120590 (119882119909119900119909119905 +119882ℎ119900ℎ119905minus1 + 119887119900) (17)ℎ119905 = 119900119905 ∙ tanh (119888119905) (18)

where 119894119905 119891119905 119888119905 119900119905 and ℎ119905 are outputs of input gate forgetgate cell output gate and memory block at time step trespectively 119909119905 is input vector of memory block at time step tℎ119905minus1 is the output vector ofmemory block at t-1 time step119882119909119894119882119909119891119882119909119888 and119882119909119900 are the weight matrices from input vectorto input gate forget gate cell and output gate respectively119882ℎ119894 119882ℎ119891 119882ℎ119888 and 119882ℎ119900 are the weight matrices from theoutput of memory block at previous time step to input gateforget gate cell and output gate respectively 119887119894 119887119891 119887119888 and119887119900 are biases of input gate forget gate cell and output gaterespectively 120590(sdot) is activation function of gate unit whichis set as logistic sigmoid function in this paper ∙ representselement-wise product

5 Construction of UUV Autonomous ObstacleAvoidance Planning Learning System

The principle framework of UUV autonomous obstacleavoidance planning learning system is shown in Figure 4At first the RNN based obstacle avoidance planners are

Complexity 5

tanh

tanh

+ +

tanh

tanh

tanh

tanh

tminus9x tminus8x

tf

Full connection

23 units

2 units

th

Hidden layer

Middle layer

Input layer

Output layer

Lb Blocks

81 inputs

Full connection

+ + + + + + + + + + + +ibfb cb ob ibfb cb ob ibfb cb ob

ti to

tx

1th minus

tanh fbHyperbolic tangent

Element wise addition concatenation

Element wise multiplication

BiasSigmoid

Forget gate vector Input gate vector Output gate vectortf ti toGate vectors

tx Input vector tminus1h Output of previous block tminus1c Memory from previous block

Operations

Nonlinearities

Inputs

Outputsth ct

ctctminus1

Output of current block Memory of current block

+

Figure 3 The network structure of LSTM

trained offline Then these fully trained planners are usedto do obstacle avoidance planning for UUV in real timeaccording to the environmental information obtained by FLSand some information of UUV obtained by motion and atti-tude sensor The motion controller controls the UUV basedon control commands output by online obstacle avoidanceplanners

The flowchart of RNN based online obstacle avoidanceplanning system is as follows

Step 1 Initialize the start position and target position ofUUVand deploy UUV in the start position

Step 2 Acquire data from sonar motion and attitude sen-sors

Step 3 The online RNN obstacle avoidance planner outputthe desired yaw and velocity of UUV according to sensorsdata

Step 4 UUV adjusts its heading and velocity according to theoutput instruction of onlineRNNobstacle avoidance planner

Step 5 Determine whether the UUV reach the target posi-tion and if so the obstacle avoidance planning algorithm isstopped Else jump to Step 2

6 Data Processing and Network Training

The input sequence 119909119905 of obstacle avoidance planners at timestep t consists of distance vector 119863119905 and the angle betweenUUV and target in North East-fixed reference frame 120593119905 Theoutput vector of obstacle avoidance planners at time step tis constituted by the adjustment of heading and the velocityof UUV The dataset consists of 120000 training samplesand 4810 test samples In the dataset the start point targetpoint and obstacles are generated randomly And MinndashMaxnormalization is used to preprocess input and output data

The only difference between the two types of obstacleavoidance planners is the structure hidden layers which arecomposed by CW-RNN and LSTM respectively This settingis convenient for comparison between the performance ofCW-RNN and LSTM on obstacle avoidance for UUV Thetwo types of obstacle avoidance planners consist of inputlayer hidden layer middle layer and output layer Thereare 81 neurons in input layer 23 neurons in middle layerand 2 neurons in output layer To overcome the problem ofoverfitting dropout with 06 keep probability is used in theprocess of train The loss function is mean squared error(MSE) the weights are updated using the backpropagationthrough timeminibatch gradient descent tominimizeMSEofwhich batch size is set as 10000 And the optimizer is Adam

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 3: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 3

and 119863(119881) = 119863119897 + 119863119899119897(119881) denote the system inertia matrixcoriolis-centripetalmatrix and dampingmatrix respectivelySpecifically

119877 (120595) = [[[cos120595 minus sin120595 0sin120595 cos120595 00 0 1]]] (3)

119872119877119861 = [[[119898 0 00 119898 1198981199091198660 119898119909119866 119868119911 ]]]

119872119860 = [[[minus119883 0 00 minus119884 V minus119884 1199030 minus119884 119903 minus119873 119903]]]

(4)

119862119877119861 (119881) = [[[0 0 minus119898 (119909119866119903 + V)0 0 119898119906119898 (119909119866119903 + V) minus119898119906 0 ]]]

119862119860 (119881) = [[[0 0 119884 VV + 119884 1199031199030 0 minus119883119906minus119884 VV minus 119884 119903119903 119883119906 0 ]]]

(5)

119863119897 = [[[119883119906 0 00 119884V minus1198841199030 minus119873V 119873119903 ]]]

119863119899119897 (119881) = [[[119883|119906|119906 |119906| 0 00 119884|V|V |V| minus119884|V|119903 |V|0 minus119873|119903|V |119903| 119873|119903|119903 |119903| ]]]

(6)

A constant current is assumed in this paper which isexpressed as a vector [119906119888 V119888]119879 in body-fixed reference frameAnd then the kinematic and dynamic equations of UUV canbe described as 119903 = (minus11988911119906119903 + 120591119906)11989811

V119903 = (11986011989833 minus 11986111989823)(1198982211989833 minus 119898223)119903 = (11986111989822 minus 11986011989823)(1198982211989833 minus 119898223) = 119906 cos (120595) minus V sin (120595)119910 = 119906 sin (120595) + V cos (120595) = 119903

(7)

where119860 = minus11988922V119903 +(11988923minus11990611990311988823 minus119898119906119888)119903 119861 = (11988932 minus11990611990311988832)V119903 minus11988933119903 + 120591119903 and 11989811 = 119898 minus 11988311989822 = 119898 minus 119884 V

11989823 = minus119884 11990311989833 = 119868119911 minus 119873 11990311988823 = 119898 minus 11988311988832 = 119883 minus 119884 V11988911 = 119883119906119903

+ 119883|119906119903|119906119903

1003816100381610038161003816119906119903100381610038161003816100381611988922 = 119884V119903

+ 119884|V119903|V119903

1003816100381610038161003816V119903100381610038161003816100381611988923 = 11988411990311988932 = 119873V11990311988933 = 119873119903 + 119873|119903|119903 |119903|

(8)

Assume there are twopropellers distribute in the horizon-tal plane of UUV And the force vector 120591 is modeled as

[120591119906120591119903] = [ 1 1119889119901 minus119889119901][119879(119899119901)119879 (119899119904)] (9)

where 119899119901 and 119899119904 denote the speeds of propellers of UUVrespectively 119889119901 is the distance between the propeller andcentral axis of UUV and 119879(119899119901) and 119879(119899119904) denote propellercoefficients

3 Simulation Model of Sonar

The input data of obstacle avoidance planners proposed bythis paper are obtained by multibeam forward looking sonarA 2D simulation model of multibeam FLS based on SeaBat8125 is established in this section SeaBat 8125 is a state-of-the-art high-resolution multibeam echosounder [32] It has a120∘ field of view sector 80 beams with width of 15∘ andthe maximum scan radius of 120119898 To simplify the inputinformation of network define the distance vector 119863119905 =[120 minus 1198890119905 120 minus 1198891119905 sdot sdot sdot 120 minus 11988979119905 ] where 119889119894119905 is the distanceinformation detected by 119894th ray of sonar at time step 119905 andif 119889119894119905 gt 120 then set 119889119894119905 = 120 The precision of sonar isset as 5119898 And taking the uncertainty of sonar detection intoaccount this paper sets the false alarm rate as 10

4 The Structures of ObstacleAvoidance Planners

41 The Structure of CW-RNN The forward propagation ofstandard RNN is as follows119904119905 = 120590 (119882119909119904119909119905 +119882119904119904119904119905minus1 + 119887119904) (10)119900119905 = tanh (119882119904119900119904119905 + 119887119900) (11)

where 119882119909119904 and 119882119904119904 denote the weight matrices from inputlayer and hidden layer to hidden layer respectively119882119904119900 is theweight matrix between hidden layer and output layer 119909119905 119904119905and 119900119905 are the input vector hidden state vector and output

4 Complexity

0x

81 inputs

23 units

2 units

Full connectionFull connection

1x 2x 9x

Input layer

Hidden layer

Middle layer

Output layer41

42

43

44

41

42

43

44

41

42

43

44

Figure 2 The network structure of CW-RNN Bias units are omitted to simplify the visualization network

vector at time step 119905 respectively and 119887119904 and 119887119900 correspond tothe biases of hidden layer and output layer respectively

As shown in Figure 2 the neurons in hidden layer aregrouped into 119892modules of size 119896 in the forward propagationof CW-RNN Each module i is set an explicit clock 119879119894 = 2119894minus1to operate For every module j only if 119879119894 le 119879119895 the recurrentconnections from module 119895 to module 119894 are existed Andthe state of modules i will be updated only if the modulesi satisfy (119905MOD119879119894) = 0 at each time step 119905 The long-term memory is restored by the modules have long periodThe local information obtained from input data is solved bymodules with short period

Therefore 119882119909119904 and 119882119904119904 are partitioned into 119892 blocks-rows corresponding to 119892 modules and 119882119904119904 is a block-uppertriangular matrix

119882119909119904 = (1198821199091199041119882119909119904 g)119882119904119904 = (1198821199041199041119882119904119904 g)

(12)

where each block-row119882119904119904 119894 is partitioned into block-columns0 sdot sdot sdot 0119882119904119904 119894119894 sdot sdot sdot 119882119904119904 119894119892 and119882119904119904 119894 = 119882119904119904 119894 if (119905MOD119879119894) = 00 otherwise

(13)

42 The Structure of LSTM In LSTM the memory blocksare used to replace the hidden units in RNN As shown in

Figure 3 such amemory block consists of a cell an input gatean output gate and a forget gate The current state of hiddenlayer is restored in cell the three import gate units whichcontrol the input output and forget of cell respectively Theforward propagation of LSTM is as follows119894119905 = 120590 (119882119909119894119909119905 +119882ℎ119894ℎ119905minus1 + 119887119894) (14)119891119905 = 120590 (119882119909119891119909119905 +119882ℎ119891ℎ119905minus1 + 119887119891) (15)119888119905 = 119891119905 ∙ 119888119905minus1 + 119894119905 ∙ tanh (119882119909119888119909119905 +119882ℎ119888ℎ119905minus1 + 119887119888) (16)119900119905 = 120590 (119882119909119900119909119905 +119882ℎ119900ℎ119905minus1 + 119887119900) (17)ℎ119905 = 119900119905 ∙ tanh (119888119905) (18)

where 119894119905 119891119905 119888119905 119900119905 and ℎ119905 are outputs of input gate forgetgate cell output gate and memory block at time step trespectively 119909119905 is input vector of memory block at time step tℎ119905minus1 is the output vector ofmemory block at t-1 time step119882119909119894119882119909119891119882119909119888 and119882119909119900 are the weight matrices from input vectorto input gate forget gate cell and output gate respectively119882ℎ119894 119882ℎ119891 119882ℎ119888 and 119882ℎ119900 are the weight matrices from theoutput of memory block at previous time step to input gateforget gate cell and output gate respectively 119887119894 119887119891 119887119888 and119887119900 are biases of input gate forget gate cell and output gaterespectively 120590(sdot) is activation function of gate unit whichis set as logistic sigmoid function in this paper ∙ representselement-wise product

5 Construction of UUV Autonomous ObstacleAvoidance Planning Learning System

The principle framework of UUV autonomous obstacleavoidance planning learning system is shown in Figure 4At first the RNN based obstacle avoidance planners are

Complexity 5

tanh

tanh

+ +

tanh

tanh

tanh

tanh

tminus9x tminus8x

tf

Full connection

23 units

2 units

th

Hidden layer

Middle layer

Input layer

Output layer

Lb Blocks

81 inputs

Full connection

+ + + + + + + + + + + +ibfb cb ob ibfb cb ob ibfb cb ob

ti to

tx

1th minus

tanh fbHyperbolic tangent

Element wise addition concatenation

Element wise multiplication

BiasSigmoid

Forget gate vector Input gate vector Output gate vectortf ti toGate vectors

tx Input vector tminus1h Output of previous block tminus1c Memory from previous block

Operations

Nonlinearities

Inputs

Outputsth ct

ctctminus1

Output of current block Memory of current block

+

Figure 3 The network structure of LSTM

trained offline Then these fully trained planners are usedto do obstacle avoidance planning for UUV in real timeaccording to the environmental information obtained by FLSand some information of UUV obtained by motion and atti-tude sensor The motion controller controls the UUV basedon control commands output by online obstacle avoidanceplanners

The flowchart of RNN based online obstacle avoidanceplanning system is as follows

Step 1 Initialize the start position and target position ofUUVand deploy UUV in the start position

Step 2 Acquire data from sonar motion and attitude sen-sors

Step 3 The online RNN obstacle avoidance planner outputthe desired yaw and velocity of UUV according to sensorsdata

Step 4 UUV adjusts its heading and velocity according to theoutput instruction of onlineRNNobstacle avoidance planner

Step 5 Determine whether the UUV reach the target posi-tion and if so the obstacle avoidance planning algorithm isstopped Else jump to Step 2

6 Data Processing and Network Training

The input sequence 119909119905 of obstacle avoidance planners at timestep t consists of distance vector 119863119905 and the angle betweenUUV and target in North East-fixed reference frame 120593119905 Theoutput vector of obstacle avoidance planners at time step tis constituted by the adjustment of heading and the velocityof UUV The dataset consists of 120000 training samplesand 4810 test samples In the dataset the start point targetpoint and obstacles are generated randomly And MinndashMaxnormalization is used to preprocess input and output data

The only difference between the two types of obstacleavoidance planners is the structure hidden layers which arecomposed by CW-RNN and LSTM respectively This settingis convenient for comparison between the performance ofCW-RNN and LSTM on obstacle avoidance for UUV Thetwo types of obstacle avoidance planners consist of inputlayer hidden layer middle layer and output layer Thereare 81 neurons in input layer 23 neurons in middle layerand 2 neurons in output layer To overcome the problem ofoverfitting dropout with 06 keep probability is used in theprocess of train The loss function is mean squared error(MSE) the weights are updated using the backpropagationthrough timeminibatch gradient descent tominimizeMSEofwhich batch size is set as 10000 And the optimizer is Adam

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 4: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

4 Complexity

0x

81 inputs

23 units

2 units

Full connectionFull connection

1x 2x 9x

Input layer

Hidden layer

Middle layer

Output layer41

42

43

44

41

42

43

44

41

42

43

44

Figure 2 The network structure of CW-RNN Bias units are omitted to simplify the visualization network

vector at time step 119905 respectively and 119887119904 and 119887119900 correspond tothe biases of hidden layer and output layer respectively

As shown in Figure 2 the neurons in hidden layer aregrouped into 119892modules of size 119896 in the forward propagationof CW-RNN Each module i is set an explicit clock 119879119894 = 2119894minus1to operate For every module j only if 119879119894 le 119879119895 the recurrentconnections from module 119895 to module 119894 are existed Andthe state of modules i will be updated only if the modulesi satisfy (119905MOD119879119894) = 0 at each time step 119905 The long-term memory is restored by the modules have long periodThe local information obtained from input data is solved bymodules with short period

Therefore 119882119909119904 and 119882119904119904 are partitioned into 119892 blocks-rows corresponding to 119892 modules and 119882119904119904 is a block-uppertriangular matrix

119882119909119904 = (1198821199091199041119882119909119904 g)119882119904119904 = (1198821199041199041119882119904119904 g)

(12)

where each block-row119882119904119904 119894 is partitioned into block-columns0 sdot sdot sdot 0119882119904119904 119894119894 sdot sdot sdot 119882119904119904 119894119892 and119882119904119904 119894 = 119882119904119904 119894 if (119905MOD119879119894) = 00 otherwise

(13)

42 The Structure of LSTM In LSTM the memory blocksare used to replace the hidden units in RNN As shown in

Figure 3 such amemory block consists of a cell an input gatean output gate and a forget gate The current state of hiddenlayer is restored in cell the three import gate units whichcontrol the input output and forget of cell respectively Theforward propagation of LSTM is as follows119894119905 = 120590 (119882119909119894119909119905 +119882ℎ119894ℎ119905minus1 + 119887119894) (14)119891119905 = 120590 (119882119909119891119909119905 +119882ℎ119891ℎ119905minus1 + 119887119891) (15)119888119905 = 119891119905 ∙ 119888119905minus1 + 119894119905 ∙ tanh (119882119909119888119909119905 +119882ℎ119888ℎ119905minus1 + 119887119888) (16)119900119905 = 120590 (119882119909119900119909119905 +119882ℎ119900ℎ119905minus1 + 119887119900) (17)ℎ119905 = 119900119905 ∙ tanh (119888119905) (18)

where 119894119905 119891119905 119888119905 119900119905 and ℎ119905 are outputs of input gate forgetgate cell output gate and memory block at time step trespectively 119909119905 is input vector of memory block at time step tℎ119905minus1 is the output vector ofmemory block at t-1 time step119882119909119894119882119909119891119882119909119888 and119882119909119900 are the weight matrices from input vectorto input gate forget gate cell and output gate respectively119882ℎ119894 119882ℎ119891 119882ℎ119888 and 119882ℎ119900 are the weight matrices from theoutput of memory block at previous time step to input gateforget gate cell and output gate respectively 119887119894 119887119891 119887119888 and119887119900 are biases of input gate forget gate cell and output gaterespectively 120590(sdot) is activation function of gate unit whichis set as logistic sigmoid function in this paper ∙ representselement-wise product

5 Construction of UUV Autonomous ObstacleAvoidance Planning Learning System

The principle framework of UUV autonomous obstacleavoidance planning learning system is shown in Figure 4At first the RNN based obstacle avoidance planners are

Complexity 5

tanh

tanh

+ +

tanh

tanh

tanh

tanh

tminus9x tminus8x

tf

Full connection

23 units

2 units

th

Hidden layer

Middle layer

Input layer

Output layer

Lb Blocks

81 inputs

Full connection

+ + + + + + + + + + + +ibfb cb ob ibfb cb ob ibfb cb ob

ti to

tx

1th minus

tanh fbHyperbolic tangent

Element wise addition concatenation

Element wise multiplication

BiasSigmoid

Forget gate vector Input gate vector Output gate vectortf ti toGate vectors

tx Input vector tminus1h Output of previous block tminus1c Memory from previous block

Operations

Nonlinearities

Inputs

Outputsth ct

ctctminus1

Output of current block Memory of current block

+

Figure 3 The network structure of LSTM

trained offline Then these fully trained planners are usedto do obstacle avoidance planning for UUV in real timeaccording to the environmental information obtained by FLSand some information of UUV obtained by motion and atti-tude sensor The motion controller controls the UUV basedon control commands output by online obstacle avoidanceplanners

The flowchart of RNN based online obstacle avoidanceplanning system is as follows

Step 1 Initialize the start position and target position ofUUVand deploy UUV in the start position

Step 2 Acquire data from sonar motion and attitude sen-sors

Step 3 The online RNN obstacle avoidance planner outputthe desired yaw and velocity of UUV according to sensorsdata

Step 4 UUV adjusts its heading and velocity according to theoutput instruction of onlineRNNobstacle avoidance planner

Step 5 Determine whether the UUV reach the target posi-tion and if so the obstacle avoidance planning algorithm isstopped Else jump to Step 2

6 Data Processing and Network Training

The input sequence 119909119905 of obstacle avoidance planners at timestep t consists of distance vector 119863119905 and the angle betweenUUV and target in North East-fixed reference frame 120593119905 Theoutput vector of obstacle avoidance planners at time step tis constituted by the adjustment of heading and the velocityof UUV The dataset consists of 120000 training samplesand 4810 test samples In the dataset the start point targetpoint and obstacles are generated randomly And MinndashMaxnormalization is used to preprocess input and output data

The only difference between the two types of obstacleavoidance planners is the structure hidden layers which arecomposed by CW-RNN and LSTM respectively This settingis convenient for comparison between the performance ofCW-RNN and LSTM on obstacle avoidance for UUV Thetwo types of obstacle avoidance planners consist of inputlayer hidden layer middle layer and output layer Thereare 81 neurons in input layer 23 neurons in middle layerand 2 neurons in output layer To overcome the problem ofoverfitting dropout with 06 keep probability is used in theprocess of train The loss function is mean squared error(MSE) the weights are updated using the backpropagationthrough timeminibatch gradient descent tominimizeMSEofwhich batch size is set as 10000 And the optimizer is Adam

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 5: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 5

tanh

tanh

+ +

tanh

tanh

tanh

tanh

tminus9x tminus8x

tf

Full connection

23 units

2 units

th

Hidden layer

Middle layer

Input layer

Output layer

Lb Blocks

81 inputs

Full connection

+ + + + + + + + + + + +ibfb cb ob ibfb cb ob ibfb cb ob

ti to

tx

1th minus

tanh fbHyperbolic tangent

Element wise addition concatenation

Element wise multiplication

BiasSigmoid

Forget gate vector Input gate vector Output gate vectortf ti toGate vectors

tx Input vector tminus1h Output of previous block tminus1c Memory from previous block

Operations

Nonlinearities

Inputs

Outputsth ct

ctctminus1

Output of current block Memory of current block

+

Figure 3 The network structure of LSTM

trained offline Then these fully trained planners are usedto do obstacle avoidance planning for UUV in real timeaccording to the environmental information obtained by FLSand some information of UUV obtained by motion and atti-tude sensor The motion controller controls the UUV basedon control commands output by online obstacle avoidanceplanners

The flowchart of RNN based online obstacle avoidanceplanning system is as follows

Step 1 Initialize the start position and target position ofUUVand deploy UUV in the start position

Step 2 Acquire data from sonar motion and attitude sen-sors

Step 3 The online RNN obstacle avoidance planner outputthe desired yaw and velocity of UUV according to sensorsdata

Step 4 UUV adjusts its heading and velocity according to theoutput instruction of onlineRNNobstacle avoidance planner

Step 5 Determine whether the UUV reach the target posi-tion and if so the obstacle avoidance planning algorithm isstopped Else jump to Step 2

6 Data Processing and Network Training

The input sequence 119909119905 of obstacle avoidance planners at timestep t consists of distance vector 119863119905 and the angle betweenUUV and target in North East-fixed reference frame 120593119905 Theoutput vector of obstacle avoidance planners at time step tis constituted by the adjustment of heading and the velocityof UUV The dataset consists of 120000 training samplesand 4810 test samples In the dataset the start point targetpoint and obstacles are generated randomly And MinndashMaxnormalization is used to preprocess input and output data

The only difference between the two types of obstacleavoidance planners is the structure hidden layers which arecomposed by CW-RNN and LSTM respectively This settingis convenient for comparison between the performance ofCW-RNN and LSTM on obstacle avoidance for UUV Thetwo types of obstacle avoidance planners consist of inputlayer hidden layer middle layer and output layer Thereare 81 neurons in input layer 23 neurons in middle layerand 2 neurons in output layer To overcome the problem ofoverfitting dropout with 06 keep probability is used in theprocess of train The loss function is mean squared error(MSE) the weights are updated using the backpropagationthrough timeminibatch gradient descent tominimizeMSEofwhich batch size is set as 10000 And the optimizer is Adam

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 6: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

6 Complexity

Fully Trained RNN

Online ObstacleAvoidance Planner

Motion Controller

Actuator

System Model ofUUV

Motion andAttitude Sensors

Environment

FLS

Online RNN based obstacle avoidance planning of UUV

79

79

79

Hidden Layer

Middle Layer

Output Layer

Modify the Weightof RNN by

Backpropagationthrough Time

-

+

Offline RNN based obstacle avoidance training and testing

Obtain data from training set or test

set

Label

Training

Input Layer

Output ErrorTesting

01

01

01

Figure 4 Principle framework of RNN based obstacle avoidance planning learning system

optimizer the maximum number of iterations is 20000 Allnetworks are trained at Core i3 CPU 200GHztimes4

The parameters of four networks are shown in Table 1And the MSE of the four networks on test dataset is shownin Figure 5 Table 1 and Figure 5 show that for the samenetwork the offline training time of the networks increasesand the convergence slows down as the number of parametersrises but the best MSE reduces And in the early stage oftraining the network with fewer parameters converges fasterwhile in the later stage the opposite happens Compared withCW-RNN LSTM converges faster and obtains better results

7 Results and Analysis

In this section a statistical experiment and several illustrativeexamples are present to validate the ability of obstacle avoid-ance algorithms The size of the map is set to 800119898 times 1200119898the velocity of UUV is set as a constant 8119896119899 And taking theenvironmental factors into consideration this paper added10 false alarm rate to sonar data in simulation test cases

71 Statistical Experiment In order to verify the obstacleavoidance planning effect of each network under differentenvironmental disturbances the statistical experiment isdesigned in this paper The experiment counted the perfor-mance of different networks on 100 random maps at the false

alarm rate of 5 10 and 15 respectively The experi-mental results are shown in Table 2 Table 2 shows that forthe same network the more parameters the higher planningsuccess rate the lower path cost but more time the algorithmtakes Compared with CW-RNN LSTM has advantages inpath cost success rate and stabilityThe reasons for the failureof each network planning are shown in Table 3 Among themlsquononarrivalrsquo means that UUV stops near the target point notat the target point lsquoDisorientationrsquo means that UUV driftsthrough the map after dodging obstacles rather than movingtoward the target The disorientation occurs when the obsta-cle avoidance planner cannot extract the target informationIt can be seen from Table 3 that the increase of false alarmrate makes the probability of collision and disorientationpath planned by CW-RNN increase but it has little effect onLSTM This indicates that LSTM is superior to CW-RNN inprocessing of long-term memory As shown in Table 3 CW-RNNs get a higher probability than LSTMs both in the termsof collision and lost which indicates that LSTM has betterability to learn and extract detailed features than CW-RNN

72 Simulation Test Case 1 For further analysis of the learn-ing ability of the proposed obstacle avoidance algorithms thissimulation test case tests the obstacle avoidance performanceof the four structures in two maps with the same complexityas maps in training set The tracks yaw and propeller

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 7: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 7

Updates

0

05

1

15

2

25

3

35

The M

ean

Squa

red

Erro

r (M

SE)

CW-RNN96LSTM18CW-RNN180LSTM45

0

0002

0004

0006

0008

001

0012

0014

0016

0018

002

100500

0

05

1

15

2

25

3

35

0

1

2

3

2181614121080604020

15141312

218161412108060402

times10minus3

times104

times104

times104

Figure 5 The mean squared error of all structures on test set

Table 1 The performance of all structures in training

Structure CW-RNN96 CW-RNN180 LSTM18 LSTM45Number of units in hidden layer Rb=24 Rb=45 Lb=18 Lb=45Number of parameters in hidden layer 6177(mean) 17253(mean) 5832 19440Mean train time of each iterations 636 1185 644 1829Initial error 0454 0645 2800 4600Best MSE 174times10minus3 126times10minus3 152times10minus3 87times10minus4Convergence generation 15000 18000 9500 13500

Table 2 The performance of all structures in statistical experiment

False alarm rate () CW-RNN96 CW-RNN180 LSTM18 LSTM45

Success rate ()5 88 94 93 9910 86 93 94 9815 80 87 92 98

Mean path cost (m)5 13093 12535 12983 1167610 13228 12973 13033 1183415 13764 13289 13116 11797

Time-consumption (ms)5 5273 18536 4556 1989810 5244 186 84 4475 1992515 5302 18597 4533 19906

Table 3 The reasons for failure of obstacle avoidance planning in statistical experiment

False alarm rate () Reason CW-RNN96 CW-RNN180 LSTM18 LSTM45

5Collision () 2 1 0 0

Disorientation () 5 2 1 0Non-arrival () 5 3 6 1

10Collision () 3 2 0 0

Disorientation () 6 3 0 0Non-arrival () 5 2 6 2

15Collision () 5 4 0 0

Disorientation () 9 5 2 0Non-arrival () 6 4 6 2

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 8: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

8 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180S

T

(a)

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

(b)

Figure 6 Online planning results of four obstacle avoidance algorithms of UUV in simulation test cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(a)

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

(b)

Figure 7 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test cases 1(a) and (b)

speed of UUV are shown in Figures 6ndash9 respectively Asshown in the simulation results in the maps with the samecomplexity as the training environment the four proposedobstacle avoidance algorithms can quickly generate the pathwithout collision with obstacles and the planning resultssatisfy the UUV kinematics In this simulation test case allthe four obstacle avoidance algorithms show strong learningability And compared with other structures there are feweroscillations in the path planned by LSTM45

73 Simulation Test Case 2 Assume that the start point is(156 39) and the target position of UUV is (630 1070)Figure 10 shows the tracks of UUV planned by the fourobstacle avoidance algorithms As the simulation results showthat all methods are effectively controlling UUV to avoid

the obstacles and reach the target position And all RNNbased obstacle avoidance planners have learned the abilitythat adjusts UUVrsquos heading to navigate toward the targetposition quickly after avoiding obstacles As Figures 11 and12 show the yaw and propeller speed of UUV planned byRNN based obstacle avoidance planners are conformed tothe actual practice In the map with discrete distribution ofobstacles even though the environment complexity of themap is improved the four obstacle avoidance algorithms cangenerate noncollision paths The simulation results indicatethat all the four algorithms have a degree of generalizationability and adaptive capability

74 Simulation Test Case 3 For further analysis of the abilityof the proposed obstacle avoidance planners amuch complex

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 9: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 9

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 8 Curves of left propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 9 Curves of right propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms in simulationtest cases 1(a) and (b)

map than those maps is included in train and test dataset isadopted in this simulation test caseThe tracks yaw and pro-peller speed of UUV are shown in Figures 13 14 and 15respectively As shown in the simulation results in the com-plex environment with continuous distribution of obstaclesUUV is planned by CW-RNN96 to avoid obstacles with theroam mode This is because the CW-RNN96 cannot extractthe target point information which is more detailed thanthe obstacle information And LSTM18 CW-RNN180 andLSTM45 are still capable for obstacle avoidance which exhi-bit satisfactory abilities of learning and generalization in this

problem Although LSTM18 has fewer parameters than CW-RNN96 it has a better performance in complex environment

75 Simulation Test Case 4 In order to test the generalizationand exploration ability of various methods this simulationtest case adopts a maze map of continuous obstacles shownin Figure 16 In the training set the target is all set on theeast side of the map and UUV always moves on the westside of the target point whichmeans the angle between UUVand target in North East-fixed reference frame is 180∘ lt120593119905 lt 360∘ In this map the target point is in the middle of

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 10: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

10 Complexity

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 10 Online planning results of four obstacle avoidance algorithms of UUV in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

minus10

minus5

0

5

10

15

CW-RNN96LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 11 Curves of yaw adjustment output by the four obstacle avoidance algorithms in simulation test case 2

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Left

Prop

elle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

CW-RNN96LSTM18CW-RNN180LSTM45

(b)

Figure 12 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 11: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 11

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 13 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 3

0 50 100 150 200 250 300 350 400Time (s)

minus10

minus5

0

5

10

15

LSTM18CW-RNN180LSTM45

9Q

>DOMNGH

N(∘)

Figure 14 Curves of yaw adjustment output by the several obstacle avoidance algorithms in simulation test case 3

the map and UUV must move around the target and reachthe target which means 0∘ lt 120593119905 lt 360∘ The simulationresults are shown in Figures 16ndash18 It can be seen from thesimulation results that all the methods performed well inthe early stage of planning (180∘ lt 120593119905 lt 360∘) As UUVmoves the range of 120593119905 changes to (0∘ 180∘) collision existsin the path CW-RNN96 planning disorientations exist in thepath of CW-RNN180 planning disorientation and nonarrivalexists in the path of LSTM18 planning Only LSTM45 has thecapability of path generation in this maze environment andshows excellent performance The simulation results show astrong generalization and exploration ability of LSTM45

76 Simulation Test Case 5 The results of statistical experi-ment and simulation test 2-4 show that compared with thethree methods LSTM45 is the best method for UUV obstacleavoidance planning In this test case a series of simulations indynamic environments are used in to further test LSTM45rsquos

ability of obstacle avoidance Figures 19 and 20 show thesimulation results of LSTM45 in several dynamic environ-ments that the obstacle with different motions And Figures21 and 22 show the simulation results of LSTM45 in complexenvironment with many static andmoving obstacles Assumethat the dynamic obstacles always travel in straight lines withconstant velocity The directions of motion of obstacles areindicated by the arrow in obstaclesThe velocities of obstaclesare set as 8kn in Figure 19(a) and 4kn in other cases Thesimulation results show that LSTM45 drives UUV navigatestoward the target until a collision threat is found After obsta-cle avoidanceUUV is planned tomove toward the target con-tinue Although the training set does not contain anydynamic obstacles LSTM45 still explore the strategy to avoiddynamic obstacles

It can be seen from the above experiments that (1) whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training time

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 12: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

12 Complexity

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500Le

ft Pr

opel

ler S

peed

(rpm

)

LSTM18CW-RNN180LSTM45

(a)

0 50 100 150 200 250 300 350 400Time (s)

300

320

340

360

380

400

420

440

460

480

500

Righ

t Pro

pelle

r Spe

ed (r

pm)

LSTM18CW-RNN180LSTM45

(b)

Figure 15 Curves of propeller speed controlling feedback corresponding with the different obstacle avoidance algorithms

CW-RNN96LSTM18

LSTM45CW-RNN180

S

T

Figure 16 Online planning results of several obstacle avoidance algorithms of UUV in simulation test case 4

0 100 200 300 400 500 600 700 800Time (s)

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

Figure 17 Curves of yaw adjustment output by LSTM45 in simulation test case 4

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 13: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 13

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450Le

ft Pr

opel

ler S

peed

(rpm

)

(a)

0 100 200 300 400 500 600 700 800Time (s)

300

350

400

450

Righ

t Pro

pelle

r Spe

ed (r

pm)

(b)

Figure 18 Curves of propeller speed controlling feedback corresponding with LSTM45

S

S

S

T

T

T

(a) Obstacle moves relative to UUV

S

S

S

T

T

T

(b) Obstacle moves in the same direction with UUV

S

S

S

T

T

T

(c) Obstacle moves crossing with UUV

Figure 19 Online planning results of LSTM45 in several dynamic environments

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 14: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

14 Complexity

0 20 40 60 80 100Time (s)

0

100

200

300

400

500

600Th

e dist

ance

bet

wee

n U

UV

and

obst

acle

(m)

(a) The distance between UUV and obstacle in Figure 19(a)

1200

20

40

60

80

100

120

140

160

180

200

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(b) The distance between UUV and obstacle in Figure 19(b)

0

50

100

150

200

250

300

The d

istan

ce b

etw

een

UU

V an

d ob

stac

le (m

)

0 20 40 60 80 100Time (s)

(c) The distance between UUV and obstacle in Figure 19(c)

Figure 20 The minimum distance between UUV and obstacle corresponding to Figure 19

the best loss and time-consumption but LSTM shows betterpath cost success rate generalization ability and robustnesscompared with CW-RNN (2) It is worth noting that LSTM18and CW-RNN180 have considerable learning ability and gen-eralization ability while the number of parameters trainingtime and time-consumption of LSTM18 are about 13 frac12 and14 of CW-RNN180 respectively And LSTM18 has significantadvantages in learning capability generalization ability androbustness compared with CW-RNN96 with similar numberof parameters (3) For all the four algorithms LSTM45 hasthe best learning ability generalization and exploration abilityand robustness It is able to solve the problem of obstacleavoidance for UUV in a dynamic or even complex dynamicenvironment after being trained in simple and static environ-ments

8 Conclusion

Inspired by state-of-the-art performance of CW-RNN andLSTM on many sequence prediction tasks this paperpresented two types of obstacle avoidance algorithms based

on CW-RNN and LSTM respectively and compared the per-formance of CW-RNNand LSTMon obstacle avoidance taskThe proposed obstacle avoidance algorithms based on LSTMand CW-RNN achieved a very robust performance on theonline obstacle avoidance problem of UUV under unknownenvironment and remained robust performance even thoughthe effects of measurement noises are considered And dueto the strong learning ability and generalization abilitythe obstacle avoidance algorithms are capable for obstacleavoidance in the environments which are much complexthan those environments existing in training samples Whenthe number of parameters is similar CW-RNN and LSTMalso show similar performance in terms of training timethe best loss and time-consumption but in terms of pathcost obstacle avoidance planning success rate generalizationability and robustness LSTM has a better performanceFor all the proposed four methods LSTM45 obtained thebest performance in terms of learning ability generalizationand exploration ability and robustness The simulation indynamic environment verified further the excellent ability inobstacle avoidance planning

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 15: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Complexity 15

S

T

moving obstaclestatic obstacle

(a) t4=60s

S

T

(b) t4=120s

S

T

(c) t4=230s

S

T

(d) t4=348s

Figure 21 Online planning results of LSTM45 in complex dynamic environments

minus10

minus8

minus6

minus4

minus2

0

2

4

6

8

10

9Q

>DOMNGH

N(∘)

50 100 150 200 250 300 3500Time (s)

Figure 22 Curves of yaw adjustment output by LSTM45 in complex dynamic environment

Data Availability

The data used to support the findings of this study areavailable from the corresponding author upon request

Conflicts of Interest

The authors declare that they have no conflicts of interest

Acknowledgments

This research work is supported by China Natural ScienceFoundation (no 61633008) and Natural Science Foundationof Heilongjiang Province under Grant F2015035

References

[1] Y Li T Ma P Chen Y Jiang R Wang and Q Zhang ldquoAuto-nomous underwater vehicle optimal path planning method forseabed terrain matching navigationrdquo Ocean Engineering vol133 pp 107ndash115 2017 (Catalan)

[2] B Braginsky andHGuterman ldquoObstacle avoidance approachesfor autonomous underwater vehicle simulation and experi-mental resultsrdquo IEEE Journal of Oceanic Engineering vol 41 no4 pp 882ndash892 2016

[3] P G Min H J Jeon and C L Min ldquoObstacle avoidancefor mobile robots using artificial potential field approach withsimulated annealingrdquo in Proceedings of the IEEE International

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 16: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

16 Complexity

Symposium on Industrial Electronics vol 2001 pp 1530ndash1535IEEE Pusan South Korea 2001

[4] L Tang S Dian G Gu K Zhou SWang and X Feng ldquoA novelpotential fieldmethod for obstacle avoidance and path planningof mobile robotrdquo in Proceedings of the 3rd IEEE InternationalConference on Computer Science and Information Technology(ICCSIT rsquo10) pp 633ndash637 IEEE Chengdu China 2010

[5] M Faisal R Hedjar M Al Sulaiman and K Al-Mutib ldquoFuzzylogic navigation and obstacle avoidance by a mobile robot inan unknown dynamic environmentrdquo International Journal ofAdvanced Robotic Systems vol 10 no 1 pp 257ndash271 2013

[6] Z Yan J Li G Zhang andYWu ldquoA real-time reaction obstacleavoidance algorithm for autonomous underwater vehicles inunknown environmentsrdquo Sensors vol 18 no 2 p 438 2018

[7] D Zhu and B Sun ldquoThe bio-inspired model based hybridsliding-mode tracking control for unmanned underwater vehi-clesrdquo Engineering Applications of Artificial Intelligence vol 26no 10 pp 2260ndash2269 2013

[8] Z Zeng K Sammut A Lammas F He and Y Tang ldquoImperial-ist competitive algorithm for AUV path planning in a variableoceanrdquoApplied Artificial Intelligence vol 29 no 4 pp 402ndash4202015

[9] Y-H Lin S-MWang L-C Huang andM-C Fang ldquoApplyingthe stereo-vision detection technique to the development ofunderwater inspection task with PSO-based dynamic routingalgorithm for autonomous underwater vehiclesrdquo Ocean Engi-neering vol 139 pp 127ndash139 2017

[10] Y Zhuang S Sharma B Subudhi H Huang and J Wan ldquoEffi-cient collision-free path planning for autonomous underwatervehicles in dynamic environments with a hybrid optimizationalgorithmrdquo Ocean Engineering vol 127 pp 190ndash199 2016

[11] M R JabbarpourH Zarrabi J J Jung andP Kim ldquoA green ant-basedmethod for path planning of unmanned ground vehiclesrdquoIEEE Access vol 5 pp 1820ndash1832 2017

[12] M Duguleana and G Mogan ldquoNeural networks based rein-forcement learning for mobile robots obstacle avoidancerdquoExpert Systems with Applications vol 62 pp 104ndash115 2016

[13] Y Cheng andW Zhang ldquoConcise deep reinforcement learningobstacle avoidance for underactuated unmanned marine ves-selsrdquo Neurocomputing vol 272 pp 63ndash73 2018

[14] F Fathinezhad V Derhami and M Rezaeian ldquoSupervisedfuzzy reinforcement learning for robot navigationrdquoApplied SoftComputing vol 40 no C pp 33ndash41 2016

[15] S A Dabooni and D Wunsch ldquoHeuristic dynamic program-ming for mobile robot path planning based on Dyna approachrdquoin Proceedings of the 2016 International Joint Conference onNeural Networks (IJCNN rsquo16) pp 3723ndash3730 IEEE CanadaJuly 2016

[16] X Shi Z Chen H Wang D-Y Yeung W-K Wong and W-C Woo ldquoConvolutional LSTM network A machine learningapproach for precipitation nowcastingrdquo in Proceedings of the29th Annual Conference on Neural Information Processing Sys-tems (NIPS rsquo15) 2015

[17] J Zhang Y Zhu X Zhang M Ye and J Yang ldquoDeveloping aLong Short-TermMemory (LSTM) based model for predictingwater table depth in agricultural areasrdquo Journal of Hydrologyvol 561 pp 918ndash929 2018

[18] X Ma Z Tao Y Wang H Yu and Y Wang ldquoLong short-termmemory neural network for traffic speed prediction usingremotemicrowave sensor datardquoTransportation Research Part CEmerging Technologies vol 54 pp 187ndash197 2015

[19] Y Duan Y Lv and FWang ldquoTravel time prediction with LSTMneural networkrdquo in Proceedings of the IEEE 19th InternationalConference on Intelligent Transportation Systems (ITSC) pp1053ndash1058 Rio de Janeiro Brazil 2016

[20] Y Chen Y Lv Z Li et al ldquoLong Short-Term Memory Modelfor Traffic Congestion Prediction with Online Open Datardquo inProceedings of the IEEE International Conference on IntelligentTransportation Systems pp 132ndash137 IEEE 2016

[21] Q Li X Zhao and K Huang ldquoLearning temporally correlatedrepresentations using lstms for visual trackingrdquo in Proceedingsof the 2016 IEEE International Conference on Image Processing(ICIP) pp 2381ndash8549 IEEE Phoenix AZ USA 2016

[22] X Zhou L Xie P Zhang et al ldquoOnline object tracking basedon BLSTM-RNN with contextual-sequential labelingrdquo Journalof Ambient Intelligence amp Humanized Computing no 3-4 pp1ndash10 2017

[23] S H Wang J Zhao X Liu Z-M Qian Y Liu and Y QChen ldquo3D tracking swimming fish school with learned kine-matic model using LSTM networkrdquo in Proceedings of the 2017IEEE International Conference on Acoustics Speech and SignalProcessing ICASSP 2017 pp 1068ndash1072 IEEE 2017

[24] S HWang X E Cheng and Y Q Chen ldquoTracking undulatorybody motion of multiple fish based on midline dynamics mod-elingrdquo in Proceedings of the 2016 IEEE International Conferenceon Multimedia and Expo (ICME rsquo16) pp 1ndash6 IEEE 2016

[25] K Chen Y Zhou and F A Dai ldquoA LSTM-based method forstock returns prediction A case study of China stock marketrdquoin Proceedings of the IEEE International Conference on Big Datapp 2823-2824 IEEE 2015

[26] H Sak A Senior and F Beaufays ldquoLong short-term memorybased recurrent neural network architectures for large vocabu-lary speech recognitionrdquo Computer Science pp 338ndash342 2014

[27] Y Wu M Yuan S Dong L Lin and Y Liu ldquoRemaining usefullife estimation of engineered systems using vanilla LSTM neuralnetworksrdquo Neurocomputing vol 275 pp 167ndash179 2018

[28] Y Chherawala P P Roy and M Cheriet ldquoFeature design foroffline arabic handwriting recognition Handcrafted vs auto-matedrdquo in Proceedings of the 12th International Conference onDocument Analysis and Recognition (ICDAR rsquo13) pp 290ndash294IEEE 2013

[29] J Koutnık K Greff F Gomez et al ldquoA Clockwork RNNrdquoComputer Science pp 1863ndash1871 2014

[30] S Achanta T Godambe and S V Gangashetty ldquoAn investi-gation of recurrent neural network architectures for statisticalparametric speech synthesisrdquo in Proceedings of the 16th AnnualConference of the International Speech Communication Associa-tion (INTERSPEECH rsquo15) pp 859ndash863 Germany 2015

[31] T I Fossen Marine Control Systems Guidance Navigationand Control of Ships Rigs and Underwater Vehicles MarineCybernetics AS Trondheim Norway 2002

[32] V B Ernstsen R Noormets D Hebbeln A Bartholoma andBW Flemming ldquoPrecision of high-resolutionmultibeam echosounding coupled with high-accuracy positioning in a shallowwater coastal environmentrdquo Geo-Marine Letters vol 26 no 3pp 141ndash149 2006

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom

Page 17: Research on UUV Obstacle Avoiding Method Based on Recurrent …downloads.hindawi.com/journals/complexity/2019/6320186.pdf · At rst, the RNN based obstacle avoidance planners are

Hindawiwwwhindawicom Volume 2018

MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Mathematical Problems in Engineering

Applied MathematicsJournal of

Hindawiwwwhindawicom Volume 2018

Probability and StatisticsHindawiwwwhindawicom Volume 2018

Journal of

Hindawiwwwhindawicom Volume 2018

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawiwwwhindawicom Volume 2018

OptimizationJournal of

Hindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom Volume 2018

Engineering Mathematics

International Journal of

Hindawiwwwhindawicom Volume 2018

Operations ResearchAdvances in

Journal of

Hindawiwwwhindawicom Volume 2018

Function SpacesAbstract and Applied AnalysisHindawiwwwhindawicom Volume 2018

International Journal of Mathematics and Mathematical Sciences

Hindawiwwwhindawicom Volume 2018

Hindawi Publishing Corporation httpwwwhindawicom Volume 2013Hindawiwwwhindawicom

The Scientific World Journal

Volume 2018

Hindawiwwwhindawicom Volume 2018Volume 2018

Numerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisNumerical AnalysisAdvances inAdvances in Discrete Dynamics in

Nature and SocietyHindawiwwwhindawicom Volume 2018

Hindawiwwwhindawicom

Dierential EquationsInternational Journal of

Volume 2018

Hindawiwwwhindawicom Volume 2018

Decision SciencesAdvances in

Hindawiwwwhindawicom Volume 2018

AnalysisInternational Journal of

Hindawiwwwhindawicom Volume 2018

Stochastic AnalysisInternational Journal of

Submit your manuscripts atwwwhindawicom