Upload
independent
View
0
Download
0
Embed Size (px)
Citation preview
Cooperative Controls with Intermittent Communication
Dan Shena, Genshe Chen∗a, Jose B. Cruz, Jr.b, Khanh Phamc, Erik Blaschd, Robert Lynche
aDCM Research Resources, LLC, 14163 Furlong Way, Germantown, MD, USA 20874; bDept. of ECE, 2015 Neil Ave, the Ohio State University, Columbus, OH USA 43210;
cAir Force Research Laboratory, Space Vehicles Directorate, Kirtland AFB, NM, USA 87117; dAir Force Research Laboratory, Sensors Directorate, Wright-Patterson AFB, OH, USA 45433
eNaval Undersea Warfare Center, Newport, RI, USA 02841
ABSTRACT
In this paper, we propose a solution to the cooperative path planning with limited communication problem in two phases. In the first (offline) phase, a Pareto-optimal path problem is formulated to find a reference path and the graph cuts minimization method is used to speedily calculate the optimal solution. In the second (online) phase, a foraging algorithm is used to dynamically refine the reference path to meet the dynamic constraints of unmanned aerial vehicle (UAVs), during which an open-loop feedback optimal (OLFO) controller is used to estimate the states which may be unavailable due to infrequent battlefield information updates. Furthermore, an adaptive Markov decision process is proposed to deal with intermittent asynchronous information flow. The method is demonstrated in a simulation for a swarm of Unmanned Air Vehicle (UAV) teams with various communication ranges.
Keywords: Path planning, Pareto-optimal, open-loop feedback, Markov decision process, UAV, swarms
1. INTRODUCTION Much effort has been made to study cooperative control for autonomous, unmanned air vehicle (UAV) teams, as these teams are envisioned in near future to be capable of performing many key battlefield roles. UAV teams (or swarms) could perform (1) cooperative target search, acquisition, and tracking, (2) persistent Intelligence, Surveillance, and Reconnaissance (ISR), (3) persistent area denial, and (4) coordinated attack. In some applications, UAVs in the team may have to autonomously act in a coordinated fashion while only receiving information updates from other team members, ground operators, or remote sensing/command platforms intermittently. Current techniques for coordinated control often assume continuous and uninterrupted data transmission among the team members. However, UAV team members’ communication are often limited for variable and uncertain intervals, whether due to terrain masking, urban terrain, jamming, stealth requirements, antenna placement and vehicle orientation, bandwidth limitations, or other reasons.
Applications and scenarios of cooperative control with communication constraints have been receiving increasing attention during the past several years [1, 2]. A driving reason behind this attention is the fact that due to limited or intermittent communications, UAV team members may have varying degrees of information available at different times, or may only receive occasional updates from a remote platform or operator. The vehicles still need to be able to act cooperatively without full knowledge of the world state, or of one another’s actions. Innovative approaches are requested to deal with cooperative control issues arising from intermittent and asynchronous communications. Applications requiring UAV cooperative control include information observations related to time critical targets, moving targets, pop-up threats or targets, new team member information, or other factors.
The goal of this paper is to construct a mathematically rigorous framework for evaluation and refinement of path planning of UAV teams in the presence of intermittent and asynchronous communication. The initial phase of the research will involve the design of a framework inside which interactions among system tasks, feedback control, and dynamic communication algorithms can be analyzed and synthesized to achieve desired performance. ∗ [email protected]. This work was supported in part by the US Air Force under contracts FA9453-09-C-0175.
Space Missions and Technologies, edited by Joseph Lee Cox, Manfred G. Bester, Wolfgang Fink, Proc. of SPIE Vol. 7691, 76910D · © 2010 SPIE · CCC code: 0277-786X/10/$18 · doi: 10.1117/12.849786
Proc. of SPIE Vol. 7691 76910D-1
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
Our proposed Pareto Foraging Algorithm with open-loop feedback optimal (OLFO) controller is illustrated in Figure 1. We tackle cooperative path planning with limited communication problem in two phases: in the first (offline) phase, a Pareto-optimal path problem is formulated to find a reference path and the graph cuts minimization method is used to speedily calculate the optimal solution; in the second (online) phase, a foraging algorithm is used to dynamically refine the reference path to meet the dynamic constraints of UAVs, during which an OLFO controller is used to estimate the states which may be unavailable due to infrequent battlefield information update. Furthermore, an adaptive Markov decision process is proposed to deal with intermittent asynchronous information flow.
Figure 1. Flowchart of Pareto Foraging algorithm with OLFO controller.
The rest of the paper is organized as follows. In section 2, we present a foraging model of path planning with Pareto-optimal and OLFO. Section 3 describes the adaptive fuzzy Markov-chain approach for intermittent communication in the path planning foraging model. Numerical examples are simulated in Section 4. Finally we draw conclusions in Section 5.
2. PATH PLANNING WITH PARETO-OPTIMAL FORAGING AND OLFO 2.1 Pareto Optimal Algorithm for Offline Path Planning
The concept of a Pareto-optimal strategy is useful when we consider cooperative strategies in a team of M UAVs. A set of decision choices is said to be Pareto-optimal if for any other set of choices of decisions, it is impossible to make one team member better off without necessarily making someone else worse off. When we use the minimal-risk as the goal of path planning, the Pareto-optimal path means that the survival probability of any single UAV cannot be improved without decreasing the survivability of any other UAV in the same group. So the Pareto-optimal paths can be obtained by
*
1
arg min [ ]M
j jj
Jρ ρρ λ ρ
=
= ∑ (1)
where path ρ is a mapping from time to space [ ]: 0, nTρ →ℜ , [ ]jJ ρ is the objective function of the jth UAV, and
0 1jλ< < are the Pareto cooperation coefficients.
2.2 A Foraging Model of Online Path Planning with OLFO
The reference path obtained offline from Pareto-optimal algorithm may very likely contain line segments with turn angles exceeding the dynamic constraints of the UAVs. The reference paths that are not physically available can be eliminated by the foraging algorithms which dynamically refine the reference path to be within the bounds on the turning radius of the UAVs.
Proc. of SPIE Vol. 7691 76910D-2
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
Consider a swarm of M vehicles (UAVs) in an n-dimensional Euclidian space. We model the individuals as points and ignore their dimensions. Let i nx R∈ be the position of the ith UAV in the swarm, i nv R∈ be the velocity, and
i nu R∈ be the control input, then the dynamic model for the ith agent is i ix v=& (2)
( )( ) ( )( )
2
21,
1 ˆ ˆ2ˆ ˆ ˆ ˆexp i
i jM p p
i i i i i i i i j i i i ip v v v r p p f fxij j i s
e ev u k e k e k e e k J x d
r= ≠
⎛ ⎞− −⎜ ⎟= = − + + − − ∇ −⎜ ⎟
⎜ ⎟⎝ ⎠
∑& (3)
where i ipe x x= − is the position error,
1
1 Mi
ix x
M =
= ∑ ,
i ive v v= − is the velocity error,
1
1 Mi
i
v vM =
= ∑ ,
( )i iJ x is the cost function for the ith UAV,
( ), ( ), ( )i i if p vd t d t d t are the sensing errors of ( ), ,i
i i i ip vx
J x e e∇ respectively,
ˆi i ip p pe e d= − is the position error with sensing error,
ˆi i iv v ve e d= − is the velocity error with sensing error,
0, 0i ip vk k> > are the contraction gains,
the gain 0ifk > is proportional to that UAV’s desire to follow its profile,
0irk > is the repulsion gain, 0i
sr > is its repulsion range. The foraging algorithm with perfect communication has been discussed in [1], in which the interaction between UAVs is formulated as an “attraction-repel” method (as in potential fields analysis) in which each and every UAV seeks a “comfortable” position related to its neighboring platforms. In the case of limited communication capacity, the latest battlefield information may not be always available to calculate the cost functions of the UAVs. Limited communication makes it difficult to apply the “attraction-repel” algorithm to determine the refined path. We thus propose to use open-loop feedback optimal (OLFO) concept to estimate the states of UAVs based on a Dubin’s car model [5], in which UAVs are supposed to move only forward with bounds on the turning radius, as well as solving the foraging algorithm. The OLFO controller is indeed an adaptive model predictive controller, which uses the optimal open-loop initial decision at each stage, and re-computes the open-loop optimal deterministic control after re-evaluating the uncertainty system state at each and every time step [3, 4]. The basic structure of OLFO is shown in Figure 2, and the basic principle, the “moving horizon” strategy, is illustrated in Figure 3. We can see from Figure 2 that at the present time, the control behavior during the prediction horizon is considered. In more detail, if the present time is indexed by k , we assume that the control sequence 0, 1 , , … , has been applied to the system and that the observation sequence , 0, , 0, 1, … , is available. We would like to find a “future” control sequence as to optimize the cost-to-go over the prediction horizon conditioned on the total available information at the present time. Suppose an optimal control sequence , , , … , is calculated. Using only the first element of this optimal control sequence is feed forward control. At the next time step, for the prediction horizon which has now moved one time step forward (“moving horizon”) and whose length may have changed, a new optimal control sequence is calculated. The control loop is therefore closed in cycles (optimal feedback control).
Proc. of SPIE Vol. 7691 76910D-3
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
The cost-to-go function of the foraging problem at time step k , ikJ is now determined by the state estimate ˆ i
kx which
can be generated by a standard Kalman filter algorithm over the prediction horizon kt . The length of the prediction horizon will be predicted by a Markov-Chain model as we will explain in the next section. By defining the estimated
state sequence { }1ˆ ˆ ˆ ˆ( , ) , ,...,kk k k k tx k k t x x x+ ++ = , where 1ˆ ˆ ˆ ˆ{ , ,..., }i i M
k k k kx x x x+= , and the control sequence
{ }1( , ) , ,...,k
i i i ik k k k tU k k t u u u+ ++ , the cost-to-go function can be expressed as
( )ˆ( , ), ( , )i ik k kJ f x k k t U k k t= + + (4)
We would like to find a set of waypoints ˆ( , )kx k k t+ to minimize the cost-to-go function, i.e., , arg min , , , which may be solved by standard optimization methods as in gradient descent or the greedy method. At the next point of time, the prediction horizon 1kt + moves one time interval forward and the length will be predicted by the Markov-Chain
model. The OLFO algorithm will carry out state estimation over 1kt + again. The procedure will repeat until the destination is reached.
3. ADAPTIVE FUZZY MARKOV-CHAIN MODEL Due to various reasons such as terrain masking, urban terrain, jamming, stealth requirements, antenna placement and vehicle orientation and bandwidth limitations, UAV team members’ communication is limited for variable, uncertain intervals. Innovative approaches are requested to deal with cooperative control issues arising from intermittent and asynchronous communications. Meanwhile, as we have seen from the OLFO model, the length of prediction horizon plays an important role in determining the algorithm results. Especially in the presence of limited communication, the length of horizon determines the efficiency of the OLFO controller. In order to select a horizon which reflects the current battlefield environment at each and every time step, we present an adaptive fuzzy Markov-Chain model to estimate the arrival time of the next information update.
Figure 2. Basic structure of the OLFO controller.
Figure 3. "Moving horizon" strategy of the OLFO controller.
Proc. of SPIE Vol. 7691 76910D-4
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
Let the present time be indexed by k . Let t be the communication interval between the time steps of the current information update and the next information arrival. Since [0, )t∈ +∞ , we propose to define a random sequence
{ , 0, {1, 2,..., }}k QD d i k i Q N= = ≥ ∈ = to represent the communication intervals by the following fuzzy rules:
1
1
max
1 02 0
j j
Q
tt l
ij l t l
N t l
−
=⎧⎪ < ≤⎪⎪⎪= ⎨ < ≤⎪⎪⎪
≥⎪⎩
M M
M M
(5)
where 1 max{ ,... }l l is a set of typical interval values which may be determined arbitrarily or experimentally.
3.1 First-order Markov-Chain Model
If the next communication interval depends only on the previous update intervals, we model the predicted next communication intervals as a first-order Markov-chain such that, for any k , 1kd + is independent of 0 1 1, ,... kd d d − . Mathematically,
1 1 0 1Pr( | , ,..., ) Pr( | )k k k k kd j d i d m d s d j d i+ − += = = = = = = , ,i j Q∈ (6)
We also assume that the one-step transition probabilities ( ){ 1Pr |k kd j d i+ = = , },i j Q∈ depend only on the states i
and j and are independent of the index k , i.e.,
( )1Pr | k k ijd j d i p+ = = = (7)
for all ,i j Q∈ and for any k. The transition probabilities are stationary or homogeneous. We then may construct a one-step transition probability matrix of the first-order Markov-Chain,
11 12 1
21 22 2(1)
1 2
Q
Q
Q Q Q Q
N
N
N N N N
p p p
p p pP
p p p
⎡ ⎤⎢ ⎥⎢ ⎥
= ⎢ ⎥⎢ ⎥⎢ ⎥⎣ ⎦
L
L
L L L L
L (8)
where for any ,i j Q∈ , 0ijp ≥ and for each i Q∈ , 1ijj Q
p∈
=∑ .
3.2 Higher-order Markov Chain Model
The dependence structure of order one may not be sufficient to model some situations. For a more realistic model, it is assumed that future communication intervals might depend on more than one previous state, or possibly out-of order measurements that are delayed For example, the next information arrival could depend on the previous n communication intervals of information updates, which can be described as follows:
( )( )
1 1 1 2 1 0 1
1 1 1 2 1
Pr | , , , , ,
Pr | , , ,k k k k n n k
k k k k n n
d j d i d i d i d i
d j d i d i d i+ − − + +
+ − − +
= = = = =
= = = = =
L L
L (9)
where 1 1, , , ; 1ki i j Q k n+ ∈ ≥ −L . We explain a second-order Markov chain model and the higher order models can be developed similarly. In terms of the previous notations, the second-order dependence is characterized as
Proc. of SPIE Vol. 7691 76910D-5
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
( ) ( )1 1 0 0 1 1Pr | , , , Pr | , , , ,k k k k k kd l d j d i d i d l d j d i i j l Q+ − + −= = = = = = = = ∈L (10) The one-step transition probability matrix for the Markov chain of order two is given by
111 112 11
121 122 12(2)
1 2
Q
Q
Q Q Q Q Q Q Q
N
N
N N N N N N N
p p p
p p pP
p p p
⎡ ⎤⎢ ⎥⎢ ⎥
= ⎢ ⎥⎢ ⎥⎢ ⎥⎣ ⎦
L
L
L L L L
L
(11)
where for all , ,i j l Q∈ , we assume
( )1 1Pr , 0ijl k k kp d l d j d i+ −= = = = > (12)
and for each pair ( , )i j Q Q∈ × , 1ijll Q
p∈
=∑ .
The prediction accuracy of the next information update time critically depends on the transition probabilities, which in turn can be precise enough only when the order of the Markov-Chain model is appropriately chosen with respect to the available historical data. A hypothesis test algorithm proposed by Liu in [2] can be used to estimate the order of Markov chains with stationary transition probability matrices.
3.3 Estimates of Stationary Transition Probabilities
An adaptive approach to estimate the stationary transition probability matrix for the first-order Markov chain model was developed in [8-9]. We explain how to apply this adaptive algorithm for the first-order and second-order model, and the extension to a higher-order Markov chain model is straightforward. Let ( )ijn k represent the status of transition from the
communication interval state being i Q∈ at time 1k − to state being j Q∈ time k . That is,
11 if ,( )
0 otherwisek k
ij
d i d jn k − = =⎧
= ⎨⎩
For the first-order Markov chain, the updated stationary transition probability ˆ ( )ijp k of the next update interval being
categorized by kd j= given the previous communication interval being categorized by 1kd i− = for all ,i j Q∈ , at
discrete event index k can be estimated by
1
1 1
( )ˆ ( )
( )Q
k
ijh
ij N k
iss h
n hp k
n h
=
= =
=∑
∑∑ (13)
This update procedure increases the estimated stationary transition probability from being state i to state j and reduces it for the other states from state i . For a second-order Markov chain model, the transition probabilities ˆ ( )ijlp k can be estimated by
1
1 1
( )ˆ ( )
( )Q
k
ijlh
ijl N k
ijss h
n hp k
n h
=
= =
=∑
∑∑ (14)
Proc. of SPIE Vol. 7691 76910D-6
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
where ( )ijln k also stands for the transition status which is defined as
2 11 if , , ( )
0 otherwisek k k
ijl
d i d j d ln k − −= = =⎧
= ⎨⎩
(15)
3.4 Estimate of next information arrival
After estimating the transition probability matrices, we propose to consider the maximum likelihood strategy to determine the next communication interval. Given the previous communication interval state being 1
ikd − , an estimated
next communication interval state is selected as j
k kd d= with ˆarg max{ ( )}ijj Qj p k
∈= (16)
By examining the fuzzy rules, we then know that the corresponding predicted communication interval is 1j jl t l− < ≤ , and the worst case is that jt l= , which is to be used by the OLFO controller to estimate the current states.
4. NUMERICAL SIMULATIONS 4.1 Scenario
In the scenario shown in Figure 4, suppose there are some starting points and a final destination area in which there are some destination points. Between the starting area and final destination area there are some dangerous areas which are known to be hostile. Besides these, there are also some possible pop-up areas which are also dangerous to UAVs. The farther away from the center point of a pop-up area, the less possibility that a pop-up will appear there. For UAVs to get to the final destination points as safely as possible, usually UAVs should try to avoid the dangerous areas and the possible pop-up areas in their paths. This requires us to model the situation with some comprehensive risk-benefit functions. The simplest way to identify a path is to assign some way points for it. Moreover, there is a pop-up threat in the middle. It is supposed to be detected during the path execution stage rather than the static path planning stage. In this scenario, we assume that the communication range of each UAV is limited so that each UAV can only share information with other UAVs within the range. It is more realistic than our original centralized algorithm, whose disadvantages are dramatic: There has to be some super-node acting as the processing center which requires much higher energy, storage and computing capabilities. This will largely reduce the lifetime and survivability of the whole sensor network, especially in the case of autonomous sensor networks. The first step is to conduct static path planning to find the approximate and discrete path composed of way points. This part is based on Pareto-optimal and Voronio concept. The result is shown in Figure 5. We can see the whole x-y space is sampled according to the “importance” of each spot. (The spot, where the change objective function is larger, is deemed as a more important point.) Since the additional pop-up threat will be detected during the path refined stage (foraging algorithm), the result of static path planning is same as before. Note that the dense cost contours represent both the known and the estimated hostile areas. The blue dots are the path-planning solution that minimizes the distance from the
Figure 4. A scenario in a virtual battlefield
Proc. of SPIE Vol. 7691 76910D-7
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
start to the dused as refer
We carried ounlimited coalso investiga
estination; avoence points in
out different exommunication rated the effects
oiding the hostnext stage --- d
Fig
xperiments incrange (same as of the initial p
ile areas. Thosdynamic path e
gure 5. A scenar
luding (a) largs centralized cpositions of the
se blue dots (wexecution.
rio in a virtual ba
ge communicatcase in our orie UAVs.
waypoints of ap
attlefield
tion range, (b) iginal path plan
pproximate and
small communnning algorith
d discrete path
nication rangem). For each c
) will be
, and (c) case, we
Proc. of SPIE Vol. 7691 76910D-8
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
4.2 Larg
Figure 6: L
For UAVs wUAVs, the teThe UAVs wcommunicatidestination acloser duringteams mergecommunicati
4.3 Smal
Figure 8: S
For small codistances botsome UAVs communicati
ge Communica
arger communicposi
with close initiaeam of UAVs
will fly togetheion range, thealong the referg the journey to and become oion range (dTh
ll Communica
Small communicposi
ommunication th in Fig.8 andwill fly close
ion range.
ation Case
cation range and itions
al positions (Fcan share info
er to the destine 10 UAVs foence waypointo the destinatioone team. In bohreshold = 3.5)
ation Case
ation range and itions
range (dThresd Fig. 9, all Uenough to shar
close initial
Fig. 6), since thormation from ation from theorm three subts. Since all su
on. When the doth experimentcan avoid the
close initial
shold=1.0) expUAVs can onlyre information
Figure 7: La
he communicathe start, so a start position.
b-teams. Each ub-teams use tdistances are smts (Fig. 6 and Fpop-up threats
Figure 9: Sm
periments, becay share inform, but these kin
arger communicposit
ation range is lll the effects o. In Fig. 7, sinc
sub-team canthe same referemaller and withFig. 7), our revs detected along
mall communicaposit
ause the commation at the beds of sub-team
ation range and tions
larger than all of repels and ace some distann share informence points, th
hin the communvised algorithmg the reference
ation range and ations
munication raneginning. Duri
ms are not stabl
apart initial
the distances attraction are avnces are larger mation and flyhey will fly clonication range,
m with large bue way-points.
apart initial
nge is smaller ing the enroutele because of t
between vailable. than the
y to the oser and , all sub-
ut limited
than the e flights, the small
Proc. of SPIE Vol. 7691 76910D-9
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
It is not diffiformation in additional prfounded in F Both experim
Figure 10: S
In Fig. 11, thcollisions. In
We proposeddynamic, asy(OLFO) apporientated obbenefit of fodynamic condecision procachieve a coconducted towhich means
[1] Chen, Gplanning
[2] Liu, Y., on Aeros
[3] Dreyfus,120-134
[4] Tse, E. Control,
cult to find thaFig. 8 with clo
rocedure of flyig. 8.
ments show tha
Small communicposi
he highlighted an real applicatio
d a frameworkynchronous co
proach has sevbjective functioraging is neanstraints; 3) OLcess is used to onvergent, fasto demonstrate s less energy, s
G., Shen, D. Crg for multiple aCruz, Jr., J. B
space and Elec, S. E., “Some
4, 1962. and Athans, M
, vol. AC-17, p
at the formationose initial posiying apart, whi
at the revised al
cation range and itions
area in Fig. 10 on, this is dang
k of cooperativommunication
veral advantageons can guara
ar optimal but LFO acts as anpredict the next, risky minimour revised n
storage and com
ruz Jr., J.B., Kaerial platforms
B., and Schumactronic Systemse types of optim
M., “Adaptive pp. 38-51, no. 1
n of Fig. 9, wititions. This is bich causes mor
lgorithm can a
apart initial
was zoomed. Wgerous, so a rela
5. COve path plannin. Our Pareto-es of our coopantee goal arri
very efficientn identifier of sxt arrival times
mum and efficinon-centralizedmputing power
REF
Kwan, C., Ridds”, Infotech@Aacher, C. J., “Ps, vol. 43, no.2mal control of
stochastic con1, 1972.
th further apartbecause in the re of an overs
avoid pop-up th
Figure 11: Z
We can see thaatively larger c
ONCLUSIOng problem fo-optimal foragperative path pival and threatt, robust, convsystem model as of intermittenient trajectory
d path planningr.
FERENCES
dle, S. P, CoxAerospoace; APop-Up Threat 2 April 2007.
stochastic sys
ntrol for a cla
t initial positiofirst part of the
shot as found i
hreats during th
oomed view of t
at some UAVs communication
ONS r a team of UA
ging algorithmplanning approt/obstacle avoivergent, smootand an estimatont information
planning algog algorithms w
, S. and Matthrlington, VA; UModels for Pe
tems,” SIAM J
ass of linear sy
ons, is better ane journey in Fiin Figure 8. Th
he refinement o
the area highligh
fly very closeln range is recom
AVs in the prm with open-looaches, includiidance with mth and easy toor for system sinflows; and 5orithm. Numerwith limited c
hews, C., “A nUSA; 26-29 Seersistent Area
Journal of Con
ystems,” IEEE
nd more stable ig.8 the UAVshen more over
of the path.
hted in Fig. 9
ly but have nommended.
resence of interoop feedback ing: 1) Pareto
minimum costs;o consider thestates; 4) The ) The combinarical simulatio
communication
novel cooperatept. 2005. pp. 1Denial,” IEEE
ntrol, vol. 2, n
E Trans. on Au
than the have an rshot are
rmittent, optimal
-optimal ; 2) The
e vehicle Markov
ation can ons were n ranges,
tive path 1-10. E Trans.
no. 1, pp.
utomatic
Proc. of SPIE Vol. 7691 76910D-10
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms
[5] Dubins, L. E., “On curves of minimal length with a constraint on average curvature and with prescribed initial and terminal positions and tangents,” American J. of Mathematics, vol 79, pp 497-516, 1957.
[6] Shen, D., G. Chen, J Cruz, Jr., and E. Blasch, “A Game theoretic Data Fusion Aided Path planning Approach for Cooperative UAV ISR”, IEEE Aerospace Conference, 2008.
[7] Ahmadzadeh, A., B. Sayyar-Roudsari, and A. Homaifar, “A Hybrid projected gradient-evolutionary search algorithm for capacitated multi-source multi-UAVs scheduling with time windows,” Ch 1 in Recent Development s in Cooperative Control and Optimization, (eds) S. Butenko, R. Murphey, and P Pardalos, Kluwer, 2004.
[8] DeLima, P and D. Pack, “Maximizing Search Coverage Using Future Path Projection for Cooperative Multiple UAVs with Limited Communication Ranges,” Ch 6 in Optimization and Cooperative Control Strategies – Proc. Cooperative Control and Optimization,” (eds) M. J. Hirsch, C. W. Commander, P. M. Pardalos, and R. Murphey, Springer, 2009.
[9] K. Pham, “New Results in Stochastic Cooperative Games: Strategic Coordination for Multi-resolution Performance Robustness,” Ch 16 in Optimization and Cooperative Control Strategies – Proc. Cooperative Control and Optimization,” (eds) M. J. Hirsch, C. W. Commander, P. M. Pardalos, and R. Murphey, Springer, 2009.
[10] Chen, H, G. Chen, E. Blasch, and T. M. Schuck, “Robust Track Association and Fusion with Extended Feature Matching,” Ch 19 in Optimization and Cooperative Control Strategies – Proc. Cooperative Control and Optimization,” (eds) M. J. Hirsch, C. W. Commander, P. M. Pardalos, and R. Murphey, Springer, 2009.
Proc. of SPIE Vol. 7691 76910D-11
Downloaded From: http://spiedigitallibrary.org/ on 10/22/2013 Terms of Use: http://spiedl.org/terms