12
Trend Relational Analysis and Grey-Fuzzy Clustering Method * Zhijie Chen 1 , Weizhen Chen 2,3 , Qile Chen 4 and Mian-Yun Chen 2 Abstract In this paper, the latest researches on trend relational analysis and grey-fuzzy clustering method are presented. Grey systems and fuzzy systems are found everywhere. The researched results can open new prospects for the development and application of systems methodology to data mining. Keywords Trend relational analysis, grey-fuzzy clustering method, grey systems, fuzzy systems. 1 Introduction It is well known that the notion of a system is rather broad, and can be traced to antiquity. So to speak, any an object investigated, such as the motion of a macroscopic particle, or some socioeconomic phenomenon, may be qualified as a system. Systems such as social, economic, agricultural, industrial, ecological, and biological systems are usually those of great complexity. These complicated objects apparently have the following characteristic features: * This project was supported by the National Natural Science Foundation of China(69874018, 79970025), GF-PR-528. 1 School of Economics and Management, Wuhan University of Science and Engineering, Wuhan 430073, China. Email:[email protected] 2 Department of Control Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China. Email:[email protected]. Email:[email protected] 3 Department of Information and Electrical Engineering, Wuhan Polytechnic University, Wuhan 430023, China. Email:[email protected] 4 Department of Computer Science, Wuhan University of Science and Engineering, Wuhan 430073, China. Email:[email protected] - There is no physical prototype; - The operation mechanism is not clear; - The relationships between the inputs and the outputs are not obvious; - The indeterminateness is very strong; - Oft times, only a few of discrete data observed can be obtained. Thus, in the studies of such objects, how to build a system model, how to forecast, how to make decision and how to control rely to a great extent on the use of information from objective reality. From a practical point of view, however, it is very difficult, or impossible, to get the adequate or complete information from investigated object in many situations. The phenomenon with incomplete information, therefore, is usually encountered. “Incomplete information” is the fundamental meaning of being “grey”. The name of “grey system” is chosen based on the amount of known information. Consider a “black box” stands for an object such that its internal structure is totally unknown to the investigator. Here, the word “black” represents unknown information, “white” for completely known information, and “grey” for those information which are partially known and partially unknown. Accordingly, systems with completely known information are called as white systems. Systems with completely unknown information as black systems, and the systems with partially known and partially unknown information as grey systems, respectively [1,2]. Since 1982, there has been a quick development in grey systems theory[3,4] in China, and it is also very successful in the application of the theory to many real projects, such as agriculture, society, economics, engineering, IT, data mining, management, biological 234

Trend Relational Analysis and Grey Fuzzy Clustering Methods*

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

Trend Relational Analysis and Grey-Fuzzy Clustering Method*

Zhijie Chen1, Weizhen Chen2,3, Qile Chen4 and Mian-Yun Chen2

Abstract *1234

In this paper, the latest researches on trend relational analysis and grey-fuzzy clustering method are presented. Grey systems and fuzzy systems are found everywhere. The researched results can open new prospects for the development and application of systems methodology to data mining.

Keywords Trend relational analysis, grey-fuzzy clustering method, grey systems, fuzzy systems.

1 Introduction

It is well known that the notion of a system is rather broad, and can be traced to antiquity. So to speak, any an object investigated, such as the motion of a macroscopic particle, or some socioeconomic phenomenon, may be qualified as a system. Systems such as social, economic, agricultural, industrial, ecological, and biological systems are usually those of great complexity. These complicated objects apparently have the following characteristic features:

* This project was supported by the National Natural Science Foundation of China(69874018, 79970025), GF-PR-528. 1 School of Economics and Management, Wuhan University of Science and Engineering, Wuhan 430073, China. Email:[email protected] 2 Department of Control Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China. Email:[email protected]. Email:[email protected] 3 Department of Information and Electrical Engineering, Wuhan Polytechnic University, Wuhan 430023, China. Email:[email protected] 4 Department of Computer Science, Wuhan University of Science and Engineering, Wuhan 430073, China. Email:[email protected]

- There is no physical prototype; - The operation mechanism is not clear; - The relationships between the inputs and the

outputs are not obvious; - The indeterminateness is very strong; - Oft times, only a few of discrete data observed

can be obtained. Thus, in the studies of such objects, how to build a system model, how to forecast, how to make decision and how to control rely to a great extent on the use of information from objective reality. From a practical point of view, however, it is very difficult, or impossible, to get the adequate or complete information from investigated object in many situations. The phenomenon with incomplete information, therefore, is usually encountered. “Incomplete information” is the fundamental meaning of being “grey”. The name of “grey system” is chosen based on the amount of known information. Consider a “black box” stands for an object such that its internal structure is totally unknown to the investigator. Here, the word “black” represents unknown information, “white” for completely known information, and “grey” for those information which are partially known and partially unknown. Accordingly, systems with completely known information are called as white systems. Systems with completely unknown information as black systems, and the systems with partially known and partially unknown information as grey systems, respectively [1,2]. Since 1982, there has been a quick development in grey systems theory[3,4] in China, and it is also very successful in the application of the theory to many real projects, such as agriculture, society, economics, engineering, IT, data mining, management, biological

234

Page 2: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

protection, ecology, environmental studies, etc.. After over 20 years of rapid development, the theory of grey systems consists of the following main blocks of concepts and results:

- Foundation, consisting of grey numbers, grey elements and grey relations;

- Grey systems analysis, including grey incidence analysis, grey statistics, grey clustering, etc.;

- Grey systems modeling, through the use of generation of grey numbers or functions so that hidden patterns can be found;

- Grey prediction; - Grey decision making; - Grey control; - Grey process; - Integral generating transform; - Trend relational analysis; and - Systems clouds.

The reader who takes an interest in this subject should refer to Kybernetes, Vol.33, No.2, 2004.( 《 Grey Systems Theory and Applications》 ,Guest Editors: Mian-Yun Chen, Sifeng Liu, and Yi Lin). In this paper, the latest researches on trend relational analysis and grey-fuzzy clustering method are presented. The researched results here can open new prospects for the development and application of systems methodology to data mining.

2 On Grey Process

An excessively complex or complicated object, which generally shows a lack of completed model information, may be looked upon as a grey system. That is, with the aid of the grey systems approach we are able to solve the problems of the analysis and design of complicated systems, or excessively complex systems, including the data systems. Such a system might be looked upon as a data organizing framework according to which some data are considered to be relevant, others not. From a grey system’s point of view, all the indeterminate or random concepts can be regarded to be grey. In order to describe an investigated object with incomplete

information and to get a reasonably stable picture which can be communicated, some grey concepts are defined as follows:

- The most basic ingredient, which exists in a grey system, with incomplete information is called the grey element, denoted by ; ⊗

- An indeterminate variable whose amplitude varies over a suitable range is called a grey variable, denoted by X (⊗ );

- A function defined on the Cartesian product space ⊗ ∑╳T, denoted by X (⊗ ,t), is called a grey process, where ⊗ ∈⊗ ∑ is a grey element, and t∈T represents time;

- The observable output of an investigated object,

which is a function of t, denoted by , is

called a whitening function of grey process,

)()0( tX

()()0( XtX ∈ ⊗ ), t .

More often than not, only a few of the discrete data observed from the investigated object can be obtained, such as

{ }nkkXkXX ,,2,1,0)(|)( )0()0()0( K=≥=

which is called the original time series corresponding

to . We consider that, for an investigated

object, all behavioral information is contained in

)()0( tX

X ( ⊗ ,t), and all relevant information through

observation is contained in or .

Thus, or has provided the basis

for constructing systems model[5].

)()0( tX )()0( kX

)()0( tX )()0( kX

3 Trend Relational Analysis for Grey Systems

Let us consider dynamic relationships between h factors that are present in an investigated object. Naturally, we can get

hinkkX i ,,2,1,,,2,1),()0( KK == .

235

Page 3: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

Definition 3.1 We call the reference factor

(choose freely), the compared factor,

. Correspondingly,

)0(rX

)0(cX

{ hcr ,,2,1, K∈ } { })()0( kX r is

called the reference time series, { })()0( kX c the

compared time series. In order to express the approximateness and

similarity between { })()0( kX r and { })()0( kX c , we

proceed to processing of data as follows: Definition 3.2 All of the following transformations are called to be mapping of quantity:

)0()0()0(: rrrrr XXXM ∆→×

)0()0()0(: ccccc XXXM ∆→×

)0()0()0(: rccrrc XXXM ∆→×

where

{ })(,),2(),1( )0()0()0()0( nXXXX rrrr K=

{ })(,),2(),1( )0()0()0()0( nXXXX cccc K=

{ })(,),3(),2( )0()0()0()0( nXXXX rrrrrrrr ∆∆∆=∆ K

{ })(,),3(),2( )0()0()0()0( nXXXX cccccccc ∆∆∆=∆ K

{ })(,),2(),1( )0()0()0()0( nXXXX rcrcrcrc ∆∆∆=∆ K

)1()()( )0()0()0( −−=∆ kXkXkX rrrr

)1()()( )0()0()0( −−=∆ kXkXkX cccc

)()()( )0()0()0( kXkXkX crrc −=∆ .

Definition 3.3 Let Mrc be a mapping. If

Mrc : [ ]1,0)()0()0()0( ∈→∆×∆×∆ kXXX ccrrrc ξ ,

then we call )(kξ the trend relational function of both

{ })()0( kX r and { })()0( kX c .

Theorem 3.1 Based on{ })()0( kX r and{ })()0( kX c , if

{ } { }( ))(,)()( )0()0( kXkXk crrcrc ξξ =

[ +−∆+∆+= |)1()(|1 )0()0( kXkX rcrcβ

] 1)0()0( |)()(| −∆−∆ kXkX ccrrγ

nk ,,3,2 K= , [ ]1,0, ∈γβ ,

then )(krcξ is a kind of trend relational functions.

Proof. The minimal amount of information about

{ })()0( kX r and { })()0( kX c is involved in )(krcξ .

Thus, )(krcξ can express the dynamic relationship

between { })()0( kX r and { })()0( kX c sufficiently, and

meet the conditions in Definition 3.3.

similarity; )(| )0( kX rr∆ ⇒∆− |)()0( kX cc )(| )0( kX rc∆

+ ⇒−∆ |)1()0( kX rc approximateness on certain

similarity; { } { })()( )0()0( kXkX cr = ⇒ 1)( =krcξ

[6-11].

Corollary 3.1 If )(krcξ =const, ,

then

nk ,,3,2 K=

{ })()0( kX c is completely similar to { })()0( kX r .

Definition 3.4 )(krcξ is a trend relational function

between { })()0( kX r and { })()0( kX c , then

236

Page 4: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

∑ ∑= =

=Ξn

k

n

krcrc kmkmk

2 2)(/)()(ξ

is defined as the trend relational grade, where

as a measure. [ kkmkm ,1)( −= ]

Corollary 3.2 Taking , the trend

relational grade is

1)( =km

( )( )∑=

−=Ξn

krcrc kn

2)(1/1 ξ .

Consider that , we

can obtain the trend relational grade matrix as follows:

hcjr ,,2,1,,,2,1 KK ==

⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢

ΞΞΞ

ΞΞΞΞΞΞ

=

jhjj

h

h

RCM

L

MLMM

L

L

21

22221

11211

.

Theorem 3.2 The maximum element in is the

optimal with the system modeling criterion,

denoted by

RCM

rcΞ

{ }{ }rchcjromrc Ξ=Ξ== ,,2,1,,2,1. maxmax

KK.

Proof. Omitted. Theorem 3.3 Let

{ })(min)(min. kk rcrc ξξ =

{ })(max)(max. kk rcrc ξξ = .

If )(krcξ is satisfied with the criterion below

=optJ { −)(min max. krcξ })(min. krcξ ,

then it is called the optimal similar )(krcξ , denoted

by osrc.ξ , where , .

Correspondingly,

hc ,,2,1 K= nk ,,3,2 K=

osrc.Ξ is the optimal similar rcΞ .

Proof. Omitted.

omrc.Ξ and osrc.Ξ are the efficient tools of grey

dynamic modeling.

Consider that and stand for

knowledge sets, where is known about model,

and unknown about model. We have

)0(rX )0(

cX

)0(rX

)0(cX

Theorem 3.4 The minimum element in is the

optimal

RCM

rcΞ with the knowledge discovery criterion,

denoted by

{ }{ }rchcjrkdrc Ξ=Ξ== ,,2,1,,2,1. minmin

KK.

Proof. Omitted.

kdrc.Ξ is an efficient tool for discovering new

knowledge.

4 Application of Ξ to Grey Dynamic Modeling

Let us first consider a grey system with a single factor, and we can obtain a realization through observation,

say, a single time series , ),()()0( kXkX ⊗∈

nk ,,2,1 L= . Our task here is to construct the

system forecasting model, which is rather satisfactory,

based on { })()0( kX .

At first, we may transform { })()0( kX into

{ })()1( kX through the Integral Generating Transform

(IGT)

237

Page 5: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

{ } { })()( )1()0( kXkX IGT⎯⎯→⎯ ,

where

∑=

=k

mmXkX

2

)0()1( )()( ,

))()1((5.0)( )0()0()0( kXkXkX +−= ,

. nk ,,3,2 L=

The determinacy of { })()1( kX is stronger than

{ })()0( kX .

Assume that there is a set of known function,

denoted by , )( fS

{ })()( kffS j=, , . nk ,,2,1 L= mj ,,2,1 L=

Through the trend relational analysis between

{ })()1( kX and ,

respectively, we can get their trend relational grades as follows:

{ })(,),(),( 21 kfkfkf mL

.,,,21 mxfxfxf ΞΞΞ L

In order to find a satisfactorily comparing function, we have to analyze the trend relational functions and the trend relational grades. Thus answer is explicit. Suppose that

{ }jsc xfxf Ξ=Ξ max

, , mj ,,2,1 L=

then , which is sought out, may stand for a scf

{ })()1( kX ’s latent law approximately.

is a determinate function defined on time

domain, for example,

scf

cbekf kasc −= − )1()( ,

where Rcba ∈,, , . We may

recognize that implies

nk ,,2,1 L=

)(kf sc )()1( kX in the

sense of trend relation satisfactorily, i.e. is

a latency of

)(kf sc

)()1( kX . Hence, we may use data of

{ })()1( kX to fit like a glove, and find the

system forecasting model as follows:

)(kf sc

)1(ˆ)0( ˆˆ)(ˆ −= kaebakX ,

where

⎭⎬⎫

⎩⎨⎧

−−= ∑∑==

]))1((/[])()1([lnˆ3

2)0(

3

)0()0(n

k

n

kkXkXkXa ,

2

2

)1(ˆ

2

)1(ˆ2

2 2

)1(

2

)1(ˆ)1()1(ˆ

)1(

)()()1(ˆ

⎟⎠

⎞⎜⎝

⎛−−

⎟⎠

⎞⎜⎝

⎛⎟⎠

⎞⎜⎝

⎛−−

=

∑∑

∑ ∑∑

=

=

= ==

−−

n

k

kan

k

ka

n

k

n

k

n

k

kaka

een

kXekXenb ,

This is a system forecasting model by name

( )[12-14,20]. )1,1(SCGM bSCGM )1,1(

Now let us consider a grey system with factors, and we can get the original time

series

h

{ })()0( kX i , hi ,,2,1 L= , . Our

task is to construct the system model based

on

nk ,,2,1 L=

{ })()0( kX i . As stated above, we summarize the

following theorem.

Theorem 4.1 Let { })()0( kX i , , hi ,,2,1 L=

nk ,,2,1 L= , be the original time series.

Correspondently, there are the mean-value time series

{ })()0( kX i and its generating mean-value time series

238

Page 6: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

{ })()1( kXi . If

{ })()1( kXi ⎯→⎯Mtr { })(kf j ,

where , which is known, represents the

nonhomogeneous exponential function with respect to

discrete time, , , and

stands for a satisfactorily trend relation, then the

grey model of systems clouds, by name,

can be constructed as

)(kf j

nk ,,2,1 L= mj ,,2,1 L=

Mtr

bhSCGM ),1(

UtXAtX ˆ)(ˆˆ)(ˆ )1()1( +=&, continuously (4-1) 0≥t

and its solution is

UAUAXetX tA ˆˆ)ˆˆ)0(ˆ()(ˆ 11)1(ˆ)1( −− −+= (4-2)

or

UAUAXekX kA ˆˆ)ˆˆ)1(ˆ()(ˆ 11)1()1(ˆ)1( −−− −+= , (4-3)

where

])(ln[ˆ 111

−−−= K

TKK

TK XXXXA , (4-4)

⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢

−−−

=−

)1()1()1(

)3()3()3()2()2()2(

)0()0(2

)0(1

)0()0(2

)0(1

)0()0(2

)0(1

1

nXnXnX

XXXXXX

X

h

h

h

K

L

MMM

L

L

⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢

=

)()()(

)4()4()4()3()3()3(

)0()0(2

)0(1

)0()0(2

)0(1

)0()0(2

)0(1

nXnXnX

XXXXXX

X

h

h

h

K

L

MMM

L

L

))()1((5.0)( )0()0()0( kXkXkX iii +−= ,

, ; nk ,,3,2 L= hi ,,2,1 L=

1

2

)1(ˆ

2

)1(ˆ

2

)1(ˆ)1(ˆ)1(ˆ−

=

=

=

−−⎥⎦

⎤⎟⎠

⎞⎜⎝

⎛⎟⎠

⎞⎜⎝

⎛⎢⎣

⎡−−= ∑∑∑

n

k

kAn

k

kAn

k

kAkA eeeenBTT

⎥⎦

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛⎟⎠

⎞⎜⎝

⎛−−⋅ ∑ ∑∑

= ==

−−n

k

n

k

n

k

kAkA kXekXenTT

2 2

)1(

2

)1(ˆ)1()1(ˆ )()()1(

, (4-5)

⎥⎦

⎤⎢⎣

⎡−⎟

⎞⎜⎝

⎛−

= ∑∑==

−n

k

n

k

kA kXBen

C2

)1(

2

)1(ˆ )(ˆ1

1ˆ , (4-6)

CAU ˆˆˆ = ;

CBX ˆˆ)1()1( −= ;

[ ]Th kXkXkXkX )(ˆ,),(ˆ),(ˆ)( )1()1(2

)1(1

)1( L= ;

[ ]Th tXtXtXtX )(ˆ,),(ˆ),(ˆ)( )1()1(2

)1(1

)1( L= .

Proof. Selecting to be the integral factor, and

multiplying both sides of Eq. (4-1) by , we have

tAe ˆ

tAe ˆ−

UetXAtXe tAtA ˆ)](ˆˆ)(ˆ[ ˆ)1()1(ˆ −− =−&

[ ] UetXedtd tAtA ˆ)(ˆ ˆ)1(ˆ −− =

UdtetXet tAttA ˆ)(ˆ

0

ˆ0

)1(ˆ

∫ −− = .

After rearranging, Eq. (4-1) becomes Eq. (4-2) or Eq.(4-3), where Eq.(4-3) is a discrete solution of Eq.(4-1).

Let , and .

Considering the given conditions, we define

UAC ˆˆˆ 1−= CXB ˆ)1(ˆˆ )1( +=

)(ˆ)( )1()1( kXkX =

CBe kA ˆˆ)1(ˆ −= −

BeIekX AkA ˆ)()( ˆ)1(ˆ)0( −− −=

239

Page 7: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

)(ˆ)1( )0()0(ˆ kXkXeA =− .

When , , we can

construct the matrixes:

nk ,,3,2 L= hi ,,2,1 L=

1−KX and KX . If

11 −− KTK XX is nonsingular and rank

( )( ) hXXXX KTKK

TK =

−−−

1111 , then Eq.(4-4) holds.

Because { })()1( kX is satisfactorily related to the

fitting curves determined by Eq.(4-2) or Eq.(4-3), we can get

⎥⎥⎦

⎢⎢⎣

−−

−=⎥

⎤⎢⎣

⎡−−−−

−−

∑∑∑∑∑∑∑

∑∑∑∑122

121

122121121

122

121

1221211

)(0)(

ˆˆ

HHHHHHHHHHH

CB

, ⎥⎦

⎤⎢⎣

⎡ −⋅

−∑∑

2

2122121

YYHHY

where

∑=

−−=∑

n

k

kAkA eeHT

2

)1(ˆ)1(ˆ11

∑=

−−=∑

n

k

kAT

eH2

)1(ˆ12

∑=

−−=∑

n

k

kAeH2

)1(ˆ21

InH )1(22 −=∑

∑=

−=n

k

kA kXeYT

2

)1()1(ˆ1 )(

∑=

−=n

kkXY

2

)1(2 )( .

where I is the identity matrix. Eqs.(4-5) and (4-6)

hold. Thus, we have , and Eq.(4-3).

Considering Eq.(4-3)

)1(ˆ )1(X U

⇔ Eq.(4-2), Eq.(4-1) can be obtained by differentiating both sides of Eq.(4-2) with respect . t

Theorem 4.2 The forecasting model is bhSCGM ),1(

BAetX tA ˆˆ)(ˆ ˆ)0( =

or

BAekX kA ˆˆ)(ˆ )1(ˆ)0( −= ,

where and A B are given by Eq.(4-4) and Eq.(4-5),

respectively. Proof. Omitted. Corollary 4.1 When , Eq.(4-1) becomes

model.

1=h

bSCGM )1,1(

Remarks Parameters identification of

on line.

bhSCGM ),1(

As we know, the original time series which are

used to construct only reflect the past

and present behavior on investigated object. With the lapse of time, inside and outside may be changing, correspondingly, a number of new data can be obtained.

In order to make be able to reflect and

trace the change of behavior, we should improve the

original model, using the new data

observed, so that we can keep all the

time to be valid and precise. Considering the

parameters ,

bhSCGM ),1(

bhSCGM ),1(

bhSCGM ),1(

bhSCGM ),1(

A B and of model

here only, in which is essential part, we can derive

C bhSCGM ),1(

Ae ˆ

B and from it. The detail is in the following. C

Suppose that there is an original time series

240

Page 8: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

{ })(,),2(),1( )0()0()0()0( nXXXX iiii L=

which is satisfied for modeling.

According to Theorem 4.1, we have

bhSCGM ),1(

( ) 1111

ˆ −

−−−= KTKK

TK

A XXXXe .

Let and nAn ea ˆ= ( ) 1

11−

−−= KTKn XXP , then

nKTKn PXXa 1−= .

When a new data vector observed is obtained, we have

1

)0(

1

)0(

1

1

)()(

−−

+

⎟⎟⎟⎟

⎜⎜⎜⎜

⎥⎥⎥

⎢⎢⎢

⎡−−−

⎥⎥⎥

⎢⎢⎢

⎡−−−=

nX

X

nX

XP

T

KT

T

K

n

1)0()0(11 )]()([ −−− += nXnXXX T

KTK

[ ] [ ] )()0(111

111 nXXXXX K

TKK

TK

−−

−− −=

[ ]( ) 1)0(1

11)0( )()(1

−−

−−+⋅ nXXXnX KTK

T

[ ] 111

)0( )( −

−−⋅ KTK

T XXnX

1)0()0()0( ))()(1)(( −+−= nXPnXnXPP nT

nn

nT PnX )()0(⋅ . (4-7)

Let , then 11

++ = An

n ea

1)0(

1

)0(1

)()1(+

+

⎟⎟⎟⎟

⎜⎜⎜⎜

⎥⎥⎥

⎢⎢⎢

⎡−−−

⎥⎥⎥

⎢⎢⎢

+−−−= n

T

KT

T

K

n PnX

X

nX

Xa

[ ])()1( )0()0( nXanXa nn −++=

( ) )()()(1 )0(1)0()0( nXnXPnX Tn

T −+⋅

nP⋅ , (4-8)

where

[ ])(,),(),()( )0()0(2

)0(1

)0( nXnXnXnX hT L=

[ ])1(,),1(),1()1( )0()0(2

)0(1

)0( +++=+ nXnXnXnX hT L .

Thenceforth, we can get

⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛−= ∑∑

+

=

−+

=

−−+

+++

1

2

)1(ˆ1

2

)1(ˆ)1(ˆ1

111ˆn

k

kAn

k

kAkAn

Tnn

Tn eeenB

⎢⎣

⎡−⋅⎥

⎤⎟⎠

⎞⎜⎝

⎛⋅ ∑∑

+

=

−−

+

=

− ++

1

2

)1()1(ˆ11

2

)1(ˆ )(11

n

k

kAn

k

kA kXeneTnn

⎥⎦

⎤⎟⎠

⎞⎜⎝

⎛⎟⎠

⎞⎜⎝

⎛ ∑∑+

=

+

=

−+

1

2

)1(1

2

)1(ˆ )(1

n

k

n

k

kA kXeTn .

⎥⎦

⎤⎢⎣

⎡−⎟

⎞⎜⎝

⎛= ∑∑

+

=+

+

=

−+

+

1

2

)1(1

1

2

)1(ˆ1 )(ˆ1ˆ 1

n

kn

n

k

kAn kXBe

nC n .

Eqs.(4-7) and (4-8), which are two recurrence formulas, are sought out.

{ })()0( kXi may be arbitrary time series in the

presence of uncertainty. In many situations, an investigated object may be regarded as a generalized energy system, and it is suitable to represent such system behavior by exponential function. Thus, the principle of grey dynamic modeling and

model are able to provide the effective

tools for modeling the intricacies of the real world.

),1( hSCGM

5 Grey-Fuzzy Clustering Method

As is well known that the concept of a fuzzy set first arose from the study of problems related to pattern classification, since the recognition of patterns is an important aspect of human perception, which is a fuzzy process in nature. Fuzzy classification methods may be classified into the following four categories[15]:

241

Page 9: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

- Methods based on fuzzy relation; - Methods based on fuzzy pattern matching

procedures; - Methods based on fuzzy clustering procedures; and - Other methods.

On clustering methods, the primary objective of clustering techniques is to partition a given data set into so-called homogeneous clusters. The term “homogeneous” means that all points in the same group are close to each other and are not close to points in other groups. Clustering algorithms may be used to build pattern classes or to reduce the size of a set of data while retaining relevant information[16]. From a practical point of view, grey systems and fuzzy systems are found everywhere, and the representation of clusters by grey-fuzzy clustering method below may seem more appropriate in certain situations.

Suppose is a sample set,

in which each element is called a sample. In terms of practical observation, every sample has n indexes

{ hXXXX ,,, 21 K= }

{ })(,),2(),1( nXXXX iiii K= , .

Thus, based on Theorem 3.1 and Corollary 3.2, we are able to construct the trend relational grade matrix,

denoted by ,

hi ,,2,1 K=

),,( crRC XXM { }hcr ,,2,1, K∈

⎥⎥⎥⎥

⎢⎢⎢⎢

ΞΞΞ

ΞΞΞΞΞΞ

=

hhhh

h

h

crRC XXM

L

MLMM

L

L

21

22221

11211

),(

which describes the approximateness and similarity

among samples on sample set X . Obviously, the approximateness and similarity are fuzzy concepts, and the trend relational grade is a fuzzy quantity,

, and the trend relational grade matrix

is a fuzzy relation on set

10 ≤Ξ≤ rc

),( crRC XXM X .

Because satisfies ),( crRC XXM

,1),( =cr XXM ,XXX cr ∈, cr =

),(),( rccr XXMXXM = , XXX cr ∈,

),(),(2crcr XXMXXM ≤ , XXX cr ∈,

that is, it is reflexive, symmetric and transitive,

therefore, is a fuzzy equivalent

relation on

),( crRC XXM

X . In practice, we usually use

, that is, |)()(| )0()0( kXkX ccrr −

[ ] 1)0()0( |)()(|1)( −∆−∆+= kXkXk ccrrrc γξ ,

then is a “similarity” relation on ),( crRC XXM X .

In this case, there exists

),(),(ˆ1

crpRC

pcrRC XXMXXM

== U ,

and is a fuzzy equivalent relation on ),(ˆcrRC XXM

X .

Consider that the α -cut of

is an ordinary relation,

α),( crRC XXM

),( crRC XXM [ ]1,0∈α .

Thus, we can easily prove that is a

equivalent relation if is a equivalent

relation. Based ,

α),( crRC XXM

),( crRC XXM

α),( crRC XXM [ 1,0∈ ]α , we are

able to partition a given data set into so-called homogeneous clusters[17-19].

6 Demonstrating Examples

6.1 Forecasting the floodwater in Huaihe River Huaihe River is located in the south of Fuyang

242

Page 10: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

District of Anhui Province, China. This river was a place famous for its more flood in history. In 1990,

using the model, J.Sun et al [20]

successfully forecasted the catastrophic flood happened in Huaihe River in 1991 and other years. As a example,

if the discharge of river is over 7000 , the heavy

flood will happen in Huaihe River. Take abnormal

value

)1,1(SCGM

sM /3

ξ as 7000 , and write down the particular

years as follows:

sM /3

{ }83,82,75,68,60,56,54,50=X ,

in which the discharge of the Huaihe River is more

than 7000 . We may transform sM /3 X into )0(X ,

{ })8(,),2(),1( )0()0()0()0( XXXX L=

{ }34,33,26,19,11,7,5,1= .

Based on )0(X , we can construct the

forecasting model bSCGM )1,1(

)1(ˆ)0( ˆˆ)(ˆ −= kaebakX

= )1(24019.09437365.6 −ke

where

24019991.0ˆ =a , . 90936.28ˆ =b

Taking , we can obtain the forecasting result of the time when the floodwater will take place next time.

9=k

4.47)9(ˆ )0( =X that will happen in 1996.4.

On the basis of forecasting result above, the flood control strategy was taken in advance, so the heavy losses were reduced greatly in Fuyang District.

6.2 Dynamic grey-fuzzy clustering

Suppose { }54321 ,,,, XXXXXX = is the set of

five people, and a relation on X is “familiar”. According to grey-fuzzy clustering method mentioned above, we have the trend relational grade matrix

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

16.01.04.03.06.013.02.05.01.03.011.08.04.02.01.011.03.05.08.01.01

),( crRC XXM .

It is clear that when ,1),( =cr XXM cr = , and

),(),( rccr XXMXXM = . Such a matrix is a

fuzzy “similarity” relation on X . Thus,

),(),(ˆ1

crpRC

pcrRC XXMXXM

== U

),(4crRC XXM=

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

16.05.04.05.06.015.04.05.05.05.014.08.04.04.04.014.05.05.08.04.01

which is a fuzzy equivalent relation.

Based on , ),(ˆcrRC XXM X can be divided to

some clusters. Choose , α),(ˆcrRC XXM

5.0,6.0,8.0,1=α and 0.4, we have

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

1000001000001000001000001

),(ˆ1crRC XXM ,

and corresponding cluster:

243

Page 11: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

{ } { }{ ,, 21 XX { },3X ; { } { }54 , XX }

}

}

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

1000001000001010001000101

),(ˆ8.0crRC XXM ,

{ } { } { } { }{ 31542 ,,,, XXXXX ;

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

1100011000001010001000101

),(ˆ6.0crRC XXM ,

{ } { } { }{ 54312 ,,,, XXXXX ;

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

1110111101111010001011101

),(ˆ5.0crRC XXM ,

{ } { }{ }54312 ,,,, XXXXX ;

⎥⎥⎥⎥⎥⎥

⎢⎢⎢⎢⎢⎢

=

1111111111111111111111111

),(ˆ4.0crRC XXM ,

{ }54321 ,,,, XXXXX .

We can check that the clustering results obtained above are the same as usual fuzzy clustering analyses[15]. As seen by those mentioned above, the grey-fuzzy clustering method developed in this paper is a very useful tool of data mining.

References

[1]M.Y.Chen. Grey system and its dynamic modeling. Proceedings of the International AMSE Conference on “Signal, Data, Systems” Methodologies & Applications, Calcutta(India), Dec.7-9,1992,AMSE Press, Vol.2, PP.89-96.

[2]Y.Lin, M.Y.Chen and S.F.Liu. Theory of grey systems: capturing uncertainties of grey information. Kybernetes, Vol.33(2004), No.2, pp.196-218.

[3]J.L.Deng. Control problems of grey systems. Systems & Control Letters, Vol.1(1982), No.5, pp.288-294.

[4]M.Y.Chen. Grey dynamics of the system of a boring machine. Journal of Huazhong University of Science and Technology(in Chinese), Vol.10(1982), No.6, pp.7-11.

[5]M.Y.Chen. System cloud and its grey model. Proceedings of the Fourth Japanese-Sino Sapporo International Conference on Computer Applications. Hokkaido University, Sapporo 1990, pp.38-41.

[6]M.Y.Chen. Trend relational analysis and grey dynamic modeling. Proceedings of the Twelfth European Meeting on Cybernetics and Systems Research, University of Vienna, World Scientific, 1994, pp.19-24.

[7]M.Y.Chen, W.Z.Chen, Z.J.Chen. Grey state model and forecasting control of complicated systems. Proceedings of the 15th International Conference on Systems Science, Sept.7-10,2004, Wroclaw(Poland), pp.141-150.

[8]M.Y.Chen. Uncertainty analysis and grey modeling. Uncertainty Modeling and Analysis (ISUMA’90), IEEE Computer Society Press, 1990, pp.469-473.

[9]W.Z.Chen, Z.J.Chen, Z.H.Xie, L.Zhou and M.Y.Chen. Multifactor process analysis and

forecasting model. Journal of

Wuhan University of Technology(in Chinese), Vol.28(2004), No.2, pp.291-294.

mvhSCGM ),1(

244

Page 12: Trend Relational Analysis and Grey Fuzzy Clustering Methods*

[10]M.Y.Chen, H.J.Xiong and Z.J.Chen. An approach to modal control of grey systems. SAMS, 1998. Vol.33, pp.47-56.

[11]M.Y.Chen. Fuzzy control system with trend relational predictor. The Journal of Fuzzy Mathematics, Vol.2(1994), No.4. pp.685-694.

[12]M.Y.Chen. General systems studies and grey dynamic modeling. General Systems Studies and Applications, Huazhong University of Science and Technology Press, 1997, pp1-9.

[13]Z.J.Chen, W.Z.Chen, Q.L.Chen and M.Y.Chen. “Poor” signal processing and grey modeling technique. Proceedings of Eighteenth International Conference on Systems Engineering, August 16-18, 2005, Las Vegas, Nevada(USA), pp.251-256.

[14]W.Z.Chen, Z.J.Chen, Q.L.Chen and M.Y.Chen. Approaches to grey prediction and control of environmental systems. Processings of Eighteenth International Conference on Systems Engineering, August 16-18, 2005, Las Vegas, Nevada(USA), pp.152-157.

[15]M.Grabisch and F.Dispot. A comparison of some methods of fuzzy classification on real data. Proceedings of IIZUKA’92,1992,pp.659-662.

[16]D.Dubis and H.Prade. Fuzzy Sets and Systems-Theory and Application, New York, 1980.

[17]Z.J.Chen, Q.L.Chen, W.Z.Chen and Y.N.Wang. Grey linear programming. Kybernetes, Vol.33(2004), No.2,pp.238-246.

[18]H.J.Xiong, M.Y.Chen and Q.Tan. Application of trend relational grade to fuzzy clustering. Advances in Systems Science and Applications, Special Issue 1996,pp.238-243.

[19]Y.Y.Chen. Fuzzy Mathematics(in Chinese). Huazhong University of Science and Technology Press, 1986.

[20]J.Sun and J.Zhao. The application of the model of

about forecasting the floodwater in

the Huaihe branches and the storing floodwater

time in Mengwa area. Proceedings of the International Conference on “Systems, Control, Information” Methodologies & Applications (SCI’94), Huazhong University of Science and Technology Press, 1995, pp.298-301.

)1,1(SCGM

245