7
Journal of Control, Automation and Systems Engineering Vol. 10, No. 12, December 2004 1 I. ( ) [1-14]. (DNA microarray) (high throughput) (time-series) [3-14]. ( ) (reverse engineering) . . (transcription) mRNA mRNA (translation) (protein) . (metabolism) . . . . (Boolean networks) [3], (linear networks) [4,5,9,11], (Bayesian networks) [6], (differential equations) [15], (recurrent neural networks) [1,2,13,14] . . SA(simulated annealing) , (genetic algorithm; GA), (gradient descent methods), (linear regression methods) [4,5,8,9,11] . , . , (<20) (>500) . . . . Construction of Gene Interaction Networks from Gene Expression Data Based on Evolutionary Computation , * (Sung Hoon Jung and Kwang-Hyun Cho) Abstract : This paper investigates construction of gene (interaction) networks from gene expression time-series data based on evolutionary computation. To illustrate the proposed approach in a comprehensive way, we first assume an artificial gene network and then compare it with the reconstructed network from the gene expression time-series data generated by the artificial network. Next, we employ real gene expression time-series data (Spellman's yeast data) to construct a gene network by applying the proposed approach. From these experiments, we find that the proposed approach can be used as a useful tool for discovering the structure of a gene network as well as the corresponding relations among genes. The constructed gene network can further provide biologists with information to generate/test new hypotheses and ultimately to unravel the gene functions. Keywords : microarray, gene interaction networks, time-series gene expression data, evolutionary computation (Corresponding Author) ([email protected]) ([email protected])

Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

Journal of Control, Automation and Systems Engineering Vol. 10, No. 12, December 2004 1

I.

( )[1-14].

(DNA microarray) (highthroughput)(time-series)

[3-14].( )

(reverse engineering) .

.(transcription) mRNA

mRNA (translation) (protein).

(metabolism) .

.

.

.

(Boolean networks) [3],(linear networks) [4,5,9,11], (Bayesiannetworks) [6], (differential equations) [15],

(recurrent neural networks) [1,2,13,14].

.

SA(simulated annealing) , (geneticalgorithm; GA), (gradient descent methods),

(linear regression methods) [4,5,8,9,11].

,. ,

(<20)(>500) .

.

.

.

Construction of Gene Interaction Networks from GeneExpression Data Based on Evolutionary Computation

, *

(Sung Hoon Jung and Kwang-Hyun Cho)

Abstract : This paper investigates construction of gene (interaction) networks from gene expression time-series data based onevolutionary computation. To illustrate the proposed approach in a comprehensive way, we first assume an artificial genenetwork and then compare it with the reconstructed network from the gene expression time-series data generated by theartificial network. Next, we employ real gene expression time-series data (Spellman's yeast data) to construct a gene networkby applying the proposed approach. From these experiments, we find that the proposed approach can be used as a useful toolfor discovering the structure of a gene network as well as the corresponding relations among genes. The constructed genenetwork can further provide biologists with information to generate/test new hypotheses and ultimately to unravel the genefunctions.

Keywords : microarray, gene interaction networks, time-series gene expression data, evolutionary computation

(Corresponding Author)

([email protected])([email protected])

Page 2: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

2 제어 자동화 시스템공학 논문지 제 권 제 호10 , 12 2004. 12․ ․

.

.

. ,

.

( GA [16-18](evolutionary programming; EP) [19]) . GA

( )

.EP

( )

.

..

Wahde [13,14].

Wahde

. EP GA. GA EP

EP.

. 2. 3 GA EP

. 4

. 5 .

II., ,

, ,.

..

(1)

(1)

(activation function)

(time constant) .

(Sigmoid).

(1) .

(2)

.

(3)

(3) (3) (2) .

(3) . (3)

.

. (3)

.

(3)

. (0,1)

0 1 .

III.SA

, , ,[4,5,8,9,11].

Page 3: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

Journal of Control, Automation and Systems Engineering Vol. 10, No. 12, December 2004 3

GA. Wahde [8]

GA .

. Wahde

.

.

GA

.GA

. 4GA

. GA Wahde [8]. Wahde GA

GA.

GA.

.

.

.

. 12

.

1. GA .Table1. Parameters for the GA.

pc 0.6

40

pm 0.01

20

(24 * 30) bits

0.001

30 .(24

24 * 30 = 720 ) (crossover).

. GA1 .

.GA

(MSE:mean square errors). .

(4)

(4) , ,

GA

,

.

.

.(4) MSE .

(5)

GA EP .

. EPGaussian (perturbation)

. , (randomsearch)

.GA

GA . GA

EP

.

2. .Table2. Parameters for the EP.

10024

400.001

Page 4: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

4 제어 자동화 시스템공학 논문지 제 권 제 호10 , 12 2004. 12․ ․

(4)MSE . MSE

. 2.

IV.

. Wahde[8]. 3.

31 .

GA

.

. Wahde 50.

4 3.

.

3. .Table3. Parameters of the artificial gene network.

20.0 5.0 0.0 0.0 0.0 10.025.0 -5.0 -17.0 0.0 -5.0 5.00.0 10.0 20.0 -20.0 -5.0 5.00.0 0.0 10.0 -5.0 0.0 15.0

1. .Fig. 1. Artificial gene expression time-series data.

4. (a) Wahde GA (b) GA (c) EP.Table4. Comparison of experiments (a) Wahde's GA (b) GA

(c) EP.

11(11) 7.8(12) 7.5(12) 0.8(13) 3.7(4.8) 11(1.2)

14(7.7) -16(7.3) 0.45(13) -2.4(14) 1.8(4.6) 7.5(1.8)

5.9(14) 11(11) 6.1(13) 3.2(14) 1.9(5.8) 9.1(0.7)

3.0(11) 1.5(13) 5.3(11) -16(7.0) 1.6(5.7) 16(5.3)

(a)

17(9.3) 11(13) 6.7(15) 4.0(16) 3.1(5.5) 10(1.1)

20(7.8) -13(5.3) -11(10) -9.4(12) 5.1(4.7) 5.2(0.5)

-12(9.3) 16(7.9) 12(9.5) 13(11) -4.8(4.6) 5.8(0.6)

-4.5(13) 11(12) 14(11) 1.4(15) -0.4(5.4) 17(1.8)

(b)

14(13) 9.1(15) 7.9(14) 7.1(17) 3.7(6.1) 10(1.1)

24(6.1) -15(4.2) -18(8.2) -8.2(10) 8.0(2.9) 5.1(0.3)

-8.7(8.4) 15(8.6) 12(10) 15.2(14) -8.3(2.7) 5.1(0.6)

6.3(15) 12(14) 11(14) 4.4(15) -1.3(7.3) 18(1.8)

(c)

.

.

.

(6)

(6) ,

GA.

. ,

0 . (6).

.

1 .

. 5 .

Page 5: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

Journal of Control, Automation and Systems Engineering Vol. 10, No. 12, December 2004 5

5. .Table5. Comparison of the measure.

( )

Wahde's GA [8] 0.042GA 0.134EP 0.138

2. Spellman (511x18).Fig. 2. Yeast data profiles of Spellman data(511x18).

3. 4 .

Fig. 3. Data profiles of four clusters.

5 GA Wahde [8].

Wahde GA( )

. GA,

.

.

GA.

.Spellman[20]

. Spellman 6178 82511

18 . 2 511x18.

K-means4

. 511x18 4x18. 4

.

.

. 3. 4

0 1(normalize) . 4x18 4

.GA

. GA

. 4 4

(,

).100

. ,

-5 +5 4 21. 6 GA

.6

.. GA

...

Page 6: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

6 제어 자동화 시스템공학 논문지 제 권 제 호10 , 12 2004. 12․ ․

4. 4 .

Fig. 4. Four cluster data profiles and generated data.

6. : (a) GA (b) EP.

Table6. Comparison of experiments : (a) GA (b) EP.

-3.3(1.7) 2.7(2.4) 4.4(1.2) -3.3(2.0) -1.2(1.8) 6.6(1.4)1.5(2.8) -3.5(1.6) -4.5(1.1) 4.0(1.7) 2.0(1.6) 7.3(2.1)-4.6(1.2) 3.9(1.2) -1.1(1.4) 0.1(2.8) 0.9(2.1) 4.3(0.7)4.7(0.7) -4.4(1.1) -0.4(2.0) -4.1(1.1) 2.3(1.4) 4.0(0.0)

(a)

-3.3(1.9) -0.8(3.6) 4.0(1.1) -4.6(0.9) 1.8(2.6) 4.6(0.8)-3.6(2.3) -3.4(1.9) -4.8(0.5) 4.8(0.4) 3.7(2.0) 4.2(0.5)-4.7(0.6) 4.7(0.5) -0.7(1.9) -1.8(3.0) 1.2(2.4) 4.2(0.4)4.7(0.6) -4.8(0.4) -2.0(2.2) -4.5(0.8) 3.7(1.4) 4.0(0.1)

(b)

.

. ,.

.

..

.0 ( , 0.8 )

100.

0.

0 .

0 0.

., .

V.

.

. GA

.

(

).. GA

.

.

.

[1] J. Reinitz and D. H. Sharp, ''Mechanism of eve stripeformation,'' Mechanisms of Development, no. 49, pp.133-158, 1995.

[2] M. B. Eisen, P. T. Spellman, P. O. Brown, andD.~Botstein, ''Cluster analysis and display of genome-wideexpression patterns,'' in Proceedings of the NationalAcademy of Sciences, pp. 14863-14868, Dec. 1998.

[3] T. Akutsu, S. Miyano, and S. Kuhara, ''Identification ofgenetic networks from a small number of gene expressionpatterns under the Boolean network model,'' in PacificSymposium on Biocomputing, pp. 17-28, Jan. 1999.

[4] D. Weaver, C. Workman, and G. Stormo, ''Modelingregulatory networks with weight matrices,'' in PacificSymposium on Biocomputing, pp. 112-123, Jan. 1999.

[5] P. D'haeseleer, X. Wen, S. Fuhrman, and R. Somogyi,''Linear modelling of mRNA expression levels duringCNS development and injury,'' in Pacific Symposium onBiocomputing, pp. 41-52, Jan. 1999.

[6] N. Friedman, M. Linial, I. Nachman, and D. Pe'er,''Using bayesian networks to analyze expression data,''Journal of Computational Biology, no. 7, pp. 601-620,2000.

Page 7: Construction of Gene Interaction Networks from Gene Expression …sbie.kaist.ac.kr/ftp/진화연산에 기반한 유전자... · 2004-11-26 · JournalofControl,AutomationandSystemsEngineeringVol.10,No.12,De

Journal of Control, Automation and Systems Engineering Vol. 10, No. 12, December 2004 7

[7] P. D'haeseleer, ''Reconstructing gene networks from largescale gene expression data''. PhD thesis, University of NewMexico, Albuquerque, 2000.

[8] M. Wahde and J. Hertz, ''Coarse-grained reverse engineeringof genetic regulatory networks,'' BioSystems, no. 55, pp.129-136, 2000.

[9] M. Wahde and J. Hertz, ''Modeling genetic regulatorydynamics in neural development,'' Journal of ComputationalBiology, no. 4, pp. 429-442, 2001.

[10] E. van Someren, L. Wessels, and M. Reinders, ''Linearmodelling of genetic networks from experimental data,''in Proceedings of the Eighth International Conference onIntelligent Systems for Molecular Biology, pp. 355-366,Aug. 2000.

[11] E. van Someren, L. Wessels, and M. Reinders, ''Geneticnetwork models: A comparative study,'' in Proceedings ofSPIE, Micro-arrays: Optical Technologies and Informatics,pp. 236-247, Jan. 2001.

[12] L. Wessels, E. van Someren, and M. Reinders, ''Acomparison of genetic network models,'' in PacificSymposium on Biocomputing, pp. 508-519, Jan. 2001.

[13] M. Wahde and J. Hertz, ''Modeling genetic regulatorydynamics in neural development,'' Journal of Computa-tional Biology, vol. 8, pp. 429-442, Sep 2001.

1988 .(1991),

(1995) (1996). 1996,

., ,

.

[14] M. Takane, ''Inference of Gene Regulatory Networks fromLarge Scale Gene Expression Data,'' Master's thesis,McGill University, 2003.

[15] T. Chen and G. C. H. He, ''Modeling gene expression withdifferential equations,'' in Pacific Symposium on Biocompu-ting, pp. 29-40, Jan. 1999.

[16] J. Holland, Adaptation in natural and artificial systems.1st ed.: University of Michigan Press, Ann Arbor; 2nded.: 1992, MIT Press, Cambridge, 1975.

[17] D. Goldberg, Genetic Algorithms in Search, Optimizationand Machine Learning. Reading, MA: Addison-Wesley,1989.

[18] L. Davis, Handbook of Genetic Algorithms. New York:Van Nostrand Reinhold, 1991. L. Davis (Ed.).

[19] D. B. Fogel, ''An Introduction to Simulated EvolutionaryOptimization,'' IEEE Trans. on Neural Networks, vol. 5,pp. 3-14, Jan. 1994.

[20] P. T. Spellman, G.Sherlock, M.Q. Zhang, V.R. Iyer,K.Anders, M.B. Eisen, P. O. Brown, D.Botstein, andB.Futcher, ''Comprehensive Identification of Cell Cycle-regulated Genes of the Yeast Saccharomyces cerevisiaeby Microarray Hibridization},'' Molecular Biology of theCell, vol. 9, pp. 3273-3297, 1998.

1993 KAIST .(1995),

(1998) (1998). 19992004 8

.2002 6 2003 8 UMIST

. 2003 12 2004 3Royal Institute of Technology .

2004 6 2004 8 Hamilton Institute. 2004 9 (

) (KoreaBio-MAX Institute, )

. 2004 2005 ‘Systems Biology(IEE, )’ Editor-in-Chief .

( ).