15
New approaches to New approaches to variable variable stars stars data data processing and processing and interpretation interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University, Brno, Czech Republic

New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

Embed Size (px)

Citation preview

Page 1: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

New approaches to New approaches to variablevariable  starsstars  data processing data processing

and interpretation and interpretation

Zdeněk MikulášekInstitute for Theoretical Physics and Astrophysics,

Masaryk University, Brno, Czech Republic

Page 2: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

IntroductionDevelopment from Tsessevich’s times in the field of

variable stars research is large. It has arisen:

• the number of VS itself - by one or two orders, as well as the number of their observers and interpreters.

• the volume and common access to high-quality VS observing data and computational techniques.

• the number of new efficient statistical techniques and methods that are available for everybody thanks to wide spread PCs.

Nevertheless, the methods used for processing of data mostly have remained the same as those used in Vladimir Platonovich’s era.

Page 3: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

• Every astrophysicist likes large quantities and better quality of modern observational data, new methods of processing are not so popular. Majority of them needs a good knowledge of matrix calculus.

• A frequent syndrome of VS observers – MMatrixphobiaatrixphobia.

• There are exceptions: few of mathematically erudite theoreticians love new methods and matrices so much that they do not use them for real observational data.

• Both extremes in the data processing are bad – we should find our golden mean.

• The contemporary statistics shares inexhaustible quantity of methods. It is necessary to select several of the most versatile and diverse methods, master them and to learn to combine them.

• The method of processing must not be unique, bur always must be made-to-measure of the set problem.

Page 4: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

AAdvanced PPrincipal CComponent Analysis

The majority of VS data processing tasks are solved using LSM, strictly speaking linear regressionlinear regression (polynomials, harmonic polynomials).

There are many other methods which are able to give us the same or better results. The example: APCA.

APCA – a combination of LR and standard PCA – optimal for solving a lot astrophysics problems:

realistic fitting of realistic fitting of mmultiulticcolour olour llight ight ccurvesurves the determination of the moments of extrema of McLCthe determination of the moments of extrema of McLC modeling of light multicolour curves necessary for modeling of light multicolour curves necessary for

improvement of ephemeridesimprovement of ephemerides

diagnostics of LC secular changes. Classification of LCsdiagnostics of LC secular changes. Classification of LCs

Page 5: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

HD 90044 – rotating magnetic CP star

Page 6: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

Supersylva – extrema of multicolor symmetric LC

Page 7: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

Least square methodLeast square method– the most popular method among astronomers:

minimalization of the sum of quadrates of deflections of y in respect of the before established model of observed dependence SS. The solution of LSM – the vector of free parameters of the model + their uncertainties

• The invention of the scientist – an adequate modeling of the reality. Consequent steps – only the technique of solution.

• The finding of real solution is quick if one knows a good estimate of the real solution – then substitution of the S in the space of free parameters +1 by a paraboloid

• Then conversion to linear regression – solution of the systems of k equations with k unknown parameters

• Linear regression – the model is the linear combinations of k functions – favorite – polynomial regression, hpr

Page 8: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

Benefits of orthogonal modelsBenefits of orthogonal models• Linear (linearized) LSM: uncertainties of parameters.

• Is valid:

No!!!!No!!!! • What use is to assign errors of parameters???

1 1 2 1 1

1 2 2 2 2 T 1

1 2

f ( ) f ( ) f ( )

f ( ) f ( ) f ( ), , ,

f ( ) f ( ) f ( )

.

k

k

N N k N

j j j

x x x

x x x

x x x

a s w

X V X X H V

Η

2p

1

[ f ( )] ?k

j jj

y a x

Page 9: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,
Page 10: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

• How to estimate the uncertainty of the prediction?

T1 2 p( ) (f ( ),f ( ),...f ( ));kx x x x y s w g g H g

You must know H. You can transform functions fi so that form an orthogonal basis e.g. by Gram-Schmidt orthogonalization procedure. Then H will be diagonal and the meaning of parameter uncertainty will have their awaited sense.

Orthogonal polynomials:

20 1 1 1 2 2 3

3 32 2

1 2 1 2 2 32 2

3 23 4 5 6 3 1 3 2 3

25 2 4 3 3 2

4 2 34 2 3 2

2 2 24 4 2 5 3 3 2

5 2 34 2 3 2

25 2 4

6

1; ; ; ;

0. , ;

; 0.

assuming: 0.

,

,

2

T T x b b x T x b x b

x xT T TT b b x x

x x

T x b x b x b T TT T T

x

x x x x x xb

x x x x

x x x x x x xb

x x x x

x x x xb

33 2 3

2 34 2 3 2

;x x

x x x x

Page 11: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

OrthogonalOrthogonal model model of cubic polynomial of cubic polynomial::

0 1 1 2 2 3 3

2

3 2

(0.022 0.002) (0.166 0.004)

(0.520 0.018) ( 0.143 0.160)

(0.28 0.10) ( 0.294 0.192 0.024).

y a aT a T a T

x

x x

x x x

Page 12: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

True weights in LSMTrue weights in LSM

2 2p

1

( ( )) .N

i i i ii

S y y x w w

Canonical weights of VS observers: • visual – 1, photographic – 3, photoelectric – 10 (20)True weightsTrue weights for TW Dra (before 1942)

• faintening – 1; visual I – 4, vis. II 28; PEP+photoseries – 266!True weights should not be stated in advance! It should be the result of a preliminary iterative analysis.The weight is not given only by inner accuracy of a particular observational method, but also the adequacy of the model. function. If the model is wrong, the weights of all type of measurements might be nearly equal!

Page 13: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,
Page 14: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

Robust regressionRobust regression• Practically all real (untrimmed) astrophysical data contain

rough errors – outliersoutliers. They devastate LSM method – their results are a vagary of outliers’ number and distribution.

• Second problem: Observers intending to clean their data of outliers occasionally erase also non-outliers.

• Both problems can be treated properly by a suited robust regressionrobust regression.

• We prefer RR which modifies weights of particular measurements by a special function of deflection of measured quantity from predicted values. Our favorite:

4 2

0 rr

1.06 exp ; 1.11 .2.5

ii i

y y w Nw w

N gw

Page 15: New approaches to variable stars data processing and interpretation Zdeněk Mikulášek Institute for Theoretical Physics and Astrophysics, Masaryk University,

ConclusionsConclusions

• New methods of variable stars data processing enable us better exploit information hidden in their observations. Endeavor connected with mastering of them will return in new subtle discoveries and revealing.

• Matrix calculus, true using of weights, advanced principal component analysis, factor analysis, robust regression, creation and usage of orthogonal models and several other processing techniques should appertain to compulsory outfit of each variable stars’ observer of the 21st century