1
Fakultät für Chemie Infrared Spectrometric Purity Control of Chemical Substances using R Fayaz Kondagula a and Karl Molt b 1 Introduction A typical Infrared Spectrum contains about 4000 data points and computer soft- ware is needed for processing the spectral data. Instrument manufacturers nor- mally provide proprietary software for this purpose. But this is often limited to a restricted number of common applications. To enable a more flexible and uni- versal numerical and statistical evaluation of spectral data, we have developed methods to directly read the spectra into R. If the spectrum of its pure reference is known, there are two ways for determining the purity of a potentially contam- inated substance. The first way is to calculate the correlation coefficient between the spectrum of the substance whose purity has to be controlled and the corre- sponding reference spectrum and the second one is using difference spectroscopy. 2 Spectral Purity by spectral comparison with linear regression 2.1 Simulated example (Fig.1) A simple way of calculating the spectral purity of a substance is regressing its spectrum on the spectrum of the corresponding pure reference substance [2] us- ing the R function lm(substance~reference). The resulting correlation co- efficient is used as a measure for the spectral purity: SPR 1 = 100 × R (1) The correlation coefficient however has the disadvantage that with increasing impurity it only slowly moves away from one. This “convex” behaviour is shown in Fig. 2(a). 600 800 1000 1200 1400 0.0 0.5 1.0 1.5 Wavenumbers A A C B BFigure 1: Simulated example of a spectral impurity. A is the spectrum of the pure component (av- erage from 9 spectra replicates) and B the spectrum of the contaminating substance. B is added to the pure component A stepwise from 0.001 - 0.01 concentration units (spectra B 0 ). C shows the spectra of the corresponding mixtures, on which some noise (amplitude 0.001) was superim- posed. Compared to R the z coefficient due to Fisher [3] reacts more sensitive to im- purities. The z-transformation was performed using the R function fisherz included in the package survcomp: z = 1 2 × ln 1+ R 1 - R (2) Fig. 2(b) demonstrates that z shows a “concave” behavior which is favorable for purity control because even small impurities have a large effect. 0.000 0.002 0.004 0.006 0.008 0.010 0.999970 0.999980 0.999990 1.000000 Impurity R (a) Dependency of the correlation coeffi- cent (R) from the degree of contamination. 0.000 0.002 0.004 0.006 0.008 0.010 0.80 0.85 0.90 0.95 1.00 Impurity (b) z 0 values calculated by transforming the R values in the left figure. Figure 2: Results obtained by regressing the spectra of the mixtures C in Fig. 1 on the reference spectrum A. The impurity concentration belonging to z c resp. z c 0 corresponds to the limit of detection (LOD) resp. the limit of capture (LOC). Repetitively measuring the spectra of the pure reference substance and then comparing the single spectra with the mean spectrum allows the calculation of a one-sided confidence interval for z . Its lower limit is as follows: z c z - ts z with t -1 t (1 - α) and α =0.01. z c determines the limit of detection (LOD) and z c 0 z - 2ts z the limit of capture (LOC). A normalization of z is then performed a Fayaz Kondagula,M.Sc., Universität Duisburg-Essen, Fachgebiet Instrumentelle Analyik, 47048 Duisburg, [email protected] b Prof. Dr. Karl Molt, Universität Duisburg-Essen, Fachgebiet Instrumentelle Analyik, 47048 Duisburg, [email protected] as follows z 0 = z z c (3) The corresponding spectral purity parameter is calculated as SPR 2 = 100 × z 0 (4) 3 Spectral Purity based on Difference Spectroscopy Another way of determining the spectral purity is by calculating the difference between the sample spectrum (C ) and the reference spectrum (A): D = C - f × A (5) f should be chosen in such a way that an optimal compensation of the reference spectrum is achieved. A special algorithm was developed for determining the op- timal difference factor f opt . This works by calculating a whole series of difference spectra with decreasing difference factors and determining the integral of the pos- itive part of the difference spectra (χ = R D + ). From this the so called “Difference Operating Characteristic” (DOC) is calculated by applying a lagged difference of second order on χ using the R function diff(Chi,differences=2). Fig 3(a) shows a plot of the DOC against the factor f . The optimal factor is where the DOC exceeds a certain threshold set by the user. -1.0 -0.5 0.0 0.5 1.0 1.5 2.0 0.0000 0.0005 0.0010 0.0015 0.0020 PANMEAN Vs PP_6B3 Shift = 0 /Sensitivity =6.5 /Factor = 0.968 Factor Diff2 (a) Difference Operating Characteristic Plot 3200 3100 3000 2900 2800 40 60 80 100 PANMEAN vs PP_6B3 Shift = 0 /Sensitivity =6.5 /Factor = 0.968 x dif (b) Difference Spectrum Figure 3: Purity control based on Difference spectroscopy χ and f opt are used as spectral purity parameters based on following equations: SPR 3 = 100 × f opt (6) SPR 4 = 100 × (1 - R ¯ ν 2 ¯ ν 1 D + R ¯ ν 2 ¯ ν 1 A ) (7) 4 Purity control of Palatinol N With the example of a certain plasticizer (Palatinol N) it is shown that a con- tamination originating from another plasticizer (Palatinol 911P) can be detected down to about a concentration of 0.1 g/100g of this impurity by Diamond ATR Spectroscopy and down to 0.9 g/100g with Near Infrared Spectroscopy. 0 2 4 6 8 10 99.86 99.92 99.98 Conc [g/100g] SPR_1 0 2 4 6 8 10 60 80 100 Conc [g/100g] SPR_2 LOD=0.1 g/100g LOC=0.3 g/100g 0 2 4 6 8 10 93 95 97 99 C [g/100g] SPR_3 0 2 4 6 8 10 94 96 98 100 C [g/100g] SPR_4 (a) Spectral Purity based on Correlation coefficient and Dif- ference Spectroscopy (MIR) 0 2 4 6 8 10 99.980 99.990 100.000 C [g/100g] SPR_1 0 2 4 6 8 10 70 90 110 C [g/100g] SPR_2 LOD=0.9 g/100g LOC=2.5 g/100g 0 2 4 6 8 10 97.0 98.0 99.0 100.0 C [g/100g] SPR_3 0 2 4 6 8 10 97.0 98.0 99.0 100.0 C [g/100g] SPR_4 (b) Spectral Purity based on Correlation coefficient and Dif- ference Spectroscopy (NIR) Figure 4: Dependency of Spectral Purity grades of Palatinol N as function of the concentration of an impurity (Palatinol 911P) 5 Summary The purity of chemical substances can be calculated either by difference spec- troscopy or spectral comparison via the correlation coefficient. The necessary statistical computations can easily be performed with R. References [1]Karl Molt und Alfred Schlachter, VDI-Berichte Nr. 1959, 3-26, 2006 [2]Horst Weitkamp und Dieter Wortig, Mikrochimica Acta 1983 II, 31-57 [3] Jacob Cohen, Statistical Power Analysis for the Behavioral Sciences, Second Edition, S. 110 f., Lawrence Erlbaum Associates, Inc. , Hove and London (1988)

Infrared Spectrometric Purity Control of Chemical ... · PDF fileInfrared Spectrometric Purity Control of Chemical Substances using R Fayaz Kondagulaa and Karl Moltb 1 Introduction

  • Upload
    buitruc

  • View
    222

  • Download
    3

Embed Size (px)

Citation preview

Fakultät für Chemie

Infrared Spectrometric Purity Control of Chemical Substances using R

Fayaz Kondagulaa and Karl Moltb

1 Introduction

A typical Infrared Spectrum contains about 4000 data points and computer soft-ware is needed for processing the spectral data. Instrument manufacturers nor-mally provide proprietary software for this purpose. But this is often limited toa restricted number of common applications. To enable a more flexible and uni-versal numerical and statistical evaluation of spectral data, we have developedmethods to directly read the spectra into R. If the spectrum of its pure referenceis known, there are two ways for determining the purity of a potentially contam-inated substance. The first way is to calculate the correlation coefficient betweenthe spectrum of the substance whose purity has to be controlled and the corre-sponding reference spectrum and the second one is using difference spectroscopy.

2 Spectral Purity by spectral comparison with linear regression

2.1 Simulated example (Fig.1)

A simple way of calculating the spectral purity of a substance is regressing itsspectrum on the spectrum of the corresponding pure reference substance [2] us-ing the R function lm(substance~reference). The resulting correlation co-efficient is used as a measure for the spectral purity:

SPR1 = 100×R (1)

The correlation coefficient however has the disadvantage that with increasingimpurity it only slowly moves away from one. This “convex” behaviour is shownin Fig. 2(a).

600 800 1000 1200 1400

0.0

0.5

1.0

1.5

Wavenumbers

A

A

C

B

B′′

Figure 1: Simulated example of a spectral impurity. A is the spectrum of the pure component (av-erage from 9 spectra replicates) and B the spectrum of the contaminating substance. B is addedto the pure component A stepwise from 0.001 - 0.01 concentration units (spectra B′). C showsthe spectra of the corresponding mixtures, on which some noise (amplitude 0.001) was superim-posed.

Compared to R the z coefficient due to Fisher [3] reacts more sensitive to im-purities. The z-transformation was performed using the R function fisherzincluded in the package survcomp:

z =1

2× ln

1 + R

1−R(2)

Fig. 2(b) demonstrates that z shows a “concave” behavior which is favorable forpurity control because even small impurities have a large effect.

●●●●●●●●● ●●●●●●●●● ●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

0.000 0.002 0.004 0.006 0.008 0.010

0.99

9970

0.99

9980

0.99

9990

1.00

0000

Impurity

R

(a) Dependency of the correlation coeffi-cent (R) from the degree of contamination.

●●●●●●●●● ●●

●●●●●●●

●●

●●●●●●●

●●●●●●●

●●

●●●

●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

0.000 0.002 0.004 0.006 0.008 0.010

0.80

0.85

0.90

0.95

1.00

Impurity

(b) z′ values calculated by transformingthe R values in the left figure.

Figure 2: Results obtained by regressing the spectra of the mixtures C in Fig. 1 on the referencespectrum A. The impurity concentration belonging to zc resp. zc′ corresponds to the limit ofdetection (LOD) resp. the limit of capture (LOC).

Repetitively measuring the spectra of the pure reference substance and thencomparing the single spectra with the mean spectrum allows the calculation ofa one-sided confidence interval for z. Its lower limit is as follows:

zc = z̄ − tsz

with t = Φ−1t (1 − α) and α = 0.01. zc determines the limit of detection (LOD) and

zc′ = z̄ − 2tsz the limit of capture (LOC). A normalization of z is then performedaFayaz Kondagula,M.Sc., Universität Duisburg-Essen, Fachgebiet Instrumentelle Analyik, 47048 Duisburg, [email protected]. Dr. Karl Molt, Universität Duisburg-Essen, Fachgebiet Instrumentelle Analyik, 47048 Duisburg, [email protected]

as followsz′ =

z

zc(3)

The corresponding spectral purity parameter is calculated as

SPR2 = 100× z′ (4)

3 Spectral Purity based on Difference Spectroscopy

Another way of determining the spectral purity is by calculating the differencebetween the sample spectrum (C) and the reference spectrum (A):

D = C − f × A (5)

f should be chosen in such a way that an optimal compensation of the referencespectrum is achieved. A special algorithm was developed for determining the op-timal difference factor fopt. This works by calculating a whole series of differencespectra with decreasing difference factors and determining the integral of the pos-itive part of the difference spectra (χ =

∫D+). From this the so called “Difference

Operating Characteristic” (DOC) is calculated by applying a lagged difference ofsecond order on χ using the R function diff(Chi,differences=2). Fig 3(a)shows a plot of the DOC against the factor f . The optimal factor is where theDOC exceeds a certain threshold set by the user.

−1.0 −0.5 0.0 0.5 1.0 1.5 2.0

0.00

000.

0005

0.00

100.

0015

0.00

20

PANMEAN Vs PP_6B3

Shift = 0 /Sensitivity =6.5 /Factor = 0.968Factor

Diff

2

(a) Difference Operating Characteristic Plot

3200 3100 3000 2900 2800

4060

8010

0

PANMEAN vs PP_6B3

Shift = 0 /Sensitivity =6.5 /Factor = 0.968x

dif

(b) Difference Spectrum

Figure 3: Purity control based on Difference spectroscopy

χ and fopt are used as spectral purity parameters based on following equations:

SPR3 = 100× fopt (6)

SPR4 = 100× (1−∫ ν̄2

ν̄1D+∫ ν̄2

ν̄1A

) (7)

4 Purity control of Palatinol N

With the example of a certain plasticizer (Palatinol N) it is shown that a con-tamination originating from another plasticizer (Palatinol 911P) can be detecteddown to about a concentration of 0.1 g/100g of this impurity by Diamond ATRSpectroscopy and down to 0.9 g/100g with Near Infrared Spectroscopy.

●●●●●●●●●●●●●●●●●●

●●●●●●●●● ●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●

●●●

●●●●●●●●●

●●●

●●●●●●

●●●●●●●●●

0 2 4 6 8 10

99.8

699

.92

99.9

8

Conc [g/100g]

SP

R_1

●●●●●●

●●●

●●●●

●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●● ●●●●●●●●● ●●●●●●●●● ●●●●●●●●●

0 2 4 6 8 10

6080

100

Conc [g/100g]

SP

R_2

LOD=0.1 g/100gLOC=0.3 g/100g

●●●●●●●●● ●●●●●●●●● ●●●

●●●●●●

●●●●●●●●●

●●

●●

●●●● ●●●

●●●●●● ●

●●●●●

● ●●●●●●●●

●●

●●●●●●

●●●

●●●●●●

●●●

●●●●●●

0 2 4 6 8 10

9395

9799

C [g/100g]

SP

R_3

●●●●●●●●●

●●●●●●

●●●

●●●●●●●●●

●●●●●●●●●

●●●●

●●●●●

●●●●●●

●●●

●●●

● ●●●●●●●●

●●●●●●●●

●●●●●●●●●

●●

●●●●●●

0 2 4 6 8 10

9496

9810

0

C [g/100g]

SP

R_4

(a) Spectral Purity based on Correlation coefficient and Dif-ference Spectroscopy (MIR)

●●●●●●●●● ●●●●●●●●● ●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

0 2 4 6 8 10

99.9

8099

.990

100.

000

C [g/100g]

SP

R_1

●●

●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●● ●●●●●●●●● ●●●●●●●●● ●●●●●●●●●

0 2 4 6 8 10

7090

110

C [g/100g]

SP

R_2

LOD=0.9 g/100gLOC=2.5 g/100g

●●●

●●●

●●● ●●●●●●●●●

●●●●●●●●● ●

●●●●●●●●

●●●●●●●●●

●●●

●●●●●●

●●●●●●

●●●●●●●●●●●●

●●●●●●●●●

●●●

●●●

●●●

●●●

●●●

●●●

0 2 4 6 8 10

97.0

98.0

99.0

100.

0

C [g/100g]

SP

R_3

●●●●●●

● ●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

●●●●●●●●●

0 2 4 6 8 10

97.0

98.0

99.0

100.

0

C [g/100g]

SP

R_4

(b) Spectral Purity based on Correlation coefficient and Dif-ference Spectroscopy (NIR)

Figure 4: Dependency of Spectral Purity grades of Palatinol N as function of the concentration ofan impurity (Palatinol 911P)

5 Summary

The purity of chemical substances can be calculated either by difference spec-troscopy or spectral comparison via the correlation coefficient. The necessarystatistical computations can easily be performed with R.

References

[1] Karl Molt und Alfred Schlachter, VDI-Berichte Nr. 1959, 3-26, 2006[2] Horst Weitkamp und Dieter Wortig, Mikrochimica Acta 1983 II, 31-57[3] Jacob Cohen, Statistical Power Analysis for the Behavioral Sciences, Second

Edition, S. 110 f., Lawrence Erlbaum Associates, Inc. , Hove and London (1988)