25
Q. DEHAINE * , L. O. FILIPPOV * PhD student at Université de Lorraine, A MULTIVARIATE APPROACH FOR PROCESS VARIOGRAMS WCSB7 – Bordeaux, 12 th June 2015 Work performed under FP7, STOICISM project

A multivariate approach for process variorums

Embed Size (px)

Citation preview

Page 1: A multivariate approach for process variorums

Q. DEHAINE*, L. O. FILIPPOV*PhD student at Université de Lorraine,

A MULTIVARIATE APPROACH FOR PROCESS VARIOGRAMS

WCSB7 – Bordeaux, 12th June 2015

Work performed under FP7, STOICISM project

Page 2: A multivariate approach for process variorums

2

INTRODUCTIONForeword• In every mining project, economic improvements pass through metallurgical

assessment based on series of tests aiming to improve the process,

• The effectiveness of these improvements will depend on the representativeness of the samples initially collected for the tests (Abzalov, 2013),

• Process samples are typically collected to obtain representative samples regarding one property, i.e. grade, mineralogy or physical characteristics,

• But response to one test usually depends not only on one property but on a certain range of p properties of interest.

How to assess the global representativeness of the samples for all the properties of interest ?

Page 3: A multivariate approach for process variorums

• Process streams can be seen as elongated objects:1D model,

• The preferred method for sampling 1D lots is the increment sampling,

• The choice of the sampling mode is very important as it changes the variance of lots mean,

3

INTRODUCTIONTheory of sampling

Page 4: A multivariate approach for process variorums

4

INTRODUCTIONThe variographic approachTOS introduced the variogram as a tool which provide critical information on (Gy, 2004; Petersen and Esbensen, 2005):

the process variability over time, the lot mean and the uncertainty of a single measurement, the optimal design and scheme for the sampling protocol.

Relative heterogeneity:

(Semi-)Variogram:

Constitutional heterogeneity:

Page 5: A multivariate approach for process variorums

TOS introduced the variogram as a tool which provide critical information on (Gy, 2004; Petersen and Esbensen, 2005):

the process variability over time, the lot mean and the uncertainty of a single measurement, the optimal design and scheme for the sampling protocol.

5

INTRODUCTIONThe variographic approach

Random effects (sampling, preparation, analysis)

Page 6: A multivariate approach for process variorums

6

MULTIVARIATE VARIOGRAPHYPrevious work

• The process samples need to be representative not only for one property but for a certain range of properties.

• A solution was proposed in spatial-data analysis by Oliver and Webster (1989)4 who suggested to study the variogram of Principal Component Analysis (PCA) scores of the first few principal components.

• Minkkinen and Esbensen (2014) have illustrated the advantages of ‘variograms of PCA’ as a way to perform a combined multivariate chemometric-variogram,

• It is also worth mentioning a reverse approach, introduced by Kardanpour et al. (2014), which consists in applying a PCA analysis on the individual variograms.

Page 7: A multivariate approach for process variorums

7

MULTIVARIATE VARIOGRAPHYApplication of multivariogram to process sampling• We now assume that the heterogeneity contribution is a multivariate measure:

Multivariate heterogeneity:

• Bourgault and Marcotte (1991) were the first to formalise the principle of a multivariate variogram which is define as:

Multivariogram:

where M is a positive definite p x p matrix which defines the metric for calculating the "distances" between the units.

Examples: - Euclidean metric: M=Ipxp or

- Mahalanobis metric: M=[Cov(H)]-1

Multivariate constitutional heterogeneity:

Page 8: A multivariate approach for process variorums

8

MULTIVARIATE VARIOGRAPHYSampling and analysesThe studied material is a residue stream from a kaolin mining plant (St Austell, UK), considered as a potential source of metals (LREE, Sn, Nb) as by-products (Dehaine and Filippov, 2015).

This stream must be sampled feasibility studies of pre-concentration by gravity concentration,

• Sampling: 50 increments extracted over 2 h (1 shift), with a 2 min frequency, for variographic study.

• Analyses: Chemical analysis: Sn, Nb and LREE (La, Ce, Nd), Moisture content: Pulp density Particle size analysis: D10, D50, D90 and Rosin-Rammler slope (RRslope)

8 important properties

Page 9: A multivariate approach for process variorums

Analytical results

9

Multivariate approach

CONTENTSComparing univariate and multivariate approaches

Univariate approach

PCA

PCA

Vario

gram

Multivariogram

Classical variograms

Multivariogram on raw data

Vario

gram

Variograms of PCA Multivariogram of PCA

Variogram Multivariogram

Page 10: A multivariate approach for process variorums

Analytical results

10

Multivariate approach

CONTENTSComparing univariate and multivariate approaches

Univariate approach

PCA

PCA

Vario

gram

Multivariogram

Classical variograms

Multivariogram on raw data

Vario

gram

Variograms of PCA Multivariogram of PCA

Variogram Multivariogram

Page 11: A multivariate approach for process variorums

CLASSIC APPROACHExperimental individual variograms

• The individual variograms allow distinguishing two main groups:– A high-sill variables group including LREE (+D90 and Sn),– a low-sill variables group (Pulp density, D10, D50, RRSlope and Nb).

• At first, all variograms look flat except LREE which display a local minimum,

• But the other high-sill variograms also display local minimums, indicating possible periodic phenomenon.

11Experimental variograms Vj of the 8

variables of interest. Experimental variograms Vj of 6 of the variables of

interest without LREE

Page 12: A multivariate approach for process variorums

CLASSIC APPROACHExperimental individual variograms

• The classical conclusion will be to focus on the highest sill variogram, i.e. LREE grade,

• But the sampling protocol will not consider the different periodic phenomena,

This approach doesn’t account for the multivariate nature of heterogeneity, which can lead to an underestimation of global sampling variance.

12 Std. deviation of sampling error for LREE grade

Page 13: A multivariate approach for process variorums

Analytical results

13

Multivariate approach

CONTENTSComparing univariate and multivariate approaches

Univariate approach

PCA

PCA

Vario

gram

Multivariogram

Classical variograms

Multivariogram on raw data

Vario

gram

Variograms of PCA Multivariogram of PCA

Variogram Multivariogram

Page 14: A multivariate approach for process variorums

PCA is a variable reduction procedure which simplify the data in a smaller number of more relevant components.• PC1 is mostly loaded by size distribution properties (D50, D90 and RRslope),

• PC2 is mostly loaded by pulp density, LREE, and to a lesser extent by D10,

14

MULTIVARIATE APPROACHVariograms of PCA

PCA 3D biplot

• PC3 is mostly loaded by LREE, as well as pulp density and RRslope.

How many PCs should be kept for variographic analysis ?

Page 15: A multivariate approach for process variorums

15

MULTIVARIATE APPROACHVariograms of PCA

• PC3 is mostly loaded by LREE, as well as pulp density and RRslope.

How many PCs should be kept for variographic analysis ?

𝐹 𝑖=𝑠2𝑖

(𝑉 ¿¿1)𝑖=(𝐶𝐻𝐿)𝑖

(𝑉 ¿¿1)𝑖¿¿

• Explained variability• F-test (randomness)

PC #

Explained variability

(%)

Cumulated explained

variability (%)

Eigenvalues SpectrumF-test*

1 50.13 50.13

 

2.63­2 18.82 68.94 2.813 11.47 80.41 1.354 7.94 88.35 1.695 6.61 94.96 0.896 4.08 99.04 1.537 0.92 99.96 1.248 0.05 100.00 1.42

*Critical values for the F-test are: P(F=0.90)=1.22, P(F=0.95)=1.30 and P(F=0.99)=1.47 (Minkkinen and Esbensen, 2014).

PCA is a variable reduction procedure which simplify the data in a smaller number of more relevant components.• PC1 is mostly loaded by size distribution properties (D50, D90 and RRslope),

• PC2 is mostly loaded by pulp density, LREE, and to a lesser extent by D10,

Page 16: A multivariate approach for process variorums

• V(PC1) reflect particle size variability within time, reaching a sill for the last lags, reflecting a long range variation.

• V(PC2) display a short range with a sill reached quickly, in-line with the short range variation displayed by the variograms of pulp density and LREE both loading PC2.

• V(PC3) is related to a cyclic variation with a short period (approx. 5 lags).

16

MULTIVARIATE APPROACHVariograms of PCA

Experimental variograms Vj , auxiliary functions wj and w’j of the 4 first PCs.

Page 17: A multivariate approach for process variorums

Analytical results

17

Multivariate approach

CONTENTSComparing univariate and multivariate approaches

Univariate approach

PCA

PCA

Vario

gram

Multivariogram

Classical variograms

Multivariogram on raw data

Vario

gram

Variograms of PCA Multivariogram of PCA

Variogram Multivariogram

Page 18: A multivariate approach for process variorums

• Size distribution and metal grade multivariograms both display high sills,• 3 distinct ranges: long (size distribution), medium (metal grade) and short (pulp density),• Each class-multivariogram is best modelled by a bounded linear model (blinear).

18

MULTIVARIATE APPROACHMultivariogram

Multivariograms for size distribution properties (D10, D50, D90 and RRslope), metal grades (Nb, Sn and LREE) and pulp density and the global multivariogram.

• Global multivariogram is best fitted by pentaspherical model,

Consequence of small, to long range displayed by the class-multivariograms,

Size Distribution has the highest sill with a possible periodic phenomenon.

Page 19: A multivariate approach for process variorums

• The global multivariogram displays a relatively high sill, as a consequence of the metric used for computation,

• Hence, the sampling variance is much more important comparing to the univariate case,• The std. deviation stay very high even for high number of increments collected to make the

final sample,

19

MULTIVARIATE APPROACHMultivariogram

Global relative standard deviation of the sampling error

Need to reduce global sampling variance.

Page 20: A multivariate approach for process variorums

Analytical results

20

Multivariate approach

CONTENTSComparing univariate and multivariate approaches

Univariate approach

PCA

PCA

Vario

gram

Multivariogram

Classical variograms

Multivariogram on raw data

Vario

gram

Variograms of PCA Multivariogram of PCA

Variogram Multivariogram

Page 21: A multivariate approach for process variorums

• The std. deviation of sampling error increase with the number of PCs retained,

• Even with the first fours PCs retained, the associated std. deviation, is lower than for the global multivariogram of the raw data.

• This is due to the fact that only the significant PCs were retained to compute this last multivariogram,

• PCA act as a noise-filter on the raw data to keep only relevant information on global variability of the dataset.

21

MULTIVARIATE APPROACHMultivariogram of PCA

Influence of the number of principal components on the multivariogram shape and std. deviation of sampling error.

Global shape of the variogram is already reached with the two first PCs.

Page 22: A multivariate approach for process variorums

Univariate variogram can lead to a misinterpretation and underestimation of global sampling error if interpretation is based on the highest variance property only,

Variograms of PCA scores highlight distinct spatial patterns thorough variable grouping in a reduced number of variograms,

Multivariogram summarise time variation of multiple properties and highlights the multivariate time auto-correlation aspects of these variables which however results in a high sampling variance,

An alternative approach is proposed, multivariogram of PCA scores, which filter noise from the data, and keep only the relevant data information,

Most of the variability of the St Austell residue stream is due to LREE according to the univariate approach, whereas multivariate approach point size distribution as the most important contribution.

22

CONCLUSIONS

Page 23: A multivariate approach for process variorums

References1. Abzalov, M., 2013. Representativiness of bulk samples, in: Beniscelli, J., Felipe Costa, J., Domínguez, O., Duggan, S., Esbensen,

K., Lyman, G., Sanfurgo, B. (Eds.), Proceedings of the 6th World Conference on Sampling and Blending. Lima, pp. 257–264.

2. Gy, P., 2004. Sampling of discrete materials III. Quantitative approach—sampling of one-dimensional objects. Chemometrics and Intelligent Laboratory Systems 74, 39–47.

3. Petersen, L., Esbensen, K.H., 2005. Representative process sampling for reliable data analysis—a tutorial. Journal of Chemometrics 19, 625–647.

4. Oliver, M.A., Webster, R., 1989. A geostatistical basis for spatial weighting in multivariate classification. Mathematical Geology 21, 15–35.

5. Minkkinen, P., Esbensen, K.H., 2014. Multivariate variographic versus bilinear data modeling. Journal of Chemometrics 28, 395–410.

6. Kardanpour, Z., Jacobsen, O.S., Esbensen, K.H., 2014. Soil heterogeneity characterization using PCA (Xvariogram) - Multivariate analysis of spatial signatures for optimal sampling purposes. Chemometrics and Intelligent Laboratory Systems 136, 24–35.

7. Bourgault, G., Marcotte, D., 1991. Multivariable variogram and its application to the linear model of coregionalization. Mathematical Geology 23, 899–928.

8. Dehaine, Q., Filippov, L.O., 2015. Rare earth (La, Ce, Nd) and rare metals (Sn, Nb, W) as by-product of kaolin production, Cornwall: Part1: Selection and characterisation of the valuable stream. Minerals Engineering 76, 141–153.

AcknowledgmentsThe authors wish to thank Imerys Ltd., UK, and especially S. Moradi, P. Chauhan and A. Coe for their help during the sampling exercise. We are also grateful to S. Lightfoot and P. Budge for their advices and technical support. We thank C. Gauthier for its help in the sample preparation process. This work has been financially supported by the European FP7 project “Sustainable Technologies for Calcined Industrial Minerals in Europe” (STOICISM), grant NMP2-LA-2012-310645.

23

REFERENCES & AKNOWLEDGMENTS

Page 24: A multivariate approach for process variorums

24

ANNEXComparing approaches

Univariate variogram Variograms of PCA MultivariogramNumber of variograms

p ≤ min(n-1, p) 1Sampling error(s) estimation

p sampling errors

Good estimation of the singular sampling errors

± As many sampling errors as significant PC selected

One sampling error only

Estimation of the overall sampling error sampling error depends on the metric used for calculation

Representativeness

Representative of the singular variability contribution

Underestimation of the global variability

Representative of (part of) the overall multivariate variability

– Depends on the cumulative variability explained by the selected principal components 

Representative of the overall multivariate variability

Sampling protocol design

– Choosing the variable with highest sampling error lead to underestimation of the real sampling variance

+ Allow to design the optimum sampling protocol regarding all the variables

The number of increments required can be very large

Allow to design the optimum sampling protocol regarding all the variables

The number of increments required can be very large

Objectives

To estimate the singular sampling errors To perform variable grouping (PCA) and explain distinct spatial patterns

To summarize the overall variability and representativeness of a sample

Page 25: A multivariate approach for process variorums

Thank you for your attention

Quentin DEHAINEPhD Student at Université de Lorraine

Email: [email protected]