7
Atlas Journal of Medical & Biological Sciences 1 (1): 814, 2011 doi: 10.5147/ajmbs.2011.0044 Atlas Journal of Medical & Biological Sciences - ISSN 1949-1352. Published By Atlas Publishing, LP (www.atlas-publishing.org) Mammographic Breast Density Assessment: A Methods Study Lusine Yaghjyan 1 , Susan M. Pinney 1* , Mary C. Mahoney 2 , Arthur R. Morton 3 , Jeanette Buckholz 4 1 University of Cincinnati College of Medicine, Department of Environmental Health, Cincinnati, OH, USA; 2 University of Cincinnati College of Medicine, Department of Radiology, Cincinnati, OH, USA; 3 University of Cincinnati University Hospital, Department of Radiology, Cincinnati, OH, USA; 4 University of Cincinnati Col- lege of Medicine, Department of Internal Medicine, Cincinnati, OH, USA Received: April 12, 2010 / Accepted: March 24, 2011 __________________________________________________ * Corresponding author: [email protected] Introduction Mammographic breast density is a strong predictor of breast cancer (Vachon et al., 2007, Harvey & Bovbjerg, 2004, Boy et al., 2006, Haiman et al., 2002, Tamimi et al., 2007, Ursin et al., 2003, Ginsburg et al., 2008). Light (non-radiolucent) ar- eas on the mammogram represent the fibrous and glandular tissues in the breast, whereas, the dark (radiolucent) areas are primarily fat. In clinical practice and research, breast density has been traditionally classified using the Wolfe’s classification with four breast parenchymal patterns (N1, P1, P2, and DY) or the six category classification (SCC) system based on the vi- sual estimation of the percent density (1 [0%], 2 [1–10%], 3 [11–24%], 4 [ 25–49%], 5 [50–74%], 6 [ 75–100%]) (Brisson et al., 2003, Petroudi et al., 2003, Highnam et al., 2007, Jef- frey set al., 2008, Lee-Han et al., 1995, Jamal et al., 2006). The American College of Radiology proposed a modified version of the Wolfe classification known as the Breast Imaging-Reporting and Data System (BI-RADS) classification (1-predominantly fat; 2-fat with some fibroglandular tissue; 3-heterogeneously dense; 4-extremely dense) (American College of Radiology, 1998). In addition, several computer algorithms have been developed and used to estimate breast density (Zhou et al., 2001, Muhim- mah et al., 2006). In planning a retrospective study of breast density determi- nants in a large cohort of women (Fernald Medical Monitoring Program [FMMP]), we conducted a preliminary study to find the best way to characterize mammographic breast density. There were two potential approaches. We could assign a level of den- sity for each mammogram using the radiologist’s report. Alter- nately, we could digitize the mammograms and evaluate breast 8 Abstract This pilot study was designed to find the best way to charac- terize mammographic breast density for future epidemiologic studies. Fifty mammograms (24 high, 6 low, and 20 inter- mediate density mammograms, as defined by the original radiologist reading) were selected from women participating in a community medical surveillance program. Women were required to be Caucasian, 40-85 years old and have a Body Mass Index ≤30 at the time of the mammogram; and to have no history of breast cancer. Mammograms were selected by stratified random sampling on the year of mammogram and degree of density. The agreement between breast density category assignment using different methods was assessed using the kappa statistic. Substantial to perfect agreement was observed between density categories as determined from two radiologists’ readings (kappa statistic [K]=0.73). Perfect agreement was noted between two radiologists for high vs. intermediate (K=0.81) and high vs. low density combinations (K=0.86). When grouped by the year of the mammogram, substantial agreement was observed for all year intervals (K=0.77, 0.69, and 0.70). The agreement be- tween density group assignment using information from dig- ital images and using codes applied to the radiologist read- ing showed fair to moderate agreement (K 0.19-0.36). Using the radiologist reading for characterization of breast density from earlier conventional mammography films is better than using density quantification from digitized mammograms. Key words: Epidemiologic methods, breast density, mammog- raphy.

2-AJMBS-Yaghiyan_et_al_2011.pdf

  • Upload
    ogy123

  • View
    6

  • Download
    1

Embed Size (px)

Citation preview

Page 1: 2-AJMBS-Yaghiyan_et_al_2011.pdf

Atlas Journal of Medical & Biological Sciences 1 (1): 8–14, 2011doi: 10.5147/ajmbs.2011.0044

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

Mammographic Breast Density Assessment: A Methods Study

Lusine Yaghjyan1, Susan M. Pinney1*, Mary C. Mahoney2, Arthur R. Morton3, Jeanette Buckholz4

1 University of Cincinnati College of Medicine, Department of Environmental Health, Cincinnati, OH, USA; 2

University of Cincinnati College of Medicine, Department of Radiology, Cincinnati, OH, USA; 3 University of Cincinnati University Hospital, Department of Radiology, Cincinnati, OH, USA; 4 University of Cincinnati Col-lege of Medicine, Department of Internal Medicine, Cincinnati, OH, USA

Received: April 12, 2010 / Accepted: March 24, 2011

__________________________________________________* Corresponding author: [email protected]

Introduction

Mammographic breast density is a strong predictor of breast cancer (Vachon et al., 2007, Harvey & Bovbjerg, 2004, Boy et al., 2006, Haiman et al., 2002, Tamimi et al., 2007, Ursin et al., 2003, Ginsburg et al., 2008). Light (non-radiolucent) ar-eas on the mammogram represent the fibrous and glandular tissues in the breast, whereas, the dark (radiolucent) areas are primarily fat. In clinical practice and research, breast density has been traditionally classified using the Wolfe’s classification with four breast parenchymal patterns (N1, P1, P2, and DY) or the six category classification (SCC) system based on the vi-sual estimation of the percent density (1 [0%], 2 [1–10%], 3 [11–24%], 4 [ 25–49%], 5 [50–74%], 6 [ 75–100%]) (Brisson et al., 2003, Petroudi et al., 2003, Highnam et al., 2007, Jef-frey set al., 2008, Lee-Han et al., 1995, Jamal et al., 2006). The American College of Radiology proposed a modified version of the Wolfe classification known as the Breast Imaging-Reporting and Data System (BI-RADS) classification (1-predominantly fat; 2-fat with some fibroglandular tissue; 3-heterogeneously dense; 4-extremely dense) (American College of Radiology, 1998). In addition, several computer algorithms have been developed and used to estimate breast density (Zhou et al., 2001, Muhim-mah et al., 2006).

In planning a retrospective study of breast density determi-nants in a large cohort of women (Fernald Medical Monitoring Program [FMMP]), we conducted a preliminary study to find the best way to characterize mammographic breast density. There were two potential approaches. We could assign a level of den-sity for each mammogram using the radiologist’s report. Alter-nately, we could digitize the mammograms and evaluate breast

8

Abstract

This pilot study was designed to find the best way to charac-terize mammographic breast density for future epidemiologic studies. Fifty mammograms (24 high, 6 low, and 20 inter-mediate density mammograms, as defined by the original radiologist reading) were selected from women participating in a community medical surveillance program. Women were required to be Caucasian, 40-85 years old and have a Body Mass Index ≤30 at the time of the mammogram; and to have no history of breast cancer. Mammograms were selected by stratified random sampling on the year of mammogram and degree of density. The agreement between breast density category assignment using different methods was assessed using the kappa statistic. Substantial to perfect agreement was observed between density categories as determined from two radiologists’ readings (kappa statistic [K]=0.73). Perfect agreement was noted between two radiologists for high vs. intermediate (K=0.81) and high vs. low density combinations (K=0.86). When grouped by the year of the mammogram, substantial agreement was observed for all year intervals (K=0.77, 0.69, and 0.70). The agreement be-tween density group assignment using information from dig-ital images and using codes applied to the radiologist read-ing showed fair to moderate agreement (K 0.19-0.36). Using the radiologist reading for characterization of breast density from earlier conventional mammography films is better than using density quantification from digitized mammograms.

Key words: Epidemiologic methods, breast density, mammog-raphy.

Page 2: 2-AJMBS-Yaghiyan_et_al_2011.pdf

density using computer-assisted techniques. A few studies reported a good agreement between breast

density estimation by a radiologist and that from the digitized mammogram films (Jeffreys et al., 2008). However, the breast density estimation method used in previous studies differed from the more detailed approach proposed for breast density es-timation from the FMMP mammograms. This preliminary study aimed 1) to determine whether the classification of participants into density categories using information coded from the mam-mography report is as informative as using computer-aided den-sity estimation from digitized mammograms; and 2) to find how FMMP mammography codes could be used to improve definition of low and high density for the planned retrospective study.

Materials and Methods

Mammogram Selection

Mammograms used in the present study were available from the FMMP mammography database. The FMMP was a commu-nity-based medical surveillance program for residents living within five miles of the perimeter of the former uranium pro-cessing plant in Fernald, Ohio. All mammograms were read by Board-certified radiologists. Many mammography studies were performed prior to 1995 and thus the BI-RADS classification was not included in those mammography reports. Mammographic re-ports were assigned codes from a set of over 230 standardized mammography codes that represented the inclusion of specific descriptors in the radiologist’s report. For example, the code 704 (Breasts high in density/markedly dense) was assigned to the following description “The breast tissue is markedly dense…” The text of the report and the codes were entered into an ex-

tensive database. For inclusion in the methods study, women were required to

be Caucasian (99% of FMMP population), 40-85 years old and have a Body Mass Index (BMI) ≤30 at the time of the mammo-gram of interest, to never have had breast cancer, and to be eli-gible for the main study. Using the mammogram codes assigned to the radiologist’s report, all eligible women were classified into one of the three breast density categories: low, intermedi-ate, and high. Women in high density category were required to have at least one mammogram with the high density code. The intermediate density group consisted of women who never had the high density code on any mammogram, but had one or more intermediate density codes on at least one mammogram. Women with low density were required to have never had any of the high or intermediate density codes. The selection of mam-mograms was restricted to those which were done as a part of the FMMP annual examinations (97.8%). As the radiologist (MCM) designated to do the second reading for this study was the original reader for some of the FMMP mammograms, those mammograms were excluded from the study (3.5%).

Only a small amount of funding was available and the study was restricted to 50 mammograms. We a priori decided to se-lect a larger numbers of high and intermediate density films (24 and 20, respectively) and a smaller number of low density films (6 films). We also stratified our sampling on the year of mam-mogram to account for changes in technology over time (Haus, 2002). The mediolateral oblique (MLO) views of 50 mammo-grams were evaluated in two phases of the study: 1) the second radiologist assessment of breast density and 2) quantification of breast density from the digital image. The breast density assign-ment from two phases was compared with the level of density from the original mammography report (Figure 1).

9Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

FMMP mammograms: Original BD Assignment

Second radiologist reading

2nd Direct Q 2nd Descriptors

Digitization of mammograms

BD category assignment

BD category assignment

Digital images analysis

BD category assignment

Original vs. 2nd Descriptors

Original vs. Digital

Original vs. 2nd Direct Q

Fig. 1. Methods Study Design.

Page 3: 2-AJMBS-Yaghiyan_et_al_2011.pdf

10

Inter-rater Reliability Assessment

All 50 mammograms were re-read by a second Board-certi-fied radiologist from the Department of Radiology (MCM) at the University of Cincinnati (UC). The radiologist reported findings in a standardized form that included both a direct question about the degree of breast density and a list of different mammogram descriptors (linked to the FMMP mammography codes). The ra-diologist provided a Yes/No response to each of the descrip-tors. The second radiologist estimated density for right and left breast separately.

For comparison of the two radiologists’ readings, we com-bined information from both breasts to be consistent with the coding of the original mammogram report. The overall density for the mammogram corresponded to the highest degree of den-sity assigned by the second radiologist to either of the breasts. We compared the original density category and 1) the density level directly assigned by the second radiologist (2nd-direct Q) and 2) the density level corresponding to the descriptors select-ed by the second radiologist (2nd-descriptors). The inter-rater reliability analysis was first conducted for all 50 mammograms. For statistical analysis, we created subsets of two density cat-egories at a time (high and intermediate, high and low, low and intermediate), and subsets of mammograms by calendar year (≤1995, 1996-2000, >2000).

Quantification of Density from Digital Images

All mammograms were scanned with a standard equipment (DiagnosticPRO Advantage, VIDAR Systems Corporation, VA, USA) available through the UC Department of Radiology, using one of four resolutions (1st: 870 microns/8 bit precision; 2nd: 670 microns/8 bit precision; 3rd: 220 microns/12 bit precision; and 4th: 170 microns/12 bit precision, respectively). Using a single resolution would result in loss of important information as films used in the study were done over a 16 year period and had different technical specifications. Manipulation of the digi-tal images and density estimation were performed using OSIRIS 4 Software (Geneva, Switzerland).

First, the medical physicist (ARM) manually excluded the pec-toral muscle area on the digital image; then, the rest of the breast area was divided into three irregularly shaped areas (lower, middle, and upper triads) of approximately equal size (based on the number of pixels). Density within each of the tri-ads was characterized by Pixel Average Grayness (PAG)-an average of pixel grayness levels weighted by the number of pixels.

Statistical analysis of digital images aimed to find a single density estimate for each mammogram that could be used for assignment to a density category. To account for the poten-tial effect of differences in resolution, PAG values for the tri-ads were normalized to resolution 1 (the largest group). The maximum PAG value across all triads for the given mammo-gram (both breasts) was used as a critical density estimate. The critical density estimates for all 50 mammograms were ranked from low to high, and the mammograms were assigned density categories based on their ranks and according to the distribu-tion of the original density categories (6 low, 20 intermediate, and 24 high). The density group assignment from the digital image was compared against the original breast density cat-egory (Figure 1). To explore the effects of resolution adjustment on the analysis results, a restricted analysis was conducted on the subset of mammograms scanned with resolution 1 (baseline group, PAG values never adjusted) and on the subset of mam-mograms scanned with resolution 3 (using unadjusted PAG val-ues). An additional analysis was conducted on the subset of 39 mammograms that had a perfect agreement between the two radiologists’ readings.

Statistical Analysis

The data were analyzed using SAS (version 9.2, SAS Institute, Cary, NC). The agreement from all comparisons was assessed with the kappa statistic (simple or weighted) (< 0 - less than chance agreement; 0.01–0.20 - slight agreement; 0.21– 0.40 - fair agreement; 0.41–0.60 - moderate agreement; 0.61–0.80 - substantial agreement; 0.81–0.99 - almost perfect agreement) (Viera, 2005). PAG values were adjusted for the 1st resolution

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

Characteristic High (N=24) Intermediate (N=20)

Low (N=6)

Age at the time of mammogramMean (SD) 48.4 (7.3) 55.2 (11.3) 58.3 (8.3)BMI at the time of mammogram Mean (SD) 24.0 (3.3) 25.2 (2.9) 22.4 (1.9)Premenopausal women, N (%) 20 (83.0 %) 3 (50.0 %) 11 (55.0 %)Year of mammogram, N (%)

≤19951996-2000>2000

8 (33.3 %)8 (33.3 %)8 (33.3 %)

5 (25.0 %)8 (40.0 %)7 (35.0 %)

2 (33.3 %)2 (33.3 %)2 (33.3 %)

Table 1. Distribution of participant characteristics by density category.

Page 4: 2-AJMBS-Yaghiyan_et_al_2011.pdf

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

using Analysis of Variance (ANOVA) for the nested hierarchical design.

The study was conducted under the Institutional Review Board approval from the University of Cincinnati (last approved on January 14, 2009).

Results

The distribution of participant characteristics by density cate-gory is presented in Table 1. The mean age at the mammogram date for the original high density group (48.4 years) differed significantly from both the intermediate (55.2 years, p=0.03) and the low density (58.3 years, p=0.01) categories (Table 2). The high density group contained significantly larger propor-tion (p=0.01) of premenopausal women (20/24, or 83%). The mean BMI (kg/m2) for the low density group (22.4) differed significantly from BMI for the intermediate density group (25.2, t-test p=0.04).

Inter-rater Reliability Assessment

Using the direct question on overall density (2nd direct Q), the second radiologist rated 23 mammograms as high [46.00%], 15 mammograms as intermediate [30%], and 12 mammograms as low density films [24%]. When using the information from the descriptors (2nd descriptors), the density category distribution was as follows: 25 [50%] high, 5 [10%] intermediate, and 20 [40%] low density films.

Substantial agreement (weighted kappa statistic [wK] 0.73) was observed between the original density category assignment and the overall density assignment from the second radiologist reading (Table 2). Almost perfect agreement was noted be-tween two radiologists for subsets of high vs. intermediate and high vs. low density groups (K=0.86 and 0.81, respectively).

11

When grouped by the calendar year of mammogram, inter-rat-er agreement was substantial for all year intervals (wK= 0.77, 0.69, and 0.70 for ≤1995, 1996-2000, >2000, respectively). Only moderate agreement was observed when codes applied to the second radiologist reading were used to determine the density category (2nd-descriptors), compared to the original density category assignment (wK=0.55). In both comparisons (2nd-direct and 2nd-descriptors), the largest disagreement was observed between assignments to intermediate vs. low density category with the second radiologist being more likely to assign the low density category. Cross tabulation of the original breast density category assignment and that from 2nd-direct Q and 2nd-descriptors method indicated the exact agreement of 78% (39 films) and 62% (31 films), respectively (Table 2).

Quantification of Breast Density from Digital Image

Of 50 mammograms, 23 mammograms were scanned with resolution 1, 2 mammograms were scanned with resolution 2, 22 mammograms were scanned with resolution 3, and 3 mam-mograms were scanned with resolution 4. The middle triad was consistently denser (higher PAG value) compared to the lower and the upper triad across all original density categories.Cross tabulation of the breast density group assignment from digital data (adjusted for resolution) and the original breast density category indicated the exact agreement of 52% (26 films) between two methods (wK=0.27). Better agreement was seen when the analysis was limited to the films originally catego-rized as high or intermediate density (Kappa 0.36). None of the 24 originally high density mammograms was categorized as low density using the breast density estimation from the digital image; however, 8 out of 24 mammograms (33.33%) were as-signed intermediate density.

The results of the restricted analysis on resolution 1 group

Table 2. Agreement between density category as determined using the original FMMP mammography codes and den-sity category assignment from the second radiologist’s reading.

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

Study group/subseta

Original density category vs. 2nd-Directb Original density category vs. 2nd-Descriptorsc

N Exact Agreement

Kappa coefficient [95% CI]

N Exact Agreement

Kappa coefficient [95% CI]

Whole study group

50 78 % 0.73 [0.59-0.86]d 50 62 % 0.55 [0.39-0.70]d

High and intermediate density only

44 93 % 0.86 [0.71-1.00] 44 89 % 0.77 [0.58-0.96]

Intermediate and low density only

26 65 % 0.31 [-0.0005-0.62] 26 35 % 0.02 [-0.17-0.20]

High and low density only

30 93 % 0.81[0.57-1.00] 30 93 % 0.81 [0.57-1.00]

a Selected using original density category assignment from FMMP mammography codes. b Agreement between density category, as-signed using original FMMP mammography codes and density category selected by second radiologist. c Agreement between density category, assigned using FMMP mammography codes and assigned using FMMP codes selected by second radiologist. d Weighted kappa statistic.

Page 5: 2-AJMBS-Yaghiyan_et_al_2011.pdf

12Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

only (baseline, never adjusted) and the restricted analysis on the resolution 3 group only (using unadjusted PAG values) are presented in Table 3. When the analysis was restricted to the 39 mammograms with agreement in the density category as-signment from the original mammography codes and the overall density assessment from the second radiologist’s reading, the best improvement was noticed for the subset of high and inter-mediate density groups. However, the kappa statistic remained within the fair to moderate agreement range (Table 3).

Discussion

This study was designed to find the best method to character-ize mammographic breast density for a planned retrospective study. Our findings indicate that using the FMMP mammography codes for characterization of breast density is better than using information from digitized radiography films.

This study involved mammograms from a large cohort of women with multiple mammograms obtained over time. The

Results adjusted forresolution 1

Resolution 1only

Resolution 3-unadjusted PAG

Results adjusted forresolution 1

N=39Study group/subset

N Kappa coefficient[95% CI]

N Kappa coefficient[95% CI]

N Kappa coefficient[95% CI]

N Kappa coefficient[95% CI]

Whole study group 50

0.27 a

[0.06-0.47]23

0.38 a

[0.09-0.38]22

0.38 a

[0.07-0.69]39

0.30 a

[0.08-0.53]High and intermediate density only

440.36

[0.09-0.64]20

0.39[-0.01-0.80]

190.45

[0.04-0.85]34

0.45[0.15-0.75]

Intermediate and low density only 26

-0.03 [-0.35-0.28]

14-0.09

[-0.53-0.36]10

0.07 [-0.43-0.58]

17-0.05

[-0.43-0.33]High and low density only 30

0.24 [-0.08-0.57]

120.5

[0.08-0.92]15

0.33 [-0.16-0.83]

270.25

[-0.12-0.62]

Table 3. Agreement between different breast density groups assignments from radiologist readings with those from PAG (adjusted) ranks.

a Weighted kappa statistic.

Page 6: 2-AJMBS-Yaghiyan_et_al_2011.pdf

13Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

FMMP mammograms were performed over a 16 year period, which allowed a stratified selection of mammograms by both the density category and the calendar year of the mammo-gram. The retrospective study of breast density determinants will use only the extreme breast density phenotypes (low and high). However, we needed to include the intermediate density category in this study to detect disagreement in the assignment of originally low or high density mammograms to the intermedi-ate density category by either of the investigated methods. The number of mammograms in the study was limited by funding and time allocated for this pilot project. However, the significant results suggest that this sample size was adequate to detect the existing differences.

The original density assignment was based on the combined information from both craniocaudal (CC) and MLO views, and some information on density might have been lost by selecting only MLO views for this study. However, previous studies have reported high correlation between density levels estimated from CC and MLO views by either radiologists or computer-aided techniques (Martin et al., 2006, Jeffreys et al., 2003). The FMMP mammograms differed with respect to their techni-cal specifications and quality. The use of four resolutions in this study was necessary to obtain the best information possible from the image. Comparison of the kappa statistics from the analy-ses of resolution 1 group only (never adjusted) and unadjusted PAG values for resolution 3 group only, with the results from all four resolutions combined suggests that the findings are similar, though still slightly affected by the adjustment.

Density categories from digital output were assigned by ranking all mammograms by the PAG values and assigning the number of mammograms to each category corresponding to the original distribution of the density categories in the sample. Other methods for assigning density categories from the digi-tal output were also explored. We considered using overlap areas between 95% Confidence Intervals for the PAG values across three density groups to assign films with the PAG values within the overlap ranges into the other closest density category. However, this method could not be applied as the confidence intervals for intermediate (100.5-197.0) and low (101.4-191.9) density completely overlap the much more narrow confidence interval for high density (130.9-195.6), which did not allow us to clearly define PAG ranges for each of the density categories. We also considered using a cumulative distribution plot for PAG values to detect any natural cut points for separating images into the density categories, but there was no distinct point for change in slope that would define the intermediate and high density categories.

For all three original density categories, the images that were assigned the same density categories from the original radiologist’s reading and the digital output (concordant) were compared against the images that were assigned different cat-egories by two methods (discordant). Concordant mammograms tended to have dense pixels more concentrated in certain ar-eas of the image, while discordant mammograms consistently showed more diffuse distribution of dense pixels throughout the image (Figure 2). It is possible that using PAG does not take into account the distribution pattern of the dense pixels within the

image. Previous studies have noted that fatty breasts tend to display

more diffuse breast tissue patterns (Klifa et al., 2004). In our study, the diffuse pattern made density misclassification by the digital output more likely and we expected to see a difference in BMI between women with concordant vs. discordant mammo-grams. For the high density category with the best agreement between two methods, there was no difference in BMI between women with concordant and discordant mammograms (p=1.00). There was a statistically significant difference in BMI between women with concordant and discordant mammograms in the original intermediate density category (23.7 and 26.4, respec-tively; p=0.04), but no significant difference was found for the original low density category, which might be a result of a small number of films. However, BMI in women with concordant mam-mograms was lower (19.4 and 23.0, respectively).

The middle triad was found to be denser compared to the lower and the upper triads across all three original density cat-egories. These findings were consistent with previously reported results suggesting that the denser area corresponds to the cen-tral regions of the breast (Li et al., 2004).

Our findings on agreement between breast density esti-mation by different radiologists are consistent with the previ-ously reported results (Ooms et al., 2007, Ciatto et al., 2005, Roubidoux et al., 2003, Berg et al., 2000, Jong et al., 1996). Previous studies demonstrated that the overall classification of density from digitalization of old mammography films tended to be good, but the breast density assignment from the digital images tended to be low compared with the expert (Muhim-mah & Zwiggelaar, 2006, Lee-Han et al., 1995, Jeffreys et al., 2003). Although studies suggested that computer-assisted tech-niques could be used for categorization of breast density, the classification of breast densities in those studies differs from the approach used to categorize breast density on the FMMP mam-mograms (Jeffreys et al., 2003, Lee-Han et al., 1995, Muhim-mah et al., 2006). The computer-assisted methods in previous studies were also different from the method used in our study (PAG values).

Based on these results, we will add additional codes to im-prove definition of the density phenotype. The results of this methods study are important for researchers conducting retro-spective studies of breast density using radiography films. Our findings indicate that a dual reading of the mammograms by two radiologists specifically asked to assess the level of breast density provides a better estimation of breast density and may be much less expensive.

Acknowledgements

Support for this project was provided by the Fernald Medical Monitoring Program and the UC Research Council as a Gradu-ate Student Summer Research Fellowship awarded to Yaghjyan.

References

American College of Radiology (ACR) (1998), Illustrated Breast Imag-ing Reporting and Data System (BIRADS TM). 3rd ed. Reston, VA:

Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)

Page 7: 2-AJMBS-Yaghiyan_et_al_2011.pdf

American College of Radiology. Berg WA, Campassi C, Langenberg P, and Sexton MJ (2000). Breast

Imaging Reporting and Data System Inter- and Intraobserver Vari-ability in Feature Analysis and Final Assessment. AJR 174:1769-1777.

Boyd NF, Martin LJ, Yaffe MJ, and Minkin S (2006). Mammographic density: a hormonally responsive risk factor for breast cancer. Jour-nal of the British Menopause Society 12:186-193.

Brisson J, Diorio C, and Mâsse B (2003). Wolfe’s Parenchymal Pattern and Percentage of the Breast with Mammographic Densities: Re-dundant or Complementary Classifications? Cancer Epidemiology Biomarkers & Prevention 12:728-732.

Ciatto S, Houssami N, Apruzzese A, Bassetti E, Brancato B, Carozzi F, Catarzia S, Lamberinia ML, Marcellia G, Pellizzonia R, Pescea B, Rissoa G, Russoa F, and Scorsolini A (2005). Categorizing breast mammographic density: intra- and interobserver reproducibility of BI-RADS density categories. The Breast; 14:269-75.

Ginsburg OM, Martin LJ, and Boyd NF (2008). Mammographic density, lobular involution, and risk of breast cancer. British Journal of Can-cer 99: 1369-1374.

Haiman CA, Bernstein L, Berg D, Ingles SA, Salane M, and Ursin G (2002). Genetic determinants of mammographic density. Breast Cancer Res 4:R5.

Harvey J, Bovbjerg V (2004). Quantitative assessment of mammo-graphic breast density: relationship with breast cancer risk. Radiol-ogy 230:29-41.

Haus AG (2002). Historical Technical Developments in Mammography Technology. Cancer Research & Treatment 1:119-126.

Highnam R, Jeffreys M, McCormack V, Warren R, Smith GD, and Brady M (2007). Comparing measurements of breast density. Phys Med Biol 52:5881–5895.

Jamal N, Ng K-H, Looi L-M, McLean D, Zulfiqar A, Tan S-P, Liew W-F, Shantini A, and Ranganathan S (2006). Quantitative assessment of breast density from digitized mammograms into Tabar’s patterns. Phys Med Biol 51:5843-5857.

Jeffreys M, Warren R, Smith GD, and Gunnell D (2003). Breast den-sity: agreement of measures from film and digital image. The British Journal of Radiology 76:561–3.

Jeffreys M, Warren R, Highnam R, and Smith GD (2008). Breast cancer risk factors and a novel measure of volumetric breast density: cross-sectional study. British Journal of Cancer 98:210-216.

Jong R, Fishell E, Little L, Lockwood G, and Boyd NF (1996). Mam-mographic signs of potential relevance to breast cancer risk: the agreement of radiologists’ classification. Eur J Cancer Prev 5:281-6.

Klifa C, Carballido-Gamio J, Wilmes L, Laprie L, Lobo A, DeMicco C, Watkins M, Shepherd J, Gibbs J, and Hylton N (2004). Quantifica-tion of Breast Tissue Index from MR data using Fuzzy Clustering. In Proceedings of the 26th Annual International Conference of the IEEE EMBS San Francisco, CA, USA. September 1-5.

Lee-Han H, Cooke G, and Boyd NF (1995). Quantitative evaluation of mammographic densities: a comparison of method of assessment. European Journal of Cancer Prevention 4:285-292.

Li H, Giger ML, Huo Z, Olopade OI, Lan L, Weber BL, and Bonta I (2004). Computerized analysis of mammographic parenchymal patterns for assessing breast cancer risk: Effect of ROI size and location. Med Phys 31:549-55.

Martin KE, Helvie MA, Zhou C, Roubidoux MA, Bailey JE, Paramagul C, Blane CE, Klein KA, Sonnad SS, and Chan H-P (2006). Mam-mographic Density Measured with Quantitative Computer-aided Method: Comparison with Radiologists’ Estimates and BI-RADS Cat-egories. Radiology 240:656-665.

Muhimmah I and Zwiggelaar R (2006). Mammographic Density Classifi-cation using Multiresolution Histogram Information. In Proceedings of the International Special Topic Conference on Information Technol-ogy in Biomedicine (ITAB ’06), Ioannina, Greece.

Ooms EA, Zonderland HM, Eijkemans MJ, Kriege M, Delavary BM, Burg-er C, Ansink AC (2007). Mammography: Interobserver variability in breast density assessment. The Breast 16:568–576.

Petroudi S, Kadir T, and Brady M (2003). Automatic Classification of Mammographic Parenchymal Patterns: A Statistical Approach. In: International Conference of the IEEE Engineering in Medicine and Biology Society 2:416–423.

Prevrhal S, Shepherd JA, Smith-Bindman R, Cummings SR, and Ker-likowske K (2002). Accuracy of Mammographic Breast Density Analysis: Results of Formal Operator Training. Cancer Epidemiology Biomarkers & Prevention 11:1389-1393.

Roubidoux MA, Kaur JS, Griffith KA, Sloan J, Wilson C, Novotny P, and Lobell M. Correlates of Mammogram Density in Southwestern Na-tive-American Women. Cancer Epidemiology Biomarkers & Preven-tion 2003; 12:552-8.

Tamimi RM , Byrne C, Colditz GA , and Hankinson SE (2007). Endog-enous Hormone Levels, Mammographic Density, and Subsequent Risk of Breast Cancer in Postmenopausal Women. J Natl Cancer Inst 99:1178 – 87.

Ursin G, Ma H, Wu AH, Bernstein L, Salane M, Parisky YR, Astrahan M, Siozon CC, and Pike MC (2003). Mammographic Density and Breast Cancer in Three Ethnic Groups. Cancer Epidemiology, Biomarkers & Prevention 12:332–338.

Vachon CM, Brandt KR, Ghosh K, Scott CG, Maloney SD, Carston MJ, Pankratz VS, and Sellers TA (2007). Mammographic Breast Density as a General Marker of Breast Cancer Risk. Cancer Epidemiology Biomarkers & Prevention 16: 43–9.

Viera AJ and Garrett JM (2005). Understanding Interobserver Agree-ment: The Kappa Statistic. Fam Med 37:360-3.

Zhou C, Chan HP, Petrick N, Helvie MA, Goodsitt MM, Sahiner B, Had-jiiski LM (2001). Computerized image analysis: Estimation of breast density on mammograms. Med Phys 28:1056-1069.

14Atla

s Jo

urna

l of

Med

ical

& B

iolo

gica

l Sci

ence

s -

ISSN

194

9-13

52. P

ublis

hed

By A

tlas

Publ

ishin

g, L

P (w

ww

.atla

s-pu

blish

ing.

org)