Upload
others
View
18
Download
0
Embed Size (px)
Citation preview
AUTOMATIC SEGMENTATION OF THE LOWER LIMB ANATOMY
Lena Paelinck Student number: 01307508 Promotor: Prof. Dr. Emmanuel Audenaert Copromotor: Prof. Dr. Christophe Pattyn A dissertation submitted to Ghent University in partial fulfilment of the requirements for the degree of Master of Medicine in Medicine Academic year: 2016 – 2018
AUTOMATIC SEGMENTATION OF THE LOWER LIMB ANATOMY
Lena Paelinck Student number: 01307508 Promotor: Prof. Dr. Emmanuel Audenaert Copromotor: Prof. Dr. Christophe Pattyn A dissertation submitted to Ghent University in partial fulfilment of the requirements for the degree of Master of Medicine in Medicine Academic year: 2016 – 2018
III
ACKNOWLEDGEMENTS
After one and a half years of hard work accompanied with laughter and tears, the process of
writing this dissertation comes to an end. Many people have played a crucial role in the
accomplishment of this work and to them I want to express my gratitude.
First of all, I would like to thank the University of Ghent and the department of Orthopaedics
and Traumatology of the UZ Ghent for giving me the chance to become part of a group of
professionals that has been investigating automatic segmentation for years and for giving me
the permission to help them in this research and to write this dissertation.
Secondly, I would like to express my gratitude to my supervisors Prof. Dr. Emmanuel
Audenaert and Prof. Dr. Christophe Pattyn for their guidance. Their insightful advices and
support through these eighteen months played an essential part in accomplishing this work.
Thank you for finding the time to help me in this process of researching and writing.
A special word of thanks to my mentor Dr. Jan Van Houcke is in order. His comments and
directives were essential to establish this work. He encouraged me to put my own ideas into
this dissertation but guided me when I was heading in the wrong direction. I appreciate the
time given into answering my questions, the reading over and making improvements to my
work.
Finally, a profound expression of gratitude towards my parents, sister, family and friends. Their
everlasting support and encouragement not only in this dissertation, but through all the years
of studying medicine, mean a lot to me. Thank you for finding the time to read through this
work and improving it. Without, I would not have been able to accomplish this work. Also, I
would like to say thanks, to my boyfriend, Brent Van Riet, for giving me the strength to complete
this work, for helping me through difficult times and for always being there whenever I needed
it.
A heartfelt thank you to all!
IV
TABLE OF CONTENTS
ACKNOWLEDGEMENTS ......................................................................................................... III
TABLE OF CONTENTS ...........................................................................................................IV
LIST OF FIGURES ....................................................................................................................VI
LIST OF TABLES .....................................................................................................................VI
LIST OF ABBREVIATIONS.....................................................................................................VII
ABSTRACT ................................................................................................................................ 1
NEDERLANDSE SAMENVATTING .......................................................................................... 3
1 INTRODUCTION................................................................................................................. 5
1.1 Anatomy of the lower limb ....................................................................................... 6
1.1.1 The Pelvis or the os coxae ........................................................................................... 6
1.1.2 The Femur or the thighbone ........................................................................................ 7
1.1.3 The Tibia or the shinbone ............................................................................................ 8
1.1.4 The Fibula or the splint bone ....................................................................................... 8
1.1.5 The Patella or the knee cap ......................................................................................... 9
1.1.6 The Talus or the tarsal bone ........................................................................................ 9
1.1.7 The Calcaneus or the heel bone ................................................................................. 9
1.1.8 The Lumbar Vertebrae................................................................................................ 10
1.1.9 The sacrum ................................................................................................................... 10
1.2 Imaging technology ................................................................................................. 11
1.2.1 Introduction to Computed Tomography ................................................................... 11
1.2.2 Pixels and voxels ......................................................................................................... 13
1.3 Introduction to segmentation................................................................................. 13
1.4 (Semi-) automatic segmentation methods ........................................................... 14
1.4.1 Thresholding (global vs local thresholding) ............................................................. 15
1.4.2 Gradient based method .............................................................................................. 18
1.4.3 Shape models .............................................................................................................. 19
1.4.4 Free deformation ......................................................................................................... 21
1.5 Practical applications of image segmentation ..................................................... 23
1.6 Validation metrics .................................................................................................... 24
1.6.1 Mean error distance (ASD & RMS) .......................................................................... 24
1.6.2 Maximum fault (HD) .................................................................................................... 24
V
1.6.3 Dice Similarity Index.................................................................................................... 25
2 MATERIALS AND METHODOLOGY .............................................................................. 27
2.1 Manual segmentation using Mimics...................................................................... 27
2.2 Construction of the SSM......................................................................................... 28
2.3 The pipeline of the automatic segmentation of the full lower limb ................... 29
2.3.1 Initialisation and segmentation .................................................................................. 29
2.3.2 Supervised training of the shape models and continuous updating of the
segmentation pipeline ................................................................................................................. 31
2.3.3 Solids and appearance ............................................................................................... 31
2.4 Used validation metrics .......................................................................................... 33
2.4.1 Mean error distance (ASD & RMS) .......................................................................... 33
2.4.2 Maximum fault (HD) .................................................................................................... 34
2.5 Inter- and intra-observer differences .................................................................... 34
3 RESULTS.......................................................................................................................... 35
4 DISCUSSION .................................................................................................................... 38
4.1 General ...................................................................................................................... 38
4.2 Comparison with previously developed programs ............................................. 39
4.3 Limitations ................................................................................................................ 44
4.4 Suggestions for further research .......................................................................... 44
5 CONCLUSION .................................................................................................................. 46
6 REFERENCES.................................................................................................................. 47
VI
LIST OF FIGURES
Figure 1: Anatomy of the pelvis ............................................................................................................................... 7 Figure 2: A) Anatomy of the Femur B) Anatomy of the Tibia C) Anatomy of the Fibula ......................................... 8 Figure 3: Anatomy of the Patella .............................................................................................................................. 9 Figure 4: A) Anatomy of the Talus B) Anatomy of the Calcaneus ............................................................................ 9 Figure 5: A) Anatomy of the Lumbar Vertebra B) Anatomy of the Sacrum ........................................................... 10 Figure 6: The Hounsfield scale................................................................................................................................ 12 Figure 7: Illustration of pixels ................................................................................................................................. 13 Figure 8: Common drawbacks on manual and automatic segmentation .............................................................. 14 Figure 9: Illustration of thresholding ...................................................................................................................... 15 Figure 10: Histogram based threshold ................................................................................................................... 16 Figure 11: Histogram of CT values .......................................................................................................................... 17 Figure 12: Illustration of gradient based method .................................................................................................. 18 Figure 13: Illustration of the SSM and the Eigenmodes ......................................................................................... 21 Figure 14: Illustration of the free deformation phase ........................................................................................... 22 Figure 15: Illustration of the Dice Similarity Index (DSI) ........................................................................................ 25 Figure 16: Average shape and appearance of the low- and high-resolution SSM ................................................. 28 Figure 17: Overview of the automatic segmentation pipeline .............................................................................. 31 Figure 18: Illustration of the correspondence feature ........................................................................................... 32 Figure 19: Illustration of solid cortical, cartilage and marrow volumes ................................................................ 32 Figure 20: Illustration of the Hausdorff Distance ................................................................................................... 34
LIST OF TABLES
Table 1: Accuracy and observer variation in segmentation of the respective segments ....... 36
Table 2: Comparison with previously developed automatic segmentation techniques .......... 40
VII
LIST OF ABBREVIATIONS
2D Two-dimensional
3D Three-dimensional
AD Average Distance
ASD Average Surface Distance
ASM Active Shape Model
CAOS Computer Aided Orthopaedic Surgeries
CAT Computed Axial Tomography
CT Computed Tomography
CTA Computed Tomography Angiography
DSI Dice Similarity Index
HD Hausdorff Distance
HT Higher Threshold
HU Hounsfield Unit
LAM Local Appearance Model
LT Lower Threshold
MR(I) Magnetic Resonance (Imaging)
OE Overlap Error
PCA Principal Component Analysis
RMS Root Mean Square
ROI Region of Interest
SD Standard Deviation
SSM Statistical Shape Model
1
ABSTRACT
Background: Image segmentation allows transforming a 2D image, obtained from CT or MRI,
into a 3D image. Hence, image segmentation has a growing interest, especially in diagnosis
and planning of surgery. It gains traction in orthopaedics as well as other specialities like
cardiology and neurology. However, image segmentation is mostly done manual - a laborious
and time-consuming task to be performed by trained professionals. The increase in the number
of segmentations results in a need for automatic segmentation techniques. The aim of this
study is to validate the accuracy of our proposed developed segmentation protocol compared
with the golden standard of manual segmentations. Secondly, the inter- and intra-observer
variability of the manual segmentation will be evaluated.
Materials and methodology: The pipeline of the proposed method for automatic
segmentation is presented. The fully automatic segmentation is divided into three phases.
First, there is an initialization and segmentation phase, separated in four steps. Afterwards,
the training of the shape model is supervised and the segmentation pipeline is continuously
updated. At last the appearance and solids phase occurs. Additionally, the method of validation
as well as the validation metrics and the inter- and intra-observer differences are explained.
Average surface distance (ASD) and Hausdorff distance (HD) are used as parameters for
validation. Inter- and intra-observer differences are reported.
Results: The mean error or the ASD, measured in 10 cases, ranges from 0,53 mm to 0,76
mm, with an average of 0,65 mm and the maximum error or the Hausdorff distance ranges
from 2,02 mm to 7,84 mm, with an average of 4,05 mm. The inter-observer variability,
measured in three cases, has the following results: an average error ranging from 0,39 mm to
0,61 mm, with an average of 0,44 mm and a maximum error ranging from 1,67 mm up to 3,74
mm, with an average of 2,29 mm. The intra-observer variability in one case was also measured
and the following results are found: an average error ranging from 0,17 mm to 0,32 mm and a
maximum error ranging from 0,78 mm till 2,29 mm.
Discussion: Several previously developed automatic segmentation techniques are compared
to ours. The results of these techniques range from 0,5 till 5,4 mm ASD. Automatic
segmentation techniques which only implement the segmentation of a part of a bone or a joint
are not included in the comparison. The limitations of our study are mentioned, such as the
small number of CT data sets included in the beginning of our statistical shape models, the
small number of manually segmented CT data sets as limitation for the validation of our
2
program and the small number of studies found to compare our results to. Finally, suggestions
for further research are described: a possibility to make an extension of our developed
automatic segmentation program to MRI, the full skeleton and other disciplines like cardiology
and neurosurgery.
Conclusion: It is important to note that this study is the first of its kind to develop an automatic
segmentation technique for the full lower limb. After comparing with previously developed
techniques, our automatic segmentation technique seems to generate highly competitive
results and, by consequence, was found to be the most accurate technique.
3
NEDERLANDSE SAMENVATTING
Achtergrond: Segmentatie laat toe om een 2D beeld, verkregen via CT of MRI, te
transformeren tot een 3D beeld. Bijgevolg is er een groeiende interesse in segmentatie, vooral
bij het stellen van diagnoses en plannen van chirurgie. Het is van toepassing in orthopedie,
maar ook in andere specialisaties zoals cardiologie en neurologie. Echter, heden ten dage
wordt segmentatie meestal manueel gedaan, wat een intensief en tijdrovend werk is, dat alleen
is weggelegd voor experten. Voorgenoemde zaken duiden op de nood aan automatische
segmentatie technieken. Het doel van deze studie is om ons nieuw ontwikkeld automatische
segmentatie programma te valideren door dit te vergelijken met de gouden standaard, in casu
de manuele segmentatie. Bijkomend worden ook de inter- en intra-observator variabiliteit van
de manuele segmentatie bestudeerd.
Materialen en methodologie: Het werkingsmechanisme van de voorgestelde methode om
automatisch te segmenteren wordt voorgesteld. Het voorgestelde automatische segmentatie
programma is onderverdeeld in drie fases. Eerst vindt de initialisatie en segmentatie plaats.
Deze eerste fase is dan nog eens onderverdeeld in 4 stappen. Daarna wordt de training van
het statistische vormmodel gesuperviseerd en de werking ervan voortdurend geüpdatet. De
laatste fase is diegene waarbij uiterlijke kenmerken aan het 3D model gegeven worden. Verder
wordt zowel de methode van valideren als de parameters die gebruikt worden voor de validatie
en de inter- en intra-observator verschillen bekeken. The gemiddelde oppervlakte afstand en
de maximale afstand worden gebruikt als parameters voor validatie. De inter- en intra-
observator verschillen worden beschreven.
Resultaten: De gemiddelde fout of de gemiddelde oppervlakte afstand, bestudeerd bij 10
patiënten, varieert van 0,53 mm tot 0,76 mm met een gemiddelde van 0,65 mm. De maximum
fout of de HD varieert van 2,02 mm tot 7,84 mm met een gemiddelde van 4,05 mm. De inter-
observator variabiliteit werd berekend bij 3 patiënten en gaf volgende resultaten: een
gemiddelde fout die varieert van 0,39 mm tot 0,61 mm, met een gemiddelde van 0,44 mm en
een maximum fout die varieert van 1,67 mm tot 3,74 mm met een gemiddelde van 2,29 mm.
De intra-observator variabiliteit in 1 CT data set werd berekend en volgende resultaten werden
bekomen: een gemiddelde fout variërend van 0,17 mm tot 0,32 mm en een maximum fout die
varieert van 0,78 mm tot 2,29 mm.
Discussie: Verschillende automatische segmentatie technieken, ontwikkeld door andere
onderzoeksgroepen, werden vergeleken met de onze. Het resultaat van deze technieken
4
varieerde van 0,5 tot 5,4 mm ASD. Studies waarbij de automatische segmentatie techniek zich
toelegde op een deel van een bot of op een gewricht werden uitgesloten uit deze vergelijking.
Verder worden ook de beperkingen van onze studie besproken, zoals o.a. het kleine aantal
CT data sets die opgenomen zijn in het begin van ons statistisch vormmodel, het kleine aantal
manuele segmentaties, die een limitatie vormen voor de validatie en het kleine aantal
gevonden studies waarvan de resultaten vergeleken kunnen worden met de onze. Als laatste
werden de mogelijkheden voor verder onderzoek beschreven. Zo is er onder andere een
mogelijkheid tot uitbreiding van deze techniek tot MRI, het volledige skelet en andere
specialisaties zoals cardiologie en neurochirurgie.
Conclusie: Het is belangrijk te vermelden dat deze studie de eerste is die een automatische
segmentatie techniek ontwikkelt voor het volledige onderste lidmaat. Na de vergelijking met
eerder ontwikkelde technieken, bleek deze studie sterk concurrerende resultaten te vertonen
en daarbij dus ook over de meest accurate automatische segmentatie techniek te beschikken.
5
1 INTRODUCTION
Ever since the Ancient World, several physicians studied the human body in multiple ways.
Hippocrates and Vesalius are just a few examples of famous physicians who had an enormous
impact on medicine from then and now (1,2). The skeleton, the organs and the details of the
human body have been long known, studied and illustrated before modern medicine existed.
An important difference between medicine from then and now is the way we study the human
body. Before, the human body was studied on corpses or during an operation. Nowadays with
the help of medical imaging, the human body can be studied without the need for invasive
actions.
Even though modern technology made it much easier to study the human body, Computed
Tomography (CT) and Magnetic Resonance (MR) scanners still depict it in two-dimensional
(2D) images. In order to allow for three-dimensional (3D) evaluation of the anatomical
structures, segmentation is required.
Image segmentation is the process of segmenting or partitioning a digital image into multiple
segments or in a series of pixels with the aim to change the representation of the image into
something that gives more information and is easier to scrutinize (3). It converts a complex
image to a simple image that is easy to interpret (4). In specific, image segmentation is the
process of labelling every pixel in the image so that pixels with the same label share the same
characteristics (5). In plain words, segmentation as described in this paper is the extraction of
the part one wants to see in 3D, e.g. the skin, the heart, bones, etc., from a set of 2D slices. It
is a mainly manual technique that involves selecting the region of interest in each 2D image
followed by computing the 3D shape.
However, manual segmentation is a laborious and time-consuming task that must be
performed by trained professionals with the right anatomical knowledge. It is estimated that
the segmentation of a full skeletal lower limb takes around 100 man hours (6). If 3D evaluation
of anatomy wants to reach more physicians, then automatic segmentation techniques need to
be developed.
Segmentation of individual anatomy is used to study the anatomic variation of structures.
Furthermore, it allows for running personalized computational models in mechanical simulation
studies.
6
Up till now, various semi- to automatic segmentation routines have been described in literature.
The average surface distance (ASD) ranges from 0,5 mm up to 5,4 mm. Hence, there is room
for improvement (7–13).
The aim of this thesis is to validate the accuracy of our newly developed segmentation protocol
and to compare this to the golden standard, manual segmentations, and previously developed
automatic segmentation techniques. Additionally, the inter- and intra-observer variability of the
manual segmentation will be evaluated. The current protocol is hypothesized to be superior to
previous techniques. Furthermore, the average of the inter- and intra-observer manual
segmentations are expected to be in the same range.
In light of this study, I performed the majority of the manual segmentations and performed the
statistical analysis comparing manual to automatic segmentation as well as the intra and inter
observer variability.
1.1 Anatomy of the lower limb
The bones of the human skeleton included in this study are the pelvic bones, the femur, the
tibia, the fibula, the patella, the calcaneus, the talus, lumbar vertebrae and the sacrum. In this
section, the anatomy of these bones is described extensively and the joint they take part in is
mentioned (14–18).
1.1.1 The Pelvis or the os coxae (Figure 1)
The pelvis consists of three parts: the os pubis or the pubic bone, the os ilium and the os ischii
or the ischial bone. In childhood, these three bones are connected by cartilage and by the age
of 15-17 the fusion begins. From the age of 20 to 25 the fusion is completed and together they
form an acetabulum, which is located at the lateral side of the hip bone. This is the cavity where
the femur (the thighbone) will fit in the pelvis. The acetabulum and the femur head form the hip
joint. The biggest bone in the pelvis is the os ilium. It is located at the top and the backside of
the pelvis and consists of two parts: the ala ossis ilii and the corpus ossis illii. The latter is the
part of the ilium that fusions with the other two bones. The second bone is the os pubis,
consisting of the corpus ossis pubis, the ramus superior ossis pubis and the ramus inferior
ossis pubis. The two latter form the foramen obturatum and again the corpus part of the os
pubis is the one that fuses with the os ilium and the os ischii. At last, we have the os ischii,
which consists of the corpus ossis ischii (the part that fusions) and the ramus ossis ischii. The
7
ramus forms the lower part of the foramen obturatum. Formed by the os pubis and the os ischii,
the foramen obturatum is the space where the obturator nerve, vein and artery pass.
As mentioned above, these three bones form the hip bone and each hip consists of two hip
bones which are connected by fibrocartilage at the symphysis pubica.
1.1.2 The Femur or the thighbone (Figure 2)
The femur consists of a proximal and a distal extremity, called the proximal and distal
epiphysis, and in between the corpus femoris or the diaphysis is located. At the upper or
proximal epiphysis, we find the caput femoris, which is the part that will rotate in the acetabulum
of the hip bone. The caput contains a depression which is called the fovea capitis femoris. This
fovea does not take part in the joint but serves as an attachment place for the ligamentum
teres. Between the caput and corpus femoris we find the collum femoris, which is also called
the neck of the femur. At the sides, we find two bulgings, the trochanter major at the lateral
side, and the trochanter minor at the medial-back side. These trochanters major and minor are
the insertion point of multiple muscles. At the lower or distal epiphysis is a surface, the facies
patellaris, where the patella touches the femur. At the lower end, we find 2 surfaces, the
epicondylus medialis and the epicondylus lateralis, where the tibia touches the femur. The
femur, the tibia and the patella form the knee joint.
Figure 1: Anatomy of the pelvis (18) (from Sobotta Atlas of Human Anatomy)
8
1.1.3 The Tibia or the shinbone (Figure 2)
The tibia has the same structure as the femur, it consists of a proximal and distal epiphysis
and a diaphysis or corpus tibiae which has a triangular structure. The proximal epiphysis has
a condylus medialis and a condylus lateralis, where the tibia touches the femur. It also has a
tuberositas tibia, which is a bulging for the insertion of the ligamentum patellae. The distal
epiphysis has an incisura fibularis, where the fibula touches the tibia, and a malleolus medialis,
which is also called the inner ankle.
1.1.4 The Fibula or the splint bone (Figure 2)
The fibula has the same structure as the other long bones like the femur and the tibia and
consists of a diaphysis and a proximal and distal epiphysis. The proximal epiphysis is the caput
fibulae, which has a protrusion called the apex capitis fibulae. This caput fibulae is connected
to the diaphysis by the collum fibulae. Like the tibia, the corpus of the fibula has a triangular
structure. At the distal epiphysis, there is again a bulging, called the malleolus lateralis or the
outer ankle.
Figure 2: A) Anatomy of the Femur B) Anatomy of the Tibia C) Anatomy of the Fibula (18)
(from Sobotta Atlas of Human Anatomy)
9
1.1.5 The Patella or the knee cap (Figure 3)
The patella is a little bone that has a role in the knee
joint. The other bones that form the knee joint are the
femur and the tibia. The patella has a heart shaped
structure and is located with the point (the apex
patella) downwards or distal. The basis patella is
located upwards or proximal. The dorsal side
articulates with the femur and the anterior side (the
facies anterior) is touchable through the skin.
1.1.6 The Talus or the tarsal bone (Figure 4)
The talus is the most important bone of the foot, since it carries the weight of the whole body.
It has a caput tali (the head) and a corpus tali (the body) are connected by a collum tali ( the
neck). On the corpus tali, there is a trochlea tali with a facies superior, a facies malleolaris
lateralis and a facies malleolaris medialis. The facies superior is where the talus touches the
tibia, the facies malleolaris lateralis touches the fibula and the facies malleolaris medialis
touches the tibia. Underneath, there is a facies articularis calcanea anterior and posterior,
where the talus touches the calcaneus.
1.1.7 The Calcaneus or the heel bone (Figure 4)
The calcaneus is the biggest bone of the foot. At the back, there is a tuber calcanei, also called
the heel and is the place where the Achilles tendon inserts. At the top (the cranial side), there
are three facies articularis, namely the facies articularis talaris anterior, media and posterior.
These three are the connection between the talus and the calcaneus.
Figure 3: Anatomy of the Patella (18) (from
Sobotta Atlas of Human Anatomy)
Figure 4: A) Anatomy of the Talus B) Anatomy of the Calcaneus (18) (from Sobotta Atlas of
Human Anatomy)
10
1.1.8 The Lumbar Vertebrae (Figure 5)
The lumbar vertebrae are a part of the vertebral column which consists of seven cervical
vertebrae, twelve thoracic vertebrae, five lumbar vertebrae, five sacral vertebrae (the sacrum)
and four or five coccyx vertebrae (the coccyx). The corpus of the lumbar vertebra is much
bigger than the corpora of the other vertebrae. The processus spinosus points to caudal. The
processi costales are the protrusions on the side and are typical for the lumbar vertebrae. After
the processus costales, the processus accessorius is located and together with the processus
mamillaris they are a remnant of the processus transversus. The pediculus and lamina arcus
vertebrae form the arcus of the lumbar vertebrae. In the central of this arcus there is a foramen
vertebrale for the spinal cord.
1.1.9 The sacrum (Figure 5)
The sacrum is the consequence of the fusion between the five sacral vertebrae and it consists
of 2 sides, the frontal concave side or the facies pelvina, and the dorsal convex side or the
facies dorsalis. The fifth lumbar vertebra leans on the only visible corpus vertebrae of the upper
sacral vertebra, the basis ossis sacri and on the ventral side also called the promontorium.
The apex ossis sacri touches the os coccyx. In the facies pelvina there are 4 geminated
foramina sacralia anterior (foramina sacralia posterior on the dorsal side). These foramina
sacralia are the apertures for the nervi spinales. Between the pairs of foramina sacralia anterior
there are lineae transversae, originated from the fusion of the sacral vertebrae.
Figure 5: A) Anatomy of the Lumbar Vertebra B) Anatomy of the Sacrum (18) (from Sobotta Atlas of Human Anatomy)
11
1.2 Imaging technology
1.2.1 Introduction to Computed Tomography
To study these bones without using an invasive procedure, there is a need for imaging
technology of which the purpose is to make a scan or copy of the human body in 2D. On this
scan, dependent on the type of scanning machine, one can see different structures of the body
like bones, muscles and/or organs. The two well-known scanning machines are the Computed
Tomography (CT) and the Magnetic Resonance Imaging (MRI). The CT scan is the only one
included in this study and will be explained shortly.
The CT (Computed Tomography) or CAT (Computed Axial Tomography) uses, just like the X-
ray radiography, X-radiation for making a copy of the human body. The X ray tube emits the X
rays and the detector, located on the other side of the patient and the X ray tube, receives the
radiation that goes through the body. Hence, the part of the body that is to be visualized, needs
to be located between the X ray tube and the detector. But instead of using one stationary X-
ray tube like X-ray radiography, the CT uses an x-ray tube that circles around the patient.
Equal to X-ray radiography, the CT scan makes a reconstruction of the density of structures in
the human body. The principle is that the radiation can go through some structures and while
others stop the radiation. It is comparable with visual light: visual light is also an
electromagnetic radiation, but contrary to X-rays it is visible and it cannot pass any tissue of
the human body. When you hold your hand towards the light, there is shadow on the floor. The
same applies for X-rays. Dense or radio-opaque structures like bone and metal (e.g.
pacemaker, artificial knee, etc.) stop the radiation, so the detector does not receive any
radiation. Bones and metal create a shadow on the detector and that is why bones and metal
appear white on the 2D image. Other low density or radiolucent structures such as air let the
radiation pass through the body what makes it appear black on the 2D image. Structures like
muscles, blood and fat are a shade of grey as they let a small amount of radiation pass through.
In contrast to the X-ray radiography, the CT scan detector gets more information from the
reception of X-rays. When the X-ray tube circles around the patient, it gives 3D information
(information in the depth) to the detector. Then it is the task of the computer algorithm to
transform this 3D information into a 2D image.
12
The attenuation coefficient is the measurement of how much a tissue can be penetrated by an
X-ray beam, i.e. it measures how much strength the beam has lost while passing through the
body. The intensity of the X-ray beam that went through the patient, is lower in dense or radio-
opaque structures. The Hounsfield scale is an alternative for the attenuation coefficient and is
expressed in Hounsfield Units (HU), where water is given a HU of 0 and air a HU of -1000.
The HU of water and air serve as reference values to which all other tissues are compared.
Bone has a HU of + 400 up to +3000. All the other tissues have a HU between – 1000 HU and
+ 1000 (19) (Figure 6).
When the X-ray tube finishes the circle, a single 2D slice is made by the information sent from
the detector to the computer. For the next 2D slice, the bed where the patient lies on, is moved
a little bit in the direction in which one wants to scan the human body. Then, the CT scan can
make a new full circle around the patient to create the next 2D image. In plain words, the
principle of a CT scan is to slice the body into multiple 2D images (20–24).
Just like the X-ray radiography, the CT has the disadvantage of exposing patients to radiation,
which can cause mutations in the DNA when the patient is frequently exposed. Compared to
the X-ray radiography, the CT scan gives a lot more information about soft tissues and bones.
The advantage compared to the MRI is the shorter waiting time as well as the time that is
needed for the patient to be lying in the CT scan to create a full copy of his body (21).
In this paper, as mentioned before, we only include CT imaging. MRI and X-ray radiography
are excluded. However, in addition to the conventional CT scan, the CTA (Computed
Tomography Angiography) is included as well. The principle of CTA is equal to a conventional
CT scan, but in addition a contrast agent is administered intravenously and this enables the
Figure 6: The Hounsfield scale (19) (Available from:
https://web.archive.org/web/20070926231241/http://www.intl.elsevierhealth.com/e-books/pdf/940.pdf)
13
visualisation of the blood vessels, which are normally difficult to see. The contrast agent (e.g.
iodine) enhances the visualisation of blood vessels by inhibiting the X-rays to pass through
(21).
The purpose of this study is to go from a set of 2D images obtained from a CT scan, to a 3D
image, which gives a lot more information than the individual 2D image. This can only be
achieved by using segmentation.
1.2.2 Pixels and voxels
A CT scan produces 2D axial pictures, consisting of pixels. A pixel is just one point on the
image, with certain coordinates (x,y), having a certain brightness or colour. A lot of adjacent
pixels toghether can form an image (Figure 7). To create a 3D image, there is a need of voxels.
The term voxel originates from the contamination of pixel and volume. A voxel is a pixel but in
3D, so it is not a square anymore but a cube with a certain coordinate (x,y,z) and a certain
brightness or colour. Due to the use of voxels instead of pixels, we can see the 3D image from
different angles (23,25).
1.3 Introduction to segmentation
As mentioned above, segmentation is a method that helps one to extract the bony structures
from a CT image. It used to be done, and often is still done, manually, asking a lot of time from
professionals, considering that an outsider who is not comfortable with the technique of
segmenting is unable to perform this. Equally, there are a couple of disadvantages of manual
segmentation, for example the long time it takes to segment, the subjective evaluation and the
low reproducibility (26). Low reproducibility means it is impossible to do the exact same
segmentation all over again, even by the same person. To minimalize these disadvantages,
there is a clear need of a (semi-) automatic segmentation method which does not include, or
only requires a minimal of human intervention.
Figure 7: Illustration of pixels (25) (from Matej “Retro”)
14
Manual segmentation is based on the contrast between the surrounding soft tissue and bone
tissue. The aim is to mark the bone, so the computer can extract it from the image. When
making a segmentation, a line must be drawn where the bone touches the soft tissue meaning
when the contrast is high, it is easy to draw a line between the two. For example, the density
of the long bones such as femur, tibia and fibula is high and the density of the surrounding
tissue is low which establishes a high contrast and makes segmentation effortless. But
segmentation, manual as well as automatic, has its difficulties in the joint epiphysis areas
where the contrast between the cancellous bone and the soft tissue is much lower, since the
cortical layer is thinner and thereby has a lower density (27). The cortical layer is the outer part
of the bone that makes the contrast with the surrounding tissue this high. The thinner the
cortical layer, the harder it is to see the line where the bone is in touch with the surrounding
tissue. Also, both manual as automatic segmentation deal with problems like low quality CT
scans and narrow space between the different bones taking part in the joints (10) (Figure 8).
A B C
1.4 (Semi-) automatic segmentation methods
In this section, the different methods of automatic segmentation are briefly explained. The
easiest method is thresholding, but because of its many disadvantages this method is no
longer the first-choice method of segmenting.
In manual segmentation as well as automatic segmentation, the first thing to do is select the
region of interest (ROI). The ROI is the region where one is interested in. For example, in a
CT scan of the full lower limb, where one is only interested in the femur, then the region of
interest is the region around the femur, including the femur. The image is transformed so that
only the femur is fully visible (28).
Figure 8: Common drawbacks on manual and automatic segmentation. A) Normal femur with high contrast
resulting in effortless segmentation B) Narrow spaces between the pelvic bones and the femoral head in
the hip joint C) A thin cortical layer in the knee joint (femur and tibia) that involves in less contrast
15
1.4.1 Thresholding (global vs local thresholding) (29)
Thresholding as a segmentation method uses the difference in
density between bone and the surrounding tissue (4,30–33). The
Hounsfield Units (HU), which are used to express the density of
any tissue in a CT scan, are used to define the type of tissue.
Bone has a HU between 700 (for trabecular bone) and 3000 (for
cortical bone) and soft tissue has a HU between 100-300. (19)
This allows the establishment of a threshold, a kind of boundary,
between 300-700 HU, so any tissue above this threshold will be
defined as bone and any tissue beneath will be defined as soft
tissue (34,35). For this reason, thresholding is a region based
segmentation method and is sometimes called binarizing.
The most important thing to do, is to choose the threshold value.
You can choose it manually or by calculating the mean CT value of all the pixels and taking
this CT value as a threshold (30,36). But a more appropriate way of choosing the threshold
value is based on creating a histogram of the image pixel densities. The most common used
technique of histogram based thresholding is Otsu’s method (37). The principle of the
technique proposed by Otsu is to search thresholds that maximize the interclass variation and
minimize the intraclass variation. For example in Figure 9, the histogram of a hip joint region
is calculated and two peaks will be found. The histogram curve shows us the number of voxels
that have a certain CT value or HU unit. In the ideal case the histogram curve shows us two
peaks with in between a deep valley. This valley should be used as a threshold so all the
voxels with a HU above the threshold will be defined as bone and all the voxels beneath will
be defined as soft tissue. But in most of the medical images, the two peaks are not that clear.
The lower peak correlates with the densities of the femoral head and acetabulum and the
higher peak correlates with the surrounding soft tissue. Afterwards two Gaussian curves,
representing the distribution of the bone on one hand and the soft tissue on the other hand,
are fitted to the highest values of those two peaks of the histogram curve. Finally, the
intersection between those two Gaussian curves is taken as the threshold value (35,37,38)
(Figure 10).
Figure 9: Illustration of thresholding.
A) Transversal slice of the diaphysis of
the femur B) Graphical illustration of
the variation in density of this picture
(the principle of thresholding)
A
B
16
Figure 10: Histogram based threshold: Two Gaussian curves representing the distribution of the bone on one hand
and soft tissue on the other hand are fitted in the histogram curve and the intersection of the two curves is taken as
the threshold value. (35) (from Cheng Y et. al.)
As mentioned above, thresholding is no longer the first-choice method of segmenting, since it
has many disadvantages like over-thresholding, segmenting bones where implants are
involved and diffused boundaries (35).
Over-thresholding means that not only bone will be defined as bone, but also a part of the
surrounding soft tissue is defined as bone. When the threshold of HU is taken a bit lower, e.g.
250 HU, a part of the surrounding tissue will be segmented too. Sometimes the threshold must
be placed at lower HU e.g. in the epiphyses where the bone density is lower and when the
threshold is higher, the bone will be defined as soft tissue.
When implants are present in the patient, which are often made of metal, it is impossible to
use a segmentation method based on thresholding because bone and metal have an overlap
in density and the implant could be defined as bone.
Narrowness in a joint and weak boundaries are also common problems of thresholding. For
example, in the hip joint where the femur and acetabulum of the pelvis almost touch each other
due to thin and often worn out cartilage layers. Thresholding is not able to make the distinction
between the femur and pelvis and both these bones will be defined as bone tissue. This could
be a problem if the aim is only to segment the femur and not the pelvis (35).
Local or adaptive thresholding can be a solution for the over-thresholding issue. Adaptive
thresholding is the situation where different threshold values are used for different regions in
the image (38,39). Therefore the image is divided into separated blocks or sections and each
section has a different threshold value, calculated by using for example the histogram method
explained above (32,33,40).
17
Another method, which is a variant of local or adaptive thresholding is calculating 2 threshold
values: LT (Lower Threshold) and HT (Higher Threshold) (Figure 11). Every voxel with a HU
below the LT is defined as soft tissue and every voxel with a HU higher than HT is defined as
bone. Every voxel within the LT and the HT will be assigned by a local adaptive formula. This
formula is based on the mean CT value (m) and the standard deviation (SD) of the 26
neighbour voxels of the voxel under examination (v)(41):
𝐶𝑇𝑣 = {< 𝑚 − 𝛼 ∙ 𝑆𝐷 => 𝑙𝑎𝑏𝑒𝑙 𝑜𝑓 𝑠𝑜𝑓𝑡 𝑡𝑖𝑠𝑠𝑢≥ 𝑚 − 𝛼 ∙ 𝑆𝐷 => 𝑙𝑎𝑏𝑒𝑙 𝑜𝑓 𝑏𝑜𝑛𝑒
(1)
with D typically equal to one (41).
At last, there is one more way to use thresholding as a segmentation method. In case of a lot
of background, you can define a second thresholding value so instead of binarizing the image,
you divide it in three regions which differ in density. The first threshold makes the difference
between the background (air = - 1000 HU) and the rest of the image. The second threshold
makes the difference between the rest of the image, so the soft tissue and the bone. This last
way of thresholding is used in our proposed automatic segmentation program.
Figure 11: Histogram of CT values. All voxels with CT values above the HT are classified as bone and all the
voxels with CT values below the LT are classified as soft tissue. In between LT and HT local adaptive thresholds
are used. (41) (from Kang Y et. al.)
. (
18
1.4.2 Gradient based method (29)
The image gradient is a parameter for directional change in an image. On one hand the
magnitude of the gradient represents the rapidity of the change and on the other hand, the
direction of the gradient represents in which direction the image is changing (42). The principle
of the gradient based method is based on the first derivative of the image which represents
changes in density in the image. The method only works if there is a clear change in density
between two pixels. These clear changes in density are mostly found in the edges between
surrounding soft tissue and bone and therefore can be used as a segmentation method (Figure
12) (32,33).
The magnitude of the gradient tells us more about the rapidity of
the change in density of the image. For example, when there is a
transition zone from soft tissue to cortical bone, there is a big
change in density. Therefore, the gradient will have a high peak
in that pixel. But when there is a transition zone between soft
tissue and trabecular bone whereas the HU difference is not that
big, the peak of the gradient will be much lower (3,42).
The direction of the gradient tells us more about the direction in
which the change in density occurs in the image. The direction
can be measured in the x-axis or in the y-axis (3,42). In Figure
12, the direction of the gradient is represented in the x-axis. In C)
the gradient has not included the direction, but just the change in
density and in D) the gradient has included the direction. To
calculate the gradient, we start from the first pixel on the left side.
The next pixel has the same density so there is no change and
the gradient stays the same. When we come across the first pixel
that includes the femur, there is a big change in density. The HU
increase from one pixel to the other, so the direction goes
upwards and the magnitude is high. When we are in the cortical
bone of the femur, the density does not change so the gradient
ascends to this isoline. But afterwards, when we come across the first pixel that does not
belong to the cortical bone of the femur anymore and therefore a decrease in HU, the gradient
direction goes downwards. After this change, the density stays the same for a long time, so
the gradient goes back to his isoline.
Figure 12: Illustration of gradient
based method. A) Transversal slice of
the diaphysis of the femur B) Graphical
illustration of the variation in density of
this femur (the principle of thresholding
C) The principle of gradient based
segmentation D) Gradient based
method considering direction
B
C
D
A
19
From the information above, we can conclude that the gradient based method is a
segmentation method based on edge detection, as the edges are most of the time the zones
with the biggest difference in density (25,26).
The advantage of gradient compared with thresholding is that it is way more sensitive. For
example, with the presence of a sesamoid bone in an image, having a HU below the threshold,
it will be noticed by the gradient but not by the thresholding method. In the thresholding method,
the threshold is often placed high so in order to not wrongly define soft tissue as bone. Because
of this high threshold value, the sesamoid bone will wrongly be defined as soft tissue. With the
same image segmented by the gradient based method, the gradient will recognize the abrupt
change in the density, even for the sesamoid bone, since the surrounding soft tissue has a
much lower HU. So, we can conclude that the gradient based segmentation method is more
sensitive as it will recognize such minimal difference in density.
This also explains the disadvantage of gradient based segmentation. Since it recognizes
minimal differences in density, it will also recognize noise as an edge and thus the gradient
will show us a difference in density. The edge detection method requires an equilibrium
between detection of the real edges and not detecting noise as an edge. So, if the level of
detection of the minimal of change is too high, noise can be wrongly defined as an edge. We
can conclude that gradient based method is suited for images with sharp edges and minimal
noise, because in noisy images it can create extra edges or, when the level of detecting density
changes is too low, miss edges (32,43).
Common used operators of the gradient based method are Sobel’s operator, Laplacian
operator, Laplacian of Gaussian operator and Canny edge detector. Sobel’s operator is based
on the first derivative of the image and therefore is not too sensitive for noise, but the other
three are based on the second derivative and are therefore very sensitive to noise as
derivatives amplify the noise (32,33,43–45).
1.4.3 Shape models (46–49)
In our proposed method of automatic segmentation, we use Statistical Shape Models (SSM)
for the automatic segmentation of the bones in the lower limb and therefore it is briefly
explained.
A statistical shape model describes the average shape distribution within a population as well
as the main modes of variation of that shape. The SSM is based on a training dataset.
20
First of all, there is the need to represent the shape of a bone. In this paper, it will be explained
with the concept of landmarks. Landmarks are multiple points distributed across the surface
creating a point density model (50). They need to be placed in exactly the same anatomical
location on every bone, in other words there is a need of corresponding positioning. This is
often done manually, where the landmarks are placed in the periphery of the bones in the
training dataset. To facilitate this time-consuming task, there is an easier way to place
landmarks: once there are a few landmarks placed, new ones can be placed between the
already placed ones by interpolation. However, this is one of the most difficult tasks of
constructing a statistical shape model since there is a need of correspondence. When there is
a lack of correspondence, the SSM will not be consistent. In the equation below 𝑥𝑖 stands for
the coordinates of multiple landmarks or in other words the shape of a bone.
With these landmarks, the average shape can be calculated as:
�̅� =
1𝑁 ∑ 𝑥𝑖
𝑁
𝑖=1
(2)
Where �̅� stands for the average shape and N for the number of samples included in the training
dataset.
Afterwards, a Principal Component Analysis (PCA) is performed. PCA calculates the most
important modes of variation of the average shape, also called the Eigen modes. The most
important mode of variation is mentioned first because it contains the biggest percentage of
shape variation (Figure 13). After PCA, the main modes of variation are known. Then each
bone of the training dataset, could be described as:
𝑥 = �̅� + 𝑎𝑏
(3)
Where x is the bone from the population that is described, �̅� the mean shape, a the contribution
(the score) of the mode of variation and b the mode of variation. The formula above considers
one mode of variation, but of course it can be applied to multiple modes of variation:
21
With c the total number of modes of variation.
As more modes of variation are included, the accuracy of the
description of the bone will increase. Furthermore, including
more data in the training dataset, will improve the relevance of
the SSM.
The next step is to fit those SSM into a new set of images (the
fitting process) and Active Shape Models (ASM) are one way
of doing this (47). The ASM is a statistical shape model which
iterative deform to fit the model obtained from the training data
set to the new image (49). Once these SSM/ASM are made
based on a training set, they can be used for a lot of medical
procedures (e.g. design of prosthesis/implants, correlation
between morphology and diseases (femur neck –
osteoarthritis), prediction of fracture risk, surgery planning,
computer aided surgery, …).
But segmentation using the SSM fitting alone does not allow to
segment aberrant shape data if the patient has anatomical
abnormalities (e.g. a tumour) or when too many differences
between the bone of the patient and the bones of the training
dataset occur. The SSM simply would not fit properly (46).
1.4.4 Free deformation
Free deformation is an example of a snake or an active contour model. Such snakes move
within the image and search for boundaries that are not included in the statistical shape model
(32).
In this paragraph, the process prior to the free deformation phase is explained. To begin, the
region of interest is selected, as mentioned above. Then, the first step is thresholding, which
𝑥 = �̅� + ∑ 𝑎𝑧𝑏𝑧
𝑐
𝑧=1
(4)
Figure 13: Illustration of the SSM and
the Eigenmodes. A) The percentage of
variation in appearance explained by
different shape modes in a 2D statistical
appearance model of the femur. B-D)
The representation of the first three
principal modes of variation of the full
SSM at the mean and at plus or minus
three standard deviations from the mean
(46) (from Sarkalkan N et. al.)
22
marks the bone and therefore the first step in segmentation is made. The second step is to fit
the ASM on the image. This way, the biggest part of the boundaries will be found and these
will be secured. Supposing for example that after application of the ASM, 95% of the founded
boundaries will be correct, but 5% cannot be found, because they are located outside the
restriction that the ASM dictates. The reason being is that the ASM is able to move within the
picture, but unable to move too far away from its average model. Thus, when the boundaries
are located over a couple standard deviations of the mean, the SSM will not allow the ASM to
go that far outside its average model. Therefore, we can conclude that the ASM has certain
restrictions.
Finally, the free deformation phase takes place, which is an active contour model that searches
the boundaries that cannot be found by the ASM. On the bases of an iterative process, the
free deformation searches in which direction it has to move to find the boundary, considering
its neighbours. This is an iterative process because also, the free deformation has its
restrictions. The active contour model allows a movement outside the ASM restriction, but it is
not allowed to move more than a few millimetres. So, when the real boundary is far away from
the boundary the ASM suggested, the free deformation needs more than one processing cycle
to reach the real boundary. It is clear that with each cycle, the found boundary is bit by bit
closer to the real boundary (Figure 14) (50–54).
The best way to visualize an active contour model is to see it as a rubber band that has the
possibility to deform or move during time. The aim of this deformation is to get as close as
possible to the target contour. During the iterative process, the active contour model is
attracted toward the target contour. The snake works with an energy functional, to measure
the suitability of the contour. When there is a good result, the energy functional is minimal. The
Figure 14: Illustration of the free deformation phase. Assuming that the circles represent the cortex of the femur.
The green circle shows us the real cortex of the femur and the blue circle shows us the model made by the ASM.
Because of its restrictions, the ASM is not able to reach the real green cortex. The free deformation phase, makes is
possible to reach the real cortex by an iterative process. The red circles show us that the free deformation phase
takes place in multiple steps. Each step, the real boundary is closer.
23
intention is to strive for a minimal energy functional. The energy functional is based on multiple
forces that control the location and the shape of the active contour model. These forces can
be divided into internal and external forces. The internal forces consist of the guarantying of
continuity of the contour (Econt: this energy term forces the snake to be continuous) and the
smoothness of the contour (Ecurv: this energy term forces the snake to make a smooth contour).
Together they control the alteration of the snake and keeps the model close to its original
shape. Interesting is that weighting the internal energy terms needs to be done very carefully,
since too much weight leads to a rigid model, that does not want to alter and too little weight
leads to a model that is too flexible. The external energy term is the attraction component. It
guaranties that the contour is attracted to the closest image edge and thereby controls the
fitting of the contour into the image (Eimage). The Eimage is based on image intensity or gradient
magnitude and will become very little when the snake is getting close to an edge. With this
information, the formula for the energy functional is the following: (50,52–54)
𝐸𝑠𝑛𝑎𝑘𝑒 = 𝐸𝑖𝑛𝑡𝑒𝑟𝑛𝑎𝑙 + 𝐸𝑒𝑥𝑡𝑒𝑟𝑛𝑎𝑙 = 𝐸𝑐𝑜𝑛𝑡 + 𝐸𝑐𝑢𝑟𝑣 + 𝐸𝑖𝑚𝑎𝑔𝑒
(5)
In the old model of Kass (52) there was one extra energy term. Econstraint is the constraint energy
and is based on interaction with the user. The user can give the snake restrictions by using
this energy term and push the snake towards or away from certain positions (50,51). Because
of the aim of this study to make segmentation automatic, the user interaction will not occur.
Using active contours has a couple of disadvantages. The initial contour must be placed close
to the real boundary because if not, the snake will not be able to find this boundary (55).
In our proposed program, the free deformation is used as final step after initializing the contour
to the image, near the real boundaries.
1.5 Practical applications of image segmentation
In this paragraph, a summary of applications in the context of image segmentation is
described. Image segmentation is or could be used in a lot of medical procedures such as
orthopaedic surgery: e.g. computer aided orthopaedic surgeries (CAOS), planning of surgical
procedures, design optimisation and the placing of prosthesis or implants. It can also be used
for diagnosis of e.g. anatomical abnormalities or pathogenic cellular growth, tumour
localisation and be of use for treatment planning. Furthermore, since segmentation is used to
study the anatomy of structures, it makes comparison between organs or bones from different
24
humans possible (e.g. male vs female bones) (10,32,34,35). The ability to see a 3D anatomical
structure instead of a 2D slice, gives us more information and allows us to calculate the risk
factors for certain diseases (e.g. osteoporosis and osteoarthritis) (46). Image segmentation is
not only used in the orthopaedic field, but also in other medical disciplines. For example, in
cardiology, where image segmentation makes heart surgery a lot easier. The same applies to
neurology and thereby brain surgery. Also in the diagnosis of brain tumours image
segmentation can be useful. In fact, image segmentation is capable of extracting every organ
in the human body. Thus, every single organ can be extensively studied for its normal variation
and its abnormalities (48).
Segmentation techniques are not only used in medicine, but also in crime investigation, e.g.
face recognition, finger print recognition and iris recognition (32).
The examples above are only a few of the applications of segmentation in the medical field. It
shows us that segmentation is, or is soon to become a huge tool in diagnosing procedures and
even in operation planning.
1.6 Validation metrics
For the validation of automatic segmentation programs multiple measurements are used. In
this paper three different metrics are explained and two of them will be used to compare our
program to other bone segmentation methods. The mean error distance (Average Surface
Distance (ASD) & Root Mean Square (RMS)) and the maximum fault (= Hausdorff Distance
(HD)) will be used in this study and are explained in section 2.4. The Dice Similarity Index
(DSI) and its derivative, the Overlap Error (OE), will not be used. As they are frequently used
as a validation metric in other, previously developed, automatic segmentation techniques they
are still explained below.
1.6.1 Mean error distance (ASD & RMS)
See section 2.4 for the definition of this validation metric.
1.6.2 Maximum fault (HD)
See section 2.4 for the definition of this validation metric.
25
1.6.3 Dice Similarity Index
The Dice Similarity Index (DSI) or the Dice coefficient is a measurement for the similarity
between the segmentations from the manual technique and the proposed automatic
segmentation technique. That is why it is also called the overlap index. The formula for the
DICE coefficient is:
𝐷𝑆𝐼 =2(¨𝐴 ∩ 𝐵¨)¨𝐴¨ + ¨𝐵¨
(6)
Where ¨𝐴¨is the manual segmentation and ¨𝐵¨ the automatic segmentation. 𝐴 ∩ 𝐵 is the
volume that the 2 segmentations have in common (Figure 15).
DSI is given in percent, i.e. if the result is 100%, it means that the manual and the automatic
segmentation completely overlap. 0% means there is no overlap between the two
segmentations (28,35).
Some papers use the volumetric Overlap Error (OE) as a metric. OE stands for the percentage
of segmentation between the manual segmentation and the automatic segmentation that does
not overlap. In other words, when the result is 0% it is a perfect segmentation, when it is 100%
there is no overlap between the two segmentations.
Since
𝑂𝐸 = 1 −(¨𝐴 ∩ 𝐵¨)(¨𝐴 ∪ 𝐵¨)
(7)
𝐴 ∩ 𝐵
Figure 15: Illustration of the Dice Similarity Index
(DSI)
𝐴 𝐵
26
and
(¨𝐴 ∪ 𝐵¨) = (¨𝐴¨ + ¨𝐵¨) − ¨𝐴 ∩ 𝐵¨
(8)
we can convert the volumetric overlap error to the Dice Similarity Index: (50)
𝐷𝑆𝐼 = 2 ×(1 − 𝑂𝐸)(2 − 𝑂𝐸)
(9)
We can conclude that these are good parameters to compare different studies, but it is
important to note that a comparison of the different methods is not easy and must be
interpreted with caution because of different datasets, different parameters to evaluate and
different methods being used (28,46,56,57).
27
2 MATERIALS AND METHODOLOGY
To start, the program that is used to do manual segmentation is briefly explained. Some
techniques that make the manual segmentation less time-consuming are mentioned. Then,
the pipeline of the proposed method for automatic segmentation is presented. Each step will
be explained and illustrated by figures. In the third part, the method of validation is explained.
At last, the validation metrics and the use of inter- and intra-observer differences are
mentioned.
The automatic segmentation technique performed 210 CT automatic segmentations. Only 10
of them were used for the validation since a manual segmentation was necessary for
comparison. Furthermore, two observers independently segmented three cases and one
observer segmented two times the same case by using Mimics. These inter-observer and intra-
observer variation results were used for the validation.
2.1 Manual segmentation using Mimics
The first part of my work during this study was to create manual segmentations, which could
be used for the golden standard. The manual segmentation is done using Mimics version 17.1,
a program developed by Materialise Leuven. Mimics is a 3D medical image processing
software that allows, with user interaction, to turn scans into accurate 3D models (58). First
there is a need of a full CT scan in DICOM files. These files need to be uploaded in Mimics.
Then, the manual segmentation can begin. The aim is to select every pixel that presents bone
and when this is accomplished, Mimics allows this selection to be turned into a 3D shape. To
make it somewhat less time consuming, Mimics has a couple of features to make manual
segmentation easier such as thresholding, the possibility of selecting multiple pixels at once
and region growing. Thus, in Mimics, it is possible to create a threshold: every pixel with a HU
above the threshold will be defined as bone. One should be careful not to place the threshold
at a too high HU value, because then soft tissue will be defined as bone, and this will create
more work to remove the label ‘bone’ of these wrongly defined pixels. Region growing makes
it possible to close open regions and fill holes. By using these features, the manual
segmentation goes faster, but still it takes 100 man hours to manually segment a full lower
limb (6). This is the reason why there is a need for automatic segmentation techniques, so
professionals can use their time more efficiently.
28
2.2 Construction of the SSM
Statistical shape models are used in our proposed automatic segmentation technique. First, a
low-resolution statistical shape model is used. This low-resolution SSM contains the full lower
limb. Second, a high-resolution statistical shape model is applied to the CT data set and only
encloses one bone. For the construction of these SSM models, manual segmentations are
necessary. As a result of the manual segmentation, explained above, a 3D model is formed.
This 3D model consists of landmarks, connected to each other. Each landmark is connected
with two neighbour’s landmarks with the aim to form triangles. All the triangles together form
the surface of the 3D model.
In this study, a minimal of 10 manual segmentations was made to construct the first version of
the SSM model. An average model of these 10 manual segmentations was computed and
Principal Component Analysis (PCA) was applied to determine the modes of variation of the
average shape model (see 1.4.3). The more cases segmented by the SSM, the more accurate
the SSM will be (see beneath).
Figure 16 shows us the average shape and appearance of the low-resolution full limb model
and the average shape of the high-resolution model of different bones. These statistical shape
models were used in our proposed automatic segmentation technique.
Figure 16: Average shape and appearance of the low-resolution full limb SSM on the left and the average
shape of the high-resolution SSM of the 6 lower vertebrae, the foot bones, the sacrum, the tibia and fibula
on the right.
29
Attention was paid to the exclusion of segmentations that were included in the statistical shape
models, because the automatic segmentation of a case that is included in the SMM will be
easier to recognise for the automatic segmentation program and therefore will affect the
results. Excluding these segmentations was done by applying the “leave one out” technique,
which means that when an evaluation needs to be done of a bone that is included in the SSM,
this bone will be left out of the SSM.
2.3 The pipeline of the automatic segmentation of the full lower limb
In the overview beneath, the automatic segmentation pipeline of our newly developed program
is illustrated. The fully automatic segmentation is divided into three phases: firstly, there is an
initialisation and segmentation phase. This first phase is separated in four steps. Secondly,
the training of the shape model is supervised and the segmentation pipeline is continuously
updated. Finally, there is the appearance and solids phase.
2.3.1 Initialisation and segmentation
To illustrate this phase, the reader is referred to Figure 17.
2.3.1.1 Step 1: Initialisation to estimate position, orientation and scale of the lower limb on an
isosurface reconstruction following simple thresholding
First, the volume of the CT images is down-sampled to reduce calculation time for the
initialisation phase. Afterwards, with the use of two thresholds corresponding to Otsu, the
image is segmented into three levels: background, soft tissue and bone (37). With the use of
Appearance and solids
Supervised training of the shape models and continuous updating of the segmentation pipeline
Initialisation and segmentationInitialisation of the lower limb on a
isosurface reconstruction following simple thresholding
Statistical shape adaption to provide a good local initialisation for the final in-
image segmentation processIn-image statistical shape adaptation Free form segementation
30
3D binary morphological operations, a binary dataset is generated. These operations include:
closing open regions, breaking small connections, filling holes and the removal of unconnected
noise. After the application of these operations, an isosurface is generated from this volume
data to provide a first estimation of the geometry of the cortical surfaces of the full lower limb.
2.3.1.2 Step 2: Statistical shape adaptation to provide a good local initialisation for the final
in image segmentation process
To create a first estimation on shape and position of the lower limb bones, the low-resolution
Statistical Shape Model (SSM) is fitted onto the isosurface. This fitting procedure iteratively
searches for the closest points from the source image to the target as well as from the target
to the source. It also excludes outlier points which can be caused by noise or other bony data.
2.3.1.3 Step 3: In-image statistical shape adaptation
The low resolution SSM gives a first good estimation of shape and location of the lower limb
bones. The third step includes local optimisation with the use of the high-resolution statistical
shape model fitting and is performed bone by bone. Penetration of surfaces in difficult areas
like joints is avoided by digital subtraction of the neighbouring bones during the fitting
procedure.
2.3.1.4 Step 4: The free form segmentation phase
This step is used to overcome the restrictions that SSM dictates and allows for the shape not
to be captured by the SSM, as explained above in 1.4.4.
31
2.3.2 Supervised training of the shape models and continuous updating of the segmentation pipeline
Each automatically segmented case is evaluated by a clinical expert. Segmentations which fit
well with the original scan data are added to the existing SSM and for each new segmentation
the updated versions are used. Cases that before did not fulfil quality requirements are
segmented once again with the updated pipeline.
2.3.3 Solids and appearance
With a correspondence feature, solids and appearance can be added to our 3D model of a
bone. Each bone has a periosteal surface, presented as an outer layer, and an endosteal
surface, presented as an inner layer. The correspondence feature makes it possible for our
automatic segmentation technique to connect one point of the outer mesh layer to the opposite
point of the inner mesh layer (Figure 18).
Figure 17: Overview of the automatic segmentation pipeline: A) Initial positioning of the
lower limb on an isosurface reconstruction B) fitting of the low-resolution shape model on
the isosurface C) followed by in-image statistical shape adaptation and D) free form
segmentation phase.
32
Therefore, during the construction of the shape model, the periosteal and endosteal surfaces
were non-rigidly registered onto each other to define dense correspondence between both
layers. The correspondence feature of inner and outer cortical mesh layers allows for
immediate construction of tetrahedron meshes as well as for the definition of local cortical
thickness. This cortical thickness can play an important role in the prediction of the fracture
risk. When bone is fragile, the risk of getting a fracture increases.
Tetrahedrons are created to form a 3D model. Every 3D bone model is constructed of
tetrahedrons, which are formed by interconnecting 4 points. These points are part of the
scatterplot or mesh, which is created after the pipeline was applied to the CT data set. Cartilage
layers can similarly be added by applying a distance map to the surface normal of a selected
area, and again tetrahedrons can be defined automatically (Figure 19).
Secondly, due to the correspondence feature, bone density in terms of Hounsfield units could
be directly measured during segmentation. This density or in other words the amount of bone
mineral in the bone tissue, is measured at the middle of corresponding outer-inner vertices.
For the 3D representation, different densities are given different colours. For example, low
density is given the colour blue and high density is given the colour red. Therefore, one can
16534
298 16534
9763 298
9763
Periostal surface
Endostal surface
Figure 18: Illustration of the correspondence feature
Figure 19: Illustration of solid cortical, cartilage and marrow volumes, presented by
tetrahedrons
33
predict the quality of the bone just by seeing the appearance of this 3D representation, since
bone density is the most important predictor for osteoporosis and bone fracture. The lower the
bone density, the higher the risk of osteoporosis and therefore the risk of bone fractures (59–
61).
2.4 Used validation metrics
As explained above, the mean error distance and the maximum fault are used as parameters
for validation of our newly developed automatic segmentation technique. DSI (Dice Similarity
Index) and OE (Overlap Error) will not be used in this study, but is frequently used in the
validation of other, previously developed programs.
2.4.1 Mean error distance (ASD & RMS)
The mean error distance is the Average Distance (AD) in mm, often called the Average Surface
Distance (ASD), between every point in the manual segmentation to the closest point in the
automatic segmentation.
�̅� =𝑋1 + 𝑋2 + 𝑋3 + ⋯ + 𝑋𝑛
𝑁
(10)
With �̅� the mean error or the AD, every 𝑋 distance between two points of the different
segmentation sets and N the number of the distances included in the formula (57).
A similar parameter of the average distance is the Root Mean Square (RMS). It is the square
of the formula above and then the square root is computed from the final result. Root mean
square is also called ‘the quadratic mean’.
�̅�𝑅𝑀𝑆 = √𝑋12 + 𝑋2
2 + 𝑋32 + ⋯ + 𝑋𝑛
2
𝑁
(11)
Where �̅�𝑅𝑀𝑆 is the root mean square (57,62).
34
2.4.2 Maximum fault (HD)
The Hausdorff Distance (HD) is the greatest distance of all distances from one point in the
manual segmentation to the correspondent point in the automatic segmentation and vice versa
(Figure 20). In other words, it measures the mismatch between the two segmentations (57,63).
2.5 Inter- and intra-observer differences
In manual segmentation, inter- and intra-observer differences were defined. Inter observer
differences are differences between the manual segmentations of the same bone, in the same
case made by two different clinical experts. The manual segmentation from one expert is the
golden standard and the manual segmentation of the other expert is compared to this golden
standard. Intra-observer differences are differences between the manual segmentation of the
same bone, in the same case by the same clinical expert. The intra-observer difference
measures in other words the reproducibility.
The inter- and intra-differences are measured to demonstrate that even the golden standard,
in other words the manual segmentation, has a degree of error. This makes it easier to interpret
the ASD and the maximum error that were found in our automatic segmentation technique
whilst comparing it to the golden standard.
Figure 20: Illustration of the Hausdorff Distance
35
3 RESULTS
A dell precision M6800 Laptop (Intel Core i7 - 4910MQ, 16 GB RAM, 64 bit) was used to
process a total of 210 CT data sets. Each scan data set consisted of an average of 1864
(ranging from 1320 to 2315) slices with an average pixel size of 0,703 mm (ranging from 0,575
mm to 0,975 mm). The processing of a full data set requires an average around 2 hours per
case. This time includes uploading all cases as well as exporting and writing of segmentation
results in obj. and stl. file format.
As mentioned in the introduction, different bones of the lower limb are included in this study.
Six lower vertebrae, sacrum, bilaterally the pelvic bone, femur, tibia, patella, fibula, talus and
calcaneus are included (21 osteological structures in total). A series of iterations and
progressive updating of the low-resolution as well as the high resolution SSM’s, following
quality clearance by the clinical experts was performed. Finally, all the scans were processed
and convergence of the algorithm was obtained in all the scans except one.
For the validation, manual segmentations were necessary to compare the automatically
segmented bones to the manually segmented bones, which are considered as the golden
standard. Because of this time-consuming task, 10 manual segmentations were made. These
manual segmentations included the pelvis, the femur, the tibia, the patella, the fibula, the
calcaneus and the talus. Since the six lower vertebrae and the sacrum were not manually
segmented, the results of the automatic segmentation are not given in Table 1 because they
are not comparable to the golden standard. Therefore, these bones are not included in the
validation of our automatic segmentation technique.
The results, expressed In average error (Average Surface Distance) and maximum error
(Hausdorff Distance) are presented in Table 1. The mean error or the ASD ranged from 0,53
mm to 0,76 mm, with an average of 0,65 mm and the maximum error or the Hausdorff distance
ranged from 2,02 mm to 7,84 mm, with an average of 4,05 mm. The smallest errors were found
in the foot bones (the talus and the calcaneus) and the largest errors were found in the pelvis.
36
Table 1: Accuracy and observer variation in segmentation of the respective segments
Automated segmentation (n = 10)
Manual segmentation
Inter-observer variability (n = 3)
Intra-observer variability (n = 1)
Average Error
(mm)
Maximum Error
(mm)
Average Error
(mm)
Maximum Error
(mm)
Average Error
(mm)
Maximum Error
(mm)
Pelvis 0,75 +/- 0,17 7,84 +/- 2,26 0,41 +/- 0,20 3,74 +/- 2,68 0,18 1,36
Femur 0,65 +/- 0,10 4,79 +/- 2,39 0,41 +/- 0,15 2,30 +/- 0,98 0,18 2,29
Tibia 0,63 +/- 0,11 4,07 +/- 2,15 0,39 +/- 0,19 1,88 +/- 0,47 0,23 0,89
Patella 0,65 +/- 0,15 2,02 +/- 0,39 0,42 +/- 0,11 1,96 +/- 0,29 0,18 0,78
Fibula 0,76 +/- 0,18 3,76 +/- 1,17 0,61 +/- 0,08 2,25 +/- 0,71 0,17 0,81
Calcaneus 0,53 +/- 0,16 2,90 +/- 0,77 0,40 +/- 0,12 1,67 +/- 0,34 0,32 1,12
Talus 0,57 +/- 0,12 2,97 +/- 0,59 0,44 +/- 0,08 2,21 +/- 0,41 0,26 0,78
The automatic segmentation of the pelvis has an average error of 0,75 mm and a Hausdorff
distance of 7,84 mm in comparison with the golden standard. The femur has an average
surface distance of 0,65 mm and a Hausdorff distance of 4,79 mm. The tibia has an average
surface distance of 0,63 mm and a maximum error of 4,07 mm. The patella has an average
error of 0,65 mm and a maximum error of 2,02 mm. The fibula has an average error of 0,76
mm and a Hausdorff distance of 3,76 mm. The calcaneus and the talus have an average
surface error of 0,53 mm and 0,57 mm respectively and a Hausdorff distance of 2,90 mm and
2,97 mm respectively (Table 1).
The inter-observer variability has the following results: an average error that ranges from 0,39
mm to 0,61 mm, with an average of 0,44 mm and a maximum error that ranges from 1,67 mm
till 3,74 mm, with an average of 2,29 mm. The intra-observer variability in one case was also
measured and the following results were found: an average error that ranges from 0,17 mm to
0,32 mm and a maximum error that ranges from 0,78 mm till 2,29 mm. These result show us
that even in the golden standard, degrees of error exist.
When we make a comparison between the errors made by the automatic segmentation and
the errors made by 2 experts whose manual segmentations of the same bone were compared
to each other, small differences are found. The maximum difference in average error between
the automatic segmentation and the inter observer variability research was found in the pelvis
since the average surface distance between the two manual segmentations, made by two
different experts, is 0,41 mm and the average error of the automatic segmentation is 0,75 mm.
37
Therefore, the difference in error is 0,34 mm. On the other hand, a minimum difference in
average surface distance of 0,13 mm is found in the calcaneus and talus since the average
surface distance between the two manual segmentations is 0,40 mm and 0,44 mm respectively
and the average surface distance of the automatic segmentation is 0,53 mm and 0,57 mm
respectively. The average difference between the automatic segmentation and the inter
observer variability research in average error is 0,21 mm and in maximum error is 1,76 mm.
The maximum difference in maximum error is 4,10 mm, found in the pelvis and the minimum
difference in maximum error is found in the patella with a difference of 0,06 mm.
Also in the intra-observer variability research errors are found. Here the maximum average
error difference between the intra-observer variability research and the automatic
segmentation is found in the fibula, with an average surface distance difference of 0,59 mm.
The minimal difference in average surface distance is 0,21 mm, found in the calcaneus. The
maximum difference in maximum error is 6,48 mm, found in the pelvis, and the minimum
difference in maximum error is found in the patella, with a difference of 1,24 mm. The average
difference between the automatic segmentation and the intra-observer variability research in
average error is 0,43 mm and in maximum error is 2,85 mm.
38
4 DISCUSSION
4.1 General
Image analysis is becoming an important tool in medicine, especially in diagnosis and medical
decision making. The progress in image analysis has caused an increase of diagnosis on the
basis of imaging technology. Image segmentation of Computed Tomography and Magnetic
Resonance Imaging plays an important role in the image analysis, but is often still done
manually, and by consequence is time-consuming (26). E.g. the time needed for extracting the
articular surfaces of the knee joint was estimated to approximately two days of work and a for
a full lower limb over 3 weeks of work were reported (6).
Because of this time-consuming task, several professionals (engineers and doctors in
medicine) have described different automatic segmentation techniques for image
segmentation. The segmentation technique that gives the most consistent results is the model
based technique, the technique based on using the SSM and ASM. But the disadvantage of
this technique is that there is a need of a large training data set to create an accurate shape
model. Other methods are less accurate in images with a lot of noise or artefacts (35,47,64).
Because of these disadvantages, there is a growing interest in automatic segmentation
programs which are able to process large data sets within minimal time and with a minimal
need of prior knowledge, and therefore a minimal need of human interaction (27).
An overall error of 0,65 mm was found using the current segmentation pipeline. The overall
error in the inter-observer variance was 0,44 mm, where the inter-observer variance are two
manual segmentations and therefore two golden standard segmentations compared to each
other. The absolute difference in overall error between the current segmentation pipeline and
the golden standard is therefore only 0,21 mm.
The current segmentation pipeline has an overall maximum error of 4,05 mm and the inter
observer variance has an overall maximum of 2,29 mm. The absolute difference in the overall
maximum error is therefore 1,76 mm.
On the basis of these results, we can conclude that a very good approximation of golden
standard segmentation can be guaranteed by using our proposed segmentation pipeline.
39
It is important to note that this study is the first of its kind to develop an automatic segmentation
program that segments the full lower limb. Multiple automatic segmentation techniques, which
were developed in the past, are able to segment only a few bones and organs or were
specifically developed to segment joints like the hip joint or knee joint (35,56,65).
4.2 Comparison with previously developed programs
As said above, multiple professionals have already developed semi-automatic or automatic
techniques for making segmentation a less time-consuming task. In Error! Reference source not found., the results of several studies will be presented and a comparison with our
proposed technique will be made.
The following studies are mentioned: Lamecker H. and colleagues (7), Seim H. and colleagues
(8), Almeida D.F. and colleagues (10), Krcah M. and colleagues (27), Uozumi Y. and
colleagues (26), Younes L.B. and colleagues (11), Wu D. and colleagues (9).
In the heading of the table, the following abbreviations are used: � patients stands for the
number of patients, ASD stands for average surface distance, RMS stands for root mean
square, HD stands for Hausdorff distance and OE stands for overlap error.
Lamecker H. et al. proposed a new statistical shape model of the pelvic bone, generated by
manually segmenting 23 CT data sets of male patients. They divided the pelvic bone into 11
regions. Just like our automatic segmentation pipeline, they performed a PCA (Principal
Component Analysis) and determined the main modes of variation. Also, they applied a gray-
value profile analysis where they use the thresholding technique. The results were described
for a “leave all in” segmentation as well as a “leave one out” segmentation. In the first case,
the SSM contains the CT data set that is going to be segmented. The ASD will be a lot smaller
if the SSM already knows the shape of the CT data set that needs to be segmented. Therefore,
these results are not comparable with ours, since our statistical shape model does not contain
any CT data sets that still need to be segmented. On the contrary, every new segmented CT
data set is added to the SSM. The “leave one out” segmentation on the other hand is fitted for
the comparison since the SSM does not know the shape in advance. The mean surface
distance in this study was 1,6 +/- 0,2 mm and the maximum distance was 14,6 +/- 3,8 mm. On
the basis of these results, we can conclude that our automatic segmentation is more accurate
(7).
40
Ta
ble
2: C
ompa
rison
with
pre
viou
sly
deve
lope
d au
tom
atic
seg
men
tatio
n te
chni
ques
41
Seim H. and colleagues presented an algorithm for an automatic segmentation of the pelvis.
Again, this study is based on a statistical shape model. The following three steps are used:
initial placement of the average shape (SSM) of the pelvis within the CT data, adaption of the
SSM to the image by a variation of the shape models and finally a free form deformation step,
which is used to overcome the restrictiveness of the SSM. We can conclude that this proposed
method has a lot of similarities with ours. For the validation, they used 50 manually segmented
CT data sets by performing the “leave one out” method. The results of this proposed automatic
segmentation pipeline were an average surface distance of 0,7 +/- 0,3 mm and a maximum
distance of 16,5 +/- 5 mm. In this study however, the pelvis contains the sacrum. We can
conclude that our research is an extension to their study since they suggested to include an
evaluation of the inter-user variability, which would help to compare the errors made by the
automatic segmentation to those made by professionals and therefore the golden standard.
For the results, their ASD is slightly better than ours with a minimal difference of 0,05 mm, but
the maximum distance is bigger, which makes this study equivalent to ours (8).
Almeida D.F. and colleagues described a new, fully, automatic segmentation pipeline for CT
images of the femur. They used an ASM that is based on a SSM and a LAM (Local Appearance
Model). Almeida D.F. et al. described the results for low-resolution CT scans as well as high-
resolution CT scans. Since we only used high-resolution CT scans in our study, we do not
mention the results of the low-resolution CT scans. 148 datasets were segmented, but only 10
were considered for the comparative analysis (due to the time consuming task of manual
segmentation) and resulted in an average surface distance of 1,014 +/- 0,474 mm and a
Hausdorff distance of 4,336 +/- 0,861 mm. Compared to our automatic segmentation pipeline
results, the ASD of our study is much lower but the HD of this study is with a difference of 0,36
mm slightly better than ours (10).
Krcah M. et al. proposed a fully automatic segmentation technique of the femur. Basically, they
worked with a threshold-like technique (graph-cuts and a bone boundary filter) and in the last
step they separated the individual bones, since working with thresholding, the risk of leakage
to adjacent bones exists. They performed this automatic segmentation on 197 femurs and had
as result an average Hausdorff distance of 5,4 mm. With a difference of 0,6 mm, we can
conclude that our automatic segmentation pipeline is more accurate (27).
Uozumi Y. et al. described an automatic bone segmentation method which was tested on six
patients. They worked in three different steps: the segmentation between the femur and the
tibia, the segmentation of the femur and the patella and the segmentation of the tibia and fibula.
Their main segmentation method is an automatisation of the thresholding technique and they
42
too, compare the segmentations, performed by the automatic segmentation program to the
manual segmentation or the golden standard segmentation. But due to their use of matching
rate (in %) as validation metric, the comparison with our proposed method is not useful.
However, it seemed to be an added value to mention their results, as their study showed good
results and was published recently (2013). The matching rates were 95,84 +/- 0,57 % for the
femur, 94,12 +/- 1,01 % for the tibia, 94,49 +/- 0,83 % for the patella and 86,37 +/- 4,28 % for
the fibula. Further extension of our results to the DSI (Dice similarity index) would be necessary
to compare these results (26).
Younes L.B. and colleagues presented a fully automatic segmentation of the femur using SSM.
The first step was a primitive shape recognition. The presented method is divided into three
steps: detection of the femoral head and the femoral shaft as a sphere and as a cylinder
respectively, registration between primitive shapes of the SSM and CT image to initialise the
SSM into the image and at last, the fitting of the SSM tot the CT image. 8 CT data sets were
segmented and had as result an average surface distance of 1,48 +/- 0,28 mm and a maximum
error of 10,53 +/- 3,19 mm. Both values are much higher than ours, which makes this technique
not the most accurate one (11).
At last, Wu D. et al. proposed an automatic segmentation method, based on a combination of
different techniques. First, they used a marginal space learning for bone detection and
deformed it with a SSM. Second, the deformed model was used as a shape prior in a graph
cut for refined segmentation. At last, a multi-layer graph cut is used because there was a need
for segmenting each bone separately since it is possible that their results overlap. They tested
their proposed method on 248 data sets and the following results were obtained: an average
surface distance of 0,82 +/- 0,33 mm for the femur, an ASD of 0,69 +/- 1,25 mm for the tibia,
an ASD of 0,96 +/- 4,29 mm for the fibula and an ASD of 0,68 +/- 2,06 mm for the patella.
When we compare these results to ours, we can conclude that our automatic segmentation
pipeline is more accurate (9).
Furthermore, many other studies were not included in this comparison since they only
segmented a part of the bone. Often, those studies focused on segmenting joints like the hip
joint or the knee joint. Therefore, their results are not comparable with ours. Still, a short
summary of their results is made since these studies had a big impact on other articles. The
segmentation methods will not be extensively mentioned as these studies will not be used for
comparison.
43
Chu C. et al proposed a fully automatic CT segmentation method for the hip joint. The main
method is the use of a SSM and for validation, the manual segmentation was taken as the
golden standard. An ASD of 0,52 +/- 0,10 mm for the pelvis, an ASD of 0,45 +/- 0,10 mm for
the left proximal femur and an ASD of 0,48 +/- 0,08 mm for the right proximal femur, were
obtained as results (56). Secondly, Cheng Y. et al described an automatic segmentation
technique for the acetabulum and the femoral head. Their method is a combination of
thresholding and an iterative process based on neighbourhood information with an average
ASD of 1,22 +/- 0,98 mm and a DSI of 91,55 +/- 4,82 % as result (35). Thirdly, Ramme A.J.
and colleagues presented a semi-automatic segmentation method for the knee joint. They
tested their method on the distal femur and the proximal tibia of 72 CT data sets. The following
results are obtained: A DSI of 95% for the femur as well as the tibia (66). Further, Zoroofi R.A.
et al. presented an automatic segmentation method for the hip joint (acetabulum and femoral
head), using 60 CT data sets. They used a combination of different techniques including
thresholding and obtained the following result: an ASD of 1,31 +/- 1,12 mm and a DSI of 90,36
+/- 5,31% (35,65). Moreover, Yokota F. and colleagues proposed an automated segmentation
method for a diseased hip. They made use of different SSM’s and had an ASD of 1,49 +/- 1,04
mm and a DSI of 90,14 +/- 1,95 % as results (35,64).
The only study found researching an automatic segmentation technique for the calcaneus was
the study of Görres J. and colleagues. They performed a segmentation of two calcaneal
surfaces on the basis of an ASM on 50 cases and have shown an ASD of 0,59 and 0,46 mm
respectively (67).
Unfortunately, there is no literature found on the automatic segmentation of the full calcaneus
and the talus. Thus, the results of our calcaneus and talus segmentation are not mentioned in
Error! Reference source not found. since comparison is not possible.
After comparing the results of the previously mentioned studies, which are the most important
studies comparable with ours, we can conclude that our proposed automatic segmentation
method is the most accurate one with the best results. Our segmentation method is similar to
the one used by Seim H. et al. (8). Their results were also close to ours, what makes this
technique the most promising one.
44
4.3 Limitations
This study was confronted with some limitations that have to be taken into account. Firstly, the
statistical shape models that are used (the low and high resolution statistical shape models)
contain in the beginning only a couple of CT data sets. As explained above, the SSM obtains
more accuracy by adding more CT data sets. After all, the ASM is a measurement of the mean
shape of the CT data sets that are included in the training data set. And it is logical that in case
of a large training data set, the average shape will have a more accurate shape, that ‘knows’
a lot of abnormalities and variations of the normal anatomy and therefore can be used for the
whole population. In case of a small training data set, the ASM will not be able to fit in a CT
data set with abnormalities, which is not included in the training data set. Since the program
adds every new segmented CT data set to the ASM, the ASM will become more accurate when
more CT data sets are segmented by the pipeline.
Secondly, the small number of manually segmented CT data sets is also a limitation for the
validation of our program. The manual segmentation is a very time-consuming task and for
validation there is a need for many manual segmentations, as the comparison between the
manual and the automatic segmentation are the basis of the validation. Because of the time-
consuming nature of this work, only a few full lower limb segmentations were performed
manually. Therefore, it was only possible to perform a global estimate of accuracy. Regional
analysis could not be performed, since the number of manual segmentations was too low for
this type of analysis. In the end, all other studies found, did not perform a regional analysis
either.
Finally, to compare our newly developed program to previously developed automatic
segmentations techniques, a literature study has been performed. Multiple studies that
included the automatic segmentation of the pelvis and femur were found. But on the other
hand, a very small number of studies were found that included the tibia, the fibula and the
patella. Not one study was found relating to the automatic segmentation of the calcaneus, talus
or the full lower limb. For this reason, a comparison with other programs could not be
performed optimally.
4.4 Suggestions for further research
The current automatic segmentation technique can be extended to the full human skeleton.
This study only includes the lower limb, but could be extended to the upper limb, thorax and
skull. Also, the automatization of segmentation is not only useful in orthopaedics, but can be
45
used in gastroenterology, cardiology, pneumology, gynaecology, neurology, oncology, etc.
This extension would imply that different statistical shape models must be created for each
organ, bone or tissue and for the validation, a lot of manual segmentations must be performed
in order to compare them to the automatic segmentation. As well as the creation of the
statistical shape model as the manual segmentation are time-consuming tasks and therefore
the extension of this program will cost a lot of time. But if the same results can be obtained,
segmentation and therefore analysis of the whole human body will be easier and quicker.
In this study, only the CT is included. It would be interesting to expand the segmentation
pipeline to other imaging techniques in particular Magnetic Resonance Imaging technology.
MRI is currently the most used imaging technology for the knee joint and the spine. However,
MRI will confront the researcher with some new challenges due to the different nature of
contrast intensity. The fact that one could start with elaborate SSM’s based on CT images
could facilitate the step towards to MRI segmentation.
46
5 CONCLUSION
The interest in automatic segmentation techniques is rising all over the world in various medical
specialties. Image segmentation makes it possible to see the region of interest in 3D, which
yields a lot more information than the 2D image obtained from CT or MRI. The need of
automatic segmentation techniques is high, since manual image segmentation is a time-
consuming task that can only be performed by trained professionals. With an average surface
distance ranging from 0,53 mm to 0,76 mm and a Hausdorff distance ranging from 2,01 mm
to 7,84 mm, the accuracy of our automatic segmentation technique has been illustrated.
Furthermore, the errors made by the automatic segmentation technique were compared to the
errors made by the manual segmentation or golden standard, resulting in a minimal average
difference in error of 0,21 mm.
Several scientists and engineers presented their own automatic segmentation techniques and
described an average surface distance ranging from 0,4 mm to 5,4 mm (10,12,13,27,35,56).
The ASD differs according to the different techniques, number of patients and quality of CT
scan, which makes comparison difficult. However, taking the differences into account, an
attempt to compare various studies with ours is made. We can conclude that our method has
highly competitive results and therefore is the most accurate one. Besides the study of Seim
H. and colleagues (8), neither one of the mentioned studies presented results similar to ours.
On one hand, it is important to note that this study is the first of its kind developing an automatic
segmentation technique for the entire skeleton of the lower limb. Previous studies focused on
one bone or on a joint only (56,64,65). On the other hand, further research can be performed.
For example, extension to MRI would be interesting. Also, the extension to the full skeleton
and to other specialties like cardiology and neurology would be an added value.
This study, with several advantages of automatic segmentation, can even be of use in other
specialties and even in non-medical professions like crime investigation (e.g. face and finger
print recognition) (32). The automatization of image segmentation has a lot of advantages for
professionals, since time is always a precious element. It makes transformation to 3D images
possible and therefore it can facilitate operations or it can allow for a diagnosis in an earlier
stage.
47
6 REFERENCES
1. Schiefsky MJ. Hippocrates On ancient medicine. Brill; 2005. 415 p. 2. O’malley CD. ANDREAS VESALIUS 1514-1564: IN MEMORIAM. Med Hist. 1964
Oct;8:299–308. 3. Shapiro L, Stockman G. Computer Vision. October. 2001. 608 p. 4. Saini S, Arora K. A Study Analysis on the Different Image Segmentation Techniques.
Int J Inf Comput Technol. 2014;4(14):1445–52. 5. Wikipedia, the free encyclopedia: Image segmentation [Internet]. [cited 2017 Oct 10].
Available from: https://en.wikipedia.org/wiki/Image_segmentation 6. Henak CR, Anderson AE, Weiss JA. Subject-Specific Analysis of Joint Contact
Mechanics: Application to the Study of Osteoarthritis and Surgical Planning. J Biomech Eng. 2013 Feb 11;135(2):21003.
7. Lamecker H, Seebass M, Hege H-C, Deuflhard P. A 3D statistical shape model of the pelvic bone for segmentation. In 2004. p. 1341.
8. Seim H, Kainmueller D, Heller M, Lamecker H, Zachow S, Hege H-C. Automatic Segmentation of the Pelvic Bones from CT Data Based on a Statistical Shape Model. In: Proceedings of the First Eurographics conference on Visual Computing for Biomedicine. 2008. p. 93–100.
9. Wu D, Sofka M, Birkbeck N, Zhou SK. Segmentation of multiple knee bones from CT for orthopedic knee surgery planning. Med Image Comput Comput Assist Interv. 2014;17(Pt 1):372–80.
10. Almeida DF, Ruben RB, Folgado J, Fernandes PR, Audenaert E, Verhegghe B, et al. Fully automatic segmentation of femurs with medullary canal definition in high and in low resolution CT scans. Med Eng Phys. 2016 Dec;38(12):1474–80.
11. Younes BL, Nakajima Y, Saito T. Fully automatic segmentation of the Femur from 3D-CT images using primitive shape recognition and statistical shape models. Int J Comput Assist Radiol Surg. 2014 Mar 8;9(2):189–96.
12. Almeida D, Folgado J, Fernandes PR, Ruben RB. 3D shape prior active contours for an automatic segmentation of a patient specific femur from a CT scan. In: Computational Vision and Medical Image Processing IV. 2014. p. 271–5.
13. Liu B, Zhang H, Hua S, Jiang Q, Huang R, Liu W, et al. An automatic segmentation system of acetabulum in sequential CT images for the personalized artificial femoral head design. Comput Methods Programs Biomed. 2016;127:318–35.
14. Kishner S, Chowdhry M. Hip Joint Anatomy: Overview, Gross Anatomy [Internet]. 2015 [cited 2017 Sep 30]. Available from: http://emedicine.medscape.com/article/1898964-overview#a2
15. Kishner S, Courseault J, Authement A. Knee Joint Anatomy: Overview, Gross Anatomy, Natural Variants [Internet]. 2015 [cited 2017 Sep 30]. Available from: http://emedicine.medscape.com/article/1898986-overview#a2
16. Kerckaert I. Locomotorisch stelsel en huid, Deel III: Osteologie. Gent: Academia Press; 2013. 58 p.
17. D’Herde K, Schoemaker E, Veening J. Sesam Atlas van de anatomie deel 1: Bewegingsapparaat. 10th ed. Amersfoort: ThiemeMeulenhoff; 2012. 467 p.
18. Paulsen F, Waschke J. Sobotta Atlas of Human Anatomy: General anatomy and musculoskeletal system. 15th ed. Munich: Elsevier GmbH,Urban & Fischer; 2011. 400 p.
19. Chapter 1: Introduction to CT physics [Internet]. 2004 [cited 2017 Oct 19]. p. 16. Available from: https://web.archive.org/web/20070926231241/http://www.intl.elsevierhealth.com/e-books/pdf/940.pdf
20. Understanding How MRI, CT and X-ray Are Different | CDI, Insight Imaging & Affiliated Providers [Internet]. [cited 2017 Oct 2]. Available from: https://www.mycdi.com/knowledge_center/difference_between_mri_ct_and_x-ray/
48
21. Computed Tomography (CT) | National Institute of Biomedical Imaging and Bioengineering [Internet]. [cited 2017 Oct 2]. Available from: https://www.nibib.nih.gov/science-education/science-topics/computed-tomography-ct
22. Thurston M, Nadrljanski MM. Computed tomography | Radiology Reference Article | Radiopaedia.org [Internet]. [cited 2017 Oct 3]. Available from: https://radiopaedia.org/articles/computed-tomography
23. Seeram E. Computed tomography : physical principles, clinical applications, and quality control. 487 p.
24. X-rays | National Institute of Biomedical Imaging and Bioengineering [Internet]. [cited 2017 Oct 2]. Available from: https://www.nibib.nih.gov/science-education/science-topics/x-rays
25. Matej “Retro” J. Pixels and voxels, the long answer – Retronator Magazine – Medium [Internet]. 2014 [cited 2017 Oct 3]. Available from: https://medium.com/retronator-magazine/pixels-and-voxels-the-long-answer-5889ecc18190
26. Uozumi Y, Nagamune K. An automatic bone segmentation method based on anatomical structure for the knee joint in MDCT image. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE; 2013. p. 7124–7.
27. Krcah M, Szekely G, Blanc R. Fully automatic and fast segmentation of the femur bone from 3D-CT images with no shape prior. In: 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro. IEEE; 2011. p. 2087–90.
28. Hosseini M-P, Nazem-Zadeh MR, Pompili D, Soltanian-Zadeh H. Statistical validation of automatic methods for hippocampus segmentation in MR images of epileptic patients. Conf Proc . Annu Int Conf IEEE Eng Med Biol Soc IEEE Eng Med Biol Soc Annu Conf. 2014;2014:4707–10.
29. Gonzalez RC, Woods RE. Digital image procession. 3rd ed. Horton Marcia J, McDonald Michael, Dworkin Alice, Disanno Scott, Kernan Rose, editors. Pearson Prentice Hall; 2008. 943 p.
30. Zaitoun NM, Aqel MJ. Survey on Image Segmentation Techniques. Procedia Comput Sci. 2015;65:797–806.
31. Umaa Mageswari S, Sridevi M, Mala C. An Experimental Study and Analysis of Different Image Segmentation Techniques. Procedia Eng. 2013;64:36–45.
32. Dass R, Priyanka, Devi S. Images Segmentation Techniques. IJECT. 1993;3(1):66–70. 33. Kang W-X, Yang Q-Q, Liang R-P. The Comparative Research on Image Segmentation
Algorithms. In: 2009 First International Workshop on Education Technology and Computer Science. IEEE; 2009. p. 703–7.
34. Pham DL, Xu C, Prince JL. Current Methods in Medical Image Segmentation. Annu Rev Biomed Eng. 2000 Aug;2(1):315–37.
35. Cheng Y, Zhou S, Wang Y, Guo C, Bai J, Tamura S. Automatic segmentation technique for acetabulum and femoral head in CT images. Pattern Recognit. 2013 Nov;46(11):2969–84.
36. Saleh Al-Amri S, Kalyankar N V, D KS. Image Segmentation by Using Thershod Techniques. J Comput. 2010;2(5):2151–9617.
37. Otsu N. A Threshold Selection Method from Gray-Level Histograms. IEEE Trans Syst Man Cybern. 1979 Jan;9(1):62–6.
38. Daniel P, Raju R, Neelima G. Image Segmentation by using Histogram Thresholding. IJCSET. 2012;2(1):776–9.
39. Zhang J, Yan C-H, Chui C-K, Ong S-H. Fast segmentation of bone in CT images using 3D adaptive thresholding. Comput Biol Med. 2010 Feb;40(2):231–6.
40. Pal NR, Pal SK. A REVIEW ON IMAGE SEGMENTATION TECHNIQUES. Pattern Recoynition. 1993;26(9):1277–94.
41. Kang Y, Engelke K, Kalender WA. A new accurate and precise 3-D segmentation method for skeletal structures in volumetric CT data. IEEE Trans Med Imaging. 2003 May;22(5):586–98.
42. Jacobs D. Image Gradients: Class Notes for CMSC 426. 2005 p. 3.
49
43. Kaur N, Singh J, Sharma V. Analysis and Comprehensive Study: Image Segmentation Techniques. 2015;2321–9653.
44. Heena A, Biradar N, Maroof NM. A Novel Approach to Review Various Image Segmentation Techniques. Int J Innov Res Comput Commun Eng An ISO Certif Organ. 2007;32975(2).
45. Khan AM, Ravi S. Image Segmentation Methods: A Comparative Study. Int J Soft Comput Eng. 2013;3(34):2231–307.
46. Sarkalkan N, Weinans H, Zadpoor AA. Statistical shape and appearance models of bones. Bone. 2014 Mar;60:129–40.
47. Heimann T, Meinzer H-P. Statistical shape models for 3D medical image segmentation: A review. Med Image Anal. 2009 Aug;13(4):543–63.
48. Cootes TF, Taylor CJ. Statistical Models of Appearance for Computer Vision. 2004. 49. Cootes TF, Taylor CJ, Cooper DH, Graham J. Active Shape Models - Their Training and
Application. Comput Vis image Underst. 1995;61(1):35–59. 50. Morse BS. Lecture 21: Image Understanding. 2000 p. 6. 51. Pichumani R. Snakes: an active model [Internet]. 1997 [cited 2017 Oct 25]. Available
from: http://homepages.inf.ed.ac.uk/rbf/CVonline/LOCAL_COPIES/RAMANI1/node31.html
52. Kass M, Witkin A, Terzopoulos D. Snakes: Active contour models. Int J Comput Vis. 1988;321–31.
53. Bebis G. Deformable/Active Contours (or Snakes). 54. Khelif M, Derraz F, Beladgham M. Application of Active Contour Models in Medical
Image Segmentation. Int Conf Inf Technol Coding Comput 2004 Proceedings ITCC 2004. 2004;2:675–81.
55. Xu C, Prince JL. Gradient Vector Flow: A New External Force for Snakes. IEEE Proc Conf Comp Vis Patt Recog. 1997;66–71.
56. Chu C, Chen C, Liu L, Zheng G. FACTS: Fully Automatic CT Segmentation of a Hip Joint. Ann Biomed Eng. 2015 May 4;43(5):1247–59.
57. Yeghiazaryan V, Voiculescu I. An Overview of Current Evaluation Methods Used in Medical Image Segmentation. 2015;21.
58. 3D Medical Image Processing Software | Materialise Mimics [Internet]. [cited 2017 Nov 1]. Available from: http://www.materialise.com/en/medical/software/mimics
59. Paz ICL A, Ldg B, Bruno L. Bone mineral density: review. Brazilian J Poult Sci Rev Bras. 2006;8(2):69–73.
60. McClung MR. The relationship between bone mineral density and fracture risk. Curr Osteoporos Rep. 2005 Jun;3(2):57–63.
61. Assessment of Fracture Risk [Internet]. 2005 [cited 2017 Nov 20]. Available from: https://www.medscape.org/viewarticle/506083_2
62. Weisstein EW. Root-Mean-Square [Internet]. Wolfram Research, Inc.; [cited 2017 Oct 20]. Available from: http://mathworld.wolfram.com/Root-Mean-Square.html
63. Huttenlocher DP, Klanderman GA, Rucklidge WJ. Comparing Images Using the Hausdorff Distance. IEEE Trans Pattern Anal Mach Intell. 1993;15(9):850–63.
64. Yokota F, Okada T, Takao M, Sugano N, Tada Y, Sato Y. Automated segmentation of the femur and pelvis from 3D CT data of diseased hip using hierarchical statistical shape model of joint structure. Med Image Comput Comput Assist Interv. 2009;12(Pt 2):811–8.
65. Zoroofi RA, Sato Y, Sasama T, Nishii T, Sugano N, Yonenobu K, et al. Automated segmentation of acetabulum and femoral head from 3-D CT images. IEEE Trans Inf Technol Biomed. 2003 Dec;7(4):329–43.
66. Ramme AJ, Criswell AJ, Wolf BR, Magnotta VA, Grosland NM. EM segmentation of the distal femur and proximal tibia: A high-throughput approach to anatomic surface generation. Ann Biomed Eng. 2011;39(5):1555–62.
67. Görres J, Brehler M, Franke J, Vetter SY, Grützner PA, Meinzer HP, et al. Articular surface segmentation using active shape models for intraoperative implant assessment. Int J Comput Assist Radiol Surg. 2016;11(9):1661–72.