Upload
others
View
7
Download
0
Embed Size (px)
Citation preview
I
1
RADIO GALAXY CLASSIFICATION
Classifying Radio Galaxies using Conventional Computer Vision and Machine Learning
Burger Becker(Masters Student)
Dr Trienko Grobler(Supervisor)
Computer Science Department
WHo ARE WE?
3
● Radio Galaxies have low visible light emission
● Require radio telescopes to be observed, such as the SKA
What is a Radio Galaxy?
4
Fanaroff-Riley Classification
Red is “hot”
FRI on the left, hotspots near the centre
FRII on the right, hotspots near the edges
5
Fanaroff-Riley Ratio
6
● R = B/A
● R < 0.5 is FRI
● R > 0.5 is FRII
Why is this problem relevant?● MeerKlass survey is expected to find upwards of 200 000 radio
sources
● 300 PB of data generated from the SKA per year (once it’s functional)
● Higher resolution than previous surveys on a larger data set, better morphological features identifiable
● ASKAP is expected to find around 70 million radio sources
7
Research Process
8
Preprocessing of Images
9
Preprocessing of images
10
Separating Background with k-means
11Clustering before preprocessing Clustering after preprocessing
Automatic Ratio Extraction
12
● Did not generalise well on all the data
● Harder problem than expected due to hotspots having large variability in position and brightness
● Not all sources have central hotspots
Why doesn’t it work?
13
Why doesn’t it work?
14
Why doesn’t it work?
15
Manual Ratio Calculation
16
Calculated the ratio of a 148 sources (70 FRI and 78 FRII)
Then performed automatic feature extraction on these sources
Lobe area, brightest hotspot and number of lobes were extracted automatically
Features
17
Results of ClassifierRandom Forest consistently gives the best results
18
Results of Classifier
19
Results of Classifier
20
Using all 4 features Using 3 featuresVery small sample
Is the FR-ratio still relevant?
21
Ratio at 0.311 Ratio at 0.31Yes!
We can make no assumptions regarding what the new ratio would be: 1. sample is too small 2. method is not standardized
Garbage in Garbage Out
22
● Set up a classifier on the same data as used by Wathela et al. (2018)
● Has an additional classes for Bent tails and Compact sources that “pollute” our original data
Wathela et al. (2018) Our Results on Random Forest
Results of Classifier
23
Mean accuracy of 83 (± 1.2)%
Area importance 0.43, Hotspot Brightness 0.52, Lobe area
0.05
Results of Classifier● Removing Number of Lobes as a feature still has a high
accuracy
● Accuracy of 81.7 (±1)%
24
Conclusion● Conventional techniques are a feasible alternative to
Deep Learning, especially in terms of set up time.
● The FR-ratio can potentially be used as a feature
● Be wary of blindly trusting Deep Learning methods without properly examining the feature space
25
Question to the Audience
Better method of automatic Ratio Extraction?
Preprocessing of images
28
The Curve Ball
29
● Bent tail radio galaxies do not have the linear structure of FRI/FRII type galaxies.
● This leads us to believe the underlying physical mechanism of the central galactic engine is different from that of the other two classes
● Visually distinct class
Results: Accuracy
30
Results: What does it get wrong?
31
Credits