9
1 FIRST APPROACH TO UNSUPERVISED CLASSIFICATION OF BROADBAND FISHERIES ACOUSTIC DATA FOR ECOSYSTEM MONITORING ALICIA MAURICE 1 , LAURENT BERGER 2 , MATHIEU DORAY 1 1: IFREMER,UNITÉ ECOLOGIE ET MODÈLES POUR L'HALIEUTIQUE, NANTES ²:IFREMER,IMN/NSE/A, BREST

Présentation PowerPoint - Weamec

Embed Size (px)

Citation preview

1

F I R S T A P P R O A C H T O

U N S U P E R V I S E D

C L A S S I F I C AT I O N O F

B R O A D B A N D F I S H E R I E S

A C O U S T I C D ATA F O R

E C O S Y S T E M M O N I T O R I N G

A L I C I A M A U R I C E 1 , L A U R E N T B E R G E R 2 ,

M A T H I E U D O R A Y 1

1 : I F R E M E R , U N I T É E C O L O G I E E T M O D È L E S

P O U R L ' H A L I E U T I Q U E , N A N T E S

² : I F R E M E R , I M N / N S E / A , B R E S T

Introduction

• Increase of broadband acoustic images (BAI) collection• Detailed spectral information• But curse of dimensionality, limited availability of labeled

samples and taxa mixing• Unsupervised classification approaches to BAI big data

challenge• New methodologies, (open) softwares, (super) computers and

common reference datasets for benchmarking• Hyperspectral image community legacy• First results of unsupervised classifications of BAI near an

offshore windmill in the Bay of Biscay (BoB, France,EchoSonde project)

16/04/2021 2

Material and methods• Software / hardware

• Python / py_movies3D package / datarmorHPC supercomputer

• Methods• Fine scale echo-integration (1m vert / 3m

horiz) at low threshold (-90dB)• Spatial-spectral classification• Several clustering and dimension reduction

techniques (sklearn library and Rasti et al1) onabsolute MVBS

• Data• BoB « echoscapes » (validated typical

frequency spectra)• PHOENIX2018 survey BAIs Day/Night data

acquisition around offshore windmillinstallation

• Platform and echosounders• Thalassa vessel fitted with Simrad EK80 (6 main

frequencies: 18kHz (CW), 38kHz, 70kHz,120kHz, 200kHz and 333kHz (FM))

• Pings emitted sequentially, 2 sec for a completecycle

16/04/2021 31 Feature Extraction for Hyperspectral Imagery by Rasti et al, IEEE Geoscience and Remote Sensing Magazine, 2020

Datarmor HPC supercomputer

Results (1): « echoscape » clusteringObj: separate BoB typical « echoscapes »

• Few reference labeled broadband acoustic datasets available=>use of unsupervised learning methods

• K-Means method fast and widely used but major drawbacks=>look for other methods to cluster the data

• Firstly, test of the efficiency of those methods on a set of validated in situ data (modeled by forward approach)=>8 clusters are to be identified 1

16/04/2021 41 Broadband acoustic evidences of the mesoscale distribution of gas-bearing siphonophores in the eutrophic Bay of Biscay, A. BLANLUET, IFREMER, 2019

Results (1): « echoscape » clusteringObj: separate BoB typical « echoscapes »

• Best compromise between quality of the results and computation time=>ModifiedLocally Linear Embedding (LLE) dimension reduction technique 1 + Kmeans clusteringmethod

• All clusters are correctly identified with this method

16/04/2021 51 LLE: Modified Locally Linear Embedding Using Multiple Weights, Zhang Z. & Wang L., MIT Press, 2007

Results (2): BAI clusteringObj: segment echogram to isolate backscatters

• Small fish schools are completely isolated when running LLE+Kmeans

• A bias occurs for very small fish schools (because of two main factors: the position of the transducers on the vessel and the insonification sequence) => Confidence intervals are then essential

16/04/2021 6A 10 metre-long fishschool

Results (3)Obj: segment echogram to isolate backscatters

• Identification of the thermocline when running LLE+Kmeans

16/04/2021 7

Discussion• Similarities with hyperspectral images processing method review /

libraries=>need for shared reference labeled datasets and (open) frameworks to benchmark classification methods: « echoscapes » + BAIs

• Clustering on absolute MVBS as backscatter level discriminant

• Kmeans computationaly efficient, but equally-sized clusters

• LLE+Kmeans performed better than Kmeans:

• spatial information accounted for in LLE: fish school separation, thermocline mapping ...

• dimension reduction to alleviate the curse of dimensionality: improved clustering and computation time

16/04/2021 8

Conclusions• Python + MOVIES3D + Datarmor + spatial-spectral

ordination/clustering to tackle BAI big data challenge

• EchoSonde dataset:

• vessel-borne BAI to be combined with BAIs from acoustic-optic profiler (WBTTUBE) and physical, optical and biological data

• LLE+Kmeans method:

• Test on larger dataset (CW dataset)

• Shallow approach promising, unsupervised deep learning will be tested for potential improvements

16/04/2021 9

This work was carried out within the framework of the WEAMEC (West Atlantic Marine Energy Community) EchoSonde project, with funding from the Pays de la Loire Region.