eprintid: 10205839 rev_number: 7 eprint_status: archive userid: 699 dir: disk0/10/20/58/39 datestamp: 2025-03-11 11:48:32 lastmod: 2025-03-11 11:48:32 status_changed: 2025-03-11 11:48:32 type: article metadata_visibility: show sword_depositor: 699 creators_name: Liu, W creators_name: Tsamados, M creators_name: Petty, A creators_name: Jin, T creators_name: Chen, W creators_name: Stroeve, J title: Enhanced sea ice classification for ICESat-2 using combined unsupervised and supervised machine learning ispublished: pub divisions: UCL divisions: B04 divisions: C06 divisions: F57 keywords: ICESat-2; Sea ice classification; Lead detection note: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions. abstract: ICESat-2 provides the potential for high-resolution and accurate measurements of the sea ice state. However, the current ATL07 sea ice height and type product relies on a threshold method for surface type classification, which introduces uncertainties in lead detection, especially in summer. In addition, it only categorizes into sea ice and lead types, excluding gray ice and the dark lead category has been shown to misclassify leads in cloudy conditions. To address these issues, we seek to improve the surface type classification by combining unsupervised and supervised machine learning methods and leveraging coincident imagery obtained from Sentinel-2. First, we use an unsupervised Gaussian Mixture Model (GMM) with four statistical parameters—photon rate, background rate, width of distribution, and height—to group ATL07 segments into 80 clusters. These clusters are then assigned specific surface types—sea ice, gray ice, or lead—based on coincident Sentinel-2 imagery. In the second step, we train a supervised K-nearest neighbor (KNN) classification model using the labeled segments from the GMM as training data. We conduct Leave One Group Out cross-validation of our model using coincident Sentinel-2 images as the ground truth, analyzing 717,009 strong beam and 702,843 weak beam ATL07 segments. The results demonstrate an improvement in lead detection, with precision values reaching approximately 98.6 % for strong beams and 97.5 % for weak beams and recall values of 91.8 % for strong beams and 90.3 % for weak beams. Our approach is applied to both Antarctic and Arctic sea ice, and is extended to include a new gray ice category, which agrees reasonably well with the coincident Sentinel-2 images. Our new sea ice and lead classification approach shows great promise for improving sea surface height and sea ice freeboard retrievals from ICESat-2 and highlights the significant value of coincident satellite imagery for classification training and validation. date: 2025-03 date_type: published publisher: ELSEVIER SCIENCE INC official_url: https://doi.org/10.1016/j.rse.2025.114607 full_text_type: other language: eng verified: verified_manual elements_id: 2357509 doi: 10.1016/j.rse.2025.114607 lyricists_name: Tsamados, Michel lyricists_name: Stroeve, Julienne lyricists_id: MCATS10 lyricists_id: JSTRO73 actors_name: Tsamados, Michel actors_id: MCATS10 actors_role: owner funding_acknowledgements: 42388102 [National Natural Science Foundation of China]; 42192531 [National Natural Science Foundation of China]; 42192533 [National Natural Science Foundation of China]; [Fundamental Research Funds for the Central Universities]; [China Scholarship Council] full_text_status: restricted publication: Remote Sensing of Environment volume: 318 article_number: 114607 pages: 21 citation: Liu, W; Tsamados, M; Petty, A; Jin, T; Chen, W; Stroeve, J; (2025) Enhanced sea ice classification for ICESat-2 using combined unsupervised and supervised machine learning. Remote Sensing of Environment , 318 , Article 114607. 10.1016/j.rse.2025.114607 <https://doi.org/10.1016/j.rse.2025.114607>. document_url: https://discovery.ucl.ac.uk/id/eprint/10205839/1/Manuscript_clean%20revision%20%281%29.pdf