--- layout: single title: "Primate Subsegment Sorting" categories: research tags: bioacoustics audio-classification deep-learning data-labeling signal-processing excerpt: "Binary subsegment presorting improves noisy primate sound classification." header: teaser: /assets/figures/19_binary_primates_teaser.jpg scholar_link: "https://scholar.google.de/citations?user=NODAd94AAAAJ&hl=en" ---  {:style="display:block; width:40%" .align-right} Automated acoustic classification plays a vital role in wildlife monitoring and bioacoustics research. This study introduces a sophisticated pre-processing and training strategy to significantly enhance the accuracy of multi-class audio classification, specifically targeting the identification of different primate species from field recordings. A key challenge in bioacoustics is dealing with datasets containing weak labels (where calls of interest occupy only a portion of a labeled segment), varying segment lengths, and poor signal-to-noise ratios (SNR). Our approach addresses this by: 1. **Subsegment Analysis:** Processing audio recordings represented as **MEL spectrograms**. 2. **Refined Labeling:** Meticulously **relabeling subsegments** within the spectrograms. This "binary presorting" step effectively identifies and isolates the actual vocalizations of interest within longer, weakly labeled recordings. 3. **CNN Training:** Training **Convolutional Neural Networks (CNNs)** on these refined, higher-quality subsegment inputs. 4. **Data Augmentation:** Employing innovative **data augmentation techniques** suitable for spectrogram data to further improve model robustness.