Spectrogram
View article: Numerical Analysis of Two Side-by-Side Flexible Filaments in Shear Flow and Orbit Class Prediction using Immersed Boundary Method and Machine Learning Techniques
Numerical Analysis of Two Side-by-Side Flexible Filaments in Shear Flow and Orbit Class Prediction using Immersed Boundary Method and Machine Learning Techniques Open
This study utilizes a second-order immersed boundary method (IBM) for investigating the orbital dynamics of two flexible, inextensible filaments placed side-by-side in low-Reynolds-number shear flows. A parametric analysis explores the eff…
View article: Low-SNR Northern Right Whale Upcall Detection and Classification Using Passive Acoustic Monitoring to Reduce Adverse Human–Whale Interactions
Low-SNR Northern Right Whale Upcall Detection and Classification Using Passive Acoustic Monitoring to Reduce Adverse Human–Whale Interactions Open
Marine mammal vocalizations, such as those of the Northern Right Whale (NARW), are often masked by underwater acoustic noise. The acoustic vocalization signals are characterized by features such as their amplitude, timing, modulation, dura…
View article: Enhancing underwater acoustic orthogonal frequency division multiplexing based channel estimation: a robust convolution-recurrent neural network framework with dynamic signal decomposition
Enhancing underwater acoustic orthogonal frequency division multiplexing based channel estimation: a robust convolution-recurrent neural network framework with dynamic signal decomposition Open
Introduction Underwater acoustic (UWA) communication systems confront significant challenges due to the unique, dynamic, and unpredictable nature of acoustic channels, which are impacted by low signal-to-noise ratio (SNR), severe multipath…
View article: AIRCRAFT CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS ON RCS DATA AND ISAR IMAGES
AIRCRAFT CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS ON RCS DATA AND ISAR IMAGES Open
View article: SONAR: Spectral-Contrastive Audio Residuals for Generalizable Deepfake Detection
SONAR: Spectral-Contrastive Audio Residuals for Generalizable Deepfake Detection Open
Deepfake (DF) audio detectors still struggle to generalize to out of distribution inputs. A central reason is spectral bias, the tendency of neural networks to learn low-frequency structure before high-frequency (HF) details, which both ca…
View article: IMR – Magnetometer Rmag Validation
IMR – Magnetometer Rmag Validation Open
This technical report presents the full validation of the magnetic coherence index Rmagwithin the IMR (Reflective Multidimensional Intelligence) framework, using real magnetometer data recorded across three cognitive conditions (Neutral / …
View article: A Deep Learning Framework for Audio Data Augmentation to Promote Linguistic Diversity
A Deep Learning Framework for Audio Data Augmentation to Promote Linguistic Diversity Open
In language technology, sustaining linguistic diversity is a critical challenge due to the lack of sufficient speech data for underrepresented languages and dialects. This paper addresses this scarcity by proposing a generative audio model…
View article: SEA-Bird: A Machine Learning–Ready Bird Sound Dataset for Ten Common Southeast Asian Species
SEA-Bird: A Machine Learning–Ready Bird Sound Dataset for Ten Common Southeast Asian Species Open
The SEA-Bird dataset is a curated, machine-learning–ready collection of avian vocalizations from ten bird species that are among the most commonly found in Malaysia. Recordings are sourced primarily from Southeast Asia, wi…
View article: Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model
Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model Open
Extracting individual elements from music mixtures is a valuable tool for music production and practice. While neural networks optimized to mask or transform mixture spectrograms into the individual source(s) have been the leading approach…
View article: BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference
BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference Open
Automatic Pitch Correction (APC) enhances vocal recordings by aligning pitch deviations with the intended musical notes. However, existing APC systems either rely on reference pitches, which limits their practical applicability, or employ …
View article: BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference
BERT-APC: A Reference-free Framework for Automatic Pitch Correction via Musical Context Inference Open
Automatic Pitch Correction (APC) enhances vocal recordings by aligning pitch deviations with the intended musical notes. However, existing APC systems either rely on reference pitches, which limits their practical applicability, or employ …
View article: SEA-Bird: A Machine Learning–Ready Bird Sound Dataset for Ten Common Southeast Asian Species
SEA-Bird: A Machine Learning–Ready Bird Sound Dataset for Ten Common Southeast Asian Species Open
The SEA-Bird dataset is a curated, machine-learning–ready collection of avian vocalizations from ten bird species that are among the most commonly found in Malaysia. Recordings are sourced primarily from Southeast Asia, with a small number…
View article: First Deep Learning Approach to Hammering Acoustics for Stem Stability Assessment in Total Hip Arthroplasty
First Deep Learning Approach to Hammering Acoustics for Stem Stability Assessment in Total Hip Arthroplasty Open
Audio event classification has recently emerged as a promising approach in medical applications. In total hip arthroplasty (THA), intra-operative hammering acoustics provide critical cues for assessing the initial stability of the femoral …
View article: Ulysses Unified Radio and Plasma wave (URAP) Daily Color Dynamic Spectrogram Plot
Ulysses Unified Radio and Plasma wave (URAP) Daily Color Dynamic Spectrogram Plot Open
from URAP Users Notes: Guide To The Archiving Of Ulysses Radio And Plasma Wave Data by Roger Hess, Robert MacDowall, Denise Lengyel-Frey March 15, 1995 - version 1.0 revised March 24, 1999 - version 1.1 revised June 8, 1999 - version 1.2 T…
View article: Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization
Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization Open
Generation of dynamic, scalable multi-species bird soundscapes remains a significant challenge in computer music and algorithmic sound design. Birdsongs involve rapid frequency-modulated chirps, complex amplitude envelopes, distinctive aco…
View article: Multimodal Real-Time Anomaly Detection and Industrial Applications
Multimodal Real-Time Anomaly Detection and Industrial Applications Open
This paper presents the design, implementation, and evolution of a comprehensive multimodal room-monitoring system that integrates synchronized video and audio processing for real-time activity recognition and anomaly detection. We describ…
View article: Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization
Dynamic Multi-Species Bird Soundscape Generation with Acoustic Patterning and 3D Spatialization Open
Generation of dynamic, scalable multi-species bird soundscapes remains a significant challenge in computer music and algorithmic sound design. Birdsongs involve rapid frequency-modulated chirps, complex amplitude envelopes, distinctive aco…
View article: First Deep Learning Approach to Hammering Acoustics for Stem Stability Assessment in Total Hip Arthroplasty
First Deep Learning Approach to Hammering Acoustics for Stem Stability Assessment in Total Hip Arthroplasty Open
Audio event classification has recently emerged as a promising approach in medical applications. In total hip arthroplasty (THA), intra-operative hammering acoustics provide critical cues for assessing the initial stability of the femoral …
View article: Multimodal Real-Time Anomaly Detection and Industrial Applications
Multimodal Real-Time Anomaly Detection and Industrial Applications Open
This paper presents the design, implementation, and evolution of a comprehensive multimodal room-monitoring system that integrates synchronized video and audio processing for real-time activity recognition and anomaly detection. We describ…
View article: Cassini Radio and Plasma Wave Science (RPWS) Low Rate Full Resolution
Cassini Radio and Plasma Wave Science (RPWS) Low Rate Full Resolution Open
The Cassini Radio and Plasma Wave Science (RPWS) Low Rate Full Resolution Calibrated (RPWS_LOW_RATE_FULL) is a data set including all spectral density measurements acquired by the RPWS in units of electric or magnetic field spectral densit…
View article: High Resolution Seismic Waveform Generation Using Denoising Diffusion
High Resolution Seismic Waveform Generation Using Denoising Diffusion Open
Accurate prediction and synthesis of seismic waveforms are crucial for seismic‐hazard assessment and earthquake‐resistant infrastructure design. Existing prediction methods, such as ground‐motion models and physics‐based wavefield simulati…
View article: Blackbody Radiation from Energy Resonance Field Theory (ERFT): A Numerical Validation Pipeline Based on Full 3D Field Dynamics
Blackbody Radiation from Energy Resonance Field Theory (ERFT): A Numerical Validation Pipeline Based on Full 3D Field Dynamics Open
Energy Resonance Field Theory (ERFT) proposes that gravitational, inertial and quantum-like effects emerge from coherent resonant modes of a continuous energy substrate. A critical observable predicted by ERFT is the existence of emergent …
View article: Blackbody Radiation from Energy Resonance Field Theory (ERFT): A Numerical Validation Pipeline Based on Full 3D Field Dynamics
Blackbody Radiation from Energy Resonance Field Theory (ERFT): A Numerical Validation Pipeline Based on Full 3D Field Dynamics Open
Energy Resonance Field Theory (ERFT) proposes that gravitational, inertial and quantum-like effects emerge from coherent resonant modes of a continuous energy substrate. A critical observable predicted by ERFT is the existence of emergent …
View article: Identification and Evaluation of Vibration Sources from Experiments on Laboratory Drilling Equipment
Identification and Evaluation of Vibration Sources from Experiments on Laboratory Drilling Equipment Open
Rotary rock drilling generates vibration signals that capture the dynamic behavior of the drilling system, the interaction between the tool and the rock, and the progression of tool wear. These signals, traditionally considered undesirable…
View article: Intelligent modeling of music teaching feedback using audio spectrogram features and attention-weighted emotional factors
Intelligent modeling of music teaching feedback using audio spectrogram features and attention-weighted emotional factors Open
Harmonious Music Feedback Intelligent Modeling of Music Teaching Feedback Using Audio Spectrogram Features and Attention-Weighted Emotional Factors Overview HarmoniousMusicFeedback is an intelligent framework for modeling music teaching fe…
View article: Biotuner: A python toolbox integrating music theory and signal processing for harmonic analysis of physiological and natural time series
Biotuner: A python toolbox integrating music theory and signal processing for harmonic analysis of physiological and natural time series Open
Biotuner provides an extensible framework that unifies music-theoretic constructs with biosignal processing, enabling hypothesis-driven analyses for researchers and, in parallel, creative exploration of complex natural patterns for artists.
View article: Epileptic Seizure Type Detection Model Using CNN-Derived Features and Random Forests
Epileptic Seizure Type Detection Model Using CNN-Derived Features and Random Forests Open
Epilepsy is a chronic neurological disorder characterized by recurrent and unpredictable seizures, affecting nearly 1% of the global population and profoundly impacting daily life and overall well-being. Accurate detection and classificati…
View article: IMPLEMENTATION OF WAVELET SCALOGRAMS FOR AUDIO SIGNAL ANALYSIS
IMPLEMENTATION OF WAVELET SCALOGRAMS FOR AUDIO SIGNAL ANALYSIS Open
<p>In the past few decades, wavelets have found an important place in many fields due to their nature and advantages compared to other algorithms. When the focus is on signal processing, wavelets can be used as an algorithm for de-no…
View article: Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition
Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition Open
Recent advances in multimodal deep learning have greatly enhanced the capability of systems for speech analysis and pronunciation assessment. Accurate pronunciation detection remains a key challenge in Arabic, particularly in the context o…
View article: Teager-Kaiser Energy Methods For EEG Feature Extraction In Biomedical Applications
Teager-Kaiser Energy Methods For EEG Feature Extraction In Biomedical Applications Open
Electroencephalography (EEG) signals are inherently non-linear, non-stationary, and vulnerable to noise sources, making the extraction of discriminative features a long-standing challenge. In this work, we investigate the non-linear Teager…