All Stories

  1. Working with AI Sound: Exploring the Future of Workplace AI Sound Technologies
  2. Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement
  3. Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking
  4. Separation of the aortic and pulmonary components of the second heart sound via alternating optimization
  5. Sound Event Detection: A tutorial
  6. Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation
  7. CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification
  8. Sparse Analysis Model Based Dictionary Learning for Signal Declipping
  9. Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification
  10. Learning With Out-of-Distribution Data for Audio Classification
  11. Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis
  12. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition
  13. Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization
  14. Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data
  15. Sparse Recovery and Dictionary Learning From Nonlinear Compressive Measurements
  16. Weakly Labelled AudioSet Tagging With Attention Neural Networks
  17. Referenceless Performance Evaluation of Audio Source Separation using Deep Neural Networks
  18. Information-Theoretic Approaches to Neural Network Learning
  19. Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
  20. Acoustic Event Detection from Weakly Labeled Data Using Auditory Salience
  21. Acoustic Scene Generation with Conditional Samplernn
  22. Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes
  23. Generalisation in Environmental Sound Classification: The ‘Making Sense of Sounds’ Data Set and Challenge
  24. Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering
  25. Sound Event Detection and Time–Frequency Segmentation from Weakly Labelled Data
  26. Musical Source Separation: An Introduction
  27. Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy
  28. Sound Event Localization and Detection Using CRNN on Pairs of Microphones
  29. Predicting the perceived level of reverberation using machine learning
  30. A Hierarchical Latent Mixture Model for Polyphonic Music Analysis
  31. Raw Multi-Channel Audio Source Separation using Multi- Resolution Convolutional Auto-Encoders
  32. A Contextual Study of Semantic Speech Editing in Radio Production
  33. A Demonstration of Hierarchical Structure Usage in Expressive Timing Analysis by Model Selection Tests
  34. PaperClip: A Digital Pen Interface for Semantic Speech Editing in Radio Production
  35. Inexact Proximal Operators for <tex>$\ell_{p}$</tex>-Quasinorm Minimization
  36. Orthogonality-Regularized Masked NMF for Learning on Weakly Labeled Audio Data
  37. A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data
  38. Audio Set Classification with Attention Model: A Probabilistic Perspective
  39. BSS Eval or Peass? Predicting the Perception of Singing-Voice Separation
  40. Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network
  41. Synthesis of Images by Two-Stage Generative Adversarial Networks
  42. Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
  43. Malicious User Detection Based on Low-Rank Matrix Completion in Wideband Spectrum Sensing
  44. Computational Analysis of Sound Scenes and Events
  45. Consistent Dictionary Learning for Signal Declipping
  46. Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks
  47. Latent Variable Analysis and Signal Separation
  48. Multi-Resolution Fully Convolutional Neural Networks for Monaural Audio Source Separation
  49. Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback
  50. Single channel audio source separation using convolutional denoising autoencoders
  51. Graph-based clustering for identifying region of interest in eye tracker data analysis
  52. Binaural and log-power spectra features with deep neural networks for speech-noise separation
  53. Approaches to Complex Sound Scene Analysis
  54. Future Perspective
  55. Introduction to Sound Scene and Event Analysis
  56. Two-Stage Single-Channel Audio Source Separation Using Deep Neural Networks
  57. Using deep neural networks to estimate tongue movements from speech face motion
  58. Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging
  59. Learning the Mapping Function from Voltage Amplitudes to Sensor Positions in 3D-EMA Using Deep Neural Networks
  60. Automatic music transcription using low rank non-negative matrix decomposition
  61. Joint detection and classification convolutional neural network on weakly labelled bird audio detection
  62. Masked non-negative matrix factorization for eire detection using weakly labeled data
  63. Multivariate iterative hard thresholding for sparse decomposition with flexible sparsity patterns
  64. Polyphonic Sound Event Tracking Using Linear Dynamical Systems
  65. Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
  66. Convolutional gated recurrent neural network incorporating spatial features for audio tagging
  67. A greedy algorithm with learned statistics for sparse signal reconstruction
  68. A joint detection-classification model for audio tagging of weakly labelled data
  69. Assessment of musical noise using localization of isolated peaks in time-frequency domain
  70. Fast tagging of natural sounds using marginal co-regularization
  71. Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
  72. Psychophysical Evaluation of Audio Source Separation Methods
  73. Automatic Environmental Sound Recognition: Performance Versus Computational Cost
  74. Combining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks
  75. Evaluation of audio source separation models using hypothesis-driven non-parametric statistical methods
  76. Wideband Spectrum Sensing on Real-Time Signals at Sub-Nyquist Sampling Rates in Single and Cooperative Multiple Nodes
  77. Detection of overlapping acoustic events using a temporally-constrained probabilistic model
  78. Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription
  79. Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning
  80. The Clustering of Expressive Timing Within a Phrase in Classical Piano Performances by Gaussian Mixture Models
  81. Chime-home: A dataset for sound source recognition in a domestic environment
  82. Detection and Classification of Acoustic Scenes and Events
  83. Acoustic Scene Classification: Classifying environments from the sounds they produce
  84. Event-based Multitrack Alignment using a Probabilistic Framework
  85. A dynamic programming variant of non-negative matrix deconvolution for the transcription of struck string instruments
  86. Non-negative matrix factorisation incorporating greedy hellinger sparse coding applied to polyphonic music transcription
  87. Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network
  88. Efficient compressive spectrum sensing algorithm for M2M devices
  89. Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain
  90. Learning Incoherent Subspaces: Classification via Incoherent Dictionary Learning
  91. Large‐scale analysis of frequency modulation in birdsong data bases
  92. Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning
  93. Accounting for phase cancellations in non-negative matrix factorization using weighted distances
  94. Improving instrument recognition in polyphonic music through system integration
  95. Polyphonic piano transcription using non-negative Matrix Factorisation with group sparsity
  96. Score-Informed Source Separation for Musical Audio Recordings: An overview
  97. Best Practices for Scientific Computing
  98. Big Data for Musicology
  99. Dictionary learning via projected maximal exploration
  100. Learning overcomplete dictionaries with ℓ0-sparse Non-negative Matrix Factorisation
  101. Low-rank matrix completion based malicious user detection in cooperative spectrum sensing
  102. Detection and classification of acoustic scenes and events: An IEEE AASP challenge
  103. Multichannel HR-NMF for modelling convolutive mixtures of non-stationary signals in the time-frequency domain
  104. Learning incoherent subspaces for classification via supervised iterative projections and rotations
  105. Structured sparsity using backwards elimination for Automatic Music Transcription
  106. On Theorem 10 in “On Polar Polytopes and the Recovery of Sparse Representations” [Sep 07 3188-3195]
  107. Hearing the shape of a room
  108. Synchronizing Sequencing Software to a Live Drummer
  109. Automatic Music Transcription using row weighted decompositions
  110. Behavior of greedy sparse representation algorithms on nested supports
  111. Improved multiple birdsong tracking with distribution derivative method and Markov renewal process clustering
  112. Recognition of harmonic sounds in polyphonic audio using a missing feature approach
  113. Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis
  114. Learning Incoherent Dictionaries for Sparse Approximation Using Iterative Projections and Rotations
  115. The Serendiptichord: Reflections on the Collaborative Design Process between Artist and Researcher
  116. Predictive Information in Gaussian Processes with Application to Music Analysis
  117. Using Oracle Analysis for Decomposition-Based Automatic Music Transcription
  118. A robust method for S1/S2 heart sounds detection without ecg reference based on music beat tracking
  119. Denoising and segmentation of the second heart sound using matching pursuit
  120. Cognitive music modelling: An information dynamics approach
  121. Analysis-based sparse reconstruction with synthesis-based solvers
  122. Audio Inpainting
  123. INK-SVD: Learning incoherent dictionaries for sparse representations
  124. Instrumentation-based music similarity using sparse representations
  125. Sound Software: Towards software reuse in audio and music research
  126. Structured sparsity for automatic music transcription
  127. A measure of statistical complexity based on predictive information with application to finite spin systems
  128. Dictionary Learning with Large Step Gradient Descent for Sparse Representations
  129. Group Polytope Faces Pursuit for Recovery of Block-Sparse Signals
  130. Performance Following: Real-Time Prediction of Musical Sequences Without a Score
  131. Reliability-Informed Beat Tracking of Musical Signals
  132. Learning Timbre Analogies from Unlabelled Data by Multivariate Tree Regression
  133. On the disjointess of sources in music using different time-frequency representations
  134. Onset Event Decoding Exploiting the Rhythmic Structure of Polyphonic Music
  135. Fast Dictionary Learning for Sparse Representations of Speech Signals
  136. Separating sources from sequentially acquired mixtures of heart signals
  137. A constrained matching pursuit approach to audio declipping
  138. Dictionary learning of convolved signals
  139. Sound Source Separation
  140. Measuring the Performance of Beat Tracking Algorithms Using a Beat Error Histogram
  141. Delayed Decision-making in Real-time Beatbox Percussion Classification
  142. Sparse Representations in Audio and Music: From Coding to Source Separation
  143. An L1 criterion for dictionary learning by subspace identification
  144. A Multichannel Spatial Compressed Sensing Approach for Direction of Arrival Estimation
  145. Gradient Polytope Faces Pursuit for large scale sparse recovery problems
  146. Non-negative mixtures
  147. Note onset detection using rhythmic structure
  148. Performance following: Tracking a performance without a score
  149. SMALLbox - An Evaluation Framework for Sparse Representations and Dictionary Learning Algorithms
  150. Evaluation of live human–computer music-making: Quantitative and qualitative approaches
  151. Towards a musical beat emphasis function
  152. Information dynamics: patterns of expectation and surprise in the perception of music
  153. Sparse reconstruction for compressed sensing using Stagewise Polytope Faces Pursuit
  154. Fast Multidimensional Entropy Estimation by $k$-d Partitioning
  155. Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation
  156. INFORMATION DYNAMICS AND THE PERCEPTION OF TEMPORAL STRUCTURE
  157. Using phase linearity in frequency-domain ICA to tackle the permutation problem
  158. Estimating Phase Linearity in the Frequency-Domain ICA Demixing Matrix
  159. Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation
  160. Efficient Bayesian inference for harmonic models via adaptive posterior factorization
  161. Audio analysis using sparse representations.
  162. An adaptive stereo basis method for convolutive blind audio source separation
  163. Speech Separation Using an Adaptive Sparse Dictionary Algorithm
  164. An adaptive orthogonal sparsifying transform for speech signals
  165. Oracle estimation of adaptive cosine packet transforms for underdetermined audio source separation
  166. Theorems on Positive Data: On the Uniqueness of NMF
  167. On Polar Polytopes and the Recovery of Sparse Representations
  168. Audio source separation with a signal-adaptive local cosine transform
  169. Oracle estimators for the benchmarking of source separation algorithms
  170. Low Bit-Rate Object Coding of Musical Audio Using Bayesian Harmonic Models
  171. Context-Dependent Beat Tracking of Musical Audio
  172. B-Keeper
  173. Blind Source Separation using Space–Time Independent Component Analysis
  174. Flag Manifolds for Subspace ICA Problems
  175. Geometry and Manifolds for Independent Component Analysis
  176. Independent Component Analysis and Signal Separation
  177. On the Use of Entropy for Beat Tracking Evaluation
  178. Real-time beat-synchronous audio effects
  179. Information theory and sensory perception
  180. Fast Factorization-Based Inference for Bayesian Harmonic Models
  181. Sparse representations of polyphonic music
  182. Recovery of Sparse Representations by Polytope Faces Pursuit
  183. Riemannian Optimization Method on the Flag Manifold for Independent Subspace Analysis
  184. Riemannian Optimization Method on Generalized Flag Manifolds for Complex and Subspace ICA
  185. Single-Channel Mixture Decomposition Using Bayesian Harmonic Models
  186. Sparse Coding for Convolutive Blind Audio Source Separation
  187. Unsupervised Analysis of Polyphonic Music by Sparse Coding
  188. Geometrical methods for non-negative ICA: Manifolds, Lie groups and toral subalgebras
  189. Beat tracking with a two state model [music applications]
  190. Blind Separation of Positive Sources by Globally Convergent Gradient Search
  191. A "nonnegative PCA" algorithm for independent component analysis
  192. Application of Geometric Dependency Analysis to the Separation of Convolved Mixtures
  193. Lie Group Methods for Optimization with Orthogonality Constraints
  194. Optimization Using Fourier Expansion over a Geodesic for Non-negative ICA
  195. Algorithms for nonnegative independent component analysis
  196. Automatic Music Transcription and Audio Source Separation
  197. Conditions for nonnegative independent component analysis
  198. Do cortical maps adapt to optimize information density?
  199. Do cortical maps adapt to optimize information density?
  200. Maximizing information about a noisy signal with a single non-linear neuron
  201. Designing Neural Networks using a Genetic Rule-based System
  202. Unsupervised neural network learning procedures for feature extraction and classification
  203. Information processing in negative feedback neural networks
  204. Information processing in negative feedback neural networks
  205. Lyapunov functions for convergence of principal component algorithms
  206. Analysis of an Unsupervised Indirect Feedback Network
  207. Generation and adaptation of neural networks by evolutionary techniques (GANNET)
  208. Approximating Optimal Information Transmission using Local Hebbian Algorithms in a Double Feedback Loop
  209. Efficient information transfer and anti-Hebbian neural networks
  210. Information Theory and Neural Networks
  211. Direct Approaches to Improving the Robustness of Multilayer Neural Networks
  212. The effect of receptor signal-to-noise levels on optimal filtering in a sensory system
  213. Sensory adaptation: an information-theoretic viewpoint
  214. Musical audio analysis using sparse representations
  215. Audio Source Separation using Sparse Representations
  216. Probabilistic Modeling Paradigms for Audio Source Separation
  217. Natural Conjugate Gradient on Complex Flag Manifolds for Complex Independent Subspace Analysis
  218. Dictionary Learning for L1-Exact Sparse Coding
  219. The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals
  220. A prototype system for object coding of musical audio
  221. Identification of dental bacteria using statistical and neural approaches
  222. Information and Density and Cortical Magnification Factors
  223. Communications and neural networks: theory and practice