All Stories

  1. RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
  2. From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems
  3. Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss
  4. Position Paper: Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks
  5. Acoustic Prompt Tuning: Empowering Large Language Models With Audition Capabilities
  6. LC-Protonets: Multi-Label Few-Shot Learning for World Music Audio Tagging
  7. Velocity2DMs: A Contextual Modeling Approach to Dynamics Marking Prediction in Piano Performance
  8. Globally, songs and instrumental melodies are slower and higher and use more stable pitches than speech: A Registered Report
  9. A Data-Driven Analysis of Robust Automatic Piano Transcription
  10. ATGNN: Audio Tagging Graph Neural Network
  11. PiJAMA: Piano Jazz with Automatic MIDI Annotations
  12. Few-shot Class-incremental Audio Classification Using Dynamically Expanded Classifier with Self-attention Modified Prototypes
  13. Exploring Transformer’s Potential on Automatic Piano Transcription
  14. Improving Lyrics Alignment Through Joint Pitch Detection
  15. Learning Music Audio Representations Via Weak Language Supervision
  16. Automatic Quality Assessment of Digitized and Restored Sound Archives
  17. Measuring national mood with music: using machine learning to construct a measure of national valence from audio data
  18. Adaptive Scattering Transforms for Playing Technique Recognition
  19. Comparison of Feature Extraction Methods for Sound-Based Classification of Honey Bee Activity
  20. Detecting Cover Songs with Pitch Class Key-Invariant Networks
  21. Humanities and engineering perspectives on music transcription
  22. An Evaluation of Data Augmentation Methods for Sound Scene Geotagging
  23. Vocal Harmony Separation Using Time-Domain Neural Networks
  24. Violinist identification based on vibrato features
  25. MusCaps: Generating Captions for Music Audio
  26. Revisiting the Onsets and Frames Model with Additive Attention
  27. More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
  28. Joint Multi-Pitch Detection and Score Transcription for Polyphonic Piano Music
  29. Prototypical Networks for Domain Adaptation in Acoustic Scene Classification
  30. The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription Accuracy
  31. Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation
  32. Development of a Speech Quality Database Under Uncontrolled Conditions
  33. Memory Controlled Sequential Self Attention for Sound Recognition
  34. Deep generative variational autoencoding for replay spoof detection in automatic speaker verification
  35. Reliable Local Explanations for Machine Listening
  36. A Study on the Transferability of Adversarial Attacks in Sound Event Classification
  37. A-CRNN: A Domain Adaptation Model for Sound Event Detection
  38. Audio Impairment Recognition using a Correlation-Based Feature Representation
  39. Modeling Plate and Spring Reverberation Using A DSP-Informed Deep Neural Network
  40. Playing Technique Recognition by Joint Time–Frequency Scattering
  41. Deep Learning for Black-Box Modeling of Audio Effects
  42. Learning and Evaluation Methodologies for Polyphonic Music Sequence Prediction With LSTMs
  43. Dataset Artefacts in Anti-Spoofing Systems: A Case Study on the ASVspoof 2017 Benchmark
  44. City Classification from Multiple Real-World Sound Scenes
  45. Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive Separation
  46. Polyphonic Sound Event and Sound Activity Detection: A Multi-Task Approach
  47. Ensemble Models for Spoofing Detection in Automatic Speaker Verification
  48. Towards Joint Sound Scene and Polyphonic Sound Event Recognition
  49. Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF
  50. Optimal neural network feature selection for spatial-temporal forecasting
  51. Adapting the Quality of Experience Framework for Audio Archive Evaluation
  52. Audio-based Identification of Beehive States
  53. Automatic Transcription of Diatonic Harmonica Recordings
  54. SubSpectralNet – Using Sub-spectrogram Based Convolutional Neural Networks for Acoustic Scene Classification
  55. Automatic Music Transcription: An Overview
  56. Analysing The Predictions Of a CNN-Based Replay Spoofing Detection System
  57. ANALYSING REPLAY SPOOFING COUNTERMEASURE PERFORMANCE UNDER VARIED CONDITIONS
  58. Polyphonic Music Sequence Transduction with Meter-Constrained LSTM Networks
  59. Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization
  60. A supervised classification approach for note tracking in polyphonic piano transcription
  61. Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
  62. A review of manual and computational approaches for the study of world music corpora
  63. A computational study on outliers in world music
  64. Automatic Transcription of Polyphonic Vocal Music
  65. Sound event detection in synthetic audio: Analysis of the dcase 2016 task results
  66. Approaches to Complex Sound Scene Analysis
  67. Polyphonic Sound Event Tracking Using Linear Dynamical Systems
  68. On-Bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts
  69. On the memory properties of recurrent neural models
  70. The Digital Music Lab
  71. A Morphological Model for Simulating Acoustic Scenes and Its Application to Sound Event Detection
  72. Speaker recognition with hybrid features from a deep belief network
  73. Digital music lab: A framework for analysing big music data
  74. An End-to-End Neural Network for Polyphonic Piano Music Transcription
  75. Detection of overlapping acoustic events using a temporally-constrained probabilistic model
  76. Automatic transcription of Turkish microtonal music
  77. Detection and Classification of Acoustic Scenes and Events
  78. Alternate level clustering for drum transcription
  79. A hybrid recurrent neural network for music transcription
  80. The temperament police
  81. Incremental Dataset Definition for Large Scale Musicological Research
  82. Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition
  83. Improving instrument recognition in polyphonic music through system integration
  84. Automatic transcription of pitched and unpitched sounds from polyphonic music
  85. Big Data for Musicology
  86. Detection and classification of acoustic scenes and events: An IEEE AASP challenge
  87. Automatic music transcription: challenges and future directions
  88. Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model
  89. Temporally-Constrained Convolutive Probabilistic Latent Component Analysis for Multi-pitch Detection
  90. A temporally-constrained convolutive probabilistic model for pitch detection
  91. Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription
  92. Polyphonic music transcription using note onset and offset detection
  93. Improving Music Genre Classification Using Automatically Induced Harmony Rules
  94. Auditory Spectrum-Based Pitched Instrument Onset Detection
  95. Non-Negative Tensor Factorization Applied to Music Genre Classification
  96. Computationally Efficient and Robust BIC-Based Speaker Segmentation
  97. A neural network approach to audio-assisted movie dialogue detection
  98. Systematic comparison of BIC-based speaker segmentation systems
  99. Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification
  100. Automatic Speaker Segmentation using Multiple Features and Distance Measures: A Comparison of Three Approaches