Audio data compression affects acoustic indices and reduces detections of birds by human listening and automated recognisers

Alexander G. MacPhail, Daniel A. Yip, Elly C. Knight, Richard Hedley, Michelle Knaggs, Julia Shonfield, Emily Upham-Mills & Erin M. Bayne (2024). Audio data compression affects acoustic indices and reduces detections of birds by human listening and automated recognisers. Bioacoustics, Volume 33 (1): 74 -90
Abstract: 

Increasing popularity in passive acoustic monitoring and the ease with which researchers can accumulate large quantities of acoustic data has resulted in challenges for audio recording storage, archiving, and management. Reductions in file size can be achieved by lowering sample rate and compressing to different formats; however, how these processes affect audio data quality, and the resulting interpretation of wildlife data is not well understood. We investigated the effect of sampling rate and lossy compression of audio recordings to MP3 from their native WAV format on the performance of four commonly applied avian bioacoustic applications: community listening, distance estimation, automated recognition, and acoustic indices. Compression to MP3 decreased the number of detections, including a reduction in total abundance of individuals when transcribing audio files for community listening and lower precision and recall for automated recognisers. Sampling rate reduction introduced systematic bias to acoustic indices and had an influence on precision and recall for recognisers as well. We recommend against the use of MP3 compression to reduce file volume and suggest other lossless forms of audio compression where an exact copy of the original recording can be recovered.

Keywords: 

Acoustic monitoring, audio compression, bird song, data storage, detectability, recogniser

Categories: