07. May 2025

“CoVox”: A Matched Vocal Dataset for Comparing Singing and Speech Styles

six picture tiles showing women singing or talking to each other or to a baby in their arms

From lullabies to opera arias: CoVox contains audio recordings from a total of 22 singers. (Collage: MPI for Empirical Aesthetics / L. Bittner)

The human voice is as diverse and individual as a fingerprint and can provide information about emotions, age, or health. In order to study vocal performances, researchers at the Max Planck Institute for Empirical Aesthetics (MPIEA) in Frankfurt am Main, Germany, have created a curated set of audio recordings with a total of 1,320 voice samples. The dataset is freely available and has been validated in a study recently published in Behavior Research Methods.

CoVox contains audio recordings of 22 Brazilian singers singing short melodies in three different styles—a lullaby, a pop song, and an opera aria. They also spoke the lyrics in two different styles: one as if they were addressing an adult and the other as if they were addressing a baby. This resulted in acoustic profiles that even laypeople could easily distinguish stylistically, the study found.

“What makes this dataset special is that it is fully matched: all the singing and speaking styles were performed by the same singers,” explains first author Camila Bruder of the MPIEA. “This consistency across performers makes it easier to compare the vocalizations across styles, making CoVox a controlled and directly comparable dataset.”

The recordings are available for download here and can be used under the Creative Commons license, which requires that the author of the dataset is credited, that the use is for non-commercial purposes only, and that any modifications are shared under the same license terms.

The dataset can be used as a source of experimental stimulus material for researchers to use in their own studies, but also as a subject of study in its own right, for example for comparisons between speech and singing,concludes Pauline Larrouy-Maestri, senior author at the MPIEA.

 

Original publication:

Bruder, C., & Larrouy-Maestri, P. (2025). CoVox: A Dataset of Contrasting Vocalizations. Behavior Research Methods57, 142. https://doi.org/10.3758/s13428-025-02664-9

 

Contact:

Camila Bruder

Pauline Larrouy‑Maestri

Contact

Ina Wittmann

Ina Wittmann

Press & Public Relations

Press Officer

+49 69 8300479-653

E-Mail

Ingrid Banciu

Ingrid Banciu

Press & Public Relations

Press Officer

+49 69 8300479-651

E-Mail

Sara Bertsche

Scientific Services

Press and Public Relations

+49 69 8300479-652

E-Mail

Keyvan Sarkhosh

Dr. Keyvan Sarkhosh

Press & Public Relations

Research Coordinator / Head of Press and Public Relations

+49 69 8300479-650

E-Mail