What is ASPIRE?

ASPIRE is a a first of its kind, audiovisual speech corpus recorded in real noisy environment (such as cafe, restaurants) which can be used to support reliable evaluation of multi-modal Speech Filtering technologies. This dataset follows the same sentence format as the audiovisual Grid corpus.

Some samples of the data


Acknowledgements

This research was funded by the UK Engineering and Physical Sciences Research Council (EPSRC project AV-COGHEAR, EP/M026981/1)

Download

The data can be downloaded from DOI.

Citing the corpus

  @article{gogate2020cochleanet,
  title={Cochleanet: A robust language-independent audio-visual model for speech enhancement},
  author={Gogate, Mandar and Dashtipour, Kia and Adeel, Ahsan and Hussain, Amir},
  journal={Information Fusion},
  year={2020},
  publisher={Elsevier}
}

Gogate, Mandar, Kia Dashtipour, Ahsan Adeel, and Amir Hussain. "Cochleanet: A robust language-independent audio-visual model for speech enhancement." Information Fusion (2020).