ASPIRE Corpus | Audiovisual SPeech in Real noisy Environments

ASPIRE Corpus

Audiovisual SPeech in Real noisy Environments
Know more Sample Data

What is ASPIRE?

ASPIRE is a a first of its kind, audiovisual speech corpus recorded in real noisy environment (such as cafe, restaurants) which can be used to support reliable evaluation of multi-modal Speech Filtering technologies. This dataset follows the same sentence format as the audiovisual Grid corpus.

Some samples of the data

Acknowledgements

This research was funded by the UK Engineering and Physical Sciences Research Council (EPSRC project AV-COGHEAR, EP/M026981/1)

Download

The data can be downloaded from .

Citing the corpus

  @article{gogate2020cochleanet,
  title={Cochleanet: A robust language-independent audio-visual model for speech enhancement},
  author={Gogate, Mandar and Dashtipour, Kia and Adeel, Ahsan and Hussain, Amir},
  journal={Information Fusion},
  year={2020},
  publisher={Elsevier}
}

Gogate, Mandar, Kia Dashtipour, Ahsan Adeel, and Amir Hussain. "Cochleanet: A robust language-independent audio-visual model for speech enhancement." Information Fusion (2020).