Team

Principal Investigator

Professor Amir Hussain

Principal Investigator

Postdoctoral Scientist

Dr. Ahsan Adeel

Postdoctoral Scientist

Doctoral Scientist

Mandar Gogate

Doctoral Scientist


With acknowledgments to our collaborators: Prof. Roger Watt (University of Stirling), Dr. Jon Barker (University of Sheffield), Dr. Ricard Marxer (University of Sheffield), Ashraya Samba Shiva (University of Stirling), Dr. Andrew Abel (Xi'an Jiaotong-Liverpool University), William Whitmer (MRC/CSO Inst. Of Hearing Research), Dr. Peter Derleth (Sonova AG, Switzerland)

INTERACTIVE PROTOTYPE DEMO

Headphones are recommended for best experience (as hearing-aids are more or less like headphones).

Step 2A: Lip Reading and Tracking
Step 2B: Visual Feature Extraction
Step 2C: Noisy Audio Feature Extraction
Step 3: Context-Aware Multimodal Fusion Mountain View
Step 4: Audio-Visually Derived Deep Learning
for Speech Enhancement
Mountain View
Step 5: Enhanced Speech Video
Next-generation audio-visual hearing aid user: Relieved!
Conventional audio-only hearing-aid user: Annoyed!

Spectrogram and Time Domain Signal Comparison

  • Benchmark vs. Proposed

    Headphones are recommended for best experience (as hearing-aids are more or less like headphones).

    Benchmark: (Audio-only) Deep learning
    (IEEE Spectrum 2017)


    Noisy (SNR ~ -10dB) -> Recovered

    The Man Called the Police

    It’s Getting Cold in Here

    They Ate the Lemon Pie

    Proposed: Lip Reading Driven Deep Learning

    Noisy (SNR ~ -10dB)

    Recovered

    Hello, My Name is Rachel

    Noisy



    Bin green A 9 again (SNR ~ -12dB)

    Bin green A 9 again (SNR ~ -6dB)

    Bin green A 9 again (SNR ~ 0dB)

    Bin green A 9 again (SNR ~ 6dB)

    Bin green A 9 again (SNR ~ 12dB)

    SS



    Bin green A 9 again (SNR ~ -12dB)

    Bin green A 9 again (SNR ~ -6dB)

    Bin green A 9 again (SNR ~ 0dB)

    Bin green A 9 again (SNR ~ 6dB)

    Bin green A 9 again (SNR ~ 12dB)

    LMMSE



    Bin green A 9 again (SNR ~ -12dB)

    Bin green A 9 again (SNR ~ -6dB)

    Bin green A 9 again (SNR ~ 0dB)

    Bin green A 9 again (SNR ~ 6dB)

    Bin green A 9 again (SNR ~ 12dB)

    EVWF (proposed)



    Bin green A 9 again (SNR ~ -12dB)

    Bin green A 9 again (SNR ~ -6dB)

    Bin green A 9 again (SNR ~ 0dB)

    Bin green A 9 again (SNR ~ 6dB)

    Bin green A 9 again (SNR ~ 12dB)