A speech-controlled environmental control system for people with severe dysarthria

Mark S Hawley; Pam Enderby; Phil Green; Stuart Cunningham; Simon Brownsell; James Carmichael; Mark Parker; Athanassios Hatzis; Peter O'Neill; Rebecca Palmer

doi:10.1016/j.medengphy.2006.06.009

A speech-controlled environmental control system for people with severe dysarthria

Med Eng Phys. 2007 Jun;29(5):586-93. doi: 10.1016/j.medengphy.2006.06.009. Epub 2006 Oct 17.

Authors

Mark S Hawley¹, Pam Enderby, Phil Green, Stuart Cunningham, Simon Brownsell, James Carmichael, Mark Parker, Athanassios Hatzis, Peter O'Neill, Rebecca Palmer

Affiliation

¹ Department of Medical Physics and Clinical Engineering, Barnsley Hospital NHS Foundation Trust, UK. mark.hawley@nhs.net

PMID: 17049905
DOI: 10.1016/j.medengphy.2006.06.009

Abstract

Automatic speech recognition (ASR) can provide a rapid means of controlling electronic assistive technology. Off-the-shelf ASR systems function poorly for users with severe dysarthria because of the increased variability of their articulations. We have developed a limited vocabulary speaker dependent speech recognition application which has greater tolerance to variability of speech, coupled with a computerised training package which assists dysarthric speakers to improve the consistency of their vocalisations and provides more data for recogniser training. These applications, and their implementation as the interface for a speech-controlled environmental control system (ECS), are described. The results of field trials to evaluate the training program and the speech-controlled ECS are presented. The user-training phase increased the recognition rate from 88.5% to 95.4% (p<0.001). Recognition rates were good for people with even the most severe dysarthria in everyday usage in the home (mean word recognition rate 86.9%). Speech-controlled ECS were less accurate (mean task completion accuracy 78.6% versus 94.8%) but were faster to use than switch-scanning systems, even taking into account the need to repeat unsuccessful operations (mean task completion time 7.7s versus 16.9s, p<0.001). It is concluded that a speech-controlled ECS is a viable alternative to switch-scanning systems for some people with severe dysarthria and would lead, in many cases, to more efficient control of the home.

Publication types

Evaluation Study
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Artificial Intelligence
Communication Devices for People with Disabilities*
Dysarthria / rehabilitation*
Environment, Controlled*
Humans
Pattern Recognition, Automated / methods*
Software Design
Sound Spectrography / methods*
Speech Recognition Software*
User-Computer Interface*