A psychoacoustic approach to dysphonic voice quality perception

$240,071R01FY2014DCNIH

University Of Georgia, Athens GA

Investigators

Linked publications & trials

Paper 37019804 Paper 36907680 Paper 36516473 Paper 36260821 Paper 35867607 Paper 31932189 Paper 29162356 Paper 28318967 Paper 26775221 Paper 26723336 Paper 25944288 Paper 24116533 Paper 22423721 Paper 22361106 Paper 22215034 Paper 21428523 Paper 19896328 Paper 19185451

Abstract

1 Voice disorders often lead to changes in voice quality noticed by patients, clinicians, and conversation 2 partners, and improvement in voice quality is a critical outcome of treatment. However, we have limited 3 knowledge of how people perceive voice quality. This has restricted our ability to accurately quantify or 4 describe changes in quality, such as due to a disease or when resulting from treatment. This continuation 5 project combines concepts and techniques from voice science, speech science, hearing science, and 6 engineering to address this problem. In general, the research proceeds by first obtaining high-precision 7 measures of voice quality perception in the laboratory. These data are then used to develop mathematical 8 models of voice quality perception that accurately reflect listeners' data. To obtain a close match between 9 human judgments of voice quality and model output, models of auditory processing are used to obtain an 10 internal representation of the voice acoustic signal. Specific measures are then captured from this internal 11 auditory representation and used to model the perception of voice quality. Methods for obtaining perceptual 12 judgments of single voice quality dimensions, the transformation of the acoustic signal to its internal 13 representation, and the general form of the voice quality models have been completed for two different voice 14 quality dimensions (breathiness and roughness) using simple stimuli (vowel /a/ as in hot). In the proposed 15 work, these approaches will be developed further to establish a framework for comprehensive understanding of 16 voice quality perception and to enable translation to clinical practice. These approaches will be (1) used to 17 account for multiple, co-occurring voice quality dimensions; (2) applied to more natural and complex stimuli 18 (multiple vowels and syllables); and (3) leveraged to understand other voice quality dimensions (strain). (4) To 19 increase model accuracy and to expand their applicability to severely dysphonic voices (e.g. Type II and Type 20 III), methods to estimate the pitch and pitch strength of dysphonic voices will be developed and incorporated 21 into relevant models. (4) To enhance the measurement schemes in a manner that improves clinical utility, 22 model output will be transformed to a scale that is intuitively related to the perceptual magnitude of each voice 23 quality dimension. This will create a set of intuitive voice quality metrics that are easy to use and interpret. (5) 24 Finally, the feasibility of using these models and metrics in regular clinical assessment will be evaluated 25 through an initial clinical study.

View original record on NIH RePORTER →