Speech Recognition in Adverse Conditions

Speech Recognition in Adverse Conditions
Author: Sven Mattys
Publisher: Psychology Press
Total Pages: 420
Release: 2013-12-19
Genre: Psychology
ISBN: 1317836804


Download Speech Recognition in Adverse Conditions Book in PDF, Epub and Kindle

Speech recognition in ‘adverse conditions’ has been a familiar area of research in computer science, engineering, and hearing sciences for several decades. In contrast, most psycholinguistic theories of speech recognition are built upon evidence gathered from tasks performed by healthy listeners on carefully recorded speech, in a quiet environment, and under conditions of undivided attention. Building upon the momentum initiated by the Psycholinguistic Approaches to Speech Recognition in Adverse Conditions workshop held in Bristol, UK, in 2010, the aim of this volume is to promote a multi-disciplinary, yet unified approach to the perceptual, cognitive, and neuro-physiological mechanisms underpinning the recognition of degraded speech, variable speech, speech experienced under cognitive load, and speech experienced by theoretically relevant populations. This collection opens with a review of the literature and a formal classification of adverse conditions. The research articles then highlight those adverse conditions with the greatest potential for constraining theory, showing that some speech phenomena often believed to be immutable can be affected by noise, surface variations, or attentional set in ways that will force researchers to rethink their theory. This volume is essential for those interested in speech recognition outside laboratory constraints.

Speech Recognition in Adverse Conditions

Speech Recognition in Adverse Conditions
Author: Sven Mattys
Publisher: Psychology Press
Total Pages: 326
Release: 2013-12-19
Genre: Psychology
ISBN: 1317836812


Download Speech Recognition in Adverse Conditions Book in PDF, Epub and Kindle

Speech recognition in ‘adverse conditions’ has been a familiar area of research in computer science, engineering, and hearing sciences for several decades. In contrast, most psycholinguistic theories of speech recognition are built upon evidence gathered from tasks performed by healthy listeners on carefully recorded speech, in a quiet environment, and under conditions of undivided attention. Building upon the momentum initiated by the Psycholinguistic Approaches to Speech Recognition in Adverse Conditions workshop held in Bristol, UK, in 2010, the aim of this volume is to promote a multi-disciplinary, yet unified approach to the perceptual, cognitive, and neuro-physiological mechanisms underpinning the recognition of degraded speech, variable speech, speech experienced under cognitive load, and speech experienced by theoretically relevant populations. This collection opens with a review of the literature and a formal classification of adverse conditions. The research articles then highlight those adverse conditions with the greatest potential for constraining theory, showing that some speech phenomena often believed to be immutable can be affected by noise, surface variations, or attentional set in ways that will force researchers to rethink their theory. This volume is essential for those interested in speech recognition outside laboratory constraints.

The Cognitive and Neural Organisation of Speech Processing

The Cognitive and Neural Organisation of Speech Processing
Author: Patti Adank
Publisher: Frontiers Media SA
Total Pages: 148
Release: 2016-03-18
Genre: Neurosciences
ISBN: 2889197751


Download The Cognitive and Neural Organisation of Speech Processing Book in PDF, Epub and Kindle

Speech production and perception are two of the most complex actions humans perform. The processing of speech is studied across various fields and using a wide variety of research approaches. These fields include, but are not limited to, (socio)linguistics, phonetics, cognitive psychology, neurophysiology, and cognitive neuroscience. Research approaches range from behavioural studies to neuroimaging techniques such as Magnetoencephalography, electroencephalography (MEG/EEG) and functional Magnetic Resonance Imaging (fMRI), as well as neurophysiological approaches, such as the recording of Motor Evoked Potentials (MEPs), and Transcranial Magnetic Stimulation (TMS). Each of these approaches provides valuable information about specific aspects of speech processing. Behavioural testing can inform about the nature of the cognitive processes involved in speech processing, neuroimaging methods show where (fMRI and MEG) in the brain these processes take place and/or elucidate on the time-course of activation of these brain areas (EEG and MEG), while neurophysiological methods (MEPs and TMS) can assess critical involvement of brain regions in the cognitive process. Yet, what is currently unclear is how speech researchers can combine methods such that a convergent approach adds to theory/model formulation, above and beyond the contribution of individual component methods? We expect that such combinations of approaches will significantly forward theoretical development in the field. The present research topic comprise a collection of manuscripts discussing the cognitive and neural organisation of speech processing, including speech production and perception at the level of individual speech sounds, syllables, words, and sentences. Our goal was to use findings from a variety of disciplines, perspectives, and approaches to gain a more complete picture of the organisation of speech processing. The contributions are grouped around the following five main themes: 1) Spoken language comprehension under difficult listening conditions; 2) Sub-lexical processing; 3) Sensorimotor processing of speech; 4) Speech production. The contributions used a variety of research approaches, including behavioural experiments, fMRI, EEG, MEG, and TMS. Twelve of the 14 contributions were on speech perception processing, and the remaining two examined speech production. This Research Topic thus displays a wide variety of topics and research methods and this comprehensive approach allows an integrative understanding of currently available evidence as well as the identification of concrete venues for future research.

Speech Processing in the Auditory System

Speech Processing in the Auditory System
Author: Steven Greenberg
Publisher: Springer Science & Business Media
Total Pages: 487
Release: 2006-05-09
Genre: Science
ISBN: 0387215751


Download Speech Processing in the Auditory System Book in PDF, Epub and Kindle

Although speech is the primary behavioral medium by which humans communicate, its auditory basis is poorly understood, having profound implications on efforts to ameliorate the behavioral consequences of hearing impairment and on the development of robust algorithms for computer speech recognition. In this volume, the authors provide an up-to-date synthesis of recent research in the area of speech processing in the auditory system, bringing together a diverse range of scientists to present the subject from an interdisciplinary perspective. Of particular concern is the ability to understand speech in uncertain, potentially adverse acoustic environments, currently the bane of both hearing aid and speech recognition technology. There is increasing evidence that the perceptual stability characteristic of speech understanding is due, at least in part, to elegant transformations of the acoustic signal performed by auditory mechanisms. As a comprehensive review of speech's auditory basis, this book will interest physiologists, anatomists, psychologists, phoneticians, computer scientists, biomedical and electrical engineers, and clinicians.

The Role of Working Memory and Executive Function in Communication under Adverse Conditions

The Role of Working Memory and Executive Function in Communication under Adverse Conditions
Author: Mary Rudner
Publisher: Frontiers Media SA
Total Pages: 274
Release: 2016-06-20
Genre: Neurosciences. Biological psychiatry. Neuropsychiatry
ISBN: 2889198618


Download The Role of Working Memory and Executive Function in Communication under Adverse Conditions Book in PDF, Epub and Kindle

Communication is vital for social participation. However, communication often takes place under suboptimal conditions. This makes communication harder and less reliable, leading at worst to social isolation. In order to promote participation, it is necessary to understand the mechanisms underlying communication in different situations. Human communication is often speech based, either oral or written, but may also involve gesture, either accompanying speech or in the form of sign language. For communication to be achieved, a signal generated by one person has to be perceived by another person, attended to, comprehended and responded to. This process may be hindered by adverse conditions including factors that may be internal to the sender (e.g. incomplete or idiosyncratic language production), occur during transmission (e.g. background noise or signal processing) or be internal to the receiver (e.g. poor grasp of the language or sensory impairment). The extent to which these factors interact to generate adverse conditions may differ across the lifespan. Recent work has shown that successful speech communication under adverse conditions is associated with good cognitive capacity including efficient working memory and executive abilities such as updating and inhibition. Further, frontoparietal networks associated with working memory and executive function have been shown to be activated to a greater degree when it is harder to achieve speech comprehension. To date, less work has focused on sign language communication under adverse conditions or the role of gestures accompanying speech communication under adverse conditions. It has been proposed that the role of working memory in communication under such conditions is to keep fragments of an incomplete signal in mind, updating them as appropriate and inhibiting irrelevant information, until an adequate match can be achieved with lexical and semantic representations held in long term memory. Recent models of working memory highlight an episodic buffer whose role is the multimodal integration of information from the senses and long term memory. It is likely that the episodic buffer plays a key role in communication under adverse conditions. The aim of this research topic is to draw together multiple perspectives on communication under adverse conditions including empirical and theoretical approaches. This will facilitate a scientific exchange among individual scientists and groups studying different aspects of communication under adverse conditions and/or the role of cognition in communication. As such, this topic belongs firmly within the field of Cognitive Hearing Science. Exchange of ideas among scientists with different perspectives on these issues will allow researchers to identify and highlight the way in which different internal and external factors interact to make communication in different modalities more or less successful across the lifespan. Such exchange is the forerunner of broader dissemination of results which ultimately, may make it possible to take measures to reduce adverse conditions, thus facilitating communication. Such measures might be implemented in relation to the built environment, the design of hearing aids and public awareness.

Intelligibility, Oral Communication, and the Teaching of Pronunciation

Intelligibility, Oral Communication, and the Teaching of Pronunciation
Author: John M. Levis
Publisher: Cambridge University Press
Total Pages: 319
Release: 2018-10-04
Genre: Foreign Language Study
ISBN: 1108416624


Download Intelligibility, Oral Communication, and the Teaching of Pronunciation Book in PDF, Epub and Kindle

An intelligibility-based approach to teaching that presents pronunciation as critical, yet neglected, in communicative language teaching.