Authors for the SST conference were given the choice of submitting a full paper or a one page abstract. Each of these was reviewed by at least two Referees. Those papers accepted after full paper review are as follows.
1-6 |
Joanne Arciuli,
Louisa Slowiaczek |
What does Dichotic Listening Reveal about the Processing of Stress Typicality? |
7-12 |
Mark J. Jones,
Kirsty McDougall |
A Comparative Acoustic Study of Australian English Fricated /t/: Assessing the Irish (English) Link |
13-18 |
Francis Nolan,
Kirsty McDougall, Gea de Jong, Toby Hudson |
A Forensic Phonetic Study of 'Dynamic' Sources of Variability in Speech: The DyViS Project |
19-23 |
Eric Choi,
Julien Epps |
A Novel Front-end Based on Variable Frame Rate Analysis and Mel-filterbank Output Compensation for Robust ASR |
24-29 |
Maria Paola Bissiri,
Hartmut R. Pfitzinger,
Hans G. Tillmann |
Lexical Stress Training of German Compounds for Italian Speakers by means of Resynthesis and Emphasis |
30-35 |
Shunichi Ishihara |
Kyoto Osaka Japanese Citation Tone Acoustics: A Linguistic Tonetic Study |
36-41 |
David Gunawan,
Deep Sen |
Spectral Envelope Sensitivity of Musical Instruments |
42-47 |
Janet Fletcher, Deborah Loakes |
Patterns of rising and falling intonation in Australian English |
48-51 |
Zhibo Liu, Fang Zheng, Xiaojun Wu, Mingxing Xu |
Towards A New Implementation Approach for Rapid Development of Text-based Dialog Systems |
52-57 |
Bo Yin,
Eliathamby Ambikairajah, Fang Chen |
Language-dependent Fusion for Language Identification |
58-63 |
Philip Rose |
Zooming-in on Oujiang Wu: tonal homogeneity and acoustic reconstruction in a small subgroup of Chinese dialects |
64-69 |
Philip Rose |
The Intrinsic Forensic Discriminatory Power of Diphthongs |
70-75 |
Elisabeth Zetterholm |
Same speaker - different voices. A study of one impersonator and some of his different imitations |
76-81 |
Kamil Wojcicki, Benjamin Shannon, Kuldip Paliwal |
Spectral Subtraction with Variance Reduced Noise Spectrum Estimates |
82-87 |
William Steed |
Phonation Type and Tone Sandhi as Evidence of Chinese Stress in Longquan Wu |
88-93 |
Aik Ming Toh,
Roberto Togneri,
Sven Nordholm |
Combining MLLR adaptation and feature extraction for robust speech recognition in reverberant environments |
94-99 |
Roberto Togneri, Aik Ming Toh, Sven Nordholm |
Evaluation and modification of cepstral moment normalization for speech recognition in additive babble ensemble |
100-105 |
Christa Lam, Christine Kitamura |
Developmental trends in infant preferences for affective intent in mothers’ speech |
106-111 |
Anne Cutler,
Jeesun Kim,
Takashi Otake |
On the Limits of L1 Influence on Non-L1 Listening: Evidence from Japanese Perception of Korean |
112-117 |
Yuko Kinoshita, Takashi Osanai |
Within Speaker variation in diphthongal dynamics: what can we compare? |
118-123 |
Kimiko Tsukada, Thu Nguyen, Rungpat Roengpitya |
Cross-language perception of word-final stops by native Vietnamese listeners: Preliminary results on the role of specific, non-native phonetic experience |
124-129 |
Christine Kitamura, Robin Panneton, Maria Diehl, Anna Notley |
Attuning to the native dialect: When more means less |
130-135 |
Sigrid Roehling, Catherine Watson, Bruce MacDonald |
Towards Expressive Speech Synthesis in English on a Robotic Platform |
136-142 |
Mohaddeseh Nosratighods, Eliathamby Ambikairajah, Julien Epps, Michael Craey |
A Novel Technique for the selection of speech segments for speaker verification |
142-147 |
Iris-Corinna Schwarz, Denis Burnham, Judith A. Bowey |
Phoneme Sensitivity and vocabulary size in 2 1/2- to 3-year-olds |
148-152 |
Tharmarajah Thiruvaran,
Eliathamby Ambikairajah,
Julien Epps |
Speaker Identification using FM Features |
153-158 |
Dogu Erdener, Denis Burnham |
Auditory-Visual Speech Perception in School and Preschool Children |
159-164 |
Xiaobo Lu,
Peter Bier,
William Thorpe |
A time-varying three-dimensional model of the vocal tract |
165-169 |
Cheolwoo Jo, Jianglin Wang |
Performance of Gaussian Mixture Models as a Classifier for Pathological Voice |
170-175 |
Akiko Kato, Felicity Cox |
Development of Japanese length contrast:: A longitudinal study of L2 vowels produced by Australian learners of Japanese |
176-181 |
Mary Stevens, John Hajek |
Blocking of word-boundary consonant lengthening in Sienese Italian: some auditory and acoustic evidence |
182-186 |
Hywel Stoakes,
Janet Fletcher,
Andrew Butcher |
Articulatory variability of intervocalic stop articulation in Bininj Gun-Wok |
187-192 |
Thu Nguyen, John Ingram |
Reduplication and word stress in Vietnamese |
193-198 |
John Ingram, Thu Nguyen |
Stress, tone and word prosody in Vietnamese compounds |
199-204 |
Serajul Haque, Roberto Togneri, Anthony Zaknich |
Zero-Crossings with Adaptation for Automatic Speech Recognition |
205-210 |
Deborah Loakes |
Variation in Long-Term Fundamental Frequency: Measurements from Vocalic Segments in Twins' Speech |
211-215 |
Shuai Wang,
Deep Sen,
Wenliang Lu |
Subband Analysis of Time Delay Estimation in STFT Domain |
216-221 |
Simon Hawkins |
Formant Estimation using an Autoassociative Neural Network |
222-227 |
Michael Tyler |
French listeners can use stress to segment words in an artificial language |
228-232 |
Yao Wang,
Jiong An,
Teddy Surya Gunawan,
Eliathamby Ambikairajah |
Speech Enhancement Using Temporal Masking in the FFT Domain |
233-238 |
Michael Wagner |
Speaker Verification Using the Shape of the Glottal Excitation Function for Vowels |
239-242 |
John Hajek, Tim Cummins |
A preliminary investigation of vowel lengthening in non-final position in Friulian |
243-247 |
Christopher Groot, Chris Davis |
Auditory-Visual Speech Recognition with Amplitude and Frequency Modulations |
248-252 |
Chris Davis, Amanda Sironic, Jeesun Kim |
Perceptual processing of audiovisual Lombard speech |
253-258 |
Thomas Millhouse, Frantz Clermont |
Perceptual characterisation of the singer's formant region: A preliminary study |
259-263 |
Teddy Surya Gunawan,
Eliathamby Ambikairajah |
On the Use of Simultaneous and Temporal Masking for Noise Suppression in Cochlear Implant |
264-269 |
Gordon McIntyre, Roland Goecke |
Researching Emotions in Speech |
270-275 |
Simone Graetzer |
Consonantal coarticulation resistance in vowel-consonant-vowel sequences in two Australian languages |
276-280 |
Qingning Zeng, Waleed Abdulla |
Speech Enhancement by Multi-channel Crosstalk Resistant Adaptive Noise Cancellation and Post-filtering |
281-286 |
Girija Chetty, Michael Wagner |
UCBN: A new audio-visual broadcast news corpus for multimodal speaker verification studies
|
489-493 |
Benedict Tan, Deep Sen |
The Use of the Attack Transient Envelope in Instrument Recognition |