Fifth Australian International Conference on
Speech Science & Technology
Perth, 6-8 December, 1994
Page numbers refer to nominal page numbers assigned to each paper for purposes of citation.
Pages | Authors | Title | |
---|---|---|---|
2--7 | Yaxin Zhang. Roberto Togneri, Chris deSilva. Mike Alder | Optimization Of Phoneme-Based VQ Codebook In A Dhmm System | |
8--13 | Lunji Qin, Haiyun Yang and Soo Ngee Koh | Estimation Of Continuous Fundamental Frequency Of Speech Signals | |
14--19 | Andrew Luk, C.P. Cheung, S.H. Leung, and W.H. Lau | Cantonese Phonemes Recognition Via The Gated Neural Network |
Pages | Authors | Title | |
---|---|---|---|
22--27 | Jeffery Pittam | The Measurement Of Voice | |
28--33 | Andrew Butcher | On The Phonetics Of Small Vowel Systems: Evidence From Australian Languages | |
34--39 | Kuniko Kakita | Inter-Speaker Interaction Of The Duration Of Sentences And Intersentence Interval S |
Pages | Authors | Title | |
---|---|---|---|
40--45 | A. J. Fisher and S. Sridharan | Speech Enhancement For Forensic And Telecommunication Applications | |
46--50 | K. J. Popel and R. E. Bogner | Blind Separation Of Speech Signals | |
51--56 | Jie Huang and Noboru Ohnishi | Voice Separation Based On Multi-Channel Correlation And Components Tracing | |
57--62 | Yuchang Cao, Sridha Sridharan, Miles P. Moody | Multi-Channel Speech Signal Separation By Eigendecomposition And Its Application To Co-Talker Interference Removal | |
63--68 | Michael. S. Scordilis and Stuart Adams | Experiments In Multi-Microphone Speech Enhancement For Recognition |
Pages | Authors | Title | |
---|---|---|---|
70--75 | CHRISTINE M. KITAMURA | Infant Preferences For Age-Related Infant-Directed Speech: The Salience Of Vocal Affect | |
76--80 | L. Penny, A. Russell and C. Pemberton | Some Speech And Acoustic Measures Of The Aging Voice | |
81--86 | J.P. Scanlan | The Transformation Of Bird Sounds Into 'Speech' | |
87--91 | L. Penny and M. Carmody | The Acoustic Correlates Of Heightened Emotion: The Making Of Marriage Vows | |
92--97 | Duncan Markham | Is Foreign Accent Visible? |
Pages | Authors | Title | |
---|---|---|---|
100--105 | J.S. Pan, F. R. McInnes and M. A. Jack | Improvements In Extended Partial Distortion Search And Partial Distortion Search Algorithms VQ Search | |
106--111 | J. S. Pan, F. R. McInnes and M. A. Jack | Comparison Of Fast VQ Training Algorithms | |
112--117 | Jinho Choi | On Mse Of Celp Coder | |
118--123 | H.R. Sadegh Mohammadi and W.H. Holmes | Fine-Coarse Split Vector Quantization: An Efficient Method For Spectral Coding | |
124--129 | Haiyun Yang, Soo-Ngee Koh, Sivaprakassaipillai P | Enhancement Of Improved Multi-Band Excitation (IMBE) Using A Novel Method To Encode Spectral Amplitudes |
Pages | Authors | Title | |
---|---|---|---|
132--137 | Phil Rose | Any Advance On Eleven? Linguistic Tonetic Contrasts In A Bidialectal Thai Speaker | |
138--143 | Phil Rose | Wenzhou Tonal Acoustics - Depressor And Register Effects In Chinese Tonology | |
144--149 | Heather B. King | The Interrogative Intonation Of Dyirbal | |
150--155 | Yuancheng Zheng, Harald Trost, Ernst Buchberger, and Johannes Matiasek | The Intonational Model Used For German Text-To-Speech Generation | |
156--161 | Sandra Madureira | Pitch Patterns In Brazilian Portuguese: An Acoustic-Phonetic Analysis |
Pages | Authors | Title | |
---|---|---|---|
164--169 | S. Boland, M. Deriche, and S. Sridharan | Low Bit Rate Speech And Music Coding Using The Wavelet Transform | |
170--175 | W.N. Farrell and W.G. Cowley | A Rate 3/4 Tcm Decoder For Line Spectral Pairs Using Map Information | |
176--181 | J. Leis, S. Sridharan and W. Millan | Secure Speech Coding For Voice Messaging Applications | |
182--187 | Chenthurvasan Duraiappan and Yuliang Zheng | Improving Speech Security And Authentication In Mobile Communications | |
188--193 | Ira A. Gerson, Mark A. Jasiuk, Joseph M. Nowack, and Eric H. Winter | Speech And Channel Coding For The Half-Rate Gsm Channel |
Pages | Authors | Title | |
---|---|---|---|
196--201 | K.M. Hird | The Function Of Declination In Spontaneous Speech | |
202--208 | Hisham Darjazini and Dr Jo Tibbitts | The Construction Of Phonemic Knowledge Using Clustering Methodology | |
209--214 | J.Bruce Millar and Dave Davies | The Andor Interface To The Australian National Database Of Spoken Language | |
215--220 | Jonathan Harrington and Lydia K.H. So | Some Design Criteria In Segmenting And Labelling A Database Of Spoken Cantonese |
Pages | Authors | Title | |
---|---|---|---|
222--227 | Myléne Pijpers, Michael D. Alder and Roberto Togneri | Dimension Reduction Of Acoustic Vowel Data | |
228--233 | Fikret S Gurgen, Ting Fan, Julie Vonwiller | On The Analysis Of Phoneme-Based Features For Gender Identification With Neural Networks | |
234--237 | T. Schurer | Comparing Different Feature Extraction Methods For Telephone Speech Recognition Based On Hmm'S | |
238--243 | T. Matsuoka, N. Hayakawa, Y. Yashiba, Y. Ishida, T. Honda and Y. Ogawa | Pitch Estimation Using Discrete Analytic Signals |
Pages | Authors | Title | |
---|---|---|---|
244--248 | Munehiro Namba and Yoshihisa Ishida | Design And Implementation Using Neural Networks And Its Application To Hearing Aid | |
249--254 | Hiroyuki KAMATA, Hiroyuki OKA and Yoshihisa ISHIDA | Analysis And Synthesis Of Human Voice Considering The Nonstationary Based On The Glottis Open And Close Characteristics | |
255--260 | Xue YANG, J. Bruce Millar and Iain Macleod | On The Separation Of Speech Signal Variances From Two Sources | |
261--267 | Richard E Favero | Comparison Of Perceptual Scaling Of Wavelets For Speech Recognition |
Pages | Authors | Title | |
---|---|---|---|
268--273 | S. Ong and P. Castellano | Spectral Patterns And Speaker Identification Asymmetry | |
274--279 | SH Luo and R. W. King | Using Speech Signals To Improve Visual Facial Image Reconstruction: An Rnn Approach To Explore The Mutual Information | |
280--284 | R.H. Withnell & R.A. Wilde | Preliminary Report: Early Latency Auditory Evoked Potentials In Infants With Down Syndrome |
Pages | Authors | Title | |
---|---|---|---|
285--288 | Anne Cutler | HOW HUMAN SPEECH RECOGNITION IS AFFECTED BY PHONOLOGICAL DIVERSITY AMONG LANGUAGES |
Pages | Authors | Title | |
---|---|---|---|
325--330 | Christine Kitamura & Denis Burnham | Pitch & Communicative Intent In Infant-Directed Speech: Longitudinal Data | |
331--336 | S. McLeod. J. van Doorn, and V. Reed | Homonyms And Cluster Reduction In The Normal Development Of Children'S Speech | |
337--342 | P.F. McCormack & T. Knighton | Gender Differences In The Speech Patterns Of Two And A Half Year Old Children | |
343--348 | Christine Kitamura & Denis Burnham | Infant Preferences For Infant-Directed Speech: Is Vocal Affect More Salient Than Pitch'? |
Pages | Authors | Title | |
---|---|---|---|
290--295 | Simon Fox and Peter Tischer | Exact Sound Compression With Optimal Linear Predictors | |
296--301 | Andrew Hunt and Richard Favero | Using Principal Component Analysis With Wavelets In Speech Recognition | |
302--307 | Chee Wee Loke, Roberto Togneri | A Geometric Interpretation Of Hidden Markov Model |
Pages | Authors | Title | |
---|---|---|---|
310--315 | P.F. McCormack, J. C. Ingram | Tempo And The Rhythm Rule | |
316--321 | David Deterding | The Rhythm Of Singapore English | |
322--327 | J. Wang | Syllable Duration In Mandarin |
Pages | Authors | Title | |
---|---|---|---|
330--335 | Mechtild Tronnier | Tracing Nasality With The Help Of The Spectrum Of A Nasal Signal | |
336--341 | Richard E Favero | Compound Wavelets And Speech Recognition | |
342--347 | Simon Hawkins, Iain MacLeod, and Bruce Millar | Modelling Individual Speaker Characteristics By Describing A Speaker'S Vowel Distribution In Articulatory, Cepstral And Formant Space | |
348--353 | Simon Hawkins, Iain Macleod, and Bruce Millar | An Unsupervised Algorithm For The Extraction Of Formant-Like Features From Lpc-Cepstral Space | |
354--359 | Frantz Clermont and Parham Mokhtari | Frequency-Band Specification In Cepstral Distance Computation |
Pages | Authors | Title | |
---|---|---|---|
362--367 | Anne Cutler, James McQueen, Harald Baayen and Hens Drexler | Words Within Words In A Real-Speech Corpus | |
368--373 | Clive Cooper and Frantz Clermont | An Investigation Of The Speaker Factor In Vowel Nuclei | |
374--380 | Michael Ingleby, Wiebke Brockhaus and Carl Chalfont | Robust Techniques For Recognition Of New Knowledge-Based Speech Primitives | |
381--386 | Andrew Hunt | Two Linear Models Relating Acoustic Prosodics And Syntax | |
387--392 | Shuping Ran, Phil Rose, J.Bruce Millar and Iain Macleod | Automatic Vowel Quality Description Using A Cardinal Vowel Reference Model |
Pages | Authors | Title | |
---|---|---|---|
394--399 | D Hawthorn, C White | A Data-Driven Speech System | |
400--405 | P. Castellano and S. Sridharan | Speaker Identification With Projection Networks | |
406--410 | Malcolm B. Jones | Real-Time Speech Enhancement Using Median Filters | |
411--416 | A. Satriawan and J.B. Millar | Speaker Change Detection | |
417--422 | A. R. Kian Aleolfazlian & Brian L. Karlsen | The Cocktail Party Listener |
Page numbers refer to nominal page numbers assigned to each paper for purposes of citation.
Pages | Authors | Title | |
---|---|---|---|
424--429 | Robert H. Mannell | The Prediction Of "Perceptual Distance" From Spectral Distance Measures Based Upon Auditory And Non-Auditory Models Of Intensity Scaling | |
430--435 | U. Thein-Tun and D. Burnham | The Nature Of Information Processing In Speech Perception | |
436--441 | Jialong He, Li Liu and Gunther Palm | Perception Of Stop Consonants In Vcv Utterances Reconstructed From Partial Fourier Transform Information | |
442--447 | Li Liu, Jialong He and Gunther Palm | Perception Of Stop Consonants With Conflicting Phase And Magnitude | |
448--453 | Jordi Robert-Ribes, Jen-Luc Schwartz, Pierre Escudier | Audio-Visual Recognition Of Speech Units: A Tentative Functional Model Compatible With Psychological Data |
Pages | Authors | Title | |
---|---|---|---|
456--461 | P. Castellano and S. Sridharan | A Two Stage Fuzzy Decision Classifier For Speaker Identification | |
462--467 | N. Kasabov, C.Watson, S. Sinclair, R. Kilgour | Integrating Neural Networks And Fuzzy Systems For Speech Recognition | |
468--472 | Richard F. Favero | Comparison Of Mother Wavelets For Speech Recognition | |
473--478 | David B. Grayden and Michael S. Scordilis | A Hierarchical Approach To Phoneme Recognition Of Fluent Speech | |
479--484 | A. Samouelian | Knowledge Based Approach To Speech Recognition |
Pages | Authors | Title | |
---|---|---|---|
486--491 | Sameer Singh | Linguistic Computing In Speech And Language Disorders | |
492--497 | C. McKilligan, J. van Doorn & S. Pitt | The Intelligibility Of Speech In Cerebral Palsy: The Effects Of Manipulating The Acoustic Speech Signal | |
498--503 | P.J. Blamey, M.L. Grogan and M.B. Shields | Using An Automatic Word-Tagger To Analyse The Spoken Language Of Children With Impaired Hearing | |
504--509 | B.M. Chen, D.J. Calder and G. Mann | Computer-Based Multimedia Speech Training Tool For Dyspraxic Clients |
Pages | Authors | Title | |
---|---|---|---|
512--517 | Tetsuya Hoya, Hiroyuki Kamata and Yoshihisa Ishida | Spoken Digit Recognition Using Neural Network$ Trainee By Incremental Learning | |
518--521 | Kiyoshi Kondou, Hiroyuki Kamata and Yoshihisa Ishida | Spoken Japanese Digits Recognition System Using Lvq | |
522--527 | Danqing Zhang and J.Bruce Millar | Digit-Specific Feature Extraction For Multi-Speaker Isolated Digit Recognition Using Neural Networks | |
528--533 | F. Béchet, H. Meloni, P. Gilles | Knowledge Based Lexical Filtering: The Lexical Module Of The Spex System |
Pages | Authors | Title | |
---|---|---|---|
534--539 | Fikret S. Gurgen and H. C. Choi | On The Frame-Based And Segment-Based Nonlinear Spectral Transformation For Speaker Adaptation | |
540--545 | H.C. Choi and RW. King | A Two-Stage Spectral Transformation Approach To Fast Speaker Adaptation | |
546--550 | D. Cole, M. Moody and S. Sridharan | Measuring Intelligibility Of Reverberant Speech With And Without Enhancement | |
551--556 | Jianwei Miao | Vector Quantization Of Dct Components For Speech Coding |
Pages | Authors | Title | |
---|---|---|---|
558--563 | NAGAI Akito, ISHIKAWA Yasushi and NAKAJIMA Kunio | Concept-Driven Semantic Interpretation For Robust Spontaneous Speech Understanding | |
564--569 | Yaxin Zhang, Chee Wee Loke, Roberto Togneri, Mike Alder | A Comparison Of Pbdhmm And Chmm For Isolated Word Recognition | |
570--575 | C.C. Fung, C. Romeo and A. Gregory | Development Of A Microprocessor-Based Speech Recognition System For Remotely Operated Underwater Vehicle |
Pages | Authors | Title | |
---|---|---|---|
576--578 | Professor Robert Linggard | Speech Science And Technology - Review And Perspective |
Pages | Authors | Title | |
---|---|---|---|
581--586 | Caroline Henton | Techniques For Synthesizing Visible, Emotional Speech | |
587--592 | Sang-Hun Kim, Jung-Chul Lee | Korean Text-To-Speech System Using Time Domain-Pitch Synchronous Overlap And Add Method | |
593--598 | Yasushi Ishikawa and Kunio Nakajima | Speech Synthesis By Rule Using Synthesis Units Considering Prosodic Features |
Pages | Authors | Title | |
---|---|---|---|
600--605 | C.J. James, M.F. Cheesman, L. Cornelisse and L.T. Miller. | Response Times To Sentence Verification Tasks (SVTS) As A Measure Of Effort In Speech Perception | |
606--611 | Alain Marchal and Sophie Lapierre | "Can We Learn Something From Nonsense Words?" | |
612--619 | Christophe Vescovi, Eric Castelli | Gestural Supervisor For The Vocal Cords Of A Speaking Machine |
Pages | Authors | Title | |
---|---|---|---|
620--625 | K. C. Scott, D.S. Kagels, S.H. Watson, H. Rom, J.R. Wright, M. Lee, K.J. Hussey | Synthesis Of Speaker Facial Movement To Match Selected Speech Sequences | |
626--631 | Catherine I. Watson | The Visual Display Test: A Test To Assess The Usefulness Of A Visual Speech Aid | |
632--636 | K.M.Knill and S.J.Young | Keyword Training Using A Single Spoken Example For Applications In Audio Document Retrieval | |
637--642 | Joon Hyung Ryoo, Katunobu Itou, Satoru Hayamizu and Kazuyo Tanaka | Korean Speech Dialog System For Hotel Reservation | |
643--648 | Florence Sédes, Nadine Vigouroux, Philippe Truillet, Bernard Orloia | Hyperaudio : Vocal Navigation Strategies In A Hypermedia Environment |
Pages | Authors | Title | |
---|---|---|---|
650--655 | Katsumasa Shimizu | F0 In Phonation Types Of Initial-Stops | |
656--661 | Janet Fletcher, Jonathan Harrington and John Hajek | Phonemic Vowel Length And Prosody In Australian English | |
662--667 | J. Hajek | Phonological Length And Phonetic Duration In Bolognese: Are They Related? | |
668--673 | Corinne Roberts | Speech Rate Effects On Duration: An Articulatory Analysis | |
674--679 | Stefanie Jannedy | Prosodic And Segmental Influences On High Vowel Devoicing In Turkish |
Pages | Authors | Title | |
---|---|---|---|
682--687 | Jianming Song | Enhancement Of Hmm Through Discriminative Analysis | |
688--693 | G. Platt and M.D. Alder | A Dynamic Causal Filter Approach To Speech Trajectory Segmentation | |
694--699 | Hoi-Rin Kim, Kyu-Woong Hwang, Nam-Yong Han, and Young-Mok Ahn | Korean Continuous Speech Recognition System Using Context-Dependent Phone Schmms | |
700--705 | Andrew Hunt | Introducing Prosodic Constraints To Stochastic Language Modelling | |
706--712 | Shuping Ran, Bruce Millar, William Laverty, lain Macleod, Michael Wagner and Xiaoyuan Zhu | Speaker Recognition Using Continuous Ergodic Hmms |
Pages | Authors | Title | |
---|---|---|---|
714--719 | Mary O'Kane, P.E.Kenne, Hamish Pearcy, Tim Morgan, Gail Ransom and Kathryn Devoy | On The Feasibility Of Automatic Punctuation Of Transcribed Speech Without Prosody Or Parsing | |
720--725 | P.E. Kenne, M.J. O'Kane and H. Pearcy | Some Experiments Involving The Annotation Of A Large Speech And Natural Language Database | |
725--730 | Simon Hawkins, Iain Macleod and Bruce Millar | An Ab Initio Analysis Of Relationships Between Cepstral And Formant Spaces | |
731--736 | K.L. Jenkin and M.S. Scordilis | Automatic Methods Of Syllable Stress Classification In Continuous Speech | |
737--742 | Cioni Lorenzo | A Simple Program For The Visualisation Of F0 |
Pages | Authors | Title | |
---|---|---|---|
744--749 | J.Bruce Millar, Fangxin Chen, Iain Macleod, Shuping Ran, Hong Tang, Michael Wagner and Xiaoyuan Zhu. | Overview Of Speaker Verification Studies Towards Technology For Robust User-Conscious Secure Transactions | |
750--755 | Iain Macleod, Fangxin Chen, Bruce Millar and William Laverty | Optimal Cohort Design In Vq-Distortion Based Text-Independent Speaker Verification | |
756--761 | Xiaoyuan Zhu, Bruce Millar, Iain Macleod and Michael Wagner | Speaker Verification: Beyond The Absolute Threshold | |
762--767 | Shupinq Ran, William Laverty, Bruce Millar, Iain Macleod and Michael Wagner | Estimation Of False Acceptance Rate In Speaker Verification | |
768--773 | Hong Tang, Xiaoyuan Zhu, Bruce Millar, Iain Macleod and Michael Wagner | Robust Speaker Verification In Noisy Environments |
Pages | Authors | Title | |
---|---|---|---|
776--781 | Graeme K. Yates | Dynamic Range Compression In The Cochlea: Experiments And Models | |
782--787 | Michael Oerlemans and Peter Blamey | Multisensory Speech Perception: Integration Of Speech Information | |
788--793 | Peter J Blamey and Elvira S Parisi | Pitch And Vowel Perception In Cochlear Implant Users | |
794--799 | Ambikairajah, E., McDonagh, B. | An Active Model Of The Auditory Periphery With Realistic Temporal And Spectral Characteristics | |
800--805 | P.F. McCormack, J.C. Ingram | Speech Motor Control In Ataxic Dysarthria |
Pages | Authors | Title | |
---|---|---|---|
808--813 | P.E. Kenne, M.J. O'Kane and H. Pearcy | An Australian Speech Database Derived From Court Recordings | |
814--819 | Dong K. Kim | Automatically Assisted Annotation Of The Australian National Speech Database | |
820--825 | Yaxin Zhang, Mylene Pijpers, Roberto Togneri, Mike Alder | Cdigits: A Large Isolated English Digit Database | |
826--831 | M.Bijankhan, J.Sheikhzadegan, M.R.Roohani, Y.Samareh, K.Lucas, M.Tebyani | Farsdat - The Speech Database Of Farsi Spoken Language |
Pages | Authors | Title | |
---|---|---|---|
834--839 | Wenxian Li, Yiqing Zu, Chorkin Chan | A Chinese Speech Database (Putonghua Corpus) | |
840--845 | R.E.E.ROBINSON | Synthesising Facial Movement: Data Base Design | |
846--849 | Hitoshi Ihara, Hiroyuki Kamata and Yoshihisa Ishida | Speaker Identification Using Neural Networks | |
850--855 | J.Bruce Millar, Fangxin Chen and Michael Wagner. | The Efficacy Of Cohort Normalisation In A Speaker Verification Task Under Different Types Of Speech Signal Variance |
Pages | Authors | Title | |
---|---|---|---|
858--863 | Cioni Lorenzo | A Data Base For Speech Signal Processing | |
864--869 | D. Farrokhi, R. Togneri, Y. Zhang, and Y. Attikiouzel | Real Time Voice Processing (Voice/Speaker Recognition) | |
870--875 | Fumitake SUGANO, Tomoyuki MIZUTANI, Ayano SASAKI, Takefumi KITAYAMA, Hiroyuki KAMATA and Yoshihisa ISHIDA | Speech Training System For Hearing Impaired Children Using Technology Of Voice Recognition |