The principal objective of the conference is to bring together researchers addressing perceptually motivated speech and audio processing tasks with the tools of statistical signal processing and machine learning. The themes of the conference are: Statistical models for speech and audio processing motivated by human perception Developing the commonalities between speech recognition and synthesis to provide richer and more sophisticated models for speech Adaptive learning approaches to speech and audio signal processing and their incorporation into statistical models