Research


[Home] [Publications] [Research] [Teaching] [Short Bio] [Demo & Data] [Codes]


Funded Projects (Grants)

I appreciate the financial support for my research from the following bodies (since 2008): Engineering and Physical Science Research Council (EPSRC), Ministry of Defence (MoD), Defence Science and Technology Laboratory (Dstl), Department of Defence (DoD), Home Office (HO), Royal Academy of Engineering (RAENG), European Commission (EC), National Natural Science Foundation of China (NSFC), Shenzhen Science and Technology Innovation Council (SSTIC) of China, the University Research Support Fund (URSF), and the Ohio State University (OSU), and UK/EU industries including BBC, Atlas, Kaon, Huawei, Tencent, NPL, and Samsung. [Total award to Surrey where I am a Principal Investigator (PI) or Co-Investigator (CI): £13M+ (as PI £2.2M, as CI £14M+). As PI/CI, on a total grant award portfolio: £30M+]

  1. 04/2022-04/2026, PI, "Uncertainty modelling and quantification for heterogeneous sensor/effector networks", SAAB (Surrey investigators: Wenwu Wang (PI), Pei Xiao (CI)) Project partner: SAAB.

  2. 04/2022-04/2026, PI, "Cooperative sensor fusion and management for distributed sensor swarms", SAAB (Surrey investigators: Wenwu Wang (PI), Pei Xiao (CI)) Project partner: SAAB.

  3. 01/2022-01/2026, CI, "Differentiable particle filters for data-driven sequential inference", EPSRC & NPL (iCASE). (Surrey investigators: Yunpeng Li (PI), Wenwu Wang (CI)) Project partner: National Physical Laboratory.

  4. 07/2021-07/2026, CI, "BBC Prosperity Partnership: AI for Future Personalised Media Experiences", EPSRC (Prosperity Partnership scheme). (Surrey investigators: Adrian Hilton (PI, project lead), CIs: Philip Jackson, Armin Mustafa, Jean-Yves Guillemaut, Marco Volina, and Wenwu Wang; project manager: Elizabeth James ) [The project is led by University of Surrey, jointly with BBC and University of Lancaster, with supports from 10+ industrial partners.] (project website)

  5. 07/2021-07/2024, CI, "Uncertainty Quantification for Robust AI through Optimal Transport", University of Surrey (project-based Doctoral College studentship competition). (Surrey investigators: Yunpeng Li (PI), CI: Wenwu Wang.

  6. 02/2021-02/2023, PI, "Automated Captioning of Image and Audio for Visually and Hearing Impaired", British Council (Newton Institutional Links Award). (Surrey investigators: Wenwu Wang) [The project is led by University of Surrey, jointly with Izmir Katip Celebi University (IKCU) @ Turkey (Volkan Kilic).]

  7. 04/2021-04/2024, CI, "Multimodal video search by examples", EPSRC (responsive mode). (Surrey investigators: PI: Josef Kittler, CIs: Miroslaw Bober, Wenwu Wang and Mark Plumbley) [The project is led by Ulster University (Hui Wang), jointly with Univ. of Surrey and Univ of Cambridge (Mark Gales).]

  8. 01/2021-10/2022, PI, "Acoustic surveillance", MoD (DASA call on countering drones). (Surrey investigators: Wenwu Wang) [The project is led by Airspeed Electronics.]

  9. 11/2020-11/2023, PI, "SIGNetS: signal and information gathering for networked surveillance", DoD & MoD (UDRC phase 3 call on the application theme Signal and Information Processing for Decentralized Intelligence, Surveillance, and Reconnaissance). (Surrey investigators: Wenwu Wang (PI) and Pei Xiao (CI)) [The project is a collaboration between University of Cambridge (Simon Godsill, project lead), University of Surrey, and University of Sheffield (Lyudmila Mihaylova).] (project website)

  10. 08/2020-08/2023, PI, "Particle flow PHD filtering for audio-visual multi-speaker speech tracking", Tencent (Rhino-Bird funding scheme). (Surrey investigator: Wenwu Wang) [Industry partner: Yong Xu @ Tencent AI Lab]

  11. 01/2020-10/2022, PI, "Deep embedding techniques for audio scene analysis and source separation", ASEM-DUO (Duo-India Professor Fellowship). [jointly with Dr Vipul Arora at Indian Institute of Technology (IIT) Kanpur] (Surrey investigator: Wenwu Wang).

  12. 03/2017-01/2023, CI, "Audio Visual Media Research", EPSRC (Platform grant). [Surrey investigators: PI: Adrian Hilton, CIs: Mark Plumbley, Josef Kittler, Wenwu Wang, John Collomosse, Philip Jackson, and Jean-Yves Guillemaut.]

    ------

  13. 08/2020-05/2021, PI, "Audio tagging for meta data generation of media for programme recommendation", EPSRC (impact acceleration account). (Surrey investigators: Wenwu Wang (PI) and Mark Plumbley (CI))

  14. 09/2018-03/2019, PI, "Array optimisation with sensor failure", EPSRC (impact acceleration account). [jointly with Kaon] [Surrey investigators: Wenwu Wang.]

  15. 02/2018-12/2018, PI, "Speech detection, separation and localisation with acoustic vector sensor", Huawei (HIRP). [Surrey investigators: Wenwu Wang.]

  16. 01/2017-01/2020, PI, "Improving the Robustness of UWAN Data Transmitting and Receiving Utilize Deep Learning and Statistical Model", NSFC (Youth Science Foundation). [Surrey investigators: Wenwu Wang.]

  17. 02/2016-02/2019, CI, "ACE-CReAte: Audio Commons", EC (Horizon 2020). [jointly wih Universitat Pompeu Fabra, Queen Mary University of London, Jamendo SA, AudioGaming, and Waves Audio Ltd.] [Surrey investigators: PI: Mark Plumbley, CIs: Wenwu Wang, Tim Brookes, and David Plans.] (project website)

  18. 02/2016-02/2019, PI, "Marine environment surveillance technology based on underwater acoustic signal processing", SSTIC ('international collaboration' call). [jointly wih Harbin Institute of Technology at Shenzhen] [Surrey investigators: Wenwu Wang.]

  19. 01/2016-01/2019, CI, "Making sense of sounds", EPSRC ('making sense from data' call). [jointly wih Salford University] [Surrey investigators: Mark Plumbley (PI), CIs: Wenwu Wang, Philip Jackson and David Frohlich.] (project website)

  20. 01/2015-01/2019, CI, "MacSeNet: machine sensing training network", EC (Horizon 2020, Marie Curie Actions - Innovative Training Network). [jointly with INRIA (France), University of Edinburgh (UK), Technical University of Muenchen (Germany), EPFL (Switzerland), Computer Technology Institute (Greece), Institute of Telecommunications (Portugal), Tampere University of Technology (Finland), Fraunhofer IDMT (Germany), Cedar Audio Ltd (Cambridge, UK), Audio Analytic (Cambridge, UK), VisioSafe SA (Switzerland), and Noiseless Imaging Oy (Finland)] [Surrey investigators: Mark Plumbley (PI) and Wenwu Wang (CI)] (project website)

  21. 10/2014-10/2018, CI, "SpaRTaN: Sparse representation and compressed sensing training network", EC (FP7, Marie Curie Actions - Initial Training Network). [jointly with University of Edinburgh (UK), EPFL (Switzerland), Institute of Telecommunications (Portugal), INRIA (France), VisioSafe SA (Switzerland), Noiseless Imaging Oy (Finland), Tampere University of Technology (Finland), Cedar Audio Ltd (Cambridge, UK), and Fraunhofer IDMT (Germany)] [Surrey investigators: Mark Plumbley (PI) and Wenwu Wang (CI).] (project website)

  22. 01/2014-01/2019, CI, "S3A: future spatial audio for an immersive listener experience at home", EPSRC (programme grant). [jointly with University of Southampton, University of Salford, and BBC.] [Surrey investigators: PI: Adrian Hilton, CIs: Philip Jackson, Wenwu Wang, Tim Brookes, and Russell Mason.] (project website)

  23. 04/2013-06/2018, PI, "Signal processing solutions for a networked battlespace", EPSRC and Dstl ('signal processing' call). [jointly with Loughborough University, University of Strathclyde, and Cardiff University.] [Surrey investigators: Wenwu Wang (PI), Josef Kittler (CI), and Philip Jackson (CI)] (project website)

  24. 09/2015-06/2016, PI, "Array processing exploiting sparsity for submarine hull mounted arrays", Atlas Electronik & MoD (MarCE scheme) [Surrey investigators: Wenwu Wang.]

  25. 03/2015-09/2015, PI, "Speech enhancement based on lip tracking", EPSRC (impact acceleration account). [jointly with SAMSUNG (UK)] [Surrey investigators: Wenwu Wang.]

  26. 10/2013-03/2014, PI, "Enhancing speech quality using lip tracking", SAMSUNG (industrial grant). [Surrey investigators: Wenwu Wang.]

  27. 12/2012-12/2013, PI, "Audio-visual cues based attention switching for machine listening", MILES and EPSRC (feasibility study). [jointly with School of Psychology and Department of Computing.] [Surrey investigators: PI: Wenwu Wang, CIs: Mandeep Dhami, Shujun Li, and Anthony Ho.]

  28. 11/2012-07/2013, PI, "Audio-visual blind source separation", NSFC (international collaboration scheme). [jointly with Nanchang University, China.] [Surrey investigators: Wenwu Wang.]

  29. 12/2011-03/2012, PI, "Enhancement of audio using video", HO (pathway to impact). [jointly with University of East Anglia.] [Surrey investigators: Wenwu Wang and Richard Bowden (CI).]

  30. 10/2010-10/2013, CI, "Audio and video based speech separation for multiple moving sources within a room environment", EPSRC (responsive mode). [jointly with Loughborough University.] [Surrey investigators: Josef Kittler (PI) and Wenwu Wang (CI).]

  31. 10/2009-10/2012, PI, "Multimodal blind source separation for robot audition", EPSRC and Dstl ('signal processing' call). [Surrey investigators: PI: Wenwu Wang, CIs: Josef Kittler and Philip Jackson.] (project website)

  32. 05/2008-06/2008, PI, "Convolutive non-negative sparse coding", RAENG (international travel grant).[Surrey investigators: Wang.]

  33. 02/2008-06/2008, PI, "Convolutive non-negative matrix factorization", URSF (small grant). [Surrey investigators: Wang.]

  34. 02/2008-03/2008, PI, "Computational audition", OSU (visiting scholarship). [Surrey investigators: Wang.] (Collaborator: Prof Deliang Wang)

Research Team

BSc Students

Research Collaborations

Current Opportunities

Postdoctoral Research Fellows

  1. Vacancy available: Research Fellow position in "Machine Learning for Audio Captioning" available. (Closing date for applications: 10/06/2021)

  2. Vacancy available: Research Fellow position in "Research Fellow in Autonomous Sensor Management and Fusion for Distributed Sensor Networks" available. (Closing date for applications: 16/03/2021) CLOSED

  3. Vacancy available: Research Fellow in Acoustic Signal Processing and Machine Learning" available. (Closing date for applications: 03/01/2021) CLOSED

  4. Vacancy available: Research Fellow in Advanced Machine Learning for Audio Tagging" available. (Closing date for applications: 27/06/2020) CLOSED

  5. Vacancy available: Research Fellow position in "Deep Learning for Speech Source Separation" available. (Closing date for applications: 01/05/2018) CLOSED

  6. Vacancy available: Research Fellow position in "Machine Listening" available. (Closing date for applications: 01/05/2018) CLOSED

  7. Vacancy available: Research Fellow position in "Source Separation and Localisation" available. (Closing date for applications: 27/03/2017) CLOSED

  8. Vacancy available: Research Fellow position in "Research Software Developer (Experimental Officer) ON "Making Sense of Sounds" available. (Closing date for applications: January 17, 2016) (CLOSED)

  9. Vacancy available: Research Fellow position in "Machine Listening" available. (Closing date for applications: December 13, 2015) (CLOSED)

  10. Vacancy available: Research Fellow position in "Semantic Audio-Visual Processing and Interaction" available. (Closing date for applications: December 13, 2015) (CLOSED)

  11. Vacancy available: Research Fellow position in "Audio-Visual Signal Processing" available. (Closing date for applications: January 27, 2015) (CLOSED)

  12. Vacancy available: Four research fellow positions in "Spatial Audio & Vision" available. Click here for more details. (Closing date for applications: February 2, 2014) (CLOSED)

  13. Vacancy available: Research Fellow position in "Low-Complexity Source Separation Algorithms" (Fixed-term contract for three years. Closing date for applications: March 17, 2013) (CLOSED)

  14. Vacancy available: Research Fellow position in "Statistical anomaly detection" (Closing date for applications: February 28, 2013) (CLOSED)

  15. Vacancy available: Research Fellow position in "Audio and Video Based Speech Separation for Multiple Moving Sources Within a Room Environment" (Closing date for applications: August 9, 2010) (CLOSED)

Marie Curie Early Stage Researchers

  1. Vacancy available: MacSeNet: Marie Curie Early Stage Researcher position in "Audio Restoration and Inpainting" available. (Closing date for applications: April 30, 2015) (CLOSED)

  2. Vacancy available: MacSeNet: Marie Curie Early Stage Researcher position in "Sound Scene Analysis" available. (Closing date for applications: April 30, 2015) (CLOSED)

  3. Vacancy available: SpaRTan: Marie Curie Early Stage Researcher position in "Sparse Time-Frequency Methods for Audio Source Separation" available. (Closing date for applications: January 25, 2015) (CLOSED)

  4. Vacancy available: SpaRTan: Marie Curie Early Stage Researcher position in "Automatic Music Transcription Using Structured Sparsity" available. (Closing date for applications: January 25, 2015) (CLOSED)

PhD Students

If you wish to join CVSSP and work with me as a PhD student, please check the topic list and feel free to contact me if you have further inquiries. Students with background in engineering, mathematics, computing, physics or other related subjects are all welcome to apply. New project ideas, if not included in the list, are also encouraged to propose.

  1. Vacancy available: PhD Studentship in Uncertainty Quantification for Robust AI through Optimal Transport (Closing date for applications: May 19, 2021)

  2. Vacancy available: PhD Studentship in Multimodal BSS for Robot Audition (Closing date for applications: August 7, 2009) (CLOSED)

  3. Vacancy available: PhD Studentship in Signal Processing for Machine Audition and Perception (Closing date for applications: August 8, 2008) (CLOSED)

Visiting Scholar

I welcome collaborations nationally and internationally. Please do not hesitate to contact me if you are interested in joining CVSSP as a visiting scholar.

Current Topics

  1. Unsupervised learning techniques (including independent component analysis, independent vector analysis, latent variable analysis, sparse component analysis, non-negative matrix/tensor factorisation, low-rank representation, manifold learning, and subspace clustering)

  2. Supervised learning techniques (including deep learning, dictionary learning, multimodal learning, and learning with priors and signal properties)

  3. Computational auditory scene analysis (audio scene recognition, audio event detection, audio tagging, and audio captioning)

  4. Audio signal separation (convolutive audio source separation, underdetermined audio source separation including monaural source separation)

  5. Audio feature extraction and perception (including pitch detection, onset detection, rhythm detection, music transcription and low bit-rate audio coding)

  6. Sound source localisation (using audio, video, depth information, with particle filtering, PHD filtering, and/or particle flow filtering)

  7. Multimodal speech source separation (audio/visual source separation with modelled based techniques such as Gaussian mixture model and learning-based method such as audio-visual dictionary learning)

  8. Sparse representation and compressed sensing (synthesis model and analysis model based dictionary learning for sparse represenation, with applications to audio source separation, speech enhancement, audio inpainiting, and image enhancement)

  9. Cocktail party processing (using techniques such as independent component analysis, blind source separation, computational auditory scene analysis, sparse representation/dictionary learning, Gaussian mixture modelling and expectation maximisation, and multimodal fusion)

  10. Non-negative sparse coding of audio signals (including sparsity constrained non-negative matrix factorisation for audio analysis)

  11. 3D positional audio technology (including head-related transfer functions, binaural modelling, multiple loudspeaker panning, and room geometry estimation)

  12. Approximate joint diagonalization for source separation (including unitary or non-unitary constrained joint diagonalization approaches)

  13. Robust solutions for permutation problem of frequency domain independent component analysis (including approaches using filter constraints, statistical characteristics of signals, and beamforming)

  14. Convex and non-convex optimisation (gradient descent, Newton methods, interior point method, ADMM, etc.)

  15. Psychoacoustics motivationed signal processing and machine learning methods (e.g. time-frequency masking, perceptually informed speech separation/enhancement, intelligibility adaptive speech separation algorithms)

... More information about my current research may be found in my publications.

Past Projects

During the period of 1997 to 2007, I worked on a number of projects in both academic institutes and industrial companies including:

  1. OpenSL ES (Led by Creative, jointly with other Khronos Group's member companies, such as Nokia, Samsung, Beatnik, Sonaptic, NVIDIA, Symbian, Texas Instruments, Ericsson, etc.)

  2. 3D Positional Audio for Mobile Devices (Sensaura, Creative Technology Ltd)

  3. Video Encoder/Decoder for SSEYO miniMIXA (Tao Group Ltd, jointly with Samsung Electronics Research Institute)

  4. Audio Distortion Generator for Intent Sound System (Tao Group Ltd)

  5. Floating/Fixed-Point Audio (Ogg Vorbis) Encoder/Decoder (Tao Group Ltd)

  6. Fixed-Point Pitch to MIDI Converter (Tao Group Ltd)

  7. Fixed-Point Sampling Rate Converter (Tao Group Ltd)

  8. Blind Signal Processing for Multichannel Speech Enhancement (initially with King's College London, then transferred to Cardiff University, jointly with Laboratory for Advanced Brain Signal Processing, RIKEN, Japan)

  9. Room Acoustics Parameters from Music (Cardiff University, jointly with Salford University and Manchester Metropolitan University)

  10. Video Assisted Speech Source Separation (Cardiff School of Engineering, jointly with Cardiff School of Computer Science)

  11. GPS/Celestial/Inertia Integrated Navigation System (Harbin Engineering University)

  12. Submarine Voyage Training Simulator (Harbin Engineering University, jointly with Qingdao Submarine Academy, and Jiujiang Branch of China State Shipbuilding Corp. )

  13. Electronic Chart Display and Information System (Harbin Engineering University)

  14. SCM Communication System by Carrier Wave (Harbin Engineering University)

Academic Activities


[Home] [Publications] [Research] [Teaching] [Short Bio] [Demo & Data] [Codes]

Last Modified in May 2023
First created in May 2007