[Home] [Publications] [Research] [Teaching] [Short Bio] [Demo & Data] [Codes]
Publications
Copyright Notice:
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following copyright applies to any articles on this page published by IEEE: "Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE."
Refereed International Journal Papers:
- H. Liu, X. Xu, Y. Yuan, M. Wu, W. Wang, M.D. Plumbley, "SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound," IEEE Journal on Selected Topics in Signal Processing, 2024. [PDF] (accepted)
- S. Atito, M. Awais, W. Wang, M.D. Plumbley, and J. Kittler, "ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification," IEEE/ACM Transactions on Audio Speech and Language Processing, 2024. [PDF] [code] (in press)
- X. Mei, C. Meng, H. Liu, Q. Kong, T. Ko, C. Zhao, M. D. Plumbley, Y. Zou, and W. Wang, "WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research," IEEE/ACM Transactions on Audio Speech and Language Processing, 2024. [PDF] [arXiv] [code]
- X. Mei, X. Liu, J. Sun, M. D. Plumbley, and W. Wang, "Towards Generating Diverse Audio Captions via Adversarial Training," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 32, pp. 3311-3323, 2024. [PDF] [arXiv] [code]
- Y. Zhang, R. Du, Z.-H. Tan, W. Wang, Z. Ma, "Generating accurate and diverse audio captions through variational autoencoder framework," IEEE Signal Processing Letters, 2024. [PDF] [code]
- F. Nazarieh, Z. Feng, M. Awais, W. Wang, J. Kittler, "A survey of cross-modal visual content generation," IEEE Transactions on Circuits and Systems for Video Technology, 2024. [PDF]
- S. Peng, W. Wang, Y. Chen, X. Zhong, and Q. Hu, "Regression-based Hyperparameter Learning for Support Vector Machine," IEEE Transactions on Neural Networks and Learning Systems, 2024. [PDF]
- Z. Liu, J. Yang, X. Zhong, W. Wang, H. Chen, and Y. Chang, "A novel composite graph neural network," IEEE Transactions on Neural Networks and Learning Systems, 2024. [PDF] [code]
- W. Ma, Y. Li, S. Lan, W. Wang, W. Huang, W. Zhu, "Semantic-aware normalizing flow with feature fusion for image anomaly detection," Neurocomputing, 2024. [PDF] [code]
- H. Liu , Y. Yuan , X. Liu , X. Mei , Q. Kong , Q. Tian , Y. Wang , W. Wang, Y. Wang , M. D. Plumbley, "AudioLDM 2: Learning holistic audio generation with self-supervised pretraining," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 32, pp. 2871-2883, 2024. [PDF] [code] (One of the Most Downloaded Papers in IEEE Explorer (Top Accessed Article) - check out [the list])
- Y. Hou, B. Kang, A. Mitchell, W. Wang, J. Kang, and D. Botteldooren, "Cooperative scene-event modelling for acoustic scene classificaiton," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 32, pp. 68-82, 2024. [PDF] (One of the Most Downloaded Papers in IEEE Explorer (Top Accessed Article) - check out [the list])
- Y. Liu, Y. Xu, P. Wu, and W. Wang, "Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking," IEEE Transactions on Multimedia, vol. 26, pp. 2544 - 2559, 2024. [PDF]
- D.M. Wang, S.L. Zhu, L.T. Lu, Y.Q. Han, W. Wang, Q.H. Zhou, "Event-triggered adaptive tracking control for stochastic nonlinear systems under predetermined finite-time performance," International Journal of Adaptive Control and Signal Processing, 2024;1-20. doi: 10.1002/acs.3812, 2024. [PDF]
- J. Zhao, Y. Xu, X. Qian, H. Liu, M. D. Plumbley, and W. Wang,"Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking," IEEE Open Journal of Signal Processing, vol. 5, pp. 449-458, 2024. [PDF]
- K. Li, S. Yang, L. Zhao, and W. Wang, "Weakly labelled sound event detection with a capsule-transformer model," Digital Signal Processing, vol. 146, 104347, March 2024. [PDF]
- F. Zhan, W. Wang, Q. Chen, Y. Guo, L. He, and L. Wang, "Three-direction fusion for accurate volumetric liver and tumor segmentation," IEEE Journal of Biomedical and Health Informatics, vol. 28, no. 4, 2024. [PDF]
- B. Xie, J. Qi, S. Yang, G. Sun, Z. Feng, B. Yin, W. Wang, "Sea surface temperature and marine heat waves predictions in the South China Sea: A 3D-Unet deep learning model integrating multi-source data," Atmosphere, 15(1), 86, https://doi.org/10.3390/atmos15010086, 2024. [PDF]
- W. Yuan, S. Wang, J. Wang, M. Unoki, and W. Wang, "Unsupervised Deep Unfolded Representation Learning for Singing Voice Separation," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 31, pp. 3206 - 3220, 2023. [PDF]
- Y. Zhang, H. Yu, R. Du, Z.-H. Tan, W. Wang, Z. Ma, Y. Dong, "ACTUAL: audio captioning with caption feature space regularization," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 31, pp. 2643 - 2657, 2023. [PDF] [code]
- Y. Hou, S. Song, C. Yu, W. Wang, and D. Botteldooren, "Audio event-relational graph representation learning for acoustic scene classification," IEEE Signal Processing Letters, vol. 30, pp. 1382 - 1386, 2023. [PDF] [code]
- J. Guan, Y. Liu, Q. Kong, F. Xiao, Q. Zhu, J. Tian, and W. Wang, "Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection," EURASIP Journal on Audio Speech and Music Processing, 2023, 42 (2023). https://doi.org/10.1186/s13636-023-00308-4. [PDF]
- S. Goudarzi, S. A. Soleymani, W. Wang, P. Xiao, "UAV-enabled mobile edge computing for resource allocation using cooperative evolutionary computation," IEEE Transactions on Areospace and Electronic Systems, vol. 59, no. 5, pp. 5134 - 5147, 2023. [PDF]
- M. Tharmakulasingam, W. Wang, M. Kerby, R. La Ragione, and A. Fernando, "TransAMR: An Interpretable Transformer Model for Accurate Prediction of Antimicrobial Resistance Using Antibiotic Administration Data," IEEE Access, vol. 11, pp. 75337 - 75350, 2023. [PDF]
- L. Shi, X. Wang, L. Yu, W. Wang, Z. Wang, M. Iqbal, C. C. Tsimenidis, and S. Mumtaz, "A long-range aerial acoustic communication scheme," Physical Communication, vol. 60, 2023. [DOI: 10.1016/j.phycom.2023.102135] [PDF]
- J. Dong, K. Wu, C. Liu, X. Mei, and W. Wang, "Discriminative analysis dictionary learning with adaptively ordinal locality preserving," Neural Networks, vol. 165, pp. 298-309, 2023. [PDF] [code]
- Y. Li, Y. Sun, W. Wang, and S. M. Naqvi, "U-shaped transformer with frequency-band aware attention for speech enhancement," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 31, pp. 1511-1521, 2023. [PDF] [code]
- Y. Guo, T. Liu, X. Zhang, A. Wang, and W. Wang, "End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network," Knowledge-Based Systems, vol. 177, paper 110837, 2023. [PDF] [code]
- F. Xiao, J. Guan, Q. Zhu, and W. Wang, "Graph attention for automated audio captioning," IEEE Signal Processing Letters, vol. 30, pp. 413-417, 2023. [PDF] [code]
- C. Xue, X. Zhong, M. Cai, H. Chen, and W. Wang, "Audio-visual event localization by learning spatial and semantic co-attention", IEEE Transactions on Multimedia, vol. 25, pp. 418-429, 2023. [PDF] [code]
- X. Mei, X. Liu, M. Plumbley, and W. Wang, "Automated audio captioning: an overview of recent progress and new challenges", EURASIP Journal on Audio Speech and Music Processing, 2022 [PDF] (Featured Article)
- J. Dong, L. Yang, C. Liu, W. Cheng, and W. Wang, "Support vector machine embedding discriminative dictionary pair learning for pattern classification," Neural Networks, vol. 155, pp. 498-511, 2022. [PDF]
- K. SongGong, W. Wang, and H. Chen, "Acoustic source localization in the circular harmonic domain using deep learning architecture," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 30, pp. 2475-2491, 2022. [PDF]
- F. Xiao, J. Guan, H. Lan, Q. Zhu, and W. Wang, "Local information assisted attention-free decoder for audio captioning," IEEE Signal Processing Letters, vol. 29, pp. 1604-1608, 2022. [PDF] [code]
- H. Li, S. Yang, and W. Wang, "Improved capsule routing for weakly labelled sound event detection," EURASIP Journal on Audio Speech and Music Processing, 2022, 5 (2022). https://doi.org/10.1186/s13636-022-00239-6 [PDF]
- L. Dong, J. Qi, B. Yin, H. Zhi, D. Li, S. Yang, W. Wang, H. Cai, and B. Xie, "Reconstruction of Subsurface Salinity Structure in the South China Sea Using Satellite Observations: A LightGBM-Based Deep Forest Method," Remote Sensing, vol. 14, paper 3494, 2022. [PDF]
- A. Shilandari, H. Marvi, H. Khosravi, W. Wang, "Speech Emotion Recognition using Data Augmentation Method by Cycle-Generative Adversarial Networks," Signal, Image and Video Processing, 2022. https://doi.org/10.1007/s11760-022-02156-9 [PDF]
- J. Guan, J. Liu, P. Feng, J. Sun, and W. Wang, "Multi-scale deep neural network with two-stage loss for SAR target recognition with small training set", IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022. [PDF] [code]
- T. Liu, W. Wang, X. Zhang, Y. Guo, "One to multiple mapping dual learning: Learning multiple signals from one mixture," Digital Signal Processing, vol. 129, paper 103686, 2022. [PDF]
- Y. Xian, Y. Sun, W. Wang, and S.M. Naqvi, "Convolutional fusion network for monaural speech enhancement", Neural Networks, vol. 143, pp. 97-107, 2021. [PDF]
- K. SongGong, H. Chen, and W. Wang, "Indoor multi-speaker localization based on Bayesian nonparametrics in the circular harmonic domain", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 29, no. 5, pp. 1864-1880, 2021. [PDF]
- W. Yuan, B. Dong, S. Wang, M. Unoki, and W. Wang, "Evolving multi-resolution pooling CNN for monaural singing voice separation", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 29, pp. 807-821, 2021. [PDF] [code] [demo]
- Y. Huang, C. Xue, W. Wang, Y. Zhang, and J. A. Chambers, "Adaptive recursive decentralized cooperative localization for multi-robot systems with time-varying measurement accuracy", IEEE Transactions on Instrumentation and Measurement, vol. 70, article 8501525, 2021. [PDF]
- J. Yang, W. Chen, X. Zhong, and W. Wang, "Multiple acoustic source localization in microphone array networks", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 29, pp. 334-347, 2021. [PDF]
- B. Li, L. Rencker, J. Dong, Y. Luo, M. Plumbley, and W. Wang, "Sparse analysis dictionary learning model based signal declipping", IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 1, pp. 25-36, 2021. [PDF] [code]
- Y. Xian, Y. Sun, W. Wang, and M. Naqvi, "Multi-scale Feature Recalibration Network Architecture for End-to-End Single channel Speech Enhancement", IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 1, pp. 143-155, 2021. [PDF]
- Y. Guo, J. Chen, X. Ren, A. Wang, and W. Wang, "Joint raindrop and haze removal from a single image", IEEE Transactions on Image Processing, vol. 29, pp. 9508-9519, 2020. [PDF]
- Q. Kong, Y. Xu, W. Wang, and M. D. Plumbley, "Sound event detection of weakly Labelled data with CNN-Transformer and automatic threshold optimization", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 28, no. 8, pp. 2450-2460, 2020. [PDF] [code] (One of the Most Downloaded Papers in IEEE Explorer (Top Accessed Article))
- H. Wang, Y. Zou, D. Chong, and W. Wang, "Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network", IEEE Signal Processing Letters, 2020. [PDF]
- Q. Kong, Y. Cao, T. Iqbal, Y. Wang, W. Wang, and M. D. Plumbley, "PANNs: large-scale pretrained audio neural networks for audio pattern recognition", IEEE/ACM Transactions on Audio Speech and Language Processing, 2020. [PDF] [code] (One of the Most Downloaded Papers in IEEE Explorer (Top Accessed Article)) (IEEE Signal Processing Society Young Author Best Paper Award)
- D. Xu, J. Guan, P. Feng, and W. Wang, "Association Loss for Visual Object Detection", IEEE Signal Processing Letters, vol. 27, no. 7, pp. 1435-1439, 2020. [PDF]
- S. Zhu, Y. Zhao, Y. Zhang, Q. Li, W. Wang, and S. Yang, "Short-term traffic flow prediction with wavelet and multi-dimensional Taylor network model", IEEE Transactions on Intelligent Transportation Systems, 2020. [PDF]
- Y. Liu, V. Kilic, J. Guan, and W. Wang, "Audio-visual particle flow SMC-PHD filtering for multi-speaker tracking", IEEE Transactions on Multimedia, vol. 22, no. 4, pp. 934-948, 2020. [PDF]
- M. He, Y. Nian, L. Xu, L. Qiao and W. Wang, "Automated separation of respiratory and heartbeat signals among multiple people based on empirical wavelet transform", Sensors, 20, 4913, 2020. [PDF]
- Y. Guo, X. Zhao, A. Wang, J. Li, and W. Wang, "Blind multiple input multiple output image phase retrieval", IEEE Transactions on Industrial Electronics, vol. 67, no. 3, pp. 2220-2230, 2020. [PDF]
- S. Peng, Q. Hu, J. Dang, and W. Wang, "Optimal feasible step-size based working set selection for large scale SVMs training", Neurocomputing, vol. 407, pp. 366-375, September 2020. [PDF]
- J. Dong, Z. Xue, W. Wang, "Robust PCA using nonconvex rank approximation and sparse regularizer", Circuits, Systems, and Signal Processing, vol. 39, pp. 3086-3104, 2020. [PDF]
- Q. Liu, P. Jackson, and W. Wang, "A speech synthesis approach for high quality speech separation and generation", IEEE Signal Processing Letters, vol. 26, no. 12, pp. 1872-1876, December 2019. [PDF]
- L. Remaggi, P. Jackson, and W. Wang, "Modeling the Comb filter effect and interaural coherence for binaural source separation", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 27, no. 12, pp. 2263-2277, 2019. [PDF]
- L. Rencker, F. Bach, W. Wang, and M. D. Plumbley, "Sparse recovery and dictionary learning from nonlinear compressive measurements", IEEE Transactions on Signal Processing, vol. 67, no. 21, pp. 5659-5670, 2019. [PDF]
- W. Yuan, S. Wang, X. Li, M. Unoki, and W. Wang, "A skip attention mechanism for monaural singing voice separation", IEEE Signal Processing Letters, vol. 26, no. 10, pp. 1481-1485, 2019. [PDF]
- Q. Kong, C. Yu, Y. Xu, T. Iqbal, W. Wang, M. D. Plumbley, "Weakly labelled AudioSet tagging with attention neural networks", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 27, no. 11, pp. 1791-1802, 2019. [PDF] [code]
- Y. Sun, Y. Xian, W. Wang, S. M. Naqvi, "Monaural Source Separation in Complex Domain With Long Short-Term Memory Neural Network", IEEE Journal Selected Topics in Signal Processing, vol. 13, no. 2, pp. 359-369, 2019. [PDF]
- Q. Kong, Y. Xu, I. Sobieraj, W. Wang, and M. Plumbley, "Sound event detection and time-frequency segmentation from weakly labelled data", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 27, no. 4, pp. 777-787, 2019. [PDF] [code] (One of the Most Downloaded Papers in IEEE Explorer)
- Y. Chen, W. Wang, Z. Wang, and B. Xia, "A source counting method using acoustic vector sensor based on sparse modeling of DOA histogram", IEEE Signal Processing Letters, vol. 26, no. 1, pp. 69-73, January 2019. [PDF]
- Y. Sun, W. Wang, J.A. Chambers, and M. Naqvi, "Two-stage monaural source separation in reverberant room environments using deep neural networks", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 27. no. 1, pp. 125-139, January 2019. [PDF]
- Y. Guo, T. Wang, J. Li, A. Wang, and W. Wang, "Multiple input single output phase retrieval", Circuits Systems and Signal Processing, January 2019. [PDF]
- J. Wang, G. Li, L. Rencker, W. Wang, and Y. Gu, "A RIP-based performance guarantee of covariance-assisted matching pursuit", IEEE Signal Processing Letters, vol. 26, no. 6, pp. 828-832, 2018. [PDF]
- Y. Wang, Y. Zou, and W. Wang, "Manifold based visual object counting", IEEE Transactions on Image Processing, vol. 27, no. 7, pp. 3248-3263, 2018. [PDF]
- S. Chandna, and W. Wang, "Bootstrap averaging for model-based source separation in reverberant conditions", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 26, no. 4, 806-819, 2018. [PDF]
- J. Dong, Z. Xue, J. Guan, Z.-F. Han, and W. Wang, "Low rank matrix completion using truncated nuclear norm and sparse regularizer", Signal Processing: Image Communication, vol. 68, pp. 76-87, 2018. [PDF]
- Y. Tang, Q. Liu, W. Wang, and T. Cox, "A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones", Speech Communication, vol. 96, no. 2, pp. 116-128, 2018. [PDF]
- J. Kittler, C. Zor, I. Kaloskampis, Y. Hicks, and W. Wang, "Error sensitivity analysis of Delta divergence - A novel measure for classifier incongruence detection", Pattern Recognition, vol. 77, pp. 30-44, 2018. [PDF]
- Q. Liu, W. Wang, T.E. de Campos, P.J.B. Jackson, A.D.M. Hilton, "Multiple Speaker Tracking in Spatial Audio via PHD Filtering and Depth-Audio Fusion", IEEE Transactions on Multimedia, vol. 20, no. 7, pp. 1767-1780, 2018. [PDF]
- Y. Guo, A. Wang, and W. Wang, "Multi-Source Phase Retrieval from Multi-Channel Phaseless STFT Measurements", Signal Processing, vol. 144, pp. 36-40, 2018. [PDF]
- J. Guan, X. Wang, P. Feng, J. Dong, J.A. Chambers, Z. Jiang, W. Wang, "Polynomial dictionary learning algorithms in sparse representations", Signal Processing, vol. 142, pp. 492-503, Jan 2018. [PDF]
- S. Zubair, N. Chaudhary, Z. Khan, W. Wang, "Momentum fractional LMS for power signal parameter estimation", Signal Processing, vol. 142, pp. 441-449, 2018. [PDF]
- J. Liang, Q. Hu, P. Zhu, and W. Wang, "Efficient Multi-modal Geometric Mean Metric Learning", Pattern Recognition, vol. 75, pp. 188-198, March 2018. [PDF]
- D. Wang, Y. Zou, and W. Wang, "Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor", Journal of The Franklin Institute, vol. 355, no. 4, pp. 1692-1709, 2018. [PDF]
- F. Gu, S. Wang, and W. Wang, "Standard-independent I/Q imbalance estimation and compensation scheme in OFDM", Frontiers of IT & EE, vol. 19, no. 3, pp. 388-397, 2018. [PDF]
- A. Franck, W. Wang, and F.M. Fazi, "Sparse, L_1-Optimal Multi-Loudspeaker Panning and its Relation to Vector Base Amplitude Panning", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 25, no. 5, pp. 996 - 1010, May 2017. [PDF] [code]
- Y. Xu, Q. Huang, W. Wang, P. Foster, S. Sigtia, P. J. B. Jackson, and M. D. Plumbley, "Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging," IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 25, no. 6, pp. 1230 - 1241, June 2017. [PDF] [code] [One of the Most Downloaded Papers in IEEE Explorer]
- J. Dong, Z. Han, Y. Zhao, W. Wang, A. Prochazka, and J. Chambers, "Sparse Analysis Model Based Multiplicative Noise Removal with Enhanced Regularization", Signal Processing, vol. 137, pp. 160-176, 2017. [PDF] [code]
- L. Remaggi, P. Jackson, P. Coleman, and W. Wang, "Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 25, no. 2, pp. 296 - 309, February 2017. [PDF] [code]
- J. Liang, Q. Hu, W. Wang, Y. Han, "Semi-Supervised Online Multi-Kernel Similarity Learning for Image Retrieval", IEEE Transactions on Multimedia, vol. 19, no. 5, pp. 1077 - 1089, May 2017. [PDF]
- J. Guan, X. Wang, W. Wang, and L. Huang, "Sparse Blind Speech Deconvolution with Dynamic Range Regularization and Indicator Function", Circuits Systems and Signal Processing, vol. 36, no. 10, pp. 4145-4160, February 2017. [PDF] [code]
- P. Feng, W. Wang, S. Dlay, S.M. Naqvi, and J. A. Chambers "Social Force Model based MCMC-OCSVM Particle PHD Filter for Multiple Human Tracking", IEEE Transactions on Multimedia, vol. 19, no. 4, pp. 725-739, April 2017. [PDF] [code]
- M. Barnard, W. Wang, "Audio head pose estimation using direct to reverberant speech ratio", Speech Communication, vol. 85, no. 12, pp. 98-108, December 2016. [PDF]
- P. Feng, W. Wang, S.M. Naqvi, and J.A. Chambers, "Adaptive Retrodiction Particle PHD Filter for Multiple Human Tracking", IEEE Signal Processing Letters, vol. 23, no. 11, pp. 1592-1596, November 2016. [PDF]
- F. Gu, H. Zhang, W. Wang, and S. Wang, "An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model", Circuits Systems and Signal Processing, DOI 10.1007/s00034-016-0424-2, October 2016.
- V. Kilic, M. Barnard, W. Wang, A. Hilton, and J. Kittler, "Mean-Shift and Sparse Sampling Based SMC-PHD Filtering for Audio Informed Visual Speaker Tracking", IEEE Transactions on Multimedia, vol. 18, no. 10, October 2016. [PDF] [codes] [Invited paper]
- Y. Yu, W. Wang, and P. Han, "Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks", EURASIP Journal on Audio Speech and Music Processing, 2016:7, 18 pages, DOI 10.1186/s13636-016-0085-x, 2016. [PDF] [code]
- J. Dong, W. Wang, W. Dai, M. D. Plumbley, Z. Han, and J. A. Chambers, "Analysis SimCO Algorithms for Sparse Analysis Model Based Dictionary Learning", IEEE Transactions on Signal Processing, vol. 64, no. 2, pp. 417 - 431, 2016. [PDF] [code]
- F. Gu, H. Zhang, W. Wang, and C. Xiong, "A Promising Technique for Blind Identification: The Generic Statistics", Circuits Systems and Signal Processing, vol. 35, no. 7, pp. 2544-2562, DOI 10.1007/s00034-015-0162-x, 2016. [PDF]
- D. Wu, Y. Zhao, W. Wang, and Y. Hao, "Cosparsity-based Stagewise Matching Pursuit algorithm for reconstruction of the cosparse signals", EURASIP Journal on Advances in Signal Processing, 2015: 101, DOI 10.1186/s13634-015-0281-3, December 2015. [PDF]
- L. Zhao, Q. Hu, and W. Wang, "Heterogeneous Feature Selection with Multi-Modal Deep Neural Networks and Sparse Group Lasso", IEEE Transactions on Multimedia, vol. 17, no. 11, pp. 1936-1948, 2015. [PDF]
- X. Chen, W. Wang, Y. Wang, X. Zhong, and A. Alinaghi, "Reverberant speech separation with probabilistic time-frequency masking for B-format recordings", Speech Communication, vol. 68, pp. 41-54, 2015. [PDF]
- V. Kilic, M. Barnard, W. Wang, and J. Kittler, "Audio assisted robust visual tracking with adaptive particle filtering", IEEE Transactions on Multimedia, vol. 17, no. 2, pp. 186-200, 2015. [PDF] [codes]
- A. Alinaghi, P. Jackson, Q. Liu, and W. Wang, "Joint Mixing Vector and Binaural Model Based Stereo Source Separation", IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 22, no. 9, pp. 1434-1448, 2014. [PDF]
- Q. Liu, A. Aubery, and W. Wang, "Interference Reduction in Reverberant Speech Separation with Visual Voice Activity Detection", IEEE Transactions on Multimedia, vol. 16, no. 6, pp. 1610-1623, 2014. [demo] [PDF]
- M. Barnard, P.K. Koniusz, W. Wang, J. Kittler, S. M. Naqvi, and J.A. Chambers, "Robust Multi-Speaker Tracking via Dictionary
Learning and Identity Modelling", IEEE Transactions on Multimedia, vol. 16, no. 3, pp. 864-880, 2014. [PDF]
- B. Rivet, W. Wang, S.M. Naqvi, and J.A. Chambers, "Audio-Visual Speech Source Separation", IEEE Signal Processing Magazine, vol. 31, no. 3, pp. 125-134, 2014. [PDF]
- F. Gu, H. Zhang, W. Wang, and D. Zhu, "PARAFAC-Based Blind Identification of Underdetermined Mixtures Using Gaussian Mixture Model", Circuits Systems and Signal Processing, vol. 33, pp. 1841-1857, 2014. [PDF]
- Y. Zhang, T. Yu, and W. Wang, "An Analysis Dictionary Learning Algorithm under a Noisy Data Model with Orthogonality Constraint", Scientific World Journal: Signal Processing, volume 2014, Article ID 852978, 8 pages, http://dx.doi.org/10.1155/2014/852978, 2014. [PDF]
- Q. Liu, W. Wang, P. Jackson, M. Barnard, J. Kittler, and J.A. Chambers, "Source Separation of Convolutive and Noisy
Mixtures using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking", IEEE Transactions on Signal Processing, vol. 61, no. 22, pp. 5520-5535, 2013. [PDF]
- F. Gu, H. Zhang, W. Wang, and D. Zhu, "Generalized Generating Function with Tucker Decomposition and Alternating Least Squares for Underdetermined Blind Identification", EURASIP Journal on Advances in Signal Processing, 2013:124, doi:10.1186/1687-6180-2013-124, 2013. [PDF]
- M. S. Khan, S. M. Naqvi, Ata-ur-Rehman, W. Wang, and J.A. Chambers, "Video-Aided Model-Based Source Separation in Real Reverberant Rooms", IEEE Transactions on Audio Speech and Language Processing, vol. 21, no. 9, pp. 1900-1912, 2013. [PDF]
- S. Zubair, F. Yan, and W. Wang, "Dictionary Learning Based Sparse Coefficients for Audio Classification with Max and Average Pooling", Digital Signal Processing (Elsevier), vol. 23, pp. 960-970, 2013. [PDF]
- T. Xu, W. Wang, and W. Dai, "Sparse Coding with Adaptive Dictionary Learning for Underdetermined Blind Speech Separation", Speech Communication, vol. 55, no. 3, pp. 432-450, 2013. [PDF]
- W. Dai, T. Xu, and W. Wang, "Simultaneous Codeword Optimisation (SimCO) for Dictionary Update and Learning", IEEE Transactions on Signal Processing, vol. 60, no. 12, pp. 6340-6353, 2012. [PDF] [html] [code]
- Q. Liu, W. Wang, and P. Jackson, "Use of Bimodal Coherence to Resolve Permutation Problem in Convolutive BSS," Signal Processing, Special Issue on Latent Variable Analysis and Signal Separation, vol. 92, vol. 8, pp. 1916-1927, 2012. [PDF] (Invited Paper)
- S.M. Naqvi, W. Wang, M.S. Khan, M. Barnard, and J.A. Chambers, "Multimodal (Audio-Visual) Source Separation Exploiting Multi-Speaker Tracking, Robust Beamforming, and Time-Frequency Masking", IET Signal Processing, Special Issue on Multi-Sensor Signal Processing for Defence: Detection, Localisation & Classification, vol. 6, no. 5, pp. 466-477, 2012. [PDF] (Invited Paper)
- G.R. Naik and W. Wang, "Audio Analysis of Statistically Instantaneous Signals with Mixed Gaussian Probability Distributions", International Journal of Electronics, vol. 99, no. 10, pp. 1333-1350, 2012. [PDF]
- T. Jan, W. Wang, and D.L. Wang, "A Multistage Approach to Blind Separation of Convolutive Speech Mixtures," Speech Communication, vol. 53, pp. 524-539, 2011. [PDF]
- W. Wang, A. Cichocki, and J. A. Chambers, "A Multiplicative Algorithm for Convolutive Non-negative Matrix Factorization Based on Squared Euclidean Distance," IEEE Transactions on Signal Processing, vol. 57, no. 7, pp. 2858-2864, July 2009. [PDF]
- A. Cichocki, M. Morup, P. Smaragdis, W. Wang, and R. Zdunek, "Advances in Nonnegative Matrix and Tensor Factorization (Editorial)," Computational Intelligence and Neuroscience, vol. 2008, Article ID 852187, 3 pages, doi:10.1155/2008/852187, July 2008. [PDF] [Volume]
- W. Wang, Y. Luo, J. A. Chambers, and S. Sanei, "Note Onset Detection via Non-negative Factorization of Magnitude Spectrum," EURASIP Journal on Advances in Signal Processing, vol. 2008, Article ID 231367, 15 pages, doi:10.1155/2008/231367, June 2008. [PDF]
- Y. Luo, W. Wang, J. A. Chambers, S. Lambotharan, and I. Prouder, "Exploitation of Source Non-stationarity for Underdetermined Blind Source Separation With Advanced Clustering Techniques," IEEE Transactions on Signal Processing, vol. 54, no. 6, pp. 2198-2212, June 2006. [PDF]
- M. Jafari, W. Wang, J. A. Chambers, T. Hoya, and A. Cichocki, "Sequential Blind Source Separation Based Exclusively on Second Order Statistics Developed for a Class of Periodic Signals," IEEE Transactions on Signal Processing, vol. 54, no. 3, pp.1028-1040, March 2006. [PDF]
- L. Yuan, W. Wang, and J.A. Chambers, "A Variable Step-Size Sign Natural Gradient Algorithm for Sequential Blind Source Separation," IEEE Signal Processing Letters, vol. 12, no.8, pp. 589-592, August 2005. [PDF]
- W. Wang, S. Sanei, and J.A. Chambers, "Penalty Function Based Joint Diagonalization Approach for Convolutive Blind Separation of Nonstationary Sources," IEEE Transactions on Signal Processing, vol. 53, no. 5, pp. 1654-1669, May 2005. [PDF]
- L. Shoker, S. Sanei, W. Wang, and J.A. Chambers, "Removal of Eye Blinking Artifact from EEG Incorporating a New Constrained BSS Algorithm," IEE Journal on Medical & Biological Engineering & Computing, vol. 43, no. 2, pp. 290-295, March 2005. [PDF]
- W. Wang, M. Jafari, S. Sanei, and J.A. Chambers, "Blind Separation of Convolutive Mixtures of Cyclostationary Signals," International Journal of Adaptive Control and Signal Processing, Special Issue on BSS, vol. 18, no. 3, pp. 279-298, Apr. 2004. [PDF] (This paper was chosen as one of the three "Hot Papers" by Wiley/IEEE worldwide advert for books and journals in signal and image processing in Jan 2008.)
Refereed Papers in International Conference Proceedings:
- Y. Yuan, Z. Chen, X. Liu, H. Liu, X. Xu, D. Jia, Y. Chen, M. Plumbley, and W. Wang, "T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining," in Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2024), London, UK, September 22-25, 2024.
- X. Xu, A. Singh, M. Wu, W. Wang, and M. Plumbley, "Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning," in Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2024), London, UK, September 22-25, 2024.
- Q. Deng, Q. Yang, R. Yuan, Y. Huang, Y. Wang, X. Liu, Z. Tian, J. Pan, G. Zhang, H. Lin, Y. Li, Y. Ma, J. Fu, C. Lin, E. Benetos, W. Wang, G. Xia, W. Xue, and Y. Guo, "ComposerX: Multi-Agent Music Generation with LLMs," in Proceedings of the 25th International Society for Music Information Retrieval Conference (ISMIR 2024), San Francisco, USA, November 10-14, 2024. (accepted)
- X. Xu, H. Liu, M. Wu, W. Wang, and M. Plumbley, "Efficient Audio Captioning with Encoder-Level Knowledge Distillation," in Proceedings of Interspeech (INTERSPEECH 2024), Kos, Greece, 1-5 September 2024.
- J. Sun, W. Wang, M. Plumbley, "PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning," in Proceedings of Interspeech (INTERSPEECH 2024), Kos, Greece, 1-5 September 2024.
- Q. Huang, X. Liu, T. Ko, B. Wu, W. Wang, Y. Zhang, and L. Tang, "Selective Prompting Tuning for Personalized Conversations with LLMs," in The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, August 1116, 2024. [PDF]
- J. Liang, H. Zhang, H. Liu, Y. Cao, Q. Kong, X. Liu, W. Wang, M. D Plumbley, H. Phan, and E. Benetos, "WavCraft: Audio Editing and Generation with Large Language Models," in LLMAgents Workshop @ the International Conference on Learning Representations (ICLR 2024). [PDF]
- J.-J. Brady, Y. Luo, W. Wang, V. Elvira, Y. Li, "Regime Learning for Differentiable Particle Filters," in Proceedings of the 27th International Conference on Information Fusion (FUSION 2024), Venice, Italy, July 7-11, 2024. [PDF]
- J. Zhao, X. Qian, Y. Xu, H. Liu, Y. Cao, D. Berghi, W. Wang, "Text-Queried Target Sound Event Localization," in Proceedings of the 32nd European Signal Processing Conference (EUSIPCO 2024), Lyon, France, August 26-30, 2024. [PDF]
- J. Zhao, X. Liu, J. Zhao, Y. Yuan, Q. Kong, M. Plumbley, and W. Wang, "Universal Sound Separation with Self-Supervised Audio Masked Autoencoder," in Proceedings of the 32nd European Signal Processing Conference (EUSIPCO 2024), Lyon, France, August 26-30, 2024. [PDF]
- Y. Yuan, H. Liu, X. Liu, Q. Huang, M. D. Plumbley, and W. Wang, "Retrieval-augmented text-to-audio generation," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- H. Lan, Q. Zhu, J. Guan, Y. Wei, and W. Wang, "Hierarchical metadata information constrained self-supervised learning for anomalous sound detection under domain shift," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- Y. Hou, Q. Ren, S. Song, Y. Song, W. Wang, and Dick Botteldooren, "Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- H. Liu, K. Chen, Q. Tian, W. Wang, and M. Plumbley, "AudioSR: versatile audio super-resolution at scale," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- D. Berghi, P. Wu, J. Zhao, W. Wang, and P. Jackson, "Fusion of audio and visual embeddings for sound event localization and detection," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- Y. Chen, R. Guo, X. Liu, P. Wu, G. Li, Z. Li, and W. Wang, "CM-PIE: cross-modal perception for interactive-enhanced audio-visual video parsing," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- K. SongGong, P. Zhang, X. Zhang, M. Sun, and W. Wang, "Multi-speaker localization in the circular harmonic domain on small aperture microphone arrays using deep convolutional networks," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- H. Zhang, Q. Zhu, J. Guan, H. Liu, F. Xiao, J. Tian, X. Mei, X. Liu, and W. Wang, "First-shot unsupervised anomalous sound detection with unknown anomalies estimated by metadata-assisted audio generation," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), April 14-19, 2024, Seoul, Korea. [PDF]
- H. Liu, X. Liu, Q. Kong, W. Wang, and M. D. Plumbley, "Learning Temporal Resolution in Spectrogram for Audio Classification," in Processing of 38th AAAI Conference on Artificial Intelligence (AAAI 2024), February, 20-27, 2024, Vancouver, Canada. [PDF] (Acceptance rate: 2342/9862=23.75%)
- Q. Huang, S. Fu, X. Liu, W. Wang, T. Ko, Y. Zhang, L. Tang, "Learning retrieval augmentation for personalized dialogue generation," in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 6-10 December 2023, Resorts World Convention Centre, Singapore. [PDF]
- Y. Yuan, H. Liu, X. Kang, P. Wu, M. D. Plumbley, and W. Wang, "Text-Driven Foley Sound Generation with Latent Diffusion Model", in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2023), 20-22 September, 2023, Tampere, Finland. [PDF] (The system described in this paper achieved the First Place in DCASE 2023 Challenge Task 7 (Foley Sound Synthesis) and was also given the Judges' Award.)
- P. Wu, J. Zhao, Y. Chen, B. Davide, Y. Yuan, C. Zhu, Y. Cao, Y. Liu, P. J.B. Jackson, M. D. Plumbley, and W. Wang, "PLDISET: Probabilistic Localization and Detection of Independent Sound Events with Transformers", in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2023), 20-22 September, 2023, Tampere, Finland. [PDF]
- J. Hu, Y. Cao, M. Wu, F. Yang, Z. Yu, W. Wang, M.D. Plumbley, and J. Yang, "META-SELD: Meta-Learning for Fast Adaptation to the New Environment in Sound Event Localization and Detection", in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2023), 20-22 September, 2023, Tampere, Finland. [PDF]
- X. Zhou, S. Lan, W. Wang, X. Li, S. Zhou, H. Yang, "Visual-Haptic-Kinesthetic Object Recognition with Multimodal Transformer," in 32nd International Conference on Artificial Neural Networks (ICANN 2023), 26-29 September 2023, Heraklion city, Crete, Greece. [PDF]
- B. Pu, C. Yang, H. Yang, S. Lan, W. Ma, W. Pan, and W. Wang, "GanNeXt: A New Convolutional GAN for Anomaly Detection," in 32nd International Conference on Artificial Neural Networks (ICANN 2023), 26-29 September 2023, Heraklion city, Crete, Greece. [PDF]
- P. Li, S. Sun, S. Lan, W. Wang, Y. Gao, Y. Yang, and G. Yu, "Siamese Network based on MLP and Multi-head Cross Attention for Visual Object Tracking," in 32nd International Conference on Artificial Neural Networks (ICANN 2023), 26-29 September 2023, Heraklion city, Crete, Greece. [PDF]
- X. Yin, S. Lan, W. Huang, Y. Ma, W. Wang, H. Yang, Y. Zheng, "DLAHSD: dynamic label adopted in auxiliary head for SAR detection", in Proc. IEEE International Conference on Image Processing (ICIP 2023), 8-11 October, 2023, Kuala Lumpur, Malaysia. [PDF]
- S. A. Soleymani, S. Goudarzi, X. Liu, L. Mihaylova, W. Wang, and P. Xiao, "Multi-target tracking using a swarm of UAVs by Q-learning algorithm", in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2023), 12-13 September, 2023, Edinburgh, UK. [PDF]
- X. Liu, C. Lyu, S. A. Soleymani, W. Wang, and L. Mihaylova, "Joint sensor scheduling and target tracking with efficient Bayesian optimisation", in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2023), 12-13 September, 2023, Edinburgh, UK. [PDF]
- Y. Li, Y. Sun, W. Wang, and M. Naqvi, "Joint learning with shared latent space for self-supervised monaural speech enhancement", in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2023), 12-13 September, 2023, Edinburgh, UK. [PDF]
- O. Cayli, X. Liu, V. Kilic, W. Wang, "Knowledge distillation for efficient audio-visual video captioning," in Proc. 31st European Signal Processing Conference (EUSIPCO 2023), 4-8, September, 2023, Helsinki, Finland. [PDF]
- F. Xiao, Q. Zhu, J. Guan, and W. Wang, "Enhancing audio retrieval with attention-based encoder for audio feature representation," in Proc. 31st European Signal Processing Conference (EUSIPCO 2023), 4-8, September, 2023, Helsinki, Finland. [PDF]
- Y. Yuan, H. Liu, J. Liang, X. Liu, M. D. Plumbley, W. Wang, "Leveraging pre-trained AudioLDM for text to sound generation: a benchmark study," in Proc. 31st European Signal Processing Conference (EUSIPCO 2023), 4-8, September, 2023, Helsinki, Finland. [PDF]
- J. Liang, X. Liu, H. Liu, H. Phan, E. Benetos, M. Plumbley, W. Wang, "Adapting language-audio models as few-shot audio learners," in Proc. 24th Interspeech Conference (INTERSPEECH 2023), 20-24 August, 2023, Dublin, Ireland. [PDF]
- Y. Hou, S. Song, C. Luo, Q. Ren, A. Mitchell, W. Xie, J. Kang, W. Wang, D. Botteldooren, "Joint prediction of audio event and annoyance rating in an urban soundscape by hierarchical graph representation learning," in Proc. 24th Interspeech Conference (INTERSPEECH 2023), 20-24 August, 2023, Dublin, Ireland. [PDF]
- H. Liu, Q. Kong, X. Liu, X. Mei, W. Wang, M. Plumbley, "Ontology-aware learning and evaluation for audio tagging," in Proc. 24th Interspeech Conference (INTERSPEECH 2023), 20-24 August, 2023, Dublin, Ireland. [PDF]
- J. Sun, X. Liu, X. Mei, V. Kilic, M. Plumbley, and W. Wang, "Dual transformer decoder based features fusion network for automated audio captioning," in Proc. 24th Interspeech Conference (INTERSPEECH 2023), 20-24 August, 2023, Dublin, Ireland. [PDF]
- X. Liu, Q. Huang, X. Mei, H. Liu, Q. Kong, J. Sun, S. Li, T. Ko, Y. Zhang, H. Tang, M. Plumbley, V. Kilic, and W. Wang, "Visually-aware audio captioning with adaptive audio-visual attention," in Proc. 24th Interspeech Conference (INTERSPEECH 2023), 20-24 August, 2023, Dublin, Ireland. [PDF]
- H. Liu, Z. Chen, Y. Yuan, X. Mei, X. Liu, D. Mandic, W. Wang, M. D. Plumbley, "AudioLDM: text-to-audio generation with latent diffusion models," in Proc. IEEE International Conference on Machine Learning (ICML 2023), Hawaii, USA, 23-29 July, 2023. [PDF] ([Acceptance rate: 1827/6538=27.9%) [This work has been making significant impact since the release of its source codes and demos in February 2023. See Google entries, University Press Release, Hugging Face Spaces (one of the top ranked machine learning systems)] [See the project page for source codes and demos.]
- W. Ma, S. Lan, W. Huang, W. Wang, H. Yang, Y. Ma, and Y. Ma, "A semantics-aware normalizing flow model for anomaly detection," in Proc. IEEE International Conference on Multimedia and Expo (ICME 2023), Brisbane, Australia, 10-14 July, 2023. [PDF]
- W. Li, X. Chen, W. Wang, V. Elvira, Y. Li, "Differentiable bootstrap particle filters for regime-switching models," in Proc. IEEE International Conference on Statistical Signal Processing (SSP 2023), Hanoi, Vietnam, 2-5 July 2023, 2023. [PDF]
- Y. Hou, Y. Wang, W. Wang, and D. Botteldooren, "GCT: gated contextual transformer for sequential audio tagging," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, 4-9 June, 2023. [PDF]
- J. Guan, F. Xiao, Y. Liu, Q. Zhu, and W. Wang, "Anomalous sound detection using audio representation with machine ID based contrastive learning pretraining," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, 4-9 June, 2023. [PDF]
- W. Yuan, Y. Bian, S. Wang, M. Unoki, and W. Wang, "An improved optimal transport kernel embedding method with gating mechanism for singing voice separation and speaker identification," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, 4-9 June, 2023. [PDF]
- J. Guan, Y. Liu, Q. Zhu, T. Zheng, J. Han, and W. Wang, "Time-weighted frequency domain audio representation with GMM estimator for anomalous sound detection," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, 4-9 June, 2023. [PDF]
- X. Liu, H. Liu, Q. Kong, X. Mei, M. D. Plumbley, and W. Wang, "Simple pooling front-ends for efficient audio classification," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, 4-9 June, 2023. [PDF]
- Q. Huang, Y. Zhang, T. Ko, X. Liu, B. Wu, W. Wang, and L. Tang, "Personalized dialogue generation with persona-adaptive attention," in Proc. the 37th AAAI Conference on Artificial Intelligence (AAAI 2023), Washington, USA, 7-14 February, 2023. [PDF] ([Acceptance rate: 1721/8777=19.6%)
- B. Erabadda, G. Kulupana1, T. Mallikarachchi, W. Wang, and A. Fernando, "A hybrid approach to blind video quality prediction of user generated content," in Proceedings of Picture Coding Symposium (PCS 2022), San Jose, CA, USA, 7-9 December, 2022. [PDF]
- H. Liu, X. Liu, X. Mei, Q. Kong, W. Wang, M. D. Plumbley, "Segment-level metric learning for few-shot bioacoustics event detection," in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022), 3-4, November, 2022, Nancy, France. [PDF] [The system described in this paper won the Second Place in DCASE 2022 Challenge: Task 5 - Few Shot Bioacoustic Event Detection (results)]
- Y. Xiao, X. Liu, J. King, A. Singh, E. S. Chng, M. D. Plumbley, and W. Wang, "Continual learning for on-device environmental sound classification," in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022), 3-4, November, 2022, Nancy, France. [PDF]
- D. Yang, H. Wang, W. Wang, and Y. Zou, "A mixed supervised learning framework for target sound detection," in Proceedings of the International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2022), 3-4, November, 2022, Nancy, France. [PDF]
- X. Liu, H. Liu, Q. Kong, X. Mei, J. Zhao, Q. Huang, M.D. Plumbley, and W. Wang,"Separate What You Describe: Language-Queried Audio Source Separation," in Proc. 23rd Interspeech Conference (INTERSPEECH 2022), 18-22 September, 2022, Incheon, Korea. [PDF] [code]
- X. Mei, X. Liu, J. Sun, M. D. Plumbley, and W. Wang, "On Metric Learning for Audio-Text Cross-Modal Retrieval," in Proc. 23rd Interspeech Conference (INTERSPEECH 2022), 18-22 September, 2022, Incheon, Korea. [PDF] [code] [The system described in this paper won the Second Place in DCASE 2022 Challenge: Task 6b - Language based Audio Retrieval (results)]
- D. Yang, H. Wang, Z. Ye, Y. Zou, and W. Wang, "RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection," in Proc. 23rd Interspeech Conference (INTERSPEECH 2022), 18-22 September, 2022, Incheon, Korea. [PDF] [code]
- J. Zhao, P. Wu, X. Liu, S. Goudarzi, H. Liu, Y. Xu, and W. Wang, "Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter," in Proc. 23rd Interspeech Conference (INTERSPEECH 2022), 18-22 September, 2022, Incheon, Korea. [PDF]
- M. Cui, X. Liu, J. Zhao, J. Sun, G. Lian, T. Chen, M. D. Plumbley, D. Li, and W. Wang, "Fish feeding intensity assessment in aquaculture: a new audio dataset AFFIA3K and a deep learning algorithm", in IEEE 32nd International Workshop on Machine Learning for Signal Processing (MLSP 2022), 22-25 August, 2022, Xi'an, China. [PDF] [Dataset and code]
- C. Yang, S. Lan, W. Huang, W. Wang, G. Liu, H. Yang, W. Ma, and P. Li,"A Transformer-based GAN for Anomaly Detection," in Proc. of the 31st International Conference on Artificial Neural Networks (ICANN 2022), 6-9 September, 2022, Bristol, UK. [PDF]
- W. Huang, S. Lan, W. Wang, X. Yuan, H. Yang, P. Li, and W. Ma, "Face Super-Resolution with Spatial Attention Guided by Multiscale Receptive-Field Features," in Proc. of the 31st International Conference on Artificial Neural Networks (ICANN 2022), 6-9 September, 2022, Bristol, UK. [PDF]
- X. Liu, X. Mei, Q. Huang, J. Sun, J. Zhao, H. Liu, M. D. Plumbley, V. Kilic, and W. Wang, "Leverage pre-trained BERT for audio captioning," in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- O. T. Moral, V. Kilic, A. Onan, W. Wang, "Automated image captioning with multi-layer gated recurrent unit," in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- J. Sun, X. Liu, X. Mei, J. Zhao, M. D. Plumbley, V. Kilic, and W. Wang, "Deep neural decision forest for acoustic scene classification," in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- O. Cayli, V. Kilic, A. Onan, and W. Wang, "Auxiliary classifier based residual RNN for image captioning," in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- W. Wang, J. Guan, X. Che, and W. Wang, "MS-MLP: Multi-scale sampling MLP for ECG classification," in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- J. Zhao, P. Wu, S. Goudarzi, X. Liu, J. Sun, Y. Xu, and W. Wang, "Visually assisted self-supervised audio speaker localization and tracking", in Proc. 30th European Signal Processing Conferences (EUSIPCO 2022), Belgrade, Serbia, 29 August- 2 September, 2022. [PDF]
- S. Lan, Y. Ma, W. Huang, W. Wang, H. Yang, and P. Li, "DSTAGNN: dynamic spatial-temporal aware graph neural network for traffic flow forecasting," in Proc. 39th International Conference on Machine Learning (ICML 2022), Baltimore, Maryland, USA, July 17-23, 2022. [PDF] (Acceptance rate: 21.9%)
- T. Hussain, W. Wang, N. Bouaynaya, H. Fathallah-Shaykh, and L. Mihaylova, "Deep learning for audio visual emotion recognition", in Proc. of 25th International Conference on Information Fusion (FUSION 2022), July 4-7, 2022, Linkoping, Sweden. [PDF]
- S. Goudarzi, W. Wang, P. Xiao, L. Mihaylova, and S. Godsill, "UAV-enabled edge computing for optimal task distribution in target tracking", in Proc. of 25th International Conference on Information Fusion (FUSION 2022), July 4-7, 2022, Linkoping, Sweden. [PDF]
- X. Liu, Q. Li, J. Liang, J. Zhao, P. Wu, C. Lyu, S. Goudarzi, J. George, T. Pham W. Wang, L. Mihaylova, S. Godsill, "Advanced machine learning methods for autonomous classification of ground vehicles with acoustic data", in Proc. of the SPIE Defence + Commercial Sensing on Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications IV, 4-7 April, 2022, Orlando, Florida, USA.
[PDF]
- X. Mei, X. Liu, J. Sun, M. Plumbley, W. Wang, "Diverse audio captioning via adversarial training", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, 22-27 May, 2022. [PDF] [code]
- P. Wu, J. Zhao, S. Goudarzi, and W. Wang, "Partial arithmetic consensu based distributed intensity particle flow SMC-PHD filter for multi-target tracking", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, 22-27 May, 2022. [PDF]
- D. Yang, H. Wang, Y. Zou, Z. Ye, and W. Wang, "A mutual learning framework for few-shot sound event detection", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), pp. 811-815, Singapore, 22-27 May, 2022. [PDF] [code]
- J. Zhao, P. Wu, X. Liu, Y. Xu, L. Mihaylova, S. Godsill, and W. Wang, "Audio-visual tracking of multiple speakers via a PMBM filter", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, 22-27 May, 2022. [PDF]
- Y. Liu, J. Guan, Q. Zhu, and W. Wang, "Anomalous sound detection using spectral-temporal information fusion", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022), Singapore, 22-27 May, 2022. [PDF] [code]
- T. Iqbal, Y. Cao, A. Bailey, M.D. Plumbley, W. Wang, "ARCA23K: An audio dataset for investigating open-set label noise", in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE 2021). [PDF] [code] [dataset]
- X. Mei, Q. Huang, X. Liu, G. Chen, J. Wu, Y. Wu, J. Zhao, S. Li, T. Ko, H.L. Tang, X. Shao, M.D. Plumbley, and W. Wang, "An encoder-decoder based audio captioning system with transfer and reinforcement learning", in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE 2021). [PDF] (The method described in this paper achieved the Third Place in DCASE 2021 challenge on Task 6 - Automated Audio Captioning)
- X. Mei, X. Liu, Q. Huang, M.D. Plumbley, and W. Wang, "Audio captioning transformer", in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE 2021). [PDF]
- X. Liu, Q. Huang, X. Mei, T. Ko, H. Tang, M.D. Plumbley, and W. Wang, "CL4AC: A Contrastive Loss for Audio Captioning", in Proceedings of the Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE 2021). [PDF]
- X. Liu, T. Iqbal, J.Zhao, Q. Huang, M.D. Plumbley, and W. Wang, "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", in IEEE 31th International Workshop on Machine Learning for Signal Processing (MLSP 2021), 2021. [PDF]
- B. Wang, Y. Huang, L. Luo, W. Wang, Y. Zhang, "An Improved Adaptive Kalman Filter for In-motion Initial Alignment of GPS-Aided SINS", in 2021 International Conference on
Autonomous Unmanned Systems (ICAUS 2021), Changsha, China, September 24-26, 2021. [PDF] (Won the Best Paper Award)
- W. Yuan, S. Wang, X.i Li, M. Unoki, and W. Wang, "Crossfire conditional generative adversarial networks for singing voice extraction", in Proc. Interspeech (INTERSPEECH 2021), Brno, Czech Republic, 30 Aug 2021 - 3 Sept 2021. [PDF]
- H. Wang, Y. Zou, and W. Wang, "SpecAugment++: A hidden space data augmentation method for acoustic scene classification", in Proc. Interspeech (INTERSPEECH 2021), Brno, Czech Republic, 30 Aug 2021 - 3 Sept 2021. [PDF]
- S. Lan, J. Li, S. Sun, X. Lai, and W. Wang, "Robust visual object tracking with spatial-temporal regularisation and discriminative occlusion deformation", in Proc. 28th IEEE International Conference on Image Processing (ICIP 2021), Anchorage-Alaska, USA, 19 Sept - 22 Sept 2021. [PDF]
- G. Liu, S. Lan, T. Zhang, W. Huang, and W. Wang, "SAGAN: Skip-attention GAN for anomaly detection", in Proc. 28th IEEE International Conference on Image Processing (ICIP 2021), Anchorage-Alaska, USA, 19 Sept - 22 Sept 2021. [PDF]
- C. Liu, X. Yang, D. Chong, W. Wang, and L. Li, "Enhancing Alzheimer's disease diagnosis via hierarchical 3D-FCN with multi-modal features", in Proc. 28th IEEE International Conference on Image Processing (ICIP 2021), Anchorage-Alaska, USA, 19 Sept - 22 Sept 2021. [PDF]
- L. Pham, C. Baume, Q. Kong, T. Hussain, W. Wang, and M.D. Plumbley, "An audio-based deep learning framework For BBC television programme classification", in Proc. 29th European Signal Processing Conference (EUSIPCO 2021), Dublin, Ireland, 23 Aug - 27 Aug 2021.
- T. Iqbal, K. Helwani, A. Krishnaswamy, and W. Wang, "Enhancing audio augmentation method with consistency learning", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- Y. Cao, T. Iqbal, Q. Kong, F. An, W. Wang, and M.D. Plumbley, "An improved event-independent network for polyphonic sound event localization and detection", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- H. Wang, Y. Zou, and W. Wang, "A global-local attention framework for weakly labelled audio tagging", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- J. Zhang, M. D. Plumbley, W. Wang, "Weighted magnitude-phase loss for speech dereverberation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- S. Li, Y. Luo, J. Chambers, and W. Wang, "Dimension selected subspace clustering", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- J. Guan, W. Wang, P. Feng, X. Wang, and W. Wang, "Low-dimensional denoising embedding transformer for ECG classification", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021), Toronto, Canada, 6-11 June, 2021. [PDF]
- S. Hong, Y. Zou, and W. Wang, "Gated Multi-head Attention Pooling for Weakly Labelled Audio Tagging", in Proc. Interspeech 2020, Shanghai, China, 25-29, October, 2020. [PDF]
- H. Wang, Y. Zou, D. Chong, and W. Wang, "Environmental Sound Classification with Parallel Temporal-spectral Attention", in Proc. Interspeech, Shanghai, China, 25-29, October, 2020. [PDF]
- T. Iqbal, Y. Cao, M. D. Plumbley, and W. Wang, "Incorporating auxiliary data for urban sound tagging", in Proc. 5th International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020), Tokyo, Japan, 2-3 November, 2020. [PDF] (Won the Judge's Award)
- S. Safavi, T. Iqbal, W. Wang, P. Coleman, M.D. Plumbley, "Open Window: a sound event dataset for window status detection and recognition" in Proc. 5th International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020), Tokyo, Japan, 2-3 November, 2020. [PDF]
- Y. Cao, T. Iqbal, Q. Kong, Y. Zhong, W. Wang, and M.D. Plumbley, "Event-independent network for polyphonic sound event localization and detection", in Proc. 5th International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2020), Tokyo, Japan, 2-3 November, 2020. [PDF] (Won the Reproducible Systems Award)
- Y. Xian, Y. Sun, W. Wang, M. Naqvi, "Multi-scale residual Convolutional Encoder decoder with bidirectional long short-term memory for single-channel speech enhancement", in Proc. 28th European Signal Processing Conferences (EUSIPCO 2020). [PDF]
- B. Sabeti, H. A. Firouzjaee, R. Fahmi, S.H.E.M. Najafabadi, S. Safavi, W. Wang, and M.D. Plumbley, "Credit risk rating using state machines and machine learning", in Proc. of the 9th Int. Conf. on Economics and Finance Research (ICEFR 2020), June 17-19, 2020, Paris, France. [PDF]
- L. Shi, L. Yu, K. Huang, X. Zhu, Z. Wang, X. Li, W. Wang, and X. Wang, "A covert ultrasonic phone-to-phone communication scheme", in Proc. International Conference on 16th EAI International Conference on Collaborative Computing: Networking, Applications and Worksharing, October 16-18, 2020, Shanghai, China, via Cyberspace. [PDF]
- T. Iqbal, Y. Cao, Q. Kong, M. D. Plumbley, and W. Wang, "Learning with out of distribution data for audio classification", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain, 4-8 May, 2020. [PDF]
- T. Murakami, and W. Wang, "An analytical solution to Jacobsen estimator for windowed signals", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain, 4-8 May, 2020. [PDF]
- Q. Kong, Y. Wang, X. Song, Y. Cao, W. Wang, and M. D. Plumbley, "Source separation with weakly labelled data: an approach to computational auditory scene analysis", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain, 4-8 May, 2020. [PDF]
- J. Guan, J. Liu, J. Sun, P. Feng, T. Shuai, and W. Wang, "Meta metric learning for highly imbalanced aerial scene classification", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain, 4-8 May, 2020. [PDF]
- S. Hong, Y. Zou, W. Wang, and M. Cao, "Weakly labelled audio tagging via convolutional networks with spatial and channel-wise attention", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020), Barcelona, Spain, 4-8 May, 2020. [PDF]
- Q. Kong, Y. Xu, W. Wang, P. Jackson, and M. D. Plumbley, "Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks", in Proc. 28th International Joint Conference on Artificial Intelligence (IJCAI 2019), Macao, China, 10-16 August, 2019. [PDF] (Acceptance rate: 850/4752 = 17.9%)
- Y. Cao, T. Iqbal, Q. Kong, M. B. Galindo, W. Wang and M. Plumbley, "Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks", in Proc. 5th International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2019), New York, US, 25-26, October, 2019. [PDF] (Won the Reproducible Systems Award)
- J. Wang, S. Li, and W. Wang, "SVD-Based Channel Pruning For Convolutional Neural Network In Acoustic Scene Classification Model", in Proc. IEEE International Conference on Multimedia and Expo (ICME) 2019 (ICME 2019), Shanghai, China, 8-12 July, 2019. [PDF]
- Y. Liu, Q. Hu, Y. Zou, and W. Wang, "Labelled non-zero particle flow for SMC-PHD filtering", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF] (Best Student Paper Award Finalist)
- W. Yuan, S. Wang, X. Li, M. Unoki, and W. Wang, "Proximal deep recurrent neural network for monaural singing voice separation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- S. Li, Y. Gu, Y. Luo, J. Chambers, and W. Wang, "Enhanced streaming based subspace clustering applied to acoustic scene data clustering", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- Q. Kong, Y. Xu, T. Iqbal, Y. Cao, W. Wang, and M. D. Plumbley, "Acoustic scene generation with conditional SampleRNN", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- Y. Tang, Q. Liu, T. Cox, B. Fazenda, W. Wang, "Background adaptation for improved listening experience in broadcasting", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- Y. Zou, Y. Wang, W. Guan, and W. Wang, "Semantic super-resolution for extremely low-resolution vehicle license plate", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- C. Kroos, O. Bones, Y. Cao, L. Harris, P. J. B. Jackson, W. J. Davies, W. Wang, T. J. Cox, and M. D. Plumbley, "Geralisation in environmental sound classification: The 'making sense of sounds' dataset and challenge", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), Brighton, UK, 12-17 May, 2019. [PDF]
- M. Chen, L. Gan & W. Wang, "A New Sparse Linear Array With Three-Level Nested Structure", in Proc. IEEE Sensor Signal Processing for Defence, Brighton, 9-10 May, 2019. [PDF]
- Y. Xian, Y. Sun, W. Wang & S. M. Naqvi, "Two Stage Audio-Visual Speech Separation Using Multimodal Convolutional Neural Networks", in Proc. IEEE Sensor Signal Processing for Defence, Brighton, 9-10 May, 2019. [PDF]
- D. Chong, Y. Zou, and W. Wang, "Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification", in Proceedings of 25th International Conference on MultiMedia Modeling, pp. 157-168, Thessaloniki, Greece, January 8-11, 2019. [PDF]
- T. Iqbal, Q. Kong, M. D. Plumbley, and W. Wang, "Stacked convolutional neural networks for general-purpose audio tagging", in Proc. 3rd International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018), Surrey, UK, 19-20 November, 2018. [PDF] (The system described in this paper ranked the 3rd place among 558 submissions in the Kaggle challenge on "sound recognition" in 2018.)
- Q. Kong, T. Iqbal, Y. Xu, W. Wang, and M. D. Plumbley, "DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline", in Proc. 3rd International Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2018), Surrey, UK, 19-20 November, 2018. [PDF]
- Y. Liu, W. Wang, and V. Kilic, "Intensity particle flow SMC-PHD filter for audio speaker tracking", in Proc. IEEE Workshop on AASP Challenge on Acoustic Source Localization and Tracking (LOCATA 2018), Tokyo, Japan, 17-20, September, 2018. [PDF]
- L. Rencker, F. Bach, W. Wang, and M. D. Plumbley, "Fast iterative shringkage for signal declipping and dequantization", in Proc. International Travelling Workshop on Interactions between Low-Complexity Data Models and Sensing Techniques (iTWIST 2018), Marseille, France, November 21-23, 2018. [PDF]
- S. Safavi, W. Wang, M. D. Plumbley, A.J. Choobbasti, and G. Fazekas, "Predicting the Perceived Level of Reverberation using Features from Nonlinear Auditory Model", in Proc. 23rd IEEE FRUCT Conference (FRUCT 2018), Bologna, Italy, November 28-31, 2018. [PDF]
- S. Safavi, A. Pearce, W. Wang, and M. D. Plumbley, "Predicting the perceived level of reverberation using machine learning", in Proc. 52nd Asilomar Conference on Signals, Systems and Computers (Asilomar 2018), Pacific Grove, California, USA, October 28-31, 2018. [PDF] (Invited paper)
- Q. Liu, W. Wang, P.J.B. Jackson, and S. Safavi, "A Performance Evaluation of Several Deep Neural Networks for Reverberant Speech Separation", in Proc. 52nd Asilomar Conference on Signals, Systems and Computers (Asilomar 2018), Pacific Grove, California, USA, October 28-31, 2018. [PDF] (Invited paper)
- J. Gao, H. Shi, and W. Wang, "Spatially regularized low rank tensor optimisation for visual data completion", in Proc. 25th IEEE International Conference on Image Processing (ICIP 2018), Athens, Greece, October 7-10, 2018. [PDF]
- S. Li, and W. Wang, "Randomly Sketched Sparse Subspace Clustering for Acoustic Scene Clustering", in Proc. 26th European Signal Processing Conferences (EUSIPCO 2018), Rome, Italy, September 3-7, 2018. [PDF]
- T. Iqbal, Y. Xu, Q. Kong, and W. Wang, "Capsule Routing for Sound Event Detection", in Proc. 26th European Signal Processing Conferences (EUSIPCO 2018), Rome, Italy, September 3-7, 2018. [PDF]
- Y. Sun, W. Wang, J.A. Chambers, and M. Naqvi, "Enhanced Time-Frequency Masking by Using Neural Networks for Monaural Source Separation in Reverberant Room Environments", in Proc. 26th European Signal Processing Conferences (EUSIPCO 2018), Rome, Italy, September 3-7, 2018. [PDF]
- X. Zhang, Y. Zou, and W. Wang, "LD-CNN: A Lightweight Dilated Convolutional Neural Network for Environmental Sound Classification", in Proc. 24th International Conference on Pattern Recognition (ICPR 2018), Beijing, China, August 20-24, 2018. [PDF]
- H. Zhang, H. Shi, and W. Wang, "Cascade Deep Networks for Sparse Linear Inverse Problems", in Proc. 24th International Conference on Pattern Recognition (ICPR 2018), Beijing, China, August 20-24, 2018. [PDF]
- A. Zermini, Q. Kong, Y. Xu, M. D. Plumbley, and W. Wang, "Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks", in Proc. 14th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2018), Guildford, UK, July 2-6, 2018. [PDF]
- L. Rencker, F. Bach, W. Wang, and M. D. Plumbley, "Consistent Dictionary Learning for Signal Declipping", in Proc. 14th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2018), Guildford, UK, July 2-6, 2018. [PDF] (Best student paper award)
- Y. Liu, A. Hilton, J.A. Chambers, Y. Zhao, W. Wang, "Non-zero diffusion particle flow SMC-PHD filter for audio-visual multi-speaker tracking", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- Q. Liu, Y. Xu, P. Jackson, W. Wang, P. Coleman, "Iterative deep neural networks for speaker-independent binaural blind speech separation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- Y. Xu, Q. Kong, W. Wang, and M. D. Plumbley, "Large-scale weakly supervised audio classification using gated convolutional neural network", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF] (The system described in this paper won the 1st place in the DCASE 2017 challenge on audio tagging.)
- V.H. Tran, W. Wang, Y. Luo, and J.A. Chambers, "Bayesian inference for multi-line spectra in linear sensor array", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- Q. Huang, P. Jackson, M. D. Plumbley, and W. Wang, "Synthesis of images by two-stage generative adversarial networks", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- Q. Kong, Y. Xu, W. Wang, and M. D. Plumbley, "A joint separation-classification model for sound event detection of weakly labelled data", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- J. Kittler, I. Kaloskampis, C. Zor, Y. Xu, Y. Hicks, and W. Wang, "An intelligent signal processing mechanism for nuanced anomaly detection in action audio-visual data streams", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF] (Invited paper)
- Q. Kong, Y. Xu, W. Wang, and M. D. Plumbley, "Audio set classification with attention model: a probabilistic perspective", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018), Calgary, Canada, 15-20 April, 2018. [PDF]
- A. Zermini, Q. Liu, Y. Xu, M. D. Plumbley, D. Betts, and W. Wang, "Binaural and Log-Power Spectra Features with Deep Neural Networks for Speech-Noise Separation", in Proc. IEEE 19th International Workshop on Multimedia Signal Processing (MMSP 2017), Luton, UK, October 16-18, 2017. [PDF]
- T. Iqbal, and W. Wang, "Approximate Message Passing Algorithms for Underdetermined Audio Source Separation", in Proc. Intelligent Signal Processing (ISP 2017), London, December 4-5, 2017. [PDF]
- J. Guan, X. Wang, Z. Xie, S. Qi, and W. Wang, "Joint L1-L2 Regularisation for Blind Speech Deconvolution", in Proc. Pacific-Rim Conference on Multimedia (PCM 2017), Harbin, China, September 28-29, 2017. [PDF]
- J. Guan, X. Wang, S. Qi, J. Dong, and W. Wang, "Blind Speech Deconvolution via Pretrained Polynomial Dictionary and Sparse Representation", in Proc. Pacific-Rim Conference on Multimedia (PCM 2017), Harbin, China, September 28-29, 2017. [PDF]
- Q. Liu, W. Wang, P. Jackson, and Y. Tang, "A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions", in Proc. European Signal Processing Conference (EUSIPCO 2017), Kos Island, Greece, August 28- September 2, 2017. [PDF]
- M. Chen, W. Wang, M. Barnard, and J.A. Chambers, "Wideband DoA Estimation Based on Joint Optimisation of Array and Spatial Sparsity", in Proc. European Signal Processing Conference (EUSIPCO 2017), Kos Island, Greece, August 28- September 2, 2017. [PDF]
- L. Rencker, W. Wang, and M. D. Plumbley, "Multivariate Iterative Hard Thresholding for Sparse Decomposition with Flexible Sparsity Patterns", in Proc. European Signal Processing Conference (EUSIPCO 2017), Kos Island, Greece, August 28- September 2, 2017. [PDF]
- J. Guan, X. Wang, P. Feng, J. Dong, and W. Wang, "Matrix of Polynomials Model based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling", in Proc. Interspeech (Interspeech 2017), Stockholm, Sweden, August 20-24, 2017. [PDF]
- Y. Xu, Q. Kong, Q. Huang, W. Wang and M. D. Plumbley, "Attention and Localization based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging", in Proc. Interspeech (Interspeech 2017), Stockholm, Sweden, August 20-24, 2017. [PDF]
- Y. Xu, Q. Kong, Q. Huang, W. Wang and M. D. Plumbley, "Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging", in Proc. IEEE International Joint Conference on Neural Networks (IJCNN 2017), Anchorage, Alaska, US, May 14-19, 2017. [PDF]
- Y. Liu, W. Wang, Y. Zhao, "Particle Flow for Sequential Monte Carlo Implementation of Probability Hypothesis Density", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, US, March 5-9, 2017. [PDF]
- L. Rencker, W. Wang, M. D. Plumbley, "A greedy algorithm with learned statistics for sparse signal reconstruction", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, US, March 5-9, 2017. [PDF]
- Q. Huang, Y. Xu, P. J. B. Jackson, W. Wang, M. D. Plumbley, "Fast Tagging of Natural Sounds Using Marginal Co-regularization", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, US, March 5-9, 2017. [PDF]
- R. Hamon, V. Emiya, L. Rencker, W. Wang, M. D. Plumbley, "Assessment of musical noise using localization of isolated peaks in time-frequency domain", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, US, March 5-9, 2017. [PDF]
- Q. Kong, Y. Xu, W. Wang, M. D. Plumbley, "A joint detection-classification model for audio tagging of weakly labelled data", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, US, March 5-9, 2017. [PDF]
- M. Chen, L. Gan, W. Wang, "Co-prime Arrays with Reduced Sensors (CARS) for Direction-of-Arrival Estimation", Proc. IEEE Sensor Signal Processing for Defence, London, 9-10 May, 2017.
[PDF]
- Y. Liu, W. Wang, J. Chambers, V. Kilic, and A. Hilton, "Particle Flow SMC-PHD Filter for Audio-Visual Multi-speaker Tracking", in Proc. 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Grenoble, France, February 21-23, 2017. [PDF] (Invited paper.)
- L. Rencker, and W. Wang, "Covariance-based regularization for sparse audio reconstruction", in Proc. of the 11th IMA International Conference on Mathematics in Signal Processing, Birmingham, 12-14, December 2016. [PDF]
- M. Chen and W. Wang, "Fisher information matrix constrained joint array and spatial sparsity optimisation for DoA estimation", in Proc. of the 11th IMA International Conference on Mathematics in Signal Processing, Birmingham, 12-14, December 2016. [PDF]
- A. Zermini, Y. Yu, Y. Xu, M. D. Plumbley, W. Wang, "Sparse Deep Neural Networks for Audio Source Separation with Contextual Information", in Proc. of the 11th IMA International Conference on Mathematics in Signal Processing, Birmingham, 12-14, December 2016. [PDF]
- J. Guan, X. Wang, W. Wang, Z. Xie, "Blind deconvolution for sparse acoustic system", in Proc. of the 11th IMA International Conference on Mathematics in Signal Processing, Birmingham, 12-14, December 2016. [PDF]
- M. Chen, M. Barnard, and W. Wang, "Joint array and spatial sparsity based optimisation for DoA estimation", in Proc. IEEE Sensor Signal Processing for Defence, Edinburgh, 22-23 September, 2016. [PDF]
- Y. Xu, Q. Huang, W. Wang, P.J.B. Jackson, and M. D. Plumbley, "Fully DNN-based multi-label regression for audio tagging", in Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2016), Budapest, Hungary, 3rd Sept 2016. [PDF]
- Y. Xu, Q. Huang, W. Wang, and M. D. Plumbley, "Hierarchical Learning for DNN-based Acoustic Scene Classification", in Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2016), Budapest, Hungary, 3rd Sept 2016. [PDF]
- Q. Kong, I. Sobieraj, W. Wang, and M. D. Plumbley, "Deep Neural Network Baseline for DCASE Challenge 2016", in Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2016), Budapest, Hungary, 3rd Sept 2016. [PDF]
- H. Wang, W. Fang, W. Wang, Y. Zhang, and S. Sanei, "Analysis Dictionary Learning Based on Max Transvection Function", in IEEE International Conference on Signal and Image Processing (ICSIP 2016), Beijing, China, August 13-15, 2016. [PDF]
- Q. Liu, Y. Tang, P. Jackson, and W. Wang, "Predicting binaural speech intelligibility from signals estimated by a blind source separation algorithm", in Proc. Interspeech (INTERSPEECH 2016), San Francisco, USA, Sept 8-12, 2016. [PDF]
- F. Gu, S. Wang, W. Wang, and J. Wei, "Higher-order circularity based I/Q imbalance compensation in direct-conversion receivers", in Proc. IEEE Vehicular Technology Conference - Fall, Montré Canada, al, Sept 18-21, 2016. [PDF]
- Q. Liu, T. deCampos, W. Wang, and A. Hilton, "Identity association using PHD filters in multiple head tracking with depth sensors", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, March 20-25, 2016. [PDF]
- P. Feng, W. Wang, S. M. Naqvi, S. Dlay, and J.A. Chambers, "Social force model aided robust particle PHD filter for multiple human tracking", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, March 20-25, 2016. [PDF]
- F. Font, T. Brookes, G. Fazekas, M. Guerber, A. La Burthe, D. Plans, M. D. Plumbley, M. Shaashua, W. Wang, and X. Serra, "Audio Commons: bringing creative commons audio content to the creative industries", in Proc. AES Int. Conf. on Audio for Games, London, UK, February 10-12, 2016. [PDF]
- Q. Liu, T. deCampos, W. Wang, P. Jackson and A. Hilton, "Person tracking using audio and depth cues", in Proc. IEEE Proc. ICCV Workshop on 3D Reconstruction and Understanding with Video and Sound (ICCV 2015), Santiago, Chile, December 11-18, 2015. [PDF]
- H. Deif, D. Fitzgerald, W. Wang, and L. Gan, "A local discontinuity based approach for monaural singing voice separation from accompanying music with multi-stage non-negative matrix factorization", in Proc. 3rd IEEE Global Conference on Signal & Information Processing (GlobalSIP 2015), Orlando, Florida, USA, December 14-16, 2015. [PDF]
- H. Deif, W. Wang, L. Gan, and S. Alhashmi, "Separation of Vocals From Monaural Music Recordings Using Diagonal Median Filters and Practical Time-Frequency Parameters", in Proc. 2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), pp. 163-167, 2015. [PDF]
- M. Barnard and W. Wang, "Adaptive Bayesian sparse representation for underwater acoustic signal denoising", in Proc. 2nd IET International Conference on Intelligent Signal Processing (ISP 2015), London, UK, December 1-2, 2015. [PDF]
- S. Shapoori, S. Sanei, and W. Wang, "Blind Source Separation Of Medial Temporal Discharges Via Partial Dictionary Learning", in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2015), Boston, USA, September 17-20, 2015. [PDF]
- J. Dong, W. Wang, J. A. Chambers, "Removing Speckle Noise by Analysis Dictionary Learning", in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2015), Edinburgh, UK, September 9-10, 2015. [PDF]
- P. Feng, W. Wang, S. M. Naqvi, and J. A. Chambers, "Variational Bayesian PHD filter with Deep Learning Network Updating for Multiple Human Tracking", in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2015), Edinburgh, UK, September 9-10, 2015. [PDF]
- X. Zhong, V.N. Hari, W. Wang, X. Shen, and H. Wang, "Particle Filtering for Channel Parameter Tracking in a Noisy Shallow Ocean Environment Using a Vertical Array", in Proc. 5th Asia-Pacific Conference on Synthetic Aperture Radar (APSAR 2015), Marina Bay Sands, Singapore, September 1-4, 2015. [PDF]
- Q. Liu, W. Wang, P. Jackson, and T. Cox, "A Source Separation Evaluation Method in Object-Based Spatial Audio", in Proc. 23rd European Signal Processing Conference (EUSIPCO 2015), pp. 1088-1092, Nice, France, August 31-September 4, 2015. [PDF]
- P. Feng, M. Yu, M. Naqvi, W. Wang, and J.A. Chambers, "A Robust Student's-t Distribution PHD Filter with OCSVM Updating for Multiple Human Tracking", in Proc. 23rd European Signal Processing Conference (EUSIPCO 2015), Nice, France, August 31-September 4, 2015. [PDF]
- J. Guan, J. Dong, X. Wang, and W. Wang, "A Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling", in Proc. 12th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2015), Liberec, Czech Republic, August 25-28, 2015. (Invited Paper) [PDF]
- Y. Yu, W. Wang, J. Luo, and P. Feng, "Localization Based Stereo Speech Separation Using Deep Networks", in Proc. IEEE International Conference on Digital Signal Processing (DSP 2015), Singapore, July 21-24, 2015. [PDF]
- P. Feng, W. Wang, S. M. Naqvi, and J.A. Chambers, "A Robust PHD Filter with Deep Learning Updating for Multiple Human Tracking", in Proc. IEEE International Conference on Digital Signal Processing (DSP 2015), Singapore, July 21-24, 2015. [PDF]
- J. Dong, W. Wang, J.A. Chambers, "Audio Super-Resolution Using Analysis Dictionary Learning", in Proc. IEEE International Conference on Digital Signal Processing (DSP 2015), Singapore, July 21-24, 2015. [PDF]
- V. Kilic, M. Barnard, W. Wang, A. Hilton, and J. Kittler, "Audio Informed Visual Speaker Tracking with SMC-PHD Filter", in Proc. IEEE International Conference on Multimedia and Expo (ICME 2015), Torino, Italy, June 29 - July 3, 2015. (Top 15% paper award) [PDF]
- S. Shapoori, S. Sanei, and W. Wang, "A Novel Approach for Detection of Medial Temporal Discharges Using Blind Source Separation Incorporating Dictionary Look Up", in Proc. 7th International IEEE EMBS Conference on Neural Engineering (NER 2015), Montpellier, France, April 22-24, 2015. [PDF]
- L. Remaggi, P.J.B. Jackson, W. Wang, and J.A. Chambers, "A 3D Model for Room Boundary Estimation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015), Brisbane, Australia, April 19-25, 2015. [PDF]
- Y. Yu and W. Wang, "Unsupervised Feature Learning for Stereo Source Separation", in Proc. 10th International Conference on Mathematics in Signal Processing (IMA 2014), Birmingham, UK, 15-17, December, 2014.
- Y. Zhang, H. Wang, and W. Wang, "An Analysis Dictionary Learning Algorithm based on Recursive Least Squares", in Proc. International Conference on Signal Processing (ICSP 2014), Hangzhou, China, 19-23 October, 2014. [PDF]
- F. Gu, W. Li, and W. Wang, "Fourth-Order Cumulant based Source Number Estimation from Mixtures of Unknown Number of Sources", in Proc. International Conference on
Wireless Communications and Signal Processing (WCSP 2014), Hefei, China, 23-25 October, 2014. [PDF]
- L. Remaggi, P. Jackson, P. Coleman, W. Wang, "Room Boundary Estimation from Acoustic Room Impulse Responses", in Proc. Sensor Signal Processing for Defence (SSPD 2014), Edinburgh, UK, 8-9 September, 2014. [PDF]
- S. Zubair and W. Wang, "Signal Classification Based on Block Sparse Tensor Representation", in Proc. 19th International Conference on Digital Signal Processing (DSP 2014), Hong Kong, China, 20-23 August, 2014. [PDF]
- S. Chandna and W. Wang, "Improving Model-Based Convolutive Blind Source Separation Techniques via Bootstrap", in Proc. IEEE Statistical Signal Processing Workshop (SSP 2014), Gold Coast, Queensland, Australia, 29 June -02 July, 2014. [PDF]
- V. Kilic, X. Zhong, M. Barnard, W. Wang, and J. Kittler, "Audio-Visual Tracking of a Variable Number of Speakers with a Random Finite Set Approach", in Proc. 16th International Conference on Information Fusion (FUSION 2014), Salamanca, Spain, July 7-10, 2014. (Invited Paper) [PDF]
- X. Zhong, W. Wang, S. Naqvi, and E. S. Chng, "A Bayesian Performance Bound for Time-Delay of Arrival based Acoustic Source Tracking in a Reverberant Environment", in Proc. 16th International Conference on Information Fusion (FUSION 2014), Salamanca, Spain, July 7-10, 2014. (Invited Paper) [PDF]
- J. Dong and W. Wang, "Analysis Dictionary Learning Based on Nesterov's Gradient with Application to SAR Image Despeckling", in Proc. of the 6th International Symposium on Communications, Control, and Signal Processing (ISCCSP 2014), Athens, Greece, May 21-24, 2014. (Invited Paper) [PDF]
- S. Zubair, W. Wang, and J.A. Chambers, "Discriminative Tensor Dictionaries and Sparsity for Speaker Identification", in Proc. of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA 2014), Nancy, France, May 12-14, 2014. (Invited Paper) [PDF]
- J. Dong, W. Wang, and W. Dai, "Analysis SimCO: A New Algorithm for Analysis Dictionary Learning", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), Florence, Italy, May 4-9, 2014. [PDF]
- X. Zhong, V. N. Hari, W. Wang, A. B. Premkumar, and C. T. Lau, "Characterisation of Acoustic Channel in Noisy Shallow Ocean Environment Using a Rao-Blackwellised Particle Filter", in Proc. 2nd International Conference on Signal, Image Processing and Pattern Recognition (SIPP 2014), Sydney, Australia, February 21-22, 2014. [PDF]
- V. Popa, W. Wang, and A. Alinaghi, "Underdetermined Model-Based Blind Source Separation of Reverberant Speech Mixtures using Spatial Cues in a Variational Bayesian Framework", in Proc. IET International Conference on Intelligent Signal Processing (ISP 2013), London, UK, December 3-4, 2013. [PDF]
- A. Alinaghi, P Jackson, and W. Wang, "Comparison between the Statistical cues in BSS techniques and Binaural cues in CASA approaches for reverberant speech separation", in Proc. IET International Conference on Intelligent Signal Processing (ISP 2013), London, UK, December 3-4, 2013. [PDF]
- Y. Zhang, H. Wang, W. Wang, and S. Sanei, "K-Plane Clustering Algorithm For Analysis Dictionary Learning", in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013), Southampton, UK, September 22-25, 2013. [PDF]
- S. Shapoori, W. Wang, and S. Sanei, "A Constrained Appoach for Extraction of Pre-ictal Discharges from Scalp", in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013), Southampton, UK, September 22-25, 2013. [PDF]
- X. Zhong, X. Chen, W. Wang, A. Alinaghi, and A.B. Premkumar, "Acoustic Vector Sensor Based Reverberant Speech Separation with Probabilistic Time-Frequency Masking", in Proc. 21st European Signal Processing Conference (EUSIPCO 2013), Marrakech, Morocco, 9-13 September, 2013. [PDF]
- Y. Zhang, H. Wang, T. Yu, and W. Wang, "Subset Pursuit for Analysis Dictionary Learning", in Proc. 21st European Signal Processing Conference (EUSIPCO 2013), Marrakech, Morocco, 9-13 September, 2013. [PDF]
- V. Kilic, M. Barnard, W. Wang, and J. Kittler, "Adaptive Particle Filtering Approach to Audio-Visual Tracking", in Proc. 21st European Signal Processing Conference (EUSIPCO 2013), Marrakech, Morocco, 9-13 September, 2013. [PDF]
- X. Zhong, A. Mohammadi, W. Wang, A.B. Premkumar, and A. Asif, "Acoustic Source Tracking in a Reverberant Environment Using a Pairwise Synchronous Microphone Network", in Proc. 16th International Conference on Information Fusion (FUSION 2013), Istanbul, Turkey, July 9-12, 2013. (Invited Paper) [PDF]
- M. Barnard, W. Wang, J. Kittler, S.M. Naqvi, and J.A. Chambers, "Audio-Visual Face Detection for Tracking in a Meeting Room Environment", in Proc. 16th International Conference on Information Fusion (FUSION 2013), Istanbul, Turkey, July 9-12, 2013. (Invited Paper) [PDF]
- Q. Liu, W. Wang, "Show-Through Removal for Scanned Images Using Nonlinear NMF with Adaptive Smoothing", in Proc. IEEE China Summit and International Conference on Signal and Information Processing (CHINASIP 2013), Beijing, China, July 6-10, 2013. [PDF]
- X. Chen, A. Alinaghi, X. Zhong, and W. Wang, "Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions", in Proc. 18th International Conference on Digital Signal Processing (DSP 2013), Santorini, Greece, July 1-3, 2013. [PDF]
- S. Zubair and W. Wang, "Tensor Dictionary Learning with Sparse Tucker Decomposition", in Proc. 18th International Conference on Digital Signal Processing (DSP 2013), Santorini, Greece, July 1-3, 2013. [PDF]
- X. Zhao, T. Xu, G. Zhou, W. Dai, and W. Wang, "Joint Image Separation and Dictionary Learning", in Proc. 18th International Conference on Digital Signal Processing (DSP 2013), Santorini, Greece, July 1-3, 2013. [PDF]
- A. Alinaghi, W. Wang, and P.J.B. Jackson, "Spatial and Coherence Cues Based Time-Frequency Masking for Binaural Reverberant Speech Separation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp. 684-688, Vancouver, Canada, May 26-31, 2013. [PDF]
- M. Barnard, W. Wang, and J. Kittler, "Audio Head Pose Estimation Using the Direct to Reverberant Speech Ratio", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp. 8056-8060, Vancouver, Canada, May 26-31, 2013. [PDF]
- V. Kilic, M. Barnard, W. Wang, and J. Kittler, "Audio Constrained Particle Filter Based Visual Tracking", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp. 3627-3631, Vancouver, Canada, May 26-31, 2013. [PDF]
- X. Zhong, W. Wang, and A.B. Premkumar, "Direction of Arrival Tracking of an Underwater Acoustic Source Using Particle Filtering: Real Data Experiments", in Proc. IEEE Tencon Spring Conference, Sydney, Australia, April 17-19, 2013. [PDF]
- S. Zubair, W. Dai, and W. Wang, "Sparseness Constrained Tensor Factorization Algorithm for Dictionary Learning over High-Dimensional Space", in Proc. 9th IMA International Conference on Mathematics in Signal Processing (IMA 2012), Birmingham, UK, 17-20 December, 2012. [PDF]
- T. Xu, W. Wang, and W. Dai, "Fast Dictionary Learning Algorithm via Codeword Clustering and Hierarchical Sparse Coding", in Proc. 9th IMA International Conference on Mathematics in Signal Processing (IMA 2012), Birmingham, UK, 17-20 December, 2012. [PDF]
- A. Alinaghi, P. Jackson, and W. Wang, "Separation of Underdetermined Reverberant Speech Mixtures by Monaural, Binaural and Statistical Cue Combination", in Proc. 9th IMA International Conference on Mathematics in Signal Processing (IMA 2012), Birmingham, UK, 17-20 December, 2012. [PDF]
- X. Zhao, G. Zhou, W. Wang, and W. Dai, "Weighted SimCO: A Novel Algorithm for Dictionary Update", in Proc. Sensor Signal Processing for Defence (SSPD 2012), London, UK, 26-27 September, 2012. [PDF]
- T. Jan, and W. Wang, "Frequency Dependent Statistical Model for the Suppression of Late Reverberations", in Proc. Sensor Signal Processing for Defence (SSPD 2012), London, UK, 26-27 September, 2012. [PDF]
- A. Ur-Rehman, S.M. Naqvi, R. Phan, W. Wang, and J. Chambers, "MCMC-PF Based Multiple Head Tracking in a Room Environment", in Proc. BMVC Computer Vision Workshop (BMVW 2012), Guildford, UK, 3-7 September, 2012. [PDF]
- T. Jan and W. Wang, "Blind Reverberation Time Estimation Based on Laplace Distribution," in Proc. 20th European Signal Processing Conference (EUSIPCO 2012), Bucharest, Romania, 26-31 August, 2012. [PDF]
- T. Jan and W. Wang, "Joint Blind Dereverberation and Separation of Speech Mixtures," in Proc. 20th European Signal Processing Conference (EUSIPCO 2012), Bucharest, Romania, 26-31 August, 2012. [PDF]
- Q. Liu, W. Wang, P. Jackson and M. Barnard, "Reverberant Speech Separation Based on Audio-visual Dictionary Learning and Binaural Cues", in Proc. IEEE Statistical Signal Processing Workshop (SSP 2012), pp. 664-667, Ann Arbor, USA, 5-8 August, 2012. [PDF]
- W. Dai, T. Xu, and W. Wang, "Dictionary Learning and Update based on Simultaneous Codeword Optimisation (SIMCO)," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), Kyoto, Japan, March 25-30, 2012. [PDF]
- M. Barnard, W. Wang, J. Kittler, S.M.R. Naqvi, and J.A. Chambers, "A Dictionary Learning Approach to Tracking," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), Kyoto, Japan, March 25-30, 2012. [PDF]
- W. Dai, T. Xu, and W. Wang, "Simultaneous Codeword Optimization (SimCO) for Dictionary Learning," in Proc. 49th Annual Allerton Conference on Communication, Control, and Computing (ALLERTON 2011), Monticello, Illinois, USA, Sept 28-30, 2011. (Invited Paper) [PDF]
- Q. Liu, W. Wang, and P. Jackson, "A Visual Voice Activity Detection Method with Adaboosting," in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2011), London, UK, Sept 28-29, 2011. [PDF]
- S. Zubair and W. Wang, "Audio Classification Based on Sparse Coefficients," in Proc. IEEE Sensor Signal Processing for Defence (SSPD 2011), London, UK, Sept 28-29, 2011. [PDF]
- Q. Liu and W. Wang, "Blind source separation and visual voice activity detection for target speech extraction," in Proc. IEEE 3rd International Conference on Awareness Science and Technology (ICAST 2011), pp. 457-460, Dalian, China, Sept 27-30, 2011. (Invited Paper) [PDF]
- T. Xu and W. Wang, "Methods for Learning Adaptive Dictionary for Underdetermined Speech Separation," in Proc. IEEE 21st International Workshop on Machine Learning for Signal Processing (MLSP 2011), Beijing, China, Sept 18-21, 2011. [PDF]
- S. Grima, M. Barnard, and W. Wang, "Robust Muti-Camera Audio-Visual Tracking," in Proc. 11th UK Workshop on Computational Intelligence (UKCI 2011), Manchester, UK, Sept 7-9, 2011. (Invited Paper) [PDF]
- T. Jan and W. Wang, "Empirical Mode Decomposition for Joint Denoising and Dereverberation," in Proc. 19th European Signal Processing Conference (EUSIPCO 2011), Barcelona, Spain, Aug 29 - Sept 2, 2011. [PDF]
- Q. Liu, S.M.R. Naqvi, W. Wang, P. Jackson, and J.A. Chambers, "Robust Feature Selection for Scaling Ambiguity Reduction in Audio-Visual Convolutive BSS," in Proc. 19th European Signal Processing Conference (EUSIPCO 2011), pp. 1060-1064, Barcelona, Spain, Aug 29 - Sept 2, 2011. [PDF]
- S.M.R. Naqvi, M.S. Khan, Q. Liu, W. Wang, and J.A. Chambers, "Multimodal Blind Source Separation with a Circular Microphone Array and Robust Beamforming," in Proc. 19th European Signal Processing Conference (EUSIPCO 2011), pp. 1050-1054, Barcelona, Spain, Aug 29 - Sept 2, 2011. [PDF]
- A. Alinaghi, W. Wang, and P. Jackson, "Integrating Binaural Cues and Blind Source Separation Method for Separating Reverberant Speech Mixtures," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp. 209-212, Prague, Czech Republic, May 22-27, 2011. [PDF]
- Q. Liu, W. Wang, and P. Jackson, "Audio-visual Convolutive Blind Source Separation," in Proc. Sensor Signal Processing for Defence (SSPD 2010), pp. 1-5, London, UK, Sept 29-30, 2010. [PDF]
- Q. Liu, W. Wang, and P. Jackson, "Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS," in Lecture Notes in Computer Science (LNCS 6365), Springer-Verlag. Proc. 9th International Conference on Latent Variable Analysis and Signal Separation (formerly the International Conference on Independent Component Analysis and Signal Separation) (LVA/ICA 2010), pp. 131-139, St. Malo, France, Sept 27-30, 2010. [PDF] (Best Student Paper Award Normination)
- Q. Liu, W. Wang, and P. Jackson, "Bimodal Coherence based Scale Ambiguity Cancellation for Target Speech Extraction and Enhancement," in Proc. Interspeech (INTERSPEECH 2010), pp. 438-441, Makuhari, Japan, Sept 26-30, 2010. [PDF]
- T. Xu and W. Wang, "Learning Dictionary for Underdetermined Blind Speech Separation Based on Compressed Sensing Method," in Proc. INSPIRE Conference on Information Representation and Estimation (INSPIRE 2010), London, UK, Sept 6-8, 2010. [PDF]
- H. Mustafa and W. Wang, "Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification," in Proc. 7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), Malaga, Spain, June 21-24, 2010. (Invited Paper) [PDF]
- T. Xu and W. Wang, "A Block-based Compressed Sensing Method for Underdetermined Blind Speech Separation Incorporating Binary Mask," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010), Dallas, Texas, USA, March 14-19, 2010. [PDF]
- Y. Liang, W. Wang, and J.A. Chambers, "Adaptive signal processing techniques for clutter removal in radar-based navigation systems," in Proc. IEEE 43rd Asilomar Conference on Signals, Systems and Computers (Asilomar 2009), Pacific Grove, California, USA, November 1-4, 2009. [PDF] (Invited Paper)
- T. Xu and W. Wang, "A Compressed Sensing Approach for Underdetermined Blind Audio Source Separation with Sparse Representations," in Proc. IEEE International Workshop on Statistical Signal Processing (SSP 2009), Cardiff, UK, August 31-Sept 3, 2009. [PDF] (Top Accessed Article in IEEE Xplore May, June 2010)
- S. Soltuz, W. Wang, and P. Jackson, "A Hybrid Iterative Algorithm for Non-negative Matrix Factorization," in Proc. IEEE International Workshop on Statistical Signal Processing (SSP 2009), Cardiff, UK, August 31-Sept 3, 2009. [PDF]
- T. Jan, W. Wang, and D.L. Wang, "A Multistage Approach for Blind Separation of Convolutive Speech Mixtures," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009), Taipei, Taiwan, April 19-24, 2009. [PDF]
- T. Jan, W. Wang, and D.L. Wang, "Binaural Speech Separation Based on Convolutive ICA and Ideal Binary Mask Coupled with Cepstral Smoothing," in Proc. 8th IMA International Conference on Mathematics in Signal Processing (IMA 2008), Cirencester, UK, December 16-18, 2008. [PDF]
- W. Wang,"One Microphone Audio Source Separation Using Convolutive Non-negative Matrix Factorization with Sparseness Constraints," in Proc. 8th IMA International Conference on Mathematics in Signal Processing (IMA 2008), Cirencester, UK, December 16-18, 2008. [PDF]
- X. Zou, W. Wang, and J. Kittler, "Non-negative Matrix Factorization for Face Illumination Analysis," in Proc. ICA Research Network International Workshop (ICARN 2008), pp. 52-55, Liverpool, UK, September 25-26, 2008. [PDF]
- W. Wang and X. Zou, "Non-Negative Matrix Factorization based on Projected Nonlinear Conjugate Gradient Algorithm," in Proc. ICA Research Network International Workshop (ICARN 2008), pp. 5-8, Liverpool, UK, September 25-26, 2008. [PDF]
- W. Wang, "Convolutive Non-negative Sparse Coding," in Proc. IEEE 5th World Congress on Computational Intelligence (WCCI 2008) & Proc. 21st International Joint Conference on Neural Networks (IJCNN 2008), HongKong, China, June 1-6, 2008. [PDF]
- W. Wang, "Squared Euclidean Distance Based Convolutive Non-negative Matrix Factorization with Multiplicative Learning Rules for Audio Pattern Separation," in Proc. 7th IEEE International Symposium on Signal Processing and Information Technology (ISSPIT 2007), Cairo, Egypt, December 15-18, 2007. [PDF]
- Y. Zhang, J.A. Chambers, W. Wang, P. Kendrick, and T.J. Cox, "A New Variable Step-Size LMS Algorithm with Robustness to Nonstationary Noise," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), vol. III, pp. 1349-1352, Honolulu, Hawaii, USA, April 15-20, 2007. [PDF]
- W. Wang, Y. Luo, J.A. Chambers, and S. Sanei, "Non-negative Matrix Factorization for Note Onset Detection of Audio Signals," in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2006), pp. 447-452, Maynooth, Ireland, September 6-8, 2006. [PDF] (Top Accessed Article in IEEE Xplore June 2010)
- W. Wang, D. Cosker, Y. Hicks, S. Sanei, and J. A. Chambers, "Video Assisted Speech Source Separation," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2005), vol. V, pp.425-428, Philadelphia, USA, March 18-23, 2005. [PDF]
- L. Yuan, E. Sang, W. Wang, and J.A. Chambers, "An Effective Method to Improve Convergence for Sequential Blind Source Separation," Lecture Notes in Computer Science (LNCS 3610), Springer-Verlag, ISBN: 3-540-28323-4. Proc. 1st International Conference on Natural Computation & 2nd International Conference on Fuzzy Systems and Knowledge Discovery (ICNC'05-FSKD'05), pp. 199-208, Changsha, China, August 27-29, 2005. [PDF]
- W. Wang, J.A. Chambers, and S. Sanei, "Subband Decomposition for Blind Speech Separation Using a Cochlear Filterbank," Proc. IMA 6th International Conference on Mathematics in Signal Processing (IMA 2004), pp. 207-210, Cirencester, UK, Dec. 14-16, 2004.
- S. Sanei, L. Spyrou, W. Wang, and J.A. Chambers, "Localization of P300 Sources in Schizophrenia Patients Using Constrained BSS," Lecture Notes in Computer Science (LNCS 3195), Springer-Verlag, ISBN: 3-540-23056-4. Proc. 5th International Conference on Independent Component Analysis and Blind Signal Separation (ICA 2004), pp. 176-183, Granada, Spain, Sept. 22-24, 2004. [PDF]
- W. Wang, J.A. Chambers, and S. Sanei, "Penalty Function Approach for Constrained Convolutive Blind Source Separation," Lecture Notes in Computer Science (LNCS 3195), Springer-Verlag, ISBN: 3-540-23056-4. Proc. 5th International Conference on Independent Component Analysis and Blind Signal Separation (ICA 2004), pp. 657-664, Granada, Spain, Sept. 22-24, 2004. [PDF]
- W. Wang, J.A. Chambers, and S. Sanei, "A Novel Hybrid Approach to the Permutation Problem of Frequency Domain Blind Source Separation," Lecture Notes in Computer Science (LNCS 3195), Springer-Verlag, ISBN: 3-540-23056-4. Proc. 5th International Conference on Independent Component Analysis and Blind Signal Separation (ICA 2004), pp. 530-537, Granada, Spain, Sept. 22-24, 2004. [PDF]
- W. Wang, J.A. Chambers, and S. Sanei, "Penalty Function Based Joint Diagonalization Approach for Convolutive Constrained BSS of Nonstationary Signals," Proc. 12th European Signal Processing Conference (EUSIPCO 2004), Vienna, Austria, Sept. 7-10, 2004.
- S. Sanei, W. Wang, and J.A. Chambers, "A Coupled HMM for Solving the Permutation Problem in Frequency Domain BSS," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2004), vol. V, pp. 565-568, Montreal, Canada, May 17-21, 2004. [PDF]
- W. Wang, S. Sanei, and J.A. Chambers, "Hybrid Scheme of Convolutive BSS and Beamforming for Speech Signal Separation Using Psychoacousitcs Filtering," Proc. International Conference on Control Science and Engineering (ICCSE 2003), Harbin, China, Dec. 18-20, 2003.
- W. Wang, M. Jafari, S. Sanei, and J.A. Chambers, "Blind Separation of Convolutive Mixtures of Cyclostationary Sources Using an Extended Natural Gradient Method," Proc. IEEE 7th International Symposium on Signal Processing and its Applications (ISSPA 2003), vol. II, pp. 93-96, Paris, France, Jul. 1-4, 2003. [PDF]
- W. Wang, J.A. Chambers, and S. Sanei, "A Joint Diagonalization Method for Convolutive Blind Separation of Nonstationary Sources in the Frequency Domain," Proc. 4th International Symposium on Independent Component Analysis and Blind Signal Separation (ICA 2003), pp. 939-944, Nara, Japan, Apr. 1-4, 2003. [PDF]
- Y. Wang, W. Wang, F. Sun, and C. Liu, "Modeling and Simulation of Submarine Emergency Maneuver," Proc. 5th International Conference on System Simulation and Scientific Computing (ICSC 2002), Shanghai China, Nov. 3-6, 2002.
- Y. Wang, W. Wang, F. Sun, and G. Wang, "Simulation of Submarine Near-Surface Motion under Disturbance Forces," Proc. International Conference of the Society for Computer Simulation International (SCS): 2002 Summer Computer Simulation Conference (SCSC2002), San Diego, USA, Jul. 14-18, 2002.
- W. Wang, H. Liu, X. Zhang, and Y. Wang, "The Design of Data Structure in Chart Modification of ECDIS," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 115-118, Harbin, China, Oct. 10-12, 2001. [HTML]
- W. Wang, Y. Wang, K. Yin, J. Rong, and Y. Xu, "Modeling and Simulation of Six DOF Maneuvering for Submarine," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 428-432, Harbin, China, Oct. 10-12, 2001. [HTML]
- X. Zhan, W. Wang, and L. Zhao, "Design of a Neural Network PID Controller and the Application in Ship Autopilot," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 451-454, Harbin, China, Oct. 10-12, 2001. [HTML]
- R. Xie, W. Wang, and F. Sun, "A Semi-fragile Digital Watermarking Technique Based on Wavelet Transform For Image Authentication," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 408-412, Harbin, China, Oct. 10-12, 2001. [HTML]
- S. Hu, W. Wang, Y. Wang, and X. Zhang, "Multidimensional Data Analysis of Data Warehouse," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 433-435, Harbin, China, Oct. 10-12, 2001. [HTML]
- Y. Wang, W. Wang, L. Zhao, and X. Shen, "The Design and Implementation of Visual System of a Submarine Voyage Training Simulator," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 440-444, Harbin, China, Oct. 10-12, 2001. [HTML]
- C. Li, R. Xie, W. Wang, and S. Yang, "The Application of the Back-Propagation Neural Network in the SINS/GPS System Malfunction Identification," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 43-46, Harbin, China, Oct. 10-12, 2001. [HTML]
- X. Zhang, Q. Wu, W. Wang, and D. Li, "Multiplex Coupled Chaos Synchronization Based on Lyapunov Function," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 198-202, Harbin, China, Oct. 10-12, 2001. [HTML]
- X. Zhang, H. Lin, W. Wang, and D. Li, "Frequency Hopping Sequence Based on Chebyshev Map," Proc. International Conference on Navigation, Guidance and Control (ICNGC 2001), pp. 458-460, Harbin, China, Oct. 10-12, 2001. [HTML]
Edited Books
- G. Naik and W. Wang (eds), Blind Source Separation: Advances in Theory, Algorithms and Applications, ISBN 978-3-642-55015-7, Springer, 2014.
- W. Wang (ed), Machine Audition: Principles, Algorithms and Systems, ISBN13: 9781615209194, ISBN10: 1615209190, IGI Global Press, 532 pages, ISBN-13: 978-1615209194, August, 2010. [link]
Edited Conference Proceedings
- S.M. Zhou and W. Wang (eds), Proceedings of 2009 IEEE/WRI Global Congress on Intelligent Systems, ISBN: 978-0-7695-3571-5, IEEE Computer Society Press, 2397 pages, May, 2009. [Front/Back Cover and CD-ROM]
Book Chapters
- V. Kilic, W. Wang, "Audio-Visual Speaker Tracking", in Motion Tracking and Gesture Recognition, InTech, 2017. (in press)
- X. Zhao, G. Zhou, W. Dai and W. Wang, "Blind Source Separation Based on Dictionary Learning: A Singularity-Aware Approach", in G.R. Naik and W. Wang (eds), Blind Source Separation: Advances in Theory, Algorithms and Applications, ISBN 978-3-642-55015-7, Springer, pp. 39-60, 2014. [PDF]
- Y. Liang, S. M. Naqvi, W. Wang, and J. A. Chambers, "Frequency Domain Blind Source Separation Based on Independent Vector Analysis with a Multivariate Generalised Gausian Source Prior", in G.R. Naik and W. Wang (eds), Blind Source Separation: Advances in Theory, Algorithms and Applications, ISBN 978-3-642-55015-7, Springer, pp. 131-150, 2014. [PDF]
- G. Naik, and W. Wang, "Preface (Editorial)", in G.R. Naik and W. Wang (eds), Blind Source Separation: Advances in Theory, Algorithms and Applications, ISBN 978-3-642-55015-7, Springer, pp. 131-150, 2014. [PDF]
- A. Lal and W. Wang, "Music Audio Separation using Spectral Template and Isolated Note Information", in G.R. Naik (ed), Independent Component Analysis for Audio and Biosignal Applications, ISBN 980-953-307-197-3, InTech Press, 2012. (Invited chapter) [PDF]
- W. Wang and H. Mustafa, "Single Channel Music Sound Separation Based on Spectrogram Decomposition and Note Classification," in S. Ystad, M. Aramaki, R. Kronland-Martinet, and K. Jensen (eds), Computer Music Modeling and Retrieval, ISBN 978-3-642-23125-4, Springer, 2011. (Invited chapter) [PDF]
- T. Jan and W. Wang, "Cocktail Party Problem: Source Separation Issues and Computational Methods," in W. Wang (ed), Machine Audition: Principles, Algorithms and Systems, ISBN13: 9781615209194, ISBN10: 1615209190, IGI Global Press, pp. 61-79, August, 2010.
[PDF]
- W. Wang, "Instantaneous versus Convolutive Non-negative Matrix Factorization: Models, Algorithms and Applications to Audio Pattern Separation," in W. Wang (ed), Machine Audition: Principles, Algorithms and Systems, ISBN13: 9781615209194, ISBN10: 1615209190, IGI Global Press, pp. 353-370, August, 2010. [PDF]
- W. Wang, "Preface (Editorial)," in W. Wang (ed), Machine Audition: Principles, Algorithms and Systems, ISBN13: 9781615209194, ISBN10: 1615209190, IGI Global Press, pp. xv-xxi, August, 2010. [link] [PDF]
Earlier Journal Papers published in Chinese:
- Y. Wang, W. Wang, F. Sun, and G.F. Wang, "Modeling and Simulation of Submarine Emergency Maneuver," Journal of Computer Simulation (a prestigious Chinese journal, same for the following journal papers), vol. 20, no. 6, pp. 1-3, 2003. [PDF]
- Y. Wang, W. Wang, F. Sun, and G.F. Wang, "Simulation of Submarine Near-Surface Motion under Disturbance Force," Journal of System Simulation, vol. 15, no.1, pp. 84-87, 2003. [PDF]
- X. Wang, X. Zhang, W. Wang, and D. Li, " Chaotic Scalar Signal's Synchronization of Chua's Circuit Using Adaptive Control," Journal of Computer Simulation, vol. 19, no. 2, pp. 89-96, April, 2002.
- X. Zhang, D. Li, S. Chen, and W. Wang, " High precision Synchronization of Hyperchaotic System Based on State Observer," Journal of Circuits and Systems, vol. 6, no.4, pp. 15-19, December, 2001
- W. Wang, L. Zhao, Y. Hao, and S. Yang, "The Design of Submarine Voyage Training Simulator Based on Virtual Reality," Journal of System Simulation, vol. 13, no. 5, pp. 599-601, October 2001.
- W. Wang, F. Sun, X. Zhan, and L. Zhao, "Design and Implementation of the Visual System of the Submarine Voyage Training Simulator Based on VR," Journal of Computer Engineering and Applications, vol. 37, no. 22, November 2001.
- W. Wang, C. Liu, S. Yang, and Y. Hao, "Military Information Countermining in Computer Network," Journal of Ship Electronic Engineering, no. 2, pp. 42-46, February 2001.
- X. Zhang, S. Chen, W. Wang, and D. Li, "Realization of High-Precision Synchronization of Hyper-chaotic Signals by State Observers," Journal of Harbin Engineering University, vol. 22, no. 4, pp. 29-34, August 2001.
- W. Wang, and Y. Hao, "Encoding and Decoding Software by Using the Property of a Hard Disk," Journal of Applied Science and Technology, no. 11, pp. 45-47, November 2001.
- Y. Wang, L. Zhao, X. Shen, and W. Wang, "Design and Implementation of Visual System in Submarine Voyage Training Simulator," Journal of Harbin Engineering University, vol. 22, no. 2, pp. 30-32, April 2001.
- W. Wang, F. Sun, C. Liu, and R. Xie, "The Algorithm Continuously Used to Calculate the Apparent Place of Celestial Bodies in the Solar System," Journal of Harbin Engineering University, vol. 21, no. 5, pp. 18-23, October 2000.
- W. Wang, and Y. Hao, "Design Scheme for the Marine Celestial Navigation System Based on the ECDIS System," Journal of Chinese Navigation, vol. 46, no. 1, pp. 71-77, June 2000.
- W. Wang, B. Qiao, and G. Qu, "How to Play Multimedia Animation in Visual C," Modern Computer, vol. 75, no. 6, pp. 65-67, June 2000.
- W. Wang, C. Wang, Y. Hao, and J. Zhou, "Making and Using Multimedia Timer," Computer Applications, vol. 17, no. 3, pp. 16-19, March 2000. [PDF]
- S. Yang, J. Tang, W. Wang, and C. Li, "Research to Life of Ship Electric Power System," Journal of Ship and Electric Technique, vol. 20, no. 2, pp. 1-7, 2000.
- J. Zhou, K. Wang, W. Wang, and J. Zhang, "Curriculum Schedule Arrangement Expert System," Computer Applications, vol. 20, no. 5, pp. 76-78, May 2000.
- W. Wang, H. Wang, and F. Sun, "The Algorithm Continuously Used to Calculate the Apparent Place of Stars," Journal of Harbin Engineering University, vol. 19, no. 6, pp. 35-41, December 1998. (Excellent Paper Award from HEU).
Refereed Conference Abstracts:
- A. Zermini, Y. Yu, Y. Xu, M.D. Plumbley, and W. Wang, "Sparse Deep Neural Networks for Audio Source Separation," in UK Speech Conference (UKSpeech 2016), Sheffield, UK, June 20-21, 2016.
- L. Rencker, and W. Wang, "Sparsity based declipping of speech signals," in UK Speech Conference (UKSpeech 2016), Sheffield, UK, June 20-21, 2016.
- S. Zubair, W. Wang, and J. Chambers, "Block-structured sparse tensor decomposition for classification of multi-dimensional data," in UCL-Duke Workshop on Sensing and Analysis of High-Dimensional Data (SAHD 2014), London, UK, September 4-5, 2014.
- M. Barnard, W. Wang, and J. Kittler, "Head Pose Estimation from Reverberant Speech," in Proc. CLEAR Workshop on Enhancement of Degraded Speech: Processing, Modelling, Evaluation (CLEAR 2012), London, UK, October 31, 2012.
- X. Zhao, G. Zhou, W. Wang, and W. Dai, "An Algorithm for Dictionary Learning: SimCO with Weighted Objective Functions," in Proc. 3rd IMA Conference on Numerical Linear Algebra and Optimisation (NLAO 2012), Birmingham, UK, September 10-12, 2012.
- Q. Liu, and W. Wang, "Bimodal Dictionary Learning for Model-Based Source Separation of Noisy Mixtures," in Proc. SMALL Workshop on Sparsity, Localisation and Dictionary Learning (SMALL 2012), London, UK, June 26-27, 2012.
- S. Shahrzad, S. Sanei, and W. Wang, "Blind Separation of Sparse Sources for Detection of Deep Brain Sources," in Proc. 8th Annual Computing Department PhD Conference (CompConf 2012), Guildford, UK, March 21, 2012.
- T. Jan and W. Wang, "Suppression of Late Reverberations Based on a Frequency Dependent Model," in Proc. 2012 SCANDAL Workshop on Sounds and Sound Processing in Natural and Artificial Systems: Making Sense of Sound (SCANDAL 2012), Plymouth, UK, Feb 20-21, 2012.
- A. Alinaghi, W. Wang, and P. Jackson, "Underdetermined Reverberant Speech Separation Using Binaural Cues and Blind Source Separation Approach," in Proc. 2011 AUDIS Conference on Signal Processing and Audiology - From Front-end to Perception (AUDIS 2011), Southampton, UK, September 12, 2011.
- T. Xu and W. Wang, "Methods for Training Adaptive Dictionary for Underdetermined Speech Separation," in Proc. 4th Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS 2011), Edinburgh, UK, June 27-30, 2011. [PDF]
- T. Jan, W. Wang, and D.L. Wang, "A Novel Multi-stage Approach For Blind Separation of Convolutive Speech Mixtures," In Proc. One-day Meeting for Young Speech Researchers (UK SPEECH 2008), Guildford, UK, July 16, 2008.
Invited Poster Presentation:
- A. Alinaghi, W. Wang, and P. Jackson, "Separation and Enhancement of Reverberant Speech Mixtures using Binaural cues, Statistical properties and Precedence effect", in UK and IE Speech Conference, Birmingham, UK, December 17-18, 2012. (Organizers: Edinburgh Speech Group)
- W. Wang, and M. Barnard, "Audio and Audio-Visual Source Separation for Machine Listening," in BBC Audio Research Partnership Annual Meeting, MediaCityUK, Manchester, UK, September 11-12, 2012. (Organizers: BBC R&D Audio Team)
- V. Kilic, M. Barnard, and W. Wang, "Audio-Visual Tracking," in BMVA Summer School, Manchester, UK, September 11-12, 2012. (Organizers: BMVA)
- W. Wang, "Audio and Audio-Visual Source Separation for Machine Listening," in BBC Audio Research Partnership Launch Meeting, MediaCityUK, Manchester, UK, July 8, 2011. (Organizers: BBC R&D Audio Team) (For more information about the partnership, see media coverage on BBC, How-Do, TVBEurope, and also Graham's blog)
- T. Xu and W. Wang, "Adaptive Dictionary Learning Based Compressive Sensing for Underdetermined Speech Separation," in Machine Listening Workshop (MLW 2010), Queen Mary University of London, UK, December 20, 2010. (Organizers: Prof Mark Plumbley and Dr Matthew Davies)
- A. Alinaghi, W. Wang and P. Jackson, "Blind Separation of Reverberant Speech Mixtures (via Statistical Modeling of Binaural Cues and Mixing Vectors)," in Machine Listening Workshop (MLW 2010), Queen Mary University of London, UK, December 20, 2010. (Organizers: Prof Mark Plumbley and Dr Matthew Davies)
- T. Jan, W. Wang, and D.L. Wang, "A Multi-stage Approach For Blind Separation of Convolutive Speech Mixtures," Open Afternoon in ICA Research Network International Workshop (ICARN 2008), Liverpool, UK, September 26, 2008. (Organizer: Dr Mark Plumbley)
Plenary/Keynote/Tutorial Speech on International Conferences/Workshops/Seasonal Schools:
- Keynote Speaker, IEEE International Conference on Signal, Information and Data Processing (ICSIDP 2019), Chongqing, China, 11-13 December 2019. (1000+ attendees)
- Keynote Speaker, International Conference on Digital Image and Signal Processing (DISP 2019), Oxford, UK, April 29-30, 2019.
- Keynote Speaker, China Sound and Music Technology Conference (CSMT 2018), Xiamen University, Xiamen, China, November 24-26, 2018.
- Plenary Speaker, The 7th Int'l Conference on Signal and Image Processing (CSIP 2018), Sanya, China, November 28-30, 2018.
- Keynote Speaker, CCF Workshop on Sparse Representation and Deep Learning, Shenzhen, China, August 17, 2018.
- Keynote Speaker, IEEE Int. Conf. on Signal Processing and Integrated Networks, February 22-23, 2018.
- Plenary Speaker, Korea-UK Focal Point Workshop on Intelligent Virtual Reality: Deep Audio-Visual Representation Learning for Multimedia Perception and Reproduction, January 24, 2018.
- Plenary Speaker, IET Intelligent Signal Processing Conference, London, UK, December 4-5, 2017.
- Plenary Speaker, Alan Turing Institute Workshop on Data Science and Signal Processing, November 29, 2017.
- Tutorial Speaker, Intelligent Sensing Summer School, London, UK, September 7-9, 2017.
- Tutorial Speaker, UDRC Summer School on Signal Processing for Defence, Surrey, UK, June 26-29, 2017.
- Keynote Speaker, The 2nd International Conference on Fuzzy Systems and Data Mining, Macau, China, December 11-14, 2016.
- Tutorial Speaker, "Introduction to Source Separation", UDRC Summer School, Edinburgh, UK, June 27-30, 2016. [PDF]
- Tutorial Speaker, "Convolutive source separation", UDRC Summer School, Edinburgh, UK, June 27-30, 2016. [PDF]
- Tutorial Speaker, "Sparse representation and dictionary learning for source separation, localisation and tracking", SpaRTaN-MacSeNet Spring School on Sparse Representations and Compressed Sensing, Ilmenau, Germany, 4th-8th April 2016. [PDF]
- Tutorial Speaker, "Introduction to Source Separation", UDRC Summer School, Surrey, UK, July 23, 2015. [PDF]
- Tutorial Speaker, "Frequency Domain Source Separation", UDRC Summer School, Surrey, UK, July 23, 2015. [PDF]
- Tutorial Speaker, "Audio Visual and Sparsity Based Source Separation", UDRC Summer School on Signal Processing, Edinburgh, UK, June 26, 2014. [PDF]
- Tutorial Speaker, "Convolutive Source Separation & Demonstrations", UDRC Summer School on Signal Processing, Edinburgh, UK, June 26, 2014. [PDF]
- Tutorial Speaker, (with W. Dai, and B. Mailh), "Dictionary Learning for Sparse Representations: Algorithms and Applications", ICASSP, Vancouver, Canada, May 26-31, 2013. [PDF]
- Plenary speaker, "Machine Audition at CVSSP", in UK & IE Speech Conference, Birmingham, UK, December 17-18, 2012.
- Plenary speaker, (with J.A. Chambers and S. Sanei) "Has the Permutation Problem in Transform Domain BSS Been Solved?," IEE Workshop on Independent Component Analysis: Generalizations, Algorithms and Applications, Queen Mary University of London, London, Dec. 20, 2002.
Invited Seminars:
- W. Wang "Convolutive source separation", University of West London, Computer Science Department, London, UK, October 11, 2018. (Organiser: Prof Henry Wang)
- W. Wang "Deep learning for audio classification", Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, China, August 10, 2018. (Organiser: Prof Xuan Wang)
- W. Wang "Deep learning for audio classification", Oxford University, Oxford, UK, June 29, 2018. (Organiser: Prof Steve Robert and Dr Yunpeng Li)
- W. Wang "Deep learning for audio classification", Tianjin University, Tianjin, China, December 21, 2017. (Organiser: Qinghua Hu)
- W. Wang "Acoustic reflector localisation and its application in blind source separation", Shenzhen University, August 20, 2017. (Organiser: Prof Lei Huang)
- W. Wang "Acoustic reflector localisation and its application in blind source separation", Harbin Institute of Technology, Shenzhen Graduate School, August 17, 2017. (Organiser: Prof Xuan Wang)
- W. Wang "Convolutive blind source separation", Beijing Institute of Technology, Beijing, China, April 24, 2017. (Organiser: Prof Qiang Fu)
- W. Wang "Audio-visual tracking of multiple moving sources", Tsinghua University, Beijing, China, April 24, 2017. (Organiser: Dr Yuantao Gu)
- W. Wang "Convolutive blind source separation", Shenzhen University, Shenzhen, China, April 19, 2017. (Organiser: Prof Lei Huang)
- W. Wang "Audio-visual tracking of multiple moving sources", Harbin Institute of Technology at Shenzhen, China, April 18, 2017. (Organiser: Prof Xuan Wang)
- W. Wang "Sparse analysis model based dictionary learning and signal reconstruction", Harbin Engineering University, Harbin, China, April 14, 2017. (Organiser: Prof. Jinwei Yin and Dr Juan Hui)
- W. Wang "Convolutive blind source separation”, Harbin Engineering University, Harbin, China, April 12, 2017. (Organiser: Prof. Jinwei Yin and Dr Juan Hui)
- W. Wang, "Polynomial dictionary learning and sparse representation", The 1st Workshop on Polynomial Matrix Decompositions and Their Applications, The Royal Society, Kavli International Research Centre, Chicheley Hall, UK, August 24-26, 2016. (Organiser: Prof John McWhirter and Dr Stephan Weiss)
- W. Wang, "Sparse analysis model based dictionary learning and signal reconstruction", School of Electronic and Computer Engineering, Peking University, Shenzhen Graduate School, Shenzhen, China, 3pm, August 19, 2016. (Organiser: Prof Yuexian Zou) [news]
- W. Wang, "Sparse analysis model based dictionary learning and signal reconstruction", School of Electronic and Computer Engineering, Harbin Institute of Technology, Shenzhen Graduate School, Shenzhen, China, 10am, August 19, 2016. (Organiser: Dr Lin Jiang)
- W. Wang, "Sparse analysis model based dictionary learning and signal reconstruction", College of Information Engineering, Shenzhen University, Shenzhen, China, August 17, 2016. (Organiser: Prof Lei Huang)
- W. Wang, "Speech source separation", MRC Microphone Network Meeting, Cardiff University, Cardiff, UK, June 14, 2016. (Organiser: Prof John Culling)
- W. Wang, "Probabilistic Time-Frequency Masking for Convolutive Blind Source Separation", School of Electronic and Computer Engineering, Peking University, Shenzhen Graduate School, Shenzhen, China, August 18, 2015. (Organiser: Prof Yuexian Zou) [news]
- W. Wang, "Probabilistic Time-Frequency Masking for Convolutive Blind Source Separation", School of Engineering and Digital Arts, Kent University, Canterbury, UK, June 03, 2015. (Organiser: Prof Steven Gao)
- W. Wang, "Dictionary Learning for Sparse Representations - Applications to Image Denoising, Source Separation, and Visual Tracking", School of Electronic and Computer Engineering, Tianjin University, Tianjing, China, December 23, 2014. (Organiser: Prof Qinghua Hu)
- W. Wang, "Dictionary Learning based Sparse Representations for Audio-Visual Signal Processing", Shenzhen Graduate School of Harbin Institute of Technology, Shenzhen, China, April 22, 2014. (Organiser: Prof Xuan Wang and Mr Jian Guan)
- W. Wang, "Dictionary Learning in Sparse Representations: New Algorithms and Applications", PLA University of Science & Technology, Nanjing, China, December 17, 2013. (Organiser: Prof Hang Zhang)
- W. Wang, "Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking in Convolutive and Noisy Source Separation", UDRC Source Separation and Sparsity Theme Meeting, University of Edinburgh, Edinburgh, UK, October 31, 2013. (Organiser: Dr Janet Forbes and Prof Mike Davies)
- W. Wang, "Source Separation of Convolutive and Noisy Mixtures using Audio-Visual Dictionary Learning and Probabilistic Time-Frequency Masking", University of Reading, Reading, UK, October 23, 2013. (Organiser: Dr Sillas Hadjiloucas and Prof Xia Hong)
- W. Wang, "Source Separation and Sparse Representation", Harbin Engineering University, Harbin, China, September 26, 2013. (Organiser: Dr Yuxin Zhao)
- W. Wang, "Audio and Audio-Visual Source Separation, Localisation, and Tracking", BAE Systems, Guildford, UK, June 20, 2013. (Organiser: Philip Pring)
- W. Wang, "Dictionary Learning Algorithms in Sparse Representations and Signal Processing," (Organizer: Dr Wei Liu), Department of Eletronic and Electrical Engineering , Sheffield University, October 24, 2012.
- W. Wang, "Dictionary Learning Algorithms in Signal Processing," (Organizer: Dr Lu Gan), School of Engineering and Design, Brunel University, August 1, 2012.
- W. Wang, "Adaptive Dictionary Learning Algorithms for Image Denoising, Source Separation, and Visual Tracking," (Organizer: Dr Andrew Aubrey), Cardiff School of Computer Science and Informatics, Cardiff University, May 24, 2012.
- W. Wang, "Dictionary Learning Algorithms and Their Applications in Source separation, Speaker Tracking, and Image Denoising," (Organizer: Prof Mark Plumbley), School of Electronic Engineering and Computer Science, Queen Mary University of London, April 25, 2012.
- W. Wang, "Audio and Audio-Visual Source Separation," (Organizer: Dr Xiaorong Shen), School of Automation Science and Electrical Engineering, Beihang University, Beijing, September 20, 2011.
- T. Xu and W. Wang, "Compressive Sensing," (Organizer: Prof. Anthony Ho), Department of Computer Science, University of Surrey, Guildford, January 11, 2010.
- W. Wang, "Multimodal Blind Source Separation for Robot Audition," (Organizer: Dr. Tania Stathaki), MOD University Defence Research Centre Launch & Theme Meeting, Imperial College London, London, November 5, 2009.
- W. Wang, "Two-microphone Speech Separation Based on Convolutive ICA and Ideal Binary Mask Coupled with Cepstral Smoothing," (Organizer: Prof. Francis Rumsey), Institute of Sound Recording (IoSR), University of Surrey, Guildford, October 21, 2008.
- W. Wang, "Convolutive ICA and NMF for Audio Source Separation and Perception," (Organizers: Prof. Vladimir M. Sloutsky & Prof. DeLiang Wang), Center for Cognitive Science, Ohio State University, Columbus, April 11, 2008.
- W. Wang, "Audio Source Separation and Perception," (Organizer: Prof. DeLiang Wang), Perception and Neurodynamics Laboratory (PNL), Department of Computer Science and Engineering, Ohio State University, Columbus, March 07, 2008.
- W. Wang, "Intelligent Data Fusion Based Blind Source Separation," (Organizer: Dr Nathan Wood), Royal Academy of Engineering, London, April 11, 2005.
- W. Wang and J.A. Chambers, "Frequency Domain Blind Source Separation," IEE Seminar on Blind Source Separation in Biomedicine (Organizer: Dr. Christopher J. James), British Institute of Radiology, London, 1 Dec. 2004.
- W. Wang, "Frequency Domain BSS and its Associated Permutation Problem," Contract Researchers Conference at Cardiff School of Engineering (Organizer: Dr. Adrian Porch), Cardiff University, Cardiff, July 16, 2004.
- W. Wang, "Blind Signal Processing and Speech Enhancement," Series Forum for Celebration of the 50th Anniversary of Harbin Engineering University (Organizer: Prof. Yanling Hao), Harbin, Apr. 11, 2003.
Technical Reports/Theses:
- W. Wang and J.A. Chambers, "Blind Signal Processing for Multichannel Speech Enhancement," EPSRC Final Report, Cardiff University, April 2005.
- W. Wang, J.A. Chambers, "Blind Source Separation for Convolutive Mixtures and its Application in Speech Enhancement: An Overview," EPSRC Technical Report, King's College London, Sept. 2002.
- W. Wang, "Information Fusion Theory and Its Application in Integrated Navigation System" (in Chinese with English abstract), Technical Report, Harbin Engineering University, Aug. 2000.
- W. Wang, "Study on Submarine Voyage Training Simulator Based on Virtual Reality," Ph.D. Thesis, Harbin Engineering University, Mar. 2002 (in Chinese with English abstract, Supervisor: Prof. Yanling Hao).
- W. Wang, "Research of Ship Celestial Navigation System and Its Application in ECDIS," M.E. Thesis, Harbin Engineering University, Dec. 1999 (in Chinese with English abstract, Supervisor: Prof. Yanling Hao; (Excellent Thesis Award).
- W. Wang, "Design of Carrier Wave Communication System Based on SCM," Graduate Thesis, Harbin Engineering University, Jul. 1997 (in Chinese with English abstract, Supervisor: Prof. Qidan Zhu).
[Home] [Publications] [Research] [Teaching] [Short Bio] [Demo & Data] [Codes]
Last updated in May 2023
First created in May 2007
|