通信工程学院

个人简介

杭州电子科技大学通信工程学院，特聘副教授，硕士生导师。

上海交通大学电子工程系博士、计算机科学与工程系博士后。研究方向为智能语音信息处理，主要包括：语音增强、语音唤醒、语音识别、语音理解大模型、低码率语音编解码、类脑听觉等。主要成果：在声学与语音信号处理领域国际顶级期刊及会议IEEE TASLP、IEEE SPL、ICASSP、Interspeech等累计发表/录用论文十余篇，授权国家发明专利十余项；主持科技部创新专项课题、浙江省自然科学基金及多项企业横向项目，参与多项国家级科研任务；现任IEEE会员、CCF语音对话与听觉专业委员会执行委员，长期担任IEEE TASLP、ICASSP、Interspeech、IEEE SLT、NCMMSC等期刊和会议的审稿人。

主持项目/课题：

1. 科技部-长三角科技创新项目课题，多任务语音理解大模型，主持，在研

2. 浙江省自然科学基金-探索项目，跨模态知识引导的生成式语音增强方法研究，主持，在研

3. 省属高校基本科研业务费项目，先验知识引导的生成式语音增强技术产业化推广，主持，在研

4. 横向项目，语音感知前沿技术研究，主持，在研

5. 横向项目，智能家居语音指令控制系统研发，主持，在研

6. 横向项目，智能语音交互的语音数据采集系统开发，主持，结题

7. 横向项目，智能语音交互的模型优化与测试系统开发，主持，结题

已发表论文：Google Scholar

● 2024~Now

[1] T. Meng, W. Jiang*, H. Zhang, Y. Zhou, H. Yin, “Neuromorphic Speech Enhancement with Dual-Branch Spiking Neural Networks,” in Proc. Interspeech 2026, Accepted.

[2] J. Li, W. Jiang*, J. Hu, “KFC-KWS: Keyframe Fusion with CTC for User-Defined Keyword Spotting,” in Proc. Interspeech 2026, Accepted.

[3] W. Zhang, W. Jiang*, Y. Zhang, X. Zhou, “Time-Unconditional Generative Speech Enhancement via Autonomous Rectified Flow,” in Proc. Interspeech 2026, Accepted.

[4] W. Jiang, F. Wen and K. Yu, “SelfSE: Self-Supervised Speech Enhancement Via Noisy Speech Refinement,” in IEEE Transactions on Audio, Speech and Language Processing, vol. 34, pp.2115-2127, 2026.

[5] Y. Zhang, W. Jiang*, Z. Wang, K. Wu, W. Zhang, F. Wen, “HyFlowSE: Hybrid End-to-End Flow-Matching Speech Enhancement via Generative-Discriminative Learning,” in proc. ICASSP 2026, pp.16177-16181.

[6] X. Wang, W. Jiang*, J. Wang, Y. You, S. Fang, F. Wen, “Switchcodec: Adaptive Residual-expert Sparse Quantization For High-fidelity Neural Audio Coding,” in proc. ICASSP 2026, pp. 14462-14466.

[7] F. Wen*, W. Wang, Z. Yan, W. Jiang*, “Optimal Transport Based Unsupervised Restoration Learning Exploiting Degradation Sparsity,” in proc. ICASSP 2026, pp.9042-9046.

[8] W. Jiang, F. Wen and K. Yu, “MOS-GAN: Mean Opinion Score GAN for Unsupervised Speech Enhancement”, IEEE Signal Processing Letters, vol. 32, pp. 3465-3469, 2025.

[9] J. Li, W. Jiang*, Y. Tian, and Z. Li. “NC-KWS: Few-Shot Class-Incremental Keyword Spotting Based on Neural Collapse,” Proc. National Conference on Man-Machine Speech Communication, 2025, Springer, Singapore.

[10] 张雯, 江文斌*, 吴开颖, 张杨, 蔡轩昊. 融合相位估计的声码器语音增强算法[J]. 人工智能,2025,(05):46-53.

[11] W. Jiang, K. Yu, and F. Wen, “Unsupervised Speech Enhancement Using Optimal Transport and Speech Presence Probability,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.32, pp.4445 - 4455, 2024.

● 2022~2023

[1] W. Jiang and K. Yu, “Speech Enhancement With Integration of Neural Homomorphic Synthesis and Spectral Masking,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 31, pp. 1758–1770, 2023.

[2] Y. Zhang, W. Jiang*, Q. Zhuo, and K. Yu, “Iterative Noisy-Target Approach: Speech Enhancement Without Clean Speech,” in Proc. National Conference on Man-Machine Speech Communication, 2023, pp. 256-264.

[3] Q. Pan, W. Jiang*, Q. Zhuo, and K. Yu, “A Framework Combining Separate and Joint Training for Neural Vocoder-Based Monaural Speech Enhancement,” in Proc. National Conference on Man-Machine Speech Communication, 2023, pp. 189-202.

[4] W. Jiang, F. Wen, Y. Zhang, and K. Yu, “UnSE: Unsupervised Speech Enhancement using Optimal Transport,” in Proc. Interspeech, 2023, pp. 4029–4033.

[5] W. Jiang, Z. Liu, K. Yu, and F. Wen, “Speech enhancement with neural homomorphic synthesis,” in Proc. ICASSP, 2022, pp. 376–380.

[6] W. Jiang, T. Liu, and K. Yu, “Efficient speech enhancement with neural homomorphic synthesis,” in Proc. Interspeech, 2022, pp. 986–990.

● Before 2020

[1] W. Jiang, F. Wen, and P. Liu, “Robust Beamforming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation,” IEEE Access, vol. 6, pp. 52385–52392, 2018.

[2] W. Jiang, P. Liu, and F. Wen, “Speech Magnitude Spectrum Reconstruction from MFCCs Using Deep Neural Network,” Chinese Journal of Electronics, vol. 27, no. 2, pp. 393–398, Mar. 2018.

[3] W. Jiang, P. Liu, and F. Wen, “An improved vector quantization method using deep neural network,” AEU - International Journal of Electronics and Communications, vol. 72, pp. 178–183, Feb. 2017.

[4] W. Jiang, R. Ying, and P. Liu, “Noise identification for model-based speech enhancement,” in Proc. ICSP, 2014, pp. 478–483.

[5] W. Jiang, Y. Rendong, and L. Peilin, “Speech reconstruction for MFCC-based low bit-rate speech coding,” in Proc. ICMEW, 2014, pp. 1–6.

[6] W. Jiang, Y. Rendong, and L. Peilin, “A novel speech reconstruction algorithm for DSR back-end,” in International conference on audio, language and image processing, 2014, pp. 367–371.

学生工作：

大学生创新创业训练计划项目（第一指导老师）

1. 一种基于条件流匹配和声码器的语音增强算法，张杨，国家级，2025

2. 面向边缘设备的语音增强与识别一体化小模型，周禹含，省级，2025

3. 一种融合声码器与相位估计的语音增强算法，蔡轩昊，国家级，2024

4. 基于在线无监督学习的可扩展指令识别系统，李锦，国家级，2024

5. 基于神经同态合成的语音增强算法研究，魏婕，校级，2024

6. 一种基于倒谱-余弦域的双阶段实时语音增强算法，张杨，院级，2024

7. 面向边缘设备的语音增强与识别一体化小模型，周禹含，院级，2024

8. 基于神经坍塌的少样本类增量学习语音指令识别研究，詹蝉瑜，院级，2024

教育经历

工作经历

社会职务

研究领域

教学与课程

横向科研

纵向科研

论文

著作

专利成果

软件成果

荣誉及奖励

教职工个人主页

江文斌