Chao Weng

2.5k total citations · 2 hit papers
71 papers, 1.4k citations indexed

About

Chao Weng is a scholar working on Artificial Intelligence, Signal Processing and Computer Vision and Pattern Recognition. According to data from OpenAlex, Chao Weng has authored 71 papers receiving a total of 1.4k indexed citations (citations by other indexed papers that have themselves been cited), including 61 papers in Artificial Intelligence, 58 papers in Signal Processing and 5 papers in Computer Vision and Pattern Recognition. Recurrent topics in Chao Weng's work include Speech Recognition and Synthesis (60 papers), Speech and Audio Processing (54 papers) and Music and Audio Processing (45 papers). Chao Weng is often cited by papers focused on Speech Recognition and Synthesis (60 papers), Speech and Audio Processing (54 papers) and Music and Audio Processing (45 papers). Chao Weng collaborates with scholars based in China, United States and Hong Kong. Chao Weng's co-authors include Dong Yu, Dan Su, Shinji Watanabe, Chengzhu Yu, Helen Meng, Jianwei Yu, Meng Yu, Biing-Hwang Juang, Yuexian Zou and Dongchao Yang and has published in prestigious journals such as IEEE Signal Processing Letters, Computer Speech & Language and IEEE/ACM Transactions on Audio Speech and Language Processing.

In The Last Decade

Chao Weng

67 papers receiving 1.3k citations

Hit Papers

Diffsound: Discrete Diffusion Model for Text-to-Sound Gen... 2023 2026 2024 2025 2023 2024 25 50 75 100

Peers

Chao Weng
Comparison fields: 5 of 85
  • Artificial Intelligence 1.0k
  • Signal Processing 992
  • Computer Vision and Pattern Recognition 209
  • Computational Mechanics 88
  • Cognitive Neuroscience 43
Ehsan Variani United States
Xiao-Lei Zhang China
Khe Chai Sim Singapore
Tian Tan China
Liang Lu United States
Shigeki Karita Japan
Dimitrios Dimitriadis United States
Tom Ko China
Jahn Heymann Germany
Atsunori Ogawa Japan
Ehsan Variani United States View profile →
Citations per field, relative to Chao Weng
Chao Weng · 1×
Citations per year, relative to Chao Weng
Chao Weng · 1×

Countries citing papers authored by Chao Weng

Since Specialization
Citations

This map shows the geographic impact of Chao Weng's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Chao Weng with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Chao Weng more than expected).

Fields of papers citing papers by Chao Weng

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Chao Weng. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Chao Weng. The network helps show where Chao Weng may publish in the future.

Co-authorship network of co-authors of Chao Weng

This figure shows the co-authorship network connecting the top 25 collaborators of Chao Weng. A scholar is included among the top collaborators of Chao Weng based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Chao Weng. Chao Weng is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
# Title Journal Authors Indexed citations
1 InstructTTS: Modelling Expressive TTS in Discrete Latent Space With Natural Language Style Prompt IEEE/ACM Transactions on Audio Speech and Language Processing Dongchao Yang, Songxiang Liu et al. 21
2 Opine: Leveraging a Optimization-Inspired Deep Unfolding Method for Multi-Channel Speech Enhancement Andong Li, Yu Gu et al. 0
3 VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models breakdown → Haoxin Chen, Yong Zhang et al. 46
4 Sifisinger: A High-Fidelity End-to-End Singing Voice Synthesizer Based on Source-Filter Model Jianwei Cui, Yu Gu et al. 1
5 Diffsound: Discrete Diffusion Model for Text-to-Sound Generation breakdown → IEEE/ACM Transactions on Audio Speech and Language Processing Dongchao Yang, Jianwei Yu et al. 108
6 NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS Dongchao Yang, Songxiang Liu et al. 9
7 High Fidelity Speech Enhancement with Band-split RNN Jianwei Yu, Hangting Chen et al. 21
8 Integrating Lattice-Free MMI Into End-to-End Speech Recognition IEEE/ACM Transactions on Audio Speech and Language Processing Jianwei Yu, Chao Weng et al. 9
9 Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition Computer Speech & Language Aswin Shanmugam Subramanian, Chao Weng et al. 59
10 Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model IEEE Signal Processing Letters Jianwei Yu, Chao Weng et al. 8
11 Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-Based Multi-Modal Context Modeling ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Chenyi Li, Zhiyong Wu et al. 13
12 GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10,000 Hours of Transcribed Audio Guoguo Chen, Wei-Qiang Zhang et al. 117
13 Replay and Synthetic Speech Detection with Res2Net Architecture Xu Li, Na Li et al. 113
14 Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition Max W. Y. Lam, Jun Wang et al. 1
15 Video Human Action Recognition with Channel Attention on ST-GCN Journal of Physics Conference Series Deyuan Zhang, Chao Weng et al. 1
16 Dfsmn-San with Persistent Memory Model for Automatic Speech Recognition Zhao You, Dan Su et al. 6
17 Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition Chao Weng, Jia Cui et al. 38
18 Deep learning vector quantization for acoustic information retrieval Zhen Huang, Chao Weng et al. 10
19 Latent semantic rational kernels for topic spotting on spontaneous conversational speech Chao Weng, Biing‐Hwang Juang 2
20 Discriminative training using non-uniform criteria for keyword spotting on spontaneous speech Chao Weng, Biing‐Hwang Juang et al. 3

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026