Chao Weng

2.5k total citations · 2 hit papers
71 papers, 1.4k citations indexed

About

Chao Weng is a scholar working on Artificial Intelligence, Signal Processing and Computer Vision and Pattern Recognition. According to data from OpenAlex, Chao Weng has authored 71 papers receiving a total of 1.4k indexed citations (citations by other indexed papers that have themselves been cited), including 61 papers in Artificial Intelligence, 58 papers in Signal Processing and 5 papers in Computer Vision and Pattern Recognition. Recurrent topics in Chao Weng's work include Speech Recognition and Synthesis (60 papers), Speech and Audio Processing (54 papers) and Music and Audio Processing (45 papers). Chao Weng is often cited by papers focused on Speech Recognition and Synthesis (60 papers), Speech and Audio Processing (54 papers) and Music and Audio Processing (45 papers). Chao Weng collaborates with scholars based in China, United States and Hong Kong. Chao Weng's co-authors include Dong Yu, Dan Su, Shinji Watanabe, Chengzhu Yu, Helen Meng, Jianwei Yu, Meng Yu, Biing-Hwang Juang, Yuexian Zou and Dongchao Yang and has published in prestigious journals such as IEEE Signal Processing Letters, Computer Speech & Language and IEEE/ACM Transactions on Audio Speech and Language Processing.

In The Last Decade

Chao Weng

67 papers receiving 1.3k citations

Hit Papers

align trajectories

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

Diffsound: Discrete Diffusion Model for Text-to-Sound Generation

2023 108 citations Dongchao Yang, Jianwei Yu et al. IEEE/ACM Transactions on Audio Speech and Language Processing profile →
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

2024 46 citations Haoxin Chen, Yong Zhang et al. profile →

Peers

Countries citing papers authored by Chao Weng

Since Specialization

Citations

This map shows the geographic impact of Chao Weng's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Chao Weng with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Chao Weng more than expected).

Fields of papers citing papers by Chao Weng

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Chao Weng. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Chao Weng. The network helps show where Chao Weng may publish in the future.

Co-authorship network of co-authors of Chao Weng

This figure shows the co-authorship network connecting the top 25 collaborators of Chao Weng. A scholar is included among the top collaborators of Chao Weng based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Chao Weng. Chao Weng is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

#	Title	Journal	Authors	Indexed citations
1	InstructTTS: Modelling Expressive TTS in Discrete Latent Space With Natural Language Style Prompt	IEEE/ACM Transactions on Audio Speech and Language Processing	Dongchao Yang, Songxiang Liu et al.	21
2	Opine: Leveraging a Optimization-Inspired Deep Unfolding Method for Multi-Channel Speech Enhancement		Andong Li, Yu Gu et al.	0
3	VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models breakdown →		Haoxin Chen, Yong Zhang et al.	46
4	Sifisinger: A High-Fidelity End-to-End Singing Voice Synthesizer Based on Source-Filter Model		Jianwei Cui, Yu Gu et al.	1
5	Diffsound: Discrete Diffusion Model for Text-to-Sound Generation breakdown →	IEEE/ACM Transactions on Audio Speech and Language Processing	Dongchao Yang, Jianwei Yu et al.	108
6	NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS		Dongchao Yang, Songxiang Liu et al.	9
7	High Fidelity Speech Enhancement with Band-split RNN		Jianwei Yu, Hangting Chen et al.	21
8	Integrating Lattice-Free MMI Into End-to-End Speech Recognition	IEEE/ACM Transactions on Audio Speech and Language Processing	Jianwei Yu, Chao Weng et al.	9
9	Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition	Computer Speech & Language	Aswin Shanmugam Subramanian, Chao Weng et al.	59
10	Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model	IEEE Signal Processing Letters	Jianwei Yu, Chao Weng et al.	8
11	Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-Based Multi-Modal Context Modeling	ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	Chenyi Li, Zhiyong Wu et al.	13
12	GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10,000 Hours of Transcribed Audio		Guoguo Chen, Wei-Qiang Zhang et al.	117
13	Replay and Synthetic Speech Detection with Res2Net Architecture		Xu Li, Na Li et al.	113
14	Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition		Max W. Y. Lam, Jun Wang et al.	1
15	Video Human Action Recognition with Channel Attention on ST-GCN	Journal of Physics Conference Series	Deyuan Zhang, Chao Weng et al.	1
16	Dfsmn-San with Persistent Memory Model for Automatic Speech Recognition		Zhao You, Dan Su et al.	6
17	Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition		Chao Weng, Jia Cui et al.	38
18	Deep learning vector quantization for acoustic information retrieval		Zhen Huang, Chao Weng et al.	10
19	Latent semantic rational kernels for topic spotting on spontaneous conversational speech		Chao Weng, Biing‐Hwang Juang	2
20	Discriminative training using non-uniform criteria for keyword spotting on spontaneous speech		Chao Weng, Biing‐Hwang Juang et al.	3

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact