Haohe Liu

1.1k total citations · 4 hit papers
35 papers, 442 citations indexed

About

Haohe Liu is a scholar working on Signal Processing, Artificial Intelligence and Computer Vision and Pattern Recognition. According to data from OpenAlex, Haohe Liu has authored 35 papers receiving a total of 442 indexed citations (citations by other indexed papers that have themselves been cited), including 30 papers in Signal Processing, 15 papers in Artificial Intelligence and 10 papers in Computer Vision and Pattern Recognition. Recurrent topics in Haohe Liu's work include Music and Audio Processing (26 papers), Speech and Audio Processing (24 papers) and Speech Recognition and Synthesis (12 papers). Haohe Liu is often cited by papers focused on Music and Audio Processing (26 papers), Speech and Audio Processing (24 papers) and Speech Recognition and Synthesis (12 papers). Haohe Liu collaborates with scholars based in United Kingdom, China and United States. Haohe Liu's co-authors include Wenwu Wang, Mark D. Plumbley, Qiuqiang Kong, Xinhao Mei, Xubo Liu, Jinzheng Zhao, Tom Ko, Yuexian Zou, Chengqi Zhao and Yu-Ping Wang and has published in prestigious journals such as IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Journal of Selected Topics in Signal Processing and Reviews in Aquaculture.

In The Last Decade

Haohe Liu

32 papers receiving 424 citations

Hit Papers

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Caption... 2024 2026 2025 2024 2024 2024 2025 20 40 60

Peers

Haohe Liu
Haohe Liu
Citations per year, relative to Haohe Liu Haohe Liu (= 1×) peers Hervé Glotin

Countries citing papers authored by Haohe Liu

Since Specialization
Citations

This map shows the geographic impact of Haohe Liu's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Haohe Liu with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Haohe Liu more than expected).

Fields of papers citing papers by Haohe Liu

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Haohe Liu. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Haohe Liu. The network helps show where Haohe Liu may publish in the future.

Co-authorship network of co-authors of Haohe Liu

This figure shows the co-authorship network connecting the top 25 collaborators of Haohe Liu. A scholar is included among the top collaborators of Haohe Liu based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Haohe Liu. Haohe Liu is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Cui, Meng, et al.. (2025). Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Survey. Reviews in Aquaculture. 17(1). 21 indexed citations breakdown →
2.
Yuan, Yi, et al.. (2025). FlowSep: Language-Queried Sound Separation with Rectified Flow Matching. 1–5. 1 indexed citations
3.
Liu, Haohe, Meng Cui, Jinhua Liang, et al.. (2025). WavJourney: Compositional Audio Creation With Large Language Models. IEEE Transactions on Audio Speech and Language Processing. 33. 2830–2844.
4.
Das, Rohan Kumar, et al.. (2025). EnvSDD: Benchmarking Environmental Sound Deepfake Detection. 201–205. 1 indexed citations
5.
Zhao, Jinzheng, Yong Xu, Xinyuan Qian, et al.. (2024). Attention-Based End-to-End Differentiable Particle Filter for Audio Speaker Tracking. IEEE Open Journal of Signal Processing. 5. 449–458. 2 indexed citations
6.
Tan, Xu, Jiawei Chen, Haohe Liu, et al.. (2024). NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality. IEEE Transactions on Pattern Analysis and Machine Intelligence. 46(6). 4234–4245. 49 indexed citations breakdown →
7.
Ju, Zeqian, Haohe Liu, Xu Tan, et al.. (2024). FlashSpeech: Efficient Zero-Shot Speech Synthesis. Rare & Special e-Zone (The Hong Kong University of Science and Technology). 6998–7007. 3 indexed citations
8.
Mei, Xinhao, Haohe Liu, Qiuqiang Kong, et al.. (2024). WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3339–3354. 63 indexed citations breakdown →
9.
Liu, Haohe, et al.. (2024). SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound. IEEE Journal of Selected Topics in Signal Processing. 18(8). 1448–1461. 7 indexed citations
10.
Cui, Meng, Xubo Liu, Haohe Liu, et al.. (2024). Multimodal Fish Feeding Intensity Assessment in Aquaculture. IEEE Transactions on Automation Science and Engineering. 22. 9485–9497. 13 indexed citations
11.
Yuan, Yi, Zhuo Chen, Haohe Liu, et al.. (2024). T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. View. 1–6. 5 indexed citations
12.
Zhao, Jinzheng, Xinyuan Qian, Yong Xu, et al.. (2024). Text-Queried Target Sound Event Localization. 261–265. 2 indexed citations
13.
Liu, Haohe, Xubo Liu, Qiuqiang Kong, Wenwu Wang, & Mark D. Plumbley. (2024). Learning Temporal Resolution in Spectrogram for Audio Classification. Proceedings of the AAAI Conference on Artificial Intelligence. 38(12). 13873–13881. 3 indexed citations
14.
Yuan, Yi, et al.. (2024). Retrieval-Augmented Text-to-Audio Generation. 581–585. 12 indexed citations
15.
Xu, Xuenan, Haohe Liu, Mengyue Wu, Wenwu Wang, & Mark D. Plumbley. (2024). Efficient Audio Captioning with Encoder-Level Knowledge Distillation. View. 1160–1164. 2 indexed citations
16.
Liu, Xubo, Xinhao Mei, Haohe Liu, et al.. (2023). Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. View. 2838–2842. 11 indexed citations
17.
Yuan, Yi, et al.. (2023). Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. 765–769. 5 indexed citations
18.
Liang, Jinhua, Xubo Liu, Haohe Liu, et al.. (2023). Adapting Language-Audio Models as Few-Shot Audio Learners. Queen Mary Research Online (Queen Mary University of London). 276–280. 12 indexed citations
19.
Liu, Haohe, Woosung Choi, Xubo Liu, et al.. (2022). Neural Vocoder is All You Need for Speech Super-resolution. Interspeech 2022. 4227–4231. 24 indexed citations
20.
Kong, Qiuqiang, Yin Cao, Haohe Liu, Keunwoo Choi, & Yuxuan Wang. (2021). Music Source Separation PyTorch Checkpoints. Zenodo (CERN European Organization for Nuclear Research). 1 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026