Xinhao Mei

655 total citations · 2 hit papers
16 papers, 289 citations indexed

About

Xinhao Mei is a scholar working on Signal Processing, Artificial Intelligence and Computer Vision and Pattern Recognition. According to data from OpenAlex, Xinhao Mei has authored 16 papers receiving a total of 289 indexed citations (citations by other indexed papers that have themselves been cited), including 14 papers in Signal Processing, 9 papers in Artificial Intelligence and 8 papers in Computer Vision and Pattern Recognition. Recurrent topics in Xinhao Mei's work include Music and Audio Processing (14 papers), Speech and Audio Processing (8 papers) and Speech Recognition and Synthesis (5 papers). Xinhao Mei is often cited by papers focused on Music and Audio Processing (14 papers), Speech and Audio Processing (8 papers) and Speech Recognition and Synthesis (5 papers). Xinhao Mei collaborates with scholars based in United Kingdom, China and Türkiye. Xinhao Mei's co-authors include Wenwu Wang, Mark D. Plumbley, Xubo Liu, Haohe Liu, Qiuqiang Kong, Tom Ko, Jinzheng Zhao, Chengqi Zhao, Yuexian Zou and Yuxuan Wang and has published in prestigious journals such as Computers and Electronics in Agriculture, IEEE/ACM Transactions on Audio Speech and Language Processing and EURASIP Journal on Audio Speech and Music Processing.

In The Last Decade

Xinhao Mei

14 papers receiving 278 citations

Hit Papers

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Caption... 2024 2026 2025 2024 2024 20 40 60

Peers

Xinhao Mei
Soham Deshmukh United Kingdom
Mahmoud Al Ismail United States
Emiru Tsunoo United States
Ruibo Fu China
Jan Nouza Czechia
Brendan Shillingford United Kingdom
Xinhao Mei
Citations per year, relative to Xinhao Mei Xinhao Mei (= 1×) peers Xingjian Du

Countries citing papers authored by Xinhao Mei

Since Specialization
Citations

This map shows the geographic impact of Xinhao Mei's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Xinhao Mei with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Xinhao Mei more than expected).

Fields of papers citing papers by Xinhao Mei

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Xinhao Mei. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Xinhao Mei. The network helps show where Xinhao Mei may publish in the future.

Co-authorship network of co-authors of Xinhao Mei

This figure shows the co-authorship network connecting the top 25 collaborators of Xinhao Mei. A scholar is included among the top collaborators of Xinhao Mei based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Xinhao Mei. Xinhao Mei is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

16 of 16 papers shown
1.
Cui, Meng, Tan Wang, Xinhao Mei, et al.. (2025). Enhanced audio-based fish feeding intensity recognition via decomposed visually-guided cross-modality distillation. Computers and Electronics in Agriculture. 239. 111132–111132.
2.
Mei, Xinhao, et al.. (2024). Towards Generating Diverse Audio Captions via Adversarial Training. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3311–3323.
3.
Mei, Xinhao, Haohe Liu, Qiuqiang Kong, et al.. (2024). WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3339–3354. 63 indexed citations breakdown →
4.
Mei, Xinhao, et al.. (2024). Foleygen: Visually-Guided Audio Generation. 1–6. 4 indexed citations
5.
Zhu, Qiaoxi, Jian Guan, Haohe Liu, et al.. (2024). First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. 1271–1275. 4 indexed citations
6.
Liu, Haohe, Xinhao Mei, Qiuqiang Kong, et al.. (2024). AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 2871–2883. 60 indexed citations breakdown →
7.
Liu, Xubo, Xinhao Mei, Haohe Liu, et al.. (2023). Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. View. 2838–2842. 11 indexed citations
8.
Liu, Xubo, et al.. (2023). Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. View. 4164–4168. 2 indexed citations
9.
Liu, Haohe, et al.. (2023). Simple Pooling Front-Ends for Efficient Audio Classification. View. 1–5. 9 indexed citations
10.
Liu, Haohe, Qiuqiang Kong, Xubo Liu, et al.. (2023). Ontology-aware Learning and Evaluation for Audio Tagging. View. 3799–3803. 2 indexed citations
11.
Mei, Xinhao, et al.. (2022). On Metric Learning for Audio-Text Cross-Modal Retrieval. Interspeech 2022. 4142–4146. 32 indexed citations
12.
Mei, Xinhao, et al.. (2022). Diverse Audio Captioning Via Adversarial Training. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 8882–8886. 15 indexed citations
13.
Liu, Xubo, Xinhao Mei, Jinzheng Zhao, et al.. (2022). Deep Neural Decision Forest for Acoustic Scene Classification. 2022 30th European Signal Processing Conference (EUSIPCO). 772–776. 6 indexed citations
14.
Mei, Xinhao, Xubo Liu, Mark D. Plumbley, & Wenwu Wang. (2022). Automated audio captioning: an overview of recent progress and new challenges. EURASIP Journal on Audio Speech and Music Processing. 2022(1). 27 indexed citations
15.
Liu, Xubo, Haohe Liu, Qiuqiang Kong, et al.. (2022). Separate What You Describe: Language-Queried Audio Source Separation. Interspeech 2022. 1801–1805. 40 indexed citations
16.
Liu, Xubo, Xinhao Mei, Jinzheng Zhao, et al.. (2022). Leveraging Pre-trained BERT for Audio Captioning. 2022 30th European Signal Processing Conference (EUSIPCO). 1145–1149. 14 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026