Xinhao Mei

655 total citations · 2 hit papers
16 papers, 289 citations indexed

About

Xinhao Mei is a scholar working on Signal Processing, Artificial Intelligence and Computer Vision and Pattern Recognition. According to data from OpenAlex, Xinhao Mei has authored 16 papers receiving a total of 289 indexed citations (citations by other indexed papers that have themselves been cited), including 14 papers in Signal Processing, 9 papers in Artificial Intelligence and 8 papers in Computer Vision and Pattern Recognition. Recurrent topics in Xinhao Mei's work include Music and Audio Processing (14 papers), Speech and Audio Processing (8 papers) and Speech Recognition and Synthesis (5 papers). Xinhao Mei is often cited by papers focused on Music and Audio Processing (14 papers), Speech and Audio Processing (8 papers) and Speech Recognition and Synthesis (5 papers). Xinhao Mei collaborates with scholars based in United Kingdom, China and Türkiye. Xinhao Mei's co-authors include Wenwu Wang, Mark D. Plumbley, Xubo Liu, Haohe Liu, Qiuqiang Kong, Tom Ko, Jinzheng Zhao, Chengqi Zhao, Yuexian Zou and Yuxuan Wang and has published in prestigious journals such as Computers and Electronics in Agriculture, IEEE/ACM Transactions on Audio Speech and Language Processing and EURASIP Journal on Audio Speech and Music Processing.

In The Last Decade

Xinhao Mei

14 papers receiving 278 citations

Hit Papers

align trajectories

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

2024 63 citations Xinhao Mei, Haohe Liu et al. IEEE/ACM Transactions on Audio Speech and Language Processing profile →
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining

2024 60 citations Haohe Liu, Xinhao Mei et al. IEEE/ACM Transactions on Audio Speech and Language Processing profile →

Peers

Xinhao Mei

SP

CVPR

AI

LL

MUSIC

Xingjian Du China

Soham Deshmukh United Kingdom

Mahmoud Al Ismail United States

Emiru Tsunoo United States

Chitralekha Gupta Singapore

Ruibo Fu China

Kevin Kilgour Germany

Marc Chemillier France

Jan Nouza Czechia

Brendan Shillingford United Kingdom

Xingjian Du China

Xinhao Mei

175 ×0.8

SP

76 ×0.6

CVPR

101 ×0.8

AI

6 ×0.3

LL

17 ×1.4

MUSIC

Citations per year, relative to Xinhao Mei Xinhao Mei (= 1×) peers Xingjian Du

Countries citing papers authored by Xinhao Mei

Since Specialization

Citations

This map shows the geographic impact of Xinhao Mei's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Xinhao Mei with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Xinhao Mei more than expected).

Fields of papers citing papers by Xinhao Mei

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Xinhao Mei. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Xinhao Mei. The network helps show where Xinhao Mei may publish in the future.

Co-authorship network of co-authors of Xinhao Mei

This figure shows the co-authorship network connecting the top 25 collaborators of Xinhao Mei. A scholar is included among the top collaborators of Xinhao Mei based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Xinhao Mei. Xinhao Mei is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

Sort: Min cites: Since: Top N: Style:

16 of 16 papers shown

1.

Cui, Meng, Tan Wang, Xinhao Mei, et al.. (2025). Enhanced audio-based fish feeding intensity recognition via decomposed visually-guided cross-modality distillation. Computers and Electronics in Agriculture. 239. 111132–111132.

2.

Mei, Xinhao, et al.. (2024). Towards Generating Diverse Audio Captions via Adversarial Training. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3311–3323.

3.

Mei, Xinhao, Haohe Liu, Qiuqiang Kong, et al.. (2024). WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3339–3354. 63 indexed citations breakdown →

4.

Mei, Xinhao, et al.. (2024). Foleygen: Visually-Guided Audio Generation. 1–6. 4 indexed citations

5.

Zhu, Qiaoxi, Jian Guan, Haohe Liu, et al.. (2024). First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. 1271–1275. 4 indexed citations

6.

Liu, Haohe, Xinhao Mei, Qiuqiang Kong, et al.. (2024). AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 2871–2883. 60 indexed citations breakdown →

7.

Liu, Xubo, Xinhao Mei, Haohe Liu, et al.. (2023). Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. View. 2838–2842. 11 indexed citations

8.

Liu, Xubo, et al.. (2023). Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. View. 4164–4168. 2 indexed citations

9.

Liu, Haohe, et al.. (2023). Simple Pooling Front-Ends for Efficient Audio Classification. View. 1–5. 9 indexed citations

10.

Liu, Haohe, Qiuqiang Kong, Xubo Liu, et al.. (2023). Ontology-aware Learning and Evaluation for Audio Tagging. View. 3799–3803. 2 indexed citations

11.

Mei, Xinhao, et al.. (2022). On Metric Learning for Audio-Text Cross-Modal Retrieval. Interspeech 2022. 4142–4146. 32 indexed citations

12.

Mei, Xinhao, et al.. (2022). Diverse Audio Captioning Via Adversarial Training. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 8882–8886. 15 indexed citations

13.

Liu, Xubo, Xinhao Mei, Jinzheng Zhao, et al.. (2022). Deep Neural Decision Forest for Acoustic Scene Classification. 2022 30th European Signal Processing Conference (EUSIPCO). 772–776. 6 indexed citations

14.

Mei, Xinhao, Xubo Liu, Mark D. Plumbley, & Wenwu Wang. (2022). Automated audio captioning: an overview of recent progress and new challenges. EURASIP Journal on Audio Speech and Music Processing. 2022(1). 27 indexed citations

15.

Liu, Xubo, Haohe Liu, Qiuqiang Kong, et al.. (2022). Separate What You Describe: Language-Queried Audio Source Separation. Interspeech 2022. 1801–1805. 40 indexed citations

16.

Liu, Xubo, Xinhao Mei, Jinzheng Zhao, et al.. (2022). Leveraging Pre-trained BERT for Audio Captioning. 2022 30th European Signal Processing Conference (EUSIPCO). 1145–1149. 14 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact