Kaitao Song

9.1k total citations · 2 hit papers
27 papers, 4.5k citations indexed

About

Kaitao Song is a scholar working on Artificial Intelligence, Computer Vision and Pattern Recognition and Signal Processing. According to data from OpenAlex, Kaitao Song has authored 27 papers receiving a total of 4.5k indexed citations (citations by other indexed papers that have themselves been cited), including 21 papers in Artificial Intelligence, 8 papers in Computer Vision and Pattern Recognition and 6 papers in Signal Processing. Recurrent topics in Kaitao Song's work include Natural Language Processing Techniques (10 papers), Topic Modeling (9 papers) and Speech Recognition and Synthesis (7 papers). Kaitao Song is often cited by papers focused on Natural Language Processing Techniques (10 papers), Topic Modeling (9 papers) and Speech Recognition and Synthesis (7 papers). Kaitao Song collaborates with scholars based in China, Hong Kong and United States. Kaitao Song's co-authors include Deng-Ping Fan, Xiang Li, Wenhai Wang, Ding Liang, Ling Shao, Enze Xie, Tong Lü, Ping Luo, Jianfeng Lu and Tao Qin and has published in prestigious journals such as IEEE Transactions on Image Processing, Knowledge-Based Systems and Information Fusion.

In The Last Decade

Kaitao Song

21 papers receiving 4.4k citations

Hit Papers

Pyramid Vision Transformer: A Versatile Backbone for Dens... 2021 2026 2022 2024 2021 2022 1000 2.0k 3.0k

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Kaitao Song China 11 3.0k 1.2k 853 441 418 27 4.5k
Ding Liang China 15 3.7k 1.2× 1.4k 1.2× 969 1.1× 497 1.1× 501 1.2× 39 5.5k
Tete Xiao United States 8 2.7k 0.9× 1.3k 1.1× 490 0.6× 408 0.9× 433 1.0× 9 5.3k
Chao-Yuan Wu United States 14 3.0k 1.0× 2.1k 1.8× 537 0.6× 549 1.2× 320 0.8× 20 5.8k
Enze Xie China 21 4.8k 1.6× 1.6k 1.4× 1.6k 1.8× 520 1.2× 586 1.4× 32 6.6k
Kai Han China 21 3.1k 1.1× 1.1k 1.0× 652 0.8× 268 0.6× 474 1.1× 48 5.0k
Daquan Zhou China 11 2.8k 1.0× 943 0.8× 740 0.9× 366 0.8× 559 1.3× 29 4.9k
Jun Fu China 11 3.8k 1.3× 1.4k 1.2× 1.2k 1.4× 611 1.4× 329 0.8× 18 5.4k
Yabiao Wang China 18 2.9k 1.0× 1.5k 1.3× 606 0.7× 318 0.7× 222 0.5× 60 4.0k
Chunjing Xu China 23 4.4k 1.5× 1.4k 1.2× 1.2k 1.4× 284 0.6× 588 1.4× 63 6.3k

Countries citing papers authored by Kaitao Song

Since Specialization
Citations

This map shows the geographic impact of Kaitao Song's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Kaitao Song with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Kaitao Song more than expected).

Fields of papers citing papers by Kaitao Song

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Kaitao Song. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Kaitao Song. The network helps show where Kaitao Song may publish in the future.

Co-authorship network of co-authors of Kaitao Song

This figure shows the co-authorship network connecting the top 25 collaborators of Kaitao Song. A scholar is included among the top collaborators of Kaitao Song based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Kaitao Song. Kaitao Song is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Chen, Xueyu, Liang Hu, Qi Zhang, et al.. (2025). Beyond topology-based graph mining: Deep analysis research networks via evolutionary topology and content fusion. Information Fusion. 127. 103922–103922.
2.
Song, Kaitao, Jiangjie Chen, Yongliang Shen, et al.. (2025). EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction. 951–972. 2 indexed citations
3.
4.
Li, Dongsheng, Weiming Lü, Kan Ren, et al.. (2024). TaskBench: Benchmarking Large Language Models for Task Automation. 4540–4574.
5.
Shen, Yongliang, et al.. (2023). DiffusionNER: Boundary Diffusion for Named Entity Recognition. 3875–3890. 34 indexed citations
6.
Song, Kaitao, et al.. (2023). MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models. 246–255. 6 indexed citations
7.
Zou, Yicheng, Kaitao Song, Xu Tan, et al.. (2023). Towards Understanding Omission in Dialogue Summarization. 14268–14286. 1 indexed citations
8.
Lü, Weiming, et al.. (2023). HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face. 38154–38180. 1 indexed citations
9.
Li, Jinchao, Xixin Wu, Kaitao Song, et al.. (2023). A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition. 1–5.
10.
Leng, Yichong, et al.. (2023). SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition. Proceedings of the AAAI Conference on Artificial Intelligence. 37(11). 13034–13042. 6 indexed citations
11.
Song, Kaitao, Huiqiang Jiang, Yuqing Yang, et al.. (2023). End-to-End Word-Level Pronunciation Assessment with MASK Pre-training. 969–973. 1 indexed citations
12.
Wang, Wenhai, Enze Xie, Xiang Li, et al.. (2022). PVT v2: Improved baselines with pyramid vision transformer. Computational Visual Media. 8(3). 415–424. 1259 indexed citations breakdown →
13.
Zhang, Guangyan, Yichong Leng, Ying Qin, et al.. (2022). A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 32. 6087–6091. 1 indexed citations
14.
Zhang, Guangyan, Kaitao Song, Xu Tan, et al.. (2022). Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech. Interspeech 2022. 456–460. 12 indexed citations
15.
Wang, Wenhai, Enze Xie, Xiang Li, et al.. (2021). Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 548–558. 3008 indexed citations breakdown →
16.
Song, Kaitao, Xu Tan, Yi Ren, et al.. (2021). SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint. Proceedings of the AAAI Conference on Artificial Intelligence. 35(15). 13798–13805. 30 indexed citations
17.
Song, Kaitao, Xu Tan, Nevin L. Zhang, et al.. (2021). DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling. Rare & Special e-Zone (The Hong Kong University of Science and Technology). 69–81. 13 indexed citations
18.
Song, Kaitao, Xiu-Shen Wei, Xiangbo Shu, Renjie Song, & Jianfeng Lu. (2020). Bi-Modal Progressive Mask Attention for Fine-Grained Recognition. IEEE Transactions on Image Processing. 29. 7006–7018. 50 indexed citations
19.
Song, Kaitao, Xu Tan, Tao Qin, Jianfeng Lu, & Tie‐Yan Liu. (2020). MPNet: Masked and Permuted Pre-training for Language Understanding. arXiv (Cornell University). 33. 16857–16867. 18 indexed citations
20.
Song, Kaitao, Xu Tan, & Jianfeng Lu. (2020). Neural Machine Translation with Error Correction. 3891–3897. 7 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026