Tom Ko
Impact in
- Signal Processing top 0.5%
- Speech and Audio Processing
- Music and Audio Processing
- Artificial Intelligence top 0.5%
- Speech Recognition and Synthesis
- Natural Language Processing Techniques
- Topic Modeling
- Speech and dialogue systems
Papers in
-
- Speech and Audio Processing 19
- Music and Audio Processing 13
-
- Speech Recognition and Synthesis 35
- Natural Language Processing Techniques 19
- Topic Modeling 13
- Speech and dialogue systems 7
- Domain Adaptation and Few-Shot Learning 4
- Co-authors
- Daniel PoveySanjeev KhudanpurVijayaditya PeddintiMichael L. SeltzerBrian MakDavid SnyderQing LiLong Zhou
- Journals
- Speech Communication (1 paper)IEEE/ACM Transactions on Audio Speech and Language Processing (1 paper)IEEE Transactions on Audio Speech and Language Processing (1 paper)View (1 paper)PolyU Institutional Research Archive (Hong Kong Polytechnic University) (1 paper)
- Partner nations
- ChinaHong KongUnited States
In The Last Decade
Tom Ko
44 papers receiving 1.6k citations
Hit Papers
Peers
Comparison fields: 5 of 84
- Signal Processing 1.3k
- Artificial Intelligence 1.6k
- Computer Vision and Pattern Recognition 167
- Experimental and Cognitive Psychology 107
- Developmental Biology 8
Countries citing papers authored by Tom Ko
This map shows the geographic impact of Tom Ko's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Tom Ko with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Tom Ko more than expected).
Fields of papers citing papers by Tom Ko
This network shows the impact of papers produced by Tom Ko. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Tom Ko. The network helps show where Tom Ko may publish in the future.
Co-authorship network
The 25 scholars most cited alongside Tom Ko, linked wherever they have co-authored with each other. Click a name or a connecting line to browse the papers they share.
All Works
| # | Work | ||
|---|---|---|---|
| 1 | 2024 | 11 | |
| 2 | 2024 | 1 | |
| 3 | WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research Hit paper breakdown → | 2024 | 63 |
| 4 | 2023 | 13 | |
| 5 | 2023 | 10 | |
| 6 | 2023 | 2 | |
| 7 | 2023 | 2 | |
| 8 | 2023 | 11 | |
| 9 | 2023 | 10 | |
| 10 | 2022 | 70 | |
| 11 | 2022 | 22 | |
| 12 | 2022 | 3 | |
| 13 | 2021 | 9 | |
| 14 | 2021 | 5 | |
| 15 | 2020 | 12 | |
| 16 | 2018 | 149 | |
| 17 | Meta Learning for Few-shot Keyword Spotting. | 2018 | 4 |
| 18 | Audio augmentation for speech recognition Hit paper breakdown → | 2015 | 694 |
| 19 | 2013 | 4 | |
| 20 | 2011 | 6 |
About Tom Ko
Tom Ko is a scholar working on Signal Processing, Artificial Intelligence, Computer Vision and Pattern Recognition, Human-Computer Interaction and Language and Linguistics, having authored 44 papers that have together received 1.8k indexed citations. Recurring topics across this work include Speech Recognition and Synthesis (35 papers), Speech and Audio Processing (19 papers), Natural Language Processing Techniques (19 papers), Music and Audio Processing (13 papers), Topic Modeling (13 papers), Speech and dialogue systems (7 papers), Domain Adaptation and Few-Shot Learning (4 papers) and Multimodal Machine Learning Applications (2 papers). The work is most often cited by research in Signal Processing (1.3k citations), Artificial Intelligence (1.6k citations), Computer Vision and Pattern Recognition (167 citations), Experimental and Cognitive Psychology (107 citations) and Developmental Biology (8 citations). Tom Ko has collaborated with scholars based in China, Hong Kong and United States. Frequent co-authors include Daniel Povey, Sanjeev Khudanpur, Vijayaditya Peddinti, Michael L. Seltzer, Brian Mak, David Snyder, Qing Li, Long Zhou, Vimal Manohar and Wenwu Wang. Their work appears in journals such as Speech Communication, IEEE/ACM Transactions on Audio Speech and Language Processing, IEEE Transactions on Audio Speech and Language Processing, View and PolyU Institutional Research Archive (Hong Kong Polytechnic University).
Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.