Yashesh Gaur

1.3k total citations
48 papers, 719 citations indexed

About

Yashesh Gaur is a scholar working on Artificial Intelligence, Signal Processing and Computational Mechanics. According to data from OpenAlex, Yashesh Gaur has authored 48 papers receiving a total of 719 indexed citations (citations by other indexed papers that have themselves been cited), including 43 papers in Artificial Intelligence, 32 papers in Signal Processing and 5 papers in Computational Mechanics. Recurrent topics in Yashesh Gaur's work include Speech Recognition and Synthesis (42 papers), Music and Audio Processing (29 papers) and Speech and Audio Processing (26 papers). Yashesh Gaur is often cited by papers focused on Speech Recognition and Synthesis (42 papers), Music and Audio Processing (29 papers) and Speech and Audio Processing (26 papers). Yashesh Gaur collaborates with scholars based in United States, India and Finland. Yashesh Gaur's co-authors include Jinyu Li, Zhong Meng, Naoyuki Kanda, Takuya Yoshioka, Yifan Gong, Xiaofei Wang, Yu Wu, Liang Lu, Shujie Liu and Rui Zhao and has published in prestigious journals such as IEEE/ACM Transactions on Audio Speech and Language Processing, Defence Science Journal and arXiv (Cornell University).

In The Last Decade

Yashesh Gaur

47 papers receiving 660 citations

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Yashesh Gaur United States 16 655 442 28 21 17 48 719
Hank Liao United States 12 687 1.0× 570 1.3× 83 3.0× 8 0.4× 8 0.5× 18 828
Christian Fügen United States 14 526 0.8× 183 0.4× 85 3.0× 17 0.8× 17 1.0× 36 651
Hideki Kashioka Japan 15 546 0.8× 180 0.4× 64 2.3× 18 0.9× 8 0.5× 97 637
Ching-Feng Yeh Taiwan 13 365 0.6× 227 0.5× 31 1.1× 5 0.2× 7 0.4× 28 463
Christian Fuegen United States 12 691 1.1× 427 1.0× 54 1.9× 7 0.3× 8 0.5× 33 789
Jaesung Huh South Korea 11 425 0.6× 378 0.9× 57 2.0× 32 1.5× 17 1.0× 24 556
Téva Merlin France 7 573 0.9× 548 1.2× 118 4.2× 11 0.5× 12 0.7× 16 706
Torbjørn Svendsen Norway 14 602 0.9× 470 1.1× 73 2.6× 6 0.3× 4 0.2× 73 676
Sylvain Meignier France 14 949 1.4× 871 2.0× 183 6.5× 14 0.7× 14 0.8× 39 1.1k
Thomas Kemp Germany 13 492 0.8× 330 0.7× 121 4.3× 19 0.9× 5 0.3× 26 641

Countries citing papers authored by Yashesh Gaur

Since Specialization
Citations

This map shows the geographic impact of Yashesh Gaur's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Yashesh Gaur with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Yashesh Gaur more than expected).

Fields of papers citing papers by Yashesh Gaur

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Yashesh Gaur. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Yashesh Gaur. The network helps show where Yashesh Gaur may publish in the future.

Co-authorship network of co-authors of Yashesh Gaur

This figure shows the co-authorship network connecting the top 25 collaborators of Yashesh Gaur. A scholar is included among the top collaborators of Yashesh Gaur based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Yashesh Gaur. Yashesh Gaur is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Zhao, Jinzheng, Niko Moritz, Kateřina Žmolíková, et al.. (2025). Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens. 1–5.
2.
Zhou, Long, Ziqiang Zhang, Yu Wu, et al.. (2024). VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation. IEEE/ACM Transactions on Audio Speech and Language Processing. 32. 3709–3716. 4 indexed citations
3.
Gaur, Yashesh, et al.. (2023). Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition. abs/1611. 00068. 237–244. 2 indexed citations
4.
Kanda, Naoyuki, Xiong Xiao, Yashesh Gaur, et al.. (2022). Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers Using End-to-End Speaker-Attributed ASR. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 8082–8086. 18 indexed citations
5.
Kanda, Naoyuki, et al.. (2022). Streaming Multi-Talker ASR with Token-Level Serialized Output Training. Interspeech 2022. 3774–3778. 2 indexed citations
6.
Xue, Jian, Peidong Wang, Jinyu Li, Matt Post, & Yashesh Gaur. (2022). Large-Scale Streaming End-to-End Speech Translation with Neural Transducers. Interspeech 2022. 3263–3267. 14 indexed citations
8.
Kanda, Naoyuki, Zhong Meng, Liang Lu, et al.. (2021). Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR. 6503–6507. 9 indexed citations
9.
Dimitriadis, Dimitrios, Kenichi Kumatani, Yashesh Gaur, et al.. (2021). Ensemble Combination between Different Time Segmentations. 6768–6772. 2 indexed citations
10.
Kanda, Naoyuki, Guoli Ye, Yashesh Gaur, et al.. (2021). End-to-End Speaker-Attributed ASR with Transformer. 4413–4417. 14 indexed citations
11.
Meng, Zhong, Naoyuki Kanda, Yashesh Gaur, et al.. (2021). Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition. 7338–7342. 31 indexed citations
12.
Kanda, Naoyuki, Xuankai Chang, Yashesh Gaur, et al.. (2021). Investigation of End-to-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings. 809–816. 28 indexed citations
13.
Li, Jinyu, Yu Wu, Yashesh Gaur, et al.. (2020). On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition. 1–5. 87 indexed citations
14.
Kanda, Naoyuki, Yashesh Gaur, Xiaofei Wang, Zhong Meng, & Takuya Yoshioka. (2020). Serialized Output Training for End-to-End Overlapped Speech Recognition. 2797–2801. 31 indexed citations
15.
Dimitriadis, Dimitrios, Kenichi Kumatani, Robert Gmyr, Yashesh Gaur, & Şefik Emre Eskimez. (2020). A Federated Approach in Training Acoustic Models. 981–985. 21 indexed citations
16.
Meng, Zhong, Yashesh Gaur, Jinyu Li, & Yifan Gong. (2019). Character-Aware Attention-Based End-to-End Speech Recognition. abs 1612 2695. 949–955. 6 indexed citations
17.
Meng, Zhong, Yashesh Gaur, Jinyu Li, & Yifan Gong. (2019). Speaker Adaptation for Attention-Based End-to-End Speech Recognition. arXiv (Cornell University). 241–245. 28 indexed citations
18.
Gaur, Yashesh, Walter S. Lasecki, Florian Metze, & Jeffrey P. Bigham. (2016). The effects of automatic speech recognition quality on human transcription latency. 1–8. 30 indexed citations
19.
Gaur, Yashesh, et al.. (2012). Performance Comparison of OMP and CoSaMP Based Channel Estimation in AF-TWRN Scenario. 186–190. 5 indexed citations
20.
Verma, P.D.S. & Yashesh Gaur. (1974). Laminar swirling flow in an annulus with porous walls. Proceedings of the Indian Academy of Sciences - Section A. 80(5). 211–222. 1 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026