Roger Hsiao

747 total citations
46 papers, 546 citations indexed

About

Roger Hsiao is a scholar working on Artificial Intelligence, Signal Processing and Computer Vision and Pattern Recognition. According to data from OpenAlex, Roger Hsiao has authored 46 papers receiving a total of 546 indexed citations (citations by other indexed papers that have themselves been cited), including 41 papers in Artificial Intelligence, 30 papers in Signal Processing and 6 papers in Computer Vision and Pattern Recognition. Recurrent topics in Roger Hsiao's work include Speech Recognition and Synthesis (34 papers), Speech and Audio Processing (29 papers) and Music and Audio Processing (21 papers). Roger Hsiao is often cited by papers focused on Speech Recognition and Synthesis (34 papers), Speech and Audio Processing (29 papers) and Music and Audio Processing (21 papers). Roger Hsiao collaborates with scholars based in United States, Hong Kong and Czechia. Roger Hsiao's co-authors include Brian Mak, Stavros Tsakalidis, Tim Ng, William M. Hartmann, Tanja Schultz, Damianos Karakos, Richard Schwartz, Le Zhang, Long Nguyen and František Grézl and has published in prestigious journals such as Communications of the ACM, IEEE Signal Processing Letters and IEEE Transactions on Audio Speech and Language Processing.

In The Last Decade

Roger Hsiao

41 papers receiving 443 citations

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Roger Hsiao United States 15 500 345 49 27 10 46 546
Anton Ragni United Kingdom 17 688 1.4× 405 1.2× 36 0.7× 27 1.0× 13 1.3× 50 739
Suwon Shon United States 13 372 0.7× 278 0.8× 52 1.1× 28 1.0× 13 1.3× 35 478
Bagher BabaAli Iran 8 267 0.5× 197 0.6× 45 0.9× 35 1.3× 8 0.8× 37 366
Yatharth Saraf United States 9 385 0.8× 225 0.7× 57 1.2× 23 0.9× 5 0.5× 17 459
Vitaly Lavrukhin United States 6 312 0.6× 214 0.6× 44 0.9× 15 0.6× 11 1.1× 20 371
Kyu J. Han United States 12 371 0.7× 311 0.9× 44 0.9× 50 1.9× 7 0.7× 38 456
Yuya Unno United States 4 789 1.6× 543 1.6× 65 1.3× 43 1.6× 15 1.5× 5 863
Aaron Lawson United States 11 439 0.9× 413 1.2× 40 0.8× 19 0.7× 3 0.3× 41 500
Sankaran Panchapagesan United States 9 322 0.6× 263 0.8× 38 0.8× 27 1.0× 5 0.5× 17 385
Harald Höge Germany 11 362 0.7× 280 0.8× 49 1.0× 51 1.9× 7 0.7× 52 428

Countries citing papers authored by Roger Hsiao

Since Specialization
Citations

This map shows the geographic impact of Roger Hsiao's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Roger Hsiao with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Roger Hsiao more than expected).

Fields of papers citing papers by Roger Hsiao

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Roger Hsiao. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Roger Hsiao. The network helps show where Roger Hsiao may publish in the future.

Co-authorship network of co-authors of Roger Hsiao

This figure shows the co-authorship network connecting the top 25 collaborators of Roger Hsiao. A scholar is included among the top collaborators of Roger Hsiao based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Roger Hsiao. Roger Hsiao is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Lee, Inhee, Roger Hsiao, Mingyu Yang, et al.. (2024). mSAIL: Milligram-Scale Multi-Modal Sensor Platform for Monarch Butterfly Migration Tracking. Communications of the ACM. 67(6). 93–101. 1 indexed citations
2.
Hsiao, Roger, et al.. (2024). Optimizing Byte-Level Representation For End-To-End ASR. 462–467.
3.
Lee, Inhee, Roger Hsiao, Mingyu Yang, et al.. (2021). mSAIL. 517–530. 7 indexed citations
4.
Yang, Mingyu, Roger Hsiao, Jaechan Lim, et al.. (2020). Migrating Monarch Butterfly Localization Using Multi-Modal Sensor Fusion Neural Networks. 1792–1796. 6 indexed citations
5.
Hsiao, Roger, Tim Ng, & Man-Hung Siu. (2017). Unsupervised adaptation for deep neural networks using Alternating Direction Method of Multipliers. 9. 5180–5184. 1 indexed citations
6.
Hartmann, William M., et al.. (2017). Improved Single System Conversational Telephone Speech Recognition with VGG Bottleneck Features. 112–116. 5 indexed citations
7.
Hartmann, William M., et al.. (2016). Comparison of Multiple System Combination Techniques for Keyword Spotting. 1913–1917. 10 indexed citations
8.
Hsiao, Roger, Jeff Ma, William M. Hartmann, et al.. (2015). Robust speech recognition in unknown reverberant and noisy conditions. 533–538. 26 indexed citations
9.
Karakos, Damianos, Richard Schwartz, Stavros Tsakalidis, et al.. (2013). Score normalization and system combination for improved keyword spotting. 210–215. 77 indexed citations
10.
Hsiao, Roger, Tim Ng, František Grézl, et al.. (2013). Discriminative semi-supervised training for keyword search in low resource languages. 440–445. 19 indexed citations
11.
Hsiao, Roger & Tanja Schultz. (2012). Towards single pass discriminative training for speech recognition. 4093–4096.
12.
Tsakalidis, Stavros, Xiaodan Zhuang, Roger Hsiao, et al.. (2012). Robust event detection from spoken content in consumer domain videos. 2101–2104. 6 indexed citations
13.
Hsiao, Roger & Tanja Schultz. (2011). Generalized Baum-welch algorithm and its implication to a new extended Baum-welch algorithm. 773–776. 9 indexed citations
14.
Hsiao, Roger, Florian Metze, & Tanja Schultz. (2010). Improvements to generalized discriminative feature transformation for speech recognition. Figshare. 1361–1364. 3 indexed citations
15.
Hsiao, Roger, Wilson Tam, & Tanja Schultz. (2009). Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition system. 19. 3769–3772. 9 indexed citations
16.
Bách, Nguyễn, Matthias Eck, Sebastian Stüker, et al.. (2007). The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System. Repository KITopen (Karlsruhe Institute of Technology). 18 indexed citations
17.
Hsiao, Roger, Shajith Ikbal, Qin Jin, et al.. (2006). The ISL TC-STAR Spring 2006 ASR Evaluation Systems. Journal of the Pediatric Infectious Diseases Society. 7(2). 100–103. 15 indexed citations
18.
19.
Mak, Brian, et al.. (2006). Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting. IEEE Transactions on Audio Speech and Language Processing. 14(4). 1267–1280. 16 indexed citations
20.
Mak, Brian & Roger Hsiao. (2004). Improving eigenspace-based MLLR adaptation by kernel PCA. Rare & Special e-Zone (The Hong Kong University of Science and Technology). 13–16. 11 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026