Kalin Stefanov

516 total citations
31 papers, 248 citations indexed

About

Kalin Stefanov is a scholar working on Computer Vision and Pattern Recognition, Artificial Intelligence and Human-Computer Interaction. According to data from OpenAlex, Kalin Stefanov has authored 31 papers receiving a total of 248 indexed citations (citations by other indexed papers that have themselves been cited), including 15 papers in Computer Vision and Pattern Recognition, 14 papers in Artificial Intelligence and 10 papers in Human-Computer Interaction. Recurrent topics in Kalin Stefanov's work include Speech and dialogue systems (6 papers), Speech and Audio Processing (6 papers) and Gaze Tracking and Assistive Technology (6 papers). Kalin Stefanov is often cited by papers focused on Speech and dialogue systems (6 papers), Speech and Audio Processing (6 papers) and Gaze Tracking and Assistive Technology (6 papers). Kalin Stefanov collaborates with scholars based in Australia, United States and Sweden. Kalin Stefanov's co-authors include Jonas Beskow, Abhinav Dhall, Giampiero Salvi, Munawar Hayat, Mohammad Soleymani, Shreya Ghosh, Jianfei Cai, Hamid Rezatofighi, Jonathan Gratch and Tom Gedeon and has published in prestigious journals such as IEEE Transactions on Multimedia, Computer Vision and Image Understanding and Language Resources and Evaluation.

In The Last Decade

Kalin Stefanov

30 papers receiving 245 citations

Peers

Kalin Stefanov
Kalin Stefanov
Citations per year, relative to Kalin Stefanov Kalin Stefanov (= 1×) peers Chaoran Liu

Countries citing papers authored by Kalin Stefanov

Since Specialization
Citations

This map shows the geographic impact of Kalin Stefanov's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Kalin Stefanov with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Kalin Stefanov more than expected).

Fields of papers citing papers by Kalin Stefanov

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Kalin Stefanov. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Kalin Stefanov. The network helps show where Kalin Stefanov may publish in the future.

Co-authorship network of co-authors of Kalin Stefanov

This figure shows the co-authorship network connecting the top 25 collaborators of Kalin Stefanov. A scholar is included among the top collaborators of Kalin Stefanov based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Kalin Stefanov. Kalin Stefanov is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Stefanov, Kalin, et al.. (2025). S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction. IEEE Transactions on Multimedia. 27. 4321–4332.
2.
Stefanov, Kalin, et al.. (2025). GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction. 7876–7886. 1 indexed citations
3.
Butler, Matthew, et al.. (2025). Enhancing Tactile Learning: A Co-Designed System for Supporting Speech Interaction with Multi-Part 3D Printed Models by Students who are Blind. Monash University Research Portal (Monash University). 1–18. 1 indexed citations
4.
Ghosh, Shreya, et al.. (2024). AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset. Monash University Research Portal (Monash University). 7414–7423. 12 indexed citations
5.
Wong, KokSheik, et al.. (2024). Histohdr-Net: Histogram Equalization for Single LDR to HDR Image Translation. 2730–2736. 1 indexed citations
6.
Dhall, Abhinav, Shreya Ghosh, Munawar Hayat, et al.. (2024). 1M-Deepfakes Detection Challenge. Monash University Research Portal (Monash University). 11355–11359. 5 indexed citations
7.
Ghosh, Shreya, et al.. (2023). Glitch in the matrix: A large scale benchmark for content driven audio–visual forgery detection and localization. Computer Vision and Image Understanding. 236. 103818–103818. 19 indexed citations
8.
Stefanov, Kalin, et al.. (2021). Analysis of Behavior Classification in Motivational Interviewing. PubMed. 2021. 110–115. 7 indexed citations
9.
Stefanov, Kalin, et al.. (2021). Group-Level Focus of Visual Attention for Improved Next Speaker Prediction. Monash University Research Portal (Monash University). 4838–4842. 6 indexed citations
10.
Su, Lei, Kalin Stefanov, & Jonathan Gratch. (2020). Emotion or expressivity? An automated analysis of nonverbal perception in a social dilemma. 242. 544–551. 5 indexed citations
11.
Stefanov, Kalin, et al.. (2020). OpenSense: A Platform for Multimodal Data Acquisition and Behavior Perception. 660–664. 12 indexed citations
12.
Stefanov, Kalin, et al.. (2019). Multimodal Learning for Identifying Opportunities for Empathetic Responses. 95–104. 12 indexed citations
13.
Stefanov, Kalin, et al.. (2019). Towards Digitally-Mediated Sign Language Communication. 286–288. 3 indexed citations
14.
Stefanov, Kalin & Jonas Beskow. (2017). A real-time gesture recognition system for isolated Swedish Sign Language signs. 18–27. 3 indexed citations
15.
Stefanov, Kalin & Jonas Beskow. (2016). A multi-party multi-modal dataset for focus of visual attention in human-human and human-robot interaction. Language Resources and Evaluation. 4440–4444. 9 indexed citations
16.
Stefanov, Kalin & Jonas Beskow. (2016). Gesture Recognition System for Isolated Sign Language Signs. 57–59. 1 indexed citations
17.
Koutsombogera, Maria, Samer Al Moubayed, Bajibabu Bollepalli, et al.. (2014). The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue. Language Resources and Evaluation. 4196–4201. 1 indexed citations
18.
Stefanov, Kalin, et al.. (2014). A Data-driven Approach to Detection of Interruptions in Human-human Conversations. Monash University Research Portal (Monash University). 29–32. 1 indexed citations
19.
Moubayed, Samer Al, Jonas Beskow, Bajibabu Bollepalli, et al.. (2014). Human-robot collaborative tutoring using multiparty multimodal spoken dialogue. 112–113. 2 indexed citations
20.
Moubayed, Samer Al, Gabriel Skantze, Jonas Beskow, Kalin Stefanov, & Joakim Gustafson. (2012). Multimodal multiparty social interaction with the furhat head. 293–294. 6 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026