Tomaž Erjavec

2.7k citations
141 papers · 1.4k indexed · h-index 18
Topics
Natural Language Processing Techniques (96 papers)Topic Modeling (34 papers)Lexicography and Language Studies (21 papers)
Journals
SHILAP Revista de lepidopterologíaLanguage Resources and EvaluationScience of Computer Programming

In The Last Decade

Tomaž Erjavec

127 papers receiving 1.2k citations

Peers

Tomaž Erjavec
Comparison fields: 5 of 57
  • Artificial Intelligence 1.2k
  • Language and Linguistics 339
  • Information Systems 99
  • Molecular Biology 56
  • Sociology and Political Science 46
Replace Nicoletta Calzolari with:
Nicoletta Calzolari Italy
Pranav Anand United States
Harold Somers United Kingdom
Hans Uszkoreit Germany
Laurent Romary France
Hans van Halteren Netherlands
Lori Levin United States
Josef van Genabith Ireland
Scott Piao United Kingdom
Stefan Evert Germany
Tomaž Erjavec relative to Nicoletta Calzolari Italy Nicoletta Calzolari's profile →
Citations per field
00.5×1.5×
Nicoletta Calzolari · 1×
Citations per year

Countries citing papers authored by Tomaž Erjavec

Since Specialization
Citations

This map shows the geographic impact of Tomaž Erjavec's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Tomaž Erjavec with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Tomaž Erjavec more than expected).

Fields of papers citing papers by Tomaž Erjavec

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Tomaž Erjavec. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Tomaž Erjavec. The network helps show where Tomaž Erjavec may publish in the future.

Co-authorship network of co-authors of Tomaž Erjavec

This figure shows the co-authorship network connecting the top 25 collaborators of Tomaž Erjavec. A scholar is included among the top collaborators of Tomaž Erjavec based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Tomaž Erjavec. Tomaž Erjavec is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
#WorkIndexed citations
1
Gigafida 2.0: The Reference Corpus of Written Standard Slovene
6
2 2
3
The Sloleks Morphological Lexicon and its Future Development
2
4 3
5
Leksikon besednih oblik Sloleks in smernice njegovega razvoja
2
6
Corpus-Based Diacritic Restoration for South Slavic Languages.
9
7
Gold-Standard Datasets for Annotation of Slovene Computer-Mediated Communication.
3
8
Corpus vs. Lexicon Supervision in Morphosyntactic Tagging: the Case of Slovene
15
9
Normalising Slovene data: historical texts vs. user-generated content.
20
10
Text mining platform for NLP workflow design, replication and reuse
2
11
The goo300k corpus of historical Slovene
5
12
OD BIOGRAFSKEGA LEKSIKONA DO ZNANSTVENOKRITIČNE IZDAJE: VPRAŠANJE TRAJNOSTI ELEKTRONSKIH BESEDIL
0
13
Designing and evaluating a Russian tagset
28
14
A Low Cost Approach to Building a Japanese-Slovene Parallel Corpus
2
15
The JOS Morphosyntactically Tagged Corpus of Slovene
8
16
Compiling and Using the IJS-ELAN Parallel Corpus.
3
17
Automatic Sense Tagging Using Parallel Corpora.
18
18
Morphosyntactic Tagging of Slovene: Evaluating Taggers and Tagsets.
17
19
The MULTEXT-East Corpus
27
20
East meets West: Producing Multilingual Resources in a European Context
8

About Tomaž Erjavec

Tomaž Erjavec is a scholar working on Language and Linguistics, Artificial Intelligence and Human-Computer Interaction, having authored 141 papers that have together received 1.4k indexed citations. Recurring topics across this work include Natural Language Processing Techniques (96 papers), Topic Modeling (34 papers) and Lexicography and Language Studies (21 papers). The work is most often cited by research in Artificial Intelligence (1.2k citations), Language and Linguistics (339 citations) and Linguistics and Language (37 citations). Tomaž Erjavec has collaborated with scholars based in Slovenia, Croatia and United States. Frequent co-authors include Dan Tufiş, Darja Fišer, Nikola Ljubešić, Nancy Ide, Bruno Pouliquen, Camelia Ignat, Ralf Steinberger, Sašo Džeroski, Dániel Varga and Simon Krek. Their work appears in journals such as SHILAP Revista de lepidopterología, Language Resources and Evaluation and Science of Computer Programming.

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026