Jan Hajič
Impact in
- Artificial Intelligence top 0.2%
- Natural Language Processing Techniques
- Topic Modeling
- Text Readability and Simplification
- Speech and dialogue systems
- Semantic Web and Ontologies
- Speech Recognition and Synthesis
- Advanced Text Analysis Techniques
- Language and Linguistics top 2%
Papers in
-
- Natural Language Processing Techniques 93
- Topic Modeling 55
- Semantic Web and Ontologies 18
- Speech and dialogue systems 10
- Text Readability and Simplification 10
- Speech Recognition and Synthesis 8
-
- Lexicography and Language Studies 10
- Co-authors
- Ryan McDonaldMilan StrakaJun’ichi TsujiiJana StrakováKiril RibarovFernando PereiraDaniel ZemanFilip Ginter
- Journals
- Language Resources and Evaluation (23 papers)Artificial Intelligence in Medicine (1 paper)Transactions of the Association for Computational Linguistics (1 paper)International Journal of Lexicography (1 paper)Meta Journal des traducteurs (1 paper)
- Partner nations
- CzechiaUnited StatesSweden
In The Last Decade
Jan Hajič
95 papers receiving 3.3k citations
Hit Papers
Peers
Comparison fields: 5 of 102
- Artificial Intelligence 3.6k
- Language and Linguistics 239
- Computer Vision and Pattern Recognition 273
- Information Systems 247
- Linguistics and Language 42
Countries citing papers authored by Jan Hajič
This map shows the geographic impact of Jan Hajič's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Jan Hajič with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Jan Hajič more than expected).
Fields of papers citing papers by Jan Hajič
This network shows the impact of papers produced by Jan Hajič. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Jan Hajič. The network helps show where Jan Hajič may publish in the future.
Co-authors
The 25 scholars most cited alongside Jan Hajič, linked wherever they have co-authored with each other. Click a name or a connecting line to browse the papers they share.
All Works
| # | Work | ||
|---|---|---|---|
| 1 | 2019 | 175 | |
| 2 | 2019 | 0 | |
| 3 | Creating a Verb Synonym Lexicon Based on a Parallel Corpus | 2018 | 1 |
| 4 | CoNLL 2018 Shared Task : Multilingual Parsing from Raw Text to Universal Dependencies | 2018 | 96 |
| 5 | Diacritics Restoration Using Neural Networks. | 2018 | 16 |
| 6 | Universal Dependencies v1: A Multilingual Treebank Collection Hit paper breakdown → | 2016 | 575 |
| 7 | QTLeap WSD/NED Corpora: Semantic Annotation of Parallel Corpora in Six Languages | 2016 | 7 |
| 8 | UDPipe: Trainable Pipeline for Processing CoNLL-U Files Performing Tokenization, Morphological Analysis, POS Tagging and Parsing | 2016 | 201 |
| 9 | 2014 | 69 | |
| 10 | Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain | 2014 | 2 |
| 11 | Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers Hit paper breakdown → | 2014 | 303 |
| 12 | An Analysis of Annotation of Verb-Noun Idiomatic Combinations in a Parallel Dependency Corpus | 2013 | 4 |
| 13 | HamleDT: To Parse or Not to Parse? | 2012 | 32 |
| 14 | Proceedings of the Thirteenth Conference on Computational Natural Language Learning (CoNLL 2009): Shared Task | 2009 | 4 |
| 15 | Validating the Quality of Full Morphological Annotation. | 2008 | 1 |
| 16 | Issues in annotation of the Czech spontaneous speech corpus in the MALACH project | 2004 | 7 |
| 17 | Annotation Lexicons: Using the Valency Lexicon for Tectogrammatical Annotation. | 2003 | 2 |
| 18 | 1998 | 2 | |
| 19 | Czech language processing, POS tagging. | 1998 | 9 |
| 20 | 1987 | 10 |
About Jan Hajič
Jan Hajič is a scholar working on Artificial Intelligence, Language and Linguistics, General Social Sciences, Linguistics and Language and Information Systems, having authored 110 papers that have together received 3.8k indexed citations. Recurring topics across this work include Natural Language Processing Techniques (93 papers), Topic Modeling (55 papers), Semantic Web and Ontologies (18 papers), Lexicography and Language Studies (10 papers), Speech and dialogue systems (10 papers), Text Readability and Simplification (10 papers), Speech Recognition and Synthesis (8 papers) and Biomedical Text Mining and Ontologies (7 papers). The work is most often cited by research in Artificial Intelligence (3.6k citations), Language and Linguistics (239 citations), Computer Vision and Pattern Recognition (273 citations), Information Systems (247 citations) and Linguistics and Language (42 citations). Jan Hajič has collaborated with scholars based in Czechia, United States and Sweden. Frequent co-authors include Ryan McDonald, Milan Straka, Jun’ichi Tsujii, Jana Straková, Kiril Ribarov, Fernando Pereira, Daniel Zeman, Filip Ginter, Joakim Nivre and Slav Petrov. Their work appears in journals such as Language Resources and Evaluation, Artificial Intelligence in Medicine, Transactions of the Association for Computational Linguistics, International Journal of Lexicography and Meta Journal des traducteurs.
Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.