Jan Pomikálek

483 citations
20 papers · 291 indexed · h-index 8
Topics
Natural Language Processing Techniques (17 papers)Topic Modeling (6 papers)Lexicography and Language Studies (5 papers)
Journals
Language Resources and EvaluationInstitutional Research Information System (Università degli Studi di Trento)˜The œPrague Bulletin of Mathematical Linguistics
Partner nations
CzechiaIndiaGermany

In The Last Decade

Jan Pomikálek

15 papers receiving 213 citations

Peers

Jan Pomikálek
Comparison fields: 5 of 33
  • Artificial Intelligence 257
  • Language and Linguistics 87
  • Information Systems 67
  • Developmental and Educational Psychology 32
  • Literature and Literary Theory 14
Replace Eckhard Bick with:
Eckhard Bick South Korea
Gil Francopoulo France
Anna Feldman United States
Violeta Seretan Switzerland
Wolfgang Lezius Germany
Elena Volodina Sweden
Keith Suderman United States
Ines Rehbein Germany
Siew Mei Wu Singapore
Johanna Monti Italy
Jan Pomikálek relative to Eckhard Bick South Korea Eckhard Bick's profile →
Citations per field
00.5×3.2×
Eckhard Bick · 1×
Citations per year

Countries citing papers authored by Jan Pomikálek

Since Specialization
Citations

This map shows the geographic impact of Jan Pomikálek's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Jan Pomikálek with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Jan Pomikálek more than expected).

Fields of papers citing papers by Jan Pomikálek

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Jan Pomikálek. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Jan Pomikálek. The network helps show where Jan Pomikálek may publish in the future.

Co-authorship network of co-authors of Jan Pomikálek

This figure shows the co-authorship network connecting the top 25 collaborators of Jan Pomikálek. A scholar is included among the top collaborators of Jan Pomikálek based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Jan Pomikálek. Jan Pomikálek is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
#WorkIndexed citations
1
Flexible Similarity Search of Semantic Vectors Using Fulltext Search Engines
0
2
Text Tokenisation Using unitok
11
3
Domain Specific Corpora from the Web
2
4
Setting Up for Corpus Lexicography
4
5
Building a 70 billion word corpus of English from ClueWeb
17
6
Efficient Web Crawling for Large Text Corpora
41
7
Building a 50M Corpus of Tajik Language
2
8
Practical Web Crawling for Text Corpora
1
9
chared: Character Encoding Detection with a Known Language
0
10
Comparable corpora BootCaT
2
11
Removing Boilerplate and Duplicate Content from Web Corpora
60
12
A Corpus Factory for Many Languages
51
13
Scaling to Billion-plus Word Corpora
19
14
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
7
15
Detecting Co-Derivative Documents in Large Text Collections
6
16 7
17
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
0
18
LEMPAS: A Make-Do Lemmatizer for the Swedish PAROLE-Corpus
2
19
WebBootCaT: a Web Tool for Instant Corpora
31
20
WebBootCaT. Instant Domain-Specific Corpora to Support Human Translators
28

About Jan Pomikálek

Jan Pomikálek is a scholar working on Language and Linguistics, Artificial Intelligence and Information Systems, having authored 20 papers that have together received 291 indexed citations. Recurring topics across this work include Natural Language Processing Techniques (17 papers), Topic Modeling (6 papers) and Lexicography and Language Studies (5 papers). The work is most often cited by research in Language and Linguistics (87 citations), Artificial Intelligence (257 citations) and Information Systems (67 citations). Jan Pomikálek has collaborated with scholars based in Czechia, India and Germany. Frequent co-authors include Adam Kilgarriff, Pavel Rychlý, Vít Suchomel, Marco Baroni, Siva Reddy, Miloš Jakubíček, Jan Michelfeit, Silvie Cinková, Diana McCarthy and Kremena Ivanova. Their work appears in journals such as Language Resources and Evaluation, Institutional Research Information System (Università degli Studi di Trento) and ˜The œPrague Bulletin of Mathematical Linguistics.

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026