Kenneth Heafield

5.9k citations

56 papers · 2.8k indexed · 3 hit papers · h-index 22

Artificial Intelligence top 0.5%
- Natural Language Processing Techniques 46
- Topic Modeling 45
- Text Readability and Simplification 6
- Algorithms and Data Compression 5
- Speech and dialogue systems 4
- Semantic Web and Ontologies 4
- Speech Recognition and Synthesis 4
Computer Vision and Pattern Recognition top 2%
- Multimodal Machine Learning Applications 12
Information Systems top 2%
Software top 10%
Signal Processing top 5%

Co-authors: Philipp Koehn Jonathan H. Clark Anna Currey Roman Grundkiewicz Marcin Junczys-Dowmunt Antonio Valerio Miceli Barone Santonu Sarkar Alon Lavie
Cited by: Artificial Intelligence Computer Vision and Pattern Recognition Information Systems
Journals: Language Resources and Evaluation (1 paper)Workshop on Statistical Machine Translation (5 papers)North American Chapter of the Association for Computational Linguistics (1 paper)
Partner nations: United Kingdom United States Belgium

In The Last Decade

Kenneth Heafield

53 papers receiving 2.5k citations

Hit Papers

align trajectories log scale

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

2016 Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics
2013 Meeting of the Association for Computational Linguistics
2011 Workshop on Statistical Machine Translation

Peers

Countries citing papers authored by Kenneth Heafield

Since Specialization

Citations

This map shows the geographic impact of Kenneth Heafield's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Kenneth Heafield with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Kenneth Heafield more than expected).

Fields of papers citing papers by Kenneth Heafield

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Kenneth Heafield. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Kenneth Heafield. The network helps show where Kenneth Heafield may publish in the future.

Co-authorship network

The 25 scholars most cited alongside Kenneth Heafield, linked wherever they have co-authored with each other. Click a name or a connecting line to browse the papers they share.

Border = papers with Kenneth Heafield Line = papers co-authored together Kenneth Heafield links everyone, so they are left out of the graph.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

#	Work
1	Document-Level Machine Translation with Large-Scale Public Parallel Corpora David Fuelling,Alexandra Birch,Kenneth Heafield	2024	1
2	ParaCrawl: Web-Scale Acquisition of Parallel Corpora ePrints Soton (University of Southampton) ·Ayumi Houri,Pinzhen Chen,Barry Haddow,Kenneth Heafield,Hieu T. Hoang,Miquel Esplà-Gomis,Mikel L. Forcada,Amir Kamran,Prem Sagar Vasanth Kumar,Philipp Koehn,Mihalea Craciunescu,Chung Seok Oh,Gema Ramírez-Sánchez,Michael W Busch,Heiko Webert,Brian J. Thompson,William Waites,Ranon Altman,Jason Grant Britton	2020	70
3	Speed-optimized, Compact Student Models that Distill Knowledge from a Larger Teacher Model: the UEDIN-CUNI Submission to the WMT 2020 News Translation Task Edinburgh Research Explorer (University of Edinburgh) ·Ulrich Germann,Roman Grundkiewicz,Martin Popel,Norainmuni Hamid,Nikolay Bogoychev,Kenneth Heafield	2020	1
4	Edinburgh’s Submissions to the 2020 Machine Translation Efficiency Task Edinburgh Research Explorer ·Nikolay Bogoychev,Roman Grundkiewicz,Alham Fikri Aji,نسرين مصطو شرفاني,Kenneth Heafield,ANDRÉIA SANGALLI,James T. Nawalaniec,Sutthira Khumkratok	2020	3
5	Losing Heads in the Lottery: Pruning Transformer Attention in Neural Machine Translation نسرين مصطو شرفاني,Kenneth Heafield	2020	28
6	Incorporating Source Syntax into Transformer-Based Neural Machine Translation Edinburgh Research Explorer (University of Edinburgh) ·Anna Currey,Kenneth Heafield	2019	33
7	Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data Edinburgh Research Explorer (University of Edinburgh) ·Roman Grundkiewicz,Marcin Junczys-Dowmunt,Kenneth Heafield	2019	106
8	Voting on N-grams for Machine Translation System Combination Figshare ·Kenneth Heafield,Alon Lavie	2018	1
9	Proceedings of the 54th Annual Meeting of the Association for Computational Linguisticsbreakdown → Kenneth Heafield	2016	589
10	Edinburghâ€™s Phrase-based Machine Translation Systems for WMT-14 Workshop on Statistical Machine Translation ·Nadir Durrani,Barry Haddow,Philipp Koehn,Kenneth Heafield	2014	1
11	N-gram Counts and Language Models from the Common Crawl Language Resources and Evaluation ·Christian Buck,Kenneth Heafield,武雄藤岡	2014	77
12	Scalable Modified Kneser-Ney Language Model Estimationbreakdown → Meeting of the Association for Computational Linguistics ·Kenneth Heafield,M Berta,Jonathan H. Clark,Philipp Koehn	2013	319
13	Grouping Language Model Boundary Words to Speed K--Best Extraction from Hypergraphs North American Chapter of the Association for Computational Linguistics ·Kenneth Heafield,Philipp Koehn,Arnon Lavie	2013	13
14	Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL 2012, July 12-14, 2012, Jeju Island, Korea Kenneth Heafield,Philipp Koehn,Arnon Lavie	2012	27
15	Language Model Rest Costs and Space-Efficient Storage Empirical Methods in Natural Language Processing ·Kenneth Heafield,Philipp Koehn,Alon Lavie	2012	7
16	KenLM: Faster and Smaller Language Model Queriesbreakdown → Workshop on Statistical Machine Translation ·Kenneth Heafield	2011	710
17	CMU System Combination in WMT 2011 Workshop on Statistical Machine Translation ·Kenneth Heafield,Alon Lavie	2011	6
18	Proceedings of the Sixth Workshop on Statistical Machine Translation Kenneth Heafield,Alon Lavie	2011	9
19	Left language model state for syntactic machine translation. Edinburgh Research Explorer (University of Edinburgh) ·Kenneth Heafield,Hieu Hoang,Philipp Koehn,Azza Al Subhi,Marcello Federico	2011	13
20	CMU Multi-Engine Machine Translation for WMT 2010 Workshop on Statistical Machine Translation ·Kenneth Heafield,Alon Lavie	2010	7

About Kenneth Heafield

Kenneth Heafield is a scholar working on Artificial Intelligence, Computer Vision and Pattern Recognition and Software, having authored 56 papers that have together received 2.8k indexed citations. Recurring topics across this work include Natural Language Processing Techniques (46 papers), Topic Modeling (45 papers), Multimodal Machine Learning Applications (12 papers), Text Readability and Simplification (6 papers), Algorithms and Data Compression (5 papers), Speech and dialogue systems (4 papers), Semantic Web and Ontologies (4 papers) and Speech Recognition and Synthesis (4 papers). The work is most often cited by research in Artificial Intelligence (2.5k citations), Computer Vision and Pattern Recognition (550 citations) and Information Systems (311 citations). Kenneth Heafield has collaborated with scholars based in United Kingdom, United States and Belgium. Frequent co-authors include Philipp Koehn, Jonathan H. Clark, Anna Currey, Roman Grundkiewicz, Marcin Junczys-Dowmunt, Antonio Valerio Miceli Barone, Santonu Sarkar, Alon Lavie, Christopher D. Manning and Barry Haddow. Their work appears in journals such as Language Resources and Evaluation, Workshop on Statistical Machine Translation, North American Chapter of the Association for Computational Linguistics, Edinburgh Research Explorer and Edinburgh Research Explorer (University of Edinburgh).

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact