Katherine Lee

17.0k total citations · 1 hit paper
13 papers, 375 citations indexed

About

Katherine Lee is a scholar working on Artificial Intelligence, Safety Research and Molecular Biology. According to data from OpenAlex, Katherine Lee has authored 13 papers receiving a total of 375 indexed citations (citations by other indexed papers that have themselves been cited), including 9 papers in Artificial Intelligence, 4 papers in Safety Research and 1 paper in Molecular Biology. Recurrent topics in Katherine Lee's work include Ethics and Social Impacts of AI (4 papers), Topic Modeling (3 papers) and Law, AI, and Intellectual Property (3 papers). Katherine Lee is often cited by papers focused on Ethics and Social Impacts of AI (4 papers), Topic Modeling (3 papers) and Law, AI, and Intellectual Property (3 papers). Katherine Lee collaborates with scholars based in United States, Switzerland and Germany. Katherine Lee's co-authors include Daphne Ippolito, Nicholas Carlini, Chiyuan Zhang, Douglas Eck, Chris Callison-Burch, Florian Tramèr, Reza Shokri, Fatemehsadat Mireshghallah, Ashish Agarwal and David Sussillo and has published in prestigious journals such as Journal of Pain and Symptom Management, Statistics in Biosciences and Repository for Publications and Research Data (ETH Zurich).

In The Last Decade

Katherine Lee

13 papers receiving 348 citations

Hit Papers

Deduplicating Training Data Makes Language Models Better 2022 2026 2023 2024 2022 40 80 120

Peers

Katherine Lee
Comparison fields: 5 of 76
  • Artificial Intelligence 265
  • Information Systems 55
  • Computer Vision and Pattern Recognition 42
  • Health Informatics 42
  • Safety Research 35
Replace Rishi Bommasani with:
Rishi Bommasani United States
Laria Reynolds United States
Ilia Shumailov United Kingdom
Willy Chung Hong Kong
Albert Meroño-Peñuela United Kingdom
Faisal Ladhak United States
Yanai Elazar Israel
Timo Schick Germany
Gaole He Netherlands
Albert Webson United States
Rishi Bommasani United States View profile →
Citations per field, relative to Katherine Lee
Katherine Lee · 1×
Citations per year, relative to Katherine Lee
Katherine Lee · 1×

Countries citing papers authored by Katherine Lee

Since Specialization
Citations

This map shows the geographic impact of Katherine Lee's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Katherine Lee with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Katherine Lee more than expected).

Fields of papers citing papers by Katherine Lee

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Katherine Lee. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Katherine Lee. The network helps show where Katherine Lee may publish in the future.

Co-authorship network of co-authors of Katherine Lee

This figure shows the co-authorship network connecting the top 25 collaborators of Katherine Lee. A scholar is included among the top collaborators of Katherine Lee based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Katherine Lee. Katherine Lee is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

13 of 13 papers shown
# Work Indexed citations
1 3
2 1
3 5
4 11
5 11
6 20
7 1
8 3
9
Deduplicating Training Data Makes Language Models Better breakdown →
147
10 88
11 2
12 19
13
Hallucinations in Neural Machine Translation
64

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026