Gorka Labaka

3.2k total citations
51 papers, 908 citations indexed

About

Gorka Labaka is a scholar working on Artificial Intelligence, Language and Linguistics and Molecular Biology. According to data from OpenAlex, Gorka Labaka has authored 51 papers receiving a total of 908 indexed citations (citations by other indexed papers that have themselves been cited), including 46 papers in Artificial Intelligence, 5 papers in Language and Linguistics and 4 papers in Molecular Biology. Recurrent topics in Gorka Labaka's work include Natural Language Processing Techniques (46 papers), Topic Modeling (37 papers) and Text Readability and Simplification (10 papers). Gorka Labaka is often cited by papers focused on Natural Language Processing Techniques (46 papers), Topic Modeling (37 papers) and Text Readability and Simplification (10 papers). Gorka Labaka collaborates with scholars based in Spain, United States and Qatar. Gorka Labaka's co-authors include Eneko Agirre, Mikel Artetxe, Kepa Sarasola, Adrián Núñez-Marcos, Alberto Barrón‐Cedeño, Paolo Rosso, Arantza Díaz de Ilarraza, Oscar Sainz, Oier López de Lacalle and Iñaki Alegria and has published in prestigious journals such as PLoS ONE, Expert Systems with Applications and Human Reproduction.

In The Last Decade

Gorka Labaka

45 papers receiving 822 citations

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Gorka Labaka Spain 12 818 142 53 53 45 51 908
Ramy Eskander United States 14 970 1.2× 110 0.8× 95 1.8× 20 0.4× 16 0.4× 30 1.0k
Grzegorz Chrupała Netherlands 13 544 0.7× 149 1.0× 80 1.5× 8 0.2× 26 0.6× 46 646
Natalia Silveira United States 6 901 1.1× 96 0.7× 75 1.4× 7 0.1× 40 0.9× 8 972
Marc Light United States 13 666 0.8× 48 0.3× 109 2.1× 9 0.2× 85 1.9× 26 747
Stephen Roller United States 12 597 0.7× 130 0.9× 54 1.0× 13 0.2× 25 0.6× 16 720
Mohamed Al-Badrashiny United States 11 685 0.8× 94 0.7× 68 1.3× 16 0.3× 12 0.3× 27 730
Maja Popović Germany 18 1.5k 1.9× 277 2.0× 137 2.6× 23 0.4× 63 1.4× 94 1.7k
Beáta Megyesi Sweden 12 479 0.6× 79 0.6× 47 0.9× 7 0.1× 34 0.8× 77 567
Johannes Knopp Germany 3 345 0.4× 46 0.3× 71 1.3× 15 0.3× 28 0.6× 6 441
Katja Filippova United States 17 919 1.1× 117 0.8× 127 2.4× 10 0.2× 29 0.6× 29 1.0k

Countries citing papers authored by Gorka Labaka

Since Specialization
Citations

This map shows the geographic impact of Gorka Labaka's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Gorka Labaka with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Gorka Labaka more than expected).

Fields of papers citing papers by Gorka Labaka

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Gorka Labaka. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Gorka Labaka. The network helps show where Gorka Labaka may publish in the future.

Co-authorship network of co-authors of Gorka Labaka

This figure shows the co-authorship network connecting the top 25 collaborators of Gorka Labaka. A scholar is included among the top collaborators of Gorka Labaka based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Gorka Labaka. Gorka Labaka is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Gamallo, Pablo & Gorka Labaka. (2021). Using Dependency-Based Contextualization for transferring Passive Constructions from English to Spanish. Procesamiento del lenguaje natural. 66(66). 53–64.
2.
Artetxe, Mikel, Gorka Labaka, Noé Casas, & Eneko Agirre. (2020). Do all Roads Lead to Rome? Understanding the Role of Initialization in Iterative Back-Translation. arXiv (Cornell University). 2 indexed citations
3.
Ilarraza, Arantza Díaz de, et al.. (2020). Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification. PLoS ONE. 15(8). e0237767–e0237767. 3 indexed citations
4.
Labaka, Gorka, et al.. (2020). Ixamed’s submission description for WMT20 Biomedical shared task: benefits and limitations of using terminologies for domain adaptation. Empirical Methods in Natural Language Processing. 875–880.
5.
Artetxe, Mikel, Gorka Labaka, & Eneko Agirre. (2019). Unsupervised Neural Machine Translation, a new paradigm solely based on monolingual text. Procesamiento del lenguaje natural. 63. 151–154. 2 indexed citations
6.
Ilarraza, Arantza Díaz de, et al.. (2018). Konbitzul: an MWE-specific database for Spanish-Basque. Language Resources and Evaluation. 1 indexed citations
7.
Agerri, Rodrigo, et al.. (2018). Building Named Entity Recognition Taggers via Parallel Corpora. Language Resources and Evaluation. 7 indexed citations
8.
Artetxe, Mikel, Gorka Labaka, & Eneko Agirre. (2017). Learning bilingual word embeddings with (almost) no bilingual data. Communities in ADDI (University of the Basque Country). 451–462. 237 indexed citations
9.
Labaka, Gorka, Iñaki Alegria, & Kepa Sarasola. (2016). Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation. Language Resources and Evaluation. 2209–2213. 1 indexed citations
10.
Ilarraza, Arantza Díaz de, et al.. (2016). Using linguistic data for English and Spanish verb-noun combination identification. Sussex Research Online (University of Sussex). 857–867. 3 indexed citations
11.
Labaka, Gorka, et al.. (2016). Detecting the central units in two different genres and languages: a preliminary study of Brazilian Portuguese and Basque texts. Procesamiento del lenguaje natural. 56(56). 65–72. 5 indexed citations
12.
Agirre, Eneko, Iñaki Alegria, Mikel Artetxe, et al.. (2015). Lexical semantics, Basque and Spanish in QTLeap: Quality Translation by Deep Language Engineering Approaches. Procesamiento del lenguaje natural. 55(55). 169–172. 2 indexed citations
13.
Labaka, Gorka, et al.. (2014). Comparison of post-editing productivity between professional translators and lay users.. Conference of the Association for Machine Translation in the Americas. 20–33. 18 indexed citations
14.
Hulden, Mans, et al.. (2012). Developing an Open-Source FST Grammar for Verb Chain Transfer in a Spanish-Basque MT System. 65–69. 1 indexed citations
15.
Alegria, Iñaki, et al.. (2011). Matxin-Informatika: versión del traductor Matxin adaptada al dominio de la informática. Procesamiento del lenguaje natural. 47(47). 323–324.
16.
Barrón‐Cedeño, Alberto, Paolo Rosso, Eneko Agirre, & Gorka Labaka. (2010). Plagiarism Detection across Distant Language Pairs. International Conference on Computational Linguistics. 37–45. 71 indexed citations
17.
Alegria, Iñaki, et al.. (2009). Evaluación de un sistema de traducción automática basado en reglas o por qué BLEU sólo sirve para lo que sirve. Procesamiento del lenguaje natural. 43(43). 197–205. 2 indexed citations
18.
Alegria, Iñaki, et al.. (2008). Spanish-to-Basque MultiEngine Machine Translation for a Restricted Domain. Conference of the Association for Machine Translation in the Americas. 37–45. 1 indexed citations
19.
Alegria, Iñaki, et al.. (2008). Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open-source. International Joint Conference on Natural Language Processing. 59–64. 3 indexed citations
20.
Labaka, Gorka, et al.. (2007). Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation. Arrow@dit (Dublin Institute of Technology). 14 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026