Anish Das Sarma

47 papers receiving 2.1k citations

Hit Papers

Detecting near-duplicates for web crawling20072026201320192007100200300

Peers

Anish Das Sarma
Comparison fields: 5 of 84
  • Artificial Intelligence 1.1k
  • Computer Networks and Communications 1.0k
  • Information Systems 839
  • Signal Processing 756
  • Management Science and Operations Research 670
Replace Neoklis Polyzotis with:
Neoklis Polyzotis United States
Nilesh Dalvi United States
Tova Milo Israel
Sai Wu China
Vanja Josifovski United States
Jayant Madhavan United States
Fabian M. Suchanek Germany
Philip Bohannon United States
Wang-Chiew Tan United States
Shuai Ma China
Anish Das Sarma relative to Neoklis Polyzotis United States Neoklis Polyzotis's profile →
Citations per field
00.5×1.5×
Neoklis Polyzotis · 1×
Citations per year

Countries citing papers authored by Anish Das Sarma

Since Specialization
Citations

This map shows the geographic impact of Anish Das Sarma's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Anish Das Sarma with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Anish Das Sarma more than expected).

Fields of papers citing papers by Anish Das Sarma

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Anish Das Sarma. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Anish Das Sarma. The network helps show where Anish Das Sarma may publish in the future.

Co-authorship network of co-authors of Anish Das Sarma

This figure shows the co-authorship network connecting the top 25 collaborators of Anish Das Sarma. A scholar is included among the top collaborators of Anish Das Sarma based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Anish Das Sarma. Anish Das Sarma is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
#WorkIndexed citations
1 39
2 26
3 69
4 10
5 105
6 3
7
Ibis: A Provenance Manager for Multi-Layer Systems.
4
8 54
9 45
10 7
11 25
12
Functional Dependency Generation and Applications in Pay-As-You-Go Data Integration Systems.
28
13
Discovering Functional Dependencies in Pay-As-You- Go Data Integration Systems
2
14
Uncertainty in Data Integration
5
15
Schema Design for Uncertain Databases
6
16
Detecting near-duplicates for web crawlingbreakdown →
377
17
Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS
40
18 297
19
An Introduction to ULDBs and the Trio System
59
20
Generic Text Summarization Using WordNet
21

About Anish Das Sarma

Anish Das Sarma is a scholar working on Management Science and Operations Research, Signal Processing and Computer Networks and Communications, having authored 47 papers that have together received 2.2k indexed citations. Recurring topics across this work include Advanced Database Systems and Queries (24 papers), Data Quality and Management (20 papers) and Data Management and Algorithms (19 papers). The work is most often cited by research in Signal Processing (756 citations), Management Science and Operations Research (670 citations) and Computer Networks and Communications (1.0k citations). Anish Das Sarma has collaborated with scholars based in United States, Spain and United Kingdom. Frequent co-authors include Alon Halevy, Jennifer Widom, Omar Benjelloun, Arvind Kumar Jain, Gurmeet Singh Manku, Xin Luna Dong, Martin Theobald, Aditya Parameswaran, Héctor García-Molina and Cong Yu. Their work appears in journals such as Proceedings of the VLDB Endowment, ACM Transactions on Database Systems and The VLDB Journal.

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026