Roma Tre University
Engineering Department, Computer Science and Automation
Big Data and Databases Research Group
I joined Roma Tre as post-doc in 2016 and started as assistant professor (ricercatore) in 2019. My research activity is dedicated to the study, development, and application of algorithmic methods for different aspects of data management, including data integration, knowledge discovery and model interpretability for data integration tasks.
I received my Ph.D. in Computer science and Engineering at Sapienza University and I was visiting student at AT&T Labs with a Visitor Grant from Rutgers University.
Last accepted papers:
Recent selected papers:
CIKM 2019, "Shared-Nothing Distributed Enumeration of 2-Plexes", with Alessio Conte, Maurizio Patrignani and Riccardo Torlone
IEEE ICWS 2019, "On Computing Throttling Rate Limits in Web APIs through Statistical Inference", with Francesco Leotta and Massimo Mecella
Full list of papers:
SIGMOD 2018, "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
KDD 2018, "Towards Knowledge Discovery from the Vatican Secret Archives. In Codice Ratio - Episode 1: Machine Transcription of the Manuscript", with Marco Maiorino, Paolo Merialdo and Elena Nieddu
KDD 2017, "Fast Enumeration of Large k-Plexes", with Alessio Conte, Caterina Mordente, Maurizio Patrignani and Riccardo Torlone
VLDB 2016, "Online Entity Resolution Using an Oracle", with Barna Saha and Divesh Srivastava
In Codice Ratio. The project aims at developing novel methods and tools to support content analysis and knowledge discovery from large collections of historical documents. Read the MIT Technology Review article on the project.
Alaska Benchmark. The project aims at building an end-to-end benchmark, designed up-front for dealing with the complexity of different integration tasks.
2019: IEEE ICWS Best Paper Award "On Computing Throttling Rate Limits in Web APIs through Statistical Inference", with Francesco Leotta and Massimo Mecella
2018: SIGMOD Reproducibility Award for the paper "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
(co-advised with Paolo Merialdo)
expected 2023: Tommaso Teofili
expected 2022: Andrea Rossi
expected 2021: Elena Nieddu
Recent conference/workshop organization:
Recent industry/government projects:
Master data management of administrations, with Dipartimento della funzione pubblica. The project aims at providing a unified view over different administration indices, such as IPA and ISTAT, and a collection of tools for semi-automatic data integration and data quality management.
Data extraction from fiscal documents, with LAMBO. The project aimed at prototyping advanced OCR tools for fiscal documents, with high-performance on mobile devices (possibly without Internet access). Read the blog article on our results. (in Italian)
Data managment of fiscal documents, with Mediatica. The project aimed at prototyping indexing and image processing tools for fiscal documents, in order to recognize their main layout features efficiently and speed up their manual processing.