Donatella Firmani
Assistant Professor
Roma Tre University
Engineering Department, Computer Science and Automation
Big Data and Databases Research Group
donatella.firmaniti.3amorinu@inamrif.alletanod@uniroma3.it
Bio Sketch
I joined Roma Tre as post-doc in 2016 and started as assistant professor (ricercatore) in 2019. My research activity is dedicated to the study, development, and application of algorithmic methods for different aspects of data management, including data integration, knowledge discovery and model interpretability for data integration tasks.
I received my Ph.D. in Computer science and Engineering at Sapienza University and I was visiting student at AT&T Labs with a Visitor Grant from Rutgers University.
Publications
Last accepted papers:
-
VLDB Journal 2021, "Efficient and Effective ER with Progressive Blocking", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
-
ACM Trans. on Knowledge Discovery from Data 2020, "Knowledge Graph Embedding for Link Prediction: A Comparative Analysis", with Andrea Rossi, Antonio Matinata, Denilson Barbosa and Paolo Merialdo
Recent selected papers:
-
SIGMOD 2018, "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
-
KDD 2018, "Towards Knowledge Discovery from the Vatican Secret Archives. In Codice Ratio - Episode 1: Machine Transcription of the Manuscript", with Marco Maiorino, Paolo Merialdo and Elena Nieddu
-
KDD 2017, "Fast Enumeration of Large k-Plexes", with Alessio Conte, Caterina Mordente, Maurizio Patrignani and Riccardo Torlone
-
VLDB 2016, "Online Entity Resolution Using an Oracle", with Barna Saha and Divesh Srivastava
Full list of papers:
Research Projects
-
In Codice Ratio. The project aims at developing novel methods and tools to support content analysis and knowledge discovery from large collections of historical documents.
-
Alaska Benchmark. The project aims at building an end-to-end benchmark, designed up-front for dealing with the complexity of different integration tasks.
Awards
-
2019: IEEE ICWS Best Paper Award "On Computing Throttling Rate Limits in Web APIs through Statistical Inference", with Francesco Leotta and Massimo Mecella
-
2018: SIGMOD Reproducibility Award for the paper "Robust Entity Resolution using Random Graphs", with Sainyam Galhotra, Barna Saha and Divesh Srivastava
Ph.D. Students
(co-advised with Paolo Merialdo)
-
expected 2023: Tommaso Teofili
-
expected 2022: Andrea Rossi
-
expected 2021: Elena Nieddu
Professional
Editorial:
Recent conference/workshop organization:
Recent industry/government projects:
-
Master data management of administrations, with Dipartimento della funzione pubblica. The project aims at providing a unified view over different administration indices, such as IPA and ISTAT, and a collection of tools for semi-automatic data integration and data quality management.
-
Data extraction from fiscal documents, with LAMBO. The project aimed at prototyping advanced OCR tools for fiscal documents, with high-performance on mobile devices (possibly without Internet access). Read the blog article on our results. (in Italian)
-
Data managment of fiscal documents, with Mediatica. The project aimed at prototyping indexing and image processing tools for fiscal documents, in order to recognize their main layout features efficiently and speed up their manual processing.
Teaching
-
A.A. 2019-20: Modern approaches to Entity Resolution, Ph.D. Course, Roma Tre University.
-
A.A. 2019-20: L’esperienza di “In Codice Ratio”, Master in Management-Promozione-Innovazioni Tecnologiche nella Gestione dei Beni Culturali, Roma Tre University. (in Italian)
-
A.A. 2019-20: I progressi dell’OCR sulle scritture a mano, Seminari di informatica umanistica, Scuola di Alta Formazione A. Varvaro. (in Italian)
-
A.A. 2019-20: Elementi di Informatica, Laurea in Ingegneria Meccanica, Roma Tre University. (in Italian)
-
A.A. 2016-19: Fondamenti di Informatica, Laurea in Ingegneria Elettronica, Roma Tre University. (in Italian)