EEMCS-CS-DMB

He received his mastersā€™ degree in computer science at the University of Twente in 1992 and completed his PhD on Formal operation definition in object-oriented databases in 1997. His research targets robustness in data science focusing on two main threats to data science reliability: data quality and undesirable machine learning behaviour. The former is focused on data integration, semi-structured data, natural language processing, and data quality issues involved in these. He co-developed one of the most scalable XML database systems of its time: MonetDB/XQuery. Furthermore, he proposed a data integration approach, called Probabilistic Data Integration, which fundamentally incorporates handling of uncertain and of lesser quality data. He developed a probabilistic database system, called DuBio, which allows the scalable storage, manipulation and management of such uncertain data. On the threat of undesirable machine learning behaviour, he focuses on Explainable AI with the intrinsically explainable deep learning approach ProtoTree as one of the notable results of this. He is secretary of the executive board of the EDBT Association (Extending Database Technology). He is the (co-) author of about 200 publications that accumulated about 2000 citations.

Expertise

  • Computer Science

    • Events
    • Database
    • Models
    • Case Study
    • Data Integration
    • Real World
    • Social Media
    • Machine Learning

Organisations

Publications

2025

Dynamic Predictive Models for Side Effects Following Cancer or Cancer Treatment: A Systematic Review (2025)[Contribution to conference › Abstract] 10th Dutch Biomedical Engineering Conference, BME 2025. Fatime, O. D., Schipper, R., Berendsen, A., Nane, G. F., van Keulen, M., Witteveen, A. & John, A.

2024

Impact of Camera Settings on 3D Reconstruction Quality: Insights from NeRF and Gaussian Splatting (2024)Sensors (Switzerland), 24(23). Article 7594. Rangelov, D., Waanders, S., Waanders, K., van Keulen, M. & Miltchev, R.https://doi.org/10.3390/s24237594MetaLIRS: Meta-learning for Imputation and Regression Selection (2024)In Intelligent Data Engineering and Automated Learning - IDEAL 2024: 25th International Conference, Valencia, Spain, November 20-22, 2024. Proceedings, Part I (pp. 155-166) (Lecture Notes in Computer Science; Vol. 15346). Springer. Baysal Erez, I., Flokstra, J., Poel, M. & van Keulen, M.https://doi.org/10.1007/978-3-031-77731-8_15Dynamic Sparse Training versus Dense Training: The Unexpected Winner in Image Corruption Robustness (2024)[Working paper › Preprint]. ArXiv.org. Wu, B., Xiao, Q., Wang, S., Strisciuglio, N., Pechenizkiy, M., van Keulen, M., Mocanu, D. C. & Mocanu, E.https://doi.org/10.48550/arXiv.2410.03030Insights into Dynamic Sparse Training: Theory Meets Practice (2024)[Contribution to conference › Poster] European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2024. Wu, B., van Keulen, M., Mocanu, D. C. & Mocanu, E.Finding blind spots: Investigating identity data matching in transnational commercialized security infrastructures and beyond (2024)[Thesis › PhD Thesis - Research UT, graduation UT]. University of Twente. Van Rossem, W.https://doi.org/10.3990/1.9789036561778Prototype-Based Interpretable Breast Cancer Prediction Models: Analysis andĀ Challenges (2024)In Explainable Artificial Intelligence - 2nd World Conference, xAI 2024, Proceedings (pp. 21-42) (Communications in Computer and Information Science; Vol. 2153 CCIS). Springer. Pathak, S., Schlötterer, J., Veltman, J., Geerdink, J., van Keulen, M. & Seifert, C.https://doi.org/10.1007/978-3-031-63787-2_2The interaction between imputation and regression models (2024)[Contribution to conference › Poster] 22nd International Conference of AI in Medicine, AIME 2024. Baysal Erez, I., Flokstra, J., Poel, M. & van Keulen, M.Are Large Language Models the New Interface for Data Pipelines? (2024)In Proceedings of the International Workshop on Big Data in Emergent Distributed Environments, BIDEDE 2024, in conjunction with the 2024 ACM SIGMOD/PODS Conference. Article 6 (Proceedings of the International Workshop on Big Data in Emergent Distributed Environments, BIDEDE 2024, in conjunction with the 2024 ACM SIGMOD/PODS Conference). Association for Computing Machinery. Barbon, S., Ceravolo, P., Groppe, S., Jarrar, M., Maghool, S., Sèdes, F., Sahri, S. & Van Keulen, M.https://doi.org/10.1145/3663741.3664785Are Large Language Models the New Interface for Data Pipelines? (2024)[Working paper › Preprint]. ArXiv.org. Junior, S. B., Ceravolo, P., Groppe, S., Jarrar, M., Maghool, S., Sèdes, F., Sahri, S. & van Keulen, M.https://doi.org/10.48550/arXiv.2406.06596

Research profiles

Affiliated study programs

Courses academic year 2024/2025

Courses in the current academic year are added at the moment they are finalised in the Osiris system. Therefore it is possible that the list is not yet complete for the whole academic year.

Courses academic year 2023/2024

Address

University of Twente

Zilverling (building no. 11), room 4061
Hallenweg 19
7522 NH Enschede
Netherlands

Navigate to location

Organisations

Scan the QR code or
Download vCard