In response to the need to tackle increasingly complex medical research questions, a growing amount of human health data is being collected, either in routine Electronic Healthcare Record (EHR) databases, through research-driven cohort studies, in biobanks or related efforts. However, data sources are typically fragmented and contain information gaps which prevent their full exploitation. EMIF aim ...