Learning from Massive, Incompletely annotated, and Structured Data

Acronimo: MAESTRA

Data di inizio

2014-02-01

Data di fine

2017-07-31

Capofila/Coordinatore

INSTITUT JOZEF STEFAN (SI)


Abstract

The need for machine learning (ML) and data mining (DM) is ever growing due to the increased pervasiveness of data analysis tasks in almost every area of life, including business, science and technology. Not only is the pervasiveness of data analysis tasks increasing, but so is their complexity. We are increasingly often facing predictive modelling tasks involving one or several of the following complexity aspects: (a)structured data as input or output of the prediction process, (b)very large/massive datasets, with many examples and/or many input/output dimensions, where data may be streaming at high rates, (c)incompletely/partially labelled data, and (d)data placed in a spatio-temporal or network context. Each of these is a major challenge to current ML/DM approaches and is the central topic of active research in areas such as structured-output prediction, mining data streams, semi-supervised learning, and mining network data. The simultaneous presence of several of them is a much harder, currently insurmountable, challenge and severely limits the applicability of ML/DM approaches.The proposed project will develop predictive modelling methods capable of simultaneously addressing several (ultimately all) of the above complexity aspects. In the most complex case, the methods would be able to address massive sets of network data incompletely labelled with structured outputs. We will develop the foundations (basic concepts and notions) for and the methodology (design and implementation of algorithms) of such approaches. We will demonstrate the potential and utility of the methods on showcase problems from a diverse set of application areas (molecular biology, sensor networks, mutimedia, and social networks). Some of these applications, such as relating the composition of microbiota to human health and the design of social media aggregators, have the potential of transformational impact on important aspects of society, such as personalized medicine and social media.


Programma

FP7-ICT

Call

FP7-ICT-2013-X


Partecipanti Pugliesi
Partner Ruolo Contributo UE Referente
Università Degli Studi Di Bari Aldo Moro Partecipante 508,800.00€ Rosaria Lacalamita

Partner Ruolo Paese
Ss. CYRIL AND METHODIUS UNIVERSITY IN SKOPJE Partecipante MK
INESC TEC - INSTITUTO DE ENGENHARIA DE SISTEMAS E COMPUTADORES, TECNOLOGIA E CIENCIA Partecipante PT
RUDER BOSKOVIC INSTITUTE Partecipante HR

Budget Totale

2,293,310.00€

Contributo UE

1,748,295.00€