You are here:Home/RICERCA/STRATEGIA E RISULTATI/Risultati e prodotti della ricerca/Progetti finanziati/I progetti di ricerca europei dell’Università di Pisa 2014-2020/Excellent Science/Marie Skłodowska-Curie Actions/Marie Skłodowska-Curie Actions under Horizon 2020/ALPACA

ALPACA

Marie Curie Acions logo UNIPI Team Leader: Prof.ssa Nadia Pisanti, Dipartimento di Informatica

Genomes are strings over the letters A,C,G,T, which represent nucleotides, the building blocks of DNA. In view of ultra-large amounts of genome sequence data emerging from ever more and technologically rapidly advancing genome sequencing devices—in the meantime, amounts of sequencing data accrued are reaching into the exabyte scale—the driving, urgent question is: how can we arrange and analyze these data masses in a formally rigorous, computationally efficient and biomedically rewarding manner?

Graph based data structures have been pointed out to have disruptive benefits over traditional sequence based structures when representing pan-genomes, sufficiently large, evolutionarily coherent collections of genomes. This idea has its immediate justification in the laws of genetics: evolutionarily closely related genomes vary only in relatively little amounts of letters, while sharing the majority of their sequence content. Graph-based pan-genome representations that allow to remove redundancies without having to discard individual differences, make utmost sense. In this project, we will put this shift of paradigms—from sequence to graph based representations of genomes—into full effect. As a result, we can expect a wealth of practically relevant advantages, among which arrangement, analysis, compression, integration and exploitation of genome data are the most fundamental points. In addition, we will also open up a significant source of inspiration for computer science itself.

For realizing our goals, our network will (i) decisively strengthen and form new ties in the emerging community of computational pan-genomics, (ii) perform research on all relevant frontiers, aiming at significant computational advances at the level of important breakthroughs, and (iii) boost relevant knowledge exchange between academia and industry. Last but not least, in doing so, we will train a new, “paradigm-shift-aware” generation of computational genomics researchers.

Coordinator

UNIVERSITAET BIELEFELD, Germany

Participants

CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE CNRS, France
INSTITUT NATIONAL DE RECHERCHE ENINFORMATIQUE ET AUTOMATIQUE, France
UNIVERSITA DI PISA, Italy
UNIVERSITA' DEGLI STUDI DI MILANO-BICOCCA, Italy
STICHTING NEDERLANDSE WETENSCHAPPELIJK ONDERZOEK INSTITUTEN, Netherlands
HEINRICH-HEINE-UNIVERSITAET DUESSELDORF, Germany
EUROPEAN MOLECULAR BIOLOGY LABORATORY, Germany
UNIVERZITA KOMENSKEHO V BRATISLAVE, Slovakia
HELSINGIN YLIOPISTO, Finland
INSTITUT PASTEUR, France
THE CHANCELLOR MASTERS AND SCHOLARSOF THE UNIVERSITY OF CAMBRIDGE, United Kingdom
GENETON S.R.O., Slovakia

Start date 1 January 2021
End date 31 December 2024
Project cost € 3 725 035,20
Project funding € 3 725 035,20
UNIPI quota € 261 499,68
Call title H2020-MSCA-ITN-2020
Funding scheme MSCA-ITN-ETN - European Training Networks
UNIPI role Partner

Ultima modifica: Gio 04 Mag 2023 - 07:42