Monopartite projections of bipartite networks are useful tools for modeling indirect interactions in complex systems. The standard approach to identify significant links is statistical validation using a suitable null network model, such as the popular configuration model (CM) that constrains node degrees and randomizes everything else. However different CM formulations exist, depending on how the constraints are imposed and for which sets of nodes. Here we systematically investigate the application of these formulations in validating the same network, showing that they lead to different results even when the same significance threshold is used. Instead a much better agreement is obtained for the same density of validated links. We thus propose a meta-validation approach that allows to identify model-specific significance thresholds for which the signal is strongest, and at the same time to obtain results independent of the way in which the null hypothesis is formulated. We illustrate this procedure using data on scientific production of world countries.The configuration model, in its various formulations, is a widely used null model for statistical validation of bipartite network projections. Here, the authors show that different formulations might bring to very different results, and propose a meta-validation approach that allows to identify model-specific significance thresholds while remaining null-model independent.
Cimini, G., Carra, A., Didomenicantonio, L., Zaccaria, A. (2022). Meta-validation of bipartite network projections. COMMUNICATIONS PHYSICS, 5(1) [10.1038/s42005-022-00856-9].
Meta-validation of bipartite network projections
Cimini G.
;
2022-01-01
Abstract
Monopartite projections of bipartite networks are useful tools for modeling indirect interactions in complex systems. The standard approach to identify significant links is statistical validation using a suitable null network model, such as the popular configuration model (CM) that constrains node degrees and randomizes everything else. However different CM formulations exist, depending on how the constraints are imposed and for which sets of nodes. Here we systematically investigate the application of these formulations in validating the same network, showing that they lead to different results even when the same significance threshold is used. Instead a much better agreement is obtained for the same density of validated links. We thus propose a meta-validation approach that allows to identify model-specific significance thresholds for which the signal is strongest, and at the same time to obtain results independent of the way in which the null hypothesis is formulated. We illustrate this procedure using data on scientific production of world countries.The configuration model, in its various formulations, is a widely used null model for statistical validation of bipartite network projections. Here, the authors show that different formulations might bring to very different results, and propose a meta-validation approach that allows to identify model-specific significance thresholds while remaining null-model independent.File | Dimensione | Formato | |
---|---|---|---|
s42005-022-00856-9.pdf
accesso aperto
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
2.72 MB
Formato
Adobe PDF
|
2.72 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.