Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets.

Acuña, V., Ishi Soares de Lima, L., Italiano, G., Pepe' Sciarria, L., Sagot, M., Sinaimeri, B. (2021). A family of tree-based generators for bubbles in directed graphs. JOURNAL OF GRAPH ALGORITHMS AND APPLICATIONS, 25(1), 563-580 [10.7155/jgaa.00572].

A family of tree-based generators for bubbles in directed graphs

Ishi Soares de Lima, L;Italiano, GF;Pepe' Sciarria, L;
2021-01-01

Abstract

Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets.
2021
Pubblicato
Rilevanza internazionale
Articolo
Sì, ma tipo non specificato
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
English
Alternative splicing
Bubble generator
Directed graphs
Acuña, V., Ishi Soares de Lima, L., Italiano, G., Pepe' Sciarria, L., Sagot, M., Sinaimeri, B. (2021). A family of tree-based generators for bubbles in directed graphs. JOURNAL OF GRAPH ALGORITHMS AND APPLICATIONS, 25(1), 563-580 [10.7155/jgaa.00572].
Acuña, V; Ishi Soares de Lima, L; Italiano, G; Pepe' Sciarria, L; Sagot, M; Sinaimeri, B
Articolo su rivista
File in questo prodotto:
File Dimensione Formato  
572.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 601.14 kB
Formato Adobe PDF
601.14 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/411763
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact