We perform a preliminary study on large graph efficient indexing using a gap-based compression techniques and different node labelling functions. As baseline we use the Webgraph + LLP labelling function. To index the graph we use three labelling functions: Pagerank, HITS, and Pagerank with random walks choosing restart nodes with HITS authority scores. To compress the graphs we use Varint GB, with and without d-gaps, derived by rank value of the labelling function. Overall, we compare 8 different methods on different datasets composed by the WebGraph eu-2005, uk-2007 05@100000, cnr-2000, and the social networks, enron, ljournal-2008, provided by the Laboratory for Web Algorithmics (LAW)

Cruciani, A., Pasquini, D., Amati, G., Vocca, P. (2019). About graph index compression techniques. In E.D.B. Maristella Agosti (a cura di), IIR 2019: 10th Italian Information Retrieval Workshop: proceedings of the 10th Italian Information Retrieval Workshop (pp. 21-24). Aachen : CEUR.

About graph index compression techniques

Antonio Cruciani;Daniele Pasquini;Giambattista Amati;Vocca P
2019-01-01

Abstract

We perform a preliminary study on large graph efficient indexing using a gap-based compression techniques and different node labelling functions. As baseline we use the Webgraph + LLP labelling function. To index the graph we use three labelling functions: Pagerank, HITS, and Pagerank with random walks choosing restart nodes with HITS authority scores. To compress the graphs we use Varint GB, with and without d-gaps, derived by rank value of the labelling function. Overall, we compare 8 different methods on different datasets composed by the WebGraph eu-2005, uk-2007 05@100000, cnr-2000, and the social networks, enron, ljournal-2008, provided by the Laboratory for Web Algorithmics (LAW)
2019
Settore INFO-01/A - Informatica
English
Rilevanza internazionale
Articolo scientifico in atti di convegno
Graph compression
Webgraph LLP
Varint GB
PageRank
HITS
Cruciani, A., Pasquini, D., Amati, G., Vocca, P. (2019). About graph index compression techniques. In E.D.B. Maristella Agosti (a cura di), IIR 2019: 10th Italian Information Retrieval Workshop: proceedings of the 10th Italian Information Retrieval Workshop (pp. 21-24). Aachen : CEUR.
Cruciani, A; Pasquini, D; Amati, G; Vocca, P
Contributo in libro
File in questo prodotto:
File Dimensione Formato  
IIR2019ACM.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 518.05 kB
Formato Adobe PDF
518.05 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/396805
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact