Comparative study of pagerank and hits algorithms for reciprocal link prediction in online social networks

dc.contributor.authorPallangyo, Brian Somi
dc.date.accessioned2020-09-30T08:33:38Z
dc.date.available2020-09-30T08:33:38Z
dc.date.issued2020
dc.descriptionDissertation (MSc Computer Science)en_US
dc.description.abstractOnline Social Networks (OSN) provides active space for digital human interaction and are used daily. Human engagement is reflected by exploiting the dynamics of OSN, where the fundamental problem is to infer future interactions on the network, called link prediction. Most studies have employed classical algorithms which consider node similarity but neglected the link analysis algorithms which consider topological structure. This study focused on the comparative study of predicting reciprocal interaction from para-social interaction using algorithms. Particularly, this study selected PageRank and HITS, which are considered famous link analysis algorithms with high order heuristics. Network simulation was performed to understand the performance of the algorithms when used to predict reciprocal link formation by employing machine learning techniques. For the experiment, two datasets were used to ensure the reliability of the results. Initially, the publicly available secondary dataset of Twitter was used followed by primary dataset crawled from Mayocoo, both of which are directed networks. The resulting networks from both datasets adhere to power-law distribution. Resource allocation was used as the baseline for the study after outperforming Adamic-Adar, Jaccard Coefficient, and Preferential Attachment. The result of this study showed that both PageRank and HITS surpassed the baseline in performance of prediction. Thus, PageRank has an accuracy improvement of 1.8% with precision and recall of 4.8% and 1.1%, respectively. Furthermore, this improvement comes with a balance of 3% (f1-measure). When HITS is used, there is an improvement accuracy by 5%, with 15.1% (precision), 7.9% (recall) and 11.5% (f1-measure). These empirical results demonstrate that HITS outperforms PageRank in prediction performance. Also, the results from the computational test showed that PageRank uses less computational resources compared to HITS. This study suggests the use of link analysis algorithms over classical algorithms for reciprocal link prediction in OSN. Furthermore, the use of HITS is recommended when prediction performance is vital compared to computational cost, otherwise, PageRank in cases were computational resources are minimal.en_US
dc.identifier.citationPallangyo, B. S. (2020) Comparative study of pagerank and hits algorithms for reciprocal link prediction in online social networks (Master’s dissertation). The University of Dodoma, Dodoma.en_US
dc.identifier.urihttp://hdl.handle.net/20.500.12661/2492
dc.language.isoenen_US
dc.publisherThe University of Dodomaen_US
dc.subjectPageranken_US
dc.subjectHits algorithmsen_US
dc.subjectReciprocal linken_US
dc.subjectPredictionen_US
dc.subjectOnline social networksen_US
dc.subjectSocial networksen_US
dc.subjectOSNen_US
dc.subjectAlgorithmen_US
dc.subjectReciprocal interactionen_US
dc.subjectDigital human interactionsen_US
dc.titleComparative study of pagerank and hits algorithms for reciprocal link prediction in online social networksen_US
dc.typeDissertationen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pallangyo, Brian.pdf
Size:
2.67 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: