Retrieval of trending keywords in a peer-to-peer micro-blogging OSN

H. Asthana; Ingemar Cox

Retrieval of trending keywords in a peer-to-peer micro-blogging OSN

1 Citationer (Scopus)

Abstract

We investigate the problem of identifying trending information in a peer-to-peer micro-blogging online social network. In a distributed decentralized environment, the participating nodes do not have access to global statistics such as the frequencies of the keywords and the information creation rate. We propose a two step solution. First, nodes make a local estimate of the frequency of keywords in the network based on their local information. At each iteration a subset of nodes collect this information from a small subset of random nodes in the network and aggregate the results. The most frequently occurring keywords are identified. In the second step, a node requests another small random subset of nodes to identify when, in the recent past, the more frequently occurring keywords were seen in micro-blogs. Once again this information is aggregated the fraction of time within a consecutive period that keywords were encountered is calculated. If this fraction, referred to as the trending fraction, is close to 1, then the keyword is predicted to be trending. A simulation on a network of 10, 000 nodes shows that the solution is capable of detecting multiple trending keywords with a moderate increase in bandwidth.

Originalsprog	Engelsk
Titel	Proceedings of the 22nd ACM international conference on Conference on information knowledge management
Antal sider	4
Publikationsdato	2013
Sider	1229-1232
Status	Udgivet - 2013
Udgivet eksternt	Ja
Begivenhed	ACM International Conference on Information & Knowledge Management - San Francisco, USA Varighed: 27 okt. 2013 → 1 nov. 2013 Konferencens nummer: 22

Konference

Konference	ACM International Conference on Information & Knowledge Management
Nummer	22
Land/Område	USA
By	San Francisco
Periode	27/10/2013 → 01/11/2013

Citationsformater

@inbook{33ab08dae12e4fe593ac5cfdd75c9984,

title = "Retrieval of trending keywords in a peer-to-peer micro-blogging OSN",

abstract = "We investigate the problem of identifying trending information in a peer-to-peer micro-blogging online social network. In a distributed decentralized environment, the participating nodes do not have access to global statistics such as the frequencies of the keywords and the information creation rate. We propose a two step solution. First, nodes make a local estimate of the frequency of keywords in the network based on their local information. At each iteration a subset of nodes collect this information from a small subset of random nodes in the network and aggregate the results. The most frequently occurring keywords are identified. In the second step, a node requests another small random subset of nodes to identify when, in the recent past, the more frequently occurring keywords were seen in micro-blogs. Once again this information is aggregated the fraction of time within a consecutive period that keywords were encountered is calculated. If this fraction, referred to as the trending fraction, is close to 1, then the keyword is predicted to be trending. A simulation on a network of 10, 000 nodes shows that the solution is capable of detecting multiple trending keywords with a moderate increase in bandwidth.",

author = "H. Asthana and Ingemar Cox",

year = "2013",

language = "English",

pages = "1229--1232",

booktitle = "Proceedings of the 22nd ACM international conference on Conference on information knowledge management",

note = "ACM International Conference on Information & Knowledge Management ; Conference date: 27-10-2013 Through 01-11-2013",

}

TY - CHAP

T1 - Retrieval of trending keywords in a peer-to-peer micro-blogging OSN

AU - Asthana, H.

AU - Cox, Ingemar

N1 - Conference code: 22

PY - 2013

Y1 - 2013

N2 - We investigate the problem of identifying trending information in a peer-to-peer micro-blogging online social network. In a distributed decentralized environment, the participating nodes do not have access to global statistics such as the frequencies of the keywords and the information creation rate. We propose a two step solution. First, nodes make a local estimate of the frequency of keywords in the network based on their local information. At each iteration a subset of nodes collect this information from a small subset of random nodes in the network and aggregate the results. The most frequently occurring keywords are identified. In the second step, a node requests another small random subset of nodes to identify when, in the recent past, the more frequently occurring keywords were seen in micro-blogs. Once again this information is aggregated the fraction of time within a consecutive period that keywords were encountered is calculated. If this fraction, referred to as the trending fraction, is close to 1, then the keyword is predicted to be trending. A simulation on a network of 10, 000 nodes shows that the solution is capable of detecting multiple trending keywords with a moderate increase in bandwidth.

AB - We investigate the problem of identifying trending information in a peer-to-peer micro-blogging online social network. In a distributed decentralized environment, the participating nodes do not have access to global statistics such as the frequencies of the keywords and the information creation rate. We propose a two step solution. First, nodes make a local estimate of the frequency of keywords in the network based on their local information. At each iteration a subset of nodes collect this information from a small subset of random nodes in the network and aggregate the results. The most frequently occurring keywords are identified. In the second step, a node requests another small random subset of nodes to identify when, in the recent past, the more frequently occurring keywords were seen in micro-blogs. Once again this information is aggregated the fraction of time within a consecutive period that keywords were encountered is calculated. If this fraction, referred to as the trending fraction, is close to 1, then the keyword is predicted to be trending. A simulation on a network of 10, 000 nodes shows that the solution is capable of detecting multiple trending keywords with a moderate increase in bandwidth.

M3 - Book chapter

SP - 1229

EP - 1232

BT - Proceedings of the 22nd ACM international conference on Conference on information knowledge management

T2 - ACM International Conference on Information & Knowledge Management

Y2 - 27 October 2013 through 1 November 2013

ER -

Retrieval of trending keywords in a peer-to-peer micro-blogging OSN

Abstract

Konference

Fingeraftryk

Citationsformater