SlideShare uma empresa Scribd logo
1 de 8
Baixar para ler offline
Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media




            Measuring User Influence in Twitter: The Million Follower Fallacy

      Meeyoung Cha∗                   Hamed Haddadi†              Fabr´cio Benevenuto‡
                                                                      ı                             Krishna P. Gummadi∗
                                 ∗
                                     Max Planck Institute for Software Systems (MPI-SWS), Germany
                                 †
                                     Royal Veterinary College, University of London, United Kingdom
                                     ‡
                                       CS Dept., Federal University of Minas Gerais (UFMG), Brazil



                            Abstract                                   a minority of users, called influentials, excel in persuading
                                                                       others (Rogers 1962). This theory predicts that by target-
     Directed links in social media could represent anything           ing these influentials in the network, one may achieve a
     from intimate friendships to common interests, or even
     a passion for breaking news or celebrity gossip. Such
                                                                       large-scale chain-reaction of influence driven by word-of-
     directed links determine the flow of information and               mouth, with a very small marketing cost (Katz and Lazars-
     hence indicate a user’s influence on others—a concept              feld 1955). A more modern view, in contrast, de-emphasizes
     that is crucial in sociology and viral marketing. In this         the role of influentials. Instead, it posits that the key fac-
     paper, using a large amount of data collected from Twit-          tors determining influence are (i) the interpersonal rela-
     ter, we present an in-depth comparison of three mea-              tionship among ordinary users and (ii) the readiness of a
     sures of influence: indegree, retweets, and mentions.              society to adopt an innovation (Watts and Dodds 2007;
     Based on these measures, we investigate the dynam-                Domingos and Richardson 2001). This modern view of in-
     ics of user influence across topics and time. We make              fluence leads to marketing strategies such as collaborative
     several interesting observations. First, popular users            filtering. These theories, however, are still just theories, be-
     who have high indegree are not necessarily influential
     in terms of spawning retweets or mentions. Second,
                                                                       cause there has been a lack of empirical data that could be
     most influential users can hold significant influence over           used to validate either of them. The recent advent of social
     a variety of topics. Third, influence is not gained spon-          networking sites and the data within such sites now allow
     taneously or accidentally, but through concerted effort           researchers to empirically validate these theories.
     such as limiting tweets to a single topic. We believe that           Moving from theory into practice, we find that there are
     these findings provide new insights for viral marketing            many other unanswered questions about how influence dif-
     and suggest that topological measures such as indegree            fuses through a population and whether it varies across top-
     alone reveals very little about the influence of a user.           ics and time. People have different levels of expertise on
                                                                       various subjects. When it comes to marketing, however, this
                        Introduction                                   fact is generally ignored. Marketing services actively search
                                                                       for potential influencers to promote various items. These
Influence has long been studied in the fields of sociology,              influencers range from “cool” teenagers, local opinion lead-
communication, marketing, and political science (Rogers                ers, all the way to popular public figures. However, the ad-
1962; Katz and Lazarsfeld 1955). The notion of influence                vertised items are often far outside the domain of expertise
plays a vital role in how businesses operate and how a soci-           of these hired individuals. So how effective are these mar-
ety functions—for instance, see observations on how fashion            keting strategies? Can a person’s influence in one area be
spreads (Gladwell 2002) and how people vote (Berry and                 transferred to other areas?
Keller 2003). Studying influence patterns can help us bet-
                                                                          In this paper, we present an empirical analysis of influ-
ter understand why certain trends or innovations are adopted
                                                                       ence patterns in a popular social medium. Using a large
faster than others and how we could help advertisers and
                                                                       amount of data gathered from Twitter, we compare three dif-
marketers design more effective campaigns. Studying influ-
                                                                       ferent measures of influence: indegree, retweets, and men-
ence patterns, however, has been difficult. This is because
                                                                       tions.1 Focusing on different topics, we examine how the
such a study does not lend itself to readily available quan-
                                                                       three types of influential users performed in spreading pop-
tification, and essential components like human choices and
                                                                       ular news topics. We also investigate the dynamics of an
the ways our societies function cannot be reproduced within
                                                                       individual’s influence by topic and over time. Finally, we
the confines of the lab.
                                                                       characterize the precise behaviors that make ordinary indi-
   Nevertheless, there have been important theoretical stud-           viduals gain high influence over a short period of time.
ies on the diffusion of influence, albeit with radically dif-
ferent results. Traditional communication theory states that              1
                                                                          Indegree is the number of people who follow a user; retweets
Copyright c 2010, Association for the Advancement of Artificial         mean the number of times others “forward” a user’s tweet; and
Intelligence (www.aaai.org). All rights reserved.                      mentions mean the number of times others mention a user’s name.



                                                                  10
The Twitter dataset used in this paper consists of 2 billion           In contrast, a more modern view of information flow em-
follow links among 54 million users who produced a total of            phasizes the importance of prevailing culture more than the
1.7 billion tweets. We refer readers to our project webpage            role of influentials. Some researchers have reasoned that
http://twitter.mpi-sws.org/ for a detailed description of the          people in the new information age make choices based on
dataset and our data sharing plan.                                     the opinions of their peers and friends, rather than by influ-
   Our study provides several findings that have direct impli-          entials (Domingos and Richardson 2001). These researchers
cations in the design of social media and viral marketing:             argued that direct marketing through influentials would not
1) Analysis of the three influence measures provides a better           be as profitable as using “network”-based advertising such
   understanding of the different roles users play in social           as collaborative filtering.
   media. Indegree represents popularity of a user; retweets              The traditional influentials theory has also been criticized
   represent the content value of one’s tweets; and mentions           because its information flow process does not take into ac-
   represent the name value of a user. Hence, the top users            count the role of ordinary users. In order to compare the
   based on the three measures have little overlap.                    role of influentials and ordinary users, researchers have de-
                                                                       veloped a series of simulations, in which information flows
2) Our finding on how influence varies across topics could               freely between users, and a user adopts an innovation when
   serve as a useful test for answering how effective adver-           he is influenced by more than a threshold of the sample pop-
   tisement in Twitter would be if one is to employ influential         ulation (Watts and Dodds 2007). Influentials were defined
   users. Our analysis shows that most influential users hold           as those in the top 10% of influence distribution. The simu-
   significant influence over a variety of topics.                       lation showed that influentials initiated more frequent and
3) Ordinary users can gain influence by focusing on a sin-              larger cascades than average users, but they were neither
   gle topic and posting creative and insightful tweets that           necessary nor sufficient for all diffusions, as suggested in
   are perceived as valuable by others, as opposed to simply           the traditional theory. Moreover, in homogeneous networks,
   conversing with others.                                             influentials were no more successful in running long cas-
                                                                       cades than ordinary users. This means that a trend’s success
   These findings provide new insights for viral marketing.
                                                                       depends not on the person who starts it, but on how suscep-
The first finding in particular indicates that indegree alone
                                                                       tible the society is overall to the trend (Watts 2007). In fact,
reveals little about the influence of a user. This has been
                                                                       a trend can be initiated by any one, and if the environment is
coined the million follower fallacy by Avnit (Avnit 2009),
                                                                       right, it will spread. Therefore, Watts dubbed early adopters
who pointed to anecdotal evidence that some users follow
                                                                       or opinion leaders “accidental” influentials.
others simply for etiquette—it’s polite to follow someone
                                                                          The above competing ideas have remained as hypotheses
who’s following you—and many do not read all the broad-
                                                                       for several reasons. First is the lack of data that could be
cast tweets. We have empirically demonstrated that having a
                                                                       used to empirically test them. Although there exist a handful
million followers does not always mean much in the Twitter
                                                                       of empirical studies on word-of-mouth influence (Leskovec,
world. Instead, we claim that it is more influential to have
                                                                       Adamic, and Huberman 2007; Cha, Mislove, and Gummadi
an active audience who retweets or mentions the user.
                                                                       2009), no work has been conducted on the relative order
                                                                       of influence among individuals. A second issue is the va-
            Defining influence on Twitter                                riety of ways that influence has been defined (Watts 2007;
We start by reviewing studies of diffusion of influence and             Goyal, Bonchi, and Lakshmanan 2010). It has been unclear
related work on influence propagation on Twitter.                       what exactly influence means. Finally, decades have passed
                                                                       since the influentials theory appeared. Even if the theory
Background                                                             was reasonably accurate when it was proposed, things have
There are a number of conflicting ideas and theories about              changed and now we have much more variability in the flow
how trends and innovations get adopted and spread.                     of influence. In particular, online communities have become
   The traditional view assumes that a minority of mem-                a significant way we receive new information and influence
bers in a society possess qualities that make them excep-              in such communities needs to be explored. In this paper, we
tionally persuasive in spreading ideas to others. These ex-            investigate the notion of influence using a large amount of
ceptional individuals drive trends on behalf of the major-             data collected from a popular social medium, Twitter.
ity of ordinary people. They are loosely described as be-
ing informed, respected, and well-connected; they are called           Measuring influence on Twitter
the opinion leaders in the two-step flow theory (Katz and               The Merriam-Webster dictionary defines influence as “the
Lazarsfeld 1955), innovators in the diffusion of innovations           power or capacity of causing an effect in indirect or intan-
theory (Rogers 1962), and hubs, connectors, or mavens in               gible ways.” Despite the large number of theories of influ-
other work (Gladwell 2002). The theory of influentials is               ence in sociology, there is no tangible way to measure such
intuitive and compelling. By identifying and convincing a              a force nor is there a concrete definition of what influence
small number of influential individuals, a viral campaign can           means, for instance, in the spread of news.
reach a wide audience at a small cost. The theory spread well             In this paper, we analyze the Twitter network as a news
beyond academia and has been adopted in many marketing                 spreading medium and study the types and degrees of influ-
businesses, e.g., RoperASW and Tremor (Gladwell 2002;                  ence within the network. Focusing on an individual’s po-
Berry and Keller 2003).                                                tential to lead others to engage in a certain act, we highlight



                                                                  11
three “interpersonal” activities on Twitter. First, users in-          us to gather large amounts of data. We used the Twitter API
teract by following updates of people who post interesting             to gather information about a user’s social links and tweets.
tweets. Second, users can pass along interesting pieces of                We launched our crawler in August 2009 for all user IDs
information to their followers. This act is popularly known            ranging from 0 to 80 million. We did not look beyond 80
as retweeting and can typically be identified by the use                million, because no single user in the collected data had a
of RT @username or via @username in tweets. Fi-                        link to a user whose ID was greater than that value. Out of 80
nally, users can respond to (or comment on) other people’s             million possible IDs, we found 54,981,152 in-use accounts,
tweets, which we call mentioning. Mentioning is identified              which were connected to each other by 1,963,263,821 social
by searching for @username in the tweet content, after ex-             links. We gathered information about a user’s follow links
cluding retweets. A tweet that starts with @username is                and all tweets ever posted by each user since the early days
not broadcast to all followers, but only to the replied user. A        of the service. In total, there were 1,755,925,520 tweets.
tweet containing @username in the middle of its text gets              Nearly 8% of all user accounts were set private, so that only
broadcast to all followers. These three activities represent           their friends could view their tweets. We ignore these users
the different types of influence of a person:                           in our analysis. The social link information is based on the
                                                                       final snapshot of the network topology at the time of crawl-
1. Indegree influence, the number of followers of a user,
                                                                       ing and we do not know when the links were formed.
   directly indicates the size of the audience for that user.
                                                                          The network of Twitter users comprises a single dispro-
2. Retweet influence, which we measure through the num-                 portionately large connected component (containing 94.8%
   ber of retweets containing one’s name, indicates the abil-          of users), singletons (5%), and smaller components (0.2%).
   ity of that user to generate content with pass-along value.         The largest component contains 99% of all links and tweets.
3. Mention influence, which we measure through the num-                 Our goal is to explore influence of users, hence we focus on
   ber of mentions containing one’s name, indicates the abil-          the largest component of the network, which is conceptually
   ity of that user to engage others in a conversation.                a single interaction domain for users.
                                                                          Because it is hard to determine influence of users who
                                                                       have few tweets, we borrowed the concept of “active users”
Related work on Twitter                                                from the traditional media research (Levy and Windhal
Several recent efforts have been made to track influence                1985) and focused on those users with some minimum level
on Twitter. The Web Ecology Project tracked 12 popu-                   of activity. We ignored users who had posted fewer than 10
lar Twitter users for a 10-day period and grouped a user’s             tweets during their entire lifetime. We also ignored users
influence into two types: conversation-based and content-               for whom we did not have a valid screen name, because this
based (Leavitt et al. 2009). This work concluded that news             information is crucial in identifying the number of times a
media are better at spreading content, while celebrities are           user was mentioned or retweeted by others. After filtering,
better at simply making conversation. Our work extends                 there were 6,189,636 users, whom we focus on in the re-
their notion of influence and uses extensive data to further            mainder of this paper. To measure the influence of these 6
examine the spatial and temporal dynamics of influence.                 million users, however, we looked into how the entire set of
   More recently, a PageRank-like measure has been pro-                52 million users interacted with these active users.
posed to quantify influence on Twitter (Weng et al. 2010).
The authors found high link reciprocity (72%) from a non-              Methodology for comparing user influence
random sample of 6,748 Singapore-based users, and argued               For each of the 6 million users, we computed the value of
that high reciprocity is indicative of homophily. They then            each influence measure and compared them. Rather than
exploited this fact in computing a user’ influence rank. Our            comparing the values directly, we used the relative order of
study, however, contradicts the observation about high reci-           users’ ranks as a measure of difference. In order to do this,
procity; near-complete data of Twitter shows low reciprocity           we sorted users by each measure, so that the rank of 1 in-
(10%). Thus, we predict that social links on Twitter repre-            dicates the most influential user and increasing rank indi-
sent an influence relationship, rather than homophily. Ac-              cates a less influential user. Users with the same influence
cordingly, we ask what are the different activities on Twitter         value receive the average of the rank amongst them (Buck
that represent influence of a user and to what extent a per-            1980). Once every user is assigned a rank for each influence
son’s influence varies across tweet topic and time.                     measure, we are ready to quantify how a user’s rank varies
                                                                       across different measures and examine what kinds of users
       Characteristics of the top influentials                          are ranked high for a given measure.
                                                                          We used Spearman’s rank correlation coefficient
We describe how we collected the Twitter data and present
the characteristics of the top users based on three influence                                      6    (xi − yi )2
                                                                                         ρ=1−                                   (1)
measures: indegree, retweets, and mentions.                                                          N3 − N
                                                                       as a measure of the strength of the association between two
Dataset                                                                rank sets, where xi and yi are the ranks of users based on
We asked Twitter administrators to allow us to gather data             two different influence measures in a dataset of N users.
from their site at scale. They graciously white-listed the IP          Spearman’s rank correlation coefficient calculation is a non-
address range containing 58 of our servers, which allowed              parametric test; the coefficient assesses how well an arbi-



                                                                  12
trary monotonic function could describe the relationship be-
tween two variables, without making any other assumptions
about the particular nature of the relationship between the
                                                                                                          
variables. Our inclusive and complete dataset guarantees re-
liability of the correlation estimates. The closer ρ is to +1
                                                                                                              
or −1, the stronger the likely correlation. A perfect positive
correlation is +1 and a perfect negative correlation is −1.

Comparing three measures of user influence                                              



                                                                                                      
To see what kinds of users are the most influential, we visited                                              
  
the Twitter pages of the top-20 users based on each measure.
The top influentials The most followed users span a wide                 Figure 1: Venn diagram of the top-100 influentials across
variety of public figures and news sources. They were                    measures: The chart is normalized so that the total is 100%.
news sources (CNN, New York Times), politicians (Barack
Obama), athletes (Shaquille O’Neal), as well as celebrities             lap, as shown in Figure 1, indicating that the three measures
like actors, writers, musicians, and models (Ashton Kutcher,            capture different types of influence.
Britney Spears). As the list suggests, indegree measure is
useful when we want to identify users who get lots of at-               Relative influence ranks In order to investigate how the
tention from their audience through one-on-one interactions,            three measures correlate, we compared the relative influence
i.e., the audience is directly connected to influentials.                ranks of all 6 million users (Table 1). We see a moderately
                                                                        high correlation (above 0.5) across all pairs. However, the
   The most retweeted users were content aggregation ser-
                                                                        high correlation appears to be an artifact of the tied ranks
vices (Mashable, TwitterTips, TweetMeme), businessmen
                                                                        among the least influential users, e.g., many of the least con-
(Guy Kawasaki), and news sites (The New York Times, The
                                                                        nected users also received zero retweet and mention. To
Onion). They are trackers of trending topic and knowledge-
                                                                        avoid this bias, we focused on the set of relatively popu-
able people in different fields, whom other users decide to
                                                                        lar users. We considered users in the top 10th and 1st per-
retweet. Unlike indegree, retweets represent influence of a
                                                                        centiles based on indegree, in the hope that users who get
user beyond one’s one-to-one interaction domain; popular
                                                                        retweeted or mentioned must have some followers.
tweets could propagate multiple hops away from the source
before they are retweeted throughout the network. Further-                   Table 1: Spearman’s rank correlation coefficients
more, because of the tight connection between users as sug-               Correlation                All     Top 10% Top 1%
gested in the triadic closure (Granovetter 1973), retweeting              Indegree vs retweets      0.549      0.122      0.109
in a social network can serve as a powerful tool to reinforce             Indegree vs mentions 0.638           0.286      0.309
a message—for instance, the probability of adopting an in-                Retweets vs mentions 0.580           0.638      0.605
novation increases when not one but a group of users repeat
the same message (Watts and Dodds 2007).
                                                                           After this filtering step, the top users showed a strong
   The most mentioned users were mostly celebrities. Ordi-
                                                                        correlation in their retweet influence and mention influence.
nary users showed a great passion for celebrities, regularly
                                                                        Sampling the top users based on retweets or mentions leads
posting messages to them or mentioning them, without nec-
                                                                        to similar results. This means that, in general, users who get
essarily retweeting their posts. This indicates that celebrities
                                                                        mentioned often also get retweeted often, and vice versa. In-
are often in the center of public attention and celebrity gos-
                                                                        degree, however, was not related to the other measures. We
sip is a popular activity among Twitter users.
                                                                        conclude that the most connected users are not necessarily
   If retweets represent a citation of another user’s con-
                                                                        the most influential when it comes to engaging one’s audi-
tent, mentions represent a public response to another user’s
                                                                        ence in conversations and having one’s messages spread.
tweet—the focus of a tweet is on content for retweets, while
the focus is on the replied user for mentions. This can be              Discussion of methodology Normalizing retweets and
confirmed from the usage of conventions in tweets: 92% of                mentions by total tweets would yield a different measure
tweets that had a RT or via marker contained a URL and                  of influence, which might have led to very different results.
97% of them also contained the @username field. This                     When we tried normalizing the data, we identified local
means that retweets are about the content (indicated by the             opinion leaders as the most influential. However, normal-
embedded URL) and that people typically cite the authen-                ization failed to rank users with the highest sheer number of
tic source when they retweet. However, fewer than 30% of                retweets as influential. Therefore, in this paper, we use the
tweets that were classified as mentions contained any URL,               sheer number of retweets and mentions without normalizing
indicating that a mention is more identity-driven.                      these values by the total tweets of a user.
   Across all three measures, the top influentials were gener-              Other measures such as the number of tweets and out-
ally recognizable public figures and websites. Interestingly,            degree (i.e., the number of people a user follows) were not
we saw marginal overlap in these three top lists. These top-            found to be useful, because they identified robots and spam-
20 lists only had 2 users in common: Ashton Kutcher and                 mers as the most influential, respectively. Therefore, we do
Puff Daddy. The top-100 lists also showed marginal over-                not use these measures.



                                                                   13
Table 2: Summary information of the three major topics events studied
          Topic        Period            Keywords                              Users          Tweets                                 Audience
          Iran         Jun 11—Aug 10 #iranelection, names of politicians 302,130          1,482,038                                 22,177,836
          Swine        May 3—July 2      Mexico flu, H1N1, swine               239,329        495,825                                20,977,793
          Jackson      Jun 25—Aug 24 Michael Jackson, #mj                     610,213     1,418,356                                 23,550,211


   Finally, we calculated indegree based on the snapshot of               from diverse genres. These properties make this group ideal
the network at the time of crawling in 2009, because we                   for studying how a user’s influence varies across vastly dif-
do not know the time when each follow link was formed.                    ferent topic genres.
For the calculation of retweets and mentions, however, we
used longitudinal data (i.e., since the beginning of the Twit-            Distribution of the influence ranks
ter service in 2006). This difference could have resulted in              To get a measure of influence for a given topic, we count
a weaker correlation between indegree and the other influ-                 only the retweets and mentions a user spawned on the given
ence measures (Table 1). Nevertheless, nearly three quarters              topic. Because indegree is invariant across topics, we do not
of Twitter users joined in 2009, suggesting that the effect on            use the measure in this analysis.
the correlation would have been minimal.                                     Before investigating the dynamics of influence across top-
                                                                          ics, we first need to understand the high-level characteris-
  Does influence hold across different topics?                             tics of user influence, such as the degree to which users’
Having defined three different measures of influence on                     influence can differ. Figure 2 displays the influence ranks
Twitter, we now investigate whether a user’s influence varies              of users based on retweets and mentions across the Iran,
by topic genres. In order to investigate this, we first need to            Swine, and Jackson topics. The influence ranks are calcu-
find users who tweeted about a diverse set of topics.                      lated for each topic and a user’s rank may differ for different
                                                                          topics. The plots show a straight line on a log-log plot, a
Methodology for identifying target topics                                 property that is referred to as the power-law characteristic.
                                                                          The power-law pattern is indicative of the fact that users’
To find as many users to monitor as possible, we picked three
                                                                          degree of influence can differ by orders of magnitude: the
of the most popular topics in 2009 that were considered en-
                                                                          top influentials were retweeted or mentioned disproportion-
gaging and revolutionary in Twitter2 : the Iranian presiden-
                                                                          ately more times than the majority of users. This suggests
tial election, the outbreak of the H1N1 influenza, and the
                                                                          that utilizing top influentials has a great potential payoff in
death of Michael Jackson. While all three topics were popu-
                                                                          marketing strategy.
lar, they span political, health, and social genres. To extract
tweets relevant to these events, we first identified the set of
keywords describing the topics by consulting news websites                                           10000
and informed individuals. Using the selected keywords, we
                                                                                Number of retweets




then identified relevant tweets by searching for the keywords                                                                       Iran
                                                                                                     1000
in the tweet dataset.                                                                                                              Swine
   Table 2 displays the keywords and the total number of                                                                           Jackson
                                                                                                      100
users and tweets for each topic. We focused on a period of
60 days starting from one day prior to the start of each event.                                        10
We limited the study duration because popular keywords                                                  1
were typically hijacked by spammers after certain time. The                                             0
table also shows the total number of users who received any                                                   1      10       100    1000    10000
tweet on the topic (termed audience). Each topic reached                                                                  User ranks
an audience of over 20 million, indicating that over 40% of                                                  (a) Retweet influence ranks
users in Twitter were aware of at least one of the three topics.
   Among the users who tweeted about any of these topics,                                            1000
                                                                                Number of ment ons




fewer than 2% discussed all three topics. Although 2% is a                                                                         Iran
small fraction, this set contains 13,219 users, which is a large                                                                   Swine
                                                                                                      100
                                                                                                                                   Jackson
enough sample size for statistical analysis. These users were
generally well connected; they had on average 2,037 follow-                                            10
ers, and together reached an audience of 16 million. Fur-
thermore, none of these users were dedicated to these three                                             1
                                                                                                        0
topics; no user had dedicated more than 60% of their tweets
to these topics. This means that the set of 13,219 users is                                                  1       10       100    1000    10000
enthusiastic about sharing thoughts on popular news topics                                                                User ranks

   2
                                                                                                             (b) Mention influence ranks
     The top Twitter trends identified by the Twitter Research team
are listed at http://tinyurl.com/yb4965e.                                    Figure 2: Distribution of user ranks for a given topic



                                                                     14
Variation of a user’s influence across topics
                                                                                                      100




                                                                             Ranks (Sw ne, Jackson)
                                                                                                                Swine
Next, in order to examine how volatile a user’s rank is across                                                  Jackson
different topics, we compare the relative order of influence                                            80

ranks across topics in Table 3. For the same reason we previ-                                          60
ously discussed in Table 1, we ignore the least popular users
who have tied ranks and focus on the set of relatively pop-                                            40
ular users, as measured by indegree. The top 10th and 1st                                              20
percentiles included 1,322 and 132 users, respectively.
                                                                                                        0
                                                                                                            0         5       10         15   20
Table 3: Spearman’s rank correlation coefficients over topics
                            Retweet            Mentions                                                                   Ranks (Iran)
         Topics         Top 10% 1% Top 10% 1%                                  Figure 3: Users’ retweet influence ranks over topics
    Iran vs. Swine        0.54       0.62     0.59       0.68
  Iran vs. Jackson        0.48       0.54     0.59       0.63                  The rise and fall of influentials over time
 Swine vs. Jackson        0.55       0.50     0.80       0.68            Many factors—social, political, and economical—affect
                                                                         popularity and influence of individuals and organizations. In
   The rank correlation is generally high (above 0.5) and gets           online social media, such dynamics is facilitated by easy en-
stronger for the top 1% of users. Mentions show an even                  try of competition. It only costs 140 characters to generate
stronger correlation across topics than retweets. This means             a tweet for any user. Likewise, it will be challenging for in-
that a popular user who is good at spawning mentions from                fluentials to maintain their status when many emerging local
others can do so over a wide range of topics, more easily                opinion leaders and evangelists enter the arena.
than when she is retweeted over diverse topics. Among topic                 Here we analyze the dynamics of individual’s influence
pairs, Swine and Jackson showed the highest correlation for              over time in two ways. First, we track the popularity of top
the top 10% of well-connected users for both the retweet and             influentials over a long term period and check how well they
mention influence. This is perhaps due to the more social                 maintain their ranks. Second, we focus on users who in-
nature of these two events, which differs from the Iran topic            creased their influence in a specific topic over a short time
where special interest groups like politicians and bloggers              period, in order to understand what behaviors make ordinary
played a major role.                                                     individuals influential.
   Given that the rank correlation gets stronger for users with
high indegree, we investigated how widely the ranks of the               Maintaining engagement of the top influentials
top listed users change by topic. Figure 3 shows the retweet             Out of all 6 million active users in the Twitter network, we
ranks of the top users in the Iran topic against the relative            picked the top 100 users based on each of the three mea-
retweet ranks of the same users in the Swine and Jackson                 sures: indegree, retweets, and mentions. We used all the
topics. It is clear from the figure that the top 5 users in               tweets ever posted on Twitter in identifying these influen-
the Iran topic retained their relative ranks in the other two            tials. Due to the overlap we discussed in Figure 1, there
topics. These users were Mashable, CnnBrk, TweetMeme,                    were 233 distinct users, whom we call all-time influentials.
Time, and BreakingNews, who were all in the category of                  In order to see how the influence of these all-time influen-
authoritative news sources and content trackers. The men-                tials varied over time, we tracked their influence scores over
tion influence also showed a similar trend (not shown here);              an 8 month period from January to August 2009.
most influentials ranked consistently high amongst different                 To get a time-varying measure of influence, we counted
topics. And we have observed this trend not only for the                 the number of retweets and mentions the all-time influen-
hyper-influentials: moderate influentials like opinion lead-               tials spawned in every 15 days over the 8 month period.
ers and evangelists also had consistent influence ranks over              Because we only know the indegree information based on
diverse topics, as shown in Table 3.                                     the final snapshot of the Twitter network, we do not use this
   Our findings about the highly skewed ability of users to               measure. For each user, we computed a single explanatory
influence others (Figure 2) and the strong correlation in a               variable P: the probability that a random tweet posted on
user’s influence rank across different topics (Table 3) to-               Twitter during a 15 day period is a retweet (or a mention) of
gether lead to two interesting conclusions. First, most influ-            that user. Normalizing by the total number of tweets posted
ential users hold significant influence over a variety of top-             on Twitter is essential to cancel out any variable effect on the
ics. This means that local opinion leaders and highly popular            data and allows the underlying characteristics of the data sets
figures could indeed be used to spread information outside                to be compared. For instance, because the Twitter network
their area of expertise. In fact, new advertisement campaigns            quadrupled over time in terms of the registered users, the to-
have recently been launched that insert advertisement links              tal volume of tweets merely increased over time. Hence, if
into a popular person’s tweet (Fiorillo 2009). Second, the               we didn’t normalize the results, the trend wouldn’t be inter-
power-law trend in the difference among influence of indi-                esting. Google similarly normalizes the data when analyzing
viduals indicates that it is substantially more effective to tar-        their search trends (Ginsberg et al. 2009).
get the top influentials than to employ a massive number of                  Figure 4 displays the time evolution of the normalized
non-popular users in order to kick start a viral campaign.               retweets and mentions of the 233 all-time influential users.



                                                                    15
0.0015                                                                                     0.004
Retweet probability P




                                                                                           Mention probability P
                                             Top 10 users                                                                                                           Top 10 users
                        0.0012
                                             Top 11 100                                                            0.003                                            Top 11 100
                                             Top 101 233                                                                                                            Top 101 233
                        0.0009
                                                                                                                   0.002
                        0.0006
                                                                                                                   0.001
                        0.0003

                            0                                                                                          0
                                 Jan   Feb     Mar     Apr    May       Jun   Jul   Aug                                    Jan   Feb     Mar     Apr    May       Jun   Jul   Aug
                                                     Time (year 2009)                                                                          Time (year 2009)
                                             (a) Retweet influence                                                                      (b) Mention influence
Figure 4: The temporal evolution of retweets and mentions for the all-time influential users. For each data point, the error bars
are centered on the average retweet (or mention) probability, and they extend up and down by two standard deviations.


Although the values of P appear small, they account for                                                            cant efforts in conversing with others (e.g., replying to their
a large volume of retweets and mentions: a single most                                                             audience). In a sense, they need self-advertisement the most,
popular influential user spawned up to 20,000 retweets and                                                          because mass media and celebrities have many other on- and
50,000 mentions over a random 15 day period. In order to                                                           off-line channels of to promote themselves.
capture the trend in more detail, we classified the influen-                                                            While our findings provide an interesting view of how dif-
tials into three groups based on indegree: the top 10 users,                                                       ferent groups of people maintain their popularity, we should
who were mostly the mainstream news sources; the next 90                                                           also emphasize that our analysis is in retrospect but not
users, who were mainly celebrities; and the rest, who were a                                                       causal. These findings are based on the set of users who
mixed group of public figures and opinion leaders that com-                                                         ultimately became popular. We also mention that all influ-
peted with the traditional mass media.                                                                             entials put efforts in posting creative and interesting tweets,
   The evolution of the retweet probability in Figure 4(a)                                                         as shown from the high correlation in their retweet ranks.
shows that all three groups mildly increased their influence
over time. Large error ranges for the top 10 users indicate a                                                      Rising influence of the ordinary users
much wider variability in the popularity of this group. The                                                        Finally we examined the users who increased their influence
growth, however, is marginal for all three groups. We con-                                                         over a short time period to understand what behaviors make
jecture that the marginal increase is due to the limited num-                                                      ordinary users influential. We focused on the set of users
ber of tweets users post a day. Broadcasting too many tweets                                                       who talked about only one news topic, out of the three news
puts even popular users at a risk of being classified as spam-                                                      topics in Table 2, and picked the top 20 users for each news
mers. Hence, Twitter users should moderate the number of                                                           topic based on indegree. We call these users topical influ-
broadcast tweets in order to avoid crossing their followers’                                                       entials. The topical influentials included dedicated accounts
information processing limit.                                                                                      like iranbaan, oxfordgirl, and TM Outbreak who suddenly
   The evolution of the mention probability in Figure 4(b)                                                         became popular over the course of the event. These users
shows distinct patterns for the three groups. The top 10                                                           were literally unheard-of at the beginning of the news events
users fell in popularity over time; the next group had consis-                                                     and didn’t receive any retweets or mentions prior to the rel-
tent influence scores over time; and the last group (the least                                                      evant news event. The list also included users like kevinrose
connected among the three) increased their popularity over                                                         (the founder of digg.com) and 106andpark (BET.com’s mu-
time. Users in the last group, surprisingly, spawned on aver-                                                      sic video site) who were already popular on a specific topic,
age more mentions than the top 100 users. While this trend                                                         and used the news events to extend their popularity.
is counter-intuitive at first, the differences between mentions                                                        We tracked the influence scores of the 60 topical influ-
and retweets can explain the trend.                                                                                entials over the 8 month in 2009. Again, we computed the
   The mainstream news organizations in the first group are                                                         variable P as the probability that a random tweet posted in
retweeted the most, but they are not mentioned the most.                                                           a 15 day period is a retweet (or a mention) to that user. Fig-
This is because their names come up mostly when their con-                                                         ure 5 displays the temporal evolution of influence for the
tent get retweeted; it is hard for media sources to engage                                                         topical influentials. Overall, the influence scores are much
users with their identities alone. The second group, com-                                                          lower than that of the global influentials in Figure 4. This is
prised of celebrities, is more often mentioned than retweeted                                                      expected since topical influentials had 3 to 180 times fewer
because of their name value. Their tweets also get retweeted,                                                      followers than the global influentials.
when influence is transferred across topics (Table 3). Evan-                                                           Influentials on Swine flu experienced relatively stable in-
gelists in the last group successfully increased their influ-                                                       fluence scores for both retweets and mentions. This is be-
ence. While many factors could explain this phenomenon,                                                            cause no single day involved a catastrophic event due to
our manual inspection revealed that these users put signifi-                                                        Swine flu. Influentials on the Iran election increased their



                                                                                          16

Mais conteúdo relacionado

Mais procurados

Who Says What to Whom on Twitter
Who Says What to Whom on TwitterWho Says What to Whom on Twitter
Who Says What to Whom on TwitterAlcance Media Group
 
051115_Writing Sample_Kaorop
051115_Writing Sample_Kaorop051115_Writing Sample_Kaorop
051115_Writing Sample_KaoropMukkamol Kaorop
 
Suazo%2c martínez & elgueta english version
Suazo%2c martínez & elgueta english versionSuazo%2c martínez & elgueta english version
Suazo%2c martínez & elgueta english version2011990
 
The Internet is a magnifying glass
The Internet is a magnifying glassThe Internet is a magnifying glass
The Internet is a magnifying glassSorin Adam Matei
 
Karenclassslides
KarenclassslidesKarenclassslides
Karenclassslideskcarter14
 
Peer Influence & Social Media Research Paper_A Watson
Peer Influence & Social Media Research Paper_A WatsonPeer Influence & Social Media Research Paper_A Watson
Peer Influence & Social Media Research Paper_A WatsonAlexandra Watson
 
All a twitter reddit edition
All a twitter reddit editionAll a twitter reddit edition
All a twitter reddit editionAndy Crosby
 
Relational Development & Interpersonal Communication In Computer Mediated Con...
Relational Development & Interpersonal Communication In Computer Mediated Con...Relational Development & Interpersonal Communication In Computer Mediated Con...
Relational Development & Interpersonal Communication In Computer Mediated Con...maxbury
 
Suazo, martínez & elgueta english version
Suazo, martínez & elgueta english versionSuazo, martínez & elgueta english version
Suazo, martínez & elgueta english version2011990
 
Jamie's FINAL poster
Jamie's FINAL posterJamie's FINAL poster
Jamie's FINAL posterJamie McCain
 
Advertising Versus Public Relations
Advertising Versus Public RelationsAdvertising Versus Public Relations
Advertising Versus Public Relationsdmichaelson
 
Always-on Research via MROCs
Always-on Research via MROCsAlways-on Research via MROCs
Always-on Research via MROCsTom De Ruyck
 

Mais procurados (17)

Who Says What to Whom on Twitter
Who Says What to Whom on TwitterWho Says What to Whom on Twitter
Who Says What to Whom on Twitter
 
A research paper on Twitter_Intrinsic versus image related utility in social ...
A research paper on Twitter_Intrinsic versus image related utility in social ...A research paper on Twitter_Intrinsic versus image related utility in social ...
A research paper on Twitter_Intrinsic versus image related utility in social ...
 
Jf2516311637
Jf2516311637Jf2516311637
Jf2516311637
 
051115_Writing Sample_Kaorop
051115_Writing Sample_Kaorop051115_Writing Sample_Kaorop
051115_Writing Sample_Kaorop
 
Web 3.0
Web 3.0Web 3.0
Web 3.0
 
Suazo%2c martínez & elgueta english version
Suazo%2c martínez & elgueta english versionSuazo%2c martínez & elgueta english version
Suazo%2c martínez & elgueta english version
 
The Internet is a magnifying glass
The Internet is a magnifying glassThe Internet is a magnifying glass
The Internet is a magnifying glass
 
Karenclassslides
KarenclassslidesKarenclassslides
Karenclassslides
 
Peer Influence & Social Media Research Paper_A Watson
Peer Influence & Social Media Research Paper_A WatsonPeer Influence & Social Media Research Paper_A Watson
Peer Influence & Social Media Research Paper_A Watson
 
All a twitter reddit edition
All a twitter reddit editionAll a twitter reddit edition
All a twitter reddit edition
 
Dv31821825
Dv31821825Dv31821825
Dv31821825
 
Relational Development & Interpersonal Communication In Computer Mediated Con...
Relational Development & Interpersonal Communication In Computer Mediated Con...Relational Development & Interpersonal Communication In Computer Mediated Con...
Relational Development & Interpersonal Communication In Computer Mediated Con...
 
Suazo, martínez & elgueta english version
Suazo, martínez & elgueta english versionSuazo, martínez & elgueta english version
Suazo, martínez & elgueta english version
 
Jamie's FINAL poster
Jamie's FINAL posterJamie's FINAL poster
Jamie's FINAL poster
 
Knowing your public
Knowing your publicKnowing your public
Knowing your public
 
Advertising Versus Public Relations
Advertising Versus Public RelationsAdvertising Versus Public Relations
Advertising Versus Public Relations
 
Always-on Research via MROCs
Always-on Research via MROCsAlways-on Research via MROCs
Always-on Research via MROCs
 

Semelhante a Measuring User Influence in Twitter

World Civilization I Professor Cieglo Spring 2019 .docx
World Civilization I Professor Cieglo Spring 2019 .docxWorld Civilization I Professor Cieglo Spring 2019 .docx
World Civilization I Professor Cieglo Spring 2019 .docxdunnramage
 
Synergizing Natural and Research Communities
Synergizing Natural and Research CommunitiesSynergizing Natural and Research Communities
Synergizing Natural and Research CommunitiesTom De Ruyck
 
Synergizing natural and research communities: Caring about the research ecosy...
Synergizing natural and research communities: Caring about the research ecosy...Synergizing natural and research communities: Caring about the research ecosy...
Synergizing natural and research communities: Caring about the research ecosy...InSites Consulting
 
Estudio Influencia Y Pasividad En Social Media
Estudio Influencia Y Pasividad En Social MediaEstudio Influencia Y Pasividad En Social Media
Estudio Influencia Y Pasividad En Social Mediaeliasvillagran
 
Audience Theories: From Source-dominated to Active Audience Theories
Audience Theories:  From Source-dominated to Active Audience TheoriesAudience Theories:  From Source-dominated to Active Audience Theories
Audience Theories: From Source-dominated to Active Audience TheoriesIslam Muhammad
 
Riwa Malaeb : Two Step Flow Theory powerpoint
Riwa Malaeb : Two Step Flow Theory powerpointRiwa Malaeb : Two Step Flow Theory powerpoint
Riwa Malaeb : Two Step Flow Theory powerpointriwamalaeb2
 
Towards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxTowards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxturveycharlyn
 
Do not blame it on the algorithm an empirical assessment of multiple recommen...
Do not blame it on the algorithm an empirical assessment of multiple recommen...Do not blame it on the algorithm an empirical assessment of multiple recommen...
Do not blame it on the algorithm an empirical assessment of multiple recommen...Tel-Aviv Journalists' Association
 
Networks, Crowds & Markets Final Paper
Networks, Crowds & Markets Final PaperNetworks, Crowds & Markets Final Paper
Networks, Crowds & Markets Final PaperJason Rawlins
 
Gate-Keeping Theory in the Course Media Theories
Gate-Keeping Theory in the Course Media TheoriesGate-Keeping Theory in the Course Media Theories
Gate-Keeping Theory in the Course Media TheoriesShazaHijazi
 
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking Behavior
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking BehaviorConcurrent Distractions: A Cross-Cultural Study of Media Multitasking Behavior
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking BehaviorAJHSSR Journal
 
Social media marketing vignesh vaidyanathan-converted
Social media marketing   vignesh vaidyanathan-convertedSocial media marketing   vignesh vaidyanathan-converted
Social media marketing vignesh vaidyanathan-convertedVIGNESH VAIDYANATHAN
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Axel Bruns
 
A Study On The Changing Trends In Social Media And Its Impact Globally
A Study On The Changing Trends In Social Media And Its Impact GloballyA Study On The Changing Trends In Social Media And Its Impact Globally
A Study On The Changing Trends In Social Media And Its Impact GloballyAlicia Edwards
 
Corporate Communication
Corporate CommunicationCorporate Communication
Corporate CommunicationGaganbatth1987
 
Comprehensive Social Media Security Analysis & XKeyscore Espionage Technology
Comprehensive Social Media Security Analysis & XKeyscore Espionage TechnologyComprehensive Social Media Security Analysis & XKeyscore Espionage Technology
Comprehensive Social Media Security Analysis & XKeyscore Espionage TechnologyCSCJournals
 

Semelhante a Measuring User Influence in Twitter (20)

Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
Laurence Favier, University Charles De Gaulle – Lille 3: Social Influence and...
 
World Civilization I Professor Cieglo Spring 2019 .docx
World Civilization I Professor Cieglo Spring 2019 .docxWorld Civilization I Professor Cieglo Spring 2019 .docx
World Civilization I Professor Cieglo Spring 2019 .docx
 
Synergizing Natural and Research Communities
Synergizing Natural and Research CommunitiesSynergizing Natural and Research Communities
Synergizing Natural and Research Communities
 
Synergizing natural and research communities: Caring about the research ecosy...
Synergizing natural and research communities: Caring about the research ecosy...Synergizing natural and research communities: Caring about the research ecosy...
Synergizing natural and research communities: Caring about the research ecosy...
 
Estudio Influencia Y Pasividad En Social Media
Estudio Influencia Y Pasividad En Social MediaEstudio Influencia Y Pasividad En Social Media
Estudio Influencia Y Pasividad En Social Media
 
Always-on research
Always-on researchAlways-on research
Always-on research
 
Audience Theories: From Source-dominated to Active Audience Theories
Audience Theories:  From Source-dominated to Active Audience TheoriesAudience Theories:  From Source-dominated to Active Audience Theories
Audience Theories: From Source-dominated to Active Audience Theories
 
Research Proposal Writing
Research Proposal Writing Research Proposal Writing
Research Proposal Writing
 
Riwa Malaeb : Two Step Flow Theory powerpoint
Riwa Malaeb : Two Step Flow Theory powerpointRiwa Malaeb : Two Step Flow Theory powerpoint
Riwa Malaeb : Two Step Flow Theory powerpoint
 
Towards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxTowards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docx
 
Do not blame it on the algorithm an empirical assessment of multiple recommen...
Do not blame it on the algorithm an empirical assessment of multiple recommen...Do not blame it on the algorithm an empirical assessment of multiple recommen...
Do not blame it on the algorithm an empirical assessment of multiple recommen...
 
Networks, Crowds & Markets Final Paper
Networks, Crowds & Markets Final PaperNetworks, Crowds & Markets Final Paper
Networks, Crowds & Markets Final Paper
 
Gate-Keeping Theory in the Course Media Theories
Gate-Keeping Theory in the Course Media TheoriesGate-Keeping Theory in the Course Media Theories
Gate-Keeping Theory in the Course Media Theories
 
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking Behavior
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking BehaviorConcurrent Distractions: A Cross-Cultural Study of Media Multitasking Behavior
Concurrent Distractions: A Cross-Cultural Study of Media Multitasking Behavior
 
Social media marketing vignesh vaidyanathan-converted
Social media marketing   vignesh vaidyanathan-convertedSocial media marketing   vignesh vaidyanathan-converted
Social media marketing vignesh vaidyanathan-converted
 
Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...Information Contagion through Social Media: Towards a Realistic Model of the ...
Information Contagion through Social Media: Towards a Realistic Model of the ...
 
A Study On The Changing Trends In Social Media And Its Impact Globally
A Study On The Changing Trends In Social Media And Its Impact GloballyA Study On The Changing Trends In Social Media And Its Impact Globally
A Study On The Changing Trends In Social Media And Its Impact Globally
 
Corporate Communication
Corporate CommunicationCorporate Communication
Corporate Communication
 
H018144450
H018144450H018144450
H018144450
 
Comprehensive Social Media Security Analysis & XKeyscore Espionage Technology
Comprehensive Social Media Security Analysis & XKeyscore Espionage TechnologyComprehensive Social Media Security Analysis & XKeyscore Espionage Technology
Comprehensive Social Media Security Analysis & XKeyscore Espionage Technology
 

Mais de augustodefranco .

Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democracia
Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democraciaFranco, Augusto (2017) Conservadorismo, liberalismo econômico e democracia
Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democraciaaugustodefranco .
 
Franco, Augusto (2018) Os diferentes adversários da democracia no brasil
Franco, Augusto (2018) Os diferentes adversários da democracia no brasilFranco, Augusto (2018) Os diferentes adversários da democracia no brasil
Franco, Augusto (2018) Os diferentes adversários da democracia no brasilaugustodefranco .
 
A democracia sob ataque terá de ser reinventada
A democracia sob ataque terá de ser reinventadaA democracia sob ataque terá de ser reinventada
A democracia sob ataque terá de ser reinventadaaugustodefranco .
 
Algumas notas sobre os desafios de empreender em rede
Algumas notas sobre os desafios de empreender em redeAlgumas notas sobre os desafios de empreender em rede
Algumas notas sobre os desafios de empreender em redeaugustodefranco .
 
APRENDIZAGEM OU DERIVA ONTOGENICA
APRENDIZAGEM OU DERIVA ONTOGENICA APRENDIZAGEM OU DERIVA ONTOGENICA
APRENDIZAGEM OU DERIVA ONTOGENICA augustodefranco .
 
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...augustodefranco .
 
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...augustodefranco .
 
100 DIAS DE VERÃO BOOK DO PROGRAMA
100 DIAS DE VERÃO BOOK DO PROGRAMA100 DIAS DE VERÃO BOOK DO PROGRAMA
100 DIAS DE VERÃO BOOK DO PROGRAMAaugustodefranco .
 
Nunca a humanidade dependeu tanto da rede social
Nunca a humanidade dependeu tanto da rede socialNunca a humanidade dependeu tanto da rede social
Nunca a humanidade dependeu tanto da rede socialaugustodefranco .
 
Um sistema estatal de participação social?
Um sistema estatal de participação social?Um sistema estatal de participação social?
Um sistema estatal de participação social?augustodefranco .
 
Quando as eleições conspiram contra a democracia
Quando as eleições conspiram contra a democraciaQuando as eleições conspiram contra a democracia
Quando as eleições conspiram contra a democraciaaugustodefranco .
 
Democracia cooperativa: escritos políticos escolhidos de John Dewey
Democracia cooperativa: escritos políticos escolhidos de John DeweyDemocracia cooperativa: escritos políticos escolhidos de John Dewey
Democracia cooperativa: escritos políticos escolhidos de John Deweyaugustodefranco .
 
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELA
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELARELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELA
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELAaugustodefranco .
 
Diálogo democrático: um manual para practicantes
Diálogo democrático: um manual para practicantesDiálogo democrático: um manual para practicantes
Diálogo democrático: um manual para practicantesaugustodefranco .
 

Mais de augustodefranco . (20)

Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democracia
Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democraciaFranco, Augusto (2017) Conservadorismo, liberalismo econômico e democracia
Franco, Augusto (2017) Conservadorismo, liberalismo econômico e democracia
 
Franco, Augusto (2018) Os diferentes adversários da democracia no brasil
Franco, Augusto (2018) Os diferentes adversários da democracia no brasilFranco, Augusto (2018) Os diferentes adversários da democracia no brasil
Franco, Augusto (2018) Os diferentes adversários da democracia no brasil
 
Hiérarchie
Hiérarchie Hiérarchie
Hiérarchie
 
A democracia sob ataque terá de ser reinventada
A democracia sob ataque terá de ser reinventadaA democracia sob ataque terá de ser reinventada
A democracia sob ataque terá de ser reinventada
 
JERARQUIA
JERARQUIAJERARQUIA
JERARQUIA
 
Algumas notas sobre os desafios de empreender em rede
Algumas notas sobre os desafios de empreender em redeAlgumas notas sobre os desafios de empreender em rede
Algumas notas sobre os desafios de empreender em rede
 
AS EMPRESAS DIANTE DA CRISE
AS EMPRESAS DIANTE DA CRISEAS EMPRESAS DIANTE DA CRISE
AS EMPRESAS DIANTE DA CRISE
 
APRENDIZAGEM OU DERIVA ONTOGENICA
APRENDIZAGEM OU DERIVA ONTOGENICA APRENDIZAGEM OU DERIVA ONTOGENICA
APRENDIZAGEM OU DERIVA ONTOGENICA
 
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...
CONDORCET, Marquês de (1792). Relatório de projeto de decreto sobre a organiz...
 
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...
NIETZSCHE, Friederich (1888). Os "melhoradores" da humanidade, Parte 2 e O qu...
 
100 DIAS DE VERÃO BOOK DO PROGRAMA
100 DIAS DE VERÃO BOOK DO PROGRAMA100 DIAS DE VERÃO BOOK DO PROGRAMA
100 DIAS DE VERÃO BOOK DO PROGRAMA
 
Nunca a humanidade dependeu tanto da rede social
Nunca a humanidade dependeu tanto da rede socialNunca a humanidade dependeu tanto da rede social
Nunca a humanidade dependeu tanto da rede social
 
Um sistema estatal de participação social?
Um sistema estatal de participação social?Um sistema estatal de participação social?
Um sistema estatal de participação social?
 
Quando as eleições conspiram contra a democracia
Quando as eleições conspiram contra a democraciaQuando as eleições conspiram contra a democracia
Quando as eleições conspiram contra a democracia
 
100 DIAS DE VERÃO
100 DIAS DE VERÃO100 DIAS DE VERÃO
100 DIAS DE VERÃO
 
Democracia cooperativa: escritos políticos escolhidos de John Dewey
Democracia cooperativa: escritos políticos escolhidos de John DeweyDemocracia cooperativa: escritos políticos escolhidos de John Dewey
Democracia cooperativa: escritos políticos escolhidos de John Dewey
 
MULTIVERSIDADE NA ESCOLA
MULTIVERSIDADE NA ESCOLAMULTIVERSIDADE NA ESCOLA
MULTIVERSIDADE NA ESCOLA
 
DEMOCRACIA E REDES SOCIAIS
DEMOCRACIA E REDES SOCIAISDEMOCRACIA E REDES SOCIAIS
DEMOCRACIA E REDES SOCIAIS
 
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELA
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELARELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELA
RELATÓRIO DO HUMAN RIGHTS WATCH SOBRE A VENEZUELA
 
Diálogo democrático: um manual para practicantes
Diálogo democrático: um manual para practicantesDiálogo democrático: um manual para practicantes
Diálogo democrático: um manual para practicantes
 

Último

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Measuring User Influence in Twitter

  • 1. Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha∗ Hamed Haddadi† Fabr´cio Benevenuto‡ ı Krishna P. Gummadi∗ ∗ Max Planck Institute for Software Systems (MPI-SWS), Germany † Royal Veterinary College, University of London, United Kingdom ‡ CS Dept., Federal University of Minas Gerais (UFMG), Brazil Abstract a minority of users, called influentials, excel in persuading others (Rogers 1962). This theory predicts that by target- Directed links in social media could represent anything ing these influentials in the network, one may achieve a from intimate friendships to common interests, or even a passion for breaking news or celebrity gossip. Such large-scale chain-reaction of influence driven by word-of- directed links determine the flow of information and mouth, with a very small marketing cost (Katz and Lazars- hence indicate a user’s influence on others—a concept feld 1955). A more modern view, in contrast, de-emphasizes that is crucial in sociology and viral marketing. In this the role of influentials. Instead, it posits that the key fac- paper, using a large amount of data collected from Twit- tors determining influence are (i) the interpersonal rela- ter, we present an in-depth comparison of three mea- tionship among ordinary users and (ii) the readiness of a sures of influence: indegree, retweets, and mentions. society to adopt an innovation (Watts and Dodds 2007; Based on these measures, we investigate the dynam- Domingos and Richardson 2001). This modern view of in- ics of user influence across topics and time. We make fluence leads to marketing strategies such as collaborative several interesting observations. First, popular users filtering. These theories, however, are still just theories, be- who have high indegree are not necessarily influential in terms of spawning retweets or mentions. Second, cause there has been a lack of empirical data that could be most influential users can hold significant influence over used to validate either of them. The recent advent of social a variety of topics. Third, influence is not gained spon- networking sites and the data within such sites now allow taneously or accidentally, but through concerted effort researchers to empirically validate these theories. such as limiting tweets to a single topic. We believe that Moving from theory into practice, we find that there are these findings provide new insights for viral marketing many other unanswered questions about how influence dif- and suggest that topological measures such as indegree fuses through a population and whether it varies across top- alone reveals very little about the influence of a user. ics and time. People have different levels of expertise on various subjects. When it comes to marketing, however, this Introduction fact is generally ignored. Marketing services actively search for potential influencers to promote various items. These Influence has long been studied in the fields of sociology, influencers range from “cool” teenagers, local opinion lead- communication, marketing, and political science (Rogers ers, all the way to popular public figures. However, the ad- 1962; Katz and Lazarsfeld 1955). The notion of influence vertised items are often far outside the domain of expertise plays a vital role in how businesses operate and how a soci- of these hired individuals. So how effective are these mar- ety functions—for instance, see observations on how fashion keting strategies? Can a person’s influence in one area be spreads (Gladwell 2002) and how people vote (Berry and transferred to other areas? Keller 2003). Studying influence patterns can help us bet- In this paper, we present an empirical analysis of influ- ter understand why certain trends or innovations are adopted ence patterns in a popular social medium. Using a large faster than others and how we could help advertisers and amount of data gathered from Twitter, we compare three dif- marketers design more effective campaigns. Studying influ- ferent measures of influence: indegree, retweets, and men- ence patterns, however, has been difficult. This is because tions.1 Focusing on different topics, we examine how the such a study does not lend itself to readily available quan- three types of influential users performed in spreading pop- tification, and essential components like human choices and ular news topics. We also investigate the dynamics of an the ways our societies function cannot be reproduced within individual’s influence by topic and over time. Finally, we the confines of the lab. characterize the precise behaviors that make ordinary indi- Nevertheless, there have been important theoretical stud- viduals gain high influence over a short period of time. ies on the diffusion of influence, albeit with radically dif- ferent results. Traditional communication theory states that 1 Indegree is the number of people who follow a user; retweets Copyright c 2010, Association for the Advancement of Artificial mean the number of times others “forward” a user’s tweet; and Intelligence (www.aaai.org). All rights reserved. mentions mean the number of times others mention a user’s name. 10
  • 2. The Twitter dataset used in this paper consists of 2 billion In contrast, a more modern view of information flow em- follow links among 54 million users who produced a total of phasizes the importance of prevailing culture more than the 1.7 billion tweets. We refer readers to our project webpage role of influentials. Some researchers have reasoned that http://twitter.mpi-sws.org/ for a detailed description of the people in the new information age make choices based on dataset and our data sharing plan. the opinions of their peers and friends, rather than by influ- Our study provides several findings that have direct impli- entials (Domingos and Richardson 2001). These researchers cations in the design of social media and viral marketing: argued that direct marketing through influentials would not 1) Analysis of the three influence measures provides a better be as profitable as using “network”-based advertising such understanding of the different roles users play in social as collaborative filtering. media. Indegree represents popularity of a user; retweets The traditional influentials theory has also been criticized represent the content value of one’s tweets; and mentions because its information flow process does not take into ac- represent the name value of a user. Hence, the top users count the role of ordinary users. In order to compare the based on the three measures have little overlap. role of influentials and ordinary users, researchers have de- veloped a series of simulations, in which information flows 2) Our finding on how influence varies across topics could freely between users, and a user adopts an innovation when serve as a useful test for answering how effective adver- he is influenced by more than a threshold of the sample pop- tisement in Twitter would be if one is to employ influential ulation (Watts and Dodds 2007). Influentials were defined users. Our analysis shows that most influential users hold as those in the top 10% of influence distribution. The simu- significant influence over a variety of topics. lation showed that influentials initiated more frequent and 3) Ordinary users can gain influence by focusing on a sin- larger cascades than average users, but they were neither gle topic and posting creative and insightful tweets that necessary nor sufficient for all diffusions, as suggested in are perceived as valuable by others, as opposed to simply the traditional theory. Moreover, in homogeneous networks, conversing with others. influentials were no more successful in running long cas- cades than ordinary users. This means that a trend’s success These findings provide new insights for viral marketing. depends not on the person who starts it, but on how suscep- The first finding in particular indicates that indegree alone tible the society is overall to the trend (Watts 2007). In fact, reveals little about the influence of a user. This has been a trend can be initiated by any one, and if the environment is coined the million follower fallacy by Avnit (Avnit 2009), right, it will spread. Therefore, Watts dubbed early adopters who pointed to anecdotal evidence that some users follow or opinion leaders “accidental” influentials. others simply for etiquette—it’s polite to follow someone The above competing ideas have remained as hypotheses who’s following you—and many do not read all the broad- for several reasons. First is the lack of data that could be cast tweets. We have empirically demonstrated that having a used to empirically test them. Although there exist a handful million followers does not always mean much in the Twitter of empirical studies on word-of-mouth influence (Leskovec, world. Instead, we claim that it is more influential to have Adamic, and Huberman 2007; Cha, Mislove, and Gummadi an active audience who retweets or mentions the user. 2009), no work has been conducted on the relative order of influence among individuals. A second issue is the va- Defining influence on Twitter riety of ways that influence has been defined (Watts 2007; We start by reviewing studies of diffusion of influence and Goyal, Bonchi, and Lakshmanan 2010). It has been unclear related work on influence propagation on Twitter. what exactly influence means. Finally, decades have passed since the influentials theory appeared. Even if the theory Background was reasonably accurate when it was proposed, things have There are a number of conflicting ideas and theories about changed and now we have much more variability in the flow how trends and innovations get adopted and spread. of influence. In particular, online communities have become The traditional view assumes that a minority of mem- a significant way we receive new information and influence bers in a society possess qualities that make them excep- in such communities needs to be explored. In this paper, we tionally persuasive in spreading ideas to others. These ex- investigate the notion of influence using a large amount of ceptional individuals drive trends on behalf of the major- data collected from a popular social medium, Twitter. ity of ordinary people. They are loosely described as be- ing informed, respected, and well-connected; they are called Measuring influence on Twitter the opinion leaders in the two-step flow theory (Katz and The Merriam-Webster dictionary defines influence as “the Lazarsfeld 1955), innovators in the diffusion of innovations power or capacity of causing an effect in indirect or intan- theory (Rogers 1962), and hubs, connectors, or mavens in gible ways.” Despite the large number of theories of influ- other work (Gladwell 2002). The theory of influentials is ence in sociology, there is no tangible way to measure such intuitive and compelling. By identifying and convincing a a force nor is there a concrete definition of what influence small number of influential individuals, a viral campaign can means, for instance, in the spread of news. reach a wide audience at a small cost. The theory spread well In this paper, we analyze the Twitter network as a news beyond academia and has been adopted in many marketing spreading medium and study the types and degrees of influ- businesses, e.g., RoperASW and Tremor (Gladwell 2002; ence within the network. Focusing on an individual’s po- Berry and Keller 2003). tential to lead others to engage in a certain act, we highlight 11
  • 3. three “interpersonal” activities on Twitter. First, users in- us to gather large amounts of data. We used the Twitter API teract by following updates of people who post interesting to gather information about a user’s social links and tweets. tweets. Second, users can pass along interesting pieces of We launched our crawler in August 2009 for all user IDs information to their followers. This act is popularly known ranging from 0 to 80 million. We did not look beyond 80 as retweeting and can typically be identified by the use million, because no single user in the collected data had a of RT @username or via @username in tweets. Fi- link to a user whose ID was greater than that value. Out of 80 nally, users can respond to (or comment on) other people’s million possible IDs, we found 54,981,152 in-use accounts, tweets, which we call mentioning. Mentioning is identified which were connected to each other by 1,963,263,821 social by searching for @username in the tweet content, after ex- links. We gathered information about a user’s follow links cluding retweets. A tweet that starts with @username is and all tweets ever posted by each user since the early days not broadcast to all followers, but only to the replied user. A of the service. In total, there were 1,755,925,520 tweets. tweet containing @username in the middle of its text gets Nearly 8% of all user accounts were set private, so that only broadcast to all followers. These three activities represent their friends could view their tweets. We ignore these users the different types of influence of a person: in our analysis. The social link information is based on the final snapshot of the network topology at the time of crawl- 1. Indegree influence, the number of followers of a user, ing and we do not know when the links were formed. directly indicates the size of the audience for that user. The network of Twitter users comprises a single dispro- 2. Retweet influence, which we measure through the num- portionately large connected component (containing 94.8% ber of retweets containing one’s name, indicates the abil- of users), singletons (5%), and smaller components (0.2%). ity of that user to generate content with pass-along value. The largest component contains 99% of all links and tweets. 3. Mention influence, which we measure through the num- Our goal is to explore influence of users, hence we focus on ber of mentions containing one’s name, indicates the abil- the largest component of the network, which is conceptually ity of that user to engage others in a conversation. a single interaction domain for users. Because it is hard to determine influence of users who have few tweets, we borrowed the concept of “active users” Related work on Twitter from the traditional media research (Levy and Windhal Several recent efforts have been made to track influence 1985) and focused on those users with some minimum level on Twitter. The Web Ecology Project tracked 12 popu- of activity. We ignored users who had posted fewer than 10 lar Twitter users for a 10-day period and grouped a user’s tweets during their entire lifetime. We also ignored users influence into two types: conversation-based and content- for whom we did not have a valid screen name, because this based (Leavitt et al. 2009). This work concluded that news information is crucial in identifying the number of times a media are better at spreading content, while celebrities are user was mentioned or retweeted by others. After filtering, better at simply making conversation. Our work extends there were 6,189,636 users, whom we focus on in the re- their notion of influence and uses extensive data to further mainder of this paper. To measure the influence of these 6 examine the spatial and temporal dynamics of influence. million users, however, we looked into how the entire set of More recently, a PageRank-like measure has been pro- 52 million users interacted with these active users. posed to quantify influence on Twitter (Weng et al. 2010). The authors found high link reciprocity (72%) from a non- Methodology for comparing user influence random sample of 6,748 Singapore-based users, and argued For each of the 6 million users, we computed the value of that high reciprocity is indicative of homophily. They then each influence measure and compared them. Rather than exploited this fact in computing a user’ influence rank. Our comparing the values directly, we used the relative order of study, however, contradicts the observation about high reci- users’ ranks as a measure of difference. In order to do this, procity; near-complete data of Twitter shows low reciprocity we sorted users by each measure, so that the rank of 1 in- (10%). Thus, we predict that social links on Twitter repre- dicates the most influential user and increasing rank indi- sent an influence relationship, rather than homophily. Ac- cates a less influential user. Users with the same influence cordingly, we ask what are the different activities on Twitter value receive the average of the rank amongst them (Buck that represent influence of a user and to what extent a per- 1980). Once every user is assigned a rank for each influence son’s influence varies across tweet topic and time. measure, we are ready to quantify how a user’s rank varies across different measures and examine what kinds of users Characteristics of the top influentials are ranked high for a given measure. We used Spearman’s rank correlation coefficient We describe how we collected the Twitter data and present the characteristics of the top users based on three influence 6 (xi − yi )2 ρ=1− (1) measures: indegree, retweets, and mentions. N3 − N as a measure of the strength of the association between two Dataset rank sets, where xi and yi are the ranks of users based on We asked Twitter administrators to allow us to gather data two different influence measures in a dataset of N users. from their site at scale. They graciously white-listed the IP Spearman’s rank correlation coefficient calculation is a non- address range containing 58 of our servers, which allowed parametric test; the coefficient assesses how well an arbi- 12
  • 4. trary monotonic function could describe the relationship be- tween two variables, without making any other assumptions
  • 5. about the particular nature of the relationship between the variables. Our inclusive and complete dataset guarantees re- liability of the correlation estimates. The closer ρ is to +1 or −1, the stronger the likely correlation. A perfect positive correlation is +1 and a perfect negative correlation is −1. Comparing three measures of user influence To see what kinds of users are the most influential, we visited the Twitter pages of the top-20 users based on each measure. The top influentials The most followed users span a wide Figure 1: Venn diagram of the top-100 influentials across variety of public figures and news sources. They were measures: The chart is normalized so that the total is 100%. news sources (CNN, New York Times), politicians (Barack Obama), athletes (Shaquille O’Neal), as well as celebrities lap, as shown in Figure 1, indicating that the three measures like actors, writers, musicians, and models (Ashton Kutcher, capture different types of influence. Britney Spears). As the list suggests, indegree measure is useful when we want to identify users who get lots of at- Relative influence ranks In order to investigate how the tention from their audience through one-on-one interactions, three measures correlate, we compared the relative influence i.e., the audience is directly connected to influentials. ranks of all 6 million users (Table 1). We see a moderately high correlation (above 0.5) across all pairs. However, the The most retweeted users were content aggregation ser- high correlation appears to be an artifact of the tied ranks vices (Mashable, TwitterTips, TweetMeme), businessmen among the least influential users, e.g., many of the least con- (Guy Kawasaki), and news sites (The New York Times, The nected users also received zero retweet and mention. To Onion). They are trackers of trending topic and knowledge- avoid this bias, we focused on the set of relatively popu- able people in different fields, whom other users decide to lar users. We considered users in the top 10th and 1st per- retweet. Unlike indegree, retweets represent influence of a centiles based on indegree, in the hope that users who get user beyond one’s one-to-one interaction domain; popular retweeted or mentioned must have some followers. tweets could propagate multiple hops away from the source before they are retweeted throughout the network. Further- Table 1: Spearman’s rank correlation coefficients more, because of the tight connection between users as sug- Correlation All Top 10% Top 1% gested in the triadic closure (Granovetter 1973), retweeting Indegree vs retweets 0.549 0.122 0.109 in a social network can serve as a powerful tool to reinforce Indegree vs mentions 0.638 0.286 0.309 a message—for instance, the probability of adopting an in- Retweets vs mentions 0.580 0.638 0.605 novation increases when not one but a group of users repeat the same message (Watts and Dodds 2007). After this filtering step, the top users showed a strong The most mentioned users were mostly celebrities. Ordi- correlation in their retweet influence and mention influence. nary users showed a great passion for celebrities, regularly Sampling the top users based on retweets or mentions leads posting messages to them or mentioning them, without nec- to similar results. This means that, in general, users who get essarily retweeting their posts. This indicates that celebrities mentioned often also get retweeted often, and vice versa. In- are often in the center of public attention and celebrity gos- degree, however, was not related to the other measures. We sip is a popular activity among Twitter users. conclude that the most connected users are not necessarily If retweets represent a citation of another user’s con- the most influential when it comes to engaging one’s audi- tent, mentions represent a public response to another user’s ence in conversations and having one’s messages spread. tweet—the focus of a tweet is on content for retweets, while the focus is on the replied user for mentions. This can be Discussion of methodology Normalizing retweets and confirmed from the usage of conventions in tweets: 92% of mentions by total tweets would yield a different measure tweets that had a RT or via marker contained a URL and of influence, which might have led to very different results. 97% of them also contained the @username field. This When we tried normalizing the data, we identified local means that retweets are about the content (indicated by the opinion leaders as the most influential. However, normal- embedded URL) and that people typically cite the authen- ization failed to rank users with the highest sheer number of tic source when they retweet. However, fewer than 30% of retweets as influential. Therefore, in this paper, we use the tweets that were classified as mentions contained any URL, sheer number of retweets and mentions without normalizing indicating that a mention is more identity-driven. these values by the total tweets of a user. Across all three measures, the top influentials were gener- Other measures such as the number of tweets and out- ally recognizable public figures and websites. Interestingly, degree (i.e., the number of people a user follows) were not we saw marginal overlap in these three top lists. These top- found to be useful, because they identified robots and spam- 20 lists only had 2 users in common: Ashton Kutcher and mers as the most influential, respectively. Therefore, we do Puff Daddy. The top-100 lists also showed marginal over- not use these measures. 13
  • 6. Table 2: Summary information of the three major topics events studied Topic Period Keywords Users Tweets Audience Iran Jun 11—Aug 10 #iranelection, names of politicians 302,130 1,482,038 22,177,836 Swine May 3—July 2 Mexico flu, H1N1, swine 239,329 495,825 20,977,793 Jackson Jun 25—Aug 24 Michael Jackson, #mj 610,213 1,418,356 23,550,211 Finally, we calculated indegree based on the snapshot of from diverse genres. These properties make this group ideal the network at the time of crawling in 2009, because we for studying how a user’s influence varies across vastly dif- do not know the time when each follow link was formed. ferent topic genres. For the calculation of retweets and mentions, however, we used longitudinal data (i.e., since the beginning of the Twit- Distribution of the influence ranks ter service in 2006). This difference could have resulted in To get a measure of influence for a given topic, we count a weaker correlation between indegree and the other influ- only the retweets and mentions a user spawned on the given ence measures (Table 1). Nevertheless, nearly three quarters topic. Because indegree is invariant across topics, we do not of Twitter users joined in 2009, suggesting that the effect on use the measure in this analysis. the correlation would have been minimal. Before investigating the dynamics of influence across top- ics, we first need to understand the high-level characteris- Does influence hold across different topics? tics of user influence, such as the degree to which users’ Having defined three different measures of influence on influence can differ. Figure 2 displays the influence ranks Twitter, we now investigate whether a user’s influence varies of users based on retweets and mentions across the Iran, by topic genres. In order to investigate this, we first need to Swine, and Jackson topics. The influence ranks are calcu- find users who tweeted about a diverse set of topics. lated for each topic and a user’s rank may differ for different topics. The plots show a straight line on a log-log plot, a Methodology for identifying target topics property that is referred to as the power-law characteristic. The power-law pattern is indicative of the fact that users’ To find as many users to monitor as possible, we picked three degree of influence can differ by orders of magnitude: the of the most popular topics in 2009 that were considered en- top influentials were retweeted or mentioned disproportion- gaging and revolutionary in Twitter2 : the Iranian presiden- ately more times than the majority of users. This suggests tial election, the outbreak of the H1N1 influenza, and the that utilizing top influentials has a great potential payoff in death of Michael Jackson. While all three topics were popu- marketing strategy. lar, they span political, health, and social genres. To extract tweets relevant to these events, we first identified the set of keywords describing the topics by consulting news websites 10000 and informed individuals. Using the selected keywords, we Number of retweets then identified relevant tweets by searching for the keywords Iran 1000 in the tweet dataset. Swine Table 2 displays the keywords and the total number of Jackson 100 users and tweets for each topic. We focused on a period of 60 days starting from one day prior to the start of each event. 10 We limited the study duration because popular keywords 1 were typically hijacked by spammers after certain time. The 0 table also shows the total number of users who received any 1 10 100 1000 10000 tweet on the topic (termed audience). Each topic reached User ranks an audience of over 20 million, indicating that over 40% of (a) Retweet influence ranks users in Twitter were aware of at least one of the three topics. Among the users who tweeted about any of these topics, 1000 Number of ment ons fewer than 2% discussed all three topics. Although 2% is a Iran small fraction, this set contains 13,219 users, which is a large Swine 100 Jackson enough sample size for statistical analysis. These users were generally well connected; they had on average 2,037 follow- 10 ers, and together reached an audience of 16 million. Fur- thermore, none of these users were dedicated to these three 1 0 topics; no user had dedicated more than 60% of their tweets to these topics. This means that the set of 13,219 users is 1 10 100 1000 10000 enthusiastic about sharing thoughts on popular news topics User ranks 2 (b) Mention influence ranks The top Twitter trends identified by the Twitter Research team are listed at http://tinyurl.com/yb4965e. Figure 2: Distribution of user ranks for a given topic 14
  • 7. Variation of a user’s influence across topics 100 Ranks (Sw ne, Jackson) Swine Next, in order to examine how volatile a user’s rank is across Jackson different topics, we compare the relative order of influence 80 ranks across topics in Table 3. For the same reason we previ- 60 ously discussed in Table 1, we ignore the least popular users who have tied ranks and focus on the set of relatively pop- 40 ular users, as measured by indegree. The top 10th and 1st 20 percentiles included 1,322 and 132 users, respectively. 0 0 5 10 15 20 Table 3: Spearman’s rank correlation coefficients over topics Retweet Mentions Ranks (Iran) Topics Top 10% 1% Top 10% 1% Figure 3: Users’ retweet influence ranks over topics Iran vs. Swine 0.54 0.62 0.59 0.68 Iran vs. Jackson 0.48 0.54 0.59 0.63 The rise and fall of influentials over time Swine vs. Jackson 0.55 0.50 0.80 0.68 Many factors—social, political, and economical—affect popularity and influence of individuals and organizations. In The rank correlation is generally high (above 0.5) and gets online social media, such dynamics is facilitated by easy en- stronger for the top 1% of users. Mentions show an even try of competition. It only costs 140 characters to generate stronger correlation across topics than retweets. This means a tweet for any user. Likewise, it will be challenging for in- that a popular user who is good at spawning mentions from fluentials to maintain their status when many emerging local others can do so over a wide range of topics, more easily opinion leaders and evangelists enter the arena. than when she is retweeted over diverse topics. Among topic Here we analyze the dynamics of individual’s influence pairs, Swine and Jackson showed the highest correlation for over time in two ways. First, we track the popularity of top the top 10% of well-connected users for both the retweet and influentials over a long term period and check how well they mention influence. This is perhaps due to the more social maintain their ranks. Second, we focus on users who in- nature of these two events, which differs from the Iran topic creased their influence in a specific topic over a short time where special interest groups like politicians and bloggers period, in order to understand what behaviors make ordinary played a major role. individuals influential. Given that the rank correlation gets stronger for users with high indegree, we investigated how widely the ranks of the Maintaining engagement of the top influentials top listed users change by topic. Figure 3 shows the retweet Out of all 6 million active users in the Twitter network, we ranks of the top users in the Iran topic against the relative picked the top 100 users based on each of the three mea- retweet ranks of the same users in the Swine and Jackson sures: indegree, retweets, and mentions. We used all the topics. It is clear from the figure that the top 5 users in tweets ever posted on Twitter in identifying these influen- the Iran topic retained their relative ranks in the other two tials. Due to the overlap we discussed in Figure 1, there topics. These users were Mashable, CnnBrk, TweetMeme, were 233 distinct users, whom we call all-time influentials. Time, and BreakingNews, who were all in the category of In order to see how the influence of these all-time influen- authoritative news sources and content trackers. The men- tials varied over time, we tracked their influence scores over tion influence also showed a similar trend (not shown here); an 8 month period from January to August 2009. most influentials ranked consistently high amongst different To get a time-varying measure of influence, we counted topics. And we have observed this trend not only for the the number of retweets and mentions the all-time influen- hyper-influentials: moderate influentials like opinion lead- tials spawned in every 15 days over the 8 month period. ers and evangelists also had consistent influence ranks over Because we only know the indegree information based on diverse topics, as shown in Table 3. the final snapshot of the Twitter network, we do not use this Our findings about the highly skewed ability of users to measure. For each user, we computed a single explanatory influence others (Figure 2) and the strong correlation in a variable P: the probability that a random tweet posted on user’s influence rank across different topics (Table 3) to- Twitter during a 15 day period is a retweet (or a mention) of gether lead to two interesting conclusions. First, most influ- that user. Normalizing by the total number of tweets posted ential users hold significant influence over a variety of top- on Twitter is essential to cancel out any variable effect on the ics. This means that local opinion leaders and highly popular data and allows the underlying characteristics of the data sets figures could indeed be used to spread information outside to be compared. For instance, because the Twitter network their area of expertise. In fact, new advertisement campaigns quadrupled over time in terms of the registered users, the to- have recently been launched that insert advertisement links tal volume of tweets merely increased over time. Hence, if into a popular person’s tweet (Fiorillo 2009). Second, the we didn’t normalize the results, the trend wouldn’t be inter- power-law trend in the difference among influence of indi- esting. Google similarly normalizes the data when analyzing viduals indicates that it is substantially more effective to tar- their search trends (Ginsberg et al. 2009). get the top influentials than to employ a massive number of Figure 4 displays the time evolution of the normalized non-popular users in order to kick start a viral campaign. retweets and mentions of the 233 all-time influential users. 15
  • 8. 0.0015 0.004 Retweet probability P Mention probability P Top 10 users Top 10 users 0.0012 Top 11 100 0.003 Top 11 100 Top 101 233 Top 101 233 0.0009 0.002 0.0006 0.001 0.0003 0 0 Jan Feb Mar Apr May Jun Jul Aug Jan Feb Mar Apr May Jun Jul Aug Time (year 2009) Time (year 2009) (a) Retweet influence (b) Mention influence Figure 4: The temporal evolution of retweets and mentions for the all-time influential users. For each data point, the error bars are centered on the average retweet (or mention) probability, and they extend up and down by two standard deviations. Although the values of P appear small, they account for cant efforts in conversing with others (e.g., replying to their a large volume of retweets and mentions: a single most audience). In a sense, they need self-advertisement the most, popular influential user spawned up to 20,000 retweets and because mass media and celebrities have many other on- and 50,000 mentions over a random 15 day period. In order to off-line channels of to promote themselves. capture the trend in more detail, we classified the influen- While our findings provide an interesting view of how dif- tials into three groups based on indegree: the top 10 users, ferent groups of people maintain their popularity, we should who were mostly the mainstream news sources; the next 90 also emphasize that our analysis is in retrospect but not users, who were mainly celebrities; and the rest, who were a causal. These findings are based on the set of users who mixed group of public figures and opinion leaders that com- ultimately became popular. We also mention that all influ- peted with the traditional mass media. entials put efforts in posting creative and interesting tweets, The evolution of the retweet probability in Figure 4(a) as shown from the high correlation in their retweet ranks. shows that all three groups mildly increased their influence over time. Large error ranges for the top 10 users indicate a Rising influence of the ordinary users much wider variability in the popularity of this group. The Finally we examined the users who increased their influence growth, however, is marginal for all three groups. We con- over a short time period to understand what behaviors make jecture that the marginal increase is due to the limited num- ordinary users influential. We focused on the set of users ber of tweets users post a day. Broadcasting too many tweets who talked about only one news topic, out of the three news puts even popular users at a risk of being classified as spam- topics in Table 2, and picked the top 20 users for each news mers. Hence, Twitter users should moderate the number of topic based on indegree. We call these users topical influ- broadcast tweets in order to avoid crossing their followers’ entials. The topical influentials included dedicated accounts information processing limit. like iranbaan, oxfordgirl, and TM Outbreak who suddenly The evolution of the mention probability in Figure 4(b) became popular over the course of the event. These users shows distinct patterns for the three groups. The top 10 were literally unheard-of at the beginning of the news events users fell in popularity over time; the next group had consis- and didn’t receive any retweets or mentions prior to the rel- tent influence scores over time; and the last group (the least evant news event. The list also included users like kevinrose connected among the three) increased their popularity over (the founder of digg.com) and 106andpark (BET.com’s mu- time. Users in the last group, surprisingly, spawned on aver- sic video site) who were already popular on a specific topic, age more mentions than the top 100 users. While this trend and used the news events to extend their popularity. is counter-intuitive at first, the differences between mentions We tracked the influence scores of the 60 topical influ- and retweets can explain the trend. entials over the 8 month in 2009. Again, we computed the The mainstream news organizations in the first group are variable P as the probability that a random tweet posted in retweeted the most, but they are not mentioned the most. a 15 day period is a retweet (or a mention) to that user. Fig- This is because their names come up mostly when their con- ure 5 displays the temporal evolution of influence for the tent get retweeted; it is hard for media sources to engage topical influentials. Overall, the influence scores are much users with their identities alone. The second group, com- lower than that of the global influentials in Figure 4. This is prised of celebrities, is more often mentioned than retweeted expected since topical influentials had 3 to 180 times fewer because of their name value. Their tweets also get retweeted, followers than the global influentials. when influence is transferred across topics (Table 3). Evan- Influentials on Swine flu experienced relatively stable in- gelists in the last group successfully increased their influ- fluence scores for both retweets and mentions. This is be- ence. While many factors could explain this phenomenon, cause no single day involved a catastrophic event due to our manual inspection revealed that these users put signifi- Swine flu. Influentials on the Iran election increased their 16
  • 9. 0.0005 Retweet probability P Mention probability P Iran 0.0006 Iran 0.0004 Jackson Jackson Swine Swine 0.0003 0.0004 0.0002 0.0002 0.0001 0 0 Jan Feb Mar Apr May Jun Jul Aug Jan Feb Mar Apr May Jun Jul Aug Time (year 2009) Time (year 2009) (a) Retweet influence (b) Mention influence Figure 5: The temporal evolution of retweets and mentions for the topical influential users retweet influence substantially during the peak of the elec- References tion in June and July, but they did not spawn many men- Avnit, A. 2009. The Million Followers Fallacy, Internet Draft, tions. Our data shows that these influentials actively spread Pravda Media. http://tinyurl.com/nshcjg. information about protests and controversial news. Influen- Berry, J., and Keller, E. 2003. The Influentials: One American in tials on Michael Jackson experienced a mild increase in their Ten Tells the Other Nine How to Vote, Where to Eat, and What to retweet influence in June, whereas their mention influence Buy. Free Press. surged during the same time period. Our manual inspection Buck, W. 1980. Tests of Significance for Point-Biserial Rank Cor- revealed that users who limit their tweets to a single topic relation Coefficients in the Presence of Ties. Biometrical Journal showed the largest increase in their influence scores. 22(2):153–158. Cha, M.; Mislove, A.; and Gummadi, K. P. 2009. A Masurement- Conclusions Driven Analysis of Information Propagation in the Flickr Social Network. In WWW. This paper analyzed the influence of Twitter users by em- Domingos, P., and Richardson, M. 2001. Mining the Network ploying three measures that capture different perspectives: Value of Customers. In ACM SIGKDD. indegree, retweets, and mentions. We found that indegree Fiorillo, J. 2009. Twitter Advertising: Pay-Per-Tweet, Wikinomics. represents a user’s popularity, but is not related to other im- http://tinyurl.com/d89uu9. portant notions of influence such as engaging audience, i.e., Ginsberg, J.; Mohebbi, M. H.; Patel, R. S.; Brammer, L.; Smolin- retweets and mentions. Retweets are driven by the content ski, M. S.; and Brilliant, L. 2009. Detecting Influenza Epidemics value of a tweet, while mentions are driven by the name Using Search Engine Query Data. Nature 457. value of the user. Such subtle differences lead to dissimilar Gladwell, M. 2002. The Tipping Point: How Little Things Can groups of the top Twitter users; users who have high inde- Make a Big Difference. Back Bay Books. gree do not necessarily spawn many retweets or mentions. Goyal, A.; Bonchi, F.; and Lakshmanan, L. V. S. 2010. Learning This finding suggests that indegree alone reveals very little Influence Probabilities In Social Networks. In ACM WSDM. about the influence of a user. Granovetter, M. 1973. The Strength of Weak Ties. American Focusing on retweets and mentions, we studied the dy- Journal of Sociology 78(6). namics of influence across topics and time. Our spatial anal- Katz, E., and Lazarsfeld, P. 1955. Personal Influence: The Part ysis showed that most influential users can hold significant Played by People in the Flow of Mass Communications. New York: influence over a variety of topics. The top Twitter users The Free Press. had a disproportionate amount of influence, which was in- Leavitt, A.; Burchard, E.; Fisher, D.; and Gilbert, S. 2009. The dicated by a power-law distribution. Our temporal analy- Influentials: New Approaches for Analyzing Influence on Twitter. sis identified how different types of influentials interact with Web Ecology Project, http://tinyurl.com/lzjlzq. their audience. Mainstream news organizations consistently Leskovec, J.; Adamic, L. A.; and Huberman, B. A. 2007. The spawned a high level of retweets over diverse topics. In con- Dynamics of Viral Marketing. ACM Trans. on the Web (TWEB). trast, celebrities were better at inducing mentions from their Levy, M., and Windhal, S. 1985. Media Gratications Research, audience. This is because the name value of the mention in- Chapter The Concept of Audience Activity. Sage. fluentials helped them get responses from others, rather than Rogers, E. M. 1962. Diffusion of Innovations. Free Press. any inherent value in the content they posted. Watts, D., and Dodds, P. 2007. Influentials, Networks, and Public Finally, we found that influence is not gained sponta- Opinion Formation. Journal of Consumer Research. neously or accidentally, but through concerted effort. In or- Watts, D. 2007. Challenging the Influentials Hypothesis. Word of der to gain and maintain influence, users need to keep great Mouth Marketing Association. personal involvement. This could mean that influential users are more predictable than suggested by theory (Watts 2007), Weng, J.; Lim, E.-P.; Jiang, J.; and He, Q. 2010. TwitterRank: Finding Topic-Sensitive Influential Twitterers. In ACM WSDM. shedding light on how to identify emerging influential users. 17