SlideShare uma empresa Scribd logo
1 de 35
Exploring the Streams of Online
Community Conversation:
Insights into the Twitterverse
@stephendann
Australian National University
Usual House Rules
@stephendann for questions
#anzmac13 for commentary
A little context
The Past
• Dann (2010)
– Six top level twitter
categories
– 23 sub domains
• Dann (2011)
– Six top level
– 28 sub domain
The Present
• Dann (Today)
– Six Top Level Categories
• No sub domain analysis
– Secondary Processing
• Leximancer
• Linguistic Inquiry Word
Count
Twitter Analysis 2.0.14
The Procedure
Acquire Research Question
• Does Event X change the tweeting patterns of Account @Y?
• Do responses to the #hashtag event change over time?
– #EventTags in Time Period A will have more Status than in Time Period D
– Time Period D will have more Pass Along than Status
• What were they thinking?
– Dominant Categories of tweets over time within a selected account
• Do comments change by platform for account @X?
– mobile versus web versus desktop
• Does @BrandX engage with the community?
– Conversational over all other types over capture time period
Acquire your data
• Personal timelines
– Download from Twitter
• #Hashtag captures
– Hootsuite
• Time line captures
– Choose your own adventure
– Getting worse, harder and
Twitter’s API is less available.
• Try to avoid big data
Big Data
• If you are Axel Bruns, fine, continue
– http://mappingonlinepublics.net/
• For everyone else, what are you looking for?
– What sample suits your research question?
Process your data
• Stand by for ugliness and manual coding*
– Extract data into Excel
• Excel allows for additional data inputs as you progress the
analysis
– Keep tweet visible
• Only keep a column visible if it fits your research question
– Eg date, time, @user, platform
– Add column for Tweet ID, category, cat_n
• Sub category, sub_cat_n for the detailed version
*Automated coding? People are working on it. It’s a terrible idea that’ll happen anyway
Manual Coding
• Use the Dann (2010) or Dann (2011) top level
domains
– Dann (201X) is under development
• I broke something important earlier this year
• Manual coding is superior
– Nuance and interpretation counts.
Pick a box
1 Conversational Uses an @statement to address another user
2 News Events Identifiable news content
3 Pass along Tweets of endorsement of content
4 Phatic Content independent connected presence
5 Status
Tweets which address the statement "What are you doing?"
and "What's happening?" in terms of an account holder's
experiences
6 Spam Unsolicited content
Keep it on manual
Conversational Uses an @statement to address another user
1.1 Action
Activities involving other Twitter users, or tweets which
describe the presence of other Twitter users.
1.2 Query
Any statement style tweet that ends with a question mark, as it
represents an active attempt to engage responses from the
community
1.3 Referral
An @response which contains URLs or recommendation of
other Twitter users. (Excludes RT @user)
1.4 Response
Classification for tweets which commence with another user’s
name and which do not meet the requirements of the referral
category
1.5 Rhetoric Question
Asked and answered within the same tweet (distinct from
Conversational - Query) which may not require (but may elicit)
audience response
Upgrades
Pass along Tweets of endorsement of content
3.1 Automated
Endorsement Status announcements triggered by third party applications which publish URLs
3.2
Endorsement Links to web content not created by the sender
3.3 Retweet Any statement reproducing another Twitter status using the via @ or RT protocol
3.4 Secondary
Social Media Links to Facebook (fb.me) or similar social media platform
3.5 User
generated
content Links to own content created by the user
3.6 Quote
Comment marked with “ “ to represent a direct quote, paraphrase of a statement
without a source URL, including reference to offline speaker or overheard (OH)
3.7 Cite
Any tweet which contains a reference in a recognised Harvard, Oxford or similar
format
3.8 Modified
ReTweet Acknowledgement of the use of MT protocol to allow for an edited RT.
Speed Hacking Excel
• Speed hacks exist
– Alphabet Tweet Sort
• @, RT, MT cluster
• “Find all” selecting.
Coding Time!
• Cross check the coding
– Some variance is okay
– Resolve it through the
usual traditions
Sample Data #qldquake
Coded
Analysis Table Block
Category
Tweet
(TCat)
Tweet
Ratio
Max
Density
Actual
Characters
Character
Density
Density
Ratio
Conversational
News
Pass Along
Phatic
Spam
Status
n
Tweet Math Dude
• Tweet Count
– N per category
• Calculate the Tweet Ratio
– Tweet ratio is a normalized rank order of the highest
volume of tweets, where the most common category is
scored as 1
• Calculating the Tweet Ratio
– Highest number of tweets in a single category = TTMax
– Tweets per category = TCat
– Ratio is Tcat / TTMax
I’m only mildly mocking statistical analysis here
Maximum Character Density
• Max Density = 140 x TCat [number of tweets in
each category]
• Theoretical range for a tweet is between 1 and 140
characters
• Maximum tweet is 140 characters
• More characters used, more information density
• Calculate Character Density
– (Actual Character / Max Density)
• Divide each CharDensity score by the highest Char density
• Normalise CharDensity score to rank order
Reporting the Data
Category
Tweet
(TCat)
Tweet
Ratio
Max
Density
Actual
Characters
Character
Density
Density
Ratio
Conversational 39 0.08 5460 3533 65% 0.81
News 41 0.08 5740 3778 66% 0.83
Pass Along 481 1 67340 53491 79% 1.00
Phatic 21 0.04 2940 2179 74% 0.93
Spam 1 0.00 140 81 58% 0.73
Status 18 0.03 2520 1543 61% 0.77
n 601 84140 64605 77%
Reporting the Data
0
0.2
0.4
0.6
0.8
1
1.2
Conversational
News
Pass Along
Phatic
Spam
Status
Ratio Density
Text Analysis Wave 1
Linguistic Inquiry Word Count
So. Very. Fast.
LIWC
• http://www.liwc.net/
– text analysis software
– calculates the degree to which people use
different categories of words in texts
• 70 other language dimensions.
– positive or negative emotions,
– self-references,
– causal words,
A giant bucket of data
• 70 variables
– So have a hypothesis and a purpose for the
analysis
• Differences in tweet construction
– Word Counts
– Unique Words
Results
Average Word Count (AWC) Unique Word Count (UWC)
Category AWC AWC_Ratio
Conversational
12.82 0.78
News 13.56 0.82
Pass Along
16.35 1
Phatic 15.42 0.94
Status 12.94 0.79
Category UWC UWC_Ratio
Conversational
93 0.97
News 93 0.97
Pass Along
92 0.96
Phatic 93 0.97
Status 96 1
Results
Word Count Unique Word
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Conversational
News
Pass AlongPhatic
Status
0.93
0.94
0.95
0.96
0.97
0.98
0.99
1
Conversational
News
Pass AlongPhatic
Status
Chart Title
Text Analysis Wave 2
Leximancer
Leximancer
• Import into Leximancer as an individual
analysis (individual project)
– Edit Pre processing options: Sentence per block 1
– Run to Generate Outputs
– Generate Concept Map
Map time!
Four sample maps
Entirely because quadrants fit on screens better than hexes. No other reason
conversational
news
pass along
phatic
Tweet Network Density
• Calculate Network Density
– Count Nodes (n)
– Count Actual Connections (e) Edges (paths
between nodes)
– Calculate Network density based on 2e / n(n-1)
• Network Density Notes
– Calculate potential connections
Pass Along Network
Nodes Edges Network Density
15 15 0.14
Network Density Results
Category Nodes Edges
Network
Density
Conversational 13 12 0.15
News 18 17 0.11
Pass Along 15 15 0.14
Phatic 3 2 0.67
Status 4 3 0.50
n 19 17 0.10
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Conversational
News
Pass AlongPhatic
Status
One Bucket of Data
• This is why a research question is important
– You can map a range of information
– None of it is useful without the RQ / hypothesis
– It’s pretty, but not valuable
Category Tweet Density Network Ave.WC
Unique
Words
Conversatio
nal 0.081081 0.819075 0.814598 0.830959 0.96875
News 0.085239 0.83315 0.828595 0.878952 0.96875
Pass Along 1 1.005496 1 1.059722 0.958333
Phatic 0.043659 0.938173 0.933044 1 0.96875
Status 0.037422 0.775065 0.770829 0.838992 1
0
0.2
0.4
0.6
0.8
1
1.2
Tweet Density Network Ave.WC Unique Words
Chart Title
Conversational News Pass Along Phatic Status
Questions?
• @stephendann
• Stephen.dann@anu.edu.au
• stephen@stephendann.net

Mais conteúdo relacionado

Semelhante a Insights into the Twitterverse: Benchmarking and analysis twitter content

Tutorial of Sentiment Analysis
Tutorial of Sentiment AnalysisTutorial of Sentiment Analysis
Tutorial of Sentiment Analysis
Fabio Benedetti
 
Twitter analytics -digiworldhanoi.vn
Twitter analytics  -digiworldhanoi.vnTwitter analytics  -digiworldhanoi.vn
Twitter analytics -digiworldhanoi.vn
Digiword Ha Noi
 
NPIN twitter chat NCHCMM 2012
NPIN twitter chat NCHCMM 2012NPIN twitter chat NCHCMM 2012
NPIN twitter chat NCHCMM 2012
CDC NPIN
 

Semelhante a Insights into the Twitterverse: Benchmarking and analysis twitter content (20)

Twitter analysis - Data as factor for designing the right communication star...
Twitter analysis  - Data as factor for designing the right communication star...Twitter analysis  - Data as factor for designing the right communication star...
Twitter analysis - Data as factor for designing the right communication star...
 
#ANZMAC2014 Twitter Content Analysis Framework: Classification and Coding of ...
#ANZMAC2014 Twitter Content Analysis Framework: Classification and Coding of ...#ANZMAC2014 Twitter Content Analysis Framework: Classification and Coding of ...
#ANZMAC2014 Twitter Content Analysis Framework: Classification and Coding of ...
 
110917_0900_Karimi.pdf
110917_0900_Karimi.pdf110917_0900_Karimi.pdf
110917_0900_Karimi.pdf
 
Language of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 AnalysisLanguage of Politics on Twitter - 03 Analysis
Language of Politics on Twitter - 03 Analysis
 
Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...
Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...
Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...
 
Conversations in Context: A Twitter Case for Social Media Systems Design
Conversations in Context: A Twitter Case for Social Media Systems DesignConversations in Context: A Twitter Case for Social Media Systems Design
Conversations in Context: A Twitter Case for Social Media Systems Design
 
Tutorial of Sentiment Analysis
Tutorial of Sentiment AnalysisTutorial of Sentiment Analysis
Tutorial of Sentiment Analysis
 
Explaining Controversy on Social Media via Stance Summarization
Explaining Controversy on Social Media via Stance SummarizationExplaining Controversy on Social Media via Stance Summarization
Explaining Controversy on Social Media via Stance Summarization
 
Twitter 101
Twitter 101Twitter 101
Twitter 101
 
Trending Topic in Social Networks
Trending Topic in Social NetworksTrending Topic in Social Networks
Trending Topic in Social Networks
 
A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging
A User Modeling Oriented Analysis of Cultural Backgrounds in MicrobloggingA User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging
A User Modeling Oriented Analysis of Cultural Backgrounds in Microblogging
 
Identification and Classification of the Most Important Moments in Students’ ...
Identification and Classification of the Most Important Moments in Students’ ...Identification and Classification of the Most Important Moments in Students’ ...
Identification and Classification of the Most Important Moments in Students’ ...
 
Twitter analytics
Twitter analyticsTwitter analytics
Twitter analytics
 
Laboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insights
Laboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insightsLaboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insights
Laboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insights
 
How Companies Engage Customers Around Accessibility on Social Media
How Companies Engage Customers Around Accessibility on Social MediaHow Companies Engage Customers Around Accessibility on Social Media
How Companies Engage Customers Around Accessibility on Social Media
 
Planning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media CampaignsPlanning to Evaluate Earned, Social/Digital Media Campaigns
Planning to Evaluate Earned, Social/Digital Media Campaigns
 
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
Sld-Natural-Language-Processing-for-large-volumes-of-human-text-data-Sozzi-Br...
 
Sentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 PlusSentiment Analysis with NVivo 11 Plus
Sentiment Analysis with NVivo 11 Plus
 
Twitter analytics -digiworldhanoi.vn
Twitter analytics  -digiworldhanoi.vnTwitter analytics  -digiworldhanoi.vn
Twitter analytics -digiworldhanoi.vn
 
NPIN twitter chat NCHCMM 2012
NPIN twitter chat NCHCMM 2012NPIN twitter chat NCHCMM 2012
NPIN twitter chat NCHCMM 2012
 

Mais de Stephen Dann

Mais de Stephen Dann (20)

Thursday Session Play for Today.pptx
Thursday Session Play for Today.pptxThursday Session Play for Today.pptx
Thursday Session Play for Today.pptx
 
Lego Serious Play and Cocreation
Lego Serious Play and CocreationLego Serious Play and Cocreation
Lego Serious Play and Cocreation
 
Questioning the lego cocreation talisman
Questioning the lego cocreation talismanQuestioning the lego cocreation talisman
Questioning the lego cocreation talisman
 
Cat herding or collaborative learning?
Cat herding or collaborative learning?Cat herding or collaborative learning?
Cat herding or collaborative learning?
 
AMSRS instagram Presentation (words on a screen mix)
AMSRS instagram Presentation (words on a screen mix)AMSRS instagram Presentation (words on a screen mix)
AMSRS instagram Presentation (words on a screen mix)
 
Measuring the “Telling” in the Selling of the Story presentation version
Measuring the “Telling” in the Selling of the Story presentation versionMeasuring the “Telling” in the Selling of the Story presentation version
Measuring the “Telling” in the Selling of the Story presentation version
 
English to legolish
English to legolishEnglish to legolish
English to legolish
 
Learning environment optimisation: Doing less with more for better outcomes
Learning environment optimisation: Doing less with more for better outcomesLearning environment optimisation: Doing less with more for better outcomes
Learning environment optimisation: Doing less with more for better outcomes
 
Mktg2023 assessment exam
Mktg2023 assessment examMktg2023 assessment exam
Mktg2023 assessment exam
 
Mktg2023 assessment lit review
Mktg2023 assessment lit reviewMktg2023 assessment lit review
Mktg2023 assessment lit review
 
Learning Environment Optimisation
Learning Environment OptimisationLearning Environment Optimisation
Learning Environment Optimisation
 
Brand on the Run: Political affiliations and Twitter social media presence
Brand on the Run: Political affiliations and Twitter social media presenceBrand on the Run: Political affiliations and Twitter social media presence
Brand on the Run: Political affiliations and Twitter social media presence
 
Bricks, Clicks and Order:
Bricks, Clicks and Order: Bricks, Clicks and Order:
Bricks, Clicks and Order:
 
The Antipodean Agenda
The Antipodean AgendaThe Antipodean Agenda
The Antipodean Agenda
 
Fear and lathering in las vegas
Fear and lathering in las vegasFear and lathering in las vegas
Fear and lathering in las vegas
 
The Antipodean Agenda
The Antipodean AgendaThe Antipodean Agenda
The Antipodean Agenda
 
Social media, politics and elections
Social media, politics and electionsSocial media, politics and elections
Social media, politics and elections
 
Australian Political Parties and social media: uses and attitudes
Australian Political Parties and social media: uses and attitudesAustralian Political Parties and social media: uses and attitudes
Australian Political Parties and social media: uses and attitudes
 
Education Showcase
Education ShowcaseEducation Showcase
Education Showcase
 
Mapping, planning, measuring
Mapping, planning, measuringMapping, planning, measuring
Mapping, planning, measuring
 

Último

Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
allensay1
 

Último (20)

Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 Updated
 
Durg CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN durg ESCORTS
Durg CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN durg ESCORTSDurg CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN durg ESCORTS
Durg CALL GIRL ❤ 82729*64427❤ CALL GIRLS IN durg ESCORTS
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
 
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 MonthsSEO Case Study: How I Increased SEO Traffic & Ranking by 50-60%  in 6 Months
SEO Case Study: How I Increased SEO Traffic & Ranking by 50-60% in 6 Months
 
How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
 
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...joint cost.pptx  COST ACCOUNTING  Sixteenth Edition                          ...
joint cost.pptx COST ACCOUNTING Sixteenth Edition ...
 
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All TimeCall 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
 
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
Ooty Call Gril 80022//12248 Only For Sex And High Profile Best Gril Sex Avail...
 
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
 
Pre Engineered Building Manufacturers Hyderabad.pptx
Pre Engineered  Building Manufacturers Hyderabad.pptxPre Engineered  Building Manufacturers Hyderabad.pptx
Pre Engineered Building Manufacturers Hyderabad.pptx
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
Putting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptxPutting the SPARK into Virtual Training.pptx
Putting the SPARK into Virtual Training.pptx
 
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur DubaiUAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck Template
 

Insights into the Twitterverse: Benchmarking and analysis twitter content

  • 1. Exploring the Streams of Online Community Conversation: Insights into the Twitterverse @stephendann Australian National University
  • 2. Usual House Rules @stephendann for questions #anzmac13 for commentary
  • 3. A little context The Past • Dann (2010) – Six top level twitter categories – 23 sub domains • Dann (2011) – Six top level – 28 sub domain The Present • Dann (Today) – Six Top Level Categories • No sub domain analysis – Secondary Processing • Leximancer • Linguistic Inquiry Word Count
  • 5. Acquire Research Question • Does Event X change the tweeting patterns of Account @Y? • Do responses to the #hashtag event change over time? – #EventTags in Time Period A will have more Status than in Time Period D – Time Period D will have more Pass Along than Status • What were they thinking? – Dominant Categories of tweets over time within a selected account • Do comments change by platform for account @X? – mobile versus web versus desktop • Does @BrandX engage with the community? – Conversational over all other types over capture time period
  • 6. Acquire your data • Personal timelines – Download from Twitter • #Hashtag captures – Hootsuite • Time line captures – Choose your own adventure – Getting worse, harder and Twitter’s API is less available. • Try to avoid big data
  • 7. Big Data • If you are Axel Bruns, fine, continue – http://mappingonlinepublics.net/ • For everyone else, what are you looking for? – What sample suits your research question?
  • 8. Process your data • Stand by for ugliness and manual coding* – Extract data into Excel • Excel allows for additional data inputs as you progress the analysis – Keep tweet visible • Only keep a column visible if it fits your research question – Eg date, time, @user, platform – Add column for Tweet ID, category, cat_n • Sub category, sub_cat_n for the detailed version *Automated coding? People are working on it. It’s a terrible idea that’ll happen anyway
  • 9. Manual Coding • Use the Dann (2010) or Dann (2011) top level domains – Dann (201X) is under development • I broke something important earlier this year • Manual coding is superior – Nuance and interpretation counts.
  • 10. Pick a box 1 Conversational Uses an @statement to address another user 2 News Events Identifiable news content 3 Pass along Tweets of endorsement of content 4 Phatic Content independent connected presence 5 Status Tweets which address the statement "What are you doing?" and "What's happening?" in terms of an account holder's experiences 6 Spam Unsolicited content
  • 11. Keep it on manual Conversational Uses an @statement to address another user 1.1 Action Activities involving other Twitter users, or tweets which describe the presence of other Twitter users. 1.2 Query Any statement style tweet that ends with a question mark, as it represents an active attempt to engage responses from the community 1.3 Referral An @response which contains URLs or recommendation of other Twitter users. (Excludes RT @user) 1.4 Response Classification for tweets which commence with another user’s name and which do not meet the requirements of the referral category 1.5 Rhetoric Question Asked and answered within the same tweet (distinct from Conversational - Query) which may not require (but may elicit) audience response
  • 12. Upgrades Pass along Tweets of endorsement of content 3.1 Automated Endorsement Status announcements triggered by third party applications which publish URLs 3.2 Endorsement Links to web content not created by the sender 3.3 Retweet Any statement reproducing another Twitter status using the via @ or RT protocol 3.4 Secondary Social Media Links to Facebook (fb.me) or similar social media platform 3.5 User generated content Links to own content created by the user 3.6 Quote Comment marked with “ “ to represent a direct quote, paraphrase of a statement without a source URL, including reference to offline speaker or overheard (OH) 3.7 Cite Any tweet which contains a reference in a recognised Harvard, Oxford or similar format 3.8 Modified ReTweet Acknowledgement of the use of MT protocol to allow for an edited RT.
  • 13. Speed Hacking Excel • Speed hacks exist – Alphabet Tweet Sort • @, RT, MT cluster • “Find all” selecting.
  • 14. Coding Time! • Cross check the coding – Some variance is okay – Resolve it through the usual traditions
  • 16. Coded
  • 18. Tweet Math Dude • Tweet Count – N per category • Calculate the Tweet Ratio – Tweet ratio is a normalized rank order of the highest volume of tweets, where the most common category is scored as 1 • Calculating the Tweet Ratio – Highest number of tweets in a single category = TTMax – Tweets per category = TCat – Ratio is Tcat / TTMax I’m only mildly mocking statistical analysis here
  • 19. Maximum Character Density • Max Density = 140 x TCat [number of tweets in each category] • Theoretical range for a tweet is between 1 and 140 characters • Maximum tweet is 140 characters • More characters used, more information density • Calculate Character Density – (Actual Character / Max Density) • Divide each CharDensity score by the highest Char density • Normalise CharDensity score to rank order
  • 20. Reporting the Data Category Tweet (TCat) Tweet Ratio Max Density Actual Characters Character Density Density Ratio Conversational 39 0.08 5460 3533 65% 0.81 News 41 0.08 5740 3778 66% 0.83 Pass Along 481 1 67340 53491 79% 1.00 Phatic 21 0.04 2940 2179 74% 0.93 Spam 1 0.00 140 81 58% 0.73 Status 18 0.03 2520 1543 61% 0.77 n 601 84140 64605 77%
  • 22. Text Analysis Wave 1 Linguistic Inquiry Word Count So. Very. Fast.
  • 23. LIWC • http://www.liwc.net/ – text analysis software – calculates the degree to which people use different categories of words in texts • 70 other language dimensions. – positive or negative emotions, – self-references, – causal words,
  • 24. A giant bucket of data • 70 variables – So have a hypothesis and a purpose for the analysis • Differences in tweet construction – Word Counts – Unique Words
  • 25. Results Average Word Count (AWC) Unique Word Count (UWC) Category AWC AWC_Ratio Conversational 12.82 0.78 News 13.56 0.82 Pass Along 16.35 1 Phatic 15.42 0.94 Status 12.94 0.79 Category UWC UWC_Ratio Conversational 93 0.97 News 93 0.97 Pass Along 92 0.96 Phatic 93 0.97 Status 96 1
  • 26. Results Word Count Unique Word 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Conversational News Pass AlongPhatic Status 0.93 0.94 0.95 0.96 0.97 0.98 0.99 1 Conversational News Pass AlongPhatic Status Chart Title
  • 27. Text Analysis Wave 2 Leximancer
  • 28. Leximancer • Import into Leximancer as an individual analysis (individual project) – Edit Pre processing options: Sentence per block 1 – Run to Generate Outputs – Generate Concept Map
  • 30. Four sample maps Entirely because quadrants fit on screens better than hexes. No other reason conversational news pass along phatic
  • 31. Tweet Network Density • Calculate Network Density – Count Nodes (n) – Count Actual Connections (e) Edges (paths between nodes) – Calculate Network density based on 2e / n(n-1) • Network Density Notes – Calculate potential connections
  • 32. Pass Along Network Nodes Edges Network Density 15 15 0.14
  • 33. Network Density Results Category Nodes Edges Network Density Conversational 13 12 0.15 News 18 17 0.11 Pass Along 15 15 0.14 Phatic 3 2 0.67 Status 4 3 0.50 n 19 17 0.10 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Conversational News Pass AlongPhatic Status
  • 34. One Bucket of Data • This is why a research question is important – You can map a range of information – None of it is useful without the RQ / hypothesis – It’s pretty, but not valuable Category Tweet Density Network Ave.WC Unique Words Conversatio nal 0.081081 0.819075 0.814598 0.830959 0.96875 News 0.085239 0.83315 0.828595 0.878952 0.96875 Pass Along 1 1.005496 1 1.059722 0.958333 Phatic 0.043659 0.938173 0.933044 1 0.96875 Status 0.037422 0.775065 0.770829 0.838992 1 0 0.2 0.4 0.6 0.8 1 1.2 Tweet Density Network Ave.WC Unique Words Chart Title Conversational News Pass Along Phatic Status