SlideShare uma empresa Scribd logo
1 de 33
Baixar para ler offline
Linking Topics of News and Blogs with
Wikipedia for Complementary Navigation

      Yuki Sato† Daisuke Yokomoto†
   Hiroyuki Nakasaki† Mariko Kawaba‡
  Takehito Utsuro† Tomohiro Fukuhara††

             †Universityof Tsukuba
 ‡NTT Cyber Space Laboratories, NTT Corporation
             ††University of Tokyo

                                              1
3 Information     Wikipedia
   Sources
 on the Web          fundamental
                      background
                 facts and knowledge




                      Events
   report       in the Real World          subjective
precise facts                             information:
                                       personal opinions
                                       and experiences

 News                                       Blog      2
3 Information         Wikipedia
   Sources
 on the Web              fundamental
                          background
                     facts and knowledge


                 Purpose of the research:
          Linking Topics of News and Blogs with
                        Wikipedia
                          Events
   report           in the Real World          subjective
precise facts                                 information:
                                           personal opinions
                                           and experiences

 News                                           Blog      3
July 2009, Dragon Quest 9
       (console role-playing game
for the Nintendo Entertainment System)
         was published in Japan.


                                    4
3 Information           Wikipedia
   Sources
 on the Web               fundamental
                           background
                      facts and knowledge



                Dragon Quest 9 on sale

                           Events
                     in the Real World
   report
                                                subjective
precise facts
                                               information:
                                            personal opinions
                                            and experiences

 News
                                                 Blog      6
Overview of the      Wikipedia
 Framework of
                        fundamental
Complementary            background
  Navigation        facts and knowledge

 Complementary                            Complementary
   Navigation                               Navigation




                                              subjective
    report              Events               information:
 precise facts                            personal opinions
                  in the Real World
                                          and experiences




  News             Complementary
                     Navigation                Blog     11
Outline of the Talk
• Purpose of the Work:
  Linking Topics of News and Blogs with Wikipedia
• Mock-up: from News to closely related Blog Posts
• Details of the Proposed Method
  – Ranking related Wikipedia Entries
    given a News Article
  – Ranking related Bloggers/Blog Posts
    given an Index Wikipedia Entry
• Evaluation
• Conclusion and Future Works
                                                12
Outline of the Talk
• Purpose of the Work:
  Linking Topics of News and Blogs with Wikipedia
• Mock-up: from News to closely related Blog Posts
• Details of the Proposed Method
  – Ranking related Wikipedia Entries
    given a News Article
  – Ranking related Bloggers/Blog Posts
    given an Index Wikipedia Entry
• Evaluation
• Conclusion and Future Works
                                                13
News Article
                         Queue of guys
                        who are waiting
                         for the sales of
                        Dragon Quest 9.


                Blog
               Search



                                        14
Complementary Navigation from News to Blog:
                Our Approach
Wikipedia Entries as Conceptual Search Index
1. Search for                      2. Search for
   Wikipedia                          Bloggers/Blog
   Entries closely                    Posts closely
   related to the                     related to the
   given News                         Index
   Article    Use as                  Wikipedia
   Search Index                       Entries
                       Wikipedia



                                                   15
           News                         Blog
News Article             Queue of guys
                           who are waiting
                            for the sales of
                           Dragon Quest 9.
                   Blog
                  Search
Related Topics




                                           16
Complementary Navigation from News to Blog:
                Our Approach
Wikipedia Entries as Conceptual Search Index
1. Search for
   Wikipedia
   Entries closely
   related to the
   given News
   Article    Use as
   Search Index
                       Wikipedia



                                          17
           News                    Blog
News Article




Related Topics




                   Blog
                  Search
                           18
News Article             Queue of guys
                           who are waiting
                            for the sales of
                           Dragon Quest 9.
Related Topics




                   Blog
                  Search
                                           19
Selected Topic




      Blog Post Ranking


1st




                          DS
2nd




                               20
News Article             Queue of guys
                           who are waiting
                            for the sales of
                           Dragon Quest 9.
Related Topics




                   Blog
                  Search
                                           21
Another Topic which damages
     Blog Post Ranking




 Blog Post Ranking (damaged)


    1st
                                  Parade of a famous
                                   festival in Kyoto

    2nd

                                Queue of a Noodle
                               Restaurant in Shibuya

                                                       22
News Article

          Precision of Top 10 ranked
             Blog Posts per Topic

Related Topics 50%
 40%                                   60%
                        0     10



       Improved to 50% after Manually
            21% on the Average
  30     Selecting Relevant Topics
            10       0       0        10



                      Blog
                     Search
                                             23
Outline of the Talk
• Purpose of the Work:
  Linking Topics of News and Blogs with Wikipedia
• Mock-up: from News to closely related Blog Posts
• Details of the Proposed Method
  – Ranking related Wikipedia Entries
    given a News Article
  – Ranking related Bloggers/Blog Posts
    given an Index Wikipedia Entry
• Evaluation
• Conclusion and Future Works
                                                24
Complementary Navigation from News to Blog:
                Our Approach
Wikipedia Entries as Conceptual Search Index
1. Search for
   Wikipedia
   Entries closely
   related to the
   given News
   Article    Use as
   Search Index
                       Wikipedia



                                          25
           News                    Blog
Ranking Wikipedia Entries with Topic Related Terms



        Related Wikipedia Entries
                             Nintendo



News Article          Dragon Quest series

                               
                               
Dragon Quest 9                 
                               
on sale
                              Game

               Top ranked      
                               
                10 Entries     
                               
                                               26
Extracting topic-related terms
         from Wikipedia
Wikipedia    describes background knowledge of the Topic


            Types of topic-related terms
            1. Bold Text
            2. Redirect paraphrase of the entry title
            3. Title of each paragraph
            4. Noun phrase in body text



               Extracted Related Terms



               Super Mario Bros Pokémon
              Nintendo DS Wii Game Sony
             Role-playing game Fighting game
                                                           27
Ranking Wikipedia Entries with Topic Related
                  Terms
                    WikiNewsScore(e, n)         ¦ (weight (type(t )) u freq(t ))
                                                 t
        Related Wikipedia type of related term tt
                   type(t) Entries
                   type(t) type of related term
                     weight(type(t)) weight of the type type(t)
                     weight(type(t)) weight of the type type(t)
                             Nintendo

                     weight(type(t) = Redirect) = 1
                     weight(type(t) = Redirect) = 1
                     weight(type(t) = Bold text) = 1
                     weight(type(t) = Bold text) = 1
News Article         weight(type(t) = Title of each paragraph) = 1
                     weight(type(t) = Title of each paragraph) = 1
                       Dragon Quest series
                     weight(type(t) = Noun phrase in body text) = 1
                     weight(type(t) = Noun phrase in body text) = 1
                                
                                
Dragon Quest 9                  
                                
on sale
                               Game

               Top ranked       
                                
                10 Entries      
                                
                                                                             28
Complementary Navigation from News to Blog:
                Our Approach
Wikipedia Entries as Conceptual Search Index
                             2. Search for
                                Bloggers/Blog
                                Posts closely
                                related to the
                                Index
                                Wikipedia
                                Entries
                 Wikipedia



                                             29
        News                      Blog
Search for Bloggers/Blog Posts
   closely related to the Index Wikipedia Entries
                                          Ranking related Blog Posts
                                                                1st
         Related Wikipedia Entries                            ranked Blog Post
                                                 Blog Post
                               Nintendo

                                                                2nd
                                                              ranked
                        Dragon Quest series      Blog Post             Blog Post
News Article

                                 
                                 
Dragon Quest 9                                      
                                                              3rd
on sale                                              
                                                             ranked Blog Post
                                Game             Blog Post
                 Top ranked      
                                 
                  10 Entries     
                                                                         
                                                                          
                                                                          
                                              Related Blog Posts          
                                                                         30
Topic-related blog feed (blogger) retrieval
[Kawaba et. al, ICWSM2008; Nakasaki et. al, ICWSM2009]

       Yahoo! Japan Search API


                                           List of blog feeds
                  Hits of topic          which have high hits of
                    keyword                  topic keyword.
                appeared in feed

Usual search                             Re-ranked list




 Blog feed A group of blog posts which
 are written by same blogger.
                                                                   31
Selecting blog posts by topic-related terms
Requirements
Blog Feed: Topic Keyword Frequency in the Blog Feed      10


Blog post: At least one Topic Related Term included in the Blog Post




                                 DS



                             2008 9 13


                                                                 32
Ranking Blog Posts with Topic Related Terms

                      WikiBlogScore(e, b)     ¦ (weight(type(t )) u freq(t ))
                                                t
       Nintendo
                      type(t) type of related term tt
                      type(t) type of related term
                  “Nintendo”
                        weight(type(t))
                                      weight of the type type(t)
                        weight(type(t))
         Related terms from Wikipedia weight of the type type(t)
         Super Mario Bros Pokémon
       Nintendo DS Wii Game Sony, = Redirect) = 3
                      weight(type(t) = Redirect) = 3
                      weight(type(t)
      Role-playing game Fighting game Bold text) = 2
                      weight(type(t) = Bold text) = 2
                      weight(type(t) =
                      weight(type(t) = Anchor text) = 0.5
                      weight(type(t) = Anchor text) = 0.5




  collected                                                   ranked
 blog posts                                                 blog posts
                  Ranking blog posts with
                     WikiBlogScore
                                                                          33
Outline of the Talk
• Purpose of the Work:
  Linking Topics of News and Blogs with Wikipedia
• Mock-up: from News to closely related Blog Posts
• Details of the Proposed Method
  – Ranking related Wikipedia Entries
    given a News Article
  – Ranking related Bloggers/Blog Posts
    given an Index Wikipedia Entry
• Evaluation
• Conclusion and Future Works
                                                34
Evaluation:
before/after Manual Selection of Relevant Wikipedia Entries

                         Top 10 Ranked Wikipedia Entries
  (Post Kyoto Protocol       (United nations)          (Protocol)   (Carbon dioxide)          (United States)
      Negotiation)


              (Debate)      (Kyoto)             (Greenhouse gas)    (Minister)         (Poland)




                                            Wikipedia


       A News Article on                                                                                  35
        Kyoto Protocol                                                             Blog
Evaluation:
before/after Manual Selection of Relevant Wikipedia Entries

         Manually Selected Relevant 3 Wikipedia Entries
  (Post-Kyoto Protocol   (United nations)          (Protocol)   (Carbon dioxide)          (United States)
      Negotiation)


              (Debate)   (Kyoto)            (Greenhouse gas)    (Minister)         (Poland)




                                        Wikipedia


       A News Article on                                                                              36
        Kyoto Protocol                                                         Blog
Evaluation Results:
    Precision of Top Ranked Blog Posts
                 Ranking Blog Posts after Manually Selecting
                  Relevant Specific Terms, excluding General
                 Terms which damage the Blog Posts Ranking




%      missile   Hillary Clinton    Kyoto Protocol


       Ranking Blog Posts with All of the Top 10
    Wikipedia Entries including General Terms which
            damage the Blog Posts Ranking                      37
Conclusion and Future Works
• Purpose of the Work:
  Linking Topics of News and Blogs with Wikipedia
• Details of the Proposed Method
   – Ranking related Wikipedia Entries given a News Article
   – Ranking related Bloggers/Blog Posts
     given an Index Wikipedia Entry

• Future Works:
   – Automatic Selection of Related Wikipedia Entries
   – Implementing Complementary Navigation System
   – Evaluation by Real Users
                                                         38

Mais conteúdo relacionado

Destaque

Destaque (9)

13 things every man should know lesson 1 grace
13 things every man should know lesson 1 grace13 things every man should know lesson 1 grace
13 things every man should know lesson 1 grace
 
13 things every man should know lesson 4 ambition
13 things every man should know lesson 4 ambition13 things every man should know lesson 4 ambition
13 things every man should know lesson 4 ambition
 
13 things every man should know lesson 5 sincerity
13 things every man should know lesson 5 sincerity13 things every man should know lesson 5 sincerity
13 things every man should know lesson 5 sincerity
 
RSW/US 2015 Webinar Slide Deck Frustration Breeds Opportunity
RSW/US 2015 Webinar Slide Deck Frustration Breeds OpportunityRSW/US 2015 Webinar Slide Deck Frustration Breeds Opportunity
RSW/US 2015 Webinar Slide Deck Frustration Breeds Opportunity
 
RSWUS Agency of the Future Webinar Slide Deck – Does your Firm Have the Trait...
RSWUS Agency of the Future Webinar Slide Deck – Does your Firm Have the Trait...RSWUS Agency of the Future Webinar Slide Deck – Does your Firm Have the Trait...
RSWUS Agency of the Future Webinar Slide Deck – Does your Firm Have the Trait...
 
Imax Marketing Presentation
Imax Marketing PresentationImax Marketing Presentation
Imax Marketing Presentation
 
Ethical_Hacking_ppt
Ethical_Hacking_pptEthical_Hacking_ppt
Ethical_Hacking_ppt
 
Convenio imss-stps
Convenio imss-stpsConvenio imss-stps
Convenio imss-stps
 
Proyecto de Norma Mexicana PROY-NMX-R-086-SCFI-2016
Proyecto de Norma Mexicana PROY-NMX-R-086-SCFI-2016Proyecto de Norma Mexicana PROY-NMX-R-086-SCFI-2016
Proyecto de Norma Mexicana PROY-NMX-R-086-SCFI-2016
 

Semelhante a Linking Topics of News and Blogs with Wikipedia for Complementary Navigation

Social Media Boot Camp L.A. Day 2, 2010
Social Media Boot Camp L.A. Day 2, 2010Social Media Boot Camp L.A. Day 2, 2010
Social Media Boot Camp L.A. Day 2, 2010
Eric Schwartzman
 
Writing for the web april 2013
Writing for the web april 2013Writing for the web april 2013
Writing for the web april 2013
Eric Athas
 
Navigating Wikipedia and Wikipedia Articles Wisely
Navigating Wikipedia and Wikipedia Articles WiselyNavigating Wikipedia and Wikipedia Articles Wisely
Navigating Wikipedia and Wikipedia Articles Wisely
B. Hamilton
 
New Media PR Boot Camp NYC Dec. 2010 - Day Two
New Media PR Boot Camp NYC Dec. 2010 - Day TwoNew Media PR Boot Camp NYC Dec. 2010 - Day Two
New Media PR Boot Camp NYC Dec. 2010 - Day Two
Eric Schwartzman
 
BSYS Word 2007 Team Assignment
BSYS Word 2007 Team AssignmentBSYS Word 2007 Team Assignment
BSYS Word 2007 Team Assignment
SunnyLing
 
Igniting the Audacity in News Disruption
Igniting the Audacity in News DisruptionIgniting the Audacity in News Disruption
Igniting the Audacity in News Disruption
Edvarcl Heng
 
New Media Boot Camp Day Two 20090925
New Media Boot Camp Day Two 20090925New Media Boot Camp Day Two 20090925
New Media Boot Camp Day Two 20090925
Eric Schwartzman
 

Semelhante a Linking Topics of News and Blogs with Wikipedia for Complementary Navigation (12)

Improving Creation, Maintenance and Contribution in Wikis with Domain Specifi...
Improving Creation, Maintenance and Contribution in Wikis with Domain Specifi...Improving Creation, Maintenance and Contribution in Wikis with Domain Specifi...
Improving Creation, Maintenance and Contribution in Wikis with Domain Specifi...
 
Social Media Boot Camp L.A. Day 2, 2010
Social Media Boot Camp L.A. Day 2, 2010Social Media Boot Camp L.A. Day 2, 2010
Social Media Boot Camp L.A. Day 2, 2010
 
Social Media Boot Camp Los Angeles 2010 Day 2
Social Media Boot Camp Los Angeles 2010 Day 2Social Media Boot Camp Los Angeles 2010 Day 2
Social Media Boot Camp Los Angeles 2010 Day 2
 
Writing for the web april 2013
Writing for the web april 2013Writing for the web april 2013
Writing for the web april 2013
 
Navigating Wikipedia and Wikipedia Articles Wisely
Navigating Wikipedia and Wikipedia Articles WiselyNavigating Wikipedia and Wikipedia Articles Wisely
Navigating Wikipedia and Wikipedia Articles Wisely
 
Wikipedia for Researchers
Wikipedia for ResearchersWikipedia for Researchers
Wikipedia for Researchers
 
Chicago Social Media Training Day 2 Oct 2009
Chicago Social Media Training Day 2 Oct 2009Chicago Social Media Training Day 2 Oct 2009
Chicago Social Media Training Day 2 Oct 2009
 
New Media PR Boot Camp NYC Dec. 2010 - Day Two
New Media PR Boot Camp NYC Dec. 2010 - Day TwoNew Media PR Boot Camp NYC Dec. 2010 - Day Two
New Media PR Boot Camp NYC Dec. 2010 - Day Two
 
Social Media Boot Camp 2
Social Media Boot Camp 2Social Media Boot Camp 2
Social Media Boot Camp 2
 
BSYS Word 2007 Team Assignment
BSYS Word 2007 Team AssignmentBSYS Word 2007 Team Assignment
BSYS Word 2007 Team Assignment
 
Igniting the Audacity in News Disruption
Igniting the Audacity in News DisruptionIgniting the Audacity in News Disruption
Igniting the Audacity in News Disruption
 
New Media Boot Camp Day Two 20090925
New Media Boot Camp Day Two 20090925New Media Boot Camp Day Two 20090925
New Media Boot Camp Day Two 20090925
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 

Linking Topics of News and Blogs with Wikipedia for Complementary Navigation

  • 1. Linking Topics of News and Blogs with Wikipedia for Complementary Navigation Yuki Sato† Daisuke Yokomoto† Hiroyuki Nakasaki† Mariko Kawaba‡ Takehito Utsuro† Tomohiro Fukuhara†† †Universityof Tsukuba ‡NTT Cyber Space Laboratories, NTT Corporation ††University of Tokyo 1
  • 2. 3 Information Wikipedia Sources on the Web fundamental background facts and knowledge Events report in the Real World subjective precise facts information: personal opinions and experiences News Blog 2
  • 3. 3 Information Wikipedia Sources on the Web fundamental background facts and knowledge Purpose of the research: Linking Topics of News and Blogs with Wikipedia Events report in the Real World subjective precise facts information: personal opinions and experiences News Blog 3
  • 4. July 2009, Dragon Quest 9 (console role-playing game for the Nintendo Entertainment System) was published in Japan. 4
  • 5. 3 Information Wikipedia Sources on the Web fundamental background facts and knowledge Dragon Quest 9 on sale Events in the Real World report subjective precise facts information: personal opinions and experiences News Blog 6
  • 6. Overview of the Wikipedia Framework of fundamental Complementary background Navigation facts and knowledge Complementary Complementary Navigation Navigation subjective report Events information: precise facts personal opinions in the Real World and experiences News Complementary Navigation Blog 11
  • 7. Outline of the Talk • Purpose of the Work: Linking Topics of News and Blogs with Wikipedia • Mock-up: from News to closely related Blog Posts • Details of the Proposed Method – Ranking related Wikipedia Entries given a News Article – Ranking related Bloggers/Blog Posts given an Index Wikipedia Entry • Evaluation • Conclusion and Future Works 12
  • 8. Outline of the Talk • Purpose of the Work: Linking Topics of News and Blogs with Wikipedia • Mock-up: from News to closely related Blog Posts • Details of the Proposed Method – Ranking related Wikipedia Entries given a News Article – Ranking related Bloggers/Blog Posts given an Index Wikipedia Entry • Evaluation • Conclusion and Future Works 13
  • 9. News Article Queue of guys who are waiting for the sales of Dragon Quest 9. Blog Search 14
  • 10. Complementary Navigation from News to Blog: Our Approach Wikipedia Entries as Conceptual Search Index 1. Search for 2. Search for Wikipedia Bloggers/Blog Entries closely Posts closely related to the related to the given News Index Article Use as Wikipedia Search Index Entries Wikipedia 15 News Blog
  • 11. News Article Queue of guys who are waiting for the sales of Dragon Quest 9. Blog Search Related Topics 16
  • 12. Complementary Navigation from News to Blog: Our Approach Wikipedia Entries as Conceptual Search Index 1. Search for Wikipedia Entries closely related to the given News Article Use as Search Index Wikipedia 17 News Blog
  • 13. News Article Related Topics Blog Search 18
  • 14. News Article Queue of guys who are waiting for the sales of Dragon Quest 9. Related Topics Blog Search 19
  • 15. Selected Topic Blog Post Ranking 1st DS 2nd 20
  • 16. News Article Queue of guys who are waiting for the sales of Dragon Quest 9. Related Topics Blog Search 21
  • 17. Another Topic which damages Blog Post Ranking Blog Post Ranking (damaged) 1st Parade of a famous festival in Kyoto 2nd Queue of a Noodle Restaurant in Shibuya 22
  • 18. News Article Precision of Top 10 ranked Blog Posts per Topic Related Topics 50% 40% 60% 0 10 Improved to 50% after Manually 21% on the Average 30 Selecting Relevant Topics 10 0 0 10 Blog Search 23
  • 19. Outline of the Talk • Purpose of the Work: Linking Topics of News and Blogs with Wikipedia • Mock-up: from News to closely related Blog Posts • Details of the Proposed Method – Ranking related Wikipedia Entries given a News Article – Ranking related Bloggers/Blog Posts given an Index Wikipedia Entry • Evaluation • Conclusion and Future Works 24
  • 20. Complementary Navigation from News to Blog: Our Approach Wikipedia Entries as Conceptual Search Index 1. Search for Wikipedia Entries closely related to the given News Article Use as Search Index Wikipedia 25 News Blog
  • 21. Ranking Wikipedia Entries with Topic Related Terms Related Wikipedia Entries Nintendo News Article Dragon Quest series Dragon Quest 9 on sale Game Top ranked 10 Entries 26
  • 22. Extracting topic-related terms from Wikipedia Wikipedia describes background knowledge of the Topic Types of topic-related terms 1. Bold Text 2. Redirect paraphrase of the entry title 3. Title of each paragraph 4. Noun phrase in body text Extracted Related Terms Super Mario Bros Pokémon Nintendo DS Wii Game Sony Role-playing game Fighting game 27
  • 23. Ranking Wikipedia Entries with Topic Related Terms WikiNewsScore(e, n) ¦ (weight (type(t )) u freq(t )) t Related Wikipedia type of related term tt type(t) Entries type(t) type of related term weight(type(t)) weight of the type type(t) weight(type(t)) weight of the type type(t) Nintendo weight(type(t) = Redirect) = 1 weight(type(t) = Redirect) = 1 weight(type(t) = Bold text) = 1 weight(type(t) = Bold text) = 1 News Article weight(type(t) = Title of each paragraph) = 1 weight(type(t) = Title of each paragraph) = 1 Dragon Quest series weight(type(t) = Noun phrase in body text) = 1 weight(type(t) = Noun phrase in body text) = 1 Dragon Quest 9 on sale Game Top ranked 10 Entries 28
  • 24. Complementary Navigation from News to Blog: Our Approach Wikipedia Entries as Conceptual Search Index 2. Search for Bloggers/Blog Posts closely related to the Index Wikipedia Entries Wikipedia 29 News Blog
  • 25. Search for Bloggers/Blog Posts closely related to the Index Wikipedia Entries Ranking related Blog Posts 1st Related Wikipedia Entries ranked Blog Post Blog Post Nintendo 2nd ranked Dragon Quest series Blog Post Blog Post News Article Dragon Quest 9 3rd on sale ranked Blog Post Game Blog Post Top ranked 10 Entries Related Blog Posts 30
  • 26. Topic-related blog feed (blogger) retrieval [Kawaba et. al, ICWSM2008; Nakasaki et. al, ICWSM2009] Yahoo! Japan Search API List of blog feeds Hits of topic which have high hits of keyword topic keyword. appeared in feed Usual search Re-ranked list Blog feed A group of blog posts which are written by same blogger. 31
  • 27. Selecting blog posts by topic-related terms Requirements Blog Feed: Topic Keyword Frequency in the Blog Feed 10 Blog post: At least one Topic Related Term included in the Blog Post DS 2008 9 13 32
  • 28. Ranking Blog Posts with Topic Related Terms WikiBlogScore(e, b) ¦ (weight(type(t )) u freq(t )) t Nintendo type(t) type of related term tt type(t) type of related term “Nintendo” weight(type(t)) weight of the type type(t) weight(type(t)) Related terms from Wikipedia weight of the type type(t) Super Mario Bros Pokémon Nintendo DS Wii Game Sony, = Redirect) = 3 weight(type(t) = Redirect) = 3 weight(type(t) Role-playing game Fighting game Bold text) = 2 weight(type(t) = Bold text) = 2 weight(type(t) = weight(type(t) = Anchor text) = 0.5 weight(type(t) = Anchor text) = 0.5 collected ranked blog posts blog posts Ranking blog posts with WikiBlogScore 33
  • 29. Outline of the Talk • Purpose of the Work: Linking Topics of News and Blogs with Wikipedia • Mock-up: from News to closely related Blog Posts • Details of the Proposed Method – Ranking related Wikipedia Entries given a News Article – Ranking related Bloggers/Blog Posts given an Index Wikipedia Entry • Evaluation • Conclusion and Future Works 34
  • 30. Evaluation: before/after Manual Selection of Relevant Wikipedia Entries Top 10 Ranked Wikipedia Entries (Post Kyoto Protocol (United nations) (Protocol) (Carbon dioxide) (United States) Negotiation) (Debate) (Kyoto) (Greenhouse gas) (Minister) (Poland) Wikipedia A News Article on 35 Kyoto Protocol Blog
  • 31. Evaluation: before/after Manual Selection of Relevant Wikipedia Entries Manually Selected Relevant 3 Wikipedia Entries (Post-Kyoto Protocol (United nations) (Protocol) (Carbon dioxide) (United States) Negotiation) (Debate) (Kyoto) (Greenhouse gas) (Minister) (Poland) Wikipedia A News Article on 36 Kyoto Protocol Blog
  • 32. Evaluation Results: Precision of Top Ranked Blog Posts Ranking Blog Posts after Manually Selecting Relevant Specific Terms, excluding General Terms which damage the Blog Posts Ranking % missile Hillary Clinton Kyoto Protocol Ranking Blog Posts with All of the Top 10 Wikipedia Entries including General Terms which damage the Blog Posts Ranking 37
  • 33. Conclusion and Future Works • Purpose of the Work: Linking Topics of News and Blogs with Wikipedia • Details of the Proposed Method – Ranking related Wikipedia Entries given a News Article – Ranking related Bloggers/Blog Posts given an Index Wikipedia Entry • Future Works: – Automatic Selection of Related Wikipedia Entries – Implementing Complementary Navigation System – Evaluation by Real Users 38