SlideShare uma empresa Scribd logo
1 de 12
ViBRANT
                                                                             Virtual Biodiversity Research




      ‘Wish you were here before!’
Who gains from collaboration between computer science
                 and social research?


 Daphne Duin, David King, Peter van den Besselaar

 Dep. of Organization Sciences & Network Institute, VU-University Amsterdam
        Department of Computing, The Open University, Milton Keynes




          Social Science and Digital Research: Interdisciplinary Insights,
                    March, 12, 2012, Oxford e-Research Centre
ViBRANT
                                                                                                                                         Virtual Biodiversity Research




Help! How is this social data?
 Time taken to serve the request (microseconds)                     Host name (equates to Scratchpad)                """Full URL"" (in quotes)"
                  Origin of request (IP address) F5                 Time the request was received (e#g# (01/Apr/2011:11:17:42 +0100)
                  """First line of request"" (in quotes)"           Status of final request (e#g# 200, 301, etc)     Size of the response in
      bytes       Remote logname (Almost always blank)              """Referer"" (in quotes)"
      able.myspecies.info          http://able.myspecies.info/favicon.ico            24.218.227.223 --               [14/Jul/2010:19:54:06
                  GET /favicon.ico HTTP/1.1        200              198              -               Mozilla/5.0 (Macintosh; U; Intel Mac OS X
      10.6; en-US; rv:1.9.2.6) Gecko/20100625 Firefox/3.6.6
      polychaetes.info             http://polychaetes.info/node/add/forum/forum/                     24.229.196.151 --
      [14/Jul/2010:20:16:48        GET /node/add/forum/forum/ HTTP/1.0               301             -
      http://polychaetes.info/node/add/forum/forum/                 Mozilla/4.0 (compatible; MSIE 6.0; Windows 98; Win 9x 4.90; Creative)
      ciliateguide.myspecies.info                  http://ciliateguide.myspecies.info/node/add/forum/forum/          24.229.196.151 --
                  [14/Jul/2010:20:39:14            GET /node/add/forum/forum/ HTTP/1.0               301             -
      http://ciliateguide.myspecies.info/node/add/forum/forum/                       Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1;
      MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW)
      ciliateguide.myspecies.info                  http://ciliateguide.myspecies.info/node/add/forum/forum           24.229.196.151 --
                  [14/Jul/2010:20:39:22            GET /node/add/forum/forum HTTP/1.0                200             25219
      http://ciliateguide.myspecies.info/node/add/forum/forum                        Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1;
      MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW)
      ciliateguide.myspecies.info                  http://ciliateguide.myspecies.info/node/add/forum/forum           24.229.196.151 --
                  [14/Jul/2010:20:39:37            POST /node/add/forum/forum HTTP/1.0               200             27128
      http://ciliateguide.myspecies.info/node/add/forum/forum                        Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1;
      MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW)
      ciliateguide.myspecies.info                  http://ciliateguide.myspecies.info/node/add/forum/forum           24.229.196.151 --
                  [14/Jul/2010:20:39:47            GET /node/add/forum/forum HTTP/1.0                200             25219
      http://ciliateguide.myspecies.info/node/add/forum/forum                        Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1;
      MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW)
 26141            wallacefund.info                 http://wallacefund.info/robots.txt                38.101.148.126 --
      [15/Jul/2010:03:48:42        GET /robots.txt HTTP/1.1         200              44              -               Mozilla/5.0 (compatible;
      discobot/1.1; +http://discoveryengine.com/discobot.html
      mhp.myspecies.info           http://mhp.myspecies.info/robots.txt              38.101.148.126 --               [15/Jul/2010:03:48:49
                  GET /robots.txt HTTP/1.1         200              44               -               Mozilla/5.0 (compatible; discobot/1.1; +
ViBRANT
                                                           Virtual Biodiversity Research




Interdisciplinary work for e-science
 E-science
 1. Application of an e-infrastructure to do science
 2. The study of the design, uptake and use of e-Science

 E-infrastructure: Scratchpads, online platform for
    biodiversity research

 Need: Developing alternative evaluation metrics for e-
   science

 Goal: Identification of different types of users

 Approach: Collaboration between social science and
   omputer science valuable for e-science
ViBRANT
                                                          Virtual Biodiversity Research




What is the impact of e-science?

  Question from e-science facility to social scientists


  Identification of different types of users
         Who are visiting Scratchpad platform?
         Web data (eg server log files)
         Identify Internet Service Providers visiting
         Scratchpads
         Cluster Internet Service Providers visiting
         Scratchpads, into meaningful categories
ViBRANT
                                                                       Virtual Biodiversity Research




Material
Standard web analytics report of Scratchpads
   >300 community sites
   > 5,000 registred users (unpaid)
   Public and closed content



Names of 6,728 unique Internet Service Providers
  (ISPs) (6 months)
  natural history museum               telstra internet   verizon online llc
  freie universitaet berlin
  queensland department of natural resources and water
  Gemeente maastricht
  national parks board (ministry of national development)
  agriculture and agrifood canada
  Commission europeenne
  u.s. fish and wildlife service irm/bfo hqstate of nebraska / office of
ViBRANT
                                                                  Virtual Biodiversity Research




Social scientists and computer scientists
 First trying alone…
 ….marina|marine|medical|medisch|microsoft|mineral|mining|ministerie|
   ministry|monsanto|museo|museum|national park|naval|navy|nerc|
   news|novartis|observatoire|office….


  Then question to computer scientist
  ...from social scientists: could you help us to better...
  • collect web data?
  • refine/cluster the data ?
  • develop tools/methods for measuring robustness of
      data?
ViBRANT
                                                           Virtual Biodiversity Research




Altmetrics for e-science: a social science and
computer science project

 “to what extent can we improve a human developed method
    with computational techniques, in order to cluster ISPs into
    meaningful categories representing the various audiences
    using Scratchpads? “
ViBRANT
                                                     Virtual Biodiversity Research




Method computer scientist
 Identify Internet Service Providers visiting
 Scratchpads, removing noise
        Inductive logic program, Aleph



 Cluster Internet Service Providers visiting Scratchpads
 into meaningful categories
       Bayesian classifier
ViBRANT
                                    Virtual Biodiversity Research




Results: Identification of ISPs
Manually build filter (181 terms)
- accuracy 94%
- precision 92%
- recall 97%
       Many hours of work

Computational filter (6 terms)
 - accuracy 84%
- precision 98%
- recall 73%
  c
     Couple of minutes
ViBRANT
                                         Virtual Biodiversity Research




Results: Clustering ISPs in meaningful
categories
Manual method: filter with key
words
“university” “research” “school”
“museum”
Problematic!
Computational method: classifiers
- 90% accuracy
       Couple of minutes!
ViBRANT
                                                   Virtual Biodiversity Research




Who gains from collaboration between
computer science and social research?

  •   E-science facilities, e-science uptake and
      implementation
  •   Social Science and
  •   Computer Science
ViBRANT
                                                          Virtual Biodiversity Research




Acknowledgments

  ViBRANT –http://vbrant.eu
  Scratchpads –http://scratchpads.eu/

  Laura Hollink for her help with the raw log files
  Simon Rycroft for his help with the web analytics reports
  Vince Smith for sharing presentation material

Mais conteúdo relacionado

Semelhante a Wish you were here before!

An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...Vince Smith
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behindVince Smith
 
Scripting Life: ViBRANT's Kickoff meeting
Scripting Life: ViBRANT's Kickoff meetingScripting Life: ViBRANT's Kickoff meeting
Scripting Life: ViBRANT's Kickoff meetingVince Smith
 
Scratchpad training
Scratchpad trainingScratchpad training
Scratchpad trainingVince Smith
 
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipForethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipDavid De Roure
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Fabien Gandon
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Fabien Gandon
 
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja JermoleMadrid network
 
DISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudDISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudPaolo Nesi
 
Community web sites: small pieces loosely joined
Community web sites: small pieces loosely joinedCommunity web sites: small pieces loosely joined
Community web sites: small pieces loosely joinedVince Smith
 
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...Cybera Inc.
 
Information Engineering in the Age of the Internet of Things
Information Engineering in the Age of the Internet of Things Information Engineering in the Age of the Internet of Things
Information Engineering in the Age of the Internet of Things PayamBarnaghi
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsAlexandreBonvin2
 
Michael Weber - Rechenkraft.net - From Volunteers to Scientists
Michael Weber - Rechenkraft.net - From Volunteers to ScientistsMichael Weber - Rechenkraft.net - From Volunteers to Scientists
Michael Weber - Rechenkraft.net - From Volunteers to ScientistsCitizenCyberlab
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Web technologies training for development of library & information resources
Web technologies training for development of library & information resourcesWeb technologies training for development of library & information resources
Web technologies training for development of library & information resourcesJulius Cortez
 
Semantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataSemantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataOscar Corcho
 

Semelhante a Wish you were here before! (20)

IoT overview 2014
IoT overview 2014IoT overview 2014
IoT overview 2014
 
Sinnott Paper
Sinnott PaperSinnott Paper
Sinnott Paper
 
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
 
TDWG_ ViBRANT_301013
TDWG_ ViBRANT_301013TDWG_ ViBRANT_301013
TDWG_ ViBRANT_301013
 
No specimen (software) left behind
No specimen (software) left behindNo specimen (software) left behind
No specimen (software) left behind
 
Scripting Life: ViBRANT's Kickoff meeting
Scripting Life: ViBRANT's Kickoff meetingScripting Life: ViBRANT's Kickoff meeting
Scripting Life: ViBRANT's Kickoff meeting
 
Scratchpad training
Scratchpad trainingScratchpad training
Scratchpad training
 
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital ScholarshipForethoughts (or Four Provocations) on Linked Data and Digital Scholarship
Forethoughts (or Four Provocations) on Linked Data and Digital Scholarship
 
Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...Web open standards for linked data and knowledge graphs as enablers of EU dig...
Web open standards for linked data and knowledge graphs as enablers of EU dig...
 
Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017Wimmics Research Team Overview 2017
Wimmics Research Team Overview 2017
 
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol
2020_12_11 «Opening Education with Artificial Intelligence» - Mitja Jermol
 
DISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloudDISIT Lab overview: smart city, big data, semantic computing, cloud
DISIT Lab overview: smart city, big data, semantic computing, cloud
 
Community web sites: small pieces loosely joined
Community web sites: small pieces loosely joinedCommunity web sites: small pieces loosely joined
Community web sites: small pieces loosely joined
 
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...
Collaborative Science: Technologies & Examples - Cameron Kiddle, Grid Researc...
 
Information Engineering in the Age of the Internet of Things
Information Engineering in the Age of the Internet of Things Information Engineering in the Age of the Internet of Things
Information Engineering in the Age of the Internet of Things
 
Structural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 yearsStructural Biology in the Clouds: A Success Story of 10 years
Structural Biology in the Clouds: A Success Story of 10 years
 
Michael Weber - Rechenkraft.net - From Volunteers to Scientists
Michael Weber - Rechenkraft.net - From Volunteers to ScientistsMichael Weber - Rechenkraft.net - From Volunteers to Scientists
Michael Weber - Rechenkraft.net - From Volunteers to Scientists
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Web technologies training for development of library & information resources
Web technologies training for development of library & information resourcesWeb technologies training for development of library & information resources
Web technologies training for development of library & information resources
 
Semantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream DataSemantic Sensor Networks and Linked Stream Data
Semantic Sensor Networks and Linked Stream Data
 

Último

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - Englishneillewis46
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxcallscotland1987
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 

Último (20)

Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 

Wish you were here before!

  • 1. ViBRANT Virtual Biodiversity Research ‘Wish you were here before!’ Who gains from collaboration between computer science and social research? Daphne Duin, David King, Peter van den Besselaar Dep. of Organization Sciences & Network Institute, VU-University Amsterdam Department of Computing, The Open University, Milton Keynes Social Science and Digital Research: Interdisciplinary Insights, March, 12, 2012, Oxford e-Research Centre
  • 2. ViBRANT Virtual Biodiversity Research Help! How is this social data? Time taken to serve the request (microseconds) Host name (equates to Scratchpad) """Full URL"" (in quotes)" Origin of request (IP address) F5 Time the request was received (e#g# (01/Apr/2011:11:17:42 +0100) """First line of request"" (in quotes)" Status of final request (e#g# 200, 301, etc) Size of the response in bytes Remote logname (Almost always blank) """Referer"" (in quotes)" able.myspecies.info http://able.myspecies.info/favicon.ico 24.218.227.223 -- [14/Jul/2010:19:54:06 GET /favicon.ico HTTP/1.1 200 198 - Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.6) Gecko/20100625 Firefox/3.6.6 polychaetes.info http://polychaetes.info/node/add/forum/forum/ 24.229.196.151 -- [14/Jul/2010:20:16:48 GET /node/add/forum/forum/ HTTP/1.0 301 - http://polychaetes.info/node/add/forum/forum/ Mozilla/4.0 (compatible; MSIE 6.0; Windows 98; Win 9x 4.90; Creative) ciliateguide.myspecies.info http://ciliateguide.myspecies.info/node/add/forum/forum/ 24.229.196.151 -- [14/Jul/2010:20:39:14 GET /node/add/forum/forum/ HTTP/1.0 301 - http://ciliateguide.myspecies.info/node/add/forum/forum/ Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW) ciliateguide.myspecies.info http://ciliateguide.myspecies.info/node/add/forum/forum 24.229.196.151 -- [14/Jul/2010:20:39:22 GET /node/add/forum/forum HTTP/1.0 200 25219 http://ciliateguide.myspecies.info/node/add/forum/forum Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW) ciliateguide.myspecies.info http://ciliateguide.myspecies.info/node/add/forum/forum 24.229.196.151 -- [14/Jul/2010:20:39:37 POST /node/add/forum/forum HTTP/1.0 200 27128 http://ciliateguide.myspecies.info/node/add/forum/forum Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW) ciliateguide.myspecies.info http://ciliateguide.myspecies.info/node/add/forum/forum 24.229.196.151 -- [14/Jul/2010:20:39:47 GET /node/add/forum/forum HTTP/1.0 200 25219 http://ciliateguide.myspecies.info/node/add/forum/forum Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; MRA 4.6 (build 01425); MRSPUTNIK 1, 5, 0, 19 SW) 26141 wallacefund.info http://wallacefund.info/robots.txt 38.101.148.126 -- [15/Jul/2010:03:48:42 GET /robots.txt HTTP/1.1 200 44 - Mozilla/5.0 (compatible; discobot/1.1; +http://discoveryengine.com/discobot.html mhp.myspecies.info http://mhp.myspecies.info/robots.txt 38.101.148.126 -- [15/Jul/2010:03:48:49 GET /robots.txt HTTP/1.1 200 44 - Mozilla/5.0 (compatible; discobot/1.1; +
  • 3. ViBRANT Virtual Biodiversity Research Interdisciplinary work for e-science E-science 1. Application of an e-infrastructure to do science 2. The study of the design, uptake and use of e-Science E-infrastructure: Scratchpads, online platform for biodiversity research Need: Developing alternative evaluation metrics for e- science Goal: Identification of different types of users Approach: Collaboration between social science and omputer science valuable for e-science
  • 4. ViBRANT Virtual Biodiversity Research What is the impact of e-science? Question from e-science facility to social scientists Identification of different types of users Who are visiting Scratchpad platform? Web data (eg server log files) Identify Internet Service Providers visiting Scratchpads Cluster Internet Service Providers visiting Scratchpads, into meaningful categories
  • 5. ViBRANT Virtual Biodiversity Research Material Standard web analytics report of Scratchpads >300 community sites > 5,000 registred users (unpaid) Public and closed content Names of 6,728 unique Internet Service Providers (ISPs) (6 months) natural history museum telstra internet verizon online llc freie universitaet berlin queensland department of natural resources and water Gemeente maastricht national parks board (ministry of national development) agriculture and agrifood canada Commission europeenne u.s. fish and wildlife service irm/bfo hqstate of nebraska / office of
  • 6. ViBRANT Virtual Biodiversity Research Social scientists and computer scientists First trying alone… ….marina|marine|medical|medisch|microsoft|mineral|mining|ministerie| ministry|monsanto|museo|museum|national park|naval|navy|nerc| news|novartis|observatoire|office…. Then question to computer scientist ...from social scientists: could you help us to better... • collect web data? • refine/cluster the data ? • develop tools/methods for measuring robustness of data?
  • 7. ViBRANT Virtual Biodiversity Research Altmetrics for e-science: a social science and computer science project “to what extent can we improve a human developed method with computational techniques, in order to cluster ISPs into meaningful categories representing the various audiences using Scratchpads? “
  • 8. ViBRANT Virtual Biodiversity Research Method computer scientist Identify Internet Service Providers visiting Scratchpads, removing noise Inductive logic program, Aleph Cluster Internet Service Providers visiting Scratchpads into meaningful categories Bayesian classifier
  • 9. ViBRANT Virtual Biodiversity Research Results: Identification of ISPs Manually build filter (181 terms) - accuracy 94% - precision 92% - recall 97% Many hours of work Computational filter (6 terms) - accuracy 84% - precision 98% - recall 73% c Couple of minutes
  • 10. ViBRANT Virtual Biodiversity Research Results: Clustering ISPs in meaningful categories Manual method: filter with key words “university” “research” “school” “museum” Problematic! Computational method: classifiers - 90% accuracy Couple of minutes!
  • 11. ViBRANT Virtual Biodiversity Research Who gains from collaboration between computer science and social research? • E-science facilities, e-science uptake and implementation • Social Science and • Computer Science
  • 12. ViBRANT Virtual Biodiversity Research Acknowledgments ViBRANT –http://vbrant.eu Scratchpads –http://scratchpads.eu/ Laura Hollink for her help with the raw log files Simon Rycroft for his help with the web analytics reports Vince Smith for sharing presentation material

Notas do Editor

  1. So we had a question, who are the visitors of SPs? And a file with electronic use data ...the challenge then was how to analyse the data and to know how robust the data are. Identify:We decided to start with identifying “the users”. Web analytics packages can be used to generate information on the visitors (users), notably through identification of the names of the visiting Internet Service Providers (ISPs). Through the name of the ISP, (i.e. ‘Vrije Universiteit’) we may be able to identify the nature and activities of the users. Clusters: Additionally, and next to identification we also wanted to cluster the ISP into categories that make sense for evaluation purposes We were in particular interested to see the partition of academic users versus other educational users and sectors such as government and business as this could tell us something about the (societal) impact of the e-infrastructure.
  2. The social scientists produced a 181-term filter set after many hours of effort that gave 94% accuracy, whereas the computer scientist produced a 6-term filter set in a couple of minutes that gave 84% accuracy. The tested computer-aided filtering reached a higher precision than the manually‑developed filter (98% vs 92%) though for the recall in this initial test favored the manual approach (73% vs 97%).
  3. Meaningful categories in this context are categories that The manual process highlighted a problem with continuing to use keywords to categorize ISPs. Some categories are easily made up from words in the name of the full ISP such as “university” or “research” and could be grouped under the tier one category “research & education”. However, this approach is limited. For example, to simply categorize all ISPs who had within their name the terms “health” or “medic*” as “public health” meant that a range of research, educational, governmental and corporate affiliated ISPs were wrongly classified. Therefore, we were encouraged to categorize ISPs using classifiers rather than by extending our work with filters.
  4. Interdisciplinary work of CS and SS will bring to e-science enhanced insights on the actual use and usage of the e-science environment based on robust (log) data and analysis, in a relative short amount of time 2. Social science will benefit from working with CS because of increased scale and speed of data collection and analysis and for their insight in the technological boundaries/charateritics. 3. CS will benefit because collaboration provides opportunity to demonstrate their engineering insights (tool building for the e-science facility as well as tools for analyzing social science data sets); 2) access to large datasets with behavioral/user information which are nice cases to test computer science theories Possible costs: Above we listed several reasons for collaboration between e-science facilities, computer science and social sciences, nevertheless every collaboration does have costs: it requires time in planning and communication. Furthermore, collaborators support each other’s work often at the costs advancing their own research