SlideShare uma empresa Scribd logo
1 de 62
Big Data, Linked Data
                                            Paul Miller
                                     The Cloud of Data
                                paul.miller@cloudofdata.com




cloudofdata.com
Data

           Big Data

           Linked Data

           Opportunities for Audio-Visual Archives




cloudofdata.com
Data




cloudofdata.com
cloudofdata.com
cloudofdata.com
cloudofdata.com
Talis

                      Libraries



                                                  SemWeb
     Cultural                                     Research
     Heritage



                                                      SemTech



                  JISC-land
                                                  EC !


                                  Cloud
cloudofdata.com                           inmaps.linkedinlabs.com
cloudofdata.com   inmaps.linkedinlabs.com
“Every two days now we create
 as much information as we did
   from the dawn of civilisation
          up until 2003”




cloudofdata.com
cloudofdata.com
Cost per Gb of storage ($)
300,000




200,000




100,000




     0
      1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009



cloudofdata.com
Cost per Gb of storage ($)
300,000




200,000




100,000




     0
      1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009



cloudofdata.com                       isen.com/blog/2011/03/the-decline-and-fall-of-disk-storage-prices/
cloudofdata.com   www.datacenterknowledge.com/wp-content/uploads/2009/09/aerial-1000.jpg
Big Data




cloudofdata.com
Cnut knew trying to control the tide was silly




 cloudofdata.com                                 www.flickr.com/photos/30591976@N05/3402395112/
why not use language of opportunity?




 cloudofdata.com                       www.flickr.com/photos/73645804@N00/2712985768/
why not use language of opportunity?
                     “data-driven organisations
                           look at big data as a
                       solution, not a problem”
                                       Release 2.0, February 2009




 cloudofdata.com                             www.flickr.com/photos/73645804@N00/2712985768/
cloudofdata.com   r2.oreilly.com/
Massively Parallel Processing (MPP)

       Column stores

       MapReduce/ Hadoop

       NoSQL




cloudofdata.com                              r2.oreilly.com/
emerging from lots of places, and being combined




 cloudofdata.com                                   www.flickr.com/photos/98274023@N00/2102067531/

quantity, nature, expectations...
emerging from lots of places, and being combined




       More data, faster...




 cloudofdata.com                                   www.flickr.com/photos/98274023@N00/2102067531/

quantity, nature, expectations...
Real-Time Web




 cloudofdata.com
Sensor Web - 15 Petabytes per year ?




 cloudofdata.com                       www.flickr.com/photos/7702002@N08/2293062616/
Sensor Web - 15 Petabytes per year ?




 cloudofdata.com                       www.flickr.com/photos/7702002@N08/2293062616/
Sensor Web - 15 Petabytes per year ?




 15,000,000,000,000,000




 cloudofdata.com                       www.flickr.com/photos/7702002@N08/2293062616/
Sensor Web - 15 Petabytes per year ?




 15,000,000,000,000,000
                   125,000 iPods


 cloudofdata.com                       www.flickr.com/photos/7702002@N08/2293062616/
Sensor Web




 cloudofdata.com   www.flickr.com/photos/aroberts/3035796/
Sensor Web




                                                    Image © Apple
 cloudofdata.com   www.flickr.com/photos/aroberts/3035796/
cloudofdata.com
cloudofdata.com   gigaom.com/2011/02/01/mining-the-tar-sands-of-big-data/
Linked Data




cloudofdata.com
Today’s a “Web of Documents” ?




cloudofdata.com              www.flickr.com/photos/calliope/306564541/
Tomorrow’s “Semantic Web,” 2001-style!




 cloudofdata.com                         www.sciam.com/article.cfm?id=the-semantic-web
Tomorrow’s “Semantic Web,” 2001-style!




  http://tr.im/timbl
  http://tr.im/hendler
 cloudofdata.com                         www.sciam.com/article.cfm?id=the-semantic-web
Pipe Dream?




cloudofdata.com
Or are the pieces falling into place?




cloudofdata.com
W3C-driven effort. More used - and useful - than PR might imply

                                                                  URI - 1994
                                                                  XML - 1998
                                                                  RDF - 1999/ 2004
                                                                  OWL - 2004
                                                                  SPARQL - 2008
                                                                  Applications - 2007/8




 cloudofdata.com                                       Image © World Wide Web Consortium
“J.R.R. Tolkien wrote The Hobbit”




cloudofdata.com
J.R.R. Tolkien wrote The Hobbit




cloudofdata.com
J.R.R. Tolkien   wrote   The Hobbit




cloudofdata.com
J.R.R. Tolkien   wrote   The Hobbit




cloudofdata.com
J.R.R. Tolkien   wrote       The Hobbit


          subject   predicate     object




cloudofdata.com
J.R.R. Tolkien   wrote       The Hobbit


                    predicate




cloudofdata.com
J.R.R. Tolkien                 wrote                     The Hobbit


                    http://blah.org/ThingsAuthorsDo/write




cloudofdata.com
J.R.R. Tolkien
          http://dbpedia.org/page/
              J._R._R._Tolkien




                                      wrote
                        http://blah.org/ThingsAuthorsDo/write




                                                                The Hobbit
                                            http://dbpedia.org/page/The_Hobbit
cloudofdata.com
eg Wikipedia data
                  boxes to DBpedia




cloudofdata.com
eg Wikipedia data
                  boxes to DBpedia




cloudofdata.com
eg UK Gov




                                        bit.ly/ztOed

cloudofdata.com   www.flickr.com/photos/lorentey/1438477358/
eg BBC




cloudofdata.com   bbc.co.uk/music/
Data LINKED to other places outside firewall
eg BBC trusts and relies upon MusicBrainz




                                                               bit.ly/9tBJGH

 cloudofdata.com                              www.flickr.com/photos/foxypar4/2124673642/
harks back to TimBL’s original vision
 cloudofdata.com                        www.flickr.com/photos/tanaka/3212373419/

for a Read/Write Web
“the Web done
                                                 right”
                                        Sir Tim Berners-Lee, 2008




harks back to TimBL’s original vision
 cloudofdata.com                                 www.flickr.com/photos/tanaka/3212373419/

for a Read/Write Web
Use URIs to name things

           Use HTTP URIs so that they can be followed

           When someone follows a URI, provide useful information

           Include links to other URIs, so that more can be discovered.




cloudofdata.com                                                www.w3.org/DesignIssues/LinkedData.html
cloudofdata.com   richard.cyganiak.de/2007/10/lod/
Opportunities




cloudofdata.com
cloudofdata.com
Web-scale tools

             NoSQL data manipulation with Hadoop, Cassandra, etc




cloudofdata.com
Web-scale tools

             NoSQL data manipulation with Hadoop, Cassandra, etc

           Web-scale storage and compute

             Separate archival role from analysis, dissemination and use

             “too cheap to meter” may be measuring the wrong things




cloudofdata.com
Web-scale tools

             NoSQL data manipulation with Hadoop, Cassandra, etc

           Web-scale storage and compute

             Separate archival role from analysis, dissemination and use

             “too cheap to meter” may be measuring the wrong things

           Leverage connections

             between archives, and with the wider world

                  embrace the Web, and its architecture




cloudofdata.com
cloudofdata.com   www.flickr.com/photos/11962592@N00/4549097414/
cloud of data




  Thank you                                                                                                        Download this presentation
                                                                                                                   slideshare.net/cloudofdata


  Dr Paul Miller
  The Cloud of Data
  paul.miller@cloudofdata.com
  skype: cloudofdata                                                                                                                      Made on a

  phone: +44 7769 740083
                                                                                                                                          Mac

                                Except where otherwise noted, this work is licensed under the Creative Commons Attribution Licence.
                                      To view a copy of this licence, visit creativecommons.org/licenses/by/2.0/uk/ or send a letter to
cloudofdata.com                              Creative Commons, 171 Second St, San Francisco, CA 94105, United States of America
Big Data, Linked Data

Mais conteúdo relacionado

Destaque

The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration3 Round Stones
 
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...Mark Birbeck
 
Gabriele Gattiglia - Progettare la pubblicazione dei dati
Gabriele Gattiglia - Progettare la pubblicazione dei datiGabriele Gattiglia - Progettare la pubblicazione dei dati
Gabriele Gattiglia - Progettare la pubblicazione dei datiOpenPompei
 
Big (open) data, big question?
Big (open) data, big question?Big (open) data, big question?
Big (open) data, big question?Salvatore Marras
 
Linked Data, Big Data, and User Science at Globo.com
Linked Data, Big Data, and User Science at Globo.comLinked Data, Big Data, and User Science at Globo.com
Linked Data, Big Data, and User Science at Globo.comÍcaro Medeiros
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataMuhammad Saleem
 
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...Francesco Ciclosi
 
Gestione dei big data: Web 3.0, motori semantici, soft computing
Gestione dei big data: Web 3.0, motori semantici, soft computing Gestione dei big data: Web 3.0, motori semantici, soft computing
Gestione dei big data: Web 3.0, motori semantici, soft computing Valerio Eletti
 
Ottenere e visualizzare i dati. Open Data e Big Data
Ottenere e visualizzare i dati. Open Data e Big DataOttenere e visualizzare i dati. Open Data e Big Data
Ottenere e visualizzare i dati. Open Data e Big DataVincenzo Patruno
 
Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)EUCLID project
 
M. Scannapieco - Big Data e Open Data: Istruzioni (o quasi) per l’Uso
M. Scannapieco - Big Data e Open Data:  Istruzioni (o quasi) per l’Uso  M. Scannapieco - Big Data e Open Data:  Istruzioni (o quasi) per l’Uso
M. Scannapieco - Big Data e Open Data: Istruzioni (o quasi) per l’Uso Istituto nazionale di statistica
 
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...Istituto nazionale di statistica
 
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...Istituto nazionale di statistica
 
Il turismo nella politica di coesione dal 2000-06 al 2014-20
Il turismo nella politica di coesione dal 2000-06 al 2014-20Il turismo nella politica di coesione dal 2000-06 al 2014-20
Il turismo nella politica di coesione dal 2000-06 al 2014-20OpenCoesione
 
Slide 2.5 - Come la PA pubblica i dati - ASOC1617
Slide 2.5 - Come la PA pubblica i dati - ASOC1617Slide 2.5 - Come la PA pubblica i dati - ASOC1617
Slide 2.5 - Come la PA pubblica i dati - ASOC1617A Scuola di OpenCoesione
 
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitari
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitariOpen data in sanita' - Master Funzioni Direttive Gestione servizi sanitari
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitariUSAC Program
 

Destaque (20)

The Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information IntegrationThe Power of Linked Data for Government & Healthcare Information Integration
The Power of Linked Data for Government & Healthcare Information Integration
 
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...
Monitoring Healthcare Innovation: A Case Study in Using OWL, Linked Data and ...
 
Gabriele Gattiglia - Progettare la pubblicazione dei dati
Gabriele Gattiglia - Progettare la pubblicazione dei datiGabriele Gattiglia - Progettare la pubblicazione dei dati
Gabriele Gattiglia - Progettare la pubblicazione dei dati
 
Big (open) data, big question?
Big (open) data, big question?Big (open) data, big question?
Big (open) data, big question?
 
Linked data in pharma R&D
Linked data in pharma R&DLinked data in pharma R&D
Linked data in pharma R&D
 
Linked Data, Big Data, and User Science at Globo.com
Linked Data, Big Data, and User Science at Globo.comLinked Data, Big Data, and User Science at Globo.com
Linked Data, Big Data, and User Science at Globo.com
 
Fostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked DataFostering Serendipity through Big Linked Data
Fostering Serendipity through Big Linked Data
 
3.7 Data visualization - strumenti
3.7 Data visualization - strumenti 3.7 Data visualization - strumenti
3.7 Data visualization - strumenti
 
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...
International Open Data Day 2014 Marche by Unicam - Presentazione di Francesc...
 
Gestione dei big data: Web 3.0, motori semantici, soft computing
Gestione dei big data: Web 3.0, motori semantici, soft computing Gestione dei big data: Web 3.0, motori semantici, soft computing
Gestione dei big data: Web 3.0, motori semantici, soft computing
 
Ottenere e visualizzare i dati. Open Data e Big Data
Ottenere e visualizzare i dati. Open Data e Big DataOttenere e visualizzare i dati. Open Data e Big Data
Ottenere e visualizzare i dati. Open Data e Big Data
 
Interoperabilità e Big Data
Interoperabilità e Big DataInteroperabilità e Big Data
Interoperabilità e Big Data
 
Recommender Systems and Linked Open Data
Recommender Systems and Linked Open DataRecommender Systems and Linked Open Data
Recommender Systems and Linked Open Data
 
Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)Relational Database to RDF (RDB2RDF)
Relational Database to RDF (RDB2RDF)
 
M. Scannapieco - Big Data e Open Data: Istruzioni (o quasi) per l’Uso
M. Scannapieco - Big Data e Open Data:  Istruzioni (o quasi) per l’Uso  M. Scannapieco - Big Data e Open Data:  Istruzioni (o quasi) per l’Uso
M. Scannapieco - Big Data e Open Data: Istruzioni (o quasi) per l’Uso
 
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...
Pubblica amministrazione: Smart Data e produzione di servizi pubblici innovat...
 
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...
Big data, statistica pubblica, Internet of things e città intelligenti. Bari ...
 
Il turismo nella politica di coesione dal 2000-06 al 2014-20
Il turismo nella politica di coesione dal 2000-06 al 2014-20Il turismo nella politica di coesione dal 2000-06 al 2014-20
Il turismo nella politica di coesione dal 2000-06 al 2014-20
 
Slide 2.5 - Come la PA pubblica i dati - ASOC1617
Slide 2.5 - Come la PA pubblica i dati - ASOC1617Slide 2.5 - Come la PA pubblica i dati - ASOC1617
Slide 2.5 - Come la PA pubblica i dati - ASOC1617
 
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitari
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitariOpen data in sanita' - Master Funzioni Direttive Gestione servizi sanitari
Open data in sanita' - Master Funzioni Direttive Gestione servizi sanitari
 

Semelhante a Big Data, Linked Data

Introduction To Linked Data
Introduction To Linked DataIntroduction To Linked Data
Introduction To Linked DataLeigh Dodds
 
Web Archives at the Nexus of Good Fakes and Flawed Originals
Web Archives at the Nexus of Good Fakes and Flawed OriginalsWeb Archives at the Nexus of Good Fakes and Flawed Originals
Web Archives at the Nexus of Good Fakes and Flawed OriginalsMichael Nelson
 
Information Security and Cloud Computing
Information Security and Cloud ComputingInformation Security and Cloud Computing
Information Security and Cloud ComputingPaul Miller
 
Linked Data an Introduction
Linked Data an IntroductionLinked Data an Introduction
Linked Data an IntroductionTalis Consulting
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataBoris Villazón-Terrazas
 
Cloud Computing - Myths & Reality
Cloud Computing - Myths & RealityCloud Computing - Myths & Reality
Cloud Computing - Myths & RealityErik Riedel
 
Collaborating in the Clouds
Collaborating in the CloudsCollaborating in the Clouds
Collaborating in the CloudsTom Ipri
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA KeynoteAxel Polleres
 
Content Used to Be King - Now what?
Content Used to Be King - Now what?Content Used to Be King - Now what?
Content Used to Be King - Now what?Judy O'Connell
 
GIS in the Rockies Geospatial Revolution
GIS in the Rockies Geospatial RevolutionGIS in the Rockies Geospatial Revolution
GIS in the Rockies Geospatial RevolutionPeter Batty
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)net2-project
 
Deepak semantic web_iitd
Deepak semantic web_iitdDeepak semantic web_iitd
Deepak semantic web_iitdDeepak Shevani
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesRichard Wallis
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationRichard Wallis
 
Big Data Story - From An Engineer's Perspective
Big Data Story - From An Engineer's PerspectiveBig Data Story - From An Engineer's Perspective
Big Data Story - From An Engineer's PerspectiveHien Luu
 

Semelhante a Big Data, Linked Data (20)

Introduction To Linked Data
Introduction To Linked DataIntroduction To Linked Data
Introduction To Linked Data
 
Web Archives at the Nexus of Good Fakes and Flawed Originals
Web Archives at the Nexus of Good Fakes and Flawed OriginalsWeb Archives at the Nexus of Good Fakes and Flawed Originals
Web Archives at the Nexus of Good Fakes and Flawed Originals
 
Information Security and Cloud Computing
Information Security and Cloud ComputingInformation Security and Cloud Computing
Information Security and Cloud Computing
 
Linked Data an Introduction
Linked Data an IntroductionLinked Data an Introduction
Linked Data an Introduction
 
Methodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked DataMethodological Guidelines for Publishing Linked Data
Methodological Guidelines for Publishing Linked Data
 
Cloud Computing - Myths & Reality
Cloud Computing - Myths & RealityCloud Computing - Myths & Reality
Cloud Computing - Myths & Reality
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
Cloud based Web Intelligence
Cloud based Web IntelligenceCloud based Web Intelligence
Cloud based Web Intelligence
 
When?
When?When?
When?
 
Collaborating in the Clouds
Collaborating in the CloudsCollaborating in the Clouds
Collaborating in the Clouds
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
Content Used to Be King - Now what?
Content Used to Be King - Now what?Content Used to Be King - Now what?
Content Used to Be King - Now what?
 
GIS in the Rockies Geospatial Revolution
GIS in the Rockies Geospatial RevolutionGIS in the Rockies Geospatial Revolution
GIS in the Rockies Geospatial Revolution
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
Making sense out of things on the web
Making sense out of things on the webMaking sense out of things on the web
Making sense out of things on the web
 
Deepak semantic web_iitd
Deepak semantic web_iitdDeepak semantic web_iitd
Deepak semantic web_iitd
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Contextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data FoundationContextual Computing: Laying a Global Data Foundation
Contextual Computing: Laying a Global Data Foundation
 
Big Data Story - From An Engineer's Perspective
Big Data Story - From An Engineer's PerspectiveBig Data Story - From An Engineer's Perspective
Big Data Story - From An Engineer's Perspective
 

Último

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Último (20)

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Big Data, Linked Data

  • 1. Big Data, Linked Data Paul Miller The Cloud of Data paul.miller@cloudofdata.com cloudofdata.com
  • 2. Data Big Data Linked Data Opportunities for Audio-Visual Archives cloudofdata.com
  • 7. Talis Libraries SemWeb Cultural Research Heritage SemTech JISC-land EC ! Cloud cloudofdata.com inmaps.linkedinlabs.com
  • 8. cloudofdata.com inmaps.linkedinlabs.com
  • 9. “Every two days now we create as much information as we did from the dawn of civilisation up until 2003” cloudofdata.com
  • 11. Cost per Gb of storage ($) 300,000 200,000 100,000 0 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 cloudofdata.com
  • 12. Cost per Gb of storage ($) 300,000 200,000 100,000 0 1981 1983 1985 1987 1989 1991 1993 1995 1997 1999 2001 2003 2005 2007 2009 cloudofdata.com isen.com/blog/2011/03/the-decline-and-fall-of-disk-storage-prices/
  • 13. cloudofdata.com www.datacenterknowledge.com/wp-content/uploads/2009/09/aerial-1000.jpg
  • 15. Cnut knew trying to control the tide was silly cloudofdata.com www.flickr.com/photos/30591976@N05/3402395112/
  • 16. why not use language of opportunity? cloudofdata.com www.flickr.com/photos/73645804@N00/2712985768/
  • 17. why not use language of opportunity? “data-driven organisations look at big data as a solution, not a problem” Release 2.0, February 2009 cloudofdata.com www.flickr.com/photos/73645804@N00/2712985768/
  • 18. cloudofdata.com r2.oreilly.com/
  • 19. Massively Parallel Processing (MPP) Column stores MapReduce/ Hadoop NoSQL cloudofdata.com r2.oreilly.com/
  • 20. emerging from lots of places, and being combined cloudofdata.com www.flickr.com/photos/98274023@N00/2102067531/ quantity, nature, expectations...
  • 21. emerging from lots of places, and being combined More data, faster... cloudofdata.com www.flickr.com/photos/98274023@N00/2102067531/ quantity, nature, expectations...
  • 23. Sensor Web - 15 Petabytes per year ? cloudofdata.com www.flickr.com/photos/7702002@N08/2293062616/
  • 24. Sensor Web - 15 Petabytes per year ? cloudofdata.com www.flickr.com/photos/7702002@N08/2293062616/
  • 25. Sensor Web - 15 Petabytes per year ? 15,000,000,000,000,000 cloudofdata.com www.flickr.com/photos/7702002@N08/2293062616/
  • 26. Sensor Web - 15 Petabytes per year ? 15,000,000,000,000,000 125,000 iPods cloudofdata.com www.flickr.com/photos/7702002@N08/2293062616/
  • 27. Sensor Web cloudofdata.com www.flickr.com/photos/aroberts/3035796/
  • 28. Sensor Web Image © Apple cloudofdata.com www.flickr.com/photos/aroberts/3035796/
  • 30. cloudofdata.com gigaom.com/2011/02/01/mining-the-tar-sands-of-big-data/
  • 32. Today’s a “Web of Documents” ? cloudofdata.com www.flickr.com/photos/calliope/306564541/
  • 33. Tomorrow’s “Semantic Web,” 2001-style! cloudofdata.com www.sciam.com/article.cfm?id=the-semantic-web
  • 34. Tomorrow’s “Semantic Web,” 2001-style! http://tr.im/timbl http://tr.im/hendler cloudofdata.com www.sciam.com/article.cfm?id=the-semantic-web
  • 36. Or are the pieces falling into place? cloudofdata.com
  • 37. W3C-driven effort. More used - and useful - than PR might imply URI - 1994 XML - 1998 RDF - 1999/ 2004 OWL - 2004 SPARQL - 2008 Applications - 2007/8 cloudofdata.com Image © World Wide Web Consortium
  • 38. “J.R.R. Tolkien wrote The Hobbit” cloudofdata.com
  • 39. J.R.R. Tolkien wrote The Hobbit cloudofdata.com
  • 40. J.R.R. Tolkien wrote The Hobbit cloudofdata.com
  • 41. J.R.R. Tolkien wrote The Hobbit cloudofdata.com
  • 42. J.R.R. Tolkien wrote The Hobbit subject predicate object cloudofdata.com
  • 43. J.R.R. Tolkien wrote The Hobbit predicate cloudofdata.com
  • 44. J.R.R. Tolkien wrote The Hobbit http://blah.org/ThingsAuthorsDo/write cloudofdata.com
  • 45. J.R.R. Tolkien http://dbpedia.org/page/ J._R._R._Tolkien wrote http://blah.org/ThingsAuthorsDo/write The Hobbit http://dbpedia.org/page/The_Hobbit cloudofdata.com
  • 46. eg Wikipedia data boxes to DBpedia cloudofdata.com
  • 47. eg Wikipedia data boxes to DBpedia cloudofdata.com
  • 48. eg UK Gov bit.ly/ztOed cloudofdata.com www.flickr.com/photos/lorentey/1438477358/
  • 49. eg BBC cloudofdata.com bbc.co.uk/music/
  • 50. Data LINKED to other places outside firewall eg BBC trusts and relies upon MusicBrainz bit.ly/9tBJGH cloudofdata.com www.flickr.com/photos/foxypar4/2124673642/
  • 51. harks back to TimBL’s original vision cloudofdata.com www.flickr.com/photos/tanaka/3212373419/ for a Read/Write Web
  • 52. “the Web done right” Sir Tim Berners-Lee, 2008 harks back to TimBL’s original vision cloudofdata.com www.flickr.com/photos/tanaka/3212373419/ for a Read/Write Web
  • 53. Use URIs to name things Use HTTP URIs so that they can be followed When someone follows a URI, provide useful information Include links to other URIs, so that more can be discovered. cloudofdata.com www.w3.org/DesignIssues/LinkedData.html
  • 54. cloudofdata.com richard.cyganiak.de/2007/10/lod/
  • 57. Web-scale tools NoSQL data manipulation with Hadoop, Cassandra, etc cloudofdata.com
  • 58. Web-scale tools NoSQL data manipulation with Hadoop, Cassandra, etc Web-scale storage and compute Separate archival role from analysis, dissemination and use “too cheap to meter” may be measuring the wrong things cloudofdata.com
  • 59. Web-scale tools NoSQL data manipulation with Hadoop, Cassandra, etc Web-scale storage and compute Separate archival role from analysis, dissemination and use “too cheap to meter” may be measuring the wrong things Leverage connections between archives, and with the wider world embrace the Web, and its architecture cloudofdata.com
  • 60. cloudofdata.com www.flickr.com/photos/11962592@N00/4549097414/
  • 61. cloud of data Thank you Download this presentation slideshare.net/cloudofdata Dr Paul Miller The Cloud of Data paul.miller@cloudofdata.com skype: cloudofdata Made on a phone: +44 7769 740083 Mac Except where otherwise noted, this work is licensed under the Creative Commons Attribution Licence. To view a copy of this licence, visit creativecommons.org/licenses/by/2.0/uk/ or send a letter to cloudofdata.com Creative Commons, 171 Second St, San Francisco, CA 94105, United States of America

Notas do Editor

  1. \n
  2. \n
  3. \n
  4. Data explosion. \nNot necessarily like this anymore. \nTables, and spreadsheets, and databases.\n
  5. or even - for you - this. \nNot just about STORING and SERVING streams.\nDetect and exploit CONNECTIONS - sometimes in near-real-time.\n
  6. 774 connections\nclusters are inferred. \nWhat does it mean?\n
  7. Automatic Clusters reflect my world with remarkable precision\nLabels are my own\nBigger dots (the ones you can see, at this scale!) = bigger influence in network\nMore than 50 contacts? Get your own. \n
  8. Explore...\nConnections shared with Seamus Ross;\nBlue Culture, Green JISC, 1 Orange Librarian\n
  9. Google CEO Eric Schmidt. Speaking at Techonomy, Lake Tahoe, in August 2010.\n\nPlenty to quibble with… data v. information, ‘dawn of civilisation,’ etc. But. HUGE shift. Autonomous sensors, 24 hour multi-channel tv, social networks, finance, commerce...\n
  10. Lewis Strauss, Chairman of US Atomic Energy Commission (and Pres. Eisenhower).\nPower “Too cheap to meter.” Reckoned in 1954 we’d get there. Have we?\n
  11. Storage too cheap to meter by mid 1990s?\nNot quite - question of resolution. $300,000 in 1981. $10,000 in 1990. $10 in 2000. $0.10 last year. \n\nSoon will be too cheap to meter. Changes the value proposition. In many domains, cheaper to keep everything than to selectively manage.\n\nBUT quantities increasing faster than costs are falling… and mechanics of storage a small fraction of the ‘cost’ of keeping data.\n\nTracked by a website in Nova Scotia, Canada. Data extracted by David Isenberg.\n\n
  12. COMPUTE gradually becoming too cheap to meter, too.\nCloud Computing - pay for the computers you need, for the time you need them. And then stop paying.\n\nSeparate STORAGE from PROCESSING from USE/DELIVERY/ACCESS. Not a bad thing to do anyway!\n\nMicrosoft, Dublin.\n
  13. The Economist, Feb/March 2010. Science and others also write about this. O’Reilly, GigaOM and others organise events around this.\nCompanies scrambling to ‘own’ this...\n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. Is A/V “Big Data” ? Maybe. Sometimes.\nGb and Gb of digitised film probably not.\nReal-time stream from 24 hour News… or the Big Brother House… could be. Analyse? Find patterns? Compare, Contrast, Explore.\n
  30. Mike Driscoll & Roger Ehrenberg spoke at recent Strata - and in GigaOM piece - and used tar sands/ oil sands analogy. Plenty of oil/value - but previously too expensive to extract.\n
  31. Now for something different. Linked Data rarely - so far - ‘Big’ Data.\nMight people in this room be the ones to change that?\n
  32. mostly human readable. mostly unconnected, except by hyperlinks that say nothing more structured than ‘see also…’\n
  33. been coming a long time. Heavily cited SciAm article is from 2001...\n
  34. expensive, unrealistic, academic, ultimately unattainable ?\n
  35. Plenty of people solving hard - focussed - problems with ‘semantics’\n
  36. W3C ‘Semantic Web Stack.’\nPerceived as complex, but contains powerful, flexible, elements...\n
  37. simple principles. simple power.\n
  38. \n
  39. \n
  40. subject, object, predicate.\n
  41. subject, object, predicate.\n
  42. subject, object, predicate.\n
  43. subject, object, predicate.\n
  44. subject, object, predicate.\n
  45. Add URIs and each is unambiguous. \n\nYou can also link, and link and link - ALL Tolkien’s books, etc.\n\nThe other stuff in the stack just makes this happen.\n
  46. Add URIs and each is unambiguous. \n\nYou can also link, and link and link - ALL Tolkien’s books, etc.\n\nThe other stuff in the stack just makes this happen.\n
  47. Add URIs and each is unambiguous. \n\nYou can also link, and link and link - ALL Tolkien’s books, etc.\n\nThe other stuff in the stack just makes this happen.\n
  48. Add URIs and each is unambiguous. \n\nYou can also link, and link and link - ALL Tolkien’s books, etc.\n\nThe other stuff in the stack just makes this happen.\n
  49. Add URIs and each is unambiguous. \n\nYou can also link, and link and link - ALL Tolkien’s books, etc.\n\nThe other stuff in the stack just makes this happen.\n
  50. \n
  51. \n
  52. data.gov.uk, data.gov, and many more\n
  53. also World Cup, Natural History, Programmes, and more…\nData-driven organisation. Record once, use in many places.\nBecome a nodal point on the web.\n
  54. linked and open is better.\nJISC Linked Data Horizon Scan.\n
  55. \n
  56. \n
  57. most recent version, from September 2010\nSize of the Cloud demonstrates interest… but you’ll rarely/ever use them all - look how sparse the connections are.\n
  58. \n
  59. \n
  60. \n
  61. \n
  62. Archive or Agora?\nPreservation or New Use?\nNOT mutually exclusive.\n
  63. \n
  64. \n