Where 2.0 NoSQL Presentation 2008 - GeoIQ

•Transferir como PPT, PDF•

1 gostou•823 visualizações

Presentation at Where 2.0 2008 where we discuss our rational for building a NoSQL data store after reaching limitations with the spatial SQL solution that were available at the time.

From Data Chaos to Actionable Intelligence: How the Convergence of the GeoWeb and Semantic Web is Revolutionizing the Way We Process Information Where 2.0 2008 Sean Gorman FortiusOne Inc.

AKA: Trying to GeoHack Without Getting Arrested

Geography of the Web Paul Torrens Steve Coast Martin Dodge Anthony Townsend Matt Zook

Big Data Sets Algorithmic Analysis Map Logical Results to Physical Realities

Small data sets with largely unstructured content

Crowdsourcing large structured data sets with quantitative capabilities

What happens when your database reaches 1,683,185,246 features

OpenLocation Lots of silos of GeoData how can we begin to interconnect them Infochimps.org

OpenLocation Linked repositories of data form a graph of content

Node “ Y ” is connected to Node “ X ” by the link “User Action” User adds structured dataset “ X ” to their 3 rd Party URL string “ Y ” Structured Dataset “ X” Third Party URL “ Y”

Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z” Data Set Degree X 2 Y 1 Z 1

Creates the Ability to Intelligently Serve Content User searches on a term and gets a result based on tags and full text query then weighted by degree and user ratings: The system can now recommend data URL “Y” and “Z” as data that may be useful context for dataset “X” even though they may have no tags in common The graph can be encapsulated in code and communicated to the outside world Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z”

1. Bring a new class of content/data to the Web 2. Intelligently mix that content with the rest of the Web 3. Enable the content to answer meaningful questions for users

If you would like to check out the Finder! Beta see us for a card or go here: http://finder.geocommons.com / If you would like to talk about federating geodata email me at: [email_address]

Mais conteúdo relacionado

Mais procurados

Participatory Sensing through Social NetworksIoannis Krontiris

The Emergence of a Third Wave of Open Datathegovlabnyu

Open data presentation 2013 v0 5Alan Kong

Linked Open Government Data: What’s Next?Li Ding

Mac373 med312 data journalism lectureRob Jewitt

Denver's Open Data InitiativeAllan Glen

mit data vr jan17Patrick McCormick

Data journalism OverviewAlexander Howard

Building a Digital ManifestoSarah Granger

Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013AmbasciatadelCanada

Data power and Civic Hacking at mySocietysbaack

The GIS Guide to Public Domain DataEsri

One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...Elena Simperl

Data, Indicators and Maps on HomelessnessCommunication and Media Studies, Carleton University

Data journalismCorinne Weisgerber

Memory ConnectedLi Ding

VGI Overview - Crowdsourcing Participatory MappingDany Laksono

Treasury Board of Canada - Open Government / Open Data in Canada - July 2013AmbasciatadelCanada

Public policy and the information revolution 4.20.12Greg Wass

How Linked Data is transforming eGovernmentNikos Loutas

Mais procurados (20)

Participatory Sensing through Social Networks

The Emergence of a Third Wave of Open Data

Open data presentation 2013 v0 5

Linked Open Government Data: What’s Next?

Mac373 med312 data journalism lecture

Denver's Open Data Initiative

mit data vr jan17

Data journalism Overview

Building a Digital Manifesto

Domenico Donvito - Istat - Open Data in Official Statistics - 10 July 2013

Data power and Civic Hacking at mySociety

The GIS Guide to Public Domain Data

One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...

Data, Indicators and Maps on Homelessness

Data journalism

Memory Connected

VGI Overview - Crowdsourcing Participatory Mapping

Treasury Board of Canada - Open Government / Open Data in Canada - July 2013

Public policy and the information revolution 4.20.12

How Linked Data is transforming eGovernment

Destaque

Gsapp xcities gorman_uploadseagor

Geo Web F1 Enterpriseseagor

Oscar twitter geo_sentimentseagor

Black Friday Twitter Brand Analysisseagor

Wherecamp Lightningtalk - GeoMining Social Mediaseagor

Just in Time Analytics - Where Conferenceseagor

The State of Big Data for Geo - ESRI Big Data Meetupseagor

Destaque (7)

Gsapp xcities gorman_upload

Geo Web F1 Enterprise

Oscar twitter geo_sentiment

Black Friday Twitter Brand Analysis

Wherecamp Lightningtalk - GeoMining Social Media

Just in Time Analytics - Where Conference

The State of Big Data for Geo - ESRI Big Data Meetup

Semelhante a Where 2.0 NoSQL Presentation 2008 - GeoIQ

The technical case for a semantic webTony Dobaj

Broad DataJames Hendler

Semantic Web vision and its relevance to Open Digital Data for MGIArtificial Intelligence Institute at UofSC

Bigdatacooltoolssuresh sood

Ngdm09 han gaoTarek Dakel

DCA Symposium 6 Feb 2023.pdfAlan Morrison

Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra

The web of data: how are we doing so farElena Simperl

BrightTALK - Semantic AI Semantic Web Company

Self adaptive based natural language interface for disambiguation ofNurfadhlina Mohd Sharef

What do we want computers to do for us? Andrea Volpini

Spivack Blogtalk 2008Blogtalk 2008

From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...Edward Curry

Open data Websmatchdata publica

Omitola birmingham cityunivTope Omitola

鏈結資料在圖書館的應用20131107皓仁柯

Big Data on the Web – What We Will Do Haklae Kim

Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Amit Sheth

The web of data: how are we doing so far?Elena Simperl

Semantic Web Investigation within Big Data ContextMurad Daryousse

Semelhante a Where 2.0 NoSQL Presentation 2008 - GeoIQ (20)

The technical case for a semantic web

Broad Data

Semantic Web vision and its relevance to Open Digital Data for MGI

Bigdatacooltools

Ngdm09 han gao

DCA Symposium 6 Feb 2023.pdf

Provenance and Reuse of Open Data (PILOD 2.0 June 2014)

The web of data: how are we doing so far

BrightTALK - Semantic AI

Self adaptive based natural language interface for disambiguation of

What do we want computers to do for us?

Spivack Blogtalk 2008

From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...

Open data Websmatch

Omitola birmingham cityuniv

鏈結資料在圖書館的應用20131107

Big Data on the Web – What We Will Do

Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...

The web of data: how are we doing so far?

Semantic Web Investigation within Big Data Context

Where 2.0 NoSQL Presentation 2008 - GeoIQ

1. From Data Chaos to Actionable Intelligence: How the Convergence of the GeoWeb and Semantic Web is Revolutionizing the Way We Process Information Where 2.0 2008 Sean Gorman FortiusOne Inc.

2. AKA: Trying to GeoHack Without Getting Arrested

3. Geography of the Web Paul Torrens Steve Coast Martin Dodge Anthony Townsend Matt Zook

4. Big Data Sets Algorithmic Analysis Map Logical Results to Physical Realities

5. NYSE – SIAC Routing Vulnerabilities

6. Then these guys show up

7. Then these guys rescue you

9. Geography on the Web

10. Small data sets with largely unstructured content

11. Crowdsourcing large structured data sets with quantitative capabilities

12. Didn’t we try this last year?

13. What happens when your database reaches 1,683,185,246 features

14.

15. Rinse and repeat per query type

16. or

17. Build a lightweight object database

18.

19.

20. OpenLocation Lots of silos of GeoData how can we begin to interconnect them Infochimps.org

21. Can we federate our data

22. OpenLocation Linked repositories of data form a graph of content

23. Node “ Y ” is connected to Node “ X ” by the link “User Action” User adds structured dataset “ X ” to their 3 rd Party URL string “ Y ” Structured Dataset “ X” Third Party URL “ Y”

24. Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z” Data Set Degree X 2 Y 1 Z 1

25. Creates the Ability to Intelligently Serve Content User searches on a term and gets a result based on tags and full text query then weighted by degree and user ratings: The system can now recommend data URL “Y” and “Z” as data that may be useful context for dataset “X” even though they may have no tags in common The graph can be encapsulated in code and communicated to the outside world Structured Dataset “ X” Third Party URL “ Y” Third Party URL “ Z”

26. 1. Bring a new class of content/data to the Web 2. Intelligently mix that content with the rest of the Web 3. Enable the content to answer meaningful questions for users

27. If you would like to check out the Finder! Beta see us for a card or go here: http://finder.geocommons.com / If you would like to talk about federating geodata email me at: [email_address]

Notas do Editor

Our story started with trying to GeoHack while avoiding Jonny Law
Specifically we started off researching the geography of the web – what is the structure of the plumbing of the Internet – the autonomous system graph, the router graph, IP addresses. It was a great collaboration of colleagues at UCL, Berkley, NYU and GMU.
Our specialty was really big data sets like the router and AS graphs. We’d use statistical mechanics and graph theory to model and analyze the data then map those logical results to their physical realities.
This resulted in discovering little tidbits like how to take down the NYSE.
Resulting in the guys in dark suits showing up
Fortunately the folks at IQT – the intelligence communities venture fund – rescued us.
IQT has been part of several key technologies driving the GeoWeb. One of their early investments was metacarta the folks behind openlayers which many of us in the community benefit from. They were also the funding behind Keyhole – commonly known as Google Earth the iconic application of the GeoWeb. They continued to fund the leading edge with technologies like SketchUp. IQT and the government at large have been a big impetus and source of funding pushing GeoWeb innovation.
While we started researching the “Geography of the Web” it was “”Geography on the Web” that had captured the worlds attention. The creation of mirror worlds on the web with amazing satellite imagery and 3D visualizations
But…the majority of data populating these mirror worlds were small and largely descriptive text and photos.
Leveraging our love for big data sets we developed GeoCommons – a crowdsourced repository of large structured datasets with quantitative capabilities
Yes last year on this stage we launched GeoCommons – so what happened – where did it go in one year.
We got lots of people contributing data – so much data we killed our database
This chart shows why. The problem with big datasets is they fill up databases very rapidly. 95% of the data in GeoCommons had over 100 features and over half had more that 5000 features and several had more than 100,000 features.
The problem was we were trying store and query heterogeneous data at scale. To do this we had to normalize the data, causing the tables to get huge. Then we had to optimize to not only manage the table size, but do that for all queries types and functions. It was a Sisyphean cycle – everything helped but nothing solved the problem – the rock rolled to the bottom of the hill each time. So, you can keep fighting the battle….
Leveraging a lightweight object database construct we were able to rapidly store, mine, access, and translate data – we were able to fit well over a billion features into 15 gigs of storage. The best illustration is to see a demo of the platform in action.
Introducing Finder!
We are going to leverage the GeoCommons platform to next build Maker – a cartographically empowered map creation tool and Atlas – a collaborative application to allow users to tell multimedia stories around maps.
We are not the only folks bringing geo-content to the web. There are many clever people doing innovative work to bring more rich data to the masses, but how can we begin to interconnect it all.
Can we federate our data
We’ve been having some early discussions on how we can create a network of data repositories to form a graph of content. Data stored securely and linked together – where the cloud and the data are owned by everyone.
Allowing a user to grab a URL from one source via web services and map it with a structured dataset via object federation from another source. By consciously mapping these two datasets together the user has created semantic meaning between them – going beyond just the syntax of tags. For instance a user may map crime rates from Every Block with housing prices from Zillow to solve a problem on where to buy a house. They could have no tags or syntax relating them but now they have a far more meaningful semantic relationship.
These relations form a graph relating content semantically together across the web. Whether you implement with RDF, ATOM or any other practical approach you have a highly efficient computational construct to relate content.
This allows you to make relevant recommendations to the user to solve meaningful problems.
Our goal is to deliver technology that accomplished three big objectives. I believe that an ecosystem of companies striving for similar objective will be what constitute another evolution of the Web.

Where 2.0 NoSQL Presentation 2008 - GeoIQ

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Destaque

Destaque (7)

Semelhante a Where 2.0 NoSQL Presentation 2008 - GeoIQ

Semelhante a Where 2.0 NoSQL Presentation 2008 - GeoIQ (20)

Where 2.0 NoSQL Presentation 2008 - GeoIQ

Notas do Editor