SlideShare uma empresa Scribd logo
1 de 40
Lessons Learned from LOD
(Linked Open Data) Failure and
Big Data:
The Future Trend
Youngwhan Lee, Ph. D.
전화: 010-7997-0345
이메일: nicklee@konkuk.ac.kr
Facebook: Youngwhan Nick Lee
Twitter: nicklee002

1
Web Evolution and Big Data
Internet Today
2010:
• Estimated 1011 Web pages in the World

2012:
•
•
•

Social Media: Facebook (1 Billion Monthly Active Users)
문자 발명후 2003년까지 5 엑사 바이트  2012년 현재 매일 7 엑사바이트 데이터 생성 중
Is “big data” a big pile of garbage?

1-3
Web Explosion and Big Data
•
•

Number of Web Users (Mar. 2012): 2.3 Billion
1011 Web pages in the World (Est. 2010)
– Since the inception of Web, there were 7000 days (i.e. 20 years). This means humans
create over 10 Million pages a day.

•

Digital Information Created in the year 2010: 1 zetabytes (1021)
-

-

•

"There was 5 exabytes of information created between the dawn of civilization through
2003, but that much information is now created every 2 days, and the pace is
increasing.“ –Eric Schmitt (2010)
2012, almost 7 exabytes are created everyday.
We call it “Big Data.”

What does this mean?
Aggregation

데이터분석

지식구조화

큐레이션

RIF
SPARQ
L
OWL
RDF
LOD

NoSQL
MapReduce
R-DBMS

Understanding

Modified, based on Gene Bellinger, Durval Castro, Anthony Mills http://www.systems-thinking.org/dikw/dikw.htm , http://yjhyjh.egloos.com/39721
빅데이터/웹에서의 정보/지식 추출
• 정보 검색
– SEO(Search Engine Optimization) PageRank, EdgeRank

• Data Mining: 프로그램에 의한 정보(지식) 추출 가능
– 통계분석, Rule-based Analysis, 신경망 분석
– Visualization

데이터사이언스

• 지식공학 이용
– RDF/OWL 사용한 온톨로지 누적 연결
– Raw Data 연결하고 분석 가능하도록 개방 (Linked Open Data; LOD)
– 프로그램에 의한 논리분석 가능한 지식 추출 가능
• SPARQL
• RIF(Rule-based Interface Framework)

지식공학

• 인간의 힘 이용: 큐레이션
– 인간의 눈과 지식을 이용하여 정보를 필터하고 종합
• 예: pinterest.com, videocooki.com, storify.com, scoop.it, curated.by
Pareto’s Law
Longtail

Bighead
Longtail Phenomena in
The Long Tail by Chris Anderson (Wired, Oct. ´04) adopted to
information domains

Longtail Applications

Popularity

Mobile Apps

iPhone Apps

Android Apps
SNS Apps

Facebook Apps

Twitter Apps
LOD and Others

Medical Apps

공공 정보 활용 Apps

…
…
…

Bighead Applications

…

…
지식공학에서의 접근
• 온톨로지 구축
– Cyc
– WolframAlpha
– Siri

• 데이터의 웹(Web of Data)
– LOD  LOD2
Old “Layercake” of Semantic Web

정보 교환
RDF
OWL2
OWL2
Linked Open Data (LOD) Principles
Linking Open Data (LOD) is to connect and to open data to public


A little history of LOD Project


Tim Berners-Lee proposed LOD(Linking Open Data) project (2006)



Since the proposal, numerous countries and organizations participated, caused LOD to
explode in terms of the number of data


Wikipedia  DBpedia (www.dbpedia.org)



Bio2RDF project opened in 27 fields of Biology, Genetics, Medical-related, of which the
data sets are about 2.3 billions (Bio2RDF.org) (2008.10)



BBC announced to participate LOD project (www.bbc.org), now one of the institutes
actively utilizing the data



US Data.gov released 5 billion data triples



US Library of Congress announced to join LOD project.
(http://id.loc.gov/authorities/sh85042531#concept)



NY Times ( data.nytimes.com) release their data of 150 years of publication (2009.10)



US Whitehouse release a plan to open data in RDF (2009.11)

4 Principles
of LOD

1.
2.
3.
4.

Use URIs as names for things
Use HTTP URIs
When someone looks up a URI, provide useful information
Include links to other URIs
Advantages of LOD
•
•
•
•
•
•
•
•

Elegant
Expandable
Flexible
Powerful
Decentralized
Participatory
Inclusive, and
“Free” to use
Linked Open Data (LOD) Principles
Change of Web Structure

유저 인터페이스
인간을 위한
웹 페이지 연결

웹페이지 연결 버스

유저 인터페이스
인간을 위한
웹 페이지 연결

웹페이지 연결 버스
매쉬업

매쉬업
컴퓨터를 위한
웹 데이터 연결

웹데이터 연결 버스

18
Mar., 2008
May, 2007

Sep., 2008

July, 2009
SPARQL
SPARQL (Simple Protocol and RDF Query Language)
Web 3.0: Merging the two Perspectives

WWW Propoal
(1989)

Semantic
Web

Technology
Innovation
Perspective

LOD Proposal (2006)

“GGG” Proposal (2007)

Knowledge-based Semantics

Next Generation Web

Data-based Semantics

Market
Behavior
Perspective

WEB 1.0

WEB 2.0

Web 3.0
“WEB2” Proposal (2009)

Technical Proposal Phase
Practical Use Phase
But no Champaign…
• Definition Unclear
– Berners-Lee’s 4 principles are ambiguous

•
•
•
•

Interpretation difficult
Inconsistent
Difficult both to learn and use
Difficult to build browsers and reasoners

• “Free” to use
Full of incomplete and inconsistent RDFs, no way
to make them evolve
In short, “Garbage in, Garbage out” experienced
Solution to LOD problems: LOD2
• LOD2 Stack: A Technical Approach
– Linked Data Management
– Enrichment and Quality Improvement
– Various Tools to use
•
•
•
•
•

Storage and Querying
Revision and authoring
Interlinking and fusing
Classification and enrichment
…
Q: Is this technical approach for LOD good enough?

A: Business approach is
definitely needed.
Big Data
What did we do with big data in 2013?

What would we do with big data in 2014?
빅데이터와 데이터 지상주의

End of Theory
“이론의 종말” by Chris Anderson
Implication
• Issue: Have and Have-not are
separated
– E. g. in marketing
• 4Ps
– Price, product, place, promotion

• STP
– Segmentation, targeting, and positioning
Implication
• Is Technical Approach needed?
Business Approach
• Data Markets
– Azure Data Marketplace
– Data.com
– Infochimps.com
– DataMarket.com
– Kaggle.com
Data Market: Azure Data Marketplace
Data Market: Data.com
Data Market: Infochimps.com
Data Market: DataMarket.com
Data Market: Kaggle.com
Conclusion
• Positioning for Korea,
– Where are we?
– Where are we heading to?
참고문헌
• 웹3.0 세상을 바꾸고 있다.
– 이영환

• A Semantic Web Primer (Cooperative Information Systems series)
– Grigoris Antoniou, Frank van Harmelen

• Semantic Web for the Working Ontologist, Second Edition: Effective
Modeling in RDFS and OWL
– Dean Allemang, James Hendler

• 온톨로지: 인터넷 진화의 열쇠
– 노상규, 박진수

• 월드와이드웹
– 팀 버너스-리

• 큐레이션
– 스티븐 로젠바움 저, 이시은 역
Web sites
• Problems of Linked Data
– http://milicicvuk.com/blog/2011/07/26/problems-of-linked-data14-identity/

• LOD2
– http://lod2.eu/Welcome.html
– http://stack.lod2.eu/blog/

• How to Define Web 3.0
– http://howtosplitanatom.com/news/how-to-define-web-30-2/

• SPARQL by Example
– http://www.cambridgesemantics.com/semantic-university/sparqlby-example#(1)

• Practical P-P-P-Problems with Linked Data
– http://www.mkbergman.com/917/practical-p-p-p-problems-withlinked-data/

• Linked-Data-Api
– https://code.google.com/p/linked-data-api/

Mais conteúdo relacionado

Mais procurados (7)

Presentation internet programming report
Presentation internet programming reportPresentation internet programming report
Presentation internet programming report
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"Carpenter "The Future of the Scholarly Record"
Carpenter "The Future of the Scholarly Record"
 
"Plans are worthless, but planning is essential"
"Plans are worthless, but planning is essential""Plans are worthless, but planning is essential"
"Plans are worthless, but planning is essential"
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
Freddy Limpens: From folksonomies to ontologies: a socio-technical solution.
 
Data.gov Overview, August 2012
Data.gov Overview, August 2012Data.gov Overview, August 2012
Data.gov Overview, August 2012
 

Destaque

Life Success 4 Athletes
Life Success 4 AthletesLife Success 4 Athletes
Life Success 4 Athletes
andrewbrodell
 
소셜미디어 사용 해외사례 Ver 1 1
소셜미디어 사용 해외사례 Ver 1 1소셜미디어 사용 해외사례 Ver 1 1
소셜미디어 사용 해외사례 Ver 1 1
Konkuk University
 

Destaque (20)

Back to-school presentation
Back to-school presentationBack to-school presentation
Back to-school presentation
 
Childhood Drinking: A New Kind Of Conversation
Childhood Drinking: A New Kind Of ConversationChildhood Drinking: A New Kind Of Conversation
Childhood Drinking: A New Kind Of Conversation
 
웹의 진화와 지식 구조화 Ver 1 1
웹의 진화와 지식 구조화 Ver 1 1웹의 진화와 지식 구조화 Ver 1 1
웹의 진화와 지식 구조화 Ver 1 1
 
집단지성에서 시민행동의 플랫폼으로
집단지성에서 시민행동의 플랫폼으로집단지성에서 시민행동의 플랫폼으로
집단지성에서 시민행동의 플랫폼으로
 
Rules 09 10
Rules 09 10Rules 09 10
Rules 09 10
 
Life Success 4 Athletes
Life Success 4 AthletesLife Success 4 Athletes
Life Success 4 Athletes
 
공공정보의 개방활용의 필요성과 역할
공공정보의 개방활용의 필요성과 역할공공정보의 개방활용의 필요성과 역할
공공정보의 개방활용의 필요성과 역할
 
The O Antiphons
The O AntiphonsThe O Antiphons
The O Antiphons
 
Planeta esti TU!
Planeta esti TU!Planeta esti TU!
Planeta esti TU!
 
Using Technology To Engage Citizens
Using Technology To Engage CitizensUsing Technology To Engage Citizens
Using Technology To Engage Citizens
 
시민의코딩(주) 소개서 Ver. 1.3
시민의코딩(주) 소개서 Ver. 1.3시민의코딩(주) 소개서 Ver. 1.3
시민의코딩(주) 소개서 Ver. 1.3
 
Ethics Presentation To Sorensen Institute
Ethics Presentation To Sorensen InstituteEthics Presentation To Sorensen Institute
Ethics Presentation To Sorensen Institute
 
소셜미디어 사용 해외사례 Ver 1 1
소셜미디어 사용 해외사례 Ver 1 1소셜미디어 사용 해외사례 Ver 1 1
소셜미디어 사용 해외사례 Ver 1 1
 
Document sur l'Auto provisioning, contacts, presence et streaming sur asterisk
Document sur l'Auto provisioning, contacts, presence et streaming sur asteriskDocument sur l'Auto provisioning, contacts, presence et streaming sur asterisk
Document sur l'Auto provisioning, contacts, presence et streaming sur asterisk
 
Barcamp AQUOPS (BarAQUOPS) 2015
Barcamp AQUOPS (BarAQUOPS) 2015Barcamp AQUOPS (BarAQUOPS) 2015
Barcamp AQUOPS (BarAQUOPS) 2015
 
Présentation du dispositif des attestations scolaires de sécurité routière ou...
Présentation du dispositif des attestations scolaires de sécurité routière ou...Présentation du dispositif des attestations scolaires de sécurité routière ou...
Présentation du dispositif des attestations scolaires de sécurité routière ou...
 
Babbler réinvente les relations presse #startup #digital #medias #rp
Babbler réinvente les relations presse #startup #digital #medias #rpBabbler réinvente les relations presse #startup #digital #medias #rp
Babbler réinvente les relations presse #startup #digital #medias #rp
 
ANT2- Atelier 2: Communication stratégique efficace et planifiée
ANT2- Atelier 2: Communication stratégique efficace et planifiéeANT2- Atelier 2: Communication stratégique efficace et planifiée
ANT2- Atelier 2: Communication stratégique efficace et planifiée
 
xAPI - web-conférence FFFOD du 15/09/15
xAPI - web-conférence FFFOD du 15/09/15xAPI - web-conférence FFFOD du 15/09/15
xAPI - web-conférence FFFOD du 15/09/15
 
Internet, réseaux sociaux, mobilité: de nouvelles stratégies en oeuvre dans l...
Internet, réseaux sociaux, mobilité: de nouvelles stratégies en oeuvre dans l...Internet, réseaux sociaux, mobilité: de nouvelles stratégies en oeuvre dans l...
Internet, réseaux sociaux, mobilité: de nouvelles stratégies en oeuvre dans l...
 

Semelhante a Lessons Learned from Lod Failure and Big Data : The Future Trend

Big Data Brown Bag
Big Data Brown BagBig Data Brown Bag
Big Data Brown Bag
usmanqureshi
 
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
Rida Qayyum
 

Semelhante a Lessons Learned from Lod Failure and Big Data : The Future Trend (20)

Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203Linked Data Overview - structured data on the web for US EPA 20140203
Linked Data Overview - structured data on the web for US EPA 20140203
 
Spark Social Media
Spark Social Media Spark Social Media
Spark Social Media
 
APLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with DataAPLIC 2012: Discovering & Dealing with Data
APLIC 2012: Discovering & Dealing with Data
 
Resources and Lessons on Open Data from the World Bank
Resources and Lessons on Open Data from the World BankResources and Lessons on Open Data from the World Bank
Resources and Lessons on Open Data from the World Bank
 
opening new doors: recent initiatives in open data at National Library of Sco...
opening new doors: recent initiatives in open data at National Library of Sco...opening new doors: recent initiatives in open data at National Library of Sco...
opening new doors: recent initiatives in open data at National Library of Sco...
 
UCL & IoE Libraries - Research Data Management - 22/10/14
UCL & IoE Libraries - Research Data Management - 22/10/14UCL & IoE Libraries - Research Data Management - 22/10/14
UCL & IoE Libraries - Research Data Management - 22/10/14
 
Big Data Brown Bag
Big Data Brown BagBig Data Brown Bag
Big Data Brown Bag
 
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsSocial Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 years
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)Data Big and Broad (Oxford, 2012)
Data Big and Broad (Oxford, 2012)
 
Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012Sentara Linked Data Workshop - Sept 10, 2012
Sentara Linked Data Workshop - Sept 10, 2012
 
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social Networks
 
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
A Roadmap Towards Big Data Opportunities, Emerging Issues and Hadoop as a Sol...
 
bigDataAnalysis
bigDataAnalysisbigDataAnalysis
bigDataAnalysis
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for LibrariansOpen Sesame: Open Data, Data Liberation and Opportunities for Librarians
Open Sesame: Open Data, Data Liberation and Opportunities for Librarians
 
Open Data Innovation from GEO DATA Perspective
Open Data Innovation from GEO DATA  PerspectiveOpen Data Innovation from GEO DATA  Perspective
Open Data Innovation from GEO DATA Perspective
 
Wire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub ProjectWire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub Project
 
The CSO Open Data Experience
The CSO Open Data ExperienceThe CSO Open Data Experience
The CSO Open Data Experience
 

Mais de Konkuk University

하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
Konkuk University
 
Smart gov 3 소통과 공유의 원칙 (최종)
Smart gov 3 소통과 공유의 원칙 (최종)Smart gov 3 소통과 공유의 원칙 (최종)
Smart gov 3 소통과 공유의 원칙 (최종)
Konkuk University
 
웹 3.0 세상을 바꾸고 있다 요약
웹 3.0 세상을 바꾸고 있다 요약웹 3.0 세상을 바꾸고 있다 요약
웹 3.0 세상을 바꾸고 있다 요약
Konkuk University
 

Mais de Konkuk University (9)

비대면인증의 성공적 정착을 위한 제언
비대면인증의 성공적 정착을 위한 제언비대면인증의 성공적 정착을 위한 제언
비대면인증의 성공적 정착을 위한 제언
 
데이터 거래와 유통에 대하여 (논문)
데이터 거래와 유통에 대하여 (논문)데이터 거래와 유통에 대하여 (논문)
데이터 거래와 유통에 대하여 (논문)
 
핀테크를 통한 여신금융업계의 대응과 가치 창출 방안 여신금융 여름호 1508
핀테크를 통한 여신금융업계의 대응과 가치 창출 방안 여신금융 여름호 1508핀테크를 통한 여신금융업계의 대응과 가치 창출 방안 여신금융 여름호 1508
핀테크를 통한 여신금융업계의 대응과 가치 창출 방안 여신금융 여름호 1508
 
금융 보안사고 트렌드와 정부 금융사들의 과제
금융 보안사고 트렌드와 정부 금융사들의 과제금융 보안사고 트렌드와 정부 금융사들의 과제
금융 보안사고 트렌드와 정부 금융사들의 과제
 
빅데이터와 타겟 마케팅 Ver 1 0
빅데이터와 타겟 마케팅 Ver 1 0빅데이터와 타겟 마케팅 Ver 1 0
빅데이터와 타겟 마케팅 Ver 1 0
 
하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
하이텍크 마케팅 - 컬트 브랜딩 ( Hightech Marketing - Cult brainding )
 
Smart gov 3 소통과 공유의 원칙 (최종)
Smart gov 3 소통과 공유의 원칙 (최종)Smart gov 3 소통과 공유의 원칙 (최종)
Smart gov 3 소통과 공유의 원칙 (최종)
 
Smart gov 3 소통과 공유의 원칙 최종
Smart gov 3 소통과 공유의 원칙 최종Smart gov 3 소통과 공유의 원칙 최종
Smart gov 3 소통과 공유의 원칙 최종
 
웹 3.0 세상을 바꾸고 있다 요약
웹 3.0 세상을 바꾸고 있다 요약웹 3.0 세상을 바꾸고 있다 요약
웹 3.0 세상을 바꾸고 있다 요약
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Lessons Learned from Lod Failure and Big Data : The Future Trend

  • 1. Lessons Learned from LOD (Linked Open Data) Failure and Big Data: The Future Trend Youngwhan Lee, Ph. D. 전화: 010-7997-0345 이메일: nicklee@konkuk.ac.kr Facebook: Youngwhan Nick Lee Twitter: nicklee002 1
  • 2. Web Evolution and Big Data
  • 3. Internet Today 2010: • Estimated 1011 Web pages in the World 2012: • • • Social Media: Facebook (1 Billion Monthly Active Users) 문자 발명후 2003년까지 5 엑사 바이트  2012년 현재 매일 7 엑사바이트 데이터 생성 중 Is “big data” a big pile of garbage? 1-3
  • 4. Web Explosion and Big Data • • Number of Web Users (Mar. 2012): 2.3 Billion 1011 Web pages in the World (Est. 2010) – Since the inception of Web, there were 7000 days (i.e. 20 years). This means humans create over 10 Million pages a day. • Digital Information Created in the year 2010: 1 zetabytes (1021) - - • "There was 5 exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days, and the pace is increasing.“ –Eric Schmitt (2010) 2012, almost 7 exabytes are created everyday. We call it “Big Data.” What does this mean?
  • 5.
  • 6. Aggregation 데이터분석 지식구조화 큐레이션 RIF SPARQ L OWL RDF LOD NoSQL MapReduce R-DBMS Understanding Modified, based on Gene Bellinger, Durval Castro, Anthony Mills http://www.systems-thinking.org/dikw/dikw.htm , http://yjhyjh.egloos.com/39721
  • 7. 빅데이터/웹에서의 정보/지식 추출 • 정보 검색 – SEO(Search Engine Optimization) PageRank, EdgeRank • Data Mining: 프로그램에 의한 정보(지식) 추출 가능 – 통계분석, Rule-based Analysis, 신경망 분석 – Visualization 데이터사이언스 • 지식공학 이용 – RDF/OWL 사용한 온톨로지 누적 연결 – Raw Data 연결하고 분석 가능하도록 개방 (Linked Open Data; LOD) – 프로그램에 의한 논리분석 가능한 지식 추출 가능 • SPARQL • RIF(Rule-based Interface Framework) 지식공학 • 인간의 힘 이용: 큐레이션 – 인간의 눈과 지식을 이용하여 정보를 필터하고 종합 • 예: pinterest.com, videocooki.com, storify.com, scoop.it, curated.by
  • 9. Longtail Phenomena in The Long Tail by Chris Anderson (Wired, Oct. ´04) adopted to information domains Longtail Applications Popularity Mobile Apps  iPhone Apps  Android Apps SNS Apps  Facebook Apps  Twitter Apps LOD and Others  Medical Apps  공공 정보 활용 Apps  … … … Bighead Applications … …
  • 10. 지식공학에서의 접근 • 온톨로지 구축 – Cyc – WolframAlpha – Siri • 데이터의 웹(Web of Data) – LOD  LOD2
  • 11. Old “Layercake” of Semantic Web 정보 교환
  • 12. RDF
  • 13. OWL2
  • 14. OWL2
  • 15. Linked Open Data (LOD) Principles Linking Open Data (LOD) is to connect and to open data to public  A little history of LOD Project  Tim Berners-Lee proposed LOD(Linking Open Data) project (2006)  Since the proposal, numerous countries and organizations participated, caused LOD to explode in terms of the number of data  Wikipedia  DBpedia (www.dbpedia.org)  Bio2RDF project opened in 27 fields of Biology, Genetics, Medical-related, of which the data sets are about 2.3 billions (Bio2RDF.org) (2008.10)  BBC announced to participate LOD project (www.bbc.org), now one of the institutes actively utilizing the data  US Data.gov released 5 billion data triples  US Library of Congress announced to join LOD project. (http://id.loc.gov/authorities/sh85042531#concept)  NY Times ( data.nytimes.com) release their data of 150 years of publication (2009.10)  US Whitehouse release a plan to open data in RDF (2009.11) 4 Principles of LOD 1. 2. 3. 4. Use URIs as names for things Use HTTP URIs When someone looks up a URI, provide useful information Include links to other URIs
  • 17. Linked Open Data (LOD) Principles
  • 18. Change of Web Structure 유저 인터페이스 인간을 위한 웹 페이지 연결 웹페이지 연결 버스 유저 인터페이스 인간을 위한 웹 페이지 연결 웹페이지 연결 버스 매쉬업 매쉬업 컴퓨터를 위한 웹 데이터 연결 웹데이터 연결 버스 18
  • 19. Mar., 2008 May, 2007 Sep., 2008 July, 2009
  • 20.
  • 22. SPARQL (Simple Protocol and RDF Query Language)
  • 23.
  • 24. Web 3.0: Merging the two Perspectives WWW Propoal (1989) Semantic Web Technology Innovation Perspective LOD Proposal (2006) “GGG” Proposal (2007) Knowledge-based Semantics Next Generation Web Data-based Semantics Market Behavior Perspective WEB 1.0 WEB 2.0 Web 3.0 “WEB2” Proposal (2009) Technical Proposal Phase Practical Use Phase
  • 25. But no Champaign… • Definition Unclear – Berners-Lee’s 4 principles are ambiguous • • • • Interpretation difficult Inconsistent Difficult both to learn and use Difficult to build browsers and reasoners • “Free” to use Full of incomplete and inconsistent RDFs, no way to make them evolve In short, “Garbage in, Garbage out” experienced
  • 26. Solution to LOD problems: LOD2 • LOD2 Stack: A Technical Approach – Linked Data Management – Enrichment and Quality Improvement – Various Tools to use • • • • • Storage and Querying Revision and authoring Interlinking and fusing Classification and enrichment …
  • 27. Q: Is this technical approach for LOD good enough? A: Business approach is definitely needed.
  • 28. Big Data What did we do with big data in 2013? What would we do with big data in 2014?
  • 29. 빅데이터와 데이터 지상주의 End of Theory “이론의 종말” by Chris Anderson
  • 30. Implication • Issue: Have and Have-not are separated – E. g. in marketing • 4Ps – Price, product, place, promotion • STP – Segmentation, targeting, and positioning
  • 31. Implication • Is Technical Approach needed?
  • 32. Business Approach • Data Markets – Azure Data Marketplace – Data.com – Infochimps.com – DataMarket.com – Kaggle.com
  • 33. Data Market: Azure Data Marketplace
  • 38. Conclusion • Positioning for Korea, – Where are we? – Where are we heading to?
  • 39. 참고문헌 • 웹3.0 세상을 바꾸고 있다. – 이영환 • A Semantic Web Primer (Cooperative Information Systems series) – Grigoris Antoniou, Frank van Harmelen • Semantic Web for the Working Ontologist, Second Edition: Effective Modeling in RDFS and OWL – Dean Allemang, James Hendler • 온톨로지: 인터넷 진화의 열쇠 – 노상규, 박진수 • 월드와이드웹 – 팀 버너스-리 • 큐레이션 – 스티븐 로젠바움 저, 이시은 역
  • 40. Web sites • Problems of Linked Data – http://milicicvuk.com/blog/2011/07/26/problems-of-linked-data14-identity/ • LOD2 – http://lod2.eu/Welcome.html – http://stack.lod2.eu/blog/ • How to Define Web 3.0 – http://howtosplitanatom.com/news/how-to-define-web-30-2/ • SPARQL by Example – http://www.cambridgesemantics.com/semantic-university/sparqlby-example#(1) • Practical P-P-P-Problems with Linked Data – http://www.mkbergman.com/917/practical-p-p-p-problems-withlinked-data/ • Linked-Data-Api – https://code.google.com/p/linked-data-api/