SlideShare uma empresa Scribd logo
1 de 59
International Collaboration Networks in
the Emerging (Big) Data Science
HanWoo Park
Dept. of Media & Communication
YeungNam University
214-1 Dae-dong, Gyeongsan-si,
Gyeongsangbuk-do 712-749
Republic of Korea
www.hanpark.net
Loet Leydesdorff
Amsterdam School of Communication
Research (ASCoR)
University of Amsterdam
Kloveniersburgwal 48, 1012 CX
Amsterdam, The Netherlands
loet@leydesdorff.net
This presentation is based on Park, H.W., & Leydesdorff, L. (2013 forthcoming). Decomposing Social and Semantic Networks
in Emerging “Big Data” Research. Journal of Informetrics*.
빅데이터의 개념 및 특징
데이터 사이언스 배경
(빅)데이터 R&D 동향
사회적 이슈 및 시사점
1.
3.
4.
2.
[목차]
Big data
 The term “big data” refers to “analytical technologies that
have existed for years but can now be applied faster, on
a greater scale and are accessible to more users. (Miller,
2013).
 Big data sizes may vary per discipline.
 Characteristics: Garner’s 3Vs plus SAS’s VC and IBM’s
Veracity
- Volume (amount of data), Velocity (speed of data in and
out), Variety (range of data types and sources)
- Variability: Data flows can be highly inconsistent with
daily, seasonal, and event-triggered peak data loads
- Complexity: Multiple data sources requiring cleaning,
linking, and matching the data across system
- Veracity: 1 in 3 business leaders don’t trust the
information they use to make decisions.
http://en.wikipedia.org/wiki/Big_data
http://www-01.ibm.com/software/data/bigdata/
Data-driven Research that focuses
on extracting meaningful data from
techno-socio-economic systems to
discover some hidden patterns.
빅데이터의 개념 및 특징
데이터 사이언스 배경
(빅)데이터 R&D 동향
사회적 이슈 및 시사점
1.
3.
4.
2.
[목차]
“Data Science” refers to “a discipline that incorporates
varying elements and builds on techniques and theories
from many fields, including data visualization with the goal of
extracting meaning from data and creating data products.”
http://en.wikipedia.org/wiki/Data_science
Today’s “big” is probably tomorrow’s “medium” and
next week’s “small” and thus the most effective defini-
tion of “big data” may be derived when the size of data
itself becomes part of the research problem.
Loukides (2012)
Origin of Data Science
 One is Peter Naur’s 1974 book “Concise Survey of Computer Methods”,
a survey of contemporary data processing methods in a wide range of
applications (Gilpress, 2012).
 The other is when the term “big data” first appeared in 1970 in the
Scopus database (Halevi and Moed, 2012). There was no particular key
milestone since 1970s.
 During the 1990s period, the term had been usually related to computer
modeling and software development for large datasets. Knowledge
Discovery and Data Mining in 1997. Rousseau (2012) also regards the
1993 publication as the first documents indexed in the Web version of
Web of Science.
A more recent development was made with
the establishment of journals that included the
term “Data Science” in their titles:
• Data Science Journal in 2002
• Journal of Data Science in 2003
• EPJ Data Science in 2012
• Journal of Big Data in 2013
• GigaScience gigasciencejournal.com in 2012
Science published a special
issue (February 11, 2011) looking
broadly at increasingly data-driven
research efforts as a scientific
domain (Science staff, 2011).
Data Science is composed of interrelated
clusters of research tasks. For example, the
technologies on data collection, curation, and
access, and the unique skill sets have
increasingly been central to Data Science
(Science staff, 2011).
An international conference called “Data Science
Summit” (http://www.greenplum.com/datasciencesummit).
http://novaspivack.typepad.com/nova_spivacks_weblog/2007/02/steps_towards_a.html 에서 재인용
All models are wrong but some are useful
Emergence of data author on dataverse
Andersons claims
 Data is everything we need.
 We don't have to settle for models.
 Agnostic statistics.
 Out with every theory of human behavior.
 This approach to science — hypothesize, model,
test — is becoming obsolete.
 Petabytes allow us to say: "Correlation is enough."
We can stop looking for models.
 What can science learn from Google? E-Science.
Computational (Social) Science
Park, H.W., & Leydesdorff, L. (2013 Work-In-Progress). Decomposing a Data-Driven Science Using a Scientometric Method.
 Focus on the methodological perspective based on
the use of new digital tools to manage the data deluge.
 Development of e-science tools to automate
research process.
 Experimentation with new types of data
visualization.
http://participatorysociety.org/wiki/index.
php?title=Online_Research
Why Data Science?
Savage and Burrows (2007, p.
886) lament, “Fifty years ago,
academic social scientists might
be seen as occupying the apex
of the – generally limited – social
science research ‘apparatus’.
Now they occupy an increasingly
marginal position in the huge
research infrastructure”.
Bonacich, P. (2004).
The Invasion of the Physicists. Social Networks 26(3): 285-288
This approach to science is attributed to the late Jim Gray,
one of the most influential computer scientists, at Microsoft.
“The fourth paradigm”
Research purpose lies in handling huge
amounts of data from technological,
sociological, and economic systems to
discover some hidden patterns.
Jim Gray
Global Communication 2team
(빅) 데이터과학의 도전
이론의 종말-증거기반 경영
Jeffrey Pfeffer, Robert I. Sutton (2006)
How companies can bolster performance and trump the
competition through evidence-based management, an
approach to decision-making and action that is driven by
hard facts rather than half-truths or hype.
· 빅데이터의 등장으로 전통적인
과학 연구방법론 퇴색
· 인식의 한계치를 넘어선 데이
터 (팩트가아닌패턴)
The Signal and the Noise:
Why Most Predictions Fail but Some Don't. Nate Silver
I do not go as far as a Popper in asserting that such
theories are therefore unscientific or that they lack any
value. However, the fact that the few theories we can
test have produced quite poor results suggests that
many of the ideas we haven’t tested are very wrong as
well. We are undoubtedly living with many delusions
that we do not even realize.
page 15
OECD (2012). OECDTechnology Foresight Forum
2012 - Harnessing data as a new source of growth:
Big data analytics and policies. OECD Headquarters,
Paris, France 22 October 2012
Big data and the end of theory?
 Does big data have the answers? Maybe some, but not all, says -
Mark Graham
 In 2008, Chris Anderson, then editor of Wired, wrote a
provocative piece titled The End of Theory. Anderson was
referring to the ways that computers, algorithms, and big data can
potentially generate more insightful, useful, accurate, or true
results than specialists or domain experts who traditionally craft
carefully targeted hypotheses and research strategies.
 We may one day get to the point where sufficient quantities of big
data can be harvested to answer all of the social questions that
most concern us. I doubt it though. There will always be digital
divides; always be uneven data shadows; and always be biases in
how information and technology are used and produced.
 And so we shouldn't forget the important role of specialists to
contextualize and offer insights into what our data do, and maybe
more importantly, don't tell us.
http://www.guardian.co.uk/news/datablog/2012/mar/09/big-data-theory
빅데이터의 개념 및 특징
데이터 사이언스 배경
(빅)데이터 R&D 동향
사회적 이슈 및 시사점
1.
3.
4.
2.
[목차]
Number of “Big data” papers per year
Halevi, G., & Moed, H. F. (2012).
Rousseau (2012)
We performed a similar search in the WoS (TS=“Big data”) on October 2,
2012, leading to 142 articles. We removed the oldest one (1974), and
kept 141 published during the period 1993-2012). Halevi and Moed
observed an over-exponential growth over the period 1970-2011, while
we found a growth curve that could best be described by a cubic
polynomial (R2=0.963, with year 1992=0), which is illustrated in Fig. 1.
Subject areas researching Big Data
Halevi, G., & Moed, H. F. (2012).
Rousseau (2012)
Geographical Distribution of Big Data papers
Halevi, G., & Moed, H. F. (2012).
Rousseau (2012)
Phrase map of highly occurring keywords 1999-2005
Halevi, G., & Moed, H. F. (2012).
Phrase map of highly occurring keywords 2006-2012
Halevi, G., & Moed, H. F. (2012).
Park, H. W., & Leydesdorff, L. (2013 Work-In-Progress). Decomposing a Data-Driven Science Using a Scientometric Method.
 But, Halevi and Moed (2012), and Rousseau (2012) are
based on descriptive statistics. Therefore, we intend to add
the network perspective both in the social (in terms of co-
authorship) and semantic networks.
 Furthermore, we extend search queries to various
terminologies related to Data Science because the term
“big data” is regarded only as one among a list of policy
priority issues.
 We show where the research system in Data Science is
“hot” in terms of international collaborations and
prevailing semantics.
Problem Statement
Previous studies have not systematically
examined whether research efforts driven by
various sources of big data are really becoming
increasingly widespread across the world.
Further, the status of the literature based on big
data has not been extensively discussed or
sufficiently examined with respect to its
semantic variations, disciplinary scope,
institutional adoption, and international
collaboration.
 We employed a method rooted in the social network analysis
(SNA) (Hanneman & Riddle, 2005).
 Here the unit of analysis is often the node, which refers to a
point in a network where ties cross or connect nodes.
 A tie is a connection between parts (i.e., nodes) in a network.
 We considered countries as nodes and a tie as the number of
papers co-authored by a pair of researchers with different
addresses in terms of their country of origin.
 We considered papers published in SCI journals in 2011.
 we selected three types of documents: journal articles, letters,
and reviews.
 We obtained the data from the DVD version of the SCI data-
base by using several search terms based on titles, author key
words, and keyword-plus.
As expected, the global co-authorship network was far
denser than the subnetwork, that is, co-authorship in
big data research. Note that these were not really co-
authorship relationships between countries but
relationships between them measured in terms of co-
authorship relationships.The sum of ties in the global
network and that of the subnetwork were 1,073,764
and 10,798, respectively. In addition, the global network
was more centralized around hub countries than the
network of big data science in terms of all three
measures of centrality. However, the QAP correlation
between the whole 2011 co-authorship network and
big data research demonstrates their significant
relationship: this (Pearson) correlation was .740 (p
< .001).
Network Type Density (S.D.)
Centralization (%)
Degree Node Flow
Global 26.71 (245.70) 5.11 10.08 9.83
Big Data 0.01 (0.18) 4.37 2.70 2.28
N=201.
Comparison of Density and CentralizationValues
Rank Country Degree Rank Country Betweenness Rank Country FlowBet
1 U.S. 4.450 1 U.S. 2.734 1 USA 2.309
2 GERMANY 1.650 2 FRANCE 1.253 2 FRANCE 0.929
3 U.K. 1.600 3 U.K. 0.680 3 CANADA 0.537
4 FRANCE 1.400 4 CANADA 0.643 4 ITALY 0.510
5 AUSTRALIA 1.150 5 ITALY 0.620 5 UK 0.377
6 NETHERLANDS 1.150 6 AUSTRALIA 0.602 6
SOUTH_KORE
A
0.359
7 CHINA 1.100 7 SOUTH_KOREA 0.346 7 BELGIUM 0.331
8 DENMARK 0.950 8 GERMANY 0.291 8 AUSTRALIA 0.328
9 CANADA 0.900 9 BELGIUM 0.290 9 JAPAN 0.262
10 TAIWAN 0.850 10 PORTUGAL 0.266 10 SLOVENIA 0.200
11 ISRAEL 0.750 11 JAPAN 0.256 11 PORTUGAL 0.185
12 SOUTH_KOREA 0.750 12 CHINA 0.137 12 CHINA 0.132
13 SWEDEN 0.750 13 NETHERLAND 0.104 13 SPAIN 0.129
14 ITALY 0.700 14 DENMARK 0.099 14 GERMANY 0.108
15 PORTUGAL 0.700 15 SAUDI_ARABIA 0.088 15 MALAYSIA 0.103
16 IRELAND 0.650 16 SLOVENIA 0.068 16 TANZANIA 0.095
17 NORWAY 0.650 17 TAIWAN 0.057 17 VENEZUELA 0.095
18 SPAIN 0.650 18 SPAIN 0.055 18 NETHERLANDS 0.089
19 SINGAPORE 0.500 19 ISRAEL 0.037 19 SAUDI_ARABIA 0.071
20 SWITZERLAND 0.450 20 AUSTRIA 0.036 20 AUSTRIA 0.063
Table 4. CentralityValues for Countries
Rank Country Effectiveness Rank Country Efficiency Rank Country Constrain
1 U.K. 13.071 1 EGYPT 1.000 1 DENMARK 0.312
2 AUSTRALIA 12.879 2 INDIA 1.000 2 NETHERLAND 0.331
3 FRANCE 12.562 3 POLAND 1.000 3 PORTUGAL 0.338
4 U.S. 11.563 4 UZBEKISTAN 1.000 4 ISRAEL 0.343
5 GERMANY 10.746 5 GREECE 0.805 5 NORWAY 0.345
6 NETHERLANDS 8.873 6 JAPAN 0.789 6 IRELAND 0.352
7 DENMARK 8.530 7 AUSTRIA 0.725 7 UK 0.364
8 PORTUGAL 8.229 8 BRAZIL 0.722 8 SWEDEN 0.365
9 ISRAEL 8.208 9 NEW_ZEALAND 0.722 9 AUSTRALIA 0.381
10 CANADA 7.672 10 MALAYSIA 0.698 10 GERMANY 0.397
11 ITALY 7.554 11 AUSTRALIA 0.678 11 FRANCE 0.411
12 IRELAND 7.252 12 SAUDI_ARABIA 0.667 12 CANADA 0.532
13 NORWAY 7.214 13 IRAN 0.667 13 ITALY 0.535
14 SOUTH_KOREA 6.365 14 THAILAND 0.667 14 SAUDI_ARABIA 0.548
15 CHINA 6.057 15 SINGAPORE 0.659 15 SWITZERLAND 0.556
16 SWEDEN 5.978 16 CZECH_REPUBLIC 0.644 16 USA 0.573
17
JAPAN 5.520
17
CANADA 0.639
17
SOUTH_KORE
A
0.578
18 TAIWAN 5.490 18 SLOVENIA 0.638 18 BELGIUM 0.583
19 SPAIN 5.312 19 SOUTH_KOREA 0.636 19 SPAIN 0.625
20 SWITZERLAND 4.224 20 PORTUGAL 0.633 20 TAIWAN 0.627
Table 5. Structural HoleValues by Country
International Co-Authorship Network of Big Data Research
Semantic Network of Paper Titles in Big Data
(50 Most Frequently OccurringTerms with the Cosine ≥ 0.1)
Semantic Network of PaperTitles and Countries in Big Data
(50 Most Frequently OccurringTerms and theTop 20 Countries with the Cosine ≥ 0.2)
빅데이터의 개념 및 특징
데이터 사이언스 배경
(빅)데이터 R&D 동향
사회적 이슈 및 시사점
1.
3.
4.
2.
[목차]
 Internationally co-authored papers in the field of data science
have generally focused on primary technologies.
 SCI papers do not necessarily focus on conceptually new me-
thodologies for analyzing and synthesizing massive data sets.
The results suggest the emergence of some new subjects such
as MapReduce.
 The U.S. was central in various aspects because of its connec-
tions with E.U. member countries as well as individual Asian
countries.
 Various European countries are the second most central posi-
tions based on centrality measures.
 In terms of structural hole indicators, some smaller and less
advanced countries were more efficient than effective in terms
of controlling central positions.
 The results suggest that a combination of words and locations
in a two-mode network can provide a richer representation of
the emerging field of big data science than the sum of two re-
presentations.
Yet, there still are serious problems to overcome. A trenchant
critique concerning the big data field as it is nowadays came in
the form of six statements intending to temper unbridled
enthusiasm. [42] These six provocative statements are:
 Big data change the definition of knowledge;
 Claims to accuracy and objectivity are misleading;
 More data are not always better data;
 Taken out of context, big data loses its meaning;
 Just because it is accessible, it does not make it ethical; and
 (Limited) access to big data creates a new digital divide.
Rousseau (2012)
Global Communication 2team
빅데이터에 대한 부정적인 시각 등장
-빅데이터의 가치
-저장, 분석 및 해석기술 한계 존재
-현재의 붐은 호들갑스러운 측면 존재
빅데이터 갭: PromiseVS Capabilities
빅데이터의 도전
Global Communication 2team
빅데이터의 도전 빅데이터 ‘Gap’ 분석사례
· 151명 연방 정부 CIO및 IT관리자 대상 빅데이터갭 조사실
시 .
· 실질적으로 현재 데이터를 제대로 활용하고 있는 기관도
적으며, 데이터소유권 문제도 확립되지 않은 것으로 나타
[美정부 IT네트워크 ‘Meritalk’는 빅데이터의 가
능성과 현실에는 Gap이 존재한다고 분석]
http://www.forbes.com/sites/kashmirhill/2012/02/16/how-target-figured-out-a-teen-girl-was-pregnant-before-her-father-did/
어떤 실험을 하는지 우리는 알고 있는가?
http://www.nature.com/news/facebook-experiment-boosts-us-voter-turnout-1.11401
우리는 정확히 인지하지 못한 채 동의했다
User Content VS Site Content
대부분의 SNS 서비스는 “User Content”를 무력
하게 만드는 “Site Content” 규정이 있음 (p. 60).
Issues in “Big Data” Internet Research
Cugelman, B., Thelwall, M. & Dawes, P. (in press). The psychology of online behavioural influence interventions: a meta
analysis. Journal of Medical Internet Research.
 Health Information Privacy Protection Act (HIPPA) in U.S. put
strict limit on the sharing of an individual’s health information,
• 병원에서 수술 등을 생중계하는 것은 어떻게 해결:
트위터를 가장 활발하게 이용하고 있는 ‘헨리 포드 병원’ 외에
도 현재 미국에서 트위터, 페이스북, 유튜브 등 소셜 네트워
크 서비스를 적극 활용하는 병원이 늘어나고 있는 추세임
• 건강용 스마트폰 Application 개발
Global Communication 2team
3.결론및
시사점
기술+사회문화적 요소에 대한 면밀한 검토
- 빅데이터 및 AI 논의에서 빠지지 않는 것이 개인정보 유출 및 사생활
침해와 같은 역기능 문제
- 기술의 발전과 더불어 우리가 원하는 미래상에 대한 명확한 이해와,
이를 달성하기 위한 정치사회적 기반에 대한 근본적인 모색이 중요.
박한우 교수는 2012년 2월에 미국에서 벌어진
사건을 예로 들었다. 영국의 대학생 두 명이 미국에
입국하면서 로스앤젤레스 공항을 폭파하겠다는
말을 트위터에 썼는데 이것이 미국 정부에
적발됐다. 박 교수는 “이 경우 정부는 트위터
전체가 아니라 트위터에 글을 올린 사람을, 올린
것을 규제한 것인데 미국 정부가 일상적으로
트위터를 들여 다본다는 문제로 번졌다”고
설명했다.
Prof. Han Woo PARK
World Class University Webometrics Institute
CyberEmotions Research Center
Department of Media and Communication,
YeungNam University, Korea
hanpark@ynu.ac.kr www.hanpark.net
이 슬라이드 작성에 도움을 준 사이버감성연구소 연구원들과
학부 /대학원 강의 수강생에게 고마움을 표시합니다.
이 슬라이드는 개인적 목적으로 만든 비공개 자료입니다.
배포 및 복사를 금지합니다.

Mais conteúdo relacionado

Mais procurados

Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationRinke Hoekstra
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Paul Groth
 
End-to-End Learning for Answering Structured Queries Directly over Text
End-to-End Learning for  Answering Structured Queries Directly over Text End-to-End Learning for  Answering Structured Queries Directly over Text
End-to-End Learning for Answering Structured Queries Directly over Text Paul Groth
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper ProvenancePaul Groth
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Rinke Hoekstra
 
Internet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam UniversityInternet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam Universitymwe400
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)Han Woo PARK
 
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...Stefan Dietze
 
Internet Prospective Study
Internet Prospective StudyInternet Prospective Study
Internet Prospective StudyjournalBEEI
 
The Roots: Linked data and the foundations of successful Agriculture Data
The Roots: Linked data and the foundations of successful Agriculture DataThe Roots: Linked data and the foundations of successful Agriculture Data
The Roots: Linked data and the foundations of successful Agriculture DataPaul Groth
 
Wire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub ProjectWire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub Projectmwe400
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesKaran Deep Singh
 
Data Science - Poster - Kirk Borne - RDAP12
Data Science - Poster - Kirk Borne - RDAP12Data Science - Poster - Kirk Borne - RDAP12
Data Science - Poster - Kirk Borne - RDAP12ASIS&T
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platformsYannis Charalabidis
 
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...Artificial Intelligence Institute at UofSC
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3varshakumar21
 

Mais procurados (20)

Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance Visualization
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 
End-to-End Learning for Answering Structured Queries Directly over Text
End-to-End Learning for  Answering Structured Queries Directly over Text End-to-End Learning for  Answering Structured Queries Directly over Text
End-to-End Learning for Answering Structured Queries Directly over Text
 
Democratizing Data Science in the Cloud
Democratizing Data Science in the CloudDemocratizing Data Science in the Cloud
Democratizing Data Science in the Cloud
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
 
Internet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam UniversityInternet Archives and Social Science Research - Yeungnam University
Internet Archives and Social Science Research - Yeungnam University
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)
 
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
 
Internet Prospective Study
Internet Prospective StudyInternet Prospective Study
Internet Prospective Study
 
The Roots: Linked data and the foundations of successful Agriculture Data
The Roots: Linked data and the foundations of successful Agriculture DataThe Roots: Linked data and the foundations of successful Agriculture Data
The Roots: Linked data and the foundations of successful Agriculture Data
 
Wire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub ProjectWire Workshop: Overview slides for ArchiveHub Project
Wire Workshop: Overview slides for ArchiveHub Project
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and Issues
 
10 problems 06
10 problems 0610 problems 06
10 problems 06
 
Data Science - Poster - Kirk Borne - RDAP12
Data Science - Poster - Kirk Borne - RDAP12Data Science - Poster - Kirk Borne - RDAP12
Data Science - Poster - Kirk Borne - RDAP12
 
Domain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of DataDomain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of Data
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platforms
 
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...
Hemant Purohit PhD Defense: Mining Citizen Sensor Communities for Cooperation...
 
Data science.chapter-1,2,3
Data science.chapter-1,2,3Data science.chapter-1,2,3
Data science.chapter-1,2,3
 
Broad Data
Broad DataBroad Data
Broad Data
 

Destaque

온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호datasciencekorea
 
R의 이해와 활용_데이터사이언스학회
R의 이해와 활용_데이터사이언스학회R의 이해와 활용_데이터사이언스학회
R의 이해와 활용_데이터사이언스학회datasciencekorea
 
농업 빅데이터를 활용한 병해충 발생 예측 모형
농업 빅데이터를 활용한 병해충 발생 예측 모형농업 빅데이터를 활용한 병해충 발생 예측 모형
농업 빅데이터를 활용한 병해충 발생 예측 모형datasciencekorea
 
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점datasciencekorea
 
텍스톰을 이용한 SNA 분석 -전채남
텍스톰을 이용한 SNA 분석 -전채남텍스톰을 이용한 SNA 분석 -전채남
텍스톰을 이용한 SNA 분석 -전채남datasciencekorea
 
데이터시장의 트렌드와 예측 - 이영환
데이터시장의 트렌드와 예측 - 이영환 데이터시장의 트렌드와 예측 - 이영환
데이터시장의 트렌드와 예측 - 이영환 datasciencekorea
 
Data-driven biomedical science: implications for human disease and public health
Data-driven biomedical science: implications for human disease and public healthData-driven biomedical science: implications for human disease and public health
Data-driven biomedical science: implications for human disease and public healthdatasciencekorea
 
Analyzing Big Data to Discover Honest Signals of Innovation
Analyzing Big Data to Discover Honest Signals of InnovationAnalyzing Big Data to Discover Honest Signals of Innovation
Analyzing Big Data to Discover Honest Signals of Innovationdatasciencekorea
 
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...Structures of Twitter Crowds and Conversations Six distinct types of crowds t...
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...datasciencekorea
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNINGdatasciencekorea
 
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603datasciencekorea
 
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석datasciencekorea
 
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선datasciencekorea
 
도시의 마음, 그 발현 - Emergent Mind of City
도시의 마음, 그 발현 - Emergent Mind of City도시의 마음, 그 발현 - Emergent Mind of City
도시의 마음, 그 발현 - Emergent Mind of Citydatasciencekorea
 
Studying Social Selection vs Social Influence in Virtual Financial Communities
Studying Social Selection vs Social Influence in Virtual Financial CommunitiesStudying Social Selection vs Social Influence in Virtual Financial Communities
Studying Social Selection vs Social Influence in Virtual Financial Communitiesdatasciencekorea
 
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원datasciencekorea
 
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중datasciencekorea
 
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of TagsA Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of Tagsdatasciencekorea
 
소셜미디어 분석방법론과 사례
소셜미디어 분석방법론과 사례소셜미디어 분석방법론과 사례
소셜미디어 분석방법론과 사례datasciencekorea
 
데이터 시각화의 글로벌 동향 20140819 - 고영혁
데이터 시각화의 글로벌 동향   20140819 - 고영혁데이터 시각화의 글로벌 동향   20140819 - 고영혁
데이터 시각화의 글로벌 동향 20140819 - 고영혁datasciencekorea
 

Destaque (20)

온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
온라인 데이터 분석을 통한 선거예측- 김찬우, 조인호
 
R의 이해와 활용_데이터사이언스학회
R의 이해와 활용_데이터사이언스학회R의 이해와 활용_데이터사이언스학회
R의 이해와 활용_데이터사이언스학회
 
농업 빅데이터를 활용한 병해충 발생 예측 모형
농업 빅데이터를 활용한 병해충 발생 예측 모형농업 빅데이터를 활용한 병해충 발생 예측 모형
농업 빅데이터를 활용한 병해충 발생 예측 모형
 
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점
소셜 텍스트 빅 테이터를 통해 분석한 화장품 유통구조 시사점
 
텍스톰을 이용한 SNA 분석 -전채남
텍스톰을 이용한 SNA 분석 -전채남텍스톰을 이용한 SNA 분석 -전채남
텍스톰을 이용한 SNA 분석 -전채남
 
데이터시장의 트렌드와 예측 - 이영환
데이터시장의 트렌드와 예측 - 이영환 데이터시장의 트렌드와 예측 - 이영환
데이터시장의 트렌드와 예측 - 이영환
 
Data-driven biomedical science: implications for human disease and public health
Data-driven biomedical science: implications for human disease and public healthData-driven biomedical science: implications for human disease and public health
Data-driven biomedical science: implications for human disease and public health
 
Analyzing Big Data to Discover Honest Signals of Innovation
Analyzing Big Data to Discover Honest Signals of InnovationAnalyzing Big Data to Discover Honest Signals of Innovation
Analyzing Big Data to Discover Honest Signals of Innovation
 
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...Structures of Twitter Crowds and Conversations Six distinct types of crowds t...
Structures of Twitter Crowds and Conversations Six distinct types of crowds t...
 
DATA CENTRIC EDUCATION & LEARNING
 DATA CENTRIC EDUCATION & LEARNING DATA CENTRIC EDUCATION & LEARNING
DATA CENTRIC EDUCATION & LEARNING
 
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603
국가의 신성장 동력으로서 공간정보의 가치와 활용 2016-0603
 
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석
데이터사이언스학회 5월 세미나 데이터저널리즘과 트위터네트워크 분석
 
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선
2015-4 혁신기술로서의 빅데이터 국내 기술수용 초기 특성연구- 김정선
 
도시의 마음, 그 발현 - Emergent Mind of City
도시의 마음, 그 발현 - Emergent Mind of City도시의 마음, 그 발현 - Emergent Mind of City
도시의 마음, 그 발현 - Emergent Mind of City
 
Studying Social Selection vs Social Influence in Virtual Financial Communities
Studying Social Selection vs Social Influence in Virtual Financial CommunitiesStudying Social Selection vs Social Influence in Virtual Financial Communities
Studying Social Selection vs Social Influence in Virtual Financial Communities
 
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원
빅데이터 기술을 활용한 뉴스 큐레이션 서비스 - 온병원
 
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
Deep Learning - 인공지능 기계학습의 새로운 트랜드 :김인중
 
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of TagsA Unified Music Recommender System Using Listening Habits and Semantics of Tags
A Unified Music Recommender System Using Listening Habits and Semantics of Tags
 
소셜미디어 분석방법론과 사례
소셜미디어 분석방법론과 사례소셜미디어 분석방법론과 사례
소셜미디어 분석방법론과 사례
 
데이터 시각화의 글로벌 동향 20140819 - 고영혁
데이터 시각화의 글로벌 동향   20140819 - 고영혁데이터 시각화의 글로벌 동향   20140819 - 고영혁
데이터 시각화의 글로벌 동향 20140819 - 고영혁
 

Semelhante a International Collaboration Networks in the Emerging (Big) Data Science

Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Han Woo PARK
 
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)Han Woo PARK
 
Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyeroiisdp
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterOSTHUS
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeJosh Cowls
 
Data Science definition
Data Science definitionData Science definition
Data Science definitionCarloLauro1
 
Let's talk about Data Science
Let's talk about Data ScienceLet's talk about Data Science
Let's talk about Data ScienceCarlo Lauro
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextMurad Daryousse
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-ResearchEric Meyer
 
4th_paradigm_book_complete_lr
4th_paradigm_book_complete_lr4th_paradigm_book_complete_lr
4th_paradigm_book_complete_lrDominic A Ienco
 
“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...husITa
 
Analíticas del aprendizaje: una perspectiva crítica
Analíticas del aprendizaje: una perspectiva críticaAnalíticas del aprendizaje: una perspectiva crítica
Analíticas del aprendizaje: una perspectiva críticaCENT
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)James Hendler
 
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentDutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentHendrik Drachsler
 
Social Science Landscape for Web Observatories
Social Science Landscape for Web ObservatoriesSocial Science Landscape for Web Observatories
Social Science Landscape for Web ObservatoriesDavid De Roure
 
Big Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningBig Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningStefan Dietze
 
Bi(G) data: opportunities for BI Professionals
Bi(G) data: opportunities for BI ProfessionalsBi(G) data: opportunities for BI Professionals
Bi(G) data: opportunities for BI ProfessionalsAlbert Besselse
 

Semelhante a International Collaboration Networks in the Emerging (Big) Data Science (20)

Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생Mapping (big) data science (15 dec2014)대학(원)생
Mapping (big) data science (15 dec2014)대학(원)생
 
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)Ict와 사회과학지식간 학제간 연구동향(23 march2013)
Ict와 사회과학지식간 학제간 연구동향(23 march2013)
 
Ralph schroeder and eric meyer
Ralph schroeder and eric meyerRalph schroeder and eric meyer
Ralph schroeder and eric meyer
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
Big Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
Big Data Research Trend and Forecast (2005-2015): An Informetrics PerspectiveBig Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
Big Data Research Trend and Forecast (2005-2015): An Informetrics Perspective
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Data Science definition
Data Science definitionData Science definition
Data Science definition
 
Let's talk about Data Science
Let's talk about Data ScienceLet's talk about Data Science
Let's talk about Data Science
 
Semantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data ContextSemantic Web Investigation within Big Data Context
Semantic Web Investigation within Big Data Context
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
 
4th_paradigm_book_complete_lr
4th_paradigm_book_complete_lr4th_paradigm_book_complete_lr
4th_paradigm_book_complete_lr
 
Big data survey
Big data surveyBig data survey
Big data survey
 
“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...“Big data” in human services organisations: Practical problems and ethical di...
“Big data” in human services organisations: Practical problems and ethical di...
 
Analíticas del aprendizaje: una perspectiva crítica
Analíticas del aprendizaje: una perspectiva críticaAnalíticas del aprendizaje: una perspectiva crítica
Analíticas del aprendizaje: una perspectiva crítica
 
Broad Data (India 2015)
Broad Data (India 2015)Broad Data (India 2015)
Broad Data (India 2015)
 
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the ConsistentDutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
Dutch Cooking with xAPI Recipes, The Good, the Bad, and the Consistent
 
Social Science Landscape for Web Observatories
Social Science Landscape for Web ObservatoriesSocial Science Landscape for Web Observatories
Social Science Landscape for Web Observatories
 
Big Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday LearningBig Data in Learning Analytics - Analytics for Everyday Learning
Big Data in Learning Analytics - Analytics for Everyday Learning
 
Bi(G) data: opportunities for BI Professionals
Bi(G) data: opportunities for BI ProfessionalsBi(G) data: opportunities for BI Professionals
Bi(G) data: opportunities for BI Professionals
 

Mais de datasciencekorea

스마트 시티의 빅데이터 분석론 - 최준영
스마트 시티의 빅데이터 분석론 - 최준영스마트 시티의 빅데이터 분석론 - 최준영
스마트 시티의 빅데이터 분석론 - 최준영datasciencekorea
 
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁datasciencekorea
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석datasciencekorea
 
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법온라인 물가지수 분석을 위한 빅데이터 융합분석 방법
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법datasciencekorea
 
Use of Big Data Technology in the area of Video Analytics
Use of Big Data Technology in the area of Video AnalyticsUse of Big Data Technology in the area of Video Analytics
Use of Big Data Technology in the area of Video Analyticsdatasciencekorea
 
빅 데이터 비즈니스 모델
빅 데이터 비즈니스 모델빅 데이터 비즈니스 모델
빅 데이터 비즈니스 모델datasciencekorea
 

Mais de datasciencekorea (6)

스마트 시티의 빅데이터 분석론 - 최준영
스마트 시티의 빅데이터 분석론 - 최준영스마트 시티의 빅데이터 분석론 - 최준영
스마트 시티의 빅데이터 분석론 - 최준영
 
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁
데이터에 포함된 동적 패턴의 탐색과 해석을 위한 협업적 탐험 플랫폼 -최진혁
 
Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석Bayesian Network 을 활용한 예측 분석
Bayesian Network 을 활용한 예측 분석
 
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법온라인 물가지수 분석을 위한 빅데이터 융합분석 방법
온라인 물가지수 분석을 위한 빅데이터 융합분석 방법
 
Use of Big Data Technology in the area of Video Analytics
Use of Big Data Technology in the area of Video AnalyticsUse of Big Data Technology in the area of Video Analytics
Use of Big Data Technology in the area of Video Analytics
 
빅 데이터 비즈니스 모델
빅 데이터 비즈니스 모델빅 데이터 비즈니스 모델
빅 데이터 비즈니스 모델
 

Último

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 

International Collaboration Networks in the Emerging (Big) Data Science

  • 1. International Collaboration Networks in the Emerging (Big) Data Science HanWoo Park Dept. of Media & Communication YeungNam University 214-1 Dae-dong, Gyeongsan-si, Gyeongsangbuk-do 712-749 Republic of Korea www.hanpark.net Loet Leydesdorff Amsterdam School of Communication Research (ASCoR) University of Amsterdam Kloveniersburgwal 48, 1012 CX Amsterdam, The Netherlands loet@leydesdorff.net This presentation is based on Park, H.W., & Leydesdorff, L. (2013 forthcoming). Decomposing Social and Semantic Networks in Emerging “Big Data” Research. Journal of Informetrics*.
  • 2. 빅데이터의 개념 및 특징 데이터 사이언스 배경 (빅)데이터 R&D 동향 사회적 이슈 및 시사점 1. 3. 4. 2. [목차]
  • 3. Big data  The term “big data” refers to “analytical technologies that have existed for years but can now be applied faster, on a greater scale and are accessible to more users. (Miller, 2013).  Big data sizes may vary per discipline.  Characteristics: Garner’s 3Vs plus SAS’s VC and IBM’s Veracity - Volume (amount of data), Velocity (speed of data in and out), Variety (range of data types and sources) - Variability: Data flows can be highly inconsistent with daily, seasonal, and event-triggered peak data loads - Complexity: Multiple data sources requiring cleaning, linking, and matching the data across system - Veracity: 1 in 3 business leaders don’t trust the information they use to make decisions. http://en.wikipedia.org/wiki/Big_data http://www-01.ibm.com/software/data/bigdata/
  • 4. Data-driven Research that focuses on extracting meaningful data from techno-socio-economic systems to discover some hidden patterns.
  • 5.
  • 6. 빅데이터의 개념 및 특징 데이터 사이언스 배경 (빅)데이터 R&D 동향 사회적 이슈 및 시사점 1. 3. 4. 2. [목차]
  • 7. “Data Science” refers to “a discipline that incorporates varying elements and builds on techniques and theories from many fields, including data visualization with the goal of extracting meaning from data and creating data products.” http://en.wikipedia.org/wiki/Data_science
  • 8. Today’s “big” is probably tomorrow’s “medium” and next week’s “small” and thus the most effective defini- tion of “big data” may be derived when the size of data itself becomes part of the research problem. Loukides (2012)
  • 9. Origin of Data Science  One is Peter Naur’s 1974 book “Concise Survey of Computer Methods”, a survey of contemporary data processing methods in a wide range of applications (Gilpress, 2012).  The other is when the term “big data” first appeared in 1970 in the Scopus database (Halevi and Moed, 2012). There was no particular key milestone since 1970s.  During the 1990s period, the term had been usually related to computer modeling and software development for large datasets. Knowledge Discovery and Data Mining in 1997. Rousseau (2012) also regards the 1993 publication as the first documents indexed in the Web version of Web of Science.
  • 10. A more recent development was made with the establishment of journals that included the term “Data Science” in their titles: • Data Science Journal in 2002 • Journal of Data Science in 2003 • EPJ Data Science in 2012 • Journal of Big Data in 2013 • GigaScience gigasciencejournal.com in 2012
  • 11. Science published a special issue (February 11, 2011) looking broadly at increasingly data-driven research efforts as a scientific domain (Science staff, 2011). Data Science is composed of interrelated clusters of research tasks. For example, the technologies on data collection, curation, and access, and the unique skill sets have increasingly been central to Data Science (Science staff, 2011).
  • 12. An international conference called “Data Science Summit” (http://www.greenplum.com/datasciencesummit).
  • 14. All models are wrong but some are useful Emergence of data author on dataverse
  • 15. Andersons claims  Data is everything we need.  We don't have to settle for models.  Agnostic statistics.  Out with every theory of human behavior.  This approach to science — hypothesize, model, test — is becoming obsolete.  Petabytes allow us to say: "Correlation is enough." We can stop looking for models.  What can science learn from Google? E-Science.
  • 16. Computational (Social) Science Park, H.W., & Leydesdorff, L. (2013 Work-In-Progress). Decomposing a Data-Driven Science Using a Scientometric Method.  Focus on the methodological perspective based on the use of new digital tools to manage the data deluge.  Development of e-science tools to automate research process.  Experimentation with new types of data visualization.
  • 18.
  • 19. Why Data Science? Savage and Burrows (2007, p. 886) lament, “Fifty years ago, academic social scientists might be seen as occupying the apex of the – generally limited – social science research ‘apparatus’. Now they occupy an increasingly marginal position in the huge research infrastructure”. Bonacich, P. (2004). The Invasion of the Physicists. Social Networks 26(3): 285-288
  • 20. This approach to science is attributed to the late Jim Gray, one of the most influential computer scientists, at Microsoft.
  • 21. “The fourth paradigm” Research purpose lies in handling huge amounts of data from technological, sociological, and economic systems to discover some hidden patterns. Jim Gray
  • 22. Global Communication 2team (빅) 데이터과학의 도전 이론의 종말-증거기반 경영 Jeffrey Pfeffer, Robert I. Sutton (2006) How companies can bolster performance and trump the competition through evidence-based management, an approach to decision-making and action that is driven by hard facts rather than half-truths or hype. · 빅데이터의 등장으로 전통적인 과학 연구방법론 퇴색 · 인식의 한계치를 넘어선 데이 터 (팩트가아닌패턴)
  • 23. The Signal and the Noise: Why Most Predictions Fail but Some Don't. Nate Silver I do not go as far as a Popper in asserting that such theories are therefore unscientific or that they lack any value. However, the fact that the few theories we can test have produced quite poor results suggests that many of the ideas we haven’t tested are very wrong as well. We are undoubtedly living with many delusions that we do not even realize. page 15
  • 24. OECD (2012). OECDTechnology Foresight Forum 2012 - Harnessing data as a new source of growth: Big data analytics and policies. OECD Headquarters, Paris, France 22 October 2012
  • 25. Big data and the end of theory?  Does big data have the answers? Maybe some, but not all, says - Mark Graham  In 2008, Chris Anderson, then editor of Wired, wrote a provocative piece titled The End of Theory. Anderson was referring to the ways that computers, algorithms, and big data can potentially generate more insightful, useful, accurate, or true results than specialists or domain experts who traditionally craft carefully targeted hypotheses and research strategies.  We may one day get to the point where sufficient quantities of big data can be harvested to answer all of the social questions that most concern us. I doubt it though. There will always be digital divides; always be uneven data shadows; and always be biases in how information and technology are used and produced.  And so we shouldn't forget the important role of specialists to contextualize and offer insights into what our data do, and maybe more importantly, don't tell us. http://www.guardian.co.uk/news/datablog/2012/mar/09/big-data-theory
  • 26. 빅데이터의 개념 및 특징 데이터 사이언스 배경 (빅)데이터 R&D 동향 사회적 이슈 및 시사점 1. 3. 4. 2. [목차]
  • 27. Number of “Big data” papers per year Halevi, G., & Moed, H. F. (2012).
  • 28. Rousseau (2012) We performed a similar search in the WoS (TS=“Big data”) on October 2, 2012, leading to 142 articles. We removed the oldest one (1974), and kept 141 published during the period 1993-2012). Halevi and Moed observed an over-exponential growth over the period 1970-2011, while we found a growth curve that could best be described by a cubic polynomial (R2=0.963, with year 1992=0), which is illustrated in Fig. 1.
  • 29. Subject areas researching Big Data Halevi, G., & Moed, H. F. (2012).
  • 31. Geographical Distribution of Big Data papers Halevi, G., & Moed, H. F. (2012).
  • 33. Phrase map of highly occurring keywords 1999-2005 Halevi, G., & Moed, H. F. (2012).
  • 34. Phrase map of highly occurring keywords 2006-2012 Halevi, G., & Moed, H. F. (2012).
  • 35. Park, H. W., & Leydesdorff, L. (2013 Work-In-Progress). Decomposing a Data-Driven Science Using a Scientometric Method.  But, Halevi and Moed (2012), and Rousseau (2012) are based on descriptive statistics. Therefore, we intend to add the network perspective both in the social (in terms of co- authorship) and semantic networks.  Furthermore, we extend search queries to various terminologies related to Data Science because the term “big data” is regarded only as one among a list of policy priority issues.  We show where the research system in Data Science is “hot” in terms of international collaborations and prevailing semantics.
  • 36. Problem Statement Previous studies have not systematically examined whether research efforts driven by various sources of big data are really becoming increasingly widespread across the world. Further, the status of the literature based on big data has not been extensively discussed or sufficiently examined with respect to its semantic variations, disciplinary scope, institutional adoption, and international collaboration.
  • 37.  We employed a method rooted in the social network analysis (SNA) (Hanneman & Riddle, 2005).  Here the unit of analysis is often the node, which refers to a point in a network where ties cross or connect nodes.  A tie is a connection between parts (i.e., nodes) in a network.  We considered countries as nodes and a tie as the number of papers co-authored by a pair of researchers with different addresses in terms of their country of origin.
  • 38.  We considered papers published in SCI journals in 2011.  we selected three types of documents: journal articles, letters, and reviews.  We obtained the data from the DVD version of the SCI data- base by using several search terms based on titles, author key words, and keyword-plus.
  • 39. As expected, the global co-authorship network was far denser than the subnetwork, that is, co-authorship in big data research. Note that these were not really co- authorship relationships between countries but relationships between them measured in terms of co- authorship relationships.The sum of ties in the global network and that of the subnetwork were 1,073,764 and 10,798, respectively. In addition, the global network was more centralized around hub countries than the network of big data science in terms of all three measures of centrality. However, the QAP correlation between the whole 2011 co-authorship network and big data research demonstrates their significant relationship: this (Pearson) correlation was .740 (p < .001).
  • 40. Network Type Density (S.D.) Centralization (%) Degree Node Flow Global 26.71 (245.70) 5.11 10.08 9.83 Big Data 0.01 (0.18) 4.37 2.70 2.28 N=201. Comparison of Density and CentralizationValues
  • 41. Rank Country Degree Rank Country Betweenness Rank Country FlowBet 1 U.S. 4.450 1 U.S. 2.734 1 USA 2.309 2 GERMANY 1.650 2 FRANCE 1.253 2 FRANCE 0.929 3 U.K. 1.600 3 U.K. 0.680 3 CANADA 0.537 4 FRANCE 1.400 4 CANADA 0.643 4 ITALY 0.510 5 AUSTRALIA 1.150 5 ITALY 0.620 5 UK 0.377 6 NETHERLANDS 1.150 6 AUSTRALIA 0.602 6 SOUTH_KORE A 0.359 7 CHINA 1.100 7 SOUTH_KOREA 0.346 7 BELGIUM 0.331 8 DENMARK 0.950 8 GERMANY 0.291 8 AUSTRALIA 0.328 9 CANADA 0.900 9 BELGIUM 0.290 9 JAPAN 0.262 10 TAIWAN 0.850 10 PORTUGAL 0.266 10 SLOVENIA 0.200 11 ISRAEL 0.750 11 JAPAN 0.256 11 PORTUGAL 0.185 12 SOUTH_KOREA 0.750 12 CHINA 0.137 12 CHINA 0.132 13 SWEDEN 0.750 13 NETHERLAND 0.104 13 SPAIN 0.129 14 ITALY 0.700 14 DENMARK 0.099 14 GERMANY 0.108 15 PORTUGAL 0.700 15 SAUDI_ARABIA 0.088 15 MALAYSIA 0.103 16 IRELAND 0.650 16 SLOVENIA 0.068 16 TANZANIA 0.095 17 NORWAY 0.650 17 TAIWAN 0.057 17 VENEZUELA 0.095 18 SPAIN 0.650 18 SPAIN 0.055 18 NETHERLANDS 0.089 19 SINGAPORE 0.500 19 ISRAEL 0.037 19 SAUDI_ARABIA 0.071 20 SWITZERLAND 0.450 20 AUSTRIA 0.036 20 AUSTRIA 0.063 Table 4. CentralityValues for Countries
  • 42. Rank Country Effectiveness Rank Country Efficiency Rank Country Constrain 1 U.K. 13.071 1 EGYPT 1.000 1 DENMARK 0.312 2 AUSTRALIA 12.879 2 INDIA 1.000 2 NETHERLAND 0.331 3 FRANCE 12.562 3 POLAND 1.000 3 PORTUGAL 0.338 4 U.S. 11.563 4 UZBEKISTAN 1.000 4 ISRAEL 0.343 5 GERMANY 10.746 5 GREECE 0.805 5 NORWAY 0.345 6 NETHERLANDS 8.873 6 JAPAN 0.789 6 IRELAND 0.352 7 DENMARK 8.530 7 AUSTRIA 0.725 7 UK 0.364 8 PORTUGAL 8.229 8 BRAZIL 0.722 8 SWEDEN 0.365 9 ISRAEL 8.208 9 NEW_ZEALAND 0.722 9 AUSTRALIA 0.381 10 CANADA 7.672 10 MALAYSIA 0.698 10 GERMANY 0.397 11 ITALY 7.554 11 AUSTRALIA 0.678 11 FRANCE 0.411 12 IRELAND 7.252 12 SAUDI_ARABIA 0.667 12 CANADA 0.532 13 NORWAY 7.214 13 IRAN 0.667 13 ITALY 0.535 14 SOUTH_KOREA 6.365 14 THAILAND 0.667 14 SAUDI_ARABIA 0.548 15 CHINA 6.057 15 SINGAPORE 0.659 15 SWITZERLAND 0.556 16 SWEDEN 5.978 16 CZECH_REPUBLIC 0.644 16 USA 0.573 17 JAPAN 5.520 17 CANADA 0.639 17 SOUTH_KORE A 0.578 18 TAIWAN 5.490 18 SLOVENIA 0.638 18 BELGIUM 0.583 19 SPAIN 5.312 19 SOUTH_KOREA 0.636 19 SPAIN 0.625 20 SWITZERLAND 4.224 20 PORTUGAL 0.633 20 TAIWAN 0.627 Table 5. Structural HoleValues by Country
  • 43. International Co-Authorship Network of Big Data Research
  • 44. Semantic Network of Paper Titles in Big Data (50 Most Frequently OccurringTerms with the Cosine ≥ 0.1)
  • 45. Semantic Network of PaperTitles and Countries in Big Data (50 Most Frequently OccurringTerms and theTop 20 Countries with the Cosine ≥ 0.2)
  • 46. 빅데이터의 개념 및 특징 데이터 사이언스 배경 (빅)데이터 R&D 동향 사회적 이슈 및 시사점 1. 3. 4. 2. [목차]
  • 47.  Internationally co-authored papers in the field of data science have generally focused on primary technologies.  SCI papers do not necessarily focus on conceptually new me- thodologies for analyzing and synthesizing massive data sets. The results suggest the emergence of some new subjects such as MapReduce.
  • 48.  The U.S. was central in various aspects because of its connec- tions with E.U. member countries as well as individual Asian countries.  Various European countries are the second most central posi- tions based on centrality measures.  In terms of structural hole indicators, some smaller and less advanced countries were more efficient than effective in terms of controlling central positions.  The results suggest that a combination of words and locations in a two-mode network can provide a richer representation of the emerging field of big data science than the sum of two re- presentations.
  • 49. Yet, there still are serious problems to overcome. A trenchant critique concerning the big data field as it is nowadays came in the form of six statements intending to temper unbridled enthusiasm. [42] These six provocative statements are:  Big data change the definition of knowledge;  Claims to accuracy and objectivity are misleading;  More data are not always better data;  Taken out of context, big data loses its meaning;  Just because it is accessible, it does not make it ethical; and  (Limited) access to big data creates a new digital divide. Rousseau (2012)
  • 50. Global Communication 2team 빅데이터에 대한 부정적인 시각 등장 -빅데이터의 가치 -저장, 분석 및 해석기술 한계 존재 -현재의 붐은 호들갑스러운 측면 존재 빅데이터 갭: PromiseVS Capabilities 빅데이터의 도전
  • 51. Global Communication 2team 빅데이터의 도전 빅데이터 ‘Gap’ 분석사례 · 151명 연방 정부 CIO및 IT관리자 대상 빅데이터갭 조사실 시 . · 실질적으로 현재 데이터를 제대로 활용하고 있는 기관도 적으며, 데이터소유권 문제도 확립되지 않은 것으로 나타 [美정부 IT네트워크 ‘Meritalk’는 빅데이터의 가 능성과 현실에는 Gap이 존재한다고 분석]
  • 53. 어떤 실험을 하는지 우리는 알고 있는가? http://www.nature.com/news/facebook-experiment-boosts-us-voter-turnout-1.11401
  • 54. 우리는 정확히 인지하지 못한 채 동의했다
  • 55. User Content VS Site Content 대부분의 SNS 서비스는 “User Content”를 무력 하게 만드는 “Site Content” 규정이 있음 (p. 60).
  • 56. Issues in “Big Data” Internet Research Cugelman, B., Thelwall, M. & Dawes, P. (in press). The psychology of online behavioural influence interventions: a meta analysis. Journal of Medical Internet Research.  Health Information Privacy Protection Act (HIPPA) in U.S. put strict limit on the sharing of an individual’s health information, • 병원에서 수술 등을 생중계하는 것은 어떻게 해결: 트위터를 가장 활발하게 이용하고 있는 ‘헨리 포드 병원’ 외에 도 현재 미국에서 트위터, 페이스북, 유튜브 등 소셜 네트워 크 서비스를 적극 활용하는 병원이 늘어나고 있는 추세임 • 건강용 스마트폰 Application 개발
  • 57.
  • 58. Global Communication 2team 3.결론및 시사점 기술+사회문화적 요소에 대한 면밀한 검토 - 빅데이터 및 AI 논의에서 빠지지 않는 것이 개인정보 유출 및 사생활 침해와 같은 역기능 문제 - 기술의 발전과 더불어 우리가 원하는 미래상에 대한 명확한 이해와, 이를 달성하기 위한 정치사회적 기반에 대한 근본적인 모색이 중요. 박한우 교수는 2012년 2월에 미국에서 벌어진 사건을 예로 들었다. 영국의 대학생 두 명이 미국에 입국하면서 로스앤젤레스 공항을 폭파하겠다는 말을 트위터에 썼는데 이것이 미국 정부에 적발됐다. 박 교수는 “이 경우 정부는 트위터 전체가 아니라 트위터에 글을 올린 사람을, 올린 것을 규제한 것인데 미국 정부가 일상적으로 트위터를 들여 다본다는 문제로 번졌다”고 설명했다.
  • 59. Prof. Han Woo PARK World Class University Webometrics Institute CyberEmotions Research Center Department of Media and Communication, YeungNam University, Korea hanpark@ynu.ac.kr www.hanpark.net 이 슬라이드 작성에 도움을 준 사이버감성연구소 연구원들과 학부 /대학원 강의 수강생에게 고마움을 표시합니다. 이 슬라이드는 개인적 목적으로 만든 비공개 자료입니다. 배포 및 복사를 금지합니다.