SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
Playing in the International Data Space
Mark A. Parsons
Rensselaer Polytechnic Institute
!
!
24 October 2013
Fundamentals of Data Curation
University of Illinois, Urbana-Champaign

Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
Stuff you should learn
• What international organizations are
• Who is organizing
• How they organize and how you can participate
• Why it matters to you and data science writ large and curation more specifically
Some history
• Biological classification or taxonomy
• Linnaeus’s Systema Naturae, 1735
• Darwin’s On the Origin of Species, 1859
• Encyclopedia of Life, 2007
• International Map of the World (standardized 1:1,000,000 map)
• Proposed 1891
• Begun 1913
• Never completed
• Metric system
• Introduced 1795
• Convention du Mètre, 1875
• International System of Units (SI) 1960
• One big holdout
• Time zones
• 1st use of standard (“railway”) time 1847
• International Meridian Conference 1884 established GMT but did not alter local times
• Final adoption of “standard offset” from GMT/UTC 1986
• Current number of time zones in China and India: 1
Some history
• Biological classification or taxonomy
• Linnaeus’s Systema Naturae, 1735
• Darwin’s On the Origin of Species, 1859
• Encyclopedia of Life, 2007
• International Map of the World (standardized 1:1,000,000 map)
• Proposed 1891
• Begun 1913
• Never completed
• Metric system
• Introduced 1795
• Convention du Mètre, 1875
• International System of Units (SI) 1960

Standards were
envisioned and desired,
but the evolution was
slow, driven by nation
states, and never
entirely successful.

• One big holdout
• Time zones
• 1st use of standard (“railway”) time 1847
• International Meridian Conference 1884 established GMT but did not alter local times
• Final adoption of “standard offset” from GMT/UTC 1986
• Current number of time zones in China and India: 1
Some more history
1865 — International Telegraph Union
1871 — First International Geographical Congress
1873 — International Meteorological Organization (became WMO within UN in 1951)
1899 — International Association of Academies, later the International Council of Science (ICSU)
early 1900s — Many other physical science unions established
1926 — International Federation of Library Associations and Institutions (IFLA)
1945 — United Nations
1952 — International Social Science Council
1957 — World Data Centers
1966 — ICSU Committee on Data for Science and Technology (CODATA)
1994 — World Wide Web Consortium (W3C) (Web itself c.1990)
2013 — Research Data Alliance (RDA)
Scientific professional societies...
• Convene and build communities and thereby validate and promote their field.
• Create and assert community consensus.
• Create standards, ethical guidelines, best practices, and certifications.
• Educate the public and their members.
• Provide a record of the discipline through publications from gray to black.
• Pursue focussed initiatives to further scientific goals.
• Seek to maintain a privileged position—power.
• Seek to grow their membership.
• Can be self-perpetuating and conservative especially at international level.
Where I belong
Current

Past

• American Geophysical Union (AGU)—
primary affiliation Earth and Space
Science Informatics $

• Association of American Geographers
(AAG) $$

• IEEE (Institute of Electrical and
Electronics Engineers) $$
• Research Data Alliance (RDA) free

• American Society of Information
Science and Technology (ASIST) $$
• US Permafrost Association $
• International Permafrost Association*

• Federation of Earth Science
Information Partners (ESIP) free

!

• Digital Curation Centre Associate free

!

• International Union of Geodesy and
Geoscience (IUGG) Union
Commission of Data and Information*

!

• CODATA*

*as an officer (organization does not have individual
members)
Players in international (data) organizations
• Governments
• agencies—can act but not speak on policy (short term $)
• ambassadors—can influence policy but not programs (sustained $)
• Foundations and charities
• National Academies
• Universities and Research Institutes, especially their libraries
• Professional societies and other NGOs
• UN and other intergovernmental bodies
• Companies—tech. companies (databases, software, info services, etc); commercial
publishers; data re-adapters (weather companies, map makers in the broadest
sense)
• Individuals
Managing Institutional Interplay

Slide courtesy Paul A. Berkman
Arctic Circle
Map courtesy
The Economist
Managing Impacts Across Diverse Boundaries

Earth System

Law of the Sea

Meteorological

OSPAR

Navigational

NEAFC

Marine Ecosystem

Search and
Rescue

Slide courtesy Paul A. Berkman
Categories of Organization
• Intergovernmental—UN, WMO, GEO, G8+5
• International
• “Unions”—ICSU (WDS, CODATA, ICSTI, IUGG/UCDI and other), ISSC
• Individual members—IEEE
• Organizational members—W3C
• Combined—RDA, IFLA
• Collaborative initiatives--Future Earth, GEOSS
Some international data organizations
• CODATA
• Mission: to strengthen international science for the benefit of society by promoting improved scientific and technical
data management and use.
• Subunit of ICSU but has its own paid membership subscription for Academies and unions
• Individuals participate as representatives from an org. member, as a task force member, or by attending biennial
meetings.
• WDS
• Mission: to ensure the long-term stewardship and provision of quality-assessed data and data services to the
international science community and other stakeholders.
• Subunit of ICSU but members are certified data repositories and services. No fee but there is a certification process.
• Individuals participate as representatives from an org. member, as a working group member (jointly with RDA), or by
attending biennial meetings (joint with CODATA?).
• Open Knowledge Foundation
• Mission: to promote open data and open content in all their forms.
• Non-profit with volunteer participation
• Individual sign up and participate in working groups, local groups, and task forces. Also attend myriad conferences
and “festivals”.
More international data organizations
• International Geospatial Society and Global Spatial Data Infrastructure Association
• Mission: to promote international cooperation and collaboration in support of local, national and
international spatial data infrastructure developments that will allow nations to better address social,
economic, and environmental issues of pressing importance.
• IGS is for individuals, GSDIA is for Organizations with “at least a nation-wide influence.”
• Individuals participate in committees, conferences, and trainings.
• Open Geospatial Consortium
• Mission: to serve as a global forum for the collaboration of developers and users of spatial data products
and services, and to advance the development of international standards for geospatial interoperability.
• Paid organizational membership for companies, government agencies, and universities.
• Individuals participate as representatives of their organizations, largely in standards development.
• DataCite
• Mission: to promote and facilitate data citation.
• Paid organizational membership for national library-type organizations.
• Individuals participate as representatives of their organization or attend the annual conference.
Some international data-related
organizations
• W3C
• Mission: to lead the World Wide Web to its full potential by developing protocols and guidelines that ensure
the long-term growth of the Web.
• Paid organizational membership. Many companies, some universities, some agencies.
• Individuals participate as representatives from an org. member, by participating in community groups and
discussion fora.
• IEEE
• Mission: to foster technological innovation and excellence for the benefit of humanity.
• Professional society and standards body.
• Individual membership with many types of participation including local chapters addressing areas well
beyond data
• IFLA
• Mission: to further accessibility, protection, and preservation of documentary cultural heritage and to
promote and support libraries.
• Paid membership for associations, institutes, and individuals.
• Individuals participate in myriad specialty groups, sections, and programs and attend annual meetings.
Research Data Alliance
• An alliance of individuals, organizations, and associates.
• Mission: “RDA builds the social and technical bridges that enable open sharing
of data.”
• A different sort of funding model—informally collaborating, hands-off agency
support
• A different sort of operating model recognizing the dynamics and tensions of
developing infrastructure.
• Free individual membership, inexpensive organizational membership, affiliation
with like minded organizations.
• Grass roots driven.
• More tactical than strategic.
• Global and regional but independent of nations.
Data Citation Case Study
• Initial efforts in the late 90s - early
00s
• Right idea, little traction
• Partially conflated with the citing
URLs issue
• A blossoming in the mid-late 00s.
• Multiple disciplines start
developing approaches and
guidelines
• DOI a big driver, esp. for DataCite,
but other identifiers used too
(including handles, LSIDs, UNFs,
ARKs and good ol’ URI/Ls)
• A slightly competitive atmosphere

• Now in a consensus phase
• CODATA/ICSTI/National
Academy report
• Force11 Manifesto
• RDA harmonization effort—
broadens and unites the
community
• Implementation phase just started
• Happens locally
• Requires culture change so
debates will continue
What should you do?
• Join RDA and participate in Interest and Working Groups
• Watch for upcoming student internships and fellowships.
• Become a Digital Curation Center Associate and attend one of their conferences
(publishing opportunity).
• Attend a conference or two of some of the other organizations.
• Join a professional society in your scientific discipline.
• If you don’t have a scientific discipline, get one. Curation requires it.
• Attend their meeting and help develop a data section or focus group (if they
don’t have one already).
Why you should do it and why it matters.
"Data Deluge," Brett Ryder, The Economist, Feb. 2010
Diverse snow crystal photos by Kenneth G. Libbrecht
snowcrystals.com
Distribution of NSF Awards by Dollar Value

!

© 2009 The Board of Trustees, University of Illinois

The long tail of science

Heidorn 2008
Surface-level diversity
(race, age, gender)
vs.
Deep-level diversity
(values, conceptual metaphors, personality)
Ashby’s Law of
Requisite Variety

Only variety absorbs variety
One stop shop?
Or Grand Bazaar!


!

photo by Frank Kovalchek (CC-BY)
Metcalfe’s Law

The value of a network increases
as the square of the number of
nodes.
Map of the internet by the Opte Project [CC-BY] via Wikimedia
Commons
Networks or ecosystems often rely on “weak”
links, so partner and build relationships
Increasing Complexity of
Mediation

From: C. Borgman, 2008, NSF
Cyberlearning Report
Themes from A. Tsing on Collaboration
Friction—An ethnography of global connection

•“Actually existing universalisms are
hybrid, transient, and involved in
constant reformulation through
dialogue.” They work out through
friction.
•“There is no reason to think
collaborators have common goals.”
•Unity and diversity cover each
other up. Need to remember the
local.
Where Good Ideas Come From
Steven Johnson
• The Adjacent Possible—the importance of local
• It’s often not “Eureka!” but rather a slow hunch fading in to
view over time.
• Hunches need to collide with other hunches--create that
environment. Don’t protect IP share it. Connecting vs
protecting
• Sharing of failures as well
• Create spaces for that to happen—virtual and real coffee
shops
• “Chance favors the connected mind.”
Themes on Relationships
(I’m an introvert)
• The central challenge is diversity.
• We address it through variety and myriad interfaces and
connections.
• Fostering relationships is central to community and data science.
• they build social capital—success through giving
• they uncover tacit knowledge
• they inform methods
Data Science Methods
• User-driven design is not just end user. Engage providers and funders too.
• Case studies not just use cases.
• Ethnography—study relationships because data are often at the center of that
interaction—a boundary object.
• Agile is not just for software (courtesy Bruce Caron).
• Individuals and interactions over processes and tools
• Working volunteers over comprehensive documentation
• Member collaboration over contract negotiation
• Responding to change over following a plan.
Summary
• International (data) organizations grew out the idealistic, deterministic blossoming
of science.
• They are virtually infinite in their scope and number.
• They have many different forms and the best are highly adaptive and evolving
(while retaining core principles).
• Only diversity absorbs diversity.
• Networking and interconnection are the way to solve complex problems.
• We are in more global and democratic world, but also a more local world. Coalition
politics with new kinds of coalitions because there are new kinds of identity.
• Data science and curation need to focus on relationships, connections, interfaces.
• You must participate “glocally” to succeed.

Mais conteúdo relacionado

Semelhante a Parsons on "Playing in the International Data Space"

Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Larry Naukam
 
ICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceDavid457
 
Research Data Alliance: Creating the culture and technology for an internatio...
Research Data Alliance: Creating the culture and technology for an internatio...Research Data Alliance: Creating the culture and technology for an internatio...
Research Data Alliance: Creating the culture and technology for an internatio...Research Data Alliance
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011lljohnston
 
5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation SlidesDuraSpace
 
Raivio stakeholder engagement in future earth
Raivio stakeholder engagement in future earthRaivio stakeholder engagement in future earth
Raivio stakeholder engagement in future earthIina Koskinen
 
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...ASIS&T
 
2011 - Analyzing Wikieducators - Short Ethnography
2011 - Analyzing Wikieducators - Short Ethnography2011 - Analyzing Wikieducators - Short Ethnography
2011 - Analyzing Wikieducators - Short EthnographyAlfonso Sintjago
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
 
Equity, Diversity and Inclusion Survey Results
Equity, Diversity and Inclusion Survey ResultsEquity, Diversity and Inclusion Survey Results
Equity, Diversity and Inclusion Survey ResultsOCLC
 
Knowledge Management and Open Data for Innovation
Knowledge Management and Open Data for InnovationKnowledge Management and Open Data for Innovation
Knowledge Management and Open Data for InnovationJeanne Holm
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Trish Rose-Sandler
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
Lpf.2014
Lpf.2014Lpf.2014
Lpf.2014jmu2m
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
Introduction to Future Earth SSCP KAN
Introduction to Future Earth SSCP KANIntroduction to Future Earth SSCP KAN
Introduction to Future Earth SSCP KANSSCPKAN
 

Semelhante a Parsons on "Playing in the International Data Space" (20)

Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013
 
ICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conferenceICPSR-RCMD 2012 Presentation from HACU conference
ICPSR-RCMD 2012 Presentation from HACU conference
 
Open Access contribution to inclusive and participatory global knowledge soc...
Open Access  contribution to inclusive and participatory global knowledge soc...Open Access  contribution to inclusive and participatory global knowledge soc...
Open Access contribution to inclusive and participatory global knowledge soc...
 
Research Data Alliance: Creating the culture and technology for an internatio...
Research Data Alliance: Creating the culture and technology for an internatio...Research Data Alliance: Creating the culture and technology for an internatio...
Research Data Alliance: Creating the culture and technology for an internatio...
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011
 
5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides5-14-13 An Introduction to VIVO Presentation Slides
5-14-13 An Introduction to VIVO Presentation Slides
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
Raivio stakeholder engagement in future earth
Raivio stakeholder engagement in future earthRaivio stakeholder engagement in future earth
Raivio stakeholder engagement in future earth
 
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
 
2011 - Analyzing Wikieducators - Short Ethnography
2011 - Analyzing Wikieducators - Short Ethnography2011 - Analyzing Wikieducators - Short Ethnography
2011 - Analyzing Wikieducators - Short Ethnography
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
Equity, Diversity and Inclusion Survey Results
Equity, Diversity and Inclusion Survey ResultsEquity, Diversity and Inclusion Survey Results
Equity, Diversity and Inclusion Survey Results
 
Knowledge Management and Open Data for Innovation
Knowledge Management and Open Data for InnovationKnowledge Management and Open Data for Innovation
Knowledge Management and Open Data for Innovation
 
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
Lpf.2014
Lpf.2014Lpf.2014
Lpf.2014
 
Repositories as key players in non-commercial open access - a developing reg...
Repositories as key players in non-commercial open access  - a developing reg...Repositories as key players in non-commercial open access  - a developing reg...
Repositories as key players in non-commercial open access - a developing reg...
 
Repositories as key players in non-commercial open access - a developing reg...
Repositories as key players in non-commercial open access  - a developing reg...Repositories as key players in non-commercial open access  - a developing reg...
Repositories as key players in non-commercial open access - a developing reg...
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
Introduction to Future Earth SSCP KAN
Introduction to Future Earth SSCP KANIntroduction to Future Earth SSCP KAN
Introduction to Future Earth SSCP KAN
 

Mais de Research Data Alliance

The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsResearch Data Alliance
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsResearch Data Alliance
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersResearch Data Alliance
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchResearch Data Alliance
 

Mais de Research Data Alliance (20)

RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020RDA in a Nutshell - September 2020
RDA in a Nutshell - September 2020
 
RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020RDA in a Nutshell - August 2020
RDA in a Nutshell - August 2020
 
RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020RDA in a Nutshell - July 2020
RDA in a Nutshell - July 2020
 
RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020RDA in a Nutshell - June 2020
RDA in a Nutshell - June 2020
 
RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020
 
RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020RDA in a Nutshell - April 2020
RDA in a Nutshell - April 2020
 
RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020RDA in a Nutshell - March 2020
RDA in a Nutshell - March 2020
 
RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020RDA in a Nutshell - February 2020
RDA in a Nutshell - February 2020
 
RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020RDA in a Nutshell - January 2020
RDA in a Nutshell - January 2020
 
Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019
 
Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019
 
RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019RDA in a Nutshell - October 2019
RDA in a Nutshell - October 2019
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
The Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to IndividualsThe Value of the Research Data Alliance to Individuals
The Value of the Research Data Alliance to Individuals
 
RDA Value for Infrastructure Providers
RDA Value for Infrastructure ProvidersRDA Value for Infrastructure Providers
RDA Value for Infrastructure Providers
 
Rda in a nutshell september 2019
Rda in a nutshell september 2019Rda in a nutshell september 2019
Rda in a nutshell september 2019
 
The Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing ResearchThe Value of the Rda Value for Organisations Performing Research
The Value of the Rda Value for Organisations Performing Research
 
RDA Value for Libraries
RDA Value for LibrariesRDA Value for Libraries
RDA Value for Libraries
 
The Value of the RDA for Funders
The Value of the RDA for FundersThe Value of the RDA for Funders
The Value of the RDA for Funders
 
Rda in a nutshell august 2019
Rda in a nutshell august 2019Rda in a nutshell august 2019
Rda in a nutshell august 2019
 

Último

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Último (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Parsons on "Playing in the International Data Space"

  • 1. Playing in the International Data Space Mark A. Parsons Rensselaer Polytechnic Institute ! ! 24 October 2013 Fundamentals of Data Curation University of Illinois, Urbana-Champaign Unless otherwise noted, the slides in this presentation are licensed by Mark A. Parsons under a Creative Commons Attribution-Share Alike 3.0 License
  • 2. Stuff you should learn • What international organizations are • Who is organizing • How they organize and how you can participate • Why it matters to you and data science writ large and curation more specifically
  • 3. Some history • Biological classification or taxonomy • Linnaeus’s Systema Naturae, 1735 • Darwin’s On the Origin of Species, 1859 • Encyclopedia of Life, 2007 • International Map of the World (standardized 1:1,000,000 map) • Proposed 1891 • Begun 1913 • Never completed • Metric system • Introduced 1795 • Convention du Mètre, 1875 • International System of Units (SI) 1960 • One big holdout • Time zones • 1st use of standard (“railway”) time 1847 • International Meridian Conference 1884 established GMT but did not alter local times • Final adoption of “standard offset” from GMT/UTC 1986 • Current number of time zones in China and India: 1
  • 4. Some history • Biological classification or taxonomy • Linnaeus’s Systema Naturae, 1735 • Darwin’s On the Origin of Species, 1859 • Encyclopedia of Life, 2007 • International Map of the World (standardized 1:1,000,000 map) • Proposed 1891 • Begun 1913 • Never completed • Metric system • Introduced 1795 • Convention du Mètre, 1875 • International System of Units (SI) 1960 Standards were envisioned and desired, but the evolution was slow, driven by nation states, and never entirely successful. • One big holdout • Time zones • 1st use of standard (“railway”) time 1847 • International Meridian Conference 1884 established GMT but did not alter local times • Final adoption of “standard offset” from GMT/UTC 1986 • Current number of time zones in China and India: 1
  • 5. Some more history 1865 — International Telegraph Union 1871 — First International Geographical Congress 1873 — International Meteorological Organization (became WMO within UN in 1951) 1899 — International Association of Academies, later the International Council of Science (ICSU) early 1900s — Many other physical science unions established 1926 — International Federation of Library Associations and Institutions (IFLA) 1945 — United Nations 1952 — International Social Science Council 1957 — World Data Centers 1966 — ICSU Committee on Data for Science and Technology (CODATA) 1994 — World Wide Web Consortium (W3C) (Web itself c.1990) 2013 — Research Data Alliance (RDA)
  • 6. Scientific professional societies... • Convene and build communities and thereby validate and promote their field. • Create and assert community consensus. • Create standards, ethical guidelines, best practices, and certifications. • Educate the public and their members. • Provide a record of the discipline through publications from gray to black. • Pursue focussed initiatives to further scientific goals. • Seek to maintain a privileged position—power. • Seek to grow their membership. • Can be self-perpetuating and conservative especially at international level.
  • 7. Where I belong Current Past • American Geophysical Union (AGU)— primary affiliation Earth and Space Science Informatics $ • Association of American Geographers (AAG) $$ • IEEE (Institute of Electrical and Electronics Engineers) $$ • Research Data Alliance (RDA) free • American Society of Information Science and Technology (ASIST) $$ • US Permafrost Association $ • International Permafrost Association* • Federation of Earth Science Information Partners (ESIP) free ! • Digital Curation Centre Associate free ! • International Union of Geodesy and Geoscience (IUGG) Union Commission of Data and Information* ! • CODATA* *as an officer (organization does not have individual members)
  • 8. Players in international (data) organizations • Governments • agencies—can act but not speak on policy (short term $) • ambassadors—can influence policy but not programs (sustained $) • Foundations and charities • National Academies • Universities and Research Institutes, especially their libraries • Professional societies and other NGOs • UN and other intergovernmental bodies • Companies—tech. companies (databases, software, info services, etc); commercial publishers; data re-adapters (weather companies, map makers in the broadest sense) • Individuals
  • 9. Managing Institutional Interplay Slide courtesy Paul A. Berkman
  • 12.
  • 13. Managing Impacts Across Diverse Boundaries Earth System Law of the Sea Meteorological OSPAR Navigational NEAFC Marine Ecosystem Search and Rescue Slide courtesy Paul A. Berkman
  • 14. Categories of Organization • Intergovernmental—UN, WMO, GEO, G8+5 • International • “Unions”—ICSU (WDS, CODATA, ICSTI, IUGG/UCDI and other), ISSC • Individual members—IEEE • Organizational members—W3C • Combined—RDA, IFLA • Collaborative initiatives--Future Earth, GEOSS
  • 15. Some international data organizations • CODATA • Mission: to strengthen international science for the benefit of society by promoting improved scientific and technical data management and use. • Subunit of ICSU but has its own paid membership subscription for Academies and unions • Individuals participate as representatives from an org. member, as a task force member, or by attending biennial meetings. • WDS • Mission: to ensure the long-term stewardship and provision of quality-assessed data and data services to the international science community and other stakeholders. • Subunit of ICSU but members are certified data repositories and services. No fee but there is a certification process. • Individuals participate as representatives from an org. member, as a working group member (jointly with RDA), or by attending biennial meetings (joint with CODATA?). • Open Knowledge Foundation • Mission: to promote open data and open content in all their forms. • Non-profit with volunteer participation • Individual sign up and participate in working groups, local groups, and task forces. Also attend myriad conferences and “festivals”.
  • 16. More international data organizations • International Geospatial Society and Global Spatial Data Infrastructure Association • Mission: to promote international cooperation and collaboration in support of local, national and international spatial data infrastructure developments that will allow nations to better address social, economic, and environmental issues of pressing importance. • IGS is for individuals, GSDIA is for Organizations with “at least a nation-wide influence.” • Individuals participate in committees, conferences, and trainings. • Open Geospatial Consortium • Mission: to serve as a global forum for the collaboration of developers and users of spatial data products and services, and to advance the development of international standards for geospatial interoperability. • Paid organizational membership for companies, government agencies, and universities. • Individuals participate as representatives of their organizations, largely in standards development. • DataCite • Mission: to promote and facilitate data citation. • Paid organizational membership for national library-type organizations. • Individuals participate as representatives of their organization or attend the annual conference.
  • 17. Some international data-related organizations • W3C • Mission: to lead the World Wide Web to its full potential by developing protocols and guidelines that ensure the long-term growth of the Web. • Paid organizational membership. Many companies, some universities, some agencies. • Individuals participate as representatives from an org. member, by participating in community groups and discussion fora. • IEEE • Mission: to foster technological innovation and excellence for the benefit of humanity. • Professional society and standards body. • Individual membership with many types of participation including local chapters addressing areas well beyond data • IFLA • Mission: to further accessibility, protection, and preservation of documentary cultural heritage and to promote and support libraries. • Paid membership for associations, institutes, and individuals. • Individuals participate in myriad specialty groups, sections, and programs and attend annual meetings.
  • 18. Research Data Alliance • An alliance of individuals, organizations, and associates. • Mission: “RDA builds the social and technical bridges that enable open sharing of data.” • A different sort of funding model—informally collaborating, hands-off agency support • A different sort of operating model recognizing the dynamics and tensions of developing infrastructure. • Free individual membership, inexpensive organizational membership, affiliation with like minded organizations. • Grass roots driven. • More tactical than strategic. • Global and regional but independent of nations.
  • 19. Data Citation Case Study • Initial efforts in the late 90s - early 00s • Right idea, little traction • Partially conflated with the citing URLs issue • A blossoming in the mid-late 00s. • Multiple disciplines start developing approaches and guidelines • DOI a big driver, esp. for DataCite, but other identifiers used too (including handles, LSIDs, UNFs, ARKs and good ol’ URI/Ls) • A slightly competitive atmosphere • Now in a consensus phase • CODATA/ICSTI/National Academy report • Force11 Manifesto • RDA harmonization effort— broadens and unites the community • Implementation phase just started • Happens locally • Requires culture change so debates will continue
  • 20. What should you do? • Join RDA and participate in Interest and Working Groups • Watch for upcoming student internships and fellowships. • Become a Digital Curation Center Associate and attend one of their conferences (publishing opportunity). • Attend a conference or two of some of the other organizations. • Join a professional society in your scientific discipline. • If you don’t have a scientific discipline, get one. Curation requires it. • Attend their meeting and help develop a data section or focus group (if they don’t have one already).
  • 21. Why you should do it and why it matters.
  • 22. "Data Deluge," Brett Ryder, The Economist, Feb. 2010
  • 23. Diverse snow crystal photos by Kenneth G. Libbrecht snowcrystals.com
  • 24. Distribution of NSF Awards by Dollar Value ! © 2009 The Board of Trustees, University of Illinois The long tail of science Heidorn 2008
  • 25. Surface-level diversity (race, age, gender) vs. Deep-level diversity (values, conceptual metaphors, personality)
  • 26. Ashby’s Law of Requisite Variety Only variety absorbs variety
  • 28. Or Grand Bazaar! ! photo by Frank Kovalchek (CC-BY)
  • 29. Metcalfe’s Law The value of a network increases as the square of the number of nodes.
  • 30. Map of the internet by the Opte Project [CC-BY] via Wikimedia Commons
  • 31. Networks or ecosystems often rely on “weak” links, so partner and build relationships
  • 32. Increasing Complexity of Mediation From: C. Borgman, 2008, NSF Cyberlearning Report
  • 33. Themes from A. Tsing on Collaboration Friction—An ethnography of global connection •“Actually existing universalisms are hybrid, transient, and involved in constant reformulation through dialogue.” They work out through friction. •“There is no reason to think collaborators have common goals.” •Unity and diversity cover each other up. Need to remember the local.
  • 34. Where Good Ideas Come From Steven Johnson • The Adjacent Possible—the importance of local • It’s often not “Eureka!” but rather a slow hunch fading in to view over time. • Hunches need to collide with other hunches--create that environment. Don’t protect IP share it. Connecting vs protecting • Sharing of failures as well • Create spaces for that to happen—virtual and real coffee shops • “Chance favors the connected mind.”
  • 35. Themes on Relationships (I’m an introvert) • The central challenge is diversity. • We address it through variety and myriad interfaces and connections. • Fostering relationships is central to community and data science. • they build social capital—success through giving • they uncover tacit knowledge • they inform methods
  • 36. Data Science Methods • User-driven design is not just end user. Engage providers and funders too. • Case studies not just use cases. • Ethnography—study relationships because data are often at the center of that interaction—a boundary object. • Agile is not just for software (courtesy Bruce Caron). • Individuals and interactions over processes and tools • Working volunteers over comprehensive documentation • Member collaboration over contract negotiation • Responding to change over following a plan.
  • 37. Summary • International (data) organizations grew out the idealistic, deterministic blossoming of science. • They are virtually infinite in their scope and number. • They have many different forms and the best are highly adaptive and evolving (while retaining core principles). • Only diversity absorbs diversity. • Networking and interconnection are the way to solve complex problems. • We are in more global and democratic world, but also a more local world. Coalition politics with new kinds of coalitions because there are new kinds of identity. • Data science and curation need to focus on relationships, connections, interfaces. • You must participate “glocally” to succeed.