SlideShare uma empresa Scribd logo
1 de 57
Ian Foster
Science Services and
Science Platforms
Using the cloud to accelerate
and democratize discovery
Talk at NorduGrid annual conference,
Košice, Slovakia, June 2nd, 2016
Thanks to co-authors and Globus team
Globus services (globus.org)
 Foster, I. Globus Online: Accelerating and democratizing science through
cloud-based services. IEEE Internet Computing(May/June):70-73, 2011.
 Chard, K., Tuecke, S. and Foster, I. Efficient and Secure Transfer,
Synchronization, and Sharing of Big Data. Cloud Computing, IEEE, 1(3):46-55,
2014.
 Chard, K., Foster, I. and Tuecke, S. Globus Platform-as-a-Service for
Collaborative Science Applications. Concurrency - Practice and Experience,
27(2):290-305, 2014.
Publication (globus.org/data-publication)
 Chard, K., Pruyne, J., Blaiszik, B., Ananthakrishnan, R., Tuecke, S. and Foster, I.,
Globus Data Publication as a Service: Lowering Barriers to Reproducible
Science. 11th IEEE International Conference on eScience Munich, Germany, 2015
Discovery engines
 Foster, I., Ananthakrishnan, R., Blaiszik, B., Chard, K., Osborn, R., Tuecke, S., Wilde,
M. and Wozniak, J. Networking materials data: Accelerating discovery at an
experimental facility. Big Data and High Performance Computing, 2015.
Thank you to our sponsors!
3
Civilization advances by extending
the number of important operations
which we can perform
without thinking about them
Alfred North Whitehead (1911)
Computation may someday be organized
as a public utility …
The computing utility could become the
basis for a new and important industry.
John
McCarthy
(1961)
The grid vision
Accelerate discovery and innovation by
providing on-demand access to computing
• “the average computing environment
remains inadequate for [many]
computationally sophisticated purposes”
• “if mechanisms are in place to allow
reliable, transparent, and instantaneous
access to high-end resources, then
it is as if those resources are
devoted to them”
[The Grid, Chapter 2, 1998]
Another pioneer: NorduGrid
Example grid scenarios
 “The application service providers, storage service
providers, cycle providers, and consultants
engaged by a car manufacturer to perform scenario
evaluation during planning for a new factory”
 “Members of an industrial consortium bidding on a
new aircraft”
 “A crisis management team and the databases and
simulation systems that they use to plan a
response to an emergency situation”
 “Members of a large, international, multiyear high-
energy physics collaboration”
From: The Anatomy of the Grid, 2001
Higgs discovery “only possible because
of the extraordinary achievements of
… grid computing”—Rolf Heuer,
CERN DG
10s of PB, 100s of institutions, 1000s of
scientists, 100Ks of CPUs, Bs of tasks
9
What has changed?
• Thousands of people learned about the joys of
large-scale distributed systems
• Virtual organization concepts and technologies
• Now routine to move 100s of terabytes (e.g.,
GridFTP moves >2 petabyte per day)
• High throughput computing is mainstream (e.g.,
Condor and Globus run millions of jobs per day)
• Large Hadron Collider found the Higgs
• Earth System Grid supports >25,000 users
• Commercial cloud computing has exploded
Looking forward
 Exploding data volumes and earlier successes
mean that many more people face challenges
of big data, big compute, big collaboration
 Networks are several orders of magnitude
faster than when Grid started
 Commercial cloud providers provide a
substrate on which powerful new capabilities
can be built with new economies of scale
Cloud has transformed how software
is developed and delivered
13
Infrastructure as a service: IaaS
Platform as a service: PaaS
Software as a service: SaaS
PaaS enables more rapid, cheap, and
scalable delivery of powerful apps—as SaaS
(web & mobile apps)
The right platform can do
the same for science
14
We can leverage cloud to provide solutions that
span the vast majority of researcher needs
σ
Big science
The right platform can do
the same for science
15
We can leverage cloud to provide solutions that
span the vast majority of researcher needs
σ
Big science
A science platform can spur a
discovery cloud ecosystem
16
Infrastructure as a service: IaaS
Platform as a service: PaaS
Software as a service: SaaS
(web and mobile apps)
In so doing, we can slash costs, improve quality, and
accelerate discovery across the sciences
A science platform can spur a
discovery cloud ecosystem
17
Infrastructure as a service: IaaS
Platform as a service: PaaS
Software as a service: SaaS
(web and mobile apps)
2010-
2014-
In so doing, we can slash costs, improve quality, and
accelerate discovery across the sciences
What can we automate and
outsource in science?
Run experiment
Collect data
Move data
Check data
Annotate data
Share data
Find similar data
Link to literature
Analyze data
Publish data
Automate
and
outsource
Discovery
Cloud
Data
challenges
Identity and authorization challenges
Data access: we have the highways
but not the delivery service
Our highways
encompass the Internet,
ultra-high-speed networks,
science DMZs, data
transfer nodes, high-speed
transport protocols
A good delivery service
automates, schedules,
accelerates, adapts.
It provides APIs for experts
and casual users.
Cuts costs and saves time.
Globus: Research data
management as a service
Essential research data
management services
 File transfer
 Data sharing
 Data publication
 Identity and groups
Builds on 15 years of research
Outsourced and automated
 High availability, reliability,
performance, scalability
 Convenient for
 Casual users: Web interfaces
 Power users: APIs
 Administrators: Install, manage
globus.org
Globus and the research data lifecycle
23
Researcher initiates
transfer request; or
requested automatically
by script, science
gateway
1
Instrument
Compute Facility
Globus transfers files
reliably, securely
2
Globus controls
access to shared
files on existing
storage; no need
to move files to
cloud storage!
4
Curator reviews and
approves; data set
published on campus
or other system
7
Researcher
selects files to
share, selects
user or group,
and sets access
permissions
3
Collaborator logs in to
Globus and accesses
shared files; no local
account required;
download via Globus
5
Researcher
assembles data set;
describes it using
metadata (Dublin
core and domain-
specific)
6
6
Peers, collaborators
search and discover
datasets; transfer and
share using Globus
8
Publication
Repository
Personal Computer
Transfer
Share
Publish
Discover
• SaaS  Only a web
browser required
• Use storage system
of your choice
• Access using your
campus credentials
Globus by the numbers
4
major services
13
national labs
use Globus
160 PB
transferred
10,000
active endpoints
25 billion
files processed
~400
active daily users
38,000
registered users
99.9%
uptime
35+
institutional
subscribers
1 PB
largest single
transfer to date
3 months
longest
continuously
managed transfer
130
federated
campus identities
Platforms can slash costs, simplify
access, increase
interoperability
For example, by
providing:
 Federated identity
system with fine-
grained authorization
 Data management
easily integrated
with application
workflows
Via RESTful APIs
Globus PaaS: Ecosystem
enabler
26
Auth & Groups
…
Globus Toolkit
GlobusAPIs
GlobusConnect
Data Publication & Discovery
File Sharing
File Transfer & Replication
Globus PaaS and Open
Science Grid
27
Simple web app server login
Jetstream cloud service
29
Serving a global community:
NCAR’s Research Data Archive
Serving a global community
 17+ PB virtual
processing
 45,000+ custom
orders, 4,000
users, 380 TB
served in 2014 Courtesy of Thomas Cram, NCAR (2014)
Fully automated
delivery using portal
developed w/ PaaS
PaaS enabled automated
workflow
User logs in with
NCAR or other
campus identity
Selected dataset
copied to staging
area (shared
endpoint)
User granted read permission for shared endpoint
User receives email with link to access files
ACLs deleted after five days
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 34
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 35
Globus
Transfer
Service
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST
Globus Web
Widgets
Portal Web
Server (Client)
Other
Services
https://github.com/globus/globus-sample-data-portal
Let us know if you’d
like to participate in
future workshops
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 38
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Globus Auth
 Foundational identity and access management
(IAM) platform service
 Simplify creation and integration of advanced
apps and services
 Brokers authentication and authorization
interactions between:
 End-users
 Identity providers: XSEDE, InCommon, web apps
 Resource servers: services with REST APIs
 Clients: web, mobile, desktop, command line apps
 Resource servers acting as clients to other
resource servers
https://docs.globus.org/api/auth 39
Based on widely used web
standards
 OAuth 2.0 Authorization Framework
 aka OAuth2
 OpenID Connect Core 1.0
 aka OIDC
 Allows use of standard OAuth2 and OIDC
libraries
 E.g., Google OAuth Client Libraries (Java,
Python, etc.), Apache mod_auth_openidc
40
Globus Auth uses
 Login to web app
 “Log in with Globus”
 Mobile, desktop, command line apps coming
 Protect all REST API communications
 App  Globus service
 App  non-Globus service
 Service  service
41
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 42
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Globus transfer API
Nearly all Globus Web App functionality
implemented via public Transfer API
https://docs.globus.org/api/transfer/
43
Globus Python SDK
Python client library for the Globus Auth and
Transfer REST APIs
http://globus.github.io/globus-sdk-python/
44
Jupyter (iPython) notebooks
https://github.com/globus/globus-jupyter-notebooks
45
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 46
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Globus helper pages
Globus-provided web pages
designed for use
by your web apps
 Browse Endpoint
 Select Group
 Logout
https://docs.globus.org/api/helper-pages/
Globus helper pages
48
Branding
Can skin Globus Auth pages
Header
Text
Default IDP
49
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 50
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Globus Connect HTTPS
 The future of research CI is … the web
 Globus Connect HTTPS unlocks all research
storage to the web
 Globus Auth provides security glue using
standard web security
 GridFTP doesn’t go away – async, bulk data
transfer is important, but its not the end-all,
be-all
Desktop
Globus Cloud
Firewall
Science DMZ
Research data portal pattern
 Move portal storage into Science DMZ, with Globus endpoint
 Leave Portal Web server behind firewall
 Globus handles the security and data heavy lifting 52
Globus
Transfer
Service
Portal Web
Server (Client)
Globus Auth
Browser
User’s
Endpoint
(optional)
Portal
Endpoint
Other
Endpoints
HTTPS
GridFTP
REST Other
Services
Globus Web
Widgets
Why create your own services?
 Front-end / back-end within your portal
 Remote backend for portal
 Backend for pure Javascript browser apps
 Extend your portal with a public REST API,
so that other app and service developers can
integrate with and extend your portal
53
Why Globus Auth for your
service?
 Outsource all identity management, authentication
 Federated identity with InCommon, Google, etc.
 Outsource your REST API security
 Consent, token issuance, validation, revocation
 You provide service-specific authorization
 Apps use your service like all others
 Its standard OAuth2 and OIDC
 Your service can seamlessly leverage other services
 Other services can leverage your service
Add your service to the international science platform
54
A science platform can spur a
discovery cloud ecosystem
55
Infrastructure as a service: IaaS
Platform as a service: PaaS
Software as a service: SaaS
(web and mobile apps)
2010-
2014-
In so doing, we can slash costs, improve quality, and
accelerate discovery across the sciences
The Discovery Platform
Research
Service
Research
Service
Research
Service Research
Service
Research
Service
Commercial IaaS/PaaS
Core Services
Authentication
Groups
Attributes
Data Services
Transfer
Sharing
Publication
Commercial
PaaS
Discovery
Exchange
Enabling the Discovery Cloud
Research facilitiesResearch facilities
Together we can create
an integrated ecosystem
of services and applications
for the research and education
community
Thank you!
foster@uchicago.edu
@ianfoster
57

Mais conteúdo relacionado

Mais procurados

Globus publication demo screenshots
Globus publication demo screenshotsGlobus publication demo screenshots
Globus publication demo screenshotsIan Foster
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduriRavi Madduri
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS WebinarBen Blaiszik
 
WoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific DataWoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific DataUniversity of Chicago
 
Cloud com foster december 2010
Cloud com foster december 2010Cloud com foster december 2010
Cloud com foster december 2010Ian Foster
 
Cloud Accelerated Genomics
Cloud Accelerated GenomicsCloud Accelerated Genomics
Cloud Accelerated GenomicsIdan Tohami
 
Computing Outside The Box June 2009
Computing Outside The Box June 2009Computing Outside The Box June 2009
Computing Outside The Box June 2009Ian Foster
 
Managing data in computational edge clouds
Managing data in computational edge cloudsManaging data in computational edge clouds
Managing data in computational edge cloudsNitinder Mohan
 
Benchmarking Cloud-based Tagging Services
Benchmarking Cloud-based Tagging ServicesBenchmarking Cloud-based Tagging Services
Benchmarking Cloud-based Tagging ServicesTanu Malik
 
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...Amazon Web Services
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobus
 
Sgg crest-presentation-final
Sgg crest-presentation-finalSgg crest-presentation-final
Sgg crest-presentation-finalmarpierc
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22marpierc
 
Globus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisGlobus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisRavi Madduri
 
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...Databricks
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research PlatformLarry Smarr
 
Object Tagged RBAC Model for Hadoop Ecosystem
Object Tagged RBAC Model for Hadoop EcosystemObject Tagged RBAC Model for Hadoop Ecosystem
Object Tagged RBAC Model for Hadoop EcosystemMaanak Gupta, Ph.D.
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesLynn Langit
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsPeter van Heusden
 

Mais procurados (20)

Data automation 101
Data automation 101Data automation 101
Data automation 101
 
Globus publication demo screenshots
Globus publication demo screenshotsGlobus publication demo screenshots
Globus publication demo screenshots
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduri
 
20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar20160922 Materials Data Facility TMS Webinar
20160922 Materials Data Facility TMS Webinar
 
WoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific DataWoSC19: Serverless Workflows for Indexing Large Scientific Data
WoSC19: Serverless Workflows for Indexing Large Scientific Data
 
Cloud com foster december 2010
Cloud com foster december 2010Cloud com foster december 2010
Cloud com foster december 2010
 
Cloud Accelerated Genomics
Cloud Accelerated GenomicsCloud Accelerated Genomics
Cloud Accelerated Genomics
 
Computing Outside The Box June 2009
Computing Outside The Box June 2009Computing Outside The Box June 2009
Computing Outside The Box June 2009
 
Managing data in computational edge clouds
Managing data in computational edge cloudsManaging data in computational edge clouds
Managing data in computational edge clouds
 
Benchmarking Cloud-based Tagging Services
Benchmarking Cloud-based Tagging ServicesBenchmarking Cloud-based Tagging Services
Benchmarking Cloud-based Tagging Services
 
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...
Globus Genomics: How Science-as-a-Service is Accelerating Discovery (BDT310) ...
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening Keynote
 
Sgg crest-presentation-final
Sgg crest-presentation-finalSgg crest-presentation-final
Sgg crest-presentation-final
 
Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22Cyberinfrastructure and Applications Overview: Howard University June22
Cyberinfrastructure and Applications Overview: Howard University June22
 
Globus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS AnalysisGlobus Genomics: Democratizing NGS Analysis
Globus Genomics: Democratizing NGS Analysis
 
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research Platform
 
Object Tagged RBAC Model for Hadoop Ecosystem
Object Tagged RBAC Model for Hadoop EcosystemObject Tagged RBAC Model for Hadoop Ecosystem
Object Tagged RBAC Model for Hadoop Ecosystem
 
Genomic Scale Big Data Pipelines
Genomic Scale Big Data PipelinesGenomic Scale Big Data Pipelines
Genomic Scale Big Data Pipelines
 
Assessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformaticsAssessing Galaxy's ability to express scientific workflows in bioinformatics
Assessing Galaxy's ability to express scientific workflows in bioinformatics
 

Semelhante a Science Services and Science Platforms: Using the Cloud to Accelerate and Democratize Discovery

A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryIan Foster
 
Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Webinar: Q&A on Globus Subscription Features
Webinar: Q&A on Globus Subscription FeaturesWebinar: Q&A on Globus Subscription Features
Webinar: Q&A on Globus Subscription FeaturesGlobus
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchRobert Grossman
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer OverlordsIan Foster
 
Introduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialIntroduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Globus
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobus
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Blue BRIDGE
 
grid mining
grid mininggrid mining
grid miningARNOLD
 
Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Mary Bass
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
Fundamental question and answer in cloud computing quiz by animesh chaturvedi
Fundamental question and answer in cloud computing quiz by animesh chaturvediFundamental question and answer in cloud computing quiz by animesh chaturvedi
Fundamental question and answer in cloud computing quiz by animesh chaturvediAnimesh Chaturvedi
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011Ian Foster
 

Semelhante a Science Services and Science Platforms: Using the Cloud to Accelerate and Democratize Discovery (20)

A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
 
Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus Platform
 
Grid computing
Grid computingGrid computing
Grid computing
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Webinar: Q&A on Globus Subscription Features
Webinar: Q&A on Globus Subscription FeaturesWebinar: Q&A on Globus Subscription Features
Webinar: Q&A on Globus Subscription Features
 
Using the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science ResearchUsing the Open Science Data Cloud for Data Science Research
Using the Open Science Data Cloud for Data Science Research
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
Introduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialIntroduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 Tutorial
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)
 
GlobusWorld 2020 Keynote
GlobusWorld 2020 KeynoteGlobusWorld 2020 Keynote
GlobusWorld 2020 Keynote
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Grid1
Grid1Grid1
Grid1
 
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
Using e-infrastructures for biodiversity conservation - Gianpaolo Coro (CNR)
 
grid mining
grid mininggrid mining
grid mining
 
Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
Fundamental question and answer in cloud computing quiz by animesh chaturvedi
Fundamental question and answer in cloud computing quiz by animesh chaturvediFundamental question and answer in cloud computing quiz by animesh chaturvedi
Fundamental question and answer in cloud computing quiz by animesh chaturvedi
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
 

Mais de Ian Foster

Global Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptxGlobal Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptxIan Foster
 
The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionIan Foster
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumIan Foster
 
ESnet6 and Smart Instruments
ESnet6 and Smart InstrumentsESnet6 and Smart Instruments
ESnet6 and Smart InstrumentsIan Foster
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationIan Foster
 
Foster CRA March 2022.pptx
Foster CRA March 2022.pptxFoster CRA March 2022.pptx
Foster CRA March 2022.pptxIan Foster
 
Big Data, Big Computing, AI, and Environmental Science
Big Data, Big Computing, AI, and Environmental ScienceBig Data, Big Computing, AI, and Environmental Science
Big Data, Big Computing, AI, and Environmental ScienceIan Foster
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryIan Foster
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the ContinuumIan Foster
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationIan Foster
 
Research Automation for Data-Driven Discovery
Research Automation for Data-Driven DiscoveryResearch Automation for Data-Driven Discovery
Research Automation for Data-Driven DiscoveryIan Foster
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterIan Foster
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
Team Argon Summary
Team Argon SummaryTeam Argon Summary
Team Argon SummaryIan Foster
 
Thoughts on interoperability
Thoughts on interoperabilityThoughts on interoperability
Thoughts on interoperabilityIan Foster
 
NIH Data Commons Architecture Ideas
NIH Data Commons Architecture IdeasNIH Data Commons Architecture Ideas
NIH Data Commons Architecture IdeasIan Foster
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFIan Foster
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
Software Infrastructure for a National Research Platform
Software Infrastructure for a National Research PlatformSoftware Infrastructure for a National Research Platform
Software Infrastructure for a National Research PlatformIan Foster
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Ian Foster
 

Mais de Ian Foster (20)

Global Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptxGlobal Services for Global Science March 2023.pptx
Global Services for Global Science March 2023.pptx
 
The Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, EvolutionThe Earth System Grid Federation: Origins, Current State, Evolution
The Earth System Grid Federation: Origins, Current State, Evolution
 
Better Information Faster: Programming the Continuum
Better Information Faster: Programming the ContinuumBetter Information Faster: Programming the Continuum
Better Information Faster: Programming the Continuum
 
ESnet6 and Smart Instruments
ESnet6 and Smart InstrumentsESnet6 and Smart Instruments
ESnet6 and Smart Instruments
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and Computation
 
Foster CRA March 2022.pptx
Foster CRA March 2022.pptxFoster CRA March 2022.pptx
Foster CRA March 2022.pptx
 
Big Data, Big Computing, AI, and Environmental Science
Big Data, Big Computing, AI, and Environmental ScienceBig Data, Big Computing, AI, and Environmental Science
Big Data, Big Computing, AI, and Environmental Science
 
AI at Scale for Materials and Chemistry
AI at Scale for Materials and ChemistryAI at Scale for Materials and Chemistry
AI at Scale for Materials and Chemistry
 
Coding the Continuum
Coding the ContinuumCoding the Continuum
Coding the Continuum
 
Data Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud AutomationData Tribology: Overcoming Data Friction with Cloud Automation
Data Tribology: Overcoming Data Friction with Cloud Automation
 
Research Automation for Data-Driven Discovery
Research Automation for Data-Driven DiscoveryResearch Automation for Data-Driven Discovery
Research Automation for Data-Driven Discovery
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
Team Argon Summary
Team Argon SummaryTeam Argon Summary
Team Argon Summary
 
Thoughts on interoperability
Thoughts on interoperabilityThoughts on interoperability
Thoughts on interoperability
 
NIH Data Commons Architecture Ideas
NIH Data Commons Architecture IdeasNIH Data Commons Architecture Ideas
NIH Data Commons Architecture Ideas
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Software Infrastructure for a National Research Platform
Software Infrastructure for a National Research PlatformSoftware Infrastructure for a National Research Platform
Software Infrastructure for a National Research Platform
 
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
Accelerating the Experimental Feedback Loop: Data Streams and the Advanced Ph...
 

Último

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Último (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Science Services and Science Platforms: Using the Cloud to Accelerate and Democratize Discovery

  • 1. Ian Foster Science Services and Science Platforms Using the cloud to accelerate and democratize discovery Talk at NorduGrid annual conference, Košice, Slovakia, June 2nd, 2016
  • 2. Thanks to co-authors and Globus team Globus services (globus.org)  Foster, I. Globus Online: Accelerating and democratizing science through cloud-based services. IEEE Internet Computing(May/June):70-73, 2011.  Chard, K., Tuecke, S. and Foster, I. Efficient and Secure Transfer, Synchronization, and Sharing of Big Data. Cloud Computing, IEEE, 1(3):46-55, 2014.  Chard, K., Foster, I. and Tuecke, S. Globus Platform-as-a-Service for Collaborative Science Applications. Concurrency - Practice and Experience, 27(2):290-305, 2014. Publication (globus.org/data-publication)  Chard, K., Pruyne, J., Blaiszik, B., Ananthakrishnan, R., Tuecke, S. and Foster, I., Globus Data Publication as a Service: Lowering Barriers to Reproducible Science. 11th IEEE International Conference on eScience Munich, Germany, 2015 Discovery engines  Foster, I., Ananthakrishnan, R., Blaiszik, B., Chard, K., Osborn, R., Tuecke, S., Wilde, M. and Wozniak, J. Networking materials data: Accelerating discovery at an experimental facility. Big Data and High Performance Computing, 2015.
  • 3. Thank you to our sponsors! 3
  • 4. Civilization advances by extending the number of important operations which we can perform without thinking about them Alfred North Whitehead (1911)
  • 5. Computation may someday be organized as a public utility … The computing utility could become the basis for a new and important industry. John McCarthy (1961)
  • 6. The grid vision Accelerate discovery and innovation by providing on-demand access to computing • “the average computing environment remains inadequate for [many] computationally sophisticated purposes” • “if mechanisms are in place to allow reliable, transparent, and instantaneous access to high-end resources, then it is as if those resources are devoted to them” [The Grid, Chapter 2, 1998]
  • 8. Example grid scenarios  “The application service providers, storage service providers, cycle providers, and consultants engaged by a car manufacturer to perform scenario evaluation during planning for a new factory”  “Members of an industrial consortium bidding on a new aircraft”  “A crisis management team and the databases and simulation systems that they use to plan a response to an emergency situation”  “Members of a large, international, multiyear high- energy physics collaboration” From: The Anatomy of the Grid, 2001
  • 9. Higgs discovery “only possible because of the extraordinary achievements of … grid computing”—Rolf Heuer, CERN DG 10s of PB, 100s of institutions, 1000s of scientists, 100Ks of CPUs, Bs of tasks 9
  • 10. What has changed? • Thousands of people learned about the joys of large-scale distributed systems • Virtual organization concepts and technologies • Now routine to move 100s of terabytes (e.g., GridFTP moves >2 petabyte per day) • High throughput computing is mainstream (e.g., Condor and Globus run millions of jobs per day) • Large Hadron Collider found the Higgs • Earth System Grid supports >25,000 users • Commercial cloud computing has exploded
  • 11. Looking forward  Exploding data volumes and earlier successes mean that many more people face challenges of big data, big compute, big collaboration  Networks are several orders of magnitude faster than when Grid started  Commercial cloud providers provide a substrate on which powerful new capabilities can be built with new economies of scale
  • 12.
  • 13. Cloud has transformed how software is developed and delivered 13 Infrastructure as a service: IaaS Platform as a service: PaaS Software as a service: SaaS PaaS enables more rapid, cheap, and scalable delivery of powerful apps—as SaaS (web & mobile apps)
  • 14. The right platform can do the same for science 14 We can leverage cloud to provide solutions that span the vast majority of researcher needs σ Big science
  • 15. The right platform can do the same for science 15 We can leverage cloud to provide solutions that span the vast majority of researcher needs σ Big science
  • 16. A science platform can spur a discovery cloud ecosystem 16 Infrastructure as a service: IaaS Platform as a service: PaaS Software as a service: SaaS (web and mobile apps) In so doing, we can slash costs, improve quality, and accelerate discovery across the sciences
  • 17. A science platform can spur a discovery cloud ecosystem 17 Infrastructure as a service: IaaS Platform as a service: PaaS Software as a service: SaaS (web and mobile apps) 2010- 2014- In so doing, we can slash costs, improve quality, and accelerate discovery across the sciences
  • 18. What can we automate and outsource in science? Run experiment Collect data Move data Check data Annotate data Share data Find similar data Link to literature Analyze data Publish data Automate and outsource Discovery Cloud
  • 21. Data access: we have the highways but not the delivery service Our highways encompass the Internet, ultra-high-speed networks, science DMZs, data transfer nodes, high-speed transport protocols A good delivery service automates, schedules, accelerates, adapts. It provides APIs for experts and casual users. Cuts costs and saves time.
  • 22. Globus: Research data management as a service Essential research data management services  File transfer  Data sharing  Data publication  Identity and groups Builds on 15 years of research Outsourced and automated  High availability, reliability, performance, scalability  Convenient for  Casual users: Web interfaces  Power users: APIs  Administrators: Install, manage globus.org
  • 23. Globus and the research data lifecycle 23 Researcher initiates transfer request; or requested automatically by script, science gateway 1 Instrument Compute Facility Globus transfers files reliably, securely 2 Globus controls access to shared files on existing storage; no need to move files to cloud storage! 4 Curator reviews and approves; data set published on campus or other system 7 Researcher selects files to share, selects user or group, and sets access permissions 3 Collaborator logs in to Globus and accesses shared files; no local account required; download via Globus 5 Researcher assembles data set; describes it using metadata (Dublin core and domain- specific) 6 6 Peers, collaborators search and discover datasets; transfer and share using Globus 8 Publication Repository Personal Computer Transfer Share Publish Discover • SaaS  Only a web browser required • Use storage system of your choice • Access using your campus credentials
  • 24. Globus by the numbers 4 major services 13 national labs use Globus 160 PB transferred 10,000 active endpoints 25 billion files processed ~400 active daily users 38,000 registered users 99.9% uptime 35+ institutional subscribers 1 PB largest single transfer to date 3 months longest continuously managed transfer 130 federated campus identities
  • 25. Platforms can slash costs, simplify access, increase interoperability For example, by providing:  Federated identity system with fine- grained authorization  Data management easily integrated with application workflows Via RESTful APIs
  • 26. Globus PaaS: Ecosystem enabler 26 Auth & Groups … Globus Toolkit GlobusAPIs GlobusConnect Data Publication & Discovery File Sharing File Transfer & Replication
  • 27. Globus PaaS and Open Science Grid 27
  • 28. Simple web app server login
  • 30. Serving a global community: NCAR’s Research Data Archive
  • 31. Serving a global community  17+ PB virtual processing  45,000+ custom orders, 4,000 users, 380 TB served in 2014 Courtesy of Thomas Cram, NCAR (2014) Fully automated delivery using portal developed w/ PaaS
  • 32. PaaS enabled automated workflow User logs in with NCAR or other campus identity Selected dataset copied to staging area (shared endpoint) User granted read permission for shared endpoint User receives email with link to access files ACLs deleted after five days
  • 33.
  • 34. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 34 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 35. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 35 Globus Transfer Service Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Globus Web Widgets Portal Web Server (Client) Other Services
  • 37. Let us know if you’d like to participate in future workshops
  • 38. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 38 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 39. Globus Auth  Foundational identity and access management (IAM) platform service  Simplify creation and integration of advanced apps and services  Brokers authentication and authorization interactions between:  End-users  Identity providers: XSEDE, InCommon, web apps  Resource servers: services with REST APIs  Clients: web, mobile, desktop, command line apps  Resource servers acting as clients to other resource servers https://docs.globus.org/api/auth 39
  • 40. Based on widely used web standards  OAuth 2.0 Authorization Framework  aka OAuth2  OpenID Connect Core 1.0  aka OIDC  Allows use of standard OAuth2 and OIDC libraries  E.g., Google OAuth Client Libraries (Java, Python, etc.), Apache mod_auth_openidc 40
  • 41. Globus Auth uses  Login to web app  “Log in with Globus”  Mobile, desktop, command line apps coming  Protect all REST API communications  App  Globus service  App  non-Globus service  Service  service 41
  • 42. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 42 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 43. Globus transfer API Nearly all Globus Web App functionality implemented via public Transfer API https://docs.globus.org/api/transfer/ 43
  • 44. Globus Python SDK Python client library for the Globus Auth and Transfer REST APIs http://globus.github.io/globus-sdk-python/ 44
  • 46. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 46 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 47. Globus helper pages Globus-provided web pages designed for use by your web apps  Browse Endpoint  Select Group  Logout https://docs.globus.org/api/helper-pages/
  • 49. Branding Can skin Globus Auth pages Header Text Default IDP 49
  • 50. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 50 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 51. Globus Connect HTTPS  The future of research CI is … the web  Globus Connect HTTPS unlocks all research storage to the web  Globus Auth provides security glue using standard web security  GridFTP doesn’t go away – async, bulk data transfer is important, but its not the end-all, be-all
  • 52. Desktop Globus Cloud Firewall Science DMZ Research data portal pattern  Move portal storage into Science DMZ, with Globus endpoint  Leave Portal Web server behind firewall  Globus handles the security and data heavy lifting 52 Globus Transfer Service Portal Web Server (Client) Globus Auth Browser User’s Endpoint (optional) Portal Endpoint Other Endpoints HTTPS GridFTP REST Other Services Globus Web Widgets
  • 53. Why create your own services?  Front-end / back-end within your portal  Remote backend for portal  Backend for pure Javascript browser apps  Extend your portal with a public REST API, so that other app and service developers can integrate with and extend your portal 53
  • 54. Why Globus Auth for your service?  Outsource all identity management, authentication  Federated identity with InCommon, Google, etc.  Outsource your REST API security  Consent, token issuance, validation, revocation  You provide service-specific authorization  Apps use your service like all others  Its standard OAuth2 and OIDC  Your service can seamlessly leverage other services  Other services can leverage your service Add your service to the international science platform 54
  • 55. A science platform can spur a discovery cloud ecosystem 55 Infrastructure as a service: IaaS Platform as a service: PaaS Software as a service: SaaS (web and mobile apps) 2010- 2014- In so doing, we can slash costs, improve quality, and accelerate discovery across the sciences
  • 56. The Discovery Platform Research Service Research Service Research Service Research Service Research Service Commercial IaaS/PaaS Core Services Authentication Groups Attributes Data Services Transfer Sharing Publication Commercial PaaS Discovery Exchange Enabling the Discovery Cloud Research facilitiesResearch facilities
  • 57. Together we can create an integrated ecosystem of services and applications for the research and education community Thank you! foster@uchicago.edu @ianfoster 57

Notas do Editor

  1. One useful thing One exciting initiative Science Services and Science Platforms: Using the Cloud to Democratize Discovery
  2. Talk about the Globus as being part of UChicago + ANL, as well as other context setting about how this work came about and is funded
  3. Need for entirely new instruments, computing infrastructure, organizational structures 173 TB/day
  4. Amazon VPC Microsoft one Add consumers Consumers, SMBs, large enterprises
  5. From the UCI cyberinfrastructure plan
  6. Amazon VPC Microsoft one Add consumers Consumers, SMBs, large enterprises
  7. Consumers, SMBs, large enterprises
  8. Highlight XSEDE’s planned adoption of user, group and profile management
  9. kBase: Outsource identity and group management
  10. Consumers, SMBs, large enterprises