SlideShare a Scribd company logo
1 of 43
“The Pacific Research Platform:
a Science-Driven Big-Data Freeway System.”
NCSA Colloquium
National Center for Supercomputing Applications
University of Illinois, Urbana-Champaign
September 18, 2015
Dr. Larry Smarr
Director, California Institute for Telecommunications and Information Technology
Harry E. Gruber Professor,
Dept. of Computer Science and Engineering
Jacobs School of Engineering, UCSD
http://lsmarr.calit2.net
1
Abstract
Research in data-intensive fields is increasingly multi-investigator and multi-institutional, depending on ever more rapid
access to ultra-large heterogeneous and widely distributed datasets. The Pacific Research Platform (PRP) is a multi-
institutional extensible deployment that establishes a science-driven high-capacity data-centric “freeway system.” The
PRP spans all 10 campuses of the University of California, as well as the major California private research universities,
four supercomputer centers, and several universities outside California. Fifteen multi-campus data-intensive application
teams act as drivers of the PRP, providing feedback over the five years to the technical design staff. These application
areas include particle physics, astronomy/astrophysics, earth sciences, biomedicine, and scalable multimedia, providing
models for many other applications. The PRP partnership extends the NSF-funded campus Science DMZs to a regional
model that allows high-speed data-intensive networking, facilitating researchers moving data between their labs and
their collaborators’ sites, supercomputer centers or data repositories, and enabling that data to traverse multiple
heterogeneous networks without performance degradation over campus, regional, national, and international distances
Vision: Creating a West Coast “Big Data Freeway”
Connected by CENIC/Pacific Wave to Internet2 & GLIF
Use Lightpaths to Connect
All Data Generators and Consumers,
Creating a “Big Data” Plane
Integrated With High Performance Global Networks
“The Bisection Bandwidth of a Cluster Interconnect,
but Deployed on a 10-Campus Scale.”
This Vision Has Been Building for Over Two Decades
NCSA Telnet--“Hide the Cray”
Paradigm That We Still Use Today
• NCSA Telnet -- Interactive Access
– From Macintosh or PC Computer
– To Telnet Hosts on TCP/IP Networks
• Allows for Simultaneous
Connections
– To Numerous Computers on The Net
– Standard File Transfer Server (FTP)
– Lets You Transfer Files to and from
Remote Machines and Other Users
John Kogut Simulating
Quantum Chromodynamics
He Uses a Mac—The Mac Uses the Cray
Source: Larry Smarr 1985
Data
Generator
Data
Portal
Data
Transmission
Interactive Supercomputing Collaboratory Prototype:
Using Analog Communications to Prototype the Fiber Optic Future
“We’re using satellite technology…
to demo what It might be like to have
high-speed fiber-optic links between
advanced computers
in two different geographic locations.”
― Al Gore, Senator
Chair, US Senate Subcommittee on Science, Technology and Space
Illinois
Boston
SIGGRAPH 1989
“What we really have to do is eliminate distance between
individuals who want to interact with other people and
with other computers.”
― Larry Smarr, Director, NCSA
I-WAY: Information Wide Area Year
Supercomputing ‘95
• The First National 155 Mbps Research Network
– 65 Science Projects
– Into the San Diego Convention Center
• I-Way Featured:
– Networked Visualization Application Demonstrations
– Large-Scale Immersive Displays
– I-Soft Programming Environment
UIC
http://archive.ncsa.uiuc.edu/General/Training/SC95/GII.HPCC.html
CitySpace
Cellular Semiotics
PACI is Prototyping America’s 21st Century
Information Infrastructure
The PACI Grid Testbed
National Computational Science
1997
Chesapeake Bay Simulation Collaboratory:
vBNS Linked CAVE, ImmersaDesk, Power Wall, and Workstation
Alliance Project: Collaborative Video Production
via Tele-Immersion and Virtual Director
UIC
Donna Cox, Robert Patterson, Stuart Levy, NCSA Virtual Director Team
Glenn Wheless, Old Dominion Univ.
Alliance Application Technologies
Environmental Hydrology Team
4 MPixel PowerWall
Alliance 1997
UIC
ANL
NCSA/UIUC
UC
NU
MREN
IIT
True Grid Project
Started March 1999
State Commits
$7.5M over 4 years
Illinois is Positioned to Seize National Optical Networking Leadership
with I-WIRE Infrastructure Investment
• State-Funded Infrastructure
–Application Driven
–High Definition Streaming Media
–Telepresence and Media
–Computational Grids
–Cloud Computing
–Data Grids
–Search & Information Analysis
–EmergingTech Proving Ground
–Optical Switching
–Dense Wave Division Multiplexing
–Advanced Middleware Infrastructure
–Wireless Extensions
Source: Charlie Catlett, ANL1999 LS Slide
Two New Calit2 Buildings Provide
New Laboratories for “Living in the Future”
• “Convergence” Laboratory Facilities
– Nanotech, BioMEMS, Chips, Radio, Photonics
– Virtual Reality, Digital Cinema, HDTV, Gaming
• Over 1000 Researchers in Two Buildings
– Linked via Dedicated Optical Networks
UC Irvine
www.calit2.net
Preparing for a World in Which
Distance is Eliminated…
Linking the Calit2 Auditoriums at UCSD and UCI
With HD Streams
September 8, 2009
Photo by Erik Jepsen, UC San Diego
Sept. 8, 2009
NSF’s OptIPuter Project: Using Supernetworks
to Meet the Needs of Data-Intensive Researchers
OptIPortal–
Termination
Device
for the
OptIPuter
Global
Backplane
Calit2 (UCSD, UCI), SDSC, and UIC Leads—Larry Smarr PI
Univ. Partners: NCSA, USC, SDSU, NW, TA&M, UvA, SARA, KISTI, AIST
Industry: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent
2003-2009
$13,500,000
In August 2003,
Jason Leigh and his
students used
RBUDP to blast data
from NCSA to SDSC
over the
TeraGrid DTFnet,
achieving18Gbps file
transfer out of the
available 20Gbps
High Resolution Uncompressed HD Streams
Require Multi-Gigabit/s Lambdas
U. Washington
JGN II Workshop
Osaka, Japan
Jan 2005
Prof.Osaka Prof. Aoyama
Prof. Smarr
Source: U Washington Research Channel
Telepresence Using Uncompressed 1.5 Gbps
HDTV Streaming Over IP on Fiber Optics--
75x Home Cable “HDTV” Bandwidth!
“I can see every hair on your head!”—Prof. Aoyama
First Trans-Pacific Super High Definition Telepresence Meeting
Using Digital Cinema 4k Streams
Keio University
President Anzai
UCSD
Chancellor Fox
Lays
Technical
Basis for
Global
Digital
Cinema
Sony
NTT
SGI
Streaming 4k
with JPEG 2000
Compression
½ gigabit/sec
100 Times
the Resolution
of YouTube!
4k = 4000x2000 Pixels = 4xHD
21 Countries Driving 50 Demonstrations
1 or 10Gbps to Calit2@UCSD Building Sept 2005September 26-30, 2005
Globally 10Gbp Optically Connected
Digital Cinema Collaboratory
Streaming 4K Live From NCSA Servers to Calit2@UCSD Auditorium
Content: Donna Cox, Robert Patterson, NCSA
Project StarGate Goals:
Combining Supercomputers and Supernetworks
• Create an “End-to-End”
10Gbps Workflow
• Explore Use of OptIPortals as
Petascale Supercomputer
“Scalable Workstations”
• Exploit Dynamic 10Gbps
Circuits on ESnet
• Connect Hardware Resources
at ORNL, ANL, SDSC
• Show that Data Need Not be
Trapped by the Network
“Event Horizon”
OptIPortal@SDSC
Rick Wagner Mike Norman
• ANL * Calit2 * LBNL * NICS * ORNL * SDSC
Source: Michael Norman, SDSC, UCSD
NICS
ORNL
NSF TeraGrid Kraken
Cray XT5
8,256 Compute Nodes
99,072 Compute Cores
129 TB RAM
simulation
Argonne NL
DOE Eureka
100 Dual Quad Core Xeon Servers
200 NVIDIA Quadro FX GPUs in 50
Quadro Plex S4 1U enclosures
3.2 TB RAM rendering
SDSC
Calit2/SDSC OptIPortal1
20 30” (2560 x 1600 pixel) LCD panels
10 NVIDIA Quadro FX 4600 graphics
cards > 80 megapixels
10 Gb/s network throughout
visualization
ESnet
10 Gb/s fiber optic network
*ANL * Calit2 * LBNL * NICS * ORNL * SDSC
www.calit2.net/newsroom/release.php?id=1624
Using Supernetworks to Couple End User
to Remote Supercomputers and Visualization Servers
Source: Mike Norman,
Rick Wagner, SDSC
Real-Time Interactive
Volume Rendering Streamed
from ANL to SDSC
Demoed
SC09
Integrated “OptIPlatform” Cyberinfrastructure System:
A 10Gbps Lightpath Cloud
National LambdaRail
Campus
Optical
Switch
Data Repositories & Clusters
HPC
HD/4k Video Images
HD/4k Video Cams
End User
OptIPortal
10G
Lightpath
HD/4k Telepresence
Instruments
LS 2009
Slide
So Why Don’t We Have a National
Big Data Cyberinfrastructure?
How Do You Get From Your Lab
to the Regional Optical Networks?
www.ctwatch.org
“Research is being stalled by ‘information overload,’ Mr. Bement said, because
data from digital instruments are piling up far faster than researchers can study.
In particular, he said, campus networks need to be improved. High-speed data
lines crossing the nation are the equivalent of six-lane superhighways, he said.
But networks at colleges and universities are not so capable. “Those massive
conduits are reduced to two-lane roads at most college and university
campuses,” he said. Improving cyberinfrastructure, he said, “will transform the
capabilities of campus-based scientists.”
-- Arden Bement, the director of the National Science Foundation
May 2005
DOE Esnet’s Science DMZ: A Scalable Network
Design Model for Optimizing Science Data Transfers
• A Science DMZ integrates 4 key concepts into a unified whole:
– A network architecture designed for high-performance applications,
with the science network distinct from the general-purpose network
– The use of dedicated systems for data transfer
– Performance measurement and network testing systems that are
regularly used to characterize and troubleshoot the network
– Security policies and enforcement mechanisms that are tailored for
high performance science environments
http://fasterdata.es.net/science-dmz/
Science DMZ
Coined 2010
The National Science Foundation
Has Funded Over 100 Campuses to Build Local Data Freeways
134 awards,
128 projects
- All but 4 states
- 120+ institutions
Creating a “Big Data” Plane on Campus:
NSF CC-NIE Funded Prism@UCSD and CHeruB
Prism@UCSD, Phil Papadopoulos, SDSC, Calit2, PI
CHERuB, Mike Norman, SDSC PI
CHERuB
Science DMZ Data Transfer Nodes - Optical Network Termination Devices:
Inexpensive PCs Optimized for Big Data
• FIONA – Flash I/O Network Appliance
– Combination of Desktop and Server Building Blocks
– US$5K - US$7K
– Desktop Flash up to 16TB
– RAID Drives up to 48TB
– 10GbE/40GbE Adapter
– Tested speed 40Gbs
– Developed Under
UCSD CC-NIE Prism Award
by UCSD’s
– Phil Papadopoulos
– Tom DeFanti
– Joe Keefe
FIONA
Data Appliance
9 X 256GB
510MB/sec
8 X 3TB
125MB/sec
2 x 40GbE
2 TB Cache
24TB Disk
Integrated Digital Cyberinfrastructure
Supporting Knight Lab
FIONA
12 Cores/GPU
128 GB RAM
3.5 TB SSD
48TB Disk
10Gbps NIC
Knight Lab
10Gbps
Gordon
Prism@UCSD
Data Oasis
7.5PB,
100GB/s
Knight 1024 Cluster
In SDSC Co-Lo
CHERuB
100Gbps
Emperor & Other Vis Tools
64Mpixel Data Analysis Wall
120Gbps
40Gbps
Interactively Exploring Microscope Images of Brains:
40Gbps From NCMIR to Calit2 64Mpixel Wall
Why Now?
Federating the Six UC CC-NIE Grants
• 2011 ACCI Strategic Recommendation to the NSF #3:
– NSF should create a new program funding high-speed (currently
10 Gbps) connections from campuses to the nearest landing point
for a national network backbone. The design of these connections
must include support for dynamic network provisioning services
and must be engineered to support rapid movement of large
scientific data sets."
– - pg. 6, NSF Advisory Committee for Cyberinfrastructure Task
Force on Campus Bridging, Final Report, March 2011
– www.nsf.gov/od/oci/taskforces/TaskForceReport_CampusBridging.pdf
– Led to Office of Cyberinfrastructure RFP March 1, 2012
• NSF’s Campus Cyberinfrastructure –
Network Infrastructure & Engineering (CC-NIE) Program
– 85 Grants Awarded So Far (NSF Summit Last Week)
– 6 Are in UC
UC Must Move Rapidly or Lose a Ten-Year Advantage!
UC IT Leadership Council
Oakland, CA
May 19, 2014
CENIC is Rapidly Moving to Connect
at 100 Gbps Across the State and Nation
DOE
Internet2
The Pacific Wave Platform
Creates an End-to-End Regional Science Big Data Freeway
Source:
John Hess, CENIC
Ten Week Sprint to Demonstrate the West Coast
Big Data Freeway System: PRPv0
Presented at CENIC 2015
March 9, 2015
FIONA DTNs Now Deployed to All UC Campuses
And Most PRP Sites
Pacific Research Platform
Multi-Campus Science Driver Teams
• Particle Physics
• Astronomy and Astrophysics
– Telescope Surveys
– Galaxy Evolution
– Gravitational Wave Astronomy
• Biomedical
– Cancer Genomics Hub/Browser
– Microbiome and Integrative ‘Omics
– Integrative Structural Biology
• Earth Sciences
– Data Analysis and Simulation for Earthquakes and Natural Disasters
– Climate Modeling: NCAR/UCAR
– California/Nevada Regional Climate Data Analysis
– CO2 Subsurface Modeling
• Scalable Visualization, Virtual Reality, and Ultra-Resolution Video
31
Particle Physics: Creating a 10-100 Gbps LambdaGrid
to Support LHC Researchers
ATLASCMS
U.S. Institutions
Participating in LHC
LHC Data
Generated by
CMS & ATLAS
Detectors
Analyzed
on OSG
Maps from www.uslhc.us
Two Automated Telescope Surveys
Creating Huge Datasets Will Drive PRP
300 images per night.
100MB per raw image
30GB per night
120GB per night
250 images per night.
530MB per raw image
150 GB per night
800GB per night
When processed
at NERSC
Increased by 4x
Source: Peter Nugent, Division Deputy for Scientific Engagement, LBL
Professor of Astronomy, UC Berkeley
Precursors to
LSST
And
NCSA
Cancer Genomics Hub (UCSC) is Housed in SDSC CoLo:
Large Data Flows to End Users
1G
8G
15G
Cumulative TBs of CGH
Files Downloaded
Data Source: David Haussler,
Brad Smith, UCSC
30 PB
To Map Out the Dynamics of Autoimmune Microbiome Ecology
Couples Next Generation Genome Sequencers to Big Data Supercomputers
Source: Weizhong Li, UCSD
Our Team Used 25 CPU-years
to Compute
Comparative Gut Microbiomes
Starting From
2.7 Trillion DNA Bases
of My Samples
and Healthy and IBD Controls
SDSC Gordon Data Supercomputer
Computing on Data: Complex Software Pipelines -
From Sequence to Taxonomy and Function
PI: (Weizhong Li, CRBS, UCSD):
NIH R01HG005978 (2010-2013, $1.1M)
Dan Cayan
USGS Water Resources Discipline
Scripps Institution of Oceanography, UC San Diego
much support from Mary Tyree, Mike Dettinger, Guido Franco and
other colleagues
Sponsors:
California Energy Commission
NOAA RISA program
California DWR, DOE, NSF
Planning for climate change in California
substantial shifts on top of already high climate variability
SIO Campus Climate Researchers Need to Download
Results from Remote Supercomputer Simulations
to Make Regional Climate Change Forecasts
Earth Sciences: Pacific Earthquake
Engineering Research Center
Enabling
Real-Time Coupling
Between
Shake Tables
and
Supercomputer
Simulations
Collaboration Between EVL’s CAVE2
and Calit2’s VROOM Over 10Gb Wavelength
EVL
Calit2
Source: NTT Sponsored ON*VECTOR Workshop at Calit2 March 6, 2013
Optical Fibers Link Australian and US
Big Data Researchers
Next Step: Use AARnet/PRP to Set Up
Planetary-Scale Shared Virtual Worlds
Digital Arena, UTS Sydney
CAVE2, Monash U, Melbourne
CAVE2, EVL, Chicago
PRP Timeline
• PRPv1
– A Layer 2 and Layer 3 System
– Completed In 2 Years
– Tested, Measured, Optimized, With Multi-domain Science Data
– Bring Many Of Our Science Teams Up
– Each Community Thus Will Have Its Own Certificate-Based Access
To its Specific Federated Data Infrastructure.
• PRPv2
– Advanced Ipv6-Only Version with Robust Security Features
– E.G., Trusted Platform Module Hardware and SDN/SDX Software
– Support Rates up to 100Gb/s in Bursts And Streams
– Develop Means to Operate a Shared Federation of Caches
The Pacific Wave Platform
Creates an End-to-End Regional Science Big Data Freeway
Source:
John Hess, CENIC
Opportunity:
Connect NCSA
to End Users
on PRP Campuses
@10Gbps

More Related Content

What's hot

What's hot (20)

PRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSGPRP, CHASE-CI, TNRP and OSG
PRP, CHASE-CI, TNRP and OSG
 
Peering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkPeering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains Network
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningThe Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
 
National Federated Compute Platforms: The Pacific Research Platform
National Federated Compute Platforms: The Pacific Research PlatformNational Federated Compute Platforms: The Pacific Research Platform
National Federated Compute Platforms: The Pacific Research Platform
 
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldInternet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
An Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive ResearchAn Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive Research
 
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway SystemThe Pacific Research Platform:a Science-Driven Big-Data Freeway System
The Pacific Research Platform:a Science-Driven Big-Data Freeway System
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureToward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing Cyberinfrastructure
 
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesThe Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
 
The Transformational Nature of NSF Long-Term Funding on Santa Ana Wildfires i...
The Transformational Nature of NSF Long-Term Fundingon Santa Ana Wildfires i...The Transformational Nature of NSF Long-Term Fundingon Santa Ana Wildfires i...
The Transformational Nature of NSF Long-Term Funding on Santa Ana Wildfires i...
 
Advanced Cyberinfrastructure Enabled Services and Applications in 2021
Advanced Cyberinfrastructure Enabled Services and Applications in 2021Advanced Cyberinfrastructure Enabled Services and Applications in 2021
Advanced Cyberinfrastructure Enabled Services and Applications in 2021
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Calit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative InnovationCalit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative Innovation
 
The Creation of Calit2
The Creation of Calit2The Creation of Calit2
The Creation of Calit2
 
What I’ve Learned About “Green”
What I’ve Learned About “Green”What I’ve Learned About “Green”
What I’ve Learned About “Green”
 
CENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataCENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big Data
 
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
 

Similar to The Pacific Research Platform: a Science-Driven Big-Data Freeway System

Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2
Dan Taylor
 

Similar to The Pacific Research Platform: a Science-Driven Big-Data Freeway System (20)

Building a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration InfrastructureBuilding a Regional 100G Collaboration Infrastructure
Building a Regional 100G Collaboration Infrastructure
 
An Integrated West Coast Science DMZ for Data-Intensive Research
An Integrated West Coast Science DMZ for Data-Intensive ResearchAn Integrated West Coast Science DMZ for Data-Intensive Research
An Integrated West Coast Science DMZ for Data-Intensive Research
 
OptIPuter-A High Performance SOA LambdaGrid Enabling Scientific Applications
OptIPuter-A High Performance SOA LambdaGrid Enabling Scientific ApplicationsOptIPuter-A High Performance SOA LambdaGrid Enabling Scientific Applications
OptIPuter-A High Performance SOA LambdaGrid Enabling Scientific Applications
 
Physics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global CyberinfrastructurePhysics Research in an Era of Global Cyberinfrastructure
Physics Research in an Era of Global Cyberinfrastructure
 
High Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchHigh Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive Research
 
Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...Positioning University of California Information Technology for the Future: S...
Positioning University of California Information Technology for the Future: S...
 
Blowing up the Box--the Emergence of the Planetary Computer
Blowing up the Box--the Emergence of the Planetary ComputerBlowing up the Box--the Emergence of the Planetary Computer
Blowing up the Box--the Emergence of the Planetary Computer
 
High Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light SpeedHigh Performance Collaboration – The Jump to Light Speed
High Performance Collaboration – The Jump to Light Speed
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research Platform
 
Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation Economy
 
The OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGridThe OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGrid
 
Envisioning the Future
Envisioning the FutureEnvisioning the Future
Envisioning the Future
 
Pronet Public Presentation v1 2
Pronet Public Presentation v1 2Pronet Public Presentation v1 2
Pronet Public Presentation v1 2
 
Science and Cyberinfrastructure in the Data-Dominated Era
Science and Cyberinfrastructure in the Data-Dominated EraScience and Cyberinfrastructure in the Data-Dominated Era
Science and Cyberinfrastructure in the Data-Dominated Era
 
Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation Economy
 
Coupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation EconomyCoupling Australia’s Researchers to the Global Innovation Economy
Coupling Australia’s Researchers to the Global Innovation Economy
 
Calit2: a View Into the Future of the Wired and Unwired Internet
Calit2: a View Into the Future of the Wired and Unwired InternetCalit2: a View Into the Future of the Wired and Unwired Internet
Calit2: a View Into the Future of the Wired and Unwired Internet
 
Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2Internet2 Bio IT 2016 v2
Internet2 Bio IT 2016 v2
 
Using OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid ApplicationsUsing OptIPuter Innovations to Enable LambdaGrid Applications
Using OptIPuter Innovations to Enable LambdaGrid Applications
 
Cal-(IT)2 Projects with Sun Microsystems
Cal-(IT)2 Projects with Sun MicrosystemsCal-(IT)2 Projects with Sun Microsystems
Cal-(IT)2 Projects with Sun Microsystems
 

More from Larry Smarr

More from Larry Smarr (20)

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 Years
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

The Pacific Research Platform: a Science-Driven Big-Data Freeway System

  • 1. “The Pacific Research Platform: a Science-Driven Big-Data Freeway System.” NCSA Colloquium National Center for Supercomputing Applications University of Illinois, Urbana-Champaign September 18, 2015 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net 1
  • 2. Abstract Research in data-intensive fields is increasingly multi-investigator and multi-institutional, depending on ever more rapid access to ultra-large heterogeneous and widely distributed datasets. The Pacific Research Platform (PRP) is a multi- institutional extensible deployment that establishes a science-driven high-capacity data-centric “freeway system.” The PRP spans all 10 campuses of the University of California, as well as the major California private research universities, four supercomputer centers, and several universities outside California. Fifteen multi-campus data-intensive application teams act as drivers of the PRP, providing feedback over the five years to the technical design staff. These application areas include particle physics, astronomy/astrophysics, earth sciences, biomedicine, and scalable multimedia, providing models for many other applications. The PRP partnership extends the NSF-funded campus Science DMZs to a regional model that allows high-speed data-intensive networking, facilitating researchers moving data between their labs and their collaborators’ sites, supercomputer centers or data repositories, and enabling that data to traverse multiple heterogeneous networks without performance degradation over campus, regional, national, and international distances
  • 3. Vision: Creating a West Coast “Big Data Freeway” Connected by CENIC/Pacific Wave to Internet2 & GLIF Use Lightpaths to Connect All Data Generators and Consumers, Creating a “Big Data” Plane Integrated With High Performance Global Networks “The Bisection Bandwidth of a Cluster Interconnect, but Deployed on a 10-Campus Scale.” This Vision Has Been Building for Over Two Decades
  • 4. NCSA Telnet--“Hide the Cray” Paradigm That We Still Use Today • NCSA Telnet -- Interactive Access – From Macintosh or PC Computer – To Telnet Hosts on TCP/IP Networks • Allows for Simultaneous Connections – To Numerous Computers on The Net – Standard File Transfer Server (FTP) – Lets You Transfer Files to and from Remote Machines and Other Users John Kogut Simulating Quantum Chromodynamics He Uses a Mac—The Mac Uses the Cray Source: Larry Smarr 1985 Data Generator Data Portal Data Transmission
  • 5. Interactive Supercomputing Collaboratory Prototype: Using Analog Communications to Prototype the Fiber Optic Future “We’re using satellite technology… to demo what It might be like to have high-speed fiber-optic links between advanced computers in two different geographic locations.” ― Al Gore, Senator Chair, US Senate Subcommittee on Science, Technology and Space Illinois Boston SIGGRAPH 1989 “What we really have to do is eliminate distance between individuals who want to interact with other people and with other computers.” ― Larry Smarr, Director, NCSA
  • 6. I-WAY: Information Wide Area Year Supercomputing ‘95 • The First National 155 Mbps Research Network – 65 Science Projects – Into the San Diego Convention Center • I-Way Featured: – Networked Visualization Application Demonstrations – Large-Scale Immersive Displays – I-Soft Programming Environment UIC http://archive.ncsa.uiuc.edu/General/Training/SC95/GII.HPCC.html CitySpace Cellular Semiotics
  • 7. PACI is Prototyping America’s 21st Century Information Infrastructure The PACI Grid Testbed National Computational Science 1997
  • 8. Chesapeake Bay Simulation Collaboratory: vBNS Linked CAVE, ImmersaDesk, Power Wall, and Workstation Alliance Project: Collaborative Video Production via Tele-Immersion and Virtual Director UIC Donna Cox, Robert Patterson, Stuart Levy, NCSA Virtual Director Team Glenn Wheless, Old Dominion Univ. Alliance Application Technologies Environmental Hydrology Team 4 MPixel PowerWall Alliance 1997
  • 9. UIC ANL NCSA/UIUC UC NU MREN IIT True Grid Project Started March 1999 State Commits $7.5M over 4 years Illinois is Positioned to Seize National Optical Networking Leadership with I-WIRE Infrastructure Investment • State-Funded Infrastructure –Application Driven –High Definition Streaming Media –Telepresence and Media –Computational Grids –Cloud Computing –Data Grids –Search & Information Analysis –EmergingTech Proving Ground –Optical Switching –Dense Wave Division Multiplexing –Advanced Middleware Infrastructure –Wireless Extensions Source: Charlie Catlett, ANL1999 LS Slide
  • 10. Two New Calit2 Buildings Provide New Laboratories for “Living in the Future” • “Convergence” Laboratory Facilities – Nanotech, BioMEMS, Chips, Radio, Photonics – Virtual Reality, Digital Cinema, HDTV, Gaming • Over 1000 Researchers in Two Buildings – Linked via Dedicated Optical Networks UC Irvine www.calit2.net Preparing for a World in Which Distance is Eliminated…
  • 11. Linking the Calit2 Auditoriums at UCSD and UCI With HD Streams September 8, 2009 Photo by Erik Jepsen, UC San Diego Sept. 8, 2009
  • 12. NSF’s OptIPuter Project: Using Supernetworks to Meet the Needs of Data-Intensive Researchers OptIPortal– Termination Device for the OptIPuter Global Backplane Calit2 (UCSD, UCI), SDSC, and UIC Leads—Larry Smarr PI Univ. Partners: NCSA, USC, SDSU, NW, TA&M, UvA, SARA, KISTI, AIST Industry: IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent 2003-2009 $13,500,000 In August 2003, Jason Leigh and his students used RBUDP to blast data from NCSA to SDSC over the TeraGrid DTFnet, achieving18Gbps file transfer out of the available 20Gbps
  • 13. High Resolution Uncompressed HD Streams Require Multi-Gigabit/s Lambdas U. Washington JGN II Workshop Osaka, Japan Jan 2005 Prof.Osaka Prof. Aoyama Prof. Smarr Source: U Washington Research Channel Telepresence Using Uncompressed 1.5 Gbps HDTV Streaming Over IP on Fiber Optics-- 75x Home Cable “HDTV” Bandwidth! “I can see every hair on your head!”—Prof. Aoyama
  • 14. First Trans-Pacific Super High Definition Telepresence Meeting Using Digital Cinema 4k Streams Keio University President Anzai UCSD Chancellor Fox Lays Technical Basis for Global Digital Cinema Sony NTT SGI Streaming 4k with JPEG 2000 Compression ½ gigabit/sec 100 Times the Resolution of YouTube! 4k = 4000x2000 Pixels = 4xHD 21 Countries Driving 50 Demonstrations 1 or 10Gbps to Calit2@UCSD Building Sept 2005September 26-30, 2005
  • 15. Globally 10Gbp Optically Connected Digital Cinema Collaboratory Streaming 4K Live From NCSA Servers to Calit2@UCSD Auditorium Content: Donna Cox, Robert Patterson, NCSA
  • 16. Project StarGate Goals: Combining Supercomputers and Supernetworks • Create an “End-to-End” 10Gbps Workflow • Explore Use of OptIPortals as Petascale Supercomputer “Scalable Workstations” • Exploit Dynamic 10Gbps Circuits on ESnet • Connect Hardware Resources at ORNL, ANL, SDSC • Show that Data Need Not be Trapped by the Network “Event Horizon” OptIPortal@SDSC Rick Wagner Mike Norman • ANL * Calit2 * LBNL * NICS * ORNL * SDSC Source: Michael Norman, SDSC, UCSD
  • 17. NICS ORNL NSF TeraGrid Kraken Cray XT5 8,256 Compute Nodes 99,072 Compute Cores 129 TB RAM simulation Argonne NL DOE Eureka 100 Dual Quad Core Xeon Servers 200 NVIDIA Quadro FX GPUs in 50 Quadro Plex S4 1U enclosures 3.2 TB RAM rendering SDSC Calit2/SDSC OptIPortal1 20 30” (2560 x 1600 pixel) LCD panels 10 NVIDIA Quadro FX 4600 graphics cards > 80 megapixels 10 Gb/s network throughout visualization ESnet 10 Gb/s fiber optic network *ANL * Calit2 * LBNL * NICS * ORNL * SDSC www.calit2.net/newsroom/release.php?id=1624 Using Supernetworks to Couple End User to Remote Supercomputers and Visualization Servers Source: Mike Norman, Rick Wagner, SDSC Real-Time Interactive Volume Rendering Streamed from ANL to SDSC Demoed SC09
  • 18. Integrated “OptIPlatform” Cyberinfrastructure System: A 10Gbps Lightpath Cloud National LambdaRail Campus Optical Switch Data Repositories & Clusters HPC HD/4k Video Images HD/4k Video Cams End User OptIPortal 10G Lightpath HD/4k Telepresence Instruments LS 2009 Slide
  • 19. So Why Don’t We Have a National Big Data Cyberinfrastructure?
  • 20. How Do You Get From Your Lab to the Regional Optical Networks? www.ctwatch.org “Research is being stalled by ‘information overload,’ Mr. Bement said, because data from digital instruments are piling up far faster than researchers can study. In particular, he said, campus networks need to be improved. High-speed data lines crossing the nation are the equivalent of six-lane superhighways, he said. But networks at colleges and universities are not so capable. “Those massive conduits are reduced to two-lane roads at most college and university campuses,” he said. Improving cyberinfrastructure, he said, “will transform the capabilities of campus-based scientists.” -- Arden Bement, the director of the National Science Foundation May 2005
  • 21. DOE Esnet’s Science DMZ: A Scalable Network Design Model for Optimizing Science Data Transfers • A Science DMZ integrates 4 key concepts into a unified whole: – A network architecture designed for high-performance applications, with the science network distinct from the general-purpose network – The use of dedicated systems for data transfer – Performance measurement and network testing systems that are regularly used to characterize and troubleshoot the network – Security policies and enforcement mechanisms that are tailored for high performance science environments http://fasterdata.es.net/science-dmz/ Science DMZ Coined 2010
  • 22. The National Science Foundation Has Funded Over 100 Campuses to Build Local Data Freeways 134 awards, 128 projects - All but 4 states - 120+ institutions
  • 23. Creating a “Big Data” Plane on Campus: NSF CC-NIE Funded Prism@UCSD and CHeruB Prism@UCSD, Phil Papadopoulos, SDSC, Calit2, PI CHERuB, Mike Norman, SDSC PI CHERuB
  • 24. Science DMZ Data Transfer Nodes - Optical Network Termination Devices: Inexpensive PCs Optimized for Big Data • FIONA – Flash I/O Network Appliance – Combination of Desktop and Server Building Blocks – US$5K - US$7K – Desktop Flash up to 16TB – RAID Drives up to 48TB – 10GbE/40GbE Adapter – Tested speed 40Gbs – Developed Under UCSD CC-NIE Prism Award by UCSD’s – Phil Papadopoulos – Tom DeFanti – Joe Keefe FIONA Data Appliance 9 X 256GB 510MB/sec 8 X 3TB 125MB/sec 2 x 40GbE 2 TB Cache 24TB Disk
  • 25. Integrated Digital Cyberinfrastructure Supporting Knight Lab FIONA 12 Cores/GPU 128 GB RAM 3.5 TB SSD 48TB Disk 10Gbps NIC Knight Lab 10Gbps Gordon Prism@UCSD Data Oasis 7.5PB, 100GB/s Knight 1024 Cluster In SDSC Co-Lo CHERuB 100Gbps Emperor & Other Vis Tools 64Mpixel Data Analysis Wall 120Gbps 40Gbps
  • 26. Interactively Exploring Microscope Images of Brains: 40Gbps From NCMIR to Calit2 64Mpixel Wall
  • 27. Why Now? Federating the Six UC CC-NIE Grants • 2011 ACCI Strategic Recommendation to the NSF #3: – NSF should create a new program funding high-speed (currently 10 Gbps) connections from campuses to the nearest landing point for a national network backbone. The design of these connections must include support for dynamic network provisioning services and must be engineered to support rapid movement of large scientific data sets." – - pg. 6, NSF Advisory Committee for Cyberinfrastructure Task Force on Campus Bridging, Final Report, March 2011 – www.nsf.gov/od/oci/taskforces/TaskForceReport_CampusBridging.pdf – Led to Office of Cyberinfrastructure RFP March 1, 2012 • NSF’s Campus Cyberinfrastructure – Network Infrastructure & Engineering (CC-NIE) Program – 85 Grants Awarded So Far (NSF Summit Last Week) – 6 Are in UC UC Must Move Rapidly or Lose a Ten-Year Advantage! UC IT Leadership Council Oakland, CA May 19, 2014
  • 28. CENIC is Rapidly Moving to Connect at 100 Gbps Across the State and Nation DOE Internet2
  • 29. The Pacific Wave Platform Creates an End-to-End Regional Science Big Data Freeway Source: John Hess, CENIC
  • 30. Ten Week Sprint to Demonstrate the West Coast Big Data Freeway System: PRPv0 Presented at CENIC 2015 March 9, 2015 FIONA DTNs Now Deployed to All UC Campuses And Most PRP Sites
  • 31. Pacific Research Platform Multi-Campus Science Driver Teams • Particle Physics • Astronomy and Astrophysics – Telescope Surveys – Galaxy Evolution – Gravitational Wave Astronomy • Biomedical – Cancer Genomics Hub/Browser – Microbiome and Integrative ‘Omics – Integrative Structural Biology • Earth Sciences – Data Analysis and Simulation for Earthquakes and Natural Disasters – Climate Modeling: NCAR/UCAR – California/Nevada Regional Climate Data Analysis – CO2 Subsurface Modeling • Scalable Visualization, Virtual Reality, and Ultra-Resolution Video 31
  • 32. Particle Physics: Creating a 10-100 Gbps LambdaGrid to Support LHC Researchers ATLASCMS U.S. Institutions Participating in LHC LHC Data Generated by CMS & ATLAS Detectors Analyzed on OSG Maps from www.uslhc.us
  • 33. Two Automated Telescope Surveys Creating Huge Datasets Will Drive PRP 300 images per night. 100MB per raw image 30GB per night 120GB per night 250 images per night. 530MB per raw image 150 GB per night 800GB per night When processed at NERSC Increased by 4x Source: Peter Nugent, Division Deputy for Scientific Engagement, LBL Professor of Astronomy, UC Berkeley Precursors to LSST And NCSA
  • 34. Cancer Genomics Hub (UCSC) is Housed in SDSC CoLo: Large Data Flows to End Users 1G 8G 15G Cumulative TBs of CGH Files Downloaded Data Source: David Haussler, Brad Smith, UCSC 30 PB
  • 35. To Map Out the Dynamics of Autoimmune Microbiome Ecology Couples Next Generation Genome Sequencers to Big Data Supercomputers Source: Weizhong Li, UCSD Our Team Used 25 CPU-years to Compute Comparative Gut Microbiomes Starting From 2.7 Trillion DNA Bases of My Samples and Healthy and IBD Controls SDSC Gordon Data Supercomputer
  • 36. Computing on Data: Complex Software Pipelines - From Sequence to Taxonomy and Function PI: (Weizhong Li, CRBS, UCSD): NIH R01HG005978 (2010-2013, $1.1M)
  • 37. Dan Cayan USGS Water Resources Discipline Scripps Institution of Oceanography, UC San Diego much support from Mary Tyree, Mike Dettinger, Guido Franco and other colleagues Sponsors: California Energy Commission NOAA RISA program California DWR, DOE, NSF Planning for climate change in California substantial shifts on top of already high climate variability SIO Campus Climate Researchers Need to Download Results from Remote Supercomputer Simulations to Make Regional Climate Change Forecasts
  • 38. Earth Sciences: Pacific Earthquake Engineering Research Center Enabling Real-Time Coupling Between Shake Tables and Supercomputer Simulations
  • 39. Collaboration Between EVL’s CAVE2 and Calit2’s VROOM Over 10Gb Wavelength EVL Calit2 Source: NTT Sponsored ON*VECTOR Workshop at Calit2 March 6, 2013
  • 40. Optical Fibers Link Australian and US Big Data Researchers
  • 41. Next Step: Use AARnet/PRP to Set Up Planetary-Scale Shared Virtual Worlds Digital Arena, UTS Sydney CAVE2, Monash U, Melbourne CAVE2, EVL, Chicago
  • 42. PRP Timeline • PRPv1 – A Layer 2 and Layer 3 System – Completed In 2 Years – Tested, Measured, Optimized, With Multi-domain Science Data – Bring Many Of Our Science Teams Up – Each Community Thus Will Have Its Own Certificate-Based Access To its Specific Federated Data Infrastructure. • PRPv2 – Advanced Ipv6-Only Version with Robust Security Features – E.G., Trusted Platform Module Hardware and SDN/SDX Software – Support Rates up to 100Gb/s in Bursts And Streams – Develop Means to Operate a Shared Federation of Caches
  • 43. The Pacific Wave Platform Creates an End-to-End Regional Science Big Data Freeway Source: John Hess, CENIC Opportunity: Connect NCSA to End Users on PRP Campuses @10Gbps