SlideShare uma empresa Scribd logo
1 de 61
HDF Update
Mike Folk
The HDF Group
HDF and HDF-EOS Workshop XII
Aurora, Colorado
October 16, 2008
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

1
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

2
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

3
What’s up with The HDF
Group?

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

4
Announcement!
NASA Commits $3.1M to
The HDF Group to
Support Earth System
Science

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

5
NASA Commits …
• “The HDF Group has received a 3-year contract from
NASA to provide ongoing development and support for
the HDF technologies used by NASA’s Earth Observing
System.
• The project continues the relationship that was first
established in 1994, when HDF was selected as the
standard format for the EOS Data and Information
System (EOSDIS).
• Since that time, over 4 petabytes of mission data and
derived data products have been stored in HDF4 and
HDF5, with an estimated 1.6 million users.

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

6
• Under the new contract, The HDF Group will
support NASA’s EOS program in five critical
areas:
− Provide user support to EOS data providers and
data consumers
− Perform software development and quality
assurance
− Assure long-term access to HDF data
− Integrate with complementary technologies and
applications
− Advise follow-on earth systems projects
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

7
What is
The HDF Group
And why does it exist?

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

8
History of The HDF Group
• 18 Years at University of Illinois National Center
for Supercomputing Applications
• Spun-off from University July 2006
• Non-profit
• 20+ scientific, technology, professional staff
• Intellectual property:
− The HDF Group owns HDF4 and HDF5
− HDF formats and libraries to remain open
− BSD-type license

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

9
The HDF Group Mission
To ensure long-term
accessibility of HDF data
through sustainable
development and support of
HDF technologies.
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

10
Goals
• Maintain, evolve HDF for sponsors and
communities that depend on it
• Provide consulting, training, tuning,
development, research
• Sustain the group for long term to assure data
access over time

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

11
The HDF Group Services
•

Helpdesk and Mailing Lists

− Available to all users as a first level of support

•

Standard Support

− Rapid issue resolution support

•

Consulting

− Needs assessment, troubleshooting, design reviews, etc.

•

Enterprise Support

− Coordinating HDF activities across departments

•

Special Projects

− Adapting customer applications to HDF
− New features and tools, with changes normally incorporated into open
source product
− Research and Development

•

Training

− Tutorials and hands-on practical experience

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

12
Members of the HDF support community
•
•
•
•
•
•

NASA
Sandia National Laboratory (2)
University of Illinois/NCSA
A leading U.S. aerospace company
NOAA Science Data Stewardship
New projects and partners

− A major product lifecycle management company
− A bioinformatics software company
− Engineering Research and Development Center –
Topographic Engineering Center
− NPOESS
− ITT VIS

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

13
Initiatives and areas of increased interest
•
•
•
•
•
•
•

Bioinformatics
High performance computing (HPC)
Microsoft products (HPC, .NET, others)
Database integration
Improving concurrency
Performance and storage efficiency
Improving high level language support

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

14
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

15
Basic Library Releases
F5
HD

HDF4

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

16

4
HDF
Overview of basic library releases

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

17
HDF5 1.8.0 (Feb 08)
• Major release with file format changes and
features.
• File format changes affect backward/forward
compatibility with previous releases.
• See "New Features in Release 1.8.0 and Format
Compatibility Considerations”
http://hdfgroup.org/HDF5/doc/ADGuide/CompatFormat180.html

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

18
HDF5 1.8 minor releases
• 1.8.1 (May 08)
− A minor release with bug fixes
− Provided 1.8 full support for Fortran applications
− Enhanced tools with 1.8.0 features

• HDF5 1.8.2 coming Nov 08
− Minor bug fixes
− Tool enhancements

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

19
HDF5 1.6 minor releases
• 1.6.7 (Feb 08)
− Modification to address Aura issue

• 1.6.8 coming Nov 08
− Minor bug fixes

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

20
Future HDF5 releases (highlights)
• Release HDF5 1.10.0
−
−
−
−

Performance improvements
Some new features
Support for Fortran 2003 features
Target date November 2009

• When to drop support for 1.6.* ?

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

21
HDF 4 minor releases
• 4.2r3 (Feb 08)
− Improved support for apps using HDF4 and NetCDF3
− Improved support for data sets and coordinate
variable with the same names

• Release HDF4r2.4 coming Nov 08
− Minor bug fixing, tools enhancements
− Support for C shared libraries
− Support for 32-bit version on Mac Intel

• http://hdfgroup.org/products/hdf4/
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

22
H4-H5 Conversion Software 2.0 (May)

• Re-built with HDF5 1.8.1 and HDF 4.2r3.
• Conversion tool h4toh5 enhanced
− Converts HDF-EOS2 files to HDF5 files
− Makes HDF5 files readable by NetCDF4

http://hdfgroup.org/h4toh5/

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

23
HDF-EOS library

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

24
HDF-EOS2 and HDF-EOS5
• Auto configuration for HDF-EOS2 and HDF-EOS5
− Compile and test libraries with automatic
configuration tools
− Thank you, Abe!

• Testing of EOS2 and EOS5
− Test daily with HDF4 and HDF5 development code
− Periodically test on EOS-critical platforms

• EOS website support

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

25
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

26
h5check 1.0 (March 2008)
• A validation tool to verify whether an HDF5 file
is encoded according to the HDF5 File Format
Specification.
• To ensure format integrity and long-term
compatibility between versions of the HDF5
library.
• By default, the file is verified against 1.8.x.
Can also verify against 1.6.x.

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

27
Major Improvements for Existing Tools
• Improved handling of large datasets by h5diff,
h5repack, hdiff, and hrepack
• Other added capabilities
−
−
−
−

H5import: to import strings
H5diff: to deal with NaN values
H5dump: to dump objects in requested order
H5repack:
• To apply multiple filters to all objects
• To add a userblock
• To align datasets in file at byte offsets that support
efficient access

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

28
In the works: h52jpeg
• Converts datasets in an HDF5 file to a jpeg image.
• Prototype available, if you are interested.

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

29
Please send us your
comments and requests
regarding the HDF4 and
HDF5 library and tools

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

30
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

31
HDF Java
• HDF-Java 2.5 release
− Beta 1 Release Feb 08
− Full release planned for Dec. 2008

• HDF5 JNI updated for HDF5 1.8.x with 1.6 flag
• Binary for 32-bit Linux and 64-bit Solaris
• Also added daily testing added for hdf-java
products

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

32
Also in the pipeline
• Full Java Support for HDF5 1.8.x
− Add and test new functions in Java wrapper
− Implement and test new functions in C JNI
− Use new functions in HDF-Java objects

• Add many new features
• Improve performance
• Revise HDFView User’s Guide

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

33
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

34
Surviving a System Failure

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

35
Surviving a System Failure in HDF5
• Problem:
− In the event of an application or system crash, data
in HDF5 files are susceptible to corruption
− Corruption can occur if structural metadata is being
written when the crash occurs

• Initial Objective:
− Guarantee an HDF5 file with consistent metadata
can be reconstructed in the event of a crash
− No guarantee on state of raw data – contains
whatever data made it to disk prior to crash
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

36
HDF5 Metadata Journaling Recovery
Application
crashes

H5recover Tool

Restored
HDF5
File

Corrupted HDF5 File

Companion Journal File

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

38
Faster HDF5 Data Appends

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

40
Fast Data Appends
• Problem: Metadata operations limit the rate at
which HDF5 can append data to datasets.
• Solution: new data structure for indexing chunks:
− Allows constant time extend, shrink and lookup of
chunks in datasets with single unlimited dimension
− # of metadata I/O operations to append to dataset
is independent of # of chunks
− Also allows single-writer/multiple-reader access

• Details at:
http://hdfgroup.uiuc.edu/RFC/HDF5/ReviseChunks/

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

41
HDF Performance
Framework
A framework for performance
regression testing

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

42
HDF Performance Framework
• A tool for
−
−
−
−

Testing on multiple platforms
Testing different versions
Long term regression testing
Assistance in debugging

• New for 1.8:
− API and format versioning
− Improved reporting interfaces

• Future related work
− Quality monitoring of the software, such as code
coverage, memory usage
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

43
Other library work

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

44
Library Features
• Improved external link support
− External link: link to HDF5 object in another file
− Can more easily specify path lookup of external
files
− Adding external link support for h5ls and h5dump

• Time datatype improvements
− Expand time type to support native formats better
− Adapt tools to display them properly

• Port to OpenVMS (limited support)

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

45
Improving performance
• Faster file free-space management while file open
• Many transactions can create many holes
• Free space management recovers unused space
• Up to 38x improvement in experiments

• Direct I/O: file I/O goes directly between
application and storage, bypassing operating
system read and write caches
• Disabling automatic metadata cache flushing
− In experiments, direct I/O combined with metadata
cache disabling improved I/O speed by about 2x.
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

46
Topics

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

47
Remote access

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

48
Three “remote access” projects
• HDF5-OPeNDAP handler
− See talk by Kent Yang: “HDF5 OPeNDAP project
update and demo”

• HDF5-iRODS integration
− See Peter Cao’s talk Thursday: “HDF5 iRODS”

• Accessing HDF5 through SSHFS-FUSE

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

49
Accessing HDF5 through SSHFS-FUSE
• Access to files on remote NFS system limited
• Combining FUSE (Filesystem in Userspace) with SSHFS
(Secure Shell File System)

− FUSE provides application with local view of remote file system
• Another way to mount remote file system

− SSHFS allows the local file system to access parts of remote
file.
• e.g., “read” operation on the remote filesystem can be served
through SSH
• Subsetting can be efficiently done with SSHFS

• Extract a dataset (5 MB) from a 96 MB HDF5 file

− Download whole file + subset locally: 9.85 seconds
− Subset with SSHFS: 0.47 seconds

• Technical report in the works

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

50
HDF4 Layout Map Project
• Problem
− Long-term readability of HDF data dependent on
long-term availability of HDF software

• Proposed solution
− Create a map of the layout of data objects in an
HDF file, allowing a simple reader to be written to
access the data

• See today’s talk by Folk and Duerr: “Ensuring
Long Term Access to Remotely Sensed HDF4
Data with Layout Maps.”
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

51
HDF and .NET Framework
• Prototype .NET wrappers for HDF5 1.8.0
− Based on subset of HDF5 C routines

• Released in March, 2008
• Unsupported
− Considerable interest, but currently no funding to
support or maintain
− Use hdf-forum email list for questions

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

52
netCDF-4
Released June 2008!!

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

53
Investigation of HDF
Support in Some Open
Source Software Packages

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

54
Five open source packages
•

PyHDF

− Python interface to HDF4
− http://pysclint.sourceforge.net/pyhdf/

•

Geospatial Data Abstraction Library (GDAL)

− Translator library for Raster Geospatial Data Formats
− Supports about 100 file formats
− http://gdal.org/

•

NCAR Common Language (NCL)

− Interpreted Language for Data Analysis and Visualization
− http://ncl.ucar.edu/

•

Grid Analysis and Display System (GrADS)

− Interpreted Language for Data Analysis and Visualization
− http://iges.org/grads/

•

GNU Data Language (GDL)

− Interpreted Language for Data Analysis and Visualization
− Data Analysis and Visualization
− http://gnudatalanguage.sourceforge.net/

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

55
Evaluation criteria
• Formats
− HDF4, HDF5, netCDF
− Objects supported in each language

• Installation
− Availability of binaries
− Other requirements

• Adequacy of documentation
• Technical report available soon.
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

56
Windows Virtualization
Motivation: high cost of
maintaining many different
Windows configurations
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

57
Maintenance & Testing with VMWare
•
•
•
•
•

Multiple virtual machines run in parallel
Only relevant software installed
Each represents a supported configuration
Run nightly tests of HDF4, HDF5
Each is powered on, tested, cleaned
automatically

• Technical report available soon.

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

58
HDF5 Data Transform Pilot Study
• Tools for Flight Test Data
• Framework to define and apply transformations
to data being read
• Transformations specified in Python

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

59
Science Data Stewardship
• Goal: migrate data to a single standards-based archive
format.
• Approach: investigate how to store NASA ECS data and
metadata in HDF5 Archival Information Packages (AIP).
• See talk by Yang, Duerr et al: “Using HDF5 Archive
Information Package to preserve HDF-EOS2 data”

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

60
Thank You All
and
Thank You NASA!

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

61
Acknowledgements
This report is based upon work supported in part
by a Cooperative Agreement with the National
Aeronautics and Space Administration (NASA)
under NASA Awards NNX06AC83A and
NNX08AO77A.
Any opinions, findings, and conclusions or
recommendations expressed in this material are
those of the author(s) and do not necessarily
reflect the views of the National Aeronautics and
Space Administration.
Oct. 16, 2008

HDF and HDF-EOS Workshop XII

62
Questions/comments?

Oct. 16, 2008

HDF and HDF-EOS Workshop XII

63

Mais conteúdo relacionado

Mais procurados

Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsInteroperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsThe HDF-EOS Tools and Information Center
 

Mais procurados (20)

Archive Information Packages for NASA HDF-EOS Data
Archive Information Packages for NASA HDF-EOS DataArchive Information Packages for NASA HDF-EOS Data
Archive Information Packages for NASA HDF-EOS Data
 
HDF4 Mapping Project Update
HDF4 Mapping Project UpdateHDF4 Mapping Project Update
HDF4 Mapping Project Update
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Welcome to HDF Workshop V
Welcome to HDF Workshop VWelcome to HDF Workshop V
Welcome to HDF Workshop V
 
HDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSSHDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSS
 
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 productsInteroperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
Interoperability with netCDF-4 - Experience with NPP and HDF-EOS5 products
 
HDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and DemoHDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and Demo
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
The HDF Group: Community models and outreach
The HDF Group: Community models and outreachThe HDF Group: Community models and outreach
The HDF Group: Community models and outreach
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Using IDL with Suomi NPP VIIRS Data
Using IDL with Suomi NPP VIIRS DataUsing IDL with Suomi NPP VIIRS Data
Using IDL with Suomi NPP VIIRS Data
 
HDF and netCDF Data Support in ArcGIS
HDF and netCDF Data Support in ArcGISHDF and netCDF Data Support in ArcGIS
HDF and netCDF Data Support in ArcGIS
 
HDF5 and The HDF Group
HDF5 and The HDF GroupHDF5 and The HDF Group
HDF5 and The HDF Group
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
 
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs ProjectsGES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Digital Object Identifiers for EOSDIS data
Digital Object Identifiers for EOSDIS dataDigital Object Identifiers for EOSDIS data
Digital Object Identifiers for EOSDIS data
 
Images of HDF5
Images of HDF5Images of HDF5
Images of HDF5
 
Status of HDF-EOS, Related Software and Tools
Status of HDF-EOS, Related Software and ToolsStatus of HDF-EOS, Related Software and Tools
Status of HDF-EOS, Related Software and Tools
 

Destaque

Hdf5 current future
Hdf5 current futureHdf5 current future
Hdf5 current futuremfolk
 

Destaque (20)

Profile of HDF-EOS5 Files
Profile of HDF-EOS5 FilesProfile of HDF-EOS5 Files
Profile of HDF-EOS5 Files
 
Status of HDF-EOS, Related Software, and Tools
Status of HDF-EOS, Related Software, and ToolsStatus of HDF-EOS, Related Software, and Tools
Status of HDF-EOS, Related Software, and Tools
 
Support for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF GroupSupport for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF Group
 
ENVI/IDL for HDF
ENVI/IDL for HDFENVI/IDL for HDF
ENVI/IDL for HDF
 
Profile of NPOESS HDF5 Files
Profile of NPOESS HDF5 FilesProfile of NPOESS HDF5 Files
Profile of NPOESS HDF5 Files
 
Shifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data ProviderShifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data Provider
 
Proposal for adding Named Dimensions to HDF5 Arrays
Proposal for adding Named Dimensions to HDF5 ArraysProposal for adding Named Dimensions to HDF5 Arrays
Proposal for adding Named Dimensions to HDF5 Arrays
 
HDFView and HDF Java Products
HDFView and HDF Java ProductsHDFView and HDF Java Products
HDFView and HDF Java Products
 
The CFD General Notation System transition to HDF5
The CFD General Notation System transition to HDF5The CFD General Notation System transition to HDF5
The CFD General Notation System transition to HDF5
 
What will be new in HDF5?
What will be new in HDF5?What will be new in HDF5?
What will be new in HDF5?
 
HDF and HDF-EOS Experiences and Applications
HDF and HDF-EOS Experiences and ApplicationsHDF and HDF-EOS Experiences and Applications
HDF and HDF-EOS Experiences and Applications
 
EOSDIS Status
EOSDIS StatusEOSDIS Status
EOSDIS Status
 
HDF5 OPeNDAP project update and demo
HDF5 OPeNDAP project update and demoHDF5 OPeNDAP project update and demo
HDF5 OPeNDAP project update and demo
 
Workshop Discussion: HDF & HDF-EOS Future Direction
Workshop Discussion: HDF & HDF-EOS Future DirectionWorkshop Discussion: HDF & HDF-EOS Future Direction
Workshop Discussion: HDF & HDF-EOS Future Direction
 
The MATLAB Low-Level HDF5 Interface
The MATLAB Low-Level HDF5 InterfaceThe MATLAB Low-Level HDF5 Interface
The MATLAB Low-Level HDF5 Interface
 
Reading HDF family of formats via NetCDF-Java / CDM
Reading HDF family of formats via NetCDF-Java / CDMReading HDF family of formats via NetCDF-Java / CDM
Reading HDF family of formats via NetCDF-Java / CDM
 
ORNL DAAC MODIS Land Product Subsets
ORNL DAAC MODIS Land Product SubsetsORNL DAAC MODIS Land Product Subsets
ORNL DAAC MODIS Land Product Subsets
 
Using HDF5 Archive Information Package to preserve HDF-EOS2 data
Using HDF5 Archive Information Package to preserve HDF-EOS2 dataUsing HDF5 Archive Information Package to preserve HDF-EOS2 data
Using HDF5 Archive Information Package to preserve HDF-EOS2 data
 
Advanced HDF5 Features
Advanced HDF5 FeaturesAdvanced HDF5 Features
Advanced HDF5 Features
 
Hdf5 current future
Hdf5 current futureHdf5 current future
Hdf5 current future
 

Semelhante a HDF Update

Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...The HDF-EOS Tools and Information Center
 

Semelhante a HDF Update (20)

Hierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) UpdateHierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) Update
 
Support for NPP/NPOESS/JPSS by The HDF Group
 Support for NPP/NPOESS/JPSS by The HDF Group Support for NPP/NPOESS/JPSS by The HDF Group
Support for NPP/NPOESS/JPSS by The HDF Group
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
 
HDF5 Backward and Forward Compatibility Issues
HDF5 Backward and Forward Compatibility IssuesHDF5 Backward and Forward Compatibility Issues
HDF5 Backward and Forward Compatibility Issues
 
Parallel HDF5 Developments
Parallel HDF5 DevelopmentsParallel HDF5 Developments
Parallel HDF5 Developments
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 
HDF OPeNDAP update
HDF OPeNDAP updateHDF OPeNDAP update
HDF OPeNDAP update
 
Support for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF GroupSupport for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF Group
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
ESDIS Status (2002)
ESDIS Status (2002)ESDIS Status (2002)
ESDIS Status (2002)
 
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout MapsEnsuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
 
HDF Updae
HDF UpdaeHDF Updae
HDF Updae
 
HDF OPeNDAP project update and demo
HDF OPeNDAP project update and demoHDF OPeNDAP project update and demo
HDF OPeNDAP project update and demo
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Data Interoperability
Data InteroperabilityData Interoperability
Data Interoperability
 
Introduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming ModelsIntroduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming Models
 
Easy Remote Access Via OPeNDAP
Easy Remote Access Via OPeNDAPEasy Remote Access Via OPeNDAP
Easy Remote Access Via OPeNDAP
 
HDF5 iRODS
HDF5 iRODSHDF5 iRODS
HDF5 iRODS
 

Mais de The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

Mais de The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 
Leveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software TestingLeveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software Testing
 

Último

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Último (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

HDF Update

  • 1. HDF Update Mike Folk The HDF Group HDF and HDF-EOS Workshop XII Aurora, Colorado October 16, 2008 Oct. 16, 2008 HDF and HDF-EOS Workshop XII 1
  • 2. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 2
  • 3. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 3
  • 4. What’s up with The HDF Group? Oct. 16, 2008 HDF and HDF-EOS Workshop XII 4
  • 5. Announcement! NASA Commits $3.1M to The HDF Group to Support Earth System Science Oct. 16, 2008 HDF and HDF-EOS Workshop XII 5
  • 6. NASA Commits … • “The HDF Group has received a 3-year contract from NASA to provide ongoing development and support for the HDF technologies used by NASA’s Earth Observing System. • The project continues the relationship that was first established in 1994, when HDF was selected as the standard format for the EOS Data and Information System (EOSDIS). • Since that time, over 4 petabytes of mission data and derived data products have been stored in HDF4 and HDF5, with an estimated 1.6 million users. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 6
  • 7. • Under the new contract, The HDF Group will support NASA’s EOS program in five critical areas: − Provide user support to EOS data providers and data consumers − Perform software development and quality assurance − Assure long-term access to HDF data − Integrate with complementary technologies and applications − Advise follow-on earth systems projects Oct. 16, 2008 HDF and HDF-EOS Workshop XII 7
  • 8. What is The HDF Group And why does it exist? Oct. 16, 2008 HDF and HDF-EOS Workshop XII 8
  • 9. History of The HDF Group • 18 Years at University of Illinois National Center for Supercomputing Applications • Spun-off from University July 2006 • Non-profit • 20+ scientific, technology, professional staff • Intellectual property: − The HDF Group owns HDF4 and HDF5 − HDF formats and libraries to remain open − BSD-type license Oct. 16, 2008 HDF and HDF-EOS Workshop XII 9
  • 10. The HDF Group Mission To ensure long-term accessibility of HDF data through sustainable development and support of HDF technologies. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 10
  • 11. Goals • Maintain, evolve HDF for sponsors and communities that depend on it • Provide consulting, training, tuning, development, research • Sustain the group for long term to assure data access over time Oct. 16, 2008 HDF and HDF-EOS Workshop XII 11
  • 12. The HDF Group Services • Helpdesk and Mailing Lists − Available to all users as a first level of support • Standard Support − Rapid issue resolution support • Consulting − Needs assessment, troubleshooting, design reviews, etc. • Enterprise Support − Coordinating HDF activities across departments • Special Projects − Adapting customer applications to HDF − New features and tools, with changes normally incorporated into open source product − Research and Development • Training − Tutorials and hands-on practical experience Oct. 16, 2008 HDF and HDF-EOS Workshop XII 12
  • 13. Members of the HDF support community • • • • • • NASA Sandia National Laboratory (2) University of Illinois/NCSA A leading U.S. aerospace company NOAA Science Data Stewardship New projects and partners − A major product lifecycle management company − A bioinformatics software company − Engineering Research and Development Center – Topographic Engineering Center − NPOESS − ITT VIS Oct. 16, 2008 HDF and HDF-EOS Workshop XII 13
  • 14. Initiatives and areas of increased interest • • • • • • • Bioinformatics High performance computing (HPC) Microsoft products (HPC, .NET, others) Database integration Improving concurrency Performance and storage efficiency Improving high level language support Oct. 16, 2008 HDF and HDF-EOS Workshop XII 14
  • 15. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 15
  • 16. Basic Library Releases F5 HD HDF4 Oct. 16, 2008 HDF and HDF-EOS Workshop XII 16 4 HDF
  • 17. Overview of basic library releases Oct. 16, 2008 HDF and HDF-EOS Workshop XII 17
  • 18. HDF5 1.8.0 (Feb 08) • Major release with file format changes and features. • File format changes affect backward/forward compatibility with previous releases. • See "New Features in Release 1.8.0 and Format Compatibility Considerations” http://hdfgroup.org/HDF5/doc/ADGuide/CompatFormat180.html Oct. 16, 2008 HDF and HDF-EOS Workshop XII 18
  • 19. HDF5 1.8 minor releases • 1.8.1 (May 08) − A minor release with bug fixes − Provided 1.8 full support for Fortran applications − Enhanced tools with 1.8.0 features • HDF5 1.8.2 coming Nov 08 − Minor bug fixes − Tool enhancements Oct. 16, 2008 HDF and HDF-EOS Workshop XII 19
  • 20. HDF5 1.6 minor releases • 1.6.7 (Feb 08) − Modification to address Aura issue • 1.6.8 coming Nov 08 − Minor bug fixes Oct. 16, 2008 HDF and HDF-EOS Workshop XII 20
  • 21. Future HDF5 releases (highlights) • Release HDF5 1.10.0 − − − − Performance improvements Some new features Support for Fortran 2003 features Target date November 2009 • When to drop support for 1.6.* ? Oct. 16, 2008 HDF and HDF-EOS Workshop XII 21
  • 22. HDF 4 minor releases • 4.2r3 (Feb 08) − Improved support for apps using HDF4 and NetCDF3 − Improved support for data sets and coordinate variable with the same names • Release HDF4r2.4 coming Nov 08 − Minor bug fixing, tools enhancements − Support for C shared libraries − Support for 32-bit version on Mac Intel • http://hdfgroup.org/products/hdf4/ Oct. 16, 2008 HDF and HDF-EOS Workshop XII 22
  • 23. H4-H5 Conversion Software 2.0 (May) • Re-built with HDF5 1.8.1 and HDF 4.2r3. • Conversion tool h4toh5 enhanced − Converts HDF-EOS2 files to HDF5 files − Makes HDF5 files readable by NetCDF4 http://hdfgroup.org/h4toh5/ Oct. 16, 2008 HDF and HDF-EOS Workshop XII 23
  • 24. HDF-EOS library Oct. 16, 2008 HDF and HDF-EOS Workshop XII 24
  • 25. HDF-EOS2 and HDF-EOS5 • Auto configuration for HDF-EOS2 and HDF-EOS5 − Compile and test libraries with automatic configuration tools − Thank you, Abe! • Testing of EOS2 and EOS5 − Test daily with HDF4 and HDF5 development code − Periodically test on EOS-critical platforms • EOS website support Oct. 16, 2008 HDF and HDF-EOS Workshop XII 25
  • 26. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 26
  • 27. h5check 1.0 (March 2008) • A validation tool to verify whether an HDF5 file is encoded according to the HDF5 File Format Specification. • To ensure format integrity and long-term compatibility between versions of the HDF5 library. • By default, the file is verified against 1.8.x. Can also verify against 1.6.x. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 27
  • 28. Major Improvements for Existing Tools • Improved handling of large datasets by h5diff, h5repack, hdiff, and hrepack • Other added capabilities − − − − H5import: to import strings H5diff: to deal with NaN values H5dump: to dump objects in requested order H5repack: • To apply multiple filters to all objects • To add a userblock • To align datasets in file at byte offsets that support efficient access Oct. 16, 2008 HDF and HDF-EOS Workshop XII 28
  • 29. In the works: h52jpeg • Converts datasets in an HDF5 file to a jpeg image. • Prototype available, if you are interested. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 29
  • 30. Please send us your comments and requests regarding the HDF4 and HDF5 library and tools Oct. 16, 2008 HDF and HDF-EOS Workshop XII 30
  • 31. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 31
  • 32. HDF Java • HDF-Java 2.5 release − Beta 1 Release Feb 08 − Full release planned for Dec. 2008 • HDF5 JNI updated for HDF5 1.8.x with 1.6 flag • Binary for 32-bit Linux and 64-bit Solaris • Also added daily testing added for hdf-java products Oct. 16, 2008 HDF and HDF-EOS Workshop XII 32
  • 33. Also in the pipeline • Full Java Support for HDF5 1.8.x − Add and test new functions in Java wrapper − Implement and test new functions in C JNI − Use new functions in HDF-Java objects • Add many new features • Improve performance • Revise HDFView User’s Guide Oct. 16, 2008 HDF and HDF-EOS Workshop XII 33
  • 34. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 34
  • 35. Surviving a System Failure Oct. 16, 2008 HDF and HDF-EOS Workshop XII 35
  • 36. Surviving a System Failure in HDF5 • Problem: − In the event of an application or system crash, data in HDF5 files are susceptible to corruption − Corruption can occur if structural metadata is being written when the crash occurs • Initial Objective: − Guarantee an HDF5 file with consistent metadata can be reconstructed in the event of a crash − No guarantee on state of raw data – contains whatever data made it to disk prior to crash Oct. 16, 2008 HDF and HDF-EOS Workshop XII 36
  • 37. HDF5 Metadata Journaling Recovery Application crashes H5recover Tool Restored HDF5 File Corrupted HDF5 File Companion Journal File Oct. 16, 2008 HDF and HDF-EOS Workshop XII 38
  • 38. Faster HDF5 Data Appends Oct. 16, 2008 HDF and HDF-EOS Workshop XII 40
  • 39. Fast Data Appends • Problem: Metadata operations limit the rate at which HDF5 can append data to datasets. • Solution: new data structure for indexing chunks: − Allows constant time extend, shrink and lookup of chunks in datasets with single unlimited dimension − # of metadata I/O operations to append to dataset is independent of # of chunks − Also allows single-writer/multiple-reader access • Details at: http://hdfgroup.uiuc.edu/RFC/HDF5/ReviseChunks/ Oct. 16, 2008 HDF and HDF-EOS Workshop XII 41
  • 40. HDF Performance Framework A framework for performance regression testing Oct. 16, 2008 HDF and HDF-EOS Workshop XII 42
  • 41. HDF Performance Framework • A tool for − − − − Testing on multiple platforms Testing different versions Long term regression testing Assistance in debugging • New for 1.8: − API and format versioning − Improved reporting interfaces • Future related work − Quality monitoring of the software, such as code coverage, memory usage Oct. 16, 2008 HDF and HDF-EOS Workshop XII 43
  • 42. Other library work Oct. 16, 2008 HDF and HDF-EOS Workshop XII 44
  • 43. Library Features • Improved external link support − External link: link to HDF5 object in another file − Can more easily specify path lookup of external files − Adding external link support for h5ls and h5dump • Time datatype improvements − Expand time type to support native formats better − Adapt tools to display them properly • Port to OpenVMS (limited support) Oct. 16, 2008 HDF and HDF-EOS Workshop XII 45
  • 44. Improving performance • Faster file free-space management while file open • Many transactions can create many holes • Free space management recovers unused space • Up to 38x improvement in experiments • Direct I/O: file I/O goes directly between application and storage, bypassing operating system read and write caches • Disabling automatic metadata cache flushing − In experiments, direct I/O combined with metadata cache disabling improved I/O speed by about 2x. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 46
  • 45. Topics Oct. 16, 2008 HDF and HDF-EOS Workshop XII 47
  • 46. Remote access Oct. 16, 2008 HDF and HDF-EOS Workshop XII 48
  • 47. Three “remote access” projects • HDF5-OPeNDAP handler − See talk by Kent Yang: “HDF5 OPeNDAP project update and demo” • HDF5-iRODS integration − See Peter Cao’s talk Thursday: “HDF5 iRODS” • Accessing HDF5 through SSHFS-FUSE Oct. 16, 2008 HDF and HDF-EOS Workshop XII 49
  • 48. Accessing HDF5 through SSHFS-FUSE • Access to files on remote NFS system limited • Combining FUSE (Filesystem in Userspace) with SSHFS (Secure Shell File System) − FUSE provides application with local view of remote file system • Another way to mount remote file system − SSHFS allows the local file system to access parts of remote file. • e.g., “read” operation on the remote filesystem can be served through SSH • Subsetting can be efficiently done with SSHFS • Extract a dataset (5 MB) from a 96 MB HDF5 file − Download whole file + subset locally: 9.85 seconds − Subset with SSHFS: 0.47 seconds • Technical report in the works Oct. 16, 2008 HDF and HDF-EOS Workshop XII 50
  • 49. HDF4 Layout Map Project • Problem − Long-term readability of HDF data dependent on long-term availability of HDF software • Proposed solution − Create a map of the layout of data objects in an HDF file, allowing a simple reader to be written to access the data • See today’s talk by Folk and Duerr: “Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps.” Oct. 16, 2008 HDF and HDF-EOS Workshop XII 51
  • 50. HDF and .NET Framework • Prototype .NET wrappers for HDF5 1.8.0 − Based on subset of HDF5 C routines • Released in March, 2008 • Unsupported − Considerable interest, but currently no funding to support or maintain − Use hdf-forum email list for questions Oct. 16, 2008 HDF and HDF-EOS Workshop XII 52
  • 51. netCDF-4 Released June 2008!! Oct. 16, 2008 HDF and HDF-EOS Workshop XII 53
  • 52. Investigation of HDF Support in Some Open Source Software Packages Oct. 16, 2008 HDF and HDF-EOS Workshop XII 54
  • 53. Five open source packages • PyHDF − Python interface to HDF4 − http://pysclint.sourceforge.net/pyhdf/ • Geospatial Data Abstraction Library (GDAL) − Translator library for Raster Geospatial Data Formats − Supports about 100 file formats − http://gdal.org/ • NCAR Common Language (NCL) − Interpreted Language for Data Analysis and Visualization − http://ncl.ucar.edu/ • Grid Analysis and Display System (GrADS) − Interpreted Language for Data Analysis and Visualization − http://iges.org/grads/ • GNU Data Language (GDL) − Interpreted Language for Data Analysis and Visualization − Data Analysis and Visualization − http://gnudatalanguage.sourceforge.net/ Oct. 16, 2008 HDF and HDF-EOS Workshop XII 55
  • 54. Evaluation criteria • Formats − HDF4, HDF5, netCDF − Objects supported in each language • Installation − Availability of binaries − Other requirements • Adequacy of documentation • Technical report available soon. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 56
  • 55. Windows Virtualization Motivation: high cost of maintaining many different Windows configurations Oct. 16, 2008 HDF and HDF-EOS Workshop XII 57
  • 56. Maintenance & Testing with VMWare • • • • • Multiple virtual machines run in parallel Only relevant software installed Each represents a supported configuration Run nightly tests of HDF4, HDF5 Each is powered on, tested, cleaned automatically • Technical report available soon. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 58
  • 57. HDF5 Data Transform Pilot Study • Tools for Flight Test Data • Framework to define and apply transformations to data being read • Transformations specified in Python Oct. 16, 2008 HDF and HDF-EOS Workshop XII 59
  • 58. Science Data Stewardship • Goal: migrate data to a single standards-based archive format. • Approach: investigate how to store NASA ECS data and metadata in HDF5 Archival Information Packages (AIP). • See talk by Yang, Duerr et al: “Using HDF5 Archive Information Package to preserve HDF-EOS2 data” Oct. 16, 2008 HDF and HDF-EOS Workshop XII 60
  • 59. Thank You All and Thank You NASA! Oct. 16, 2008 HDF and HDF-EOS Workshop XII 61
  • 60. Acknowledgements This report is based upon work supported in part by a Cooperative Agreement with the National Aeronautics and Space Administration (NASA) under NASA Awards NNX06AC83A and NNX08AO77A. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Aeronautics and Space Administration. Oct. 16, 2008 HDF and HDF-EOS Workshop XII 62
  • 61. Questions/comments? Oct. 16, 2008 HDF and HDF-EOS Workshop XII 63

Notas do Editor

  1. Mike Folk, president and CEO of The HDF Group, comments:  “Our close collaboration with NASA’s EOS program has been a model for the kind of partnership we strive to establish with all HDF customers—one where the users depend on and support a quality product and influence future developments.  We are pleased and proud to continue our work with NASA in their mission to serve the Earth Science community.”
  2. Why Increasing need for support, services, quick response Not a good model for a University R&amp;D project Who 11 software engineers and several students: develop, maintain HDF software, work on special projects, manage projects 3 tech support staff: helpdesk, doc, sysadmin. Management team President Director of Technical Services and Operations Director of Software Development Director of Business Operations Managers responsible for tools, applications Other THG staff include seven full-time software engineers who develop and maintain the HDF software, as well as working on special projects, and three technical support staff who provide helpdesk support, documentation, and system administration. The HDF group also generally employs students from the University Computer Science and Engineering departments.
  3. The R&amp;D mission Maintain and evolve HDF for high end science apps Maintain HDF4 and HDF5 and tools at supercomputing centers, TeraGrid Support academic science Cutting edge data management research Adapt to leading edge, experimental architectures Integrate with new middleware technologies, parallel file systems The “Support and Sustain” mission Maintain, evolve for communities, sponsors Provide proprietary consulting, tuning, development Sustain for long term, maintain data access over time
  4. Goal: help HDF users who rely on IDL get timely access to improved HDF libraries. The HDF Group and ITTVIS collaborate to improve the process of integrating the new versions of HDF with IDL. ITT VIS has provided the HDF Group with IDL software and licenses. Also let us enable IDL clients to access HDF5 files on remote servers via OPeNDAP. *ITT Visual Information Solutions (makers of IDL and ENVI).
  5. Please mention here that HDF5 maintenance releases are on a half year basis and HDF4 maintenance releases are on yearly basis, i.e., next maintenance release of HDF5 1.6 and 1.8 will be May 2009, and HDF4 in November 2009
  6. Possible performance improvements in 1.10. Free space management (non-persisting; persisting possible, not certain) Revised chunking Fast append ? From Quincey
  7. Testing not only helps find bugs inside HDF library but also finds bugs in EOS test programs
  8. Approach: Metadata Journaling When an HDF5 file is opened with Metadata Journaling enabled, a companion Journal file is created. When an HDF5 API function that modifies metadata is completed, a transaction is recorded in the Journal file.  If the application crashes, a recovery program can replay the journal by applying in order all metadata writes until the end of the last completed transaction written to the journal file.
  9. Serial HDF5 with synchronous write mode Finalize User interface definitions and file format Serial HDF5 with asynchronous write mode To mprove Journal file write speed More features (need funding) Make raw data operations atomic Allow &quot;super‐transactions&quot; to be created by applications Enable journaling for Parallel HDF5
  10. Is it only limited for unlimited / chunked datasets? Or is it that way for all but we’re just fixing it for limited / unchunked cases? Contrasts with B-tree index: - B-tree has O(log n) extend, shrink and lookup of chunks - B-tree has ~logarithmic # of metadata I/O operations as chunks appended Will be optimizing chunked dataset indexing for datasets with no unlimited dimensions (with array index) and multiple unlimited dimensions (with v2 B-tree) as part of project in the next year also.
  11. Say why external links are useful.
  12. Direct I/O is a feature of the file system whereby file reads and writes go directly from the applications to the storage device, bypassing the operating system read and write caches. Direct I/O is used by only a few applications that manage their own caches, such as databases.