Amit Sheth, Keynote: International Conference on Interoperating Geographic Systems (Interop’97), Santa Barbara, December 3-4 1997.
Related technical paper: http://knoesis.org/library/resource.php?id=00230
Semantic Interoperability in Infocosm: Beyond Infrastructural and Data Interoperability in Federated Information Systems
1. Semantic Interoperability in Infocosm: Beyond Infrastructural and Data Interoperability in Federated Information Systems Keynote Talk International Conference on Interoperating Geographic Systems (Interop’97), Santa Barbara, December 3-4 1997 Amit Sheth Large Scale Distributed Information Systems Lab University of Georgia http://lsdis.cs.uga.edu Thanks: Vipul Kashyap, Kshitij Shah
2.
3. Evolving targets and approaches in integrating data and information: a personal perspective Infocosm Generation 1 Generation 2 Generation 3 Mermaid DDTS Multibase, MRDSM, ADDS, IISS, Omnibase, ... Early 80s Infoscopes, HERMES, SIMS, ... TSIMMIS,Harvest, RUFUS,... VisualHarness InfoHarness 1990 Digital Library Projects, .. InfoQuilt 1997
4.
5.
6.
7.
8.
9. Generation I and Lessons from the Federated Database Systems Research
10. Dimensions for interoperability and integration: Perspective used for Federated Databases Distribution Autonomy Heterogeneity
11.
12.
13. Characterization of Schematic Conflicts in Multidatabase Systems Schematic Conflicts Domain Definition Incompatibility Naming Conflicts Data Representation Conflicts Data Scaling Conflicts Data Precision Conflicts Default Value Conflicts Attribute Integrity Constraint Conflicts Data Value Incompatibility Known Inconsistency Temporal Inconsistency Acceptable Inconsistency Abstraction Level Incompatibility Generalization Conflicts Aggregation Conflicts Schematic Discrepancies Data Value Attribute Conflict Entity Attribute Conflict Data Value Entity Conflict Entity Definition Incompatibility Naming Conflicts Database Identifier Conflicts Schema Isomorphism Conflicts Missing Data Items Conflicts Sheth & Kashyap, Kim & Seo
14.
15.
16.
17. Generation 1 concern: So far (schematically), yet so near (semantically)! Generation 3 concern: So near (schematically), yet so far (semantically)!
19. Information Brokering: A Three-Level Approach Ontology Content Representation used-by abstracted-into Semantic (Domain, Application specific) Metadata (content descriptions, intentional) Data (heterogeneous types, media) used-by abstracted-into Top Down Bottom Up Emphasis from Gen.I to Gen.III
20. An Architecture for Information Brokering Information System 1 Information System N INFORMATION BROKERING Data Brokering (CORBA, HTTP, IIOP) User Query/ Information Request User Query/ Information Request User Query/ Information Request ... DATA REPOSITORIES ... DATA REPOSITORIES Inter-Vocabulary Relationships Manager Vocabulary Broker Vocabulary Broker Vocabulary Brokering Metadata Broker Metadata Repository Metadata System Metadata Broker Metadata Repository Metadata System Metadata Brokering
23. Junglee Gen.2 Data Integration Data Publishing Publishing Rule Publisher Extraction Rules Extractor Mapping Rules Mapper Internet Wrappers (SDL Description) Text IDT Application RDBMS
24. Find Marketing Manager positions in a company that is within 15 miles of San Francisco and whose stock price has been growing at a rate of at least 25% per year over the last three years Junglee, SIGMOD Record, Dec. 1997
25.
26.
27.
28. VisualHarness . . Image Data Color Comp Texture Structure Other Attributes VIR Extraction Null Image Metadata for combined access User Query VH Results
36. MREF Metadata Reference Link -- complementing HREF Creating “logical web” through Media Independent Metadata based Correlation
37.
38. Correlation based on Content-descriptive Metadata Some interesting <A MREF KEYWORDS=“scenic waterfall mountain”; THRESH = 0.9 > information on scenic waterfalls </A> is available here. Content Descriptive Metadata Marina wonderland You are seeing the nature’s beauty of marina wonderland situated in the coastal region of the southern part of India. It consists of huge mountains and water flowing in between the mountains. WAIS LSI Glimpse SMART … . … . waterfall.gif (Data) Full Text Indexing
39. Correlation based on Content-based Metadata height, width and size Some interesting <A MREF KEYWORDS= “scenic waterfalls”; THRESH = 0.9; ATTRIBUTES (major-color = ‘blue’) > information on scenic waterfalls </A> is available here. waterflow.gif (Data) Metadata Storage waterflow.gif …… gif …… ppm Major component(RGB) Blue Content based Metadata Content Dependent Metadata
40. Metadata, Domain Specific Ontologies Get the titles , authors , documents , maps published by the United States Geological Service (USGS) about regions having a population greater than 5000, area greater than 1000 acres having a low density urban area land cover domain specific metadata: terms chosen from domain specific ontologies What is Metadata ? - data/information about data - useful/derived properties of media - properties/relationships between objects What are Ontologies ? - collection of terms, definitions and their interrelationships - specification of a representational vocabulary for a shared domain of discourse
41. TIGER/Line DB Population: Area: Boundaries : Land cover: Relief: Census DB Image/Map DB Regions (SQL) Boundaries Image Features (image processing routines) Repositories and the Media Types
42.
43.
44.
45. InfoQuilt Architecture (partial) Media Independent Information Requests [Browsing Collections, Keyword-based queries, Attribute-based queries] Correlation Server Media and Domain specific Extractor Agents ... IQR: Metadata & Domain Knowledge Repository and Registry loc, type, author Attr. Metadata Parameterized Routines InfoQuilt Server KnowledgeBase Other InfoQuilt Servers Domain Knowledge Indices Text, Image, Audio, Video media repositories Wrapper Wrapper Wrapper
50. Computing Communication Information Knowledge Data Decision Connectivity Interoperability Cooperation Interoperability in the ‘80s System level interoperability like TCP/IP. Standard communication channels, data exchange formats, etc. Basic infrastructural work for higher level interoperability . HTTP, IIOP, TCP/IP
51. Computing Communication Information Knowledge Data Decision Connectivity Interoperability Cooperation Interoperability in the ‘90s Information level interoperability. Standards evolve that go beyond connectivity and define information standards. Systems start exchanging metadata (MCF,RDF,..). Business Objects, CORBA, DCOM, EDI
52. Computing Communication Information Knowledge Data Connectivity Interoperability Cooperation Where we are headed Semantic interoperability where systems share ontologies and knowledge. Systems and human can cooperate in decision making and can generate new knowledge as a collective entity.