SlideShare uma empresa Scribd logo
1 de 27
ON	
  DEMAND	
  ACCESS	
  TO	
  BIG	
  DATA	
  
THROUGH	
  SEMANTIC	
  TECHNOLOGIES	
  
	
  
Peter	
  Haase	
  
fluid	
  Opera/ons	
  AG!
fluid	
  Opera/ons	
  (fluidOps)	
  
	
  
Linked	
  Data	
  &	
  Seman;c	
  Technologies	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Enterprise	
  Cloud	
  Compu;ng	
  
	
  
	
  
	
  
So7ware	
  company	
  founded	
  Q1/2008	
  by	
  team	
  of	
  serial	
  entrepreneurs,	
  privately	
  held,	
  
VC	
  funded	
  
Headquarters	
  in	
  Walldorf	
  /	
  Germany,	
  SAP	
  Partner	
  Port	
  
Currently	
  45	
  employees	
  
Named	
  “Cool	
  Vendor”	
  by	
  Gartner	
  Mar	
  2010	
  
Global	
  reseller	
  agreement	
  with	
  EMC	
  focus	
  large	
  	
  
enterprise	
  customers	
  Apr	
  2010	
  
NetApp	
  Advantage	
  Alliance	
  Partner	
  Oct	
  2010	
  
Outline	
  

•  Big	
  Data	
  Challenges:	
  Beyond	
  Volume	
  

•  Seman;c	
  Technologies	
  for	
  Big	
  Data	
  Challenges	
  

•  On	
  Demand	
  Data	
  Access	
  in	
  a	
  Self-­‐service	
  Process	
  

•  Outlook:	
  Op;que	
  -­‐	
  Scalable	
  End	
  User	
  Access	
  to	
  Big	
  Data	
  
Big	
  Data	
  

	
  
	
  
	
  
	
  
	
  
	
  
	
  
“Big	
  data	
  consists	
  of	
  data	
  sets	
  that	
  grow	
  so	
  large	
  that	
  they	
  become	
  awkward	
  
to	
  work	
  with	
  using	
  on-­‐hand	
  database	
  management	
  tools.”	
  	
  (Wikipedia)	
  
	
  
•  12	
  terabytes	
  of	
  Tweets	
  created	
  daily	
  
•  30	
  terabytes	
  of	
  telescope	
  data	
  each	
  night	
  
•  350	
  billion	
  meter	
  readings	
  	
  
•  ...	
  
Op/que	
  Case	
  Study:	
  Statoil	
  Explora/on	
  

Experts	
  in	
  geology	
  and	
  geophysics	
  develop	
  
stra/graphic	
  models	
  of	
  unexplored	
  areas.	
  
•  Based	
  on	
  produc/on	
  and	
  explora/on	
  
   data	
  from	
  nearby	
  loca/ons	
  
•  Analy/cs	
  on:	
  
     •    1,000	
  TB	
  of	
  rela/onal	
  data	
  
     •    using	
  diverse	
  schemata	
  
     •    spread	
  over	
  2,000	
  tables	
  
     •    spread	
  over	
  mul/ple	
  individual	
  data	
  bases	
  


•  900	
  experts	
  in	
  Statoil	
  Explora/on	
  
•  up	
  to	
  4	
  days	
  for	
  new	
  data	
  access	
  queries	
  
•  assistance	
  from	
  IT-­‐experts	
  required	
  
LOD	
  as	
  an	
  Example	
  of	
  Horizontal	
  Big	
  Data	
  
Semantic Technologies for
Horizontal Big Data
Linked	
  Data	
  
•  Set	
  of	
  standards,	
  principles	
  for	
  publishing,	
  sharing	
  and	
  interrela/ng	
  
   structured	
  data:	
  RDF	
  as	
  data	
  model,	
  SPARQL	
  for	
  querying	
  
•  Graph-­‐based	
  data	
  model	
  for	
  achieving	
  higher	
  degree	
  of	
  variety	
  
•  Seman/cally	
  interlink	
  data	
  scabered	
  among	
  different	
  informa/on	
  
   spaces:	
  from	
  data	
  silos	
  to	
  a	
  Web	
  of	
  Data	
  

Ontologies	
  
  •    For	
  describing	
  the	
  seman;cs	
  of	
  the	
  data	
  	
  
  •    As	
  conceptual	
  models	
  for	
  end-­‐user	
  oriented	
  access	
  
  •    For	
  the	
  integra;on	
  of	
  heterogeneous	
  sources	
  
  •    For	
  (light-­‐weight)	
  reasoning	
  
	
  
                                                                                                  8
On	
  Demand	
  Access	
  to	
  Big	
  Data	
  




   Enabling	
  on	
  demand	
  data	
  access	
  
   1.  discovery	
  of	
  relevant	
  data	
  sources	
  
   2.  automated	
  integra/on	
  and	
  interlinking	
  of	
  sources,	
  and	
  	
  
   3.  interac/ve	
  explora/on	
  and	
  ad	
  hoc	
  analysis	
  of	
  data	
  

   =>	
  Linked	
  Data	
  as	
  a	
  Service	
  
Everything	
  as	
  a	
  Service	
  

•  Abstract	
  from	
  physical	
  implementa;on	
  details	
  and	
  loca;on	
  
   of	
  resources	
  
•  Regardless	
  of	
  geographic	
  or	
  organiza/onal	
  separa;on	
  of	
  
   provider	
  and	
  consumer	
  
	
  
•      “In	
  the	
  cloud”	
       Data as a Service
•      Web	
  based	
  
•      Virtualized	
                Software as a Service
•      On-­‐demand	
  
•      Self-­‐service	
             Platform as a Service
•      Scalable	
  
•      Pay	
  as	
  you	
  go	
     Infrastructure as a Service
Linked	
  Data	
  as	
  a	
  Service	
  
   “Like	
   all	
   members	
   of	
   the	
   "as	
   a	
   Service”	
   family,	
   DaaS	
   is	
   based	
   on	
   the	
  
   concept	
  that	
  the	
  product,	
  data	
  in	
  this	
  case,	
  can	
  be	
  provided	
  on	
  demand	
  
   to	
   the	
   user	
   regardless	
   of	
   geographic	
   or	
   organiza/onal	
   separa/on	
   of	
  
   provider	
  and	
  consumer.”	
  
                                                                                                                              Source:	
  Wikipedia	
  
	
  
•  Data	
  virtualiza;on	
  supported	
  by	
  Linked	
  Data	
  principles	
  
     1.     Use	
  URIs	
  as	
  names	
  for	
  things	
  	
  
     2.     Use	
  HTTP	
  URIs	
  so	
  that	
  people	
  can	
  look	
  up	
  those	
  names.	
  	
  
     3.     When	
  someone	
  looks	
  up	
  a	
  URI,	
  provide	
  useful	
  informa/on,	
  using	
  the	
  standards:	
  RDF,	
  SPARQL	
  	
  
     4.     Include	
  links	
  to	
  other	
  URIs,	
  to	
  discover	
  more	
  things.	
  


•  Linked	
  Data	
  as	
  abstrac/on	
  layer	
  for	
  virtualized	
  data	
  access	
  across	
  data	
  spaces	
  
•  Enables	
  data	
  portability	
  across	
  current	
  data	
  silos	
  	
  
•  Plaform	
  independent	
  data	
  access	
  

•  Basis	
  for	
  enabling	
  automa;on	
  of	
  discovery,	
  composi;on,	
  and	
  use	
  of	
  datasets	
  

                                                                         11	
  
Informa/on	
  Workbench	
  -­‐	
  Linked	
  Data	
  Plaform	
  

                                                          	
  	
  	
  	
  	
  	
  Informa;on	
  Workbench:	
  
                                                                  §  Seman/cs-­‐	
  &	
  Linked	
  Data-­‐based	
  
                                                                      integra;on	
  of	
  private	
  and	
  public	
  
                                                                      data	
  sources	
  

                                                                  §  Intelligent	
  Data	
  Access	
  and	
  
                                                                      Analy;cs	
  
                                                                        §  Visual	
  Explora/on	
  
                                                                        §  Seman/c	
  Search	
  
                                                                        §  Dashboarding	
  and	
  Repor/ng	
  

                                                                  §  Collabora;on	
  and	
  knowledge	
  
                                                                      management	
  plaform	
  
                                                                        §  Wiki-­‐based	
  cura/on	
  &	
  
                                                                            authoring	
  of	
  data	
  
                             Seman/c	
  Web	
  Data	
                   §  Collabora/ve	
  workflows	
  

                                              12	
  
Enabling	
  Data	
  Discovery:	
  
Metadata	
  about	
  Data	
  Sets	
  
•    Metadata	
  about	
  data	
  sources	
  essen/al	
  for	
  dynamic	
  discovery	
  
•    Based	
  on	
  metadata	
  vocabularies	
  (VoID,	
  DCAT)	
  
•    Access	
  to	
  data	
  registered	
  at	
  global	
  registries,	
  e.g.	
  ckan.org,	
  data.gov,	
  …	
  
•    Sort/filter	
  data	
  sets	
  by	
  topic,	
  license,	
  size	
  and	
  many	
  more	
  facets	
  to	
  iden/fy	
  
     relevant	
  data	
  
•    Visually	
  explore	
  data	
  sets	
  
FedX	
  Federated	
  Query	
  Processing	
  

1)  Involve	
  only	
  relevant	
  sources	
  in	
  the	
  evalua/on	
  
          	
  Problem:	
  Subqueries	
  are	
  sent	
  to	
  all	
  sources,	
  although	
  poten/ally	
  irrelevant	
  	
  
2)  Compute	
  joins	
  close	
  to	
  the	
  data	
  
          	
  Problem:	
  All	
  joins	
  are	
  executed	
  locally	
  in	
  a	
  nested	
  loop	
  fashion	
  
3)  Reduce	
  remote	
  communica/on	
  
          	
  Problem:	
  Nested	
  loop	
  join	
  causes	
  many	
  remote	
  requests	
  




                                                                                                                               14	
  
Enabling	
  Data	
  Composi/on:	
  
FederaAon	
  of	
  Virtualized	
  Data	
  Sources	
  


  Applica'on	
  Layer	
  




 Virtualiza'on	
  Layer	
  




   Data	
  Layer	
  
                               SPARQL	
           SPARQL	
                     SPARQL	
                   SPARQL	
  
                              Endpoint	
         Endpoint	
                   Endpoint	
                 Endpoint	
  
                                                                                                                              Metadata	
  	
  
                                                                                                                              Registry	
  
                         Data	
  Source	
      Data	
  Source	
             Data	
  Source	
            Data	
  Source	
  




 See	
  also:	
  FedX:	
  Op'miza'on	
  Techniques	
  for	
  Federated	
  Query	
  Processing	
  on	
  Linked	
  Data	
  (ISWC2011)	
  
 	
  
Enabling	
  On	
  Demand	
  Use:	
  
Self-­‐service	
  Linked	
  Data	
  Frontend	
  
•  Seman/c	
  Wiki	
  as	
  user	
  frontend	
  
•  Declara;ve	
  specifica;on	
  of	
  the	
  UI	
  based	
  
   on	
  available	
  pool	
  of	
  widgets	
  and	
  
   declara/ve	
  wiki-­‐based	
  syntax	
  
•  Widgets	
  have	
  direct	
  access	
  to	
  the	
  DB	
  
•  Type-­‐based	
  template	
  mechanism	
  
•  Ad	
  hoc	
  data	
  explora/on,	
  visualiza/on,	
  
   analy/cs,	
  dashboards,	
  ...	
  




              Wiki	
  Page	
  in	
  Edit	
  Mode	
  …	
         …	
  and	
  Displayed	
  Result	
  Page	
  
Informa/on	
  Workbench	
  –	
  Linked	
  Data	
  as	
  a	
  Service	
  
ApplicaAon	
  Areas	
  

Knowledge	
  Management	
  in	
  the	
  
Life	
  Sciences	
  
	
  
	
  
	
  

Digital	
  Libraries,	
  Media	
  and	
  
Content	
  Management	
  
	
  
	
  
	
  

Intelligent	
  Data	
  Center	
  
Management	
  
	
  
Example:	
  Linked	
  Data	
  in	
  Pharma	
  

                                                                                                                            Main	
  Use	
  Cases	
  
                                                                                                                            	
  

                                                                                                                            •  Integrate	
  data	
  from	
  
                                                                                                                               company-­‐internal	
  
Search,	
  Interrogate	
  and	
              Visualize,	
  Analyze	
  and	
                 Capture	
  and	
  Augment	
  
           Reason	
                                  Explore	
                                   Knowledge	
                   data	
  silos	
  
                                                                                                                            •  Augment	
  company-­‐
                             Integrated	
  data	
  graph	
  over	
  all	
  data	
  sources	
  
                                                                                                                               internal	
  data	
  with	
  
                                                       Integ	
                                                                 Linked	
  Open	
  Data	
  
                                                                                                                            •  Collabora/ve	
  
                                                                                                                               knowledge	
  
                                                                                                                               management	
  
                                                                                                                            •  Support	
  of	
  internal	
  
                                                                                                                               processes	
  (drug	
  
                                                                                                                               development)	
  	
  



Private	
  Data	
  Sources	
                                                 Public	
  Data	
  Sources	
  
Example:	
  	
  
  Data	
  Center	
  Management	
  
•  Support	
  collabora;ve	
  
   opera;ons	
  management	
  
   in	
  the	
  data	
  center	
  
   •    Link	
  business	
  data	
  to	
  technical	
  data	
  
   •    Technical	
  Documenta/on	
  
   •    Analy/cs	
  and	
  Repor/ng	
  
   •    Performance	
  and	
  Capacity	
  Monitoring	
  
   •    Responsibility	
  Management	
  
   •    Resource	
  Management	
  
   •    Change	
  Management	
  
   •    Technical	
  Ticke/ng	
  System	
  




                                                                  19	
  
Example:	
  A	
  Cloud	
  Portal	
  for	
  Access	
  to	
  Open	
  Data	
  
with	
  the	
  Informa/on	
  Workbench	
  
Goal	
  
                                                                                       ...	
  using	
  the	
  	
  
•  Collect	
  meta	
  data	
  from	
  global	
  data	
  markets	
  (LOD	
              fluid	
  OperaAons	
  
     Cloud,	
  WorldBank,	
  CKAN,	
  …)	
  
•  Allow	
  integrated	
  search	
  and	
  ad	
  hoc	
  integra/on	
  of	
  data	
     Technology	
  Stack	
  
     sources	
  from	
  different	
  repositories	
                                     	
  
•  Link	
  data	
  with	
  private/internal	
  data	
  sources,	
  if	
  desired	
     	
  
•  Support	
  semi-­‐automated	
  linking	
  between	
  data	
  sets	
                 	
  
•  Provide	
  visualiza/on,	
  explora/on,	
  and	
  analy/cs	
                        	
  
     func/onality	
  on	
  top	
  of	
  integrated	
  data	
  sources	
                	
  
	
                                                                                     	
  
	
  
Realiza;on	
  
•  Currently	
  running	
  project	
  with	
  the	
  Hasso	
  Plabner	
  
     Ins/tute	
  (Potsdam,	
  Germany)	
  
•  Create	
  local	
  repository	
  containing	
  data	
  market	
  
     metadata	
  
•  Use	
  self-­‐service	
  technology	
  to	
  make	
  services	
  publicly	
  
     available	
  +	
  Informa/on	
  Workbench	
  for	
  analy/cs	
  
Informa/on	
  Workbench:	
  
Linked	
  Data	
  as	
  a	
  Service	
  in	
  a	
  Cloud	
  Plaform	
  Architecture	
  

                                                                                                                                Applica'on	
  Layer	
  (SaaS)	
  
 Provisioning,	
  Monitoring	
  	
  and	
  	
  Management
                                                        	
  




                                                                                                                                             Virtualiza'on	
  Layer	
  




                                                                                            Infrastructure	
  Layer	
  (IaaS)	
                                                                          Data	
  Layer	
  (DaaS)	
  
                                                               Netw.-­‐Ab.	
  Storage	
       Network	
                Compu/ng	
  Resources
                                                                                                                                           	
                                                     	
  
                                                                                                                                                                      Enterprise	
  Data	
  Sources                             Open	
  Data	
  Sources
                                                                                                                                                                                                                                                      	
  
Provisioning,	
  Monitoring	
  	
  and	
  	
  Management
                                                              	
  
                                                                      Applica'on	
  Layer	
  (SaaS)	
  




                                                                                                                                                        Virtualiza'on	
  Layer	
  




                                                                                                     Infrastructure	
  Layer	
  (IaaS)	
                                                                 Data	
  Layer	
  (DaaS)	
  
                                                                        Netw.-­‐Ab.	
  Storage	
       Network	
                  Compu/ng	
  Resources
                                                                                                                                                      	
                                                   	
  
                                                                                                                                                                               Enterprise	
  Data	
  Sources                  Open	
  Data	
  Sources
                                                                                                                                                                                                                                                    	
  




                                                                     Self-­‐service	
                                                                                     Data	
  Integra/on	
                                    Self-­‐service	
  UI	
  &	
  
                                                                                                                           Data	
  Discovery	
  
                                                                     Deployment	
                                                                                         &	
  Federa/on	
                                        Analy/cs	
  


•    Self-­‐service	
  deployment	
                                                                       •          On	
  demand	
  access	
  to	
   •         Virtualized	
  data	
          •                      Living	
  UI,	
  composed	
  
     of	
  the	
  InformaAon	
                                                                                       private	
  and	
  public	
                 access	
                                              from	
  seman/cs-­‐aware	
  
     Workbench	
  in	
  the	
  cloud	
                                                                               data	
  sources	
                •         Dynamic	
  integra/on	
  &	
                          widgets	
  
•    Pay-­‐per-­‐use	
                                                                                    •          Dynamic	
  Discovery	
                     federa/on	
  of	
  data	
      •                      Ad	
  hoc	
  data	
  
•    Scalability	
  on	
  demand	
                                                                                                                              sources	
                                             explora/on,	
  
                                                                                                                                                                                                                      visualiza/on,	
  analy/cs	
  
Op/que	
  Components	
  
Op/que	
  
Real-­‐/me	
  stream	
                  Query	
  Evalua/on	
  
   processing	
                         with	
  Elas/c	
  Clouds	
  




 Scalable	
  Query	
                       End-­‐user-­‐oriented	
  
   Rewri/ng	
                               Query	
  Interfaces	
  
Summary	
  

•  Big	
  Data	
  means	
  more	
  than	
  volume	
  and	
  ver/cal	
  scale	
  

•  Seman/c	
  Technologies	
  for	
  Big	
  Data	
  management	
  
    •    Linked	
  Data	
  as	
  adequate	
  data	
  model	
  
    •    Ontologies	
  as	
  conceptual	
  models	
  to	
  access	
  big	
  data	
  
    •    Integra/on	
  of	
  diverse,	
  heterogeneus	
  data	
  sources	
  

•  Linked	
  Data	
  as	
  a	
  Service	
  for	
  enabling	
  on	
  demand	
  data	
  access	
  
    1.  discovery	
  of	
  relevant	
  data	
  sources	
  
    2.  automated	
  integra/on	
  and	
  interlinking	
  of	
  sources,	
  and	
  	
  
    3.  interac/ve	
  explora/on	
  and	
  ad	
  hoc	
  analysis	
  of	
  data	
  



•  Outlook:	
  Op/que	
  –	
  Scalable	
  end-­‐user	
  access	
  to	
  Big	
  Data	
  
CONTACT:	
  
fluid	
  Opera;ons	
  
Altrostr.	
  31	
  
Walldorf,	
  Germany	
  
	
  
Email:	
  peter.haase@fluidops.com	
  

                         	
  
website:	
  www.fluidops.com	
  
Tel.:	
  +49	
  6227	
  3846-­‐527

Mais conteúdo relacionado

Mais procurados

Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
Chalitha Perera | Cross Media Concept and Entity Driven Search for EnterpriseChalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprisesemanticsconference
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Cambridge Semantics
 
David Kuilman | Creating a Semantic Enterprise Content model to support conti...
David Kuilman | Creating a Semantic Enterprise Content model to support conti...David Kuilman | Creating a Semantic Enterprise Content model to support conti...
David Kuilman | Creating a Semantic Enterprise Content model to support conti...semanticsconference
 
Six Ways to Simplify Metadata Management
Six Ways to Simplify Metadata ManagementSix Ways to Simplify Metadata Management
Six Ways to Simplify Metadata ManagementEnterprise Knowledge
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentOntotext
 
Gaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive TechnologyGaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive TechnologyOntotext
 
Linked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionLinked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionAndreas Blumauer
 
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataWebinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataZaloni
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 
Zen and the Art of Datanauting
Zen and the Art of DatanautingZen and the Art of Datanauting
Zen and the Art of DatanautingOntologySystems
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...DataWorks Summit
 
MongoDB Case Study in Healthcare
MongoDB Case Study in HealthcareMongoDB Case Study in Healthcare
MongoDB Case Study in HealthcareMongoDB
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataEUDAT
 
How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsOntotext
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Usedmurph4
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesCambridge Semantics
 
Five creative search solutions using text analytics
Five creative search solutions using text analyticsFive creative search solutions using text analytics
Five creative search solutions using text analyticsEnterprise Knowledge
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformSanjay Padhi, Ph.D
 

Mais procurados (20)

Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
Chalitha Perera | Cross Media Concept and Entity Driven Search for EnterpriseChalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
 
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
Transforming Data Management and Time to Insight with Anzo Smart Data Lake®
 
David Kuilman | Creating a Semantic Enterprise Content model to support conti...
David Kuilman | Creating a Semantic Enterprise Content model to support conti...David Kuilman | Creating a Semantic Enterprise Content model to support conti...
David Kuilman | Creating a Semantic Enterprise Content model to support conti...
 
Six Ways to Simplify Metadata Management
Six Ways to Simplify Metadata ManagementSix Ways to Simplify Metadata Management
Six Ways to Simplify Metadata Management
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news content
 
Gaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive TechnologyGaining Advantage in e-Learning with Semantic Adaptive Technology
Gaining Advantage in e-Learning with Semantic Adaptive Technology
 
Linked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to ActionLinked data the next 5 years - From Hype to Action
Linked data the next 5 years - From Hype to Action
 
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big DataWebinar - Risky Business: How to Balance Innovation & Risk in Big Data
Webinar - Risky Business: How to Balance Innovation & Risk in Big Data
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 
Zen and the Art of Datanauting
Zen and the Art of DatanautingZen and the Art of Datanauting
Zen and the Art of Datanauting
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
Promote the Good of the People of the United Kingdom by Maintaining Monetary ...
 
MongoDB Case Study in Healthcare
MongoDB Case Study in HealthcareMongoDB Case Study in Healthcare
MongoDB Case Study in Healthcare
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
How to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk AnalyticsHow to Reveal Hidden Relationships in Data and Risk Analytics
How to Reveal Hidden Relationships in Data and Risk Analytics
 
Metadata Use Cases You Can Use
Metadata Use Cases You Can UseMetadata Use Cases You Can Use
Metadata Use Cases You Can Use
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 
Five creative search solutions using text analytics
Five creative search solutions using text analyticsFive creative search solutions using text analytics
Five creative search solutions using text analytics
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 

Semelhante a On-Demand Access to Big Data Through Semantic Technologies

Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchPeter Haase
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesBDLSS
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a ServicePeter Haase
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...Peter Haase
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amirydatastack
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesKerstin Forsberg
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterprisePeter Haase
 
One Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeOne Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeJared Winick
 
One Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeOne Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeKoverse, Inc.
 
Hadoop & Complex Systems Research
Hadoop & Complex Systems ResearchHadoop & Complex Systems Research
Hadoop & Complex Systems ResearchDr. Mirko Kämpf
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinalDeborah McGuinness
 
Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Denodo
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
Teradata Loom Introductory Presentation
Teradata Loom Introductory PresentationTeradata Loom Introductory Presentation
Teradata Loom Introductory Presentationmlang222
 
data-mesh-101.pptx
data-mesh-101.pptxdata-mesh-101.pptx
data-mesh-101.pptxTarekHamdi8
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudPeter Haase
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeDenodo
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGGRatko Mutavdzic
 

Semelhante a On-Demand Access to Big Data Through Semantic Technologies (20)

Everything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information WorkbenchEverything Self-Service:Linked Data Applications with the Information Workbench
Everything Self-Service:Linked Data Applications with the Information Workbench
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
Linked Data as a Service
Linked Data as a ServiceLinked Data as a Service
Linked Data as a Service
 
Linked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter HaaseLinked Data and Semantic Web Application Development by Peter Haase
Linked Data and Semantic Web Application Development by Peter Haase
 
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...The Information Workbench as a Self-Service Platform for Linked Data Applicat...
The Information Workbench as a Self-Service Platform for Linked Data Applicat...
 
Data lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiryData lake-itweekend-sharif university-vahid amiry
Data lake-itweekend-sharif university-vahid amiry
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiences
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
 
One Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeOne Large Data Lake, Hold the Hype
One Large Data Lake, Hold the Hype
 
One Large Data Lake, Hold the Hype
One Large Data Lake, Hold the HypeOne Large Data Lake, Hold the Hype
One Large Data Lake, Hold the Hype
 
Hadoop & Complex Systems Research
Hadoop & Complex Systems ResearchHadoop & Complex Systems Research
Hadoop & Complex Systems Research
 
20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal20120411 travelalliancemcguinnessfinal
20120411 travelalliancemcguinnessfinal
 
Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)Why Data Mesh Needs Data Virtualization (ASEAN)
Why Data Mesh Needs Data Virtualization (ASEAN)
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Teradata Loom Introductory Presentation
Teradata Loom Introductory PresentationTeradata Loom Introductory Presentation
Teradata Loom Introductory Presentation
 
data-mesh-101.pptx
data-mesh-101.pptxdata-mesh-101.pptx
data-mesh-101.pptx
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
 
(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG(PROJEKTURA) Big Data Open Data story for TGG
(PROJEKTURA) Big Data Open Data story for TGG
 

Mais de Peter Haase

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryPeter Haase
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsPeter Haase
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationPeter Haase
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsPeter Haase
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge GraphsPeter Haase
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphPeter Haase
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsPeter Haase
 
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataMapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataPeter Haase
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingPeter Haase
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementPeter Haase
 

Mais de Peter Haase (11)

Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
Hybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge GraphsHybrid Enterprise Knowledge Graphs
Hybrid Enterprise Knowledge Graphs
 
Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federation
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked DataMapping, Interlinking and Exposing MusicBrainz as Linked Data
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
 
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data ProcessingFedbench - A Benchmark Suite for Federated Semantic Data Processing
Fedbench - A Benchmark Suite for Federated Semantic Data Processing
 
Cloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application DevelopmentCloud-based Linked Data Management for Self-service Application Development
Cloud-based Linked Data Management for Self-service Application Development
 
Semantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud ManagementSemantic Technologies for Enterprise Cloud Management
Semantic Technologies for Enterprise Cloud Management
 

Último

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Último (20)

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

On-Demand Access to Big Data Through Semantic Technologies

  • 1. ON  DEMAND  ACCESS  TO  BIG  DATA   THROUGH  SEMANTIC  TECHNOLOGIES     Peter  Haase   fluid  Opera/ons  AG!
  • 2. fluid  Opera/ons  (fluidOps)     Linked  Data  &  Seman;c  Technologies                                  Enterprise  Cloud  Compu;ng         So7ware  company  founded  Q1/2008  by  team  of  serial  entrepreneurs,  privately  held,   VC  funded   Headquarters  in  Walldorf  /  Germany,  SAP  Partner  Port   Currently  45  employees   Named  “Cool  Vendor”  by  Gartner  Mar  2010   Global  reseller  agreement  with  EMC  focus  large     enterprise  customers  Apr  2010   NetApp  Advantage  Alliance  Partner  Oct  2010  
  • 3. Outline   •  Big  Data  Challenges:  Beyond  Volume   •  Seman;c  Technologies  for  Big  Data  Challenges   •  On  Demand  Data  Access  in  a  Self-­‐service  Process   •  Outlook:  Op;que  -­‐  Scalable  End  User  Access  to  Big  Data  
  • 4. Big  Data                 “Big  data  consists  of  data  sets  that  grow  so  large  that  they  become  awkward   to  work  with  using  on-­‐hand  database  management  tools.”    (Wikipedia)     •  12  terabytes  of  Tweets  created  daily   •  30  terabytes  of  telescope  data  each  night   •  350  billion  meter  readings     •  ...  
  • 5. Op/que  Case  Study:  Statoil  Explora/on   Experts  in  geology  and  geophysics  develop   stra/graphic  models  of  unexplored  areas.   •  Based  on  produc/on  and  explora/on   data  from  nearby  loca/ons   •  Analy/cs  on:   •  1,000  TB  of  rela/onal  data   •  using  diverse  schemata   •  spread  over  2,000  tables   •  spread  over  mul/ple  individual  data  bases   •  900  experts  in  Statoil  Explora/on   •  up  to  4  days  for  new  data  access  queries   •  assistance  from  IT-­‐experts  required  
  • 6.
  • 7. LOD  as  an  Example  of  Horizontal  Big  Data  
  • 8. Semantic Technologies for Horizontal Big Data Linked  Data   •  Set  of  standards,  principles  for  publishing,  sharing  and  interrela/ng   structured  data:  RDF  as  data  model,  SPARQL  for  querying   •  Graph-­‐based  data  model  for  achieving  higher  degree  of  variety   •  Seman/cally  interlink  data  scabered  among  different  informa/on   spaces:  from  data  silos  to  a  Web  of  Data   Ontologies   •  For  describing  the  seman;cs  of  the  data     •  As  conceptual  models  for  end-­‐user  oriented  access   •  For  the  integra;on  of  heterogeneous  sources   •  For  (light-­‐weight)  reasoning     8
  • 9. On  Demand  Access  to  Big  Data   Enabling  on  demand  data  access   1.  discovery  of  relevant  data  sources   2.  automated  integra/on  and  interlinking  of  sources,  and     3.  interac/ve  explora/on  and  ad  hoc  analysis  of  data   =>  Linked  Data  as  a  Service  
  • 10. Everything  as  a  Service   •  Abstract  from  physical  implementa;on  details  and  loca;on   of  resources   •  Regardless  of  geographic  or  organiza/onal  separa;on  of   provider  and  consumer     •  “In  the  cloud”   Data as a Service •  Web  based   •  Virtualized   Software as a Service •  On-­‐demand   •  Self-­‐service   Platform as a Service •  Scalable   •  Pay  as  you  go   Infrastructure as a Service
  • 11. Linked  Data  as  a  Service   “Like   all   members   of   the   "as   a   Service”   family,   DaaS   is   based   on   the   concept  that  the  product,  data  in  this  case,  can  be  provided  on  demand   to   the   user   regardless   of   geographic   or   organiza/onal   separa/on   of   provider  and  consumer.”   Source:  Wikipedia     •  Data  virtualiza;on  supported  by  Linked  Data  principles   1.  Use  URIs  as  names  for  things     2.  Use  HTTP  URIs  so  that  people  can  look  up  those  names.     3.  When  someone  looks  up  a  URI,  provide  useful  informa/on,  using  the  standards:  RDF,  SPARQL     4.  Include  links  to  other  URIs,  to  discover  more  things.   •  Linked  Data  as  abstrac/on  layer  for  virtualized  data  access  across  data  spaces   •  Enables  data  portability  across  current  data  silos     •  Plaform  independent  data  access   •  Basis  for  enabling  automa;on  of  discovery,  composi;on,  and  use  of  datasets   11  
  • 12. Informa/on  Workbench  -­‐  Linked  Data  Plaform              Informa;on  Workbench:   §  Seman/cs-­‐  &  Linked  Data-­‐based   integra;on  of  private  and  public   data  sources   §  Intelligent  Data  Access  and   Analy;cs   §  Visual  Explora/on   §  Seman/c  Search   §  Dashboarding  and  Repor/ng   §  Collabora;on  and  knowledge   management  plaform   §  Wiki-­‐based  cura/on  &   authoring  of  data   Seman/c  Web  Data   §  Collabora/ve  workflows   12  
  • 13. Enabling  Data  Discovery:   Metadata  about  Data  Sets   •  Metadata  about  data  sources  essen/al  for  dynamic  discovery   •  Based  on  metadata  vocabularies  (VoID,  DCAT)   •  Access  to  data  registered  at  global  registries,  e.g.  ckan.org,  data.gov,  …   •  Sort/filter  data  sets  by  topic,  license,  size  and  many  more  facets  to  iden/fy   relevant  data   •  Visually  explore  data  sets  
  • 14. FedX  Federated  Query  Processing   1)  Involve  only  relevant  sources  in  the  evalua/on    Problem:  Subqueries  are  sent  to  all  sources,  although  poten/ally  irrelevant     2)  Compute  joins  close  to  the  data    Problem:  All  joins  are  executed  locally  in  a  nested  loop  fashion   3)  Reduce  remote  communica/on    Problem:  Nested  loop  join  causes  many  remote  requests   14  
  • 15. Enabling  Data  Composi/on:   FederaAon  of  Virtualized  Data  Sources   Applica'on  Layer   Virtualiza'on  Layer   Data  Layer   SPARQL   SPARQL   SPARQL   SPARQL   Endpoint   Endpoint   Endpoint   Endpoint   Metadata     Registry   Data  Source   Data  Source   Data  Source   Data  Source   See  also:  FedX:  Op'miza'on  Techniques  for  Federated  Query  Processing  on  Linked  Data  (ISWC2011)    
  • 16. Enabling  On  Demand  Use:   Self-­‐service  Linked  Data  Frontend   •  Seman/c  Wiki  as  user  frontend   •  Declara;ve  specifica;on  of  the  UI  based   on  available  pool  of  widgets  and   declara/ve  wiki-­‐based  syntax   •  Widgets  have  direct  access  to  the  DB   •  Type-­‐based  template  mechanism   •  Ad  hoc  data  explora/on,  visualiza/on,   analy/cs,  dashboards,  ...   Wiki  Page  in  Edit  Mode  …   …  and  Displayed  Result  Page  
  • 17. Informa/on  Workbench  –  Linked  Data  as  a  Service   ApplicaAon  Areas   Knowledge  Management  in  the   Life  Sciences         Digital  Libraries,  Media  and   Content  Management         Intelligent  Data  Center   Management    
  • 18. Example:  Linked  Data  in  Pharma   Main  Use  Cases     •  Integrate  data  from   company-­‐internal   Search,  Interrogate  and   Visualize,  Analyze  and   Capture  and  Augment   Reason   Explore   Knowledge   data  silos   •  Augment  company-­‐ Integrated  data  graph  over  all  data  sources   internal  data  with   Integ   Linked  Open  Data   •  Collabora/ve   knowledge   management   •  Support  of  internal   processes  (drug   development)     Private  Data  Sources   Public  Data  Sources  
  • 19. Example:     Data  Center  Management   •  Support  collabora;ve   opera;ons  management   in  the  data  center   •  Link  business  data  to  technical  data   •  Technical  Documenta/on   •  Analy/cs  and  Repor/ng   •  Performance  and  Capacity  Monitoring   •  Responsibility  Management   •  Resource  Management   •  Change  Management   •  Technical  Ticke/ng  System   19  
  • 20. Example:  A  Cloud  Portal  for  Access  to  Open  Data   with  the  Informa/on  Workbench   Goal   ...  using  the     •  Collect  meta  data  from  global  data  markets  (LOD   fluid  OperaAons   Cloud,  WorldBank,  CKAN,  …)   •  Allow  integrated  search  and  ad  hoc  integra/on  of  data   Technology  Stack   sources  from  different  repositories     •  Link  data  with  private/internal  data  sources,  if  desired     •  Support  semi-­‐automated  linking  between  data  sets     •  Provide  visualiza/on,  explora/on,  and  analy/cs     func/onality  on  top  of  integrated  data  sources           Realiza;on   •  Currently  running  project  with  the  Hasso  Plabner   Ins/tute  (Potsdam,  Germany)   •  Create  local  repository  containing  data  market   metadata   •  Use  self-­‐service  technology  to  make  services  publicly   available  +  Informa/on  Workbench  for  analy/cs  
  • 21. Informa/on  Workbench:   Linked  Data  as  a  Service  in  a  Cloud  Plaform  Architecture   Applica'on  Layer  (SaaS)   Provisioning,  Monitoring    and    Management   Virtualiza'on  Layer   Infrastructure  Layer  (IaaS)   Data  Layer  (DaaS)   Netw.-­‐Ab.  Storage   Network   Compu/ng  Resources     Enterprise  Data  Sources Open  Data  Sources  
  • 22. Provisioning,  Monitoring    and    Management   Applica'on  Layer  (SaaS)   Virtualiza'on  Layer   Infrastructure  Layer  (IaaS)   Data  Layer  (DaaS)   Netw.-­‐Ab.  Storage   Network   Compu/ng  Resources     Enterprise  Data  Sources Open  Data  Sources   Self-­‐service   Data  Integra/on   Self-­‐service  UI  &   Data  Discovery   Deployment   &  Federa/on   Analy/cs   •  Self-­‐service  deployment   •  On  demand  access  to   •  Virtualized  data   •  Living  UI,  composed   of  the  InformaAon   private  and  public   access   from  seman/cs-­‐aware   Workbench  in  the  cloud   data  sources   •  Dynamic  integra/on  &   widgets   •  Pay-­‐per-­‐use   •  Dynamic  Discovery   federa/on  of  data   •  Ad  hoc  data   •  Scalability  on  demand   sources   explora/on,   visualiza/on,  analy/cs  
  • 23.
  • 25. Op/que   Real-­‐/me  stream   Query  Evalua/on   processing   with  Elas/c  Clouds   Scalable  Query   End-­‐user-­‐oriented   Rewri/ng   Query  Interfaces  
  • 26. Summary   •  Big  Data  means  more  than  volume  and  ver/cal  scale   •  Seman/c  Technologies  for  Big  Data  management   •  Linked  Data  as  adequate  data  model   •  Ontologies  as  conceptual  models  to  access  big  data   •  Integra/on  of  diverse,  heterogeneus  data  sources   •  Linked  Data  as  a  Service  for  enabling  on  demand  data  access   1.  discovery  of  relevant  data  sources   2.  automated  integra/on  and  interlinking  of  sources,  and     3.  interac/ve  explora/on  and  ad  hoc  analysis  of  data   •  Outlook:  Op/que  –  Scalable  end-­‐user  access  to  Big  Data  
  • 27. CONTACT:   fluid  Opera;ons   Altrostr.  31   Walldorf,  Germany     Email:  peter.haase@fluidops.com     website:  www.fluidops.com   Tel.:  +49  6227  3846-­‐527