SlideShare uma empresa Scribd logo
1 de 20
Delivering a Linked Data warehouse and integrating
across the wider enterprise
Ben Gardner – Linklaters LLP
Semantics
September 2016
Summary
• Information discovery requirements
• What we did
• Linked Data in Action
• Conclusion
Accessing the right information is challenging
Diverse Range of Specialisations
Information Seeking Behaviour
Information is Silo’ed
Information Hierarchy
What we did
Building a Linked Data Warehouse demo
Excel Reports
XML File
RDF
Management
Triple
Store
Model
UI
S  O
ETL Platform
OData
+
OData4Sparql
Sparql
+
Linked Data Warehouse Data Access Exploration
Linked Data and Model
• Traditional approaches try to identify how the data is to be “captured”
upfront.
• You can do this with the linked data model
• But we don’t…..Why?
• Always leads to “Paralysis by Analysis”
• You will miss so much.
• And take a huge amount of time doing it.
• You will find that there is a huge amount of
information and relationships you never would
of thought if starting from the model.
• Then there are tricks you can do to add huge
value
• The data model evolves very rapidly from the
data and can be further tweaked at anytime.
Let the data express itself
• Source by source, row by row let the data tell
you what it is describing.
• What it is, what relationships and metadata it
has.
• You’ll find a lot more information that you
simply couldn’t describe in a RDMS
• Another source can add to an existing item
without you even having to think
Degree
Person
Matter
Jurisdic
tion
Jurisdic
tion
College
Sector
Person
Person
Client
Manager
Partner
Client Area
Client
Person
Manager Area
Linked Data and Model : Individual Model
Fragments
Degree
Matter
Jurisdic
tion
College
Sector
Person
Client
Manager
Partner
Client
Area
Client
Manager
Area
Linked Data and Model: Fragments automatically
align
ETL & Linked Data Creation & Management
In4mium Talend modules
• Semantic modules ready to use through
configuration in Talend
• No API knowledge required by users
• Range of modules (over 60 ) for all
aspects of linked data creation and
management
• Create fully semantic apps
• Or pick and mix with traditional
aspects
• Works seamlessly with existing Talend
environment and modules
• Model driven behaviours are now
possible
• Easily add sematic technologies into
existing service architectures
• All the benefits without the hassle
OData4Sparql – Simplifying integration
+
• Brings together the strength of a ubiquitous RESTful
interface standard (OData) with the flexibility, federation
ability of RDF/SPARQL.
• SPARQL/OData Interop proposed W3C interoperation proxy
between OData and SPARQL (Kal Ahmed, 2013)
• Opens up many popular user-interface development
frameworks and tools such as Kendo UI, SAPUI5, etc.
• Acts as a Janus-point between application development and
data-sources.
• User interface developers are not, and do not want to be,
database developers. Therefore they want to use a
standardized interface that abstracts away the database,
even to the extent of what type of database: RDBMS,
NoSQL, or RDF/SPARQL
• By providing an OData4SPARQL server, it opens up any
SPARQL data-source to the C#/LINQ development world.
• Opens up many productivity tools such as
Excel/PowerQuery, and SharePoint to be consumers of
SPARQL data such as Dbpedia, Chembl, Chebi, BioPax
and any of the Linked Open Data endpoints!
• Microsoft has been joined by IBM and SAP using OData as
their primary interface method which means there will many
application developers familiar with OData as the means to
communicate with a backend data source.
Model Driven UI
Linklaters Data Model Northwind Data Model
Things
Sample Query Sample Query
Relationships
between
Things
Things
Relationships
between
Things
Demo of Linked Data in action
Strings to Things to Facts
Click on a ‘thing’
displays a ‘Lens’
about that ‘thing’
that shows different
fragments that
displays facts about
the thing
The ‘About’
fragment shows
most relevant
information.
Compare with the
Google
knowledge graph
The ‘Person
Involved’
fragment list all
persons involved
with the matter
The ‘Financial
Summary’
calculates a
financial
summary
… and we can find
associated deal
‘things’. If we want
more details about
any ‘thing’ we can
now navigate to its
‘lens’
Lens Discovery
Navigating through
‘Gerald Grant’, the
managing partner
for the Matter, takes
us to his Lens
Navigating through
the associated deal
takes us to that
deal’s Lens
Or show the Lens
on the client of the
matter
One is not limited to
facts within the
application. In the
case of a client we
can navigate to their
Companies House
page (or it could
have been D&B,
LinkDocs etc)
Composing Questions
Advanced Searches can
be selected from the list
which then displays a
query in a different format
that allows better control
over the search
Advanced Searches can
be selected from the list
which then displays a
query in a different format
that allows better control
over the search
The advanced search
allows conditions to be
added that link to other
‘things’ or limit the values
of ‘facts’ about the
associated ‘thing’. This
allows much more precise
searches to be executed
OData integration with Excel Power Query/Pivot
OData
OData4Sparql
Power Query Data Grabber/Shaper
• Build queries and utilise expand to traverse graph
• Limited data transformation can be incorporated into
the queries
• Create multiple views
Power Pivot Self Service BI
• Integrate across Power Queries and
other sources to build ROLAP models
• Explore model with Pivot tables
Power
View
Power
Map
Pivots, Charts
& Grids
Tableau,
etc.
Power Query
Power Pivot
Conclusion
Linked Data has delivered
• Elimination of silos through creation of logical
data warehouse that is extensible across internal
and external data sources
• Enabled “find and explore” information seeking
behaviours
• Separation of data modelling from integration
provides for easy addition of internal & external
data
• Ability to support diverse range of specialised
domain views onto data
• Introduces a Service Orientated Data
Architecture simplifying application
development
• Based on W3C web standards providing future
proofing and protection of firms IP (data
models)
Building a Linked Data Warehouse pilot
RDF
Management
Triple
Store
Model
UI
S  O
ETL Platform
OData
+
OData4Sparql
Sparql
+







Matter
Time
People
Financials
Deal
Finder
Client
Book
Client
Engage
K_Docs
SAP


One FTE (2x0.5) and nine months delivered
• Integrated 3 years and 9 months of data from 9 sources
• 24 million triples
• 62 Things (People, Projects, Clients, etc.)
• 127 Relationships between Things
• 223 Data attributes
Questions?

Mais conteúdo relacionado

Mais procurados

AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New PrecisionAI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
Dr. Haxel Consult
 
II-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data ExplorationII-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data Exploration
Dr. Haxel Consult
 

Mais procurados (20)

Conclusions - Linked Data
Conclusions - Linked DataConclusions - Linked Data
Conclusions - Linked Data
 
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New PrecisionAI-SDV 2021: Francisco Webber - Efficiency is the New Precision
AI-SDV 2021: Francisco Webber - Efficiency is the New Precision
 
How would AI shape Future Integrations?
How would AI shape Future Integrations?How would AI shape Future Integrations?
How would AI shape Future Integrations?
 
3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning3. Relationships Matter: Using Connected Data for Better Machine Learning
3. Relationships Matter: Using Connected Data for Better Machine Learning
 
Creating a Data Distribution Knowledge Base using Neo4j, UBS
Creating a Data Distribution Knowledge Base using Neo4j, UBSCreating a Data Distribution Knowledge Base using Neo4j, UBS
Creating a Data Distribution Knowledge Base using Neo4j, UBS
 
Translating the Human Analog to Digital with Graphs
Translating the Human Analog to Digital with GraphsTranslating the Human Analog to Digital with Graphs
Translating the Human Analog to Digital with Graphs
 
lawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management PanellawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management Panel
 
Technologies and Innovation – The Internet of Value
Technologies and Innovation – The Internet of ValueTechnologies and Innovation – The Internet of Value
Technologies and Innovation – The Internet of Value
 
Big Data: Big Issues for IP
Big Data: Big Issues for IPBig Data: Big Issues for IP
Big Data: Big Issues for IP
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
Structured Document Search and Retrieval
Structured Document Search and RetrievalStructured Document Search and Retrieval
Structured Document Search and Retrieval
 
II-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data ExplorationII-SDV 2012 Towards Unified Access Systems for Data Exploration
II-SDV 2012 Towards Unified Access Systems for Data Exploration
 
Linked Open Data in the World of Patents
Linked Open Data in the World of Patents Linked Open Data in the World of Patents
Linked Open Data in the World of Patents
 
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
AI is Not Magic: It’s Time to Demystify and Apply Srinivasan Parthiban (VINGY...
 
Decentralized AI for the Rest of Us
Decentralized AI for the Rest of UsDecentralized AI for the Rest of Us
Decentralized AI for the Rest of Us
 
Graph Data Science DEMO for fraud analysis
Graph Data Science DEMO for fraud analysisGraph Data Science DEMO for fraud analysis
Graph Data Science DEMO for fraud analysis
 
LEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI systemLEI.INFO and The ideas for LEI system
LEI.INFO and The ideas for LEI system
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
4. Document Discovery with Graph Data Science
 4. Document Discovery with Graph Data Science 4. Document Discovery with Graph Data Science
4. Document Discovery with Graph Data Science
 

Destaque

OData and SharePoint
OData and SharePointOData and SharePoint
OData and SharePoint
Sanjay Patel
 
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
Christine Chandler
 

Destaque (20)

Semantic blockchain
Semantic blockchainSemantic blockchain
Semantic blockchain
 
What AI is and examples of how it is used in legal
What AI is and examples of how it is used in legalWhat AI is and examples of how it is used in legal
What AI is and examples of how it is used in legal
 
Strategies for integrating semantic and blockchain technologies
Strategies for integrating semantic and blockchain technologiesStrategies for integrating semantic and blockchain technologies
Strategies for integrating semantic and blockchain technologies
 
meet Jessica
meet Jessicameet Jessica
meet Jessica
 
Setting Your Data Free With OData
Setting Your Data Free With ODataSetting Your Data Free With OData
Setting Your Data Free With OData
 
OData for iOS developers
OData for iOS developersOData for iOS developers
OData for iOS developers
 
OData
ODataOData
OData
 
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
Consuming Data From Many Platforms: The Benefits of OData - St. Louis Day of ...
 
OData and SharePoint
OData and SharePointOData and SharePoint
OData and SharePoint
 
Moni jaiswal resume
Moni jaiswal resumeMoni jaiswal resume
Moni jaiswal resume
 
jQuery and OData - Perfect Together
jQuery and OData - Perfect TogetherjQuery and OData - Perfect Together
jQuery and OData - Perfect Together
 
Daniel Ridder How to RESTify your ABAP backend
Daniel Ridder How to RESTify your ABAP backendDaniel Ridder How to RESTify your ABAP backend
Daniel Ridder How to RESTify your ABAP backend
 
OData Fundamental
OData FundamentalOData Fundamental
OData Fundamental
 
A Look at OData
A Look at ODataA Look at OData
A Look at OData
 
!!!Faszination blech kapitel2[1]
!!!Faszination blech kapitel2[1]!!!Faszination blech kapitel2[1]
!!!Faszination blech kapitel2[1]
 
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
Tegl 19 16 guidance on services through adult and dw programs under wioa 030117
 
WHY M-BANKING FAILS AND HOW TO FIX IT
WHY M-BANKING FAILS AND HOW TO FIX ITWHY M-BANKING FAILS AND HOW TO FIX IT
WHY M-BANKING FAILS AND HOW TO FIX IT
 
Summertime safety tips
Summertime safety tipsSummertime safety tips
Summertime safety tips
 
FICHA DE CITAÇÕES TRISTES TRÓPICOS - LÉVI-STRAUSS
FICHA DE CITAÇÕES TRISTES TRÓPICOS - LÉVI-STRAUSSFICHA DE CITAÇÕES TRISTES TRÓPICOS - LÉVI-STRAUSS
FICHA DE CITAÇÕES TRISTES TRÓPICOS - LÉVI-STRAUSS
 
Cubigo zorg op afstand
Cubigo   zorg op afstandCubigo   zorg op afstand
Cubigo zorg op afstand
 

Semelhante a Delivering a Linked Data warehouse and realising the power of graphs

Introduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint CompositesIntroduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint Composites
Mark Tabladillo
 
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
David J Rosenthal
 

Semelhante a Delivering a Linked Data warehouse and realising the power of graphs (20)

Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
 
Introduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint CompositesIntroduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint Composites
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Introduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint CompositesIntroduction to Advanced Analytics with SharePoint Composites
Introduction to Advanced Analytics with SharePoint Composites
 
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
Turn Data into Business Value – Starting with Data Analytics on Oracle Cloud ...
 
Connected development data
Connected development dataConnected development data
Connected development data
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
 
SPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDSSPS Vancouver 2018 - What is CDM and CDS
SPS Vancouver 2018 - What is CDM and CDS
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018
 
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
SQL Server 2014 Faster Insights from Any Data -Level 300 Presentation from At...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBig Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
 
Next Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data WarehouseNext Gen Analytics Going Beyond Data Warehouse
Next Gen Analytics Going Beyond Data Warehouse
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 

Mais de Ben Gardner

Mais de Ben Gardner (6)

FAIR Data-centric Information Architecture.pptx
FAIR Data-centric Information Architecture.pptxFAIR Data-centric Information Architecture.pptx
FAIR Data-centric Information Architecture.pptx
 
From Search to Semantics
From Search to SemanticsFrom Search to Semantics
From Search to Semantics
 
Practical semantics - An introduction
Practical semantics - An introductionPractical semantics - An introduction
Practical semantics - An introduction
 
From the Unknown to the Known
From the Unknown to the KnownFrom the Unknown to the Known
From the Unknown to the Known
 
Enterprise wiki's: Does one size fit all?
Enterprise wiki's: Does one size fit all?Enterprise wiki's: Does one size fit all?
Enterprise wiki's: Does one size fit all?
 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Delivering a Linked Data warehouse and realising the power of graphs

  • 1. Delivering a Linked Data warehouse and integrating across the wider enterprise Ben Gardner – Linklaters LLP Semantics September 2016
  • 2. Summary • Information discovery requirements • What we did • Linked Data in Action • Conclusion
  • 3. Accessing the right information is challenging Diverse Range of Specialisations Information Seeking Behaviour Information is Silo’ed Information Hierarchy
  • 5. Building a Linked Data Warehouse demo Excel Reports XML File RDF Management Triple Store Model UI S  O ETL Platform OData + OData4Sparql Sparql + Linked Data Warehouse Data Access Exploration
  • 6. Linked Data and Model • Traditional approaches try to identify how the data is to be “captured” upfront. • You can do this with the linked data model • But we don’t…..Why? • Always leads to “Paralysis by Analysis” • You will miss so much. • And take a huge amount of time doing it. • You will find that there is a huge amount of information and relationships you never would of thought if starting from the model. • Then there are tricks you can do to add huge value • The data model evolves very rapidly from the data and can be further tweaked at anytime. Let the data express itself • Source by source, row by row let the data tell you what it is describing. • What it is, what relationships and metadata it has. • You’ll find a lot more information that you simply couldn’t describe in a RDMS • Another source can add to an existing item without you even having to think
  • 9. ETL & Linked Data Creation & Management In4mium Talend modules • Semantic modules ready to use through configuration in Talend • No API knowledge required by users • Range of modules (over 60 ) for all aspects of linked data creation and management • Create fully semantic apps • Or pick and mix with traditional aspects • Works seamlessly with existing Talend environment and modules • Model driven behaviours are now possible • Easily add sematic technologies into existing service architectures • All the benefits without the hassle
  • 10. OData4Sparql – Simplifying integration + • Brings together the strength of a ubiquitous RESTful interface standard (OData) with the flexibility, federation ability of RDF/SPARQL. • SPARQL/OData Interop proposed W3C interoperation proxy between OData and SPARQL (Kal Ahmed, 2013) • Opens up many popular user-interface development frameworks and tools such as Kendo UI, SAPUI5, etc. • Acts as a Janus-point between application development and data-sources. • User interface developers are not, and do not want to be, database developers. Therefore they want to use a standardized interface that abstracts away the database, even to the extent of what type of database: RDBMS, NoSQL, or RDF/SPARQL • By providing an OData4SPARQL server, it opens up any SPARQL data-source to the C#/LINQ development world. • Opens up many productivity tools such as Excel/PowerQuery, and SharePoint to be consumers of SPARQL data such as Dbpedia, Chembl, Chebi, BioPax and any of the Linked Open Data endpoints! • Microsoft has been joined by IBM and SAP using OData as their primary interface method which means there will many application developers familiar with OData as the means to communicate with a backend data source.
  • 11. Model Driven UI Linklaters Data Model Northwind Data Model Things Sample Query Sample Query Relationships between Things Things Relationships between Things
  • 12. Demo of Linked Data in action
  • 13. Strings to Things to Facts Click on a ‘thing’ displays a ‘Lens’ about that ‘thing’ that shows different fragments that displays facts about the thing The ‘About’ fragment shows most relevant information. Compare with the Google knowledge graph The ‘Person Involved’ fragment list all persons involved with the matter The ‘Financial Summary’ calculates a financial summary … and we can find associated deal ‘things’. If we want more details about any ‘thing’ we can now navigate to its ‘lens’
  • 14. Lens Discovery Navigating through ‘Gerald Grant’, the managing partner for the Matter, takes us to his Lens Navigating through the associated deal takes us to that deal’s Lens Or show the Lens on the client of the matter One is not limited to facts within the application. In the case of a client we can navigate to their Companies House page (or it could have been D&B, LinkDocs etc)
  • 15. Composing Questions Advanced Searches can be selected from the list which then displays a query in a different format that allows better control over the search Advanced Searches can be selected from the list which then displays a query in a different format that allows better control over the search The advanced search allows conditions to be added that link to other ‘things’ or limit the values of ‘facts’ about the associated ‘thing’. This allows much more precise searches to be executed
  • 16. OData integration with Excel Power Query/Pivot OData OData4Sparql Power Query Data Grabber/Shaper • Build queries and utilise expand to traverse graph • Limited data transformation can be incorporated into the queries • Create multiple views Power Pivot Self Service BI • Integrate across Power Queries and other sources to build ROLAP models • Explore model with Pivot tables Power View Power Map Pivots, Charts & Grids Tableau, etc. Power Query Power Pivot
  • 18. Linked Data has delivered • Elimination of silos through creation of logical data warehouse that is extensible across internal and external data sources • Enabled “find and explore” information seeking behaviours • Separation of data modelling from integration provides for easy addition of internal & external data • Ability to support diverse range of specialised domain views onto data • Introduces a Service Orientated Data Architecture simplifying application development • Based on W3C web standards providing future proofing and protection of firms IP (data models)
  • 19. Building a Linked Data Warehouse pilot RDF Management Triple Store Model UI S  O ETL Platform OData + OData4Sparql Sparql +        Matter Time People Financials Deal Finder Client Book Client Engage K_Docs SAP   One FTE (2x0.5) and nine months delivered • Integrated 3 years and 9 months of data from 9 sources • 24 million triples • 62 Things (People, Projects, Clients, etc.) • 127 Relationships between Things • 223 Data attributes

Notas do Editor

  1. In this picture we show just two In4mium modules being used alongside standard Talend modules. This workflow is showing filters, transformations and lookup joins before the data is converted to RDF. It is the Rdfiser that converts the standard data on the flow to RDF. The RDf can then be managed in triple stores or as in this case written to files. The RDFizer is itself model driven as it uses an RDF r2rml configuration file. The talend job can be deployed as a stand alone java executable or deployed as a web service within your architecture. Foundation Platform: Talend Gartner Magic Quadrant Open Studio and enterprise versions Composable visual java development environment Solution frameworks for Integration, BPM, MDM, ESB, Data Quality, Big data Configuration 1000’s of module to configure into applications ETL, Amazon Cloud, Hadoop, BI Modules are java injection routines Well supported community Highly scalable efficient code generation Deployable as within service architectures Adds to your existing architecture Not a rip and replace! BUT Lacks any knowledge of Semantic data handling and management