SlideShare uma empresa Scribd logo
1 de 35
Ben O’Steen, Getty Digital - Feb 20, 2019
What is “Arches”?
Background
• Developed jointly by the Getty Conservation Institute (GCI) and World
Monuments Fund (WMF), with code development carried out by
Farallon (SF based company)
• Arches is an Open Source web-based platform (using Django web
framework, Postgres database and Elasticsearch search service).
• Arches can create digital inventories that describe types, locations,
extent, cultural periods, materials, and conditions of heritage
resources and establish the numerous and complex relationships
between those resources.
• Uses the “CIDOC Conceptual Reference Model” at its core, with the
option to use other ontologies such as the Linked Art ontology.
Linked Open Usable Data – Linked Art
A Linked Open Usable Data model, collaboratively designed to work
across cultural heritage organizations, that is easy to publish and
enables a variety of consuming applications.
Design Principles:
• Focused on Usability, not 100% precision / completeness
• Consistently solves actual challenges from real data
• Development is iterative, as new use cases are found
• Solve 90% of use cases, with 10% of the effort
(Thanks go to Rob Sanderson & David Newbury for slide text)
What is Arches good at?
• Allows users to both create complex models to define their data using
a web user interface (UI).
• Creates web forms based on these complex models, which also can
be customized using the web UI.
• Allows users an easy way to enter, find and edit data, again using the
web UI.
• It does not require detailed knowledge of RDF, Linked Data, SPARQL,
Triplestores, and RDF representational formats, paradigms which
other Linked Data tools often implicitly require users to ‘just know’.
What is Arches not good at?
• Currently, Arches does not provide any graph-based query services,
such as GraphQL or SPARQL.
• The Reference Data Manager (that holds and manages Concepts and
Authoratative terms) requires more development better to suit our
needs.
• Arches has not been tested at huge scales.
• The default form ‘widgets’ can be verbose.
• No bulk editing as yet.
A Note on Concepts and Controlled Terms
• These are not handled as Resources in Arches (ie there is no Concept
model)
• They are managed by the Reference Data Manager (RDM) side of
Arches and imported and exported as SKOS representations, and
require a textual label as well as an identifier to be used within
Arches.
• Currently, Resources cannot refer to concepts* that are not already
present in the Arches RDM.
* Or other Resources that are not in Arches
Application Structure
• Arches consists of a web-application layer,
bound to a client-side javascript layer.
• The javascript layer relies on a REST-based
API
• Stores data in a Postgres database, utilizing
Postgres’s native JSON and GIS handling.
• Search is performed using an Elasticsearch
service, that indexes key information as
determined by the application and the
configuration.
Working with other systems
• As of Arches version 4.4 (releasing this week, or next), it will have
acceptable JSON-LD support for import and export.
• JSON-LD can be thought of a means to represent and transfer the complex
information in a way that developers can make use of.
• Arches will be used for what it is good at, with other suitable systems
or applications being used where sensible. A valid, semantically
accurate export and import is a key component for this.
Models in Arches
An Arches Model can be thought of like a template or pattern for a
given type of Resource. It is akin to a database or XML schema in
concept.
The data held in Arches is best thought of as a graph of data rather
than as tabular data (like an Excel or CSV file). A graph is a collection of
points (nodes) connected by relationships (edges).
Prepare yourself for RDF
Models are Hierarchical Trees
The compromise that Arches makes is that Models
are defined as hierarchical trees of information.
They are still a graph of information, but a
resource will ‘own’ all of the new information in its
tree, and reference other resources rather than
duplicate the information they own.
Example: An artist resource will hold information
about the artist’s name, but will only point to the
resources in the system that represent their
paintings.
Web forms for data entry/editing in Arches
They are generated from
• ‘Models’ (that structure data for a given Resource you want to describe),
• The Model’s ‘Cards’* (that hold that data within Arches)
* (I’ll come back to what cards are. They are an Arches application concept, not a Linked Data one)
Web forms for data entry/editing in Arches
They are generated from
• ‘Models’ (that structure data for a given Resource you want to describe),
• The Model’s ‘Cards’* (that hold that data within Arches)
• ‘Branches’ are reusable structures that you use when building Models.
• For example, you might make a ‘Place’ model, and add a Branch that defines how to
describe a contemporary geographical location.
• Branches are reusable templates. They help avoid multiple differing structures
creeping in for basic things like the geographical location, or a record of the creation
of an artwork.
The top node of a model
Some Nodes
Datatypes
A model can have this sort of structure,
which can be bespoke, and also pull in
pre-made branches (like templates).
Datatypes are how you define where in
the model data is stored (or ’collected’),
and what sort of data this must be.
More datatypes can be developed,
added, and have their own behavior.
Some examples: some plain text, a date,
a true or false, a number, a link to
another Resource in Arches, or a link to
a Concept.
Model (‘Graph’ view) Model (Card view)
How does Arches present this?
• As a Form view (for data entry)
• By default, relies on the structure Model to structure the form (1:1 mapping)
• A Datatype in the model is linked to some code/html called a Widget
• A Datatype (python class) maintains the data within the system, a Widget is the
HTML/JS component that lets people edit or enter data for it.
• As a Report (to view/interpret the data)
• ‘Preview’ report (for admins)
• ‘Report’ (for everyone else)
This is coordinated by some code called a ‘Card Component’.
Sorry for the jargon!
Aspects to Customize?
• A model can specify:
• What datatypes (text, date, Concept, Resource, etc) are used to hold data
• Whether a branch (eg Place, Name, etc) is searchable, is required,
permissions and how it is represented as a Card (or potentially, Cards)
• A Card can specify:
• What Card Component to use (currently there is only a default one)
• Big functional changes possible but equally, big development required
• What Widget to use to gather data for a datatype
• (including configuration options for the widget)
• Whether a Card can hold multiple values (eg multiple Places, Names, etc)
• What Report template to use for a Model.
What we can develop currently (for v 4.3.x):
Widgets and Datatypes are the easiest targets for development
• Datatypes:
• Pro: We can write our own datatypes for holding more complex data
• Pro: We can change how the data is stored and indexed
• Pro: We can code how this data should be represented in or imported from
JSON-LD
• Con: We lose the ability to structure data in the Model in the user interface.
For example, a new datatype for a Name (holding first name, middle
names, etc) would be specified in code, not in the user interface. To
change it, would require changing the code.
What we can develop currently (for v 4.3.x):
Widgets and Datatypes are the easiest targets for development
• Widgets
• Pro: A datatype can have multiple widgets written for it, allowing for easy and
safe customization.
• Pro: What widget is applied to a datatype is set in the card view for a model.
Doesn’t require development time once the widget is written and added!
• Con: Doesn’t affect how the data is held or represented in the search. It is the
interface only.
What we can develop currently (for v 4.3.x):
• Card Component:
• This component orchestrates a lot of things in the form interface.
• Pro: Big possibilities
• Con: Big commitment to rewrite, and does not affect search indexing
• Report:
• Defines how people can view a Resource.
• Can swap report views without unduly affecting anything else.
Forecast of developments
‘Mobile Survey Manager’ (for Arches version 4.4 Q1)
• Allow data entry and upload from mobile devices out in the field and
away from a direct connection to an Arches instance.
• Suited for cataloguing finds and features that have important
geographical data requiring GPS and an internet connection is either
unavailable, or undesired.
• This work will be carried out by Farallon.
Forecast of developments
‘Workflows’ (for Arches version 4.5, Q4 2019)
• Much more complex and controllable ‘wizards’ for form entry or data
editing.
• Will allow for a more purposed and efficient interface to be
developed for our needs. It is currently difficult to edit or create more
than one resource at a time, and this is labourious for some tasks
• This work will be carried out by Farallon.
Forecast of developments
Scaling and Performance testing and metrics (Q2-Q3 2019)
• This will be a key factor in informing how we use and deploy Arches
applications and is a priority.
• Adding more integration tests and functional tests, connecting to a
data metrics dashboard and measuring changes in scaling,
performance and load when code or deployment choices are
changed.
Forecast of developments
‘Notification Hooks’ (by Q3 2019)
• The ability for Arches to notify other users and services that
information it maintains has changed in some way.
• This is essential to keep other non-Arches systems in sync, and to
allow other business processes to be activated when necessary.
• Getty Digital and Farallon have already reached a consensus that this
is necessary and will collaborate with the rest of the user community.
Summary:
• Arches has good potential for allowing complex data to be edited and
curated through simple web form interfaces.
• JSON-LD import and export allow for other systems to make better
use of data held within an Arches instance.
• Models in Arches are hierarchical trees of related fields, and Branches
are reusable patterns of fields and relationships.
• Data is collected in Datatypes (Arches components that defines and
manages a type of data) through Widgets (HTML/JS code that provide
the web UI for a given datatype).
Questions?

Mais conteúdo relacionado

Mais procurados

moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016
Richard (Rick) Nelson
 

Mais procurados (20)

Metadata Mapping & Crosswalks
Metadata Mapping & CrosswalksMetadata Mapping & Crosswalks
Metadata Mapping & Crosswalks
 
RDF and Java
RDF and JavaRDF and Java
RDF and Java
 
Solution architecture for big data projects
Solution architecture for big data projectsSolution architecture for big data projects
Solution architecture for big data projects
 
Graph Databases
Graph DatabasesGraph Databases
Graph Databases
 
Metadata mapping
Metadata mappingMetadata mapping
Metadata mapping
 
Expressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDLExpressing Concept Schemes & Competency Frameworks in CTDL
Expressing Concept Schemes & Competency Frameworks in CTDL
 
GraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph DatabasesGraphTech Ecosystem - part 1: Graph Databases
GraphTech Ecosystem - part 1: Graph Databases
 
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactoryVisual Ontology Modeling for Domain Experts and Business Users with metaphactory
Visual Ontology Modeling for Domain Experts and Business Users with metaphactory
 
No sql – rise of the clusters
No sql – rise of the clustersNo sql – rise of the clusters
No sql – rise of the clusters
 
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...Experimental transformation of  ABS data into Data Cube Vocabulary (DCV) form...
Experimental transformation of ABS data into Data Cube Vocabulary (DCV) form...
 
NoSql
NoSqlNoSql
NoSql
 
HDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data PortalsHDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data Portals
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
Semantic Web Nature
Semantic Web NatureSemantic Web Nature
Semantic Web Nature
 
moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016moving_from_relational_to_nosql_couchbase_2016
moving_from_relational_to_nosql_couchbase_2016
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Data(base) taxonomy
Data(base) taxonomyData(base) taxonomy
Data(base) taxonomy
 
Data-driven Applications with conStruct
Data-driven Applications with conStructData-driven Applications with conStruct
Data-driven Applications with conStruct
 
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
 
The Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge GraphThe Business Case for Semantic Web Ontology & Knowledge Graph
The Business Case for Semantic Web Ontology & Knowledge Graph
 

Semelhante a Arches Getty Brownbag Talk

Document Based Data Modeling Technique
Document Based Data Modeling TechniqueDocument Based Data Modeling Technique
Document Based Data Modeling Technique
Carmen Sanborn
 
An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...
DataWorks Summit
 
Creating Effective Data Visualizations in Excel 2016: Some Basics
Creating Effective Data Visualizations in Excel 2016:  Some BasicsCreating Effective Data Visualizations in Excel 2016:  Some Basics
Creating Effective Data Visualizations in Excel 2016: Some Basics
Shalin Hai-Jew
 
Next-Generation Completeness and Consistency Management in the Digital Threa...
Next-Generation Completeness and Consistency Management in the Digital Threa...Next-Generation Completeness and Consistency Management in the Digital Threa...
Next-Generation Completeness and Consistency Management in the Digital Threa...
Ákos Horváth
 

Semelhante a Arches Getty Brownbag Talk (20)

Document Based Data Modeling Technique
Document Based Data Modeling TechniqueDocument Based Data Modeling Technique
Document Based Data Modeling Technique
 
Etosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road mapEtosha - Data Asset Manager : Status and road map
Etosha - Data Asset Manager : Status and road map
 
An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...An architecture for federated data discovery and lineage over on-prem datasou...
An architecture for federated data discovery and lineage over on-prem datasou...
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphs
 
Creating Effective Data Visualizations in Excel 2016: Some Basics
Creating Effective Data Visualizations in Excel 2016:  Some BasicsCreating Effective Data Visualizations in Excel 2016:  Some Basics
Creating Effective Data Visualizations in Excel 2016: Some Basics
 
Next-Generation Completeness and Consistency Management in the Digital Threa...
Next-Generation Completeness and Consistency Management in the Digital Threa...Next-Generation Completeness and Consistency Management in the Digital Threa...
Next-Generation Completeness and Consistency Management in the Digital Threa...
 
ER/Studio Data Architect Datasheet
ER/Studio Data Architect DatasheetER/Studio Data Architect Datasheet
ER/Studio Data Architect Datasheet
 
CouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big DataCouchBase The Complete NoSql Solution for Big Data
CouchBase The Complete NoSql Solution for Big Data
 
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
How To Model and Construct Graphs with Oracle Database (AskTOM Office Hours p...
 
Architecture as Linked Data
Architecture as Linked DataArchitecture as Linked Data
Architecture as Linked Data
 
Data Wrangling and Visualization Using Python
Data Wrangling and Visualization Using PythonData Wrangling and Visualization Using Python
Data Wrangling and Visualization Using Python
 
A machine learning and data science pipeline for real companies
A machine learning and data science pipeline for real companiesA machine learning and data science pipeline for real companies
A machine learning and data science pipeline for real companies
 
Semantic framework for web scraping.
Semantic framework for web scraping.Semantic framework for web scraping.
Semantic framework for web scraping.
 
ACM TechTalks : Apache Arrow and the Future of Data Frames
ACM TechTalks : Apache Arrow and the Future of Data FramesACM TechTalks : Apache Arrow and the Future of Data Frames
ACM TechTalks : Apache Arrow and the Future of Data Frames
 
Hackolade Tutorial - part 1 - What is a data model
Hackolade Tutorial - part 1 - What is a data modelHackolade Tutorial - part 1 - What is a data model
Hackolade Tutorial - part 1 - What is a data model
 
IncQuery_presentation_Incose_EMEA_WSEC.pptx
IncQuery_presentation_Incose_EMEA_WSEC.pptxIncQuery_presentation_Incose_EMEA_WSEC.pptx
IncQuery_presentation_Incose_EMEA_WSEC.pptx
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Nosql
NosqlNosql
Nosql
 
So You Want to Build a Data Lake?
So You Want to Build a Data Lake?So You Want to Build a Data Lake?
So You Want to Build a Data Lake?
 

Mais de benosteen

Mechanical curator - Technical notes
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notes
benosteen
 
Apache pig as a researcher’s stepping stone
Apache pig as a researcher’s stepping stoneApache pig as a researcher’s stepping stone
Apache pig as a researcher’s stepping stone
benosteen
 
New methods of access and discoverability bring new affordances for digital r...
New methods of access and discoverability bring new affordances for digital r...New methods of access and discoverability bring new affordances for digital r...
New methods of access and discoverability bring new affordances for digital r...
benosteen
 
Visualising Knowledge: Why? What? How?
Visualising Knowledge: Why? What? How?Visualising Knowledge: Why? What? How?
Visualising Knowledge: Why? What? How?
benosteen
 

Mais de benosteen (20)

Bl labs ucl-services
Bl labs ucl-servicesBl labs ucl-services
Bl labs ucl-services
 
Bl labs what is british library labs
Bl labs   what is british library labsBl labs   what is british library labs
Bl labs what is british library labs
 
British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017
 
Uses of Library Collections
Uses of Library CollectionsUses of Library Collections
Uses of Library Collections
 
CityLIS talk, Feb 1st 2016
CityLIS talk, Feb 1st 2016CityLIS talk, Feb 1st 2016
CityLIS talk, Feb 1st 2016
 
NDF,Te Papa, New Zealand 2015 - Keynote
NDF,Te Papa, New Zealand 2015 - KeynoteNDF,Te Papa, New Zealand 2015 - Keynote
NDF,Te Papa, New Zealand 2015 - Keynote
 
British library labs - What? Why?
British library labs - What? Why?British library labs - What? Why?
British library labs - What? Why?
 
UKSG 2015 Mechanical curator and British Library labs
UKSG 2015  Mechanical curator and British Library labsUKSG 2015  Mechanical curator and British Library labs
UKSG 2015 Mechanical curator and British Library labs
 
Lightning Talk - LDCX 2015 Stanford
Lightning Talk - LDCX 2015 StanfordLightning Talk - LDCX 2015 Stanford
Lightning Talk - LDCX 2015 Stanford
 
104 Communicating our Collections Online
104 Communicating our Collections Online104 Communicating our Collections Online
104 Communicating our Collections Online
 
Sharing and Serendipity
Sharing and SerendipitySharing and Serendipity
Sharing and Serendipity
 
Mechanical Curator (@ CREATE PUBLIC DOMAIN WORKSHOP FOR CREATIVE BUSINESSES)
Mechanical Curator (@ CREATE PUBLIC DOMAIN WORKSHOP FOR CREATIVE BUSINESSES)Mechanical Curator (@ CREATE PUBLIC DOMAIN WORKSHOP FOR CREATIVE BUSINESSES)
Mechanical Curator (@ CREATE PUBLIC DOMAIN WORKSHOP FOR CREATIVE BUSINESSES)
 
BL Labs 2014 Symposium: The Mechanical Curator
BL Labs 2014 Symposium: The Mechanical CuratorBL Labs 2014 Symposium: The Mechanical Curator
BL Labs 2014 Symposium: The Mechanical Curator
 
The surprising adventures of the mechanical curator
The surprising adventures of the mechanical curatorThe surprising adventures of the mechanical curator
The surprising adventures of the mechanical curator
 
Mechanical curator - Technical notes
Mechanical curator - Technical notesMechanical curator - Technical notes
Mechanical curator - Technical notes
 
Apache pig as a researcher’s stepping stone
Apache pig as a researcher’s stepping stoneApache pig as a researcher’s stepping stone
Apache pig as a researcher’s stepping stone
 
New methods of access and discoverability bring new affordances for digital r...
New methods of access and discoverability bring new affordances for digital r...New methods of access and discoverability bring new affordances for digital r...
New methods of access and discoverability bring new affordances for digital r...
 
Visualising Knowledge: Why? What? How?
Visualising Knowledge: Why? What? How?Visualising Knowledge: Why? What? How?
Visualising Knowledge: Why? What? How?
 
Mashspa
MashspaMashspa
Mashspa
 
Postscript, books and binding
Postscript, books and bindingPostscript, books and binding
Postscript, books and binding
 

Último

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 

Arches Getty Brownbag Talk

  • 1. Ben O’Steen, Getty Digital - Feb 20, 2019 What is “Arches”?
  • 2. Background • Developed jointly by the Getty Conservation Institute (GCI) and World Monuments Fund (WMF), with code development carried out by Farallon (SF based company) • Arches is an Open Source web-based platform (using Django web framework, Postgres database and Elasticsearch search service). • Arches can create digital inventories that describe types, locations, extent, cultural periods, materials, and conditions of heritage resources and establish the numerous and complex relationships between those resources. • Uses the “CIDOC Conceptual Reference Model” at its core, with the option to use other ontologies such as the Linked Art ontology.
  • 3. Linked Open Usable Data – Linked Art A Linked Open Usable Data model, collaboratively designed to work across cultural heritage organizations, that is easy to publish and enables a variety of consuming applications. Design Principles: • Focused on Usability, not 100% precision / completeness • Consistently solves actual challenges from real data • Development is iterative, as new use cases are found • Solve 90% of use cases, with 10% of the effort (Thanks go to Rob Sanderson & David Newbury for slide text)
  • 4.
  • 5. What is Arches good at? • Allows users to both create complex models to define their data using a web user interface (UI). • Creates web forms based on these complex models, which also can be customized using the web UI. • Allows users an easy way to enter, find and edit data, again using the web UI. • It does not require detailed knowledge of RDF, Linked Data, SPARQL, Triplestores, and RDF representational formats, paradigms which other Linked Data tools often implicitly require users to ‘just know’.
  • 6.
  • 7.
  • 8. What is Arches not good at? • Currently, Arches does not provide any graph-based query services, such as GraphQL or SPARQL. • The Reference Data Manager (that holds and manages Concepts and Authoratative terms) requires more development better to suit our needs. • Arches has not been tested at huge scales. • The default form ‘widgets’ can be verbose. • No bulk editing as yet.
  • 9. A Note on Concepts and Controlled Terms • These are not handled as Resources in Arches (ie there is no Concept model) • They are managed by the Reference Data Manager (RDM) side of Arches and imported and exported as SKOS representations, and require a textual label as well as an identifier to be used within Arches. • Currently, Resources cannot refer to concepts* that are not already present in the Arches RDM. * Or other Resources that are not in Arches
  • 10. Application Structure • Arches consists of a web-application layer, bound to a client-side javascript layer. • The javascript layer relies on a REST-based API • Stores data in a Postgres database, utilizing Postgres’s native JSON and GIS handling. • Search is performed using an Elasticsearch service, that indexes key information as determined by the application and the configuration.
  • 11. Working with other systems • As of Arches version 4.4 (releasing this week, or next), it will have acceptable JSON-LD support for import and export. • JSON-LD can be thought of a means to represent and transfer the complex information in a way that developers can make use of. • Arches will be used for what it is good at, with other suitable systems or applications being used where sensible. A valid, semantically accurate export and import is a key component for this.
  • 12. Models in Arches An Arches Model can be thought of like a template or pattern for a given type of Resource. It is akin to a database or XML schema in concept. The data held in Arches is best thought of as a graph of data rather than as tabular data (like an Excel or CSV file). A graph is a collection of points (nodes) connected by relationships (edges).
  • 13.
  • 15.
  • 16. Models are Hierarchical Trees The compromise that Arches makes is that Models are defined as hierarchical trees of information. They are still a graph of information, but a resource will ‘own’ all of the new information in its tree, and reference other resources rather than duplicate the information they own. Example: An artist resource will hold information about the artist’s name, but will only point to the resources in the system that represent their paintings.
  • 17.
  • 18. Web forms for data entry/editing in Arches They are generated from • ‘Models’ (that structure data for a given Resource you want to describe), • The Model’s ‘Cards’* (that hold that data within Arches) * (I’ll come back to what cards are. They are an Arches application concept, not a Linked Data one)
  • 19. Web forms for data entry/editing in Arches They are generated from • ‘Models’ (that structure data for a given Resource you want to describe), • The Model’s ‘Cards’* (that hold that data within Arches) • ‘Branches’ are reusable structures that you use when building Models. • For example, you might make a ‘Place’ model, and add a Branch that defines how to describe a contemporary geographical location. • Branches are reusable templates. They help avoid multiple differing structures creeping in for basic things like the geographical location, or a record of the creation of an artwork.
  • 20.
  • 21. The top node of a model Some Nodes Datatypes A model can have this sort of structure, which can be bespoke, and also pull in pre-made branches (like templates). Datatypes are how you define where in the model data is stored (or ’collected’), and what sort of data this must be. More datatypes can be developed, added, and have their own behavior. Some examples: some plain text, a date, a true or false, a number, a link to another Resource in Arches, or a link to a Concept.
  • 22. Model (‘Graph’ view) Model (Card view)
  • 23. How does Arches present this? • As a Form view (for data entry) • By default, relies on the structure Model to structure the form (1:1 mapping) • A Datatype in the model is linked to some code/html called a Widget • A Datatype (python class) maintains the data within the system, a Widget is the HTML/JS component that lets people edit or enter data for it. • As a Report (to view/interpret the data) • ‘Preview’ report (for admins) • ‘Report’ (for everyone else) This is coordinated by some code called a ‘Card Component’.
  • 24. Sorry for the jargon!
  • 25. Aspects to Customize? • A model can specify: • What datatypes (text, date, Concept, Resource, etc) are used to hold data • Whether a branch (eg Place, Name, etc) is searchable, is required, permissions and how it is represented as a Card (or potentially, Cards) • A Card can specify: • What Card Component to use (currently there is only a default one) • Big functional changes possible but equally, big development required • What Widget to use to gather data for a datatype • (including configuration options for the widget) • Whether a Card can hold multiple values (eg multiple Places, Names, etc) • What Report template to use for a Model.
  • 26. What we can develop currently (for v 4.3.x): Widgets and Datatypes are the easiest targets for development • Datatypes: • Pro: We can write our own datatypes for holding more complex data • Pro: We can change how the data is stored and indexed • Pro: We can code how this data should be represented in or imported from JSON-LD • Con: We lose the ability to structure data in the Model in the user interface. For example, a new datatype for a Name (holding first name, middle names, etc) would be specified in code, not in the user interface. To change it, would require changing the code.
  • 27. What we can develop currently (for v 4.3.x): Widgets and Datatypes are the easiest targets for development • Widgets • Pro: A datatype can have multiple widgets written for it, allowing for easy and safe customization. • Pro: What widget is applied to a datatype is set in the card view for a model. Doesn’t require development time once the widget is written and added! • Con: Doesn’t affect how the data is held or represented in the search. It is the interface only.
  • 28.
  • 29. What we can develop currently (for v 4.3.x): • Card Component: • This component orchestrates a lot of things in the form interface. • Pro: Big possibilities • Con: Big commitment to rewrite, and does not affect search indexing • Report: • Defines how people can view a Resource. • Can swap report views without unduly affecting anything else.
  • 30. Forecast of developments ‘Mobile Survey Manager’ (for Arches version 4.4 Q1) • Allow data entry and upload from mobile devices out in the field and away from a direct connection to an Arches instance. • Suited for cataloguing finds and features that have important geographical data requiring GPS and an internet connection is either unavailable, or undesired. • This work will be carried out by Farallon.
  • 31. Forecast of developments ‘Workflows’ (for Arches version 4.5, Q4 2019) • Much more complex and controllable ‘wizards’ for form entry or data editing. • Will allow for a more purposed and efficient interface to be developed for our needs. It is currently difficult to edit or create more than one resource at a time, and this is labourious for some tasks • This work will be carried out by Farallon.
  • 32. Forecast of developments Scaling and Performance testing and metrics (Q2-Q3 2019) • This will be a key factor in informing how we use and deploy Arches applications and is a priority. • Adding more integration tests and functional tests, connecting to a data metrics dashboard and measuring changes in scaling, performance and load when code or deployment choices are changed.
  • 33. Forecast of developments ‘Notification Hooks’ (by Q3 2019) • The ability for Arches to notify other users and services that information it maintains has changed in some way. • This is essential to keep other non-Arches systems in sync, and to allow other business processes to be activated when necessary. • Getty Digital and Farallon have already reached a consensus that this is necessary and will collaborate with the rest of the user community.
  • 34. Summary: • Arches has good potential for allowing complex data to be edited and curated through simple web form interfaces. • JSON-LD import and export allow for other systems to make better use of data held within an Arches instance. • Models in Arches are hierarchical trees of related fields, and Branches are reusable patterns of fields and relationships. • Data is collected in Datatypes (Arches components that defines and manages a type of data) through Widgets (HTML/JS code that provide the web UI for a given datatype).