SlideShare uma empresa Scribd logo
1 de 9
Mustafa Jarrar
Lecture Notes, Web Data Management (MCOM7348)
University of Birzeit, Palestine
1st Semester, 2013

Web Data Management
Introduction and Course Outline

Dr. Mustafa Jarrar
University of Birzeit
mjarrar@birzeit.edu
www.jarrar.info
Jarrar © 2013

1
Watch this lecture and download the slides from
http://jarrar-courses.blogspot.com/2014/01/web-data-management.html

Jarrar © 2013

2
Web Data Management, Search, and Retrieval
(MCOM7348)

Jarrar © 2013

3
Course Description:

This course aims to enrich students with methodologies and
techniques needed for gathering, storing, mining, analyzing, and
providing access to data; to help them develop modern search engines,
no-SQL databases, web information systems, and integrated services.
The main topics that will be addressed in this course includes:
(i) Tree data models: XML, XML Scheme and DTD, XML parsing, style
sheets, XPath and XQuery;
(ii) Graph data models and semantics: RDF, RDFS, OWL, SPARQL,
graph databases and RDF stores;
(iii) Data integration and retrieval scenarios: architectural solutions for the
integration issues, schema-based integration, GAV and LAV integration,
data fusion, web data mashups applications, data identity management,
Data Web, Linked Data, RDFa, and social open-graphs, modern search
engines, and indexing huge datasets.

Jarrar © 2013

4
(OWL)

(XML)
(DTD, XML Schema)
(RDFS, RDF)
(SPARQL).

(XPath, XQuery)
(RDF Stores)

.
Jarrar © 2013

5
Pre-requites: (knowledge or some maturity in) Database Systems, Web
Programming, Object-Oriented Programming.
Suggested Text Book: Web Data Management, by Serge Abitebouland
others. ISBN 1107012430, Cambridge University Press (November 28,
2011).
http://webdam.inria.fr/Jorge/
Lecture Notes and other reading material will be supplied for each lecture.

Jarrar © 2013

6
Course Outline:

Part I: Tree Data Models, by Dr. Hanna Bullata
Introduction and XML basics
DTD
XML Schema

XPath and XQuery
XML parsing, and XSLT style sheets
Project 1 and Assignments (practical session)

Jarrar © 2013

7
Course Outline:
Part II: Graph data models and semantics, Dr. Mustafa Jarrar
Introduction graph data models and RDF
RDFS (RDF Schema)
OWL (Web Ontology Language)
SPARQL (RDF Query languages)
RDF Stores, and Oracle Semantic Technology

Jarrar © 2013

8
Course Outline:
Part III: Data integration & retrieval, Dr. Mustafa Jarrar
The problem of data integration
Architectural solutions for the integration issues
Schema-based integration
GAV and LAV integration
Data Integration and Fusion using RDF
The Data Web and Linked Data
Web data Integration and RDFa and Mashups
Web-data Identity Management
Social open-graphs
Modern search engines
Indexing of huge datasets and no-SQL databases
Selected Topics (Term Papers on modern applications)

Jarrar © 2013

9

Mais conteúdo relacionado

Mais de Mustafa Jarrar

Habash: Arabic Natural Language Processing
Habash: Arabic Natural Language ProcessingHabash: Arabic Natural Language Processing
Habash: Arabic Natural Language Processing
Mustafa Jarrar
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
Mustafa Jarrar
 
Jarrar: Sparql Project
Jarrar: Sparql ProjectJarrar: Sparql Project
Jarrar: Sparql Project
Mustafa Jarrar
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing Ontologies
Mustafa Jarrar
 

Mais de Mustafa Jarrar (20)

BPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical ConstructsBPMN 2.0 Analytical Constructs
BPMN 2.0 Analytical Constructs
 
BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs  BPMN 2.0 Descriptive Constructs
BPMN 2.0 Descriptive Constructs
 
Introduction to Business Process Management
Introduction to Business Process ManagementIntroduction to Business Process Management
Introduction to Business Process Management
 
Customer Complaint Ontology
Customer Complaint Ontology Customer Complaint Ontology
Customer Complaint Ontology
 
Subset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion RulesSubset, Equality, and Exclusion Rules
Subset, Equality, and Exclusion Rules
 
Schema Modularization in ORM
Schema Modularization in ORMSchema Modularization in ORM
Schema Modularization in ORM
 
On Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in PalestineOn Computer Science Trends and Priorities in Palestine
On Computer Science Trends and Priorities in Palestine
 
Lessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online CoursesLessons from Class Recording & Publishing of Eight Online Courses
Lessons from Class Recording & Publishing of Eight Online Courses
 
Presentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-finalPresentation curras paper-emnlp2014-final
Presentation curras paper-emnlp2014-final
 
Jarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 CallsJarrar: Future Internet in Horizon 2020 Calls
Jarrar: Future Internet in Horizon 2020 Calls
 
Habash: Arabic Natural Language Processing
Habash: Arabic Natural Language ProcessingHabash: Arabic Natural Language Processing
Habash: Arabic Natural Language Processing
 
Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing Adnan: Introduction to Natural Language Processing
Adnan: Introduction to Natural Language Processing
 
Riestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 ProposalsRiestra: How to Design and engineer Competitive Horizon 2020 Proposals
Riestra: How to Design and engineer Competitive Horizon 2020 Proposals
 
Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020Bouquet: SIERA Workshop on The Pillars of Horizon2020
Bouquet: SIERA Workshop on The Pillars of Horizon2020
 
Jarrar: Sparql Project
Jarrar: Sparql ProjectJarrar: Sparql Project
Jarrar: Sparql Project
 
Jarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology EngineeringJarrar: Logical Foundation of Ontology Engineering
Jarrar: Logical Foundation of Ontology Engineering
 
Jarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing OntologiesJarrar: Stepwise Methodologies for Developing Ontologies
Jarrar: Stepwise Methodologies for Developing Ontologies
 
Jarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean MethodologyJarrar: Ontology Modeling using OntoClean Methodology
Jarrar: Ontology Modeling using OntoClean Methodology
 
Jarrar: Games
Jarrar: GamesJarrar: Games
Jarrar: Games
 
Jarrar: Informed Search
Jarrar: Informed Search  Jarrar: Informed Search
Jarrar: Informed Search
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 

Web Data Management Course outline

  • 1. Mustafa Jarrar Lecture Notes, Web Data Management (MCOM7348) University of Birzeit, Palestine 1st Semester, 2013 Web Data Management Introduction and Course Outline Dr. Mustafa Jarrar University of Birzeit mjarrar@birzeit.edu www.jarrar.info Jarrar © 2013 1
  • 2. Watch this lecture and download the slides from http://jarrar-courses.blogspot.com/2014/01/web-data-management.html Jarrar © 2013 2
  • 3. Web Data Management, Search, and Retrieval (MCOM7348) Jarrar © 2013 3
  • 4. Course Description: This course aims to enrich students with methodologies and techniques needed for gathering, storing, mining, analyzing, and providing access to data; to help them develop modern search engines, no-SQL databases, web information systems, and integrated services. The main topics that will be addressed in this course includes: (i) Tree data models: XML, XML Scheme and DTD, XML parsing, style sheets, XPath and XQuery; (ii) Graph data models and semantics: RDF, RDFS, OWL, SPARQL, graph databases and RDF stores; (iii) Data integration and retrieval scenarios: architectural solutions for the integration issues, schema-based integration, GAV and LAV integration, data fusion, web data mashups applications, data identity management, Data Web, Linked Data, RDFa, and social open-graphs, modern search engines, and indexing huge datasets. Jarrar © 2013 4
  • 5. (OWL) (XML) (DTD, XML Schema) (RDFS, RDF) (SPARQL). (XPath, XQuery) (RDF Stores) . Jarrar © 2013 5
  • 6. Pre-requites: (knowledge or some maturity in) Database Systems, Web Programming, Object-Oriented Programming. Suggested Text Book: Web Data Management, by Serge Abitebouland others. ISBN 1107012430, Cambridge University Press (November 28, 2011). http://webdam.inria.fr/Jorge/ Lecture Notes and other reading material will be supplied for each lecture. Jarrar © 2013 6
  • 7. Course Outline: Part I: Tree Data Models, by Dr. Hanna Bullata Introduction and XML basics DTD XML Schema XPath and XQuery XML parsing, and XSLT style sheets Project 1 and Assignments (practical session) Jarrar © 2013 7
  • 8. Course Outline: Part II: Graph data models and semantics, Dr. Mustafa Jarrar Introduction graph data models and RDF RDFS (RDF Schema) OWL (Web Ontology Language) SPARQL (RDF Query languages) RDF Stores, and Oracle Semantic Technology Jarrar © 2013 8
  • 9. Course Outline: Part III: Data integration & retrieval, Dr. Mustafa Jarrar The problem of data integration Architectural solutions for the integration issues Schema-based integration GAV and LAV integration Data Integration and Fusion using RDF The Data Web and Linked Data Web data Integration and RDFa and Mashups Web-data Identity Management Social open-graphs Modern search engines Indexing of huge datasets and no-SQL databases Selected Topics (Term Papers on modern applications) Jarrar © 2013 9