SlideShare uma empresa Scribd logo
1 de 16
Royal Netherlands Academy of Arts and Sciences
                      (KNAW)
  International Institute of Social History (IISG)


      Library Applications Workflow
             Vyacheslav Tykhonov

               mailto: vty@iisg.nl
               October 18, 2012
Software Tools Overview

    Evergreen library system (core) with external
    applications developed in IISG

    Digital Repository to store metadata and files
    (images, video, audio, etc)

    OCR service to convert images to text

    VisualMets Viewer to browse scans

    HiTIME project for Named Entity Recognition

    Search (VuFind) as interface to access linked
    metadata
Evergreen applications overview

    Charts Builder

    GeoLocator

    Visual Timelines

    Custom Reports

    Open Archives Initiative Protocol (OAI) for
    Metadata Harvesting (for VuFind/Wordcat,...)

    ISBN reader

    Related bibliographic records finder

    Authority linking application
Evergreen Charts Builder
    http://evergreen.iisg.nl/charts/report.1900.html
Charts Builder with filtering by
  country/language/dates
            Open website link
Evergreen GeoLocator
        Example
OCR Service
                       Website Link



    Optical character recognition (OCR) application
    for conversion of scanned images of
    handwritten, typewritten or printed text into
    machine-encoded text

    Texts can be stored in Digital Repository as
    separate layer and used for further analysis

    OCR service can recognize more than 40
    languages with high accuracy

    Can be trained to work in other languages too

    High speed of recognition (1-2 second/page)
OCR Example (Dora Russel Archive)
               Example
HiTiME Project
                       Go to website


HiTiME is text analysis system for the recognition
 and extraction of historical events and facts
 from historical sources and archives.
         Named Entity Recognition process:

    Persons (Dora Russel, Karl Marx, ...)

    Locations (Amsterdam, the Netherlands, ...)

    Dates (October 18, 2012,...)
All named entities will be stored in Knowledge
  Base and can be linked, persons can create
  social networks.
IISG resources for HiTiME
            (Machine Learning)

    Training on Authority Records from Evergreen
    can improve accuracy and recall of Named
    Entity Recognition (NER)

    Evergreen marc21 records for Topic Detection
    and Tracking (for example, 6XX Subject Access
    Fields, etc..)

    IISG archives and collections can be used to
    create corpus of related documents
HiTiME - BWSA 2.0 Demo
      http://ilk.uvt.nl/hitime/bwsa_tmp/
HiTiME – Example
       Demo
Combining of Tools: OCR + HiTIME
             Open Application
Visual Mets + OCR + NER
          Demo
Visual Timeline Application
            Example
Questions?
  International Institute of Social History (IISG)
Royal Netherlands Academy of Arts and Sciences
                    (KNAW)
      Digital Infrastructure Department (DI)

              Vyacheslav Tykhonov
           Library Systems Developer
                mailto: vty@iisg.nl

Mais conteúdo relacionado

Destaque

The New Cost To Value Curve
The New Cost To Value CurveThe New Cost To Value Curve
The New Cost To Value Curve
Wim Rampen
 
Active Magazine Najaar2011
Active Magazine Najaar2011Active Magazine Najaar2011
Active Magazine Najaar2011
michellethart
 

Destaque (6)

Bredeschool
BredeschoolBredeschool
Bredeschool
 
Overzicht beroeps- en andere verenigingen
Overzicht beroeps- en andere verenigingenOverzicht beroeps- en andere verenigingen
Overzicht beroeps- en andere verenigingen
 
Feature report "Rail risk - Stay on Track" | Aon NL
Feature report "Rail risk - Stay on Track" | Aon NLFeature report "Rail risk - Stay on Track" | Aon NL
Feature report "Rail risk - Stay on Track" | Aon NL
 
Active Magazine Najaar2011
Active Magazine Najaar2011Active Magazine Najaar2011
Active Magazine Najaar2011
 
The New Cost To Value Curve
The New Cost To Value CurveThe New Cost To Value Curve
The New Cost To Value Curve
 
Active Magazine Najaar2011
Active Magazine Najaar2011Active Magazine Najaar2011
Active Magazine Najaar2011
 

Semelhante a IISG applications overview

Hypatia for dlf 2011
Hypatia for dlf 2011Hypatia for dlf 2011
Hypatia for dlf 2011
DLFCLIR
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
Sören Auer
 

Semelhante a IISG applications overview (20)

The JISC Information Environment and collection description
The JISC Information Environment and collection descriptionThe JISC Information Environment and collection description
The JISC Information Environment and collection description
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
 
ABCD Open Source Software for managing ETD repositories
ABCD Open Source Software for managing ETD repositoriesABCD Open Source Software for managing ETD repositories
ABCD Open Source Software for managing ETD repositories
 
A02 matthew adams_keynote_paperless
A02 matthew adams_keynote_paperlessA02 matthew adams_keynote_paperless
A02 matthew adams_keynote_paperless
 
lodlam summit session browsable linked data
lodlam summit session browsable linked datalodlam summit session browsable linked data
lodlam summit session browsable linked data
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
A02 matthew adams_keynote_paperless
A02 matthew adams_keynote_paperlessA02 matthew adams_keynote_paperless
A02 matthew adams_keynote_paperless
 
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
8. (Semantic Interoperability in the CLARIN infrastructure. Menzo Windhouwer....
 
Gbrds Tech Issues Op
Gbrds Tech Issues OpGbrds Tech Issues Op
Gbrds Tech Issues Op
 
Hypatia for dlf 2011
Hypatia for dlf 2011Hypatia for dlf 2011
Hypatia for dlf 2011
 
Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Intro to Digitization Projects
Intro to Digitization ProjectsIntro to Digitization Projects
Intro to Digitization Projects
 
Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search Engine
 
Dspace
DspaceDspace
Dspace
 
Dspace
DspaceDspace
Dspace
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
A Service-Oriented National E-Theses Information System And Repository
A Service-Oriented National E-Theses Information System And RepositoryA Service-Oriented National E-Theses Information System And Repository
A Service-Oriented National E-Theses Information System And Repository
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...New challenges for digital scholarship and curation in the era of ubiquitous ...
New challenges for digital scholarship and curation in the era of ubiquitous ...
 

Mais de vty

Mais de vty (20)

Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
Decentralisation and knowledge graphs
Decentralisation and knowledge graphs Decentralisation and knowledge graphs
Decentralisation and knowledge graphs
 
Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure Decentralised identifiers for CLARIAH infrastructure
Decentralised identifiers for CLARIAH infrastructure
 
Dataverse repository for research data in the COVID-19 Museum
Dataverse repository for research data  in the COVID-19 MuseumDataverse repository for research data  in the COVID-19 Museum
Dataverse repository for research data in the COVID-19 Museum
 
Metaverse for Dataverse
Metaverse for DataverseMetaverse for Dataverse
Metaverse for Dataverse
 
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
Flexibility in Metadata Schemes and Standardisation: the Case of CMDI and DAN...
 
External CV support in Dataverse 5.7
External CV support in Dataverse 5.7External CV support in Dataverse 5.7
External CV support in Dataverse 5.7
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
 
CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes CLARIN CMDI use case and flexible metadata schemes
CLARIN CMDI use case and flexible metadata schemes
 
Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21Flexible metadata schemes for research data repositories - CLARIN Conference'21
Flexible metadata schemes for research data repositories - CLARIN Conference'21
 
Controlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repositoryControlled vocabularies and ontologies in Dataverse data repository
Controlled vocabularies and ontologies in Dataverse data repository
 
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
Automated CI/CD testing, installation and deployment of Dataverse infrastruct...
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
External controlled vocabularies support in Dataverse
External controlled vocabularies support in DataverseExternal controlled vocabularies support in Dataverse
External controlled vocabularies support in Dataverse
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in DataverseClariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
Clariah Tech Day: Controlled Vocabularies and Ontologies in Dataverse
 
5 years of Dataverse evolution
5 years of Dataverse evolution 5 years of Dataverse evolution
5 years of Dataverse evolution
 
Ontologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and DataverseOntologies, controlled vocabularies and Dataverse
Ontologies, controlled vocabularies and Dataverse
 
CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse CLARIN CMDI support in Dataverse
CLARIN CMDI support in Dataverse
 
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC,  Service QA and DataverseIntegration of WORSICA’s thematic service in EOSC,  Service QA and Dataverse
Integration of WORSICA’s thematic service in EOSC, Service QA and Dataverse
 

Último

Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost LoverPowerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
PsychicRuben LoveSpells
 
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
Faga1939
 

Último (20)

Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's DevelopmentNara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Development
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 46 (Gurgaon)
 
06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdf06052024_First India Newspaper Jaipur.pdf
06052024_First India Newspaper Jaipur.pdf
 
declarationleaders_sd_re_greens_theleft_5.pdf
declarationleaders_sd_re_greens_theleft_5.pdfdeclarationleaders_sd_re_greens_theleft_5.pdf
declarationleaders_sd_re_greens_theleft_5.pdf
 
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreieGujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreie
 
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopkoEmbed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
Embed-2 (1).pdfb[k[k[[k[kkkpkdpokkdpkopko
 
04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdf04052024_First India Newspaper Jaipur.pdf
04052024_First India Newspaper Jaipur.pdf
 
KING VISHNU BHAGWANON KA BHAGWAN PARAMATMONKA PARATOMIC PARAMANU KASARVAMANVA...
KING VISHNU BHAGWANON KA BHAGWAN PARAMATMONKA PARATOMIC PARAMANU KASARVAMANVA...KING VISHNU BHAGWANON KA BHAGWAN PARAMATMONKA PARATOMIC PARAMANU KASARVAMANVA...
KING VISHNU BHAGWANON KA BHAGWAN PARAMATMONKA PARATOMIC PARAMANU KASARVAMANVA...
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 48 (Gurgaon)
 
Embed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdhEmbed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdh
 
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
Transformative Leadership: N Chandrababu Naidu and TDP's Vision for Innovatio...
 
*Navigating Electoral Terrain: TDP's Performance under N Chandrababu Naidu's ...
*Navigating Electoral Terrain: TDP's Performance under N Chandrababu Naidu's ...*Navigating Electoral Terrain: TDP's Performance under N Chandrababu Naidu's ...
*Navigating Electoral Terrain: TDP's Performance under N Chandrababu Naidu's ...
 
Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost LoverPowerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
Powerful Love Spells in Phoenix, AZ (310) 882-6330 Bring Back Lost Lover
 
Politician uddhav thackeray biography- Full Details
Politician uddhav thackeray biography- Full DetailsPolitician uddhav thackeray biography- Full Details
Politician uddhav thackeray biography- Full Details
 
Busty Desi⚡Call Girls in Sector 62 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 62 Noida Escorts >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Sector 62 Noida Escorts >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Sector 62 Noida Escorts >༒8448380779 Escort Service
 
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
America Is the Target; Israel Is the Front Line _ Andy Blumenthal _ The Blogs...
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 47 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 47 (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 47 (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Gurgaon Sector 47 (Gurgaon)
 
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
THE OBSTACLES THAT IMPEDE THE DEVELOPMENT OF BRAZIL IN THE CONTEMPORARY ERA A...
 
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort ServiceBusty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
Busty Desi⚡Call Girls in Vasundhara Ghaziabad >༒8448380779 Escort Service
 
Enjoy Night ≽ 8448380779 ≼ Call Girls In Palam Vihar (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Palam Vihar (Gurgaon)Enjoy Night ≽ 8448380779 ≼ Call Girls In Palam Vihar (Gurgaon)
Enjoy Night ≽ 8448380779 ≼ Call Girls In Palam Vihar (Gurgaon)
 

IISG applications overview

  • 1. Royal Netherlands Academy of Arts and Sciences (KNAW) International Institute of Social History (IISG) Library Applications Workflow Vyacheslav Tykhonov mailto: vty@iisg.nl October 18, 2012
  • 2. Software Tools Overview  Evergreen library system (core) with external applications developed in IISG  Digital Repository to store metadata and files (images, video, audio, etc)  OCR service to convert images to text  VisualMets Viewer to browse scans  HiTIME project for Named Entity Recognition  Search (VuFind) as interface to access linked metadata
  • 3. Evergreen applications overview  Charts Builder  GeoLocator  Visual Timelines  Custom Reports  Open Archives Initiative Protocol (OAI) for Metadata Harvesting (for VuFind/Wordcat,...)  ISBN reader  Related bibliographic records finder  Authority linking application
  • 4. Evergreen Charts Builder http://evergreen.iisg.nl/charts/report.1900.html
  • 5. Charts Builder with filtering by country/language/dates Open website link
  • 7. OCR Service Website Link  Optical character recognition (OCR) application for conversion of scanned images of handwritten, typewritten or printed text into machine-encoded text  Texts can be stored in Digital Repository as separate layer and used for further analysis  OCR service can recognize more than 40 languages with high accuracy  Can be trained to work in other languages too  High speed of recognition (1-2 second/page)
  • 8. OCR Example (Dora Russel Archive) Example
  • 9. HiTiME Project Go to website HiTiME is text analysis system for the recognition and extraction of historical events and facts from historical sources and archives. Named Entity Recognition process:  Persons (Dora Russel, Karl Marx, ...)  Locations (Amsterdam, the Netherlands, ...)  Dates (October 18, 2012,...) All named entities will be stored in Knowledge Base and can be linked, persons can create social networks.
  • 10. IISG resources for HiTiME (Machine Learning)  Training on Authority Records from Evergreen can improve accuracy and recall of Named Entity Recognition (NER)  Evergreen marc21 records for Topic Detection and Tracking (for example, 6XX Subject Access Fields, etc..)  IISG archives and collections can be used to create corpus of related documents
  • 11. HiTiME - BWSA 2.0 Demo http://ilk.uvt.nl/hitime/bwsa_tmp/
  • 13. Combining of Tools: OCR + HiTIME Open Application
  • 14. Visual Mets + OCR + NER Demo
  • 16. Questions? International Institute of Social History (IISG) Royal Netherlands Academy of Arts and Sciences (KNAW) Digital Infrastructure Department (DI) Vyacheslav Tykhonov Library Systems Developer mailto: vty@iisg.nl