SlideShare a Scribd company logo
1 of 7
Creating Knowledge Out of Interlinked Data KAIST Project Mun Y. Yi 19-09-2011
Agenda Introduction of KAIST KAIST LOD Team Description of Work Tasks Deliverables Current Status
Introduction of KAIST KAIST (Korea Advanced Institute of Science and Technology) is the first and top science and technology research university in Korea.  Founded in 1971 to raise elites in science and technology Located in the Daedeok Research Complex in the city of Daejeon, 150 kilometers south of Seoul. For the 2009 academic year, over 8000 students enrolled; 3452 in the bachelor’s, 2197 in the master’s, and 2357 in the doctorate program. KAIST has 842 professors and 334 staff members as of January 2009 According to QS World University Rankings 2011, KAIST is ranked as the 90th in the World and 2nd in Korea.
KAIST LOD Team Key-Sun Choi Director of Semantic Web Research Center Head of the Computer Science Department Expertise in ontology, NLP, and semantic Web Mun Y. Yi Director of Knowledge Systems Lab Associate professor in the Knowledge Service Engineering Department Expertise in knowledge engineering, recommender systems, e-learning, and MIS/HCI In-Young Ko Director of WebEng Lab Associate professor in the Computer Science Department Expertise in software engineering and Web engineering including Web services, Web-based information management, and semantic Web Ying Liu Director of Intelligent System and Service Lab Assistant professor in the Knowledge Service Engineering Department Expertise in Tableseer, information retrieval, and text mining
Work Description: Tasks Task 3.2: Provenance-Aware Linked Data Extraction from Unstructured and Semi-Structured Sources  KAIST will add its experience in extracting Linked Data from Korean resources. KAIST has the most advanced technology in processing Korean natural language resources and data. One example of such resource is CoreNet, which contains a taxonomic hierarchy, concept definitions and frame sets for Korean, Japanese and Chinese words. KAIST will build a Korean version of NLP2RDF by integrating various Korean natural language tools and providing the result of those toolkits in RDF format. KAIST will also facilitate the standardization of NLP2RDF through its involvement in the ISO group TC37/SC4 (Language Resources Management). 	 Task 4.1: Semi-Automatic Data Interlinking KAIST will contribute to this task by providing a platform  for automatic linking with Korean, Chinese, Japanese RDF resources. CoreNet contains a hierarchical concept structure for Korean, Chinese and Japanese words. Once the concepts of CoreNet are mapped to WordNetsynsets, as WordNet is already integrated into LOD, KAIST can provide the Korean, Chinese and Japanese RDF data integration platform for Linked Data by providing a mapping mechanism of those data to CoreNet, thus solving multilingual issues for these Asian languages. KAIST has taken the initial step of the CoreNet-WordNet mapping; already showing some progress Task 4.5a: Multilingual Linked Data Fusion  KAIST will choose the DBpedia dataset as the pivot multilingual dataset, since it is extracted from various kinds of languages. KAIST will work on the multilingual fusion of those multilingual DBpediadatasets, thus eliminating issues for other multilingual resources, since they simply need to fuse with their own language DBpedia resource. As a first step, KAIST is working on the bilingual fusion between the Korean DBpedia and the English DBpedia; having already obtained some results. At the end of the project these results will be expanded to the fusion of Chinese and Japanese DBpedia with Korean and English DBpedia. We envision to reach more than 90% precision and recall with this multi-lingual fusion approach.  Task 6.4: Development of application scenarios and testing of the LOD2 stack configurator The stack configurator will enable potential users to create their own personalized version of the LOD2 Stack, which contains only those functions relevant for their usage scenarios. In this task, LOD2 partners will conduct an in-depth analysis of different application scenarios and identify LOD2 functional components that adequately respond to specific application requirements. These results of the study will be used to assist the development of the stack configurator and to prepare comprehensive LOD2 documentation both from the engineer’s and the user’s viewpoint. Task 10.2d: Training and Dissemination in Korea (KAIST).  KAIST will ensure the penetration of LOD2 results in a dynamic Asian country by organizing a number of events and outreach activities, such as:  Two research-oriented Data Web symposia aiming to bring together relevant researchers in Asia with the LOD2 consortium,  Two industry workshops aiming at disseminating LOD2 results to Korean and Japanese companies and to facilitate cooperation and market entry of industrial LOD2 partners,  One Asian Data Web summer school aiming to outreach to PhD students and young researchers.
Work Description: Deliverables Deliverable 3.2.4 Korean NLP2RDF (KAIST, M32) Initial release of the NLP2RDF framework for Korean text. This will include various Korean NLP tools and data, including CoreNet. Compared to English, Korean NLP toolkits are less developed and opened; hence, most of the time will be devoted to the new development of Korean NLP tools which will contribute to LOD.	 Deliverable 4.1.3 Korean Resource Linking Assist Release (M24) The first version of Korean resource linking assist to DBpedia will intelligently recommend and order the possible mappings to the knowledge engineer. This will be implemented as the expansion of Deliverable 4.1.1. 	 Deliverable 4.1.4 Asian Resource Linking Assist Release (M30) This tool will help the knowledge engineer to link Korean, Chinese, Japanese language resources to Linked Data by recommending and ordering appropriate mappings to her. 	 Deliverable 4.5.3 Korean Data Fusion Assistant (M30) The component will support Korean data fusion into English LOD by combining Deliverable 4.5.1 with the fused dataset of English and Korean DBpedia. More precisely, the component will first fuse the new Korean dataset into Korean DBpedia by using D4.5.1, and the result will again be fused into the English DBpedia by applying the fusion result of Korean and English DBpedia.  Deliverable 4.5.4 Asian Data Fusion Assistant (M36) The component is an extension of Deliverable 4.5.3, and will support the data fusion of Korean, Japanese and Chinese datasets.
Current Status In preparation for a proposal to Korea MKE (Korea Ministry of Knowledge and Economy) Need to involve industry partners Potential projects/applications CoreNet to LOD Korean NLP2RDF Multilingual DBPedia matching and expansion Link Korea Traditional Knowledge DB to LOD Have similar work done in China and Japan Wiki History and Wiki Q&A Korean Wiki annotation

More Related Content

More from LOD2 Creating Knowledge out of Interlinked Data

More from LOD2 Creating Knowledge out of Interlinked Data (20)

LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORELOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
 
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the StackLOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: 3rd relase of the Stack
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7
 
LOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia SpotlightLOD2 Webinar Series: DBpedia Spotlight
LOD2 Webinar Series: DBpedia Spotlight
 
LOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKANLOD2 Webinar Series: publicdata.eu and CKAN
LOD2 Webinar Series: publicdata.eu and CKAN
 
LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine LOD2 Webinar Series: Zemanta / Open refine
LOD2 Webinar Series: Zemanta / Open refine
 
LOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industryLOD2 Webinar Series: LOD2 in information and publishing industry
LOD2 Webinar Series: LOD2 in information and publishing industry
 
LOD2 General Presentation 2012
LOD2 General Presentation 2012LOD2 General Presentation 2012
LOD2 General Presentation 2012
 
LOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolPartyLOD2 Webinar Series: PoolParty
LOD2 Webinar Series: PoolParty
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 
LOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMESLOD2 Webinar Series: LIMES
LOD2 Webinar Series: LIMES
 
LOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project ManagementLOD2 Plenary Vienna 2012: WP12 - Project Management
LOD2 Plenary Vienna 2012: WP12 - Project Management
 
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
LOD2 Plenary Vienna 2012: WP10 - Training, Dissemination, Community Building,...
 
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
LOD2 Plenary Vienna 2012: WP9A - LOD for a Distributed Marketplace for Public...
 
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
LOD2 Plenary Vienna 2012: WP9 publicdata.eu – Publishing Governmental Informa...
 
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data WebLOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
LOD2 Plenary Vienna 2012: WP8: Linked Open Data for Enterprise Data Web
 
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
LOD2 Plenary Vienna 2012: WP7 - Linked Open Data for Media and Publishing
 
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 StackLOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
LOD2 Plenary Vienna 2012: WP6 - Interfaces, Integration & LOD2 Stack
 
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
LOD2 Plenary Vienna 2012: WP5 - Linked Data Browsing, Visualization and Autho...
 

Recently uploaded

Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetDenis Gagné
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsP&CO
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsMichael W. Hawkins
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...Paul Menig
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdftbatkhuu1
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyEthan lee
 
Understanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key InsightsUnderstanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key Insightsseri bangash
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...Any kyc Account
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...Aggregage
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒anilsa9823
 
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...Suhani Kapoor
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxWorkforce Group
 

Recently uploaded (20)

Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature SetCreating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
Creating Low-Code Loan Applications using the Trisotech Mortgage Feature Set
 
Value Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and painsValue Proposition canvas- Customer needs and pains
Value Proposition canvas- Customer needs and pains
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
HONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael HawkinsHONOR Veterans Event Keynote by Michael Hawkins
HONOR Veterans Event Keynote by Michael Hawkins
 
7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...7.pdf This presentation captures many uses and the significance of the number...
7.pdf This presentation captures many uses and the significance of the number...
 
Event mailer assignment progress report .pdf
Event mailer assignment progress report .pdfEvent mailer assignment progress report .pdf
Event mailer assignment progress report .pdf
 
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case studyThe Coffee Bean & Tea Leaf(CBTL), Business strategy case study
The Coffee Bean & Tea Leaf(CBTL), Business strategy case study
 
Understanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key InsightsUnderstanding the Pakistan Budgeting Process: Basics and Key Insights
Understanding the Pakistan Budgeting Process: Basics and Key Insights
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
 
It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
 
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
VIP Call Girls Gandi Maisamma ( Hyderabad ) Phone 8250192130 | ₹5k To 25k Wit...
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Cracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptxCracking the Cultural Competence Code.pptx
Cracking the Cultural Competence Code.pptx
 
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Greater Kailash ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 

LOD2 Plenary Meeting 2011: KAIST – Partner Introduction

  • 1. Creating Knowledge Out of Interlinked Data KAIST Project Mun Y. Yi 19-09-2011
  • 2. Agenda Introduction of KAIST KAIST LOD Team Description of Work Tasks Deliverables Current Status
  • 3. Introduction of KAIST KAIST (Korea Advanced Institute of Science and Technology) is the first and top science and technology research university in Korea. Founded in 1971 to raise elites in science and technology Located in the Daedeok Research Complex in the city of Daejeon, 150 kilometers south of Seoul. For the 2009 academic year, over 8000 students enrolled; 3452 in the bachelor’s, 2197 in the master’s, and 2357 in the doctorate program. KAIST has 842 professors and 334 staff members as of January 2009 According to QS World University Rankings 2011, KAIST is ranked as the 90th in the World and 2nd in Korea.
  • 4. KAIST LOD Team Key-Sun Choi Director of Semantic Web Research Center Head of the Computer Science Department Expertise in ontology, NLP, and semantic Web Mun Y. Yi Director of Knowledge Systems Lab Associate professor in the Knowledge Service Engineering Department Expertise in knowledge engineering, recommender systems, e-learning, and MIS/HCI In-Young Ko Director of WebEng Lab Associate professor in the Computer Science Department Expertise in software engineering and Web engineering including Web services, Web-based information management, and semantic Web Ying Liu Director of Intelligent System and Service Lab Assistant professor in the Knowledge Service Engineering Department Expertise in Tableseer, information retrieval, and text mining
  • 5. Work Description: Tasks Task 3.2: Provenance-Aware Linked Data Extraction from Unstructured and Semi-Structured Sources KAIST will add its experience in extracting Linked Data from Korean resources. KAIST has the most advanced technology in processing Korean natural language resources and data. One example of such resource is CoreNet, which contains a taxonomic hierarchy, concept definitions and frame sets for Korean, Japanese and Chinese words. KAIST will build a Korean version of NLP2RDF by integrating various Korean natural language tools and providing the result of those toolkits in RDF format. KAIST will also facilitate the standardization of NLP2RDF through its involvement in the ISO group TC37/SC4 (Language Resources Management). Task 4.1: Semi-Automatic Data Interlinking KAIST will contribute to this task by providing a platform for automatic linking with Korean, Chinese, Japanese RDF resources. CoreNet contains a hierarchical concept structure for Korean, Chinese and Japanese words. Once the concepts of CoreNet are mapped to WordNetsynsets, as WordNet is already integrated into LOD, KAIST can provide the Korean, Chinese and Japanese RDF data integration platform for Linked Data by providing a mapping mechanism of those data to CoreNet, thus solving multilingual issues for these Asian languages. KAIST has taken the initial step of the CoreNet-WordNet mapping; already showing some progress Task 4.5a: Multilingual Linked Data Fusion KAIST will choose the DBpedia dataset as the pivot multilingual dataset, since it is extracted from various kinds of languages. KAIST will work on the multilingual fusion of those multilingual DBpediadatasets, thus eliminating issues for other multilingual resources, since they simply need to fuse with their own language DBpedia resource. As a first step, KAIST is working on the bilingual fusion between the Korean DBpedia and the English DBpedia; having already obtained some results. At the end of the project these results will be expanded to the fusion of Chinese and Japanese DBpedia with Korean and English DBpedia. We envision to reach more than 90% precision and recall with this multi-lingual fusion approach. Task 6.4: Development of application scenarios and testing of the LOD2 stack configurator The stack configurator will enable potential users to create their own personalized version of the LOD2 Stack, which contains only those functions relevant for their usage scenarios. In this task, LOD2 partners will conduct an in-depth analysis of different application scenarios and identify LOD2 functional components that adequately respond to specific application requirements. These results of the study will be used to assist the development of the stack configurator and to prepare comprehensive LOD2 documentation both from the engineer’s and the user’s viewpoint. Task 10.2d: Training and Dissemination in Korea (KAIST). KAIST will ensure the penetration of LOD2 results in a dynamic Asian country by organizing a number of events and outreach activities, such as: Two research-oriented Data Web symposia aiming to bring together relevant researchers in Asia with the LOD2 consortium, Two industry workshops aiming at disseminating LOD2 results to Korean and Japanese companies and to facilitate cooperation and market entry of industrial LOD2 partners, One Asian Data Web summer school aiming to outreach to PhD students and young researchers.
  • 6. Work Description: Deliverables Deliverable 3.2.4 Korean NLP2RDF (KAIST, M32) Initial release of the NLP2RDF framework for Korean text. This will include various Korean NLP tools and data, including CoreNet. Compared to English, Korean NLP toolkits are less developed and opened; hence, most of the time will be devoted to the new development of Korean NLP tools which will contribute to LOD. Deliverable 4.1.3 Korean Resource Linking Assist Release (M24) The first version of Korean resource linking assist to DBpedia will intelligently recommend and order the possible mappings to the knowledge engineer. This will be implemented as the expansion of Deliverable 4.1.1. Deliverable 4.1.4 Asian Resource Linking Assist Release (M30) This tool will help the knowledge engineer to link Korean, Chinese, Japanese language resources to Linked Data by recommending and ordering appropriate mappings to her. Deliverable 4.5.3 Korean Data Fusion Assistant (M30) The component will support Korean data fusion into English LOD by combining Deliverable 4.5.1 with the fused dataset of English and Korean DBpedia. More precisely, the component will first fuse the new Korean dataset into Korean DBpedia by using D4.5.1, and the result will again be fused into the English DBpedia by applying the fusion result of Korean and English DBpedia. Deliverable 4.5.4 Asian Data Fusion Assistant (M36) The component is an extension of Deliverable 4.5.3, and will support the data fusion of Korean, Japanese and Chinese datasets.
  • 7. Current Status In preparation for a proposal to Korea MKE (Korea Ministry of Knowledge and Economy) Need to involve industry partners Potential projects/applications CoreNet to LOD Korean NLP2RDF Multilingual DBPedia matching and expansion Link Korea Traditional Knowledge DB to LOD Have similar work done in China and Japan Wiki History and Wiki Q&A Korean Wiki annotation