An approach to semantics word extraction by applying language phonology in rakuten

•

0 gostou•11,689 visualizações

Rakuten Group, Inc.

In its daily business activities, Rakuten has experienced massive and diverse increases in the quantity of data, for example, product information, merchant information, and customer activities (e.g., queries, emails, reviews, helpdesk, etc.). In view of this, the identification of each entity is vitally important for an e-commerce search engine to find those exactly what the users want. An approach of matching semantic information is very useful in solving the variety of representations and meanings of those entities since the variations of user-created entities cannot be resolved in all cases. One very basic but powerful approach is therefore to consolidate a dictionary and to update this dictionary through the data-driven extraction of entities. A phonetic-based approach is a useful method of identifying these entities. Our method consists of three parts: preprocessing, measuring phonetic similarity,and postprocessing. Our model is trained on phonetic representations of unlabeled data using an unsupervised EM algorithm and is tested on the extraction of entity-word pairs. On average, the results of testing achieved the highest level of performance.

3
ヘアーアイロン
ヘアーアイロン

4
ヘアーアイロンヘアアイロン
ヘアーアイロ
ンヘアア
イロン

5
 籠篭
バスケットヘアーアイロン


6


7
ワンピース
デザインボタニカル柄
ミセスファッション
キャロルグレイ
… …
… …
… …

8
… …
ワンピース
デザインボタニカル柄
ミセスファッション
キャロルグレイ



… …



9
•
•

10
•
•
•

11
https://search.rakuten.co.jp/search/mall/hair+iron/ (Access
Date: 2018/10/24)

ありがとうございました。
ohnmar.htun@rakuten.com

13
ʋ
A part of CLSG

Mais conteúdo relacionado

Mais de Rakuten Group, Inc.

楽天の規模とクラウドプラットフォーム統括部の役割

楽天の規模とクラウドプラットフォーム統括部の役割

楽天の規模とクラウドプラットフォーム統括部の役割Rakuten Group, Inc.

Rakuten Services and Infrastructure Team.pdf

Rakuten Services and Infrastructure Team.pdf

Rakuten Services and Infrastructure Team.pdfRakuten Group, Inc.

The Data Platform Administration Handling the 100 PB.pdf

The Data Platform Administration Handling the 100 PB.pdf

The Data Platform Administration Handling the 100 PB.pdfRakuten Group, Inc.

Supporting Internal Customers as Technical Account Managers.pdf

Supporting Internal Customers as Technical Account Managers.pdf

Supporting Internal Customers as Technical Account Managers.pdfRakuten Group, Inc.

Making Cloud Native CI_CD Services.pdf

Making Cloud Native CI_CD Services.pdf

Making Cloud Native CI_CD Services.pdfRakuten Group, Inc.

How We Defined Our Own Cloud.pdf

How We Defined Our Own Cloud.pdf

How We Defined Our Own Cloud.pdfRakuten Group, Inc.

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech infoRakuten Group, Inc.

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech infoRakuten Group, Inc.

OWASPTop10_Introduction

OWASPTop10_Introduction

OWASPTop10_IntroductionRakuten Group, Inc.

Introduction of GORA API Group technology

Introduction of GORA API Group technology

Introduction of GORA API Group technologyRakuten Group, Inc.

100PBを越えるデータプラットフォームの実情

100PBを越えるデータプラットフォームの実情

100PBを越えるデータプラットフォームの実情Rakuten Group, Inc.

社内エンジニアを支えるテクニカルアカウントマネージャー

社内エンジニアを支えるテクニカルアカウントマネージャー

社内エンジニアを支えるテクニカルアカウントマネージャーRakuten Group, Inc.

モニタリングプラットフォーム開発の裏側

モニタリングプラットフォーム開発の裏側

モニタリングプラットフォーム開発の裏側Rakuten Group, Inc.

楽天のインフラ事情 2022Rakuten Group, Inc.

楽天サービスとインフラ部隊Rakuten Group, Inc.

Rakuten Platform

Rakuten Platform

Rakuten PlatformRakuten Group, Inc.

Kafka & Hadoop in Rakuten

Kafka & Hadoop in Rakuten

Kafka & Hadoop in RakutenRakuten Group, Inc.

Unclouding Container Challenges

Unclouding Container Challenges

Unclouding Container ChallengesRakuten Group, Inc.

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...Rakuten Group, Inc.

アジャイル開発とメトリクスRakuten Group, Inc.

Mais de Rakuten Group, Inc. (20)

楽天の規模とクラウドプラットフォーム統括部の役割

楽天の規模とクラウドプラットフォーム統括部の役割

楽天の規模とクラウドプラットフォーム統括部の役割

Rakuten Services and Infrastructure Team.pdf

Rakuten Services and Infrastructure Team.pdf

Rakuten Services and Infrastructure Team.pdf

The Data Platform Administration Handling the 100 PB.pdf

The Data Platform Administration Handling the 100 PB.pdf

The Data Platform Administration Handling the 100 PB.pdf

Supporting Internal Customers as Technical Account Managers.pdf

Supporting Internal Customers as Technical Account Managers.pdf

Supporting Internal Customers as Technical Account Managers.pdf

Making Cloud Native CI_CD Services.pdf

Making Cloud Native CI_CD Services.pdf

Making Cloud Native CI_CD Services.pdf

How We Defined Our Own Cloud.pdf

How We Defined Our Own Cloud.pdf

How We Defined Our Own Cloud.pdf

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

Travel & Leisure Platform Department's tech info

OWASPTop10_Introduction

OWASPTop10_Introduction

OWASPTop10_Introduction

Introduction of GORA API Group technology

Introduction of GORA API Group technology

Introduction of GORA API Group technology

100PBを越えるデータプラットフォームの実情

100PBを越えるデータプラットフォームの実情

100PBを越えるデータプラットフォームの実情

社内エンジニアを支えるテクニカルアカウントマネージャー

社内エンジニアを支えるテクニカルアカウントマネージャー

社内エンジニアを支えるテクニカルアカウントマネージャー

モニタリングプラットフォーム開発の裏側

モニタリングプラットフォーム開発の裏側

モニタリングプラットフォーム開発の裏側

楽天のインフラ事情 2022

楽天サービスとインフラ部隊

Rakuten Platform

Rakuten Platform

Rakuten Platform

Kafka & Hadoop in Rakuten

Kafka & Hadoop in Rakuten

Kafka & Hadoop in Rakuten

Unclouding Container Challenges

Unclouding Container Challenges

Unclouding Container Challenges

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...

Functional Programming in Pattern-Match-Oriented Programming Style <Programmi...

アジャイル開発とメトリクス

Último

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

How to Troubleshoot Apps for the Modern Connected Worker

How to Troubleshoot Apps for the Modern Connected Worker

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

DBX First Quarter 2024 Investor Presentation

DBX First Quarter 2024 Investor Presentation

DBX First Quarter 2024 Investor PresentationDropbox

Artificial Intelligence Chap.5 : Uncertainty

Artificial Intelligence Chap.5 : Uncertainty

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

FWD Group - Insurer Innovation Award 2024

FWD Group - Insurer Innovation Award 2024

FWD Group - Insurer Innovation Award 2024The Digital Insurer

2024: Domino Containers - The Next Step. News from the Domino Container commu...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

A Beginners Guide to Building a RAG App Using Open Source Milvus

A Beginners Guide to Building a RAG App Using Open Source Milvus

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Corporate and higher education May webinar.pptx

Corporate and higher education May webinar.pptx

Corporate and higher education May webinar.pptxRustici Software

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Why Teams call analytics are critical to your entire business

Why Teams call analytics are critical to your entire business

Why Teams call analytics are critical to your entire businesspanagenda

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Architecting Cloud Native ApplicationsWSO2

Real Time Object Detection Using Open CV

Real Time Object Detection Using Open CV

Real Time Object Detection Using Open CVKhem

MS Copilot expands with MS Graph connectors

MS Copilot expands with MS Graph connectors

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024MIND CTI

Último (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

How to Troubleshoot Apps for the Modern Connected Worker

How to Troubleshoot Apps for the Modern Connected Worker

How to Troubleshoot Apps for the Modern Connected Worker

DBX First Quarter 2024 Investor Presentation

DBX First Quarter 2024 Investor Presentation

DBX First Quarter 2024 Investor Presentation

Artificial Intelligence Chap.5 : Uncertainty

Artificial Intelligence Chap.5 : Uncertainty

Artificial Intelligence Chap.5 : Uncertainty

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

FWD Group - Insurer Innovation Award 2024

FWD Group - Insurer Innovation Award 2024

FWD Group - Insurer Innovation Award 2024

2024: Domino Containers - The Next Step. News from the Domino Container commu...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

A Beginners Guide to Building a RAG App Using Open Source Milvus

A Beginners Guide to Building a RAG App Using Open Source Milvus

A Beginners Guide to Building a RAG App Using Open Source Milvus

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Corporate and higher education May webinar.pptx

Corporate and higher education May webinar.pptx

Corporate and higher education May webinar.pptx

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Why Teams call analytics are critical to your entire business

Why Teams call analytics are critical to your entire business

Why Teams call analytics are critical to your entire business

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Architecting Cloud Native Applications

Real Time Object Detection Using Open CV

Real Time Object Detection Using Open CV

Real Time Object Detection Using Open CV

MS Copilot expands with MS Graph connectors

MS Copilot expands with MS Graph connectors

MS Copilot expands with MS Graph connectors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Exploring the Future Potential of AI-Enabled Smartphone Processors

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

MINDCTI Revenue Release Quarter One 2024

An approach to semantics word extraction by applying language phonology in rakuten

1.

3. 3 ヘアーアイロンヘアーアイロン

4. 4 ヘアーアイロンヘアアイロンヘアーアイロンヘアアイロン

5. 5  籠篭バスケットヘアーアイロン 

7. 7 ワンピースデザインボタニカル柄ミセスファッションキャロルグレイ … … … … … …

8. 8 … … ワンピースデザインボタニカル柄ミセスファッションキャロルグレイ    … …  

10. 10 • • •

11. 11 https://search.rakuten.co.jp/search/mall/hair+iron/ (Access Date: 2018/10/24)

12. ありがとうございました。 ohnmar.htun@rakuten.com

13. 13 ʋ A part of CLSG