SlideShare uma empresa Scribd logo
1 de 1
Baixar para ler offline
Intent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile Devices
Subhabrata Mukherjee†, Ashish Verma†, Kenneth W. Church‡
†IBM India Research Lab, ‡ IBM T.J. Watson Research Center
• Mobile query classification is made difficult by short, noisy queries and the presence of inter-active and personalized queries like map (where is the closest restaurant),
command-and-control (open facebook), dialogue (how are you?), joke (will you marry me?) etc.
• Voice queries are more difficult than typed queries due to the errors introduced by the automatic speech recognizer
• In this work we bring the complexities of voice search and intent classification together in proposing a multi-stage classifier for intent classification
Introduction
• 52,282 unique queries, with total 1,04,950 impressions, are collected from an android voice search application and manually tagged. Avg. number of words per
query is 2.3. Avg. word error rate of ASR engine is 20%. The figure below shows the F1 score comparison of different models over 13 most frequent query classes.
Evaluation
• Stage 1 - Multi-class Naïve Bayes classifier uses bag-of-words, part-of-speech tag information and domain words of
the query as features and predicts top K probable classes for a query.
• Domain words (DW) of a query are extracted from url’s of top ranked retrieved pages from search engine. For
example, the search engine retrieves the DW’s imdb marvel en.wikipedia youtube etc. for the query the avengers.
• DW introduce external knowledge in classifier (Movies, Music, Sports etc.) and keeps online data requirement low
• POS tags detect patterns for Command-and-Control, Map, Knowledge, Dialogue, broken queries (Endpoint) etc.
• Stage 2 - Top K ranked classes are taken with some additional features in a logistic regression classifier.
• Features like DW and url ranking, DW and query overlap (Navigational ), substrings having maximum information
gain for a class help differentiate between close query categories from Stage 1
• Stage 3 – Prediction of the independent binary classifiers are combined into a single prediction
Intent
Classification
0
0.2
0.4
0.6
0.8
1
1.2
w eather map nav command know ledge movies dialogue restaurant w eb music sports acceptor endpoint
Reg+Vocab+POS+Yahoo Vocab+POS+Yahoo Vocab+Yahoo Vocab+POS Vocab
•Navigational - Query DW overlap
•Map - Presence of substrings find,
close, direction, near, where in query
•Movie - Presence of IMDB as the
topmost DW
•Command-and-Control - Query
starting with a Verb
•Websearch - Presence of Wikipedia
in the topmost two DW’s
KnowledgeWhat is pi
EndpointHow to spell
NavigationalBest buy com
Command-and-ControlCall mom
MapFind nearest SubwayExample Queries

Mais conteúdo relacionado

Mais de Subhabrata Mukherjee

Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Subhabrata Mukherjee
 

Mais de Subhabrata Mukherjee (16)

XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual ModelsXtremeDistil: Multi-stage Distillation for Massive Multilingual Models
XtremeDistil: Multi-stage Distillation for Massive Multilingual Models
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
 
Fact Checking from Text
Fact Checking from TextFact Checking from Text
Fact Checking from Text
 
OpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product ProfilesOpenTag: Open Attribute Value Extraction From Product Profiles
OpenTag: Open Attribute Value Extraction From Product Profiles
 
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
Probabilistic Graphical Models for Credibility Analysis in Evolving Online Co...
 
Leveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News CommunitiesLeveraging Joint Interactions for Credibility Analysis in News Communities
Leveraging Joint Interactions for Credibility Analysis in News Communities
 
People on Drugs: Credibility of User Statements in Health Forums
People on Drugs: Credibility of User Statements in Health ForumsPeople on Drugs: Credibility of User Statements in Health Forums
People on Drugs: Credibility of User Statements in Health Forums
 
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
Author-Specific Hierarchical Sentiment Aggregation for Rating Prediction of R...
 
Joint Author Sentiment Topic Model
Joint Author Sentiment Topic ModelJoint Author Sentiment Topic Model
Joint Author Sentiment Topic Model
 
TwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in TwitterTwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
TwiSent: A Multi-Stage System for Analyzing Sentiment in Twitter
 
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
Adaptation of Sentiment Analysis to New Linguistic Features, Informal Languag...
 
Leveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word SimilarityLeveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word Similarity
 
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarizat...
 
Feature specific analysis of reviews
Feature specific analysis of reviewsFeature specific analysis of reviews
Feature specific analysis of reviews
 
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
YouCat : Weakly Supervised Youtube Video Categorization System from Meta Data...
 
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse AnalysisSentiment Analysis in Twitter with Lightweight Discourse Analysis
Sentiment Analysis in Twitter with Lightweight Discourse Analysis
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Intent Classification of Voice Queries on Mobile Devices

  • 1. Intent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile DevicesIntent Classification of Voice Queries on Mobile Devices Subhabrata Mukherjee†, Ashish Verma†, Kenneth W. Church‡ †IBM India Research Lab, ‡ IBM T.J. Watson Research Center • Mobile query classification is made difficult by short, noisy queries and the presence of inter-active and personalized queries like map (where is the closest restaurant), command-and-control (open facebook), dialogue (how are you?), joke (will you marry me?) etc. • Voice queries are more difficult than typed queries due to the errors introduced by the automatic speech recognizer • In this work we bring the complexities of voice search and intent classification together in proposing a multi-stage classifier for intent classification Introduction • 52,282 unique queries, with total 1,04,950 impressions, are collected from an android voice search application and manually tagged. Avg. number of words per query is 2.3. Avg. word error rate of ASR engine is 20%. The figure below shows the F1 score comparison of different models over 13 most frequent query classes. Evaluation • Stage 1 - Multi-class Naïve Bayes classifier uses bag-of-words, part-of-speech tag information and domain words of the query as features and predicts top K probable classes for a query. • Domain words (DW) of a query are extracted from url’s of top ranked retrieved pages from search engine. For example, the search engine retrieves the DW’s imdb marvel en.wikipedia youtube etc. for the query the avengers. • DW introduce external knowledge in classifier (Movies, Music, Sports etc.) and keeps online data requirement low • POS tags detect patterns for Command-and-Control, Map, Knowledge, Dialogue, broken queries (Endpoint) etc. • Stage 2 - Top K ranked classes are taken with some additional features in a logistic regression classifier. • Features like DW and url ranking, DW and query overlap (Navigational ), substrings having maximum information gain for a class help differentiate between close query categories from Stage 1 • Stage 3 – Prediction of the independent binary classifiers are combined into a single prediction Intent Classification 0 0.2 0.4 0.6 0.8 1 1.2 w eather map nav command know ledge movies dialogue restaurant w eb music sports acceptor endpoint Reg+Vocab+POS+Yahoo Vocab+POS+Yahoo Vocab+Yahoo Vocab+POS Vocab •Navigational - Query DW overlap •Map - Presence of substrings find, close, direction, near, where in query •Movie - Presence of IMDB as the topmost DW •Command-and-Control - Query starting with a Verb •Websearch - Presence of Wikipedia in the topmost two DW’s KnowledgeWhat is pi EndpointHow to spell NavigationalBest buy com Command-and-ControlCall mom MapFind nearest SubwayExample Queries