SlideShare a Scribd company logo
1 of 38
Download to read offline
WIN-WIN SEARCH: DUAL-AGENT
STOCHASTIC GAME IN SESSION
SEARCH
Jiyun Luo Sicong Zhang Grace Hui Yang
Department of Computer Science
Georgetown University
{jl1749, sz303}@georgetown.edu
huiyang@cs.georgetown.edu1
2
AGE OF EMPIRE
2
A NEW PERSPECTIVE TO LOOK AT SEARCH
3
Documents
to explore
Information
need
Observed
documents
User
Devise a strategy
for helping the
user explore the
information
space in order to
learn which
documents are
relevant and
which aren’t,
and satisfy their
information
need.
3
WHY USERS MAKE CERTAIN MOVES?
4
Markov Chain of Decision Making States
RELATED WORK
! queries suitable for personalization [Teevan et al. SIGIR’08]
! task types [Kanoulas et al. TREC’12]
! roles of task stage and task type [Liu et al. SIGIR’10]
! session query changes [Guan et al. SIGIR’13]
! user intensions and attention [Carterette et al. CIKM’11]
! user click model [Craswell et al. SIGIR’07]
! page re-ranking [Jin et al. WWW’13]
! Search topics [Jones et al. CIKM’08]
! Ads selection using pomdp[Yuan et al. CIKM’12]
!Our work is a retrieval model
! not a user study
5
OUR SOLUTION
6
Try to find an optimal solution
through a sequence of dynamic
interactions
Trial and Error:
learn from repeated, varied attempts
which are continued until success
6
TRIAL AND ERROR
7
! q1 – "dulles hotels"
! q2 – "dulles airport"
! q3 – "dulles airport
location"
! q4 – "dulles metrostop"
7
8
! Rich interactions
Query formulation, Document clicks, Document
examination, eye movement, mouse movements, etc.
! Temporal dependency
! Overall goal
RECAP – CHARACTERISTICS OF
DYNAMIC IR
8
9
! Model interactions, which means it needs to have place
holders for actions;
! Model information need hidden behind user queries and
other interactions;
! Set up a reward mechanism to guide the entire search
algorithm to adjust its retrieval strategies;
! Represent Markov properties to handle the temporal
dependency.
WHAT IS A DESIRABLE MODEL FOR
DYNAMIC IR
A model in Trial and Error setting will do!
A Markov Model will do!
9
10
! Two agents work together to fulfill the information need
! Dual-agent stochastic game
! Partially Observable Markov Decision Process
! Joint Optimization
!To achieve Win-win
WIN-WIN SEARCH
WIN-WIN SEARCH
11
! A tuple (S, T, A, R, γ, O, Θ, B)
! S : state space
! T: transition matrix
! A: action space(Au, Ase, Σu, Σse)
! R: reward function(Ru, Rse)
! γ: discount factor, 0< γ ≤1
! O: observation set(Ωu, Ωse)
an observation is a symbol emitted according to a hidden state.
! Θ: observation function
Θ(s,a,o) is the probability that o is observed when the system
transitions into state s after taking action a, i.e. P(o|s,a).
! B: belief space
Belief is a probability distribution over hidden states.
12
Name Symbol Meanings
state S the four hidden decision states
user action Au add/remove/keep query terms
search engine
action
Ase increase/decrease/keep term weights, adjust
search techniques, etc.
message from user
to search engine
Σu clicked and SAT clicked documents
message from
search engine to
user
Σse top k returned documents
user's observation Ωu observations that the user makes from the world
search engine's
observation
Ωse observations that the search engine makes from
the world and from the user
user reward Ru relevant information the user gains from
reading the documents
search engine
reward
Rse nDCG that the search gains by returning
documents
belief state B belief states generated from the belief updater
and shared by both agents
STATES (S)
13
SRT
Relevant &
Exploitation
SRR
Relevant &
Exploration
SNRT
Non-Relevant
& Exploitation
SNRR
Non-Relevant
& Exploration
! scooter price ⟶ scooter
stores
! collecting old US coins⟶
selling old US coins
! Philadelphia NYC travel ⟶
Philadelphia NYC train
! Boston tourism ⟶ NYC
tourism
q0
ACTIONS (AU, ASE, ΣU, ΣSE)
! User Action (Au)
! add query terms (+Δq)
! remove query terms (-Δq)
! keep query terms (qtheme)
! Search Engine Action(Ase)
! increase term weights
! decrease term weights
! keep term weights
! adjust search techniques, etc.
! Message from the user(Σu)
! clicked documents
! SAT clicked documents
! Message from search engine(Σse)
! top k returned documents 14
1. At iteration t,
the user agent
takes action *+
,
(query change).
15
2. The search engine picks the best action *-.
, to search
DUAL-AGENT STOCHASTIC GAME
3. Search engine
returns
document set Dt
as message 4-.
, .
16
4. The user agent
examines Dt
and sends clicks
as feedback
messages 4+
, .
34
DUAL-AGENT STOCHASTIC GAME
Messages are essentially documents that an agent
thinks they are relevant.
DUAL-AGENT STOCHASTIC GAME
5. The user agent
again makes
action 5+
,67
(query changes).
6. The world
moves into
iteration t + 1.
7. The loop
continues
17
4 3
OBSERVATION FUNCTION (O)
18
Probability of making observation ω after
taking action a and landing in state s
e.g., Prob. of making observation ω after
taking action a and landing in state
SRT=O(SREL, a, ω)O(SEXPLOITATION, a, ω)
OBSERVATION FUNCTION (O)
! Intuition """" Relevant or Non-relevant?
! Observation function
89:, ; Re=, 4+, ?, ; Re=) ∝ A9:, ; Re=|?, ; Re=)A9?, ; Re=|4+)
! A :, ; Re= ?, ; Re= and A9?, ; CD=|4+) are estimated from
! log data
! TREC ground truth. 19
st is likely to be
Relevant
Non-Relevant
If ∃d ∈ D∃d ∈ D∃d ∈ D∃d ∈ Dtttt----1111 andandandand
d is SAT Clickedd is SAT Clickedd is SAT Clickedd is SAT Clicked
otherwise
# TU TV:DWXDY WD=DX5Z[D
# TU TV:DWX5]TZ:
# TU ob:DWXDY W_D WD=DX5Z[D
# TUTV:DWXDY WD=DX5Z[D
! Intuition """" Exploration or Exploitation˛˛˛˛
! Observation Function
89:, ; `ab=TW5]TZ, 5+ ; cde,, 4-. ; f,g7, ?, ; `ab=TW5]TZ)
∝ A9:, ; `ab=TW5]TZ|?, ; `ab=TW5]TZ)A9?, ; `ab=TW5]TZ| c de,, f,g7)
! A9:, ; `ab=TW5]TZ|?, ; `ab=TW5]TZ) 5ZY A9?, ; `ab=TW5]TZ| c de,, f,g7)
are estimated
! log data
! human judgment.
20
st is likely to be
Exploration
Exploitation
if 9c9c9c9cΔΔΔΔqqqqtttt≠∅ and c≠∅ and c≠∅ and c≠∅ and cΔΔΔΔqqqqtttt∉D∉D∉D∉Dtttt----1111))))
oooor 9r 9r 9r 9ccccΔΔΔΔqqqqtttt;;;;∅∅∅∅ andandandand ----ΔΔΔΔqqqqtttt≠∅≠∅≠∅≠∅ ))))
if 9c9c9c9cΔΔΔΔqqqqtttt≠∅ and c≠∅ and c≠∅ and c≠∅ and cΔΔΔΔqqqqtttt∈∈∈∈DDDDtttt----1111))))
oooor 9r 9r 9r 9ccccΔΔΔΔqqqqtttt;;;;∅∅∅∅ andandandand ––––ΔΔΔΔqqqqtttt;∅ );∅ );∅ );∅ )
OBSERVATION FUNCTION (O)
# TU TV:DWXDY Dab=TW5]TZ Y_D T 5YY DWl:
# TU TV:DWX5]TZ: Y_D T 5YY DWl:
# TU TV:DWXDY W_D Dab=TW5]TZ
# TU TV:DWXDY Dab=TW5]TZ
! At every search iteration the belief state b is updated
when a new observation is obtained.
21
V,679:m) ; A9:m|?,, 5,, V,n
;
A9?,|:m, 5,, V,) o A9:m|:p, 5,, V,)V,9:pn
-q∈r
A9?,|5,, V,)
;
89:m, 5,, ?,) o A9:m|:p, 5,, V,)V,9:pn
-q∈r
A9?,|5,, V,)
BELIEF UPDATES (B)
22
! q1=“best US destinations”
observation= NRR
SRT
Relevant &
Exploitation
0.1784
SRR
Relevant &
Exploration
0.1135
SNRT
Non-Relevant
& Exploitation
0.2838
SNRR
Non-Relevant
& Exploration
0.4243
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
BELIEF UPDATES (B)
q0
23
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
SRT
Relevant &
Exploitation
0.0005
SRR
Relevant &
Exploration
0.0068
SNRT
Non-Relevant
& Exploitation
0.0715
SNRR
Non-Relevant
& Exploration
0.9212
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
BELIEF UPDATES (B)
q0
24
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
SRT
Relevant &
Exploitation
0.0005
SRR
Relevant &
Exploration
0.0068
SNRT
Non-Relevant
& Exploitation
0.0715
SNRR
Non-Relevant
& Exploration
0.9212
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
BELIEF UPDATES (B)
q0
25
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0151
SRR
Relevant &
Exploration
0.4347
SNRT
Non-Relevant
& Exploitation
0.0276
SNRR
Non-Relevant
& Exploration
0.5226
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
BELIEF UPDATES (B)
q0
26
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0151
SRR
Relevant &
Exploration
0.4347
SNRT
Non-Relevant
& Exploitation
0.0276
SNRR
Non-Relevant
& Exploration
0.5226
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
BELIEF UPDATES (B)
q0
27
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0291
SRR
Relevant &
Exploration
0.7837
SNRT
Non-Relevant
& Exploitation
0.0081
SNRR
Non-Relevant
& Exploration
0.1790
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
! q20=“Philadelphia NYC
train”
observation = NRT
……
BELIEF UPDATES (B)
q0
28
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0291
SRR
Relevant &
Exploration
0.7837
SNRT
Non-Relevant
& Exploitation
0.0081
SNRR
Non-Relevant
& Exploration
0.1790
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
! q20=“Philadelphia NYC
train”
observation = NRT
……
BELIEF UPDATES (B)
q0
29
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0304
SRR
Relevant &
Exploration
0.8126
SNRT
Non-Relevant
& Exploitation
0.0066
SNRR
Non-Relevant
& Exploration
0.1505
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
……
! q20=“Philadelphia NYC
train”
observation = NRT
! q21=“Philadelphia NYC
bus”
observation = NRT
BELIEF UPDATES (B)
q0
30
! q1=“best US destinations”
observation= NRR
! q2=“distance New York
Boston”
observation = RT
! q3=“maps.bing.com”
observation = NRT
SRT
Relevant &
Exploitation
0.0304
SRR
Relevant &
Exploration
0.8126
SNRT
Non-Relevant
& Exploitation
0.0066
SNRR
Non-Relevant
& Exploration
0.1505
TREC’13 session #87 topic: planning a trip to the United States. You will be there for
a month and able to travel within a 150-mile radius of your destination. What are the
best cities to visit?
……
! q20=“Philadelphia NYC
train”
observation = NRT
! q21=“Philadelphia NYC
bus”
observation = NRT
BELIEF UPDATES (B)
q0
! The long term reward function for the search engine agent
! The long tern reward function for the user agent
! Joint optimization
31
s-.9V, 5) ; oV9:)C9:, 5)
-∈r
c t o A9?|V, 5+, 4-.)A9?|V, 4+)l5a
u
s-.9Vv, 5w
x∈y
s+9V, 5+) ; C9:, 5+) c t z {9:,|:,g7, f,g7)u|
max-~•€
s+9:,g7, 5+)
= P(qt|d) +t z P9e,|e,g7, f,g7, 5)u
max‚~•€
A 9e,g7|f,g7)
5-. ; argmax
u
9s-.9V, 5) c s+9V, 5+))
JOINT OPTIMIZATION — WIN-WIN
EXPERIMENTS
! Evaluate on TREC 2012 and 2013 Session Tracks
! The session logs contain
! session topic
! user queries
! previously retrieved URLs, snippets
! user clicks, and dwell time etc.
! Task: retrieve 2,000 documents for the last query in each
session
! The evaluation is based on the whole session.
! A document related to any query in the session is a good document
32
! Datasets
! ClueWeb09 CatB
! ClueWeb12 CatB
! spam documents are
removed
! duplicated documents
are removed
ACTIONS
! increasing weights of the added terms by a factor of
x={1.05, 1.10, 1.15, 1.20, 1.25, 1.5, 1.75 or 2};
! decreasing weights of the added terms by a factor of
y={0.5, 0.57, 0.67, 0.8, 0.83, 0.87, 0.9 or 0.95};
! QCM proposed in Guan et. al SIGIR’13;
! Pseudo Relevance Feedback which assumes the top
20 retrieved documents are relevant;
! directly uses the query in current iteration to perform
retrieval;
! combines all queries in a session weights them
equally.
33
SEARCH ACCURACY
! Search accuracy on TREC 2012 Session Track
34
TREC 2012 Session Track
# Win-win outperforms most retrieval algorithms on TREC 2012.
35
# Systems in TREC 2012 perform better than in TREC 2013.
# many relevant documents are not included in ClueWeb12 CatB collection
# Win-win outperforms all retrieval algorithms on TREC 2013.
# It is highly effective in Session Search.
SEARCH ACCURACY
! Search accuracy on TREC 2013 Session Track
TREC 2013 Session Track
IMMEDIATE SEARCH ACCURACY
36
# Original run: top returned documents provided by TREC log data
# win-win’s immediate search accuracy is better than the Original at
every iteration
# win-win's immediate search accuracy increases while the number of
search iterations increases
TREC 2012 Session Track TREC 2013 Session Track
Conclusions
37
! A novel session search framework
! Model the interactions between user and search
engine as a dual-agent stochastic game
! Able to perform efficient optimization
! a finite discrete set of states and actions
! Jointly search for the goal in a trial-and-error
manner
THANK YOU
huiyang@cs.georgetown.edu
38

More Related Content

Similar to Win-Win Search: Dual-Agent Stochastic Game in Session Search (SIGIR 2014)

10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting SpatialFAO
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics EnvironmentIan Foster
 
RではじめるTwitter解析
RではじめるTwitter解析RではじめるTwitter解析
RではじめるTwitter解析Takeshi Arabiki
 
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRollBrian Troutwine
 
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfcookie1969
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with RYanchang Zhao
 
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docx
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docxNORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docx
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docxcurwenmichaela
 
AiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAjayRawat829497
 
HON4D (O. Oreifej et al., CVPR2013)
HON4D (O. Oreifej et al., CVPR2013)HON4D (O. Oreifej et al., CVPR2013)
HON4D (O. Oreifej et al., CVPR2013)Mitsuru Nakazawa
 
Is your excel production code?
Is your excel production code?Is your excel production code?
Is your excel production code?ProCogia
 
Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...butest
 
方策勾配型強化学習の基礎と応用
方策勾配型強化学習の基礎と応用方策勾配型強化学習の基礎と応用
方策勾配型強化学習の基礎と応用Ryo Iwaki
 
Introduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdfIntroduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdfTulasiramKandula1
 
A Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioA Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioRandall Hunt
 
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법Seungwoo Yeom
 
R getting spatial
R getting spatialR getting spatial
R getting spatialFAO
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RRevolution Analytics
 

Similar to Win-Win Search: Dual-Agent Stochastic Game in Session Search (SIGIR 2014) (20)

10. Getting Spatial
10. Getting Spatial10. Getting Spatial
10. Getting Spatial
 
Open Analytics Environment
Open Analytics EnvironmentOpen Analytics Environment
Open Analytics Environment
 
RではじめるTwitter解析
RではじめるTwitter解析RではじめるTwitter解析
RではじめるTwitter解析
 
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll
10 Billion a Day, 100 Milliseconds Per: Monitoring Real-Time Bidding at AdRoll
 
Piano rubyslava final
Piano rubyslava finalPiano rubyslava final
Piano rubyslava final
 
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docx
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docxNORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docx
NORTHERN ILLINOIS UNIVERSITY PHYSICS DEPARTMENT .docx
 
R meetup talk
R meetup talkR meetup talk
R meetup talk
 
AiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdfAiCore Brochure 27-Mar-2023-205529.pdf
AiCore Brochure 27-Mar-2023-205529.pdf
 
HON4D (O. Oreifej et al., CVPR2013)
HON4D (O. Oreifej et al., CVPR2013)HON4D (O. Oreifej et al., CVPR2013)
HON4D (O. Oreifej et al., CVPR2013)
 
Is your excel production code?
Is your excel production code?Is your excel production code?
Is your excel production code?
 
Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...Use of data mining techniques in the discovery of spatial and ...
Use of data mining techniques in the discovery of spatial and ...
 
方策勾配型強化学習の基礎と応用
方策勾配型強化学習の基礎と応用方策勾配型強化学習の基礎と応用
方策勾配型強化学習の基礎と応用
 
10. R getting spatial
10.  R getting spatial10.  R getting spatial
10. R getting spatial
 
Introduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdfIntroduction to computing Processing and performance.pdf
Introduction to computing Processing and performance.pdf
 
A Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.ioA Century Of Weather Data - Midwest.io
A Century Of Weather Data - Midwest.io
 
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법
텍스트 랭크 알고리즘을 이용한 실시간 이슈 데이터 분석법
 
R getting spatial
R getting spatialR getting spatial
R getting spatial
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
 

Recently uploaded

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Recently uploaded (20)

Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

Win-Win Search: Dual-Agent Stochastic Game in Session Search (SIGIR 2014)

  • 1. WIN-WIN SEARCH: DUAL-AGENT STOCHASTIC GAME IN SESSION SEARCH Jiyun Luo Sicong Zhang Grace Hui Yang Department of Computer Science Georgetown University {jl1749, sz303}@georgetown.edu huiyang@cs.georgetown.edu1
  • 3. A NEW PERSPECTIVE TO LOOK AT SEARCH 3 Documents to explore Information need Observed documents User Devise a strategy for helping the user explore the information space in order to learn which documents are relevant and which aren’t, and satisfy their information need. 3
  • 4. WHY USERS MAKE CERTAIN MOVES? 4 Markov Chain of Decision Making States
  • 5. RELATED WORK ! queries suitable for personalization [Teevan et al. SIGIR’08] ! task types [Kanoulas et al. TREC’12] ! roles of task stage and task type [Liu et al. SIGIR’10] ! session query changes [Guan et al. SIGIR’13] ! user intensions and attention [Carterette et al. CIKM’11] ! user click model [Craswell et al. SIGIR’07] ! page re-ranking [Jin et al. WWW’13] ! Search topics [Jones et al. CIKM’08] ! Ads selection using pomdp[Yuan et al. CIKM’12] !Our work is a retrieval model ! not a user study 5
  • 6. OUR SOLUTION 6 Try to find an optimal solution through a sequence of dynamic interactions Trial and Error: learn from repeated, varied attempts which are continued until success 6
  • 7. TRIAL AND ERROR 7 ! q1 – "dulles hotels" ! q2 – "dulles airport" ! q3 – "dulles airport location" ! q4 – "dulles metrostop" 7
  • 8. 8 ! Rich interactions Query formulation, Document clicks, Document examination, eye movement, mouse movements, etc. ! Temporal dependency ! Overall goal RECAP – CHARACTERISTICS OF DYNAMIC IR 8
  • 9. 9 ! Model interactions, which means it needs to have place holders for actions; ! Model information need hidden behind user queries and other interactions; ! Set up a reward mechanism to guide the entire search algorithm to adjust its retrieval strategies; ! Represent Markov properties to handle the temporal dependency. WHAT IS A DESIRABLE MODEL FOR DYNAMIC IR A model in Trial and Error setting will do! A Markov Model will do! 9
  • 10. 10 ! Two agents work together to fulfill the information need ! Dual-agent stochastic game ! Partially Observable Markov Decision Process ! Joint Optimization !To achieve Win-win WIN-WIN SEARCH
  • 11. WIN-WIN SEARCH 11 ! A tuple (S, T, A, R, γ, O, Θ, B) ! S : state space ! T: transition matrix ! A: action space(Au, Ase, Σu, Σse) ! R: reward function(Ru, Rse) ! γ: discount factor, 0< γ ≤1 ! O: observation set(Ωu, Ωse) an observation is a symbol emitted according to a hidden state. ! Θ: observation function Θ(s,a,o) is the probability that o is observed when the system transitions into state s after taking action a, i.e. P(o|s,a). ! B: belief space Belief is a probability distribution over hidden states.
  • 12. 12 Name Symbol Meanings state S the four hidden decision states user action Au add/remove/keep query terms search engine action Ase increase/decrease/keep term weights, adjust search techniques, etc. message from user to search engine Σu clicked and SAT clicked documents message from search engine to user Σse top k returned documents user's observation Ωu observations that the user makes from the world search engine's observation Ωse observations that the search engine makes from the world and from the user user reward Ru relevant information the user gains from reading the documents search engine reward Rse nDCG that the search gains by returning documents belief state B belief states generated from the belief updater and shared by both agents
  • 13. STATES (S) 13 SRT Relevant & Exploitation SRR Relevant & Exploration SNRT Non-Relevant & Exploitation SNRR Non-Relevant & Exploration ! scooter price ⟶ scooter stores ! collecting old US coins⟶ selling old US coins ! Philadelphia NYC travel ⟶ Philadelphia NYC train ! Boston tourism ⟶ NYC tourism q0
  • 14. ACTIONS (AU, ASE, ΣU, ΣSE) ! User Action (Au) ! add query terms (+Δq) ! remove query terms (-Δq) ! keep query terms (qtheme) ! Search Engine Action(Ase) ! increase term weights ! decrease term weights ! keep term weights ! adjust search techniques, etc. ! Message from the user(Σu) ! clicked documents ! SAT clicked documents ! Message from search engine(Σse) ! top k returned documents 14
  • 15. 1. At iteration t, the user agent takes action *+ , (query change). 15 2. The search engine picks the best action *-. , to search DUAL-AGENT STOCHASTIC GAME
  • 16. 3. Search engine returns document set Dt as message 4-. , . 16 4. The user agent examines Dt and sends clicks as feedback messages 4+ , . 34 DUAL-AGENT STOCHASTIC GAME Messages are essentially documents that an agent thinks they are relevant.
  • 17. DUAL-AGENT STOCHASTIC GAME 5. The user agent again makes action 5+ ,67 (query changes). 6. The world moves into iteration t + 1. 7. The loop continues 17 4 3
  • 18. OBSERVATION FUNCTION (O) 18 Probability of making observation ω after taking action a and landing in state s e.g., Prob. of making observation ω after taking action a and landing in state SRT=O(SREL, a, ω)O(SEXPLOITATION, a, ω)
  • 19. OBSERVATION FUNCTION (O) ! Intuition """" Relevant or Non-relevant? ! Observation function 89:, ; Re=, 4+, ?, ; Re=) ∝ A9:, ; Re=|?, ; Re=)A9?, ; Re=|4+) ! A :, ; Re= ?, ; Re= and A9?, ; CD=|4+) are estimated from ! log data ! TREC ground truth. 19 st is likely to be Relevant Non-Relevant If ∃d ∈ D∃d ∈ D∃d ∈ D∃d ∈ Dtttt----1111 andandandand d is SAT Clickedd is SAT Clickedd is SAT Clickedd is SAT Clicked otherwise # TU TV:DWXDY WD=DX5Z[D # TU TV:DWX5]TZ: # TU ob:DWXDY W_D WD=DX5Z[D # TUTV:DWXDY WD=DX5Z[D
  • 20. ! Intuition """" Exploration or Exploitation˛˛˛˛ ! Observation Function 89:, ; `ab=TW5]TZ, 5+ ; cde,, 4-. ; f,g7, ?, ; `ab=TW5]TZ) ∝ A9:, ; `ab=TW5]TZ|?, ; `ab=TW5]TZ)A9?, ; `ab=TW5]TZ| c de,, f,g7) ! A9:, ; `ab=TW5]TZ|?, ; `ab=TW5]TZ) 5ZY A9?, ; `ab=TW5]TZ| c de,, f,g7) are estimated ! log data ! human judgment. 20 st is likely to be Exploration Exploitation if 9c9c9c9cΔΔΔΔqqqqtttt≠∅ and c≠∅ and c≠∅ and c≠∅ and cΔΔΔΔqqqqtttt∉D∉D∉D∉Dtttt----1111)))) oooor 9r 9r 9r 9ccccΔΔΔΔqqqqtttt;;;;∅∅∅∅ andandandand ----ΔΔΔΔqqqqtttt≠∅≠∅≠∅≠∅ )))) if 9c9c9c9cΔΔΔΔqqqqtttt≠∅ and c≠∅ and c≠∅ and c≠∅ and cΔΔΔΔqqqqtttt∈∈∈∈DDDDtttt----1111)))) oooor 9r 9r 9r 9ccccΔΔΔΔqqqqtttt;;;;∅∅∅∅ andandandand ––––ΔΔΔΔqqqqtttt;∅ );∅ );∅ );∅ ) OBSERVATION FUNCTION (O) # TU TV:DWXDY Dab=TW5]TZ Y_D T 5YY DWl: # TU TV:DWX5]TZ: Y_D T 5YY DWl: # TU TV:DWXDY W_D Dab=TW5]TZ # TU TV:DWXDY Dab=TW5]TZ
  • 21. ! At every search iteration the belief state b is updated when a new observation is obtained. 21 V,679:m) ; A9:m|?,, 5,, V,n ; A9?,|:m, 5,, V,) o A9:m|:p, 5,, V,)V,9:pn -q∈r A9?,|5,, V,) ; 89:m, 5,, ?,) o A9:m|:p, 5,, V,)V,9:pn -q∈r A9?,|5,, V,) BELIEF UPDATES (B)
  • 22. 22 ! q1=“best US destinations” observation= NRR SRT Relevant & Exploitation 0.1784 SRR Relevant & Exploration 0.1135 SNRT Non-Relevant & Exploitation 0.2838 SNRR Non-Relevant & Exploration 0.4243 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? BELIEF UPDATES (B) q0
  • 23. 23 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT SRT Relevant & Exploitation 0.0005 SRR Relevant & Exploration 0.0068 SNRT Non-Relevant & Exploitation 0.0715 SNRR Non-Relevant & Exploration 0.9212 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? BELIEF UPDATES (B) q0
  • 24. 24 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT SRT Relevant & Exploitation 0.0005 SRR Relevant & Exploration 0.0068 SNRT Non-Relevant & Exploitation 0.0715 SNRR Non-Relevant & Exploration 0.9212 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? BELIEF UPDATES (B) q0
  • 25. 25 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0151 SRR Relevant & Exploration 0.4347 SNRT Non-Relevant & Exploitation 0.0276 SNRR Non-Relevant & Exploration 0.5226 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? BELIEF UPDATES (B) q0
  • 26. 26 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0151 SRR Relevant & Exploration 0.4347 SNRT Non-Relevant & Exploitation 0.0276 SNRR Non-Relevant & Exploration 0.5226 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? BELIEF UPDATES (B) q0
  • 27. 27 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0291 SRR Relevant & Exploration 0.7837 SNRT Non-Relevant & Exploitation 0.0081 SNRR Non-Relevant & Exploration 0.1790 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? ! q20=“Philadelphia NYC train” observation = NRT …… BELIEF UPDATES (B) q0
  • 28. 28 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0291 SRR Relevant & Exploration 0.7837 SNRT Non-Relevant & Exploitation 0.0081 SNRR Non-Relevant & Exploration 0.1790 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? ! q20=“Philadelphia NYC train” observation = NRT …… BELIEF UPDATES (B) q0
  • 29. 29 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0304 SRR Relevant & Exploration 0.8126 SNRT Non-Relevant & Exploitation 0.0066 SNRR Non-Relevant & Exploration 0.1505 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? …… ! q20=“Philadelphia NYC train” observation = NRT ! q21=“Philadelphia NYC bus” observation = NRT BELIEF UPDATES (B) q0
  • 30. 30 ! q1=“best US destinations” observation= NRR ! q2=“distance New York Boston” observation = RT ! q3=“maps.bing.com” observation = NRT SRT Relevant & Exploitation 0.0304 SRR Relevant & Exploration 0.8126 SNRT Non-Relevant & Exploitation 0.0066 SNRR Non-Relevant & Exploration 0.1505 TREC’13 session #87 topic: planning a trip to the United States. You will be there for a month and able to travel within a 150-mile radius of your destination. What are the best cities to visit? …… ! q20=“Philadelphia NYC train” observation = NRT ! q21=“Philadelphia NYC bus” observation = NRT BELIEF UPDATES (B) q0
  • 31. ! The long term reward function for the search engine agent ! The long tern reward function for the user agent ! Joint optimization 31 s-.9V, 5) ; oV9:)C9:, 5) -∈r c t o A9?|V, 5+, 4-.)A9?|V, 4+)l5a u s-.9Vv, 5w x∈y s+9V, 5+) ; C9:, 5+) c t z {9:,|:,g7, f,g7)u| max-~•€ s+9:,g7, 5+) = P(qt|d) +t z P9e,|e,g7, f,g7, 5)u max‚~•€ A 9e,g7|f,g7) 5-. ; argmax u 9s-.9V, 5) c s+9V, 5+)) JOINT OPTIMIZATION — WIN-WIN
  • 32. EXPERIMENTS ! Evaluate on TREC 2012 and 2013 Session Tracks ! The session logs contain ! session topic ! user queries ! previously retrieved URLs, snippets ! user clicks, and dwell time etc. ! Task: retrieve 2,000 documents for the last query in each session ! The evaluation is based on the whole session. ! A document related to any query in the session is a good document 32 ! Datasets ! ClueWeb09 CatB ! ClueWeb12 CatB ! spam documents are removed ! duplicated documents are removed
  • 33. ACTIONS ! increasing weights of the added terms by a factor of x={1.05, 1.10, 1.15, 1.20, 1.25, 1.5, 1.75 or 2}; ! decreasing weights of the added terms by a factor of y={0.5, 0.57, 0.67, 0.8, 0.83, 0.87, 0.9 or 0.95}; ! QCM proposed in Guan et. al SIGIR’13; ! Pseudo Relevance Feedback which assumes the top 20 retrieved documents are relevant; ! directly uses the query in current iteration to perform retrieval; ! combines all queries in a session weights them equally. 33
  • 34. SEARCH ACCURACY ! Search accuracy on TREC 2012 Session Track 34 TREC 2012 Session Track # Win-win outperforms most retrieval algorithms on TREC 2012.
  • 35. 35 # Systems in TREC 2012 perform better than in TREC 2013. # many relevant documents are not included in ClueWeb12 CatB collection # Win-win outperforms all retrieval algorithms on TREC 2013. # It is highly effective in Session Search. SEARCH ACCURACY ! Search accuracy on TREC 2013 Session Track TREC 2013 Session Track
  • 36. IMMEDIATE SEARCH ACCURACY 36 # Original run: top returned documents provided by TREC log data # win-win’s immediate search accuracy is better than the Original at every iteration # win-win's immediate search accuracy increases while the number of search iterations increases TREC 2012 Session Track TREC 2013 Session Track
  • 37. Conclusions 37 ! A novel session search framework ! Model the interactions between user and search engine as a dual-agent stochastic game ! Able to perform efficient optimization ! a finite discrete set of states and actions ! Jointly search for the goal in a trial-and-error manner