Temporal information conveyed by language describes how the world around us changes through time. Events, durations and times are all temporal elements that can be viewed as intervals. These intervals are sometimes temporally related in text. Automatically determining the nature of such relations is a complex and unsolved problem. Some words can act as “signals” which suggest a temporal ordering between intervals. We use these signal words to improve the accuracy of a recent approach to classification of temporal links.
Using signals to improve automatic classification of temporal relations
1. Using Signals to Improve Automatic
Classification of Temporal Relations
2. Time in language
● In natural languages, significant effort is devoted to
describing time (tense, aspect, adverbials).
● We have occurrences and states which we can temporally
“position” or ask about.
● Events, points and periods in time can all be related with
language.
● Understanding and processing this information is difficult.
● How can we formally describe time in language?
3. Temporal Annotation
What to annotate?
● Events and time expressions (intervals)
● Temporal, aspectual and subordinate links between intervals
● Signals that indicate recurrence or temporal ordering
TimeML is a formal specification for annotating these kinds of
entity
TimeBank is an annotated corpus of ~65 000 tokens
5. Baseline and corpus
The TLINK classification task is difficult:
● TempEval1: 59%
● TempEval2: 61%
● Mostcommonclass: 5055%
We replicated established recent work, using a merge of
TimeBank v1.2 and the AQUAINT TimeML corpus.
● Eventevent TLINK classification: 60%
6. Feature set
To augment the baseline's features with information about signals:
● Signal phrase
Textual position of event word and signal can affect temporal
interpretation of a relation:
● “I run before I sleep”
● “Before I run I sleep”
We capture ordering using these features:
● Arg1 / signal order
● Signal / Arg2 order
● Token distance between arg1 / signal / arg2