Slides of the second paper on the ULiS project, availiable at http://maxime-lefrancois.info/Publications
We are interested in bridging the world of natural language and the world of the semantic web in particular to support multilingual access to the web of data. In this paper we introduce the ULiS project, that aims at designing a pivot-based NLP technique called Universal Linguistic System, 100% using the semantic web formalisms, and being compliant with the Meaning-Text theory. Through the ULiS, a user could interact with an interlingual knowledge base (IKB) in controlled natural language. Linguistic resources themselves are part of a specific IKB: The Universal Lexical Knowledge base (ULK), so that actors may enhance their controlled natural language, through requests in controlled natural language. We describe a basic interaction scenario at the system level, and provide an overview of the architecture of ULiS. We then introduce the core of the ULiS: the interlingual lexical ontology (ILexicOn), in which each interlingual lexical unit class (ILUc) supports the projection of its semantic decomposition on itself. We validate our model with a standalone ILexicOn, and introduce and explain a concise human-readable notation for it.
1. ULiS: An Expert System on Linguistics
to Support Multilingual Management
of Interlingual Semantic Web Knowledge bases
Maxime Lefrançois & Fabien Gandon
Edelweiss – INRIA Sophia-Antipolis – France
{Maxime.Lefrancois|Fabien.Gandon}@inria.fr
MSW 2011
2. 3000 < natural languages < 7000 >31 billion RDF triples
295 data sets
Interlinked by >503 million RDF links
?
multilingual access to the web of data
The ULiS project
Lefrançois & Gandon, ULiS. MSW 2011 - 2
3. OUTLINE
The Meaning-Text Theory
The ULiS project: multilingual access to web of data
Proposal 1:
An architecture that unlocks motivating scenarios
The ILexicOn : defining interlingual lexical units
Proposal 2:
A formal way to represent lexicographic definitions
Lefrançois & Gandon, ULiS. MSW 2011 - 3
5. The Meaning-Text Theory
A theoretical linguistic framework to
construct models of natural language
Mel’čuk et. al., late 70’s > today
Lefrançois & Gandon, ULiS. MSW 2011 - 5
6. The Meaning-Text Theory
Systems of explicit rules
that express correspondences
between meaning and text
SemRs
7 levels of linguistic representation
For a set of synonymous utterances DSynRs
SSynRs
DMorphRs
SMorphRs
DPhonRs
SPhonRs
Lefrançois & Gandon, ULiS. MSW 2011 - 6
7. The Meaning-Text Theory
The masterpiece:
The Explanatory Combinatorial Dictionary
Meta- Ontology Data
SemRs
Explanatory
Combinatorial
Dictionary DSynRs
Description of the SSynRs
Explanatory
Combinatorial Explantory: DMorphRs
Dictionary Formal semantic explanations
SMorphRs
Combinatorial: DPhonRs
Lexical combinations
SPhonRs
Lefrançois & Gandon, ULiS. MSW 2011 - 7
8. Structure of the ECD entry
Semantic zone:
(formal?) (text or SemR?) lexical definition of lexical units
Semantic actants: arguments of the semantic predicate
denotational vs. connotational semantic elements (ex: EN hot air, FR wind, RU water).
Phonological/Graphematic zone:
orthograph + prosody
The cooccurrence zone
Lefrançois & Gandon, ULiS. MSW 2011 - 8
9. Structure of the ECD entry
- The cooccurrence zone
Morphological sub-zone:
inflections, derivations.
Syntactic sub-zone:
Government pattern
Correspondance semantic – syntax (based on Dependency grammars)
ways to express a semantic predicate in the language
How the lexical unit participates in specific constructions as a dependent
Lexical sub-zone
semantic derivations and collocations
Stylistic sub-zone
Usage labels
Pragmatic sub-zone
real-life situations where expressions are appropriate.
Lefrançois & Gandon, ULiS. MSW 2011 - 9
10. Structure of the ECD entry
- Examples of lexical functions
lexical functions :
paradigmatic
Syn(telephoneV) = phoneV
Anti
Conv21(preced) = follow
Gener(republic) = state
Figur(rain) = curtain (a curtain of rain)
S0 (analyse) = analysis
A0(city) = urban
V0(analysis) = analyse
Adv0(slow) = slowly
S1(teach) = teacher ; S2(teach) = subject ; S3(teach) = pupil;
… 19 in total
Lefrançois & Gandon, ULiS. MSW 2011 - 10
11. Structure of the ECD entry
- Examples of lexical functions
lexical functions :
syntagmatic
Magn(laughV) = one’s head off
Bon(service) = first-class
Locin(height) = at [a height of…]
Oper1(supportN) = [to] lend [~ to N]
Oper2(supportN) = [to] receive [~ to N]
Func1(blownN) = comes [from ~]
… 34 in total
+ non-standard lexical functions
WithNoSugar(coffee) = black
Lefrançois & Gandon, ULiS. MSW 2011 - 11
12. 2. The ULiS project:
Multilingual access to the web of data
?
Lefrançois & Gandon, ULiS. MSW 2011
14. The ULiS project
a Universal Linguistic System
Interlanguage
Convert
Deconvert
Lefrançois & Gandon, ULiS. MSW 2011 - 14
15. The ULiS project
a Universal Linguistic System
to redesign
Pivot-based NLP
technique
100% using
ULiS
Semantic Web
formalisms
Meaning-Text
Theory
compliant with
Lefrançois & Gandon, ULiS. MSW 2011 - 15
17. The ULiS project - scenario
- Machine Translation
-
-
The RDF-World
RDF interlingual
representations
IR RDF
RDF situational John01 kill@past Mary01. John01 tuer@past Mary01.
representations SR RDF
John01 kill@past Mary01.
SR RDF
John01 tuer@past Mary01
John EN Mary
killed John FR Mary
a tué
InputTEXT
John killed Mary. John a tué TEXT
Output1 Mary.
Lefrançois & Gandon, ULiS. MSW 2011 - 17
18. The ULiS project - scenario
- Machine Translation
-
-
The RDF-World
RDF interlingual
representations
RDF situational John01 kill@past Mary01. John01 tuer@past Mary01.
representations John01 kill@past Mary01. John01 tuer@past Mary01
John killed Mary John a tué Mary
John killed Mary. John a tué Mary.
Lefrançois & Gandon, ULiS. MSW 2011 - 18
19. The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
-
The RDF-World
SELECT ?person
SPARQLRDF + X RDF
SPARQL WHERE {
Request ?person ikb:kill ikb:Mary01. IKB
IDBpedia
RDF X RDF
ikb:John01 RDF
Output
}
RDF interlingual
representations
IR RDF IR RDF
RDF situational Who kill@past@? Mary01 John01 kill@past Mary01.
representations SR RDF
Who kill@past@? Mary01 SR RDF
John01 kill@past Mary01.
EN
Who killed Mary ? John EN Mary
killed
InputTEXT
Who killed Mary? John killed TEXT
Output2 Mary.
Lefrançois & Gandon, ULiS. MSW 2011 - 19
20. The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
-
The RDF-World
SELECT ?person
SPARQL WHERE {
ikb:John01 RDF
Request ?person ikb:kill ikb:Mary01.
}
IDBpedia Output
RDF interlingual
representations
RDF situational Who kill@past@? Mary01 John01 kill@past Mary01.
representations Who kill@past@? Mary01 John01 kill@past Mary01.
Who killed Mary ? John killed Mary
Who killed Mary? John killed Mary.
Lefrançois & Gandon, ULiS. MSW 2011 - 20
21. The ULiS project - scenario
- Machine Translation
- Management of Interlingual Knowledge Bases
- Management of the Universal Linguistic Knowledge base
The RDF-World
SPARQLRDF + X RDF X RDF
SPARQL
RDF
Request ULKRDF
IKB RDF Output
RDF interlingual
representations
IR RDF IR RDF
RDF situational
representations SRFR RDF SRFR RDF SRFR RDF
InputTEXT Output1TEXT Output2TEXT
Lefrançois & Gandon, ULiS. MSW 2011 - 21
32. With the Semantic Web formalisms,
We designed a simple ILexiMOn…
Interlingual Lexical Units Classes may
be formally defined in the ILexicOn…
…by supporting the projection
of their semantic decomposition
on themselves
Lefrançois & Gandon, ILexicOn.
33. The three layers
owl:Class owl:ObjectProperty xsd:boolean
OWL owl:intersectionOf is-a
is-a
owl:propertyChainAxiom
owl:unionOf owl:hasSelf
subClassOf is-a subClassOf
core-ILexiMOn layer
:ILexicalUnit :ISemanticRelation
is-a :onISemanticRelation
subClassOf
:allValuesFrom
:ILexicalPrimitive
is-a :isObligatory range
:Entity :hasEntity
ILexicOn layer
:State true
:Person
class/instance A B
A is an instance of B
intersectionOf
property
A B
:Alive A is a subClass of B
Data-layer
B
A p
hasEntity C A B
:Mary01 :Alive01 A is the intersection A is linked to B
of B and C through property p
Lefrançois & Gandon, ULiS. MSW 2011 - 33
53. Conceptual participants
Named ConP slots
(an infinite number of) lexicalized conceptual relations
ConP inheritance
ConP partial inheritance
ConP composition
ConP merging
Optional / obligatory ConP
Relation between two ConPs
Formal definitions of ILUCs
Lefrançois & Gandon, ULiS. MSW 2011
54. Three layers
owl:Class owl:ObjectProperty xsd:boolean
OWL owl:intersectionOf is-a
is-a
owl:propertyChainAxiom
owl:unionOf owl:hasSelf
subClassOf is-a subClassOf
core-ILexiMOn layer
:ILexicalUnit :ISemanticRelation
is-a :onISemanticRelation
subClassOf
:allValuesFrom
:ILexicalPrimitive
is-a :isObligatory range
:Entity :hasEntity
ILexicOn layer
:State true
:Person
class/instance A B
A is an instance of B
intersectionOf
property
A B
:Alive A is a subClass of B
Data-layer
B
A p
hasEntity C A B
:Mary01 :Alive01 A is the intersection A is linked to B
of B and C through property p
Lefrançois & Gandon, ULiS. MSW 2011 - 55
55. ULiS: A Universal Linguistic System
- to redesign Pivot-based NLP techniques
- 100% using the Semantic Web Formalisms
- compliant with the Meaning-Text Theory
ILexicOn: The Interlingual Lexical Ontology
- formal lexicographic definitions
- Lexical functions
- Populate: SUMO, FrameNet…
- SLexicOn
- Rules, SPIN & VISL-CG3
Thank You
Maxime Lefrançois & Fabien Gandon
Edelweiss – INRIA Sophia-Antipolis – France
{Maxime.Lefrancois|Fabien.Gandon}@inria.fr
This presentation on
http://maxime-lefrancois.info