SlideShare uma empresa Scribd logo
1 de 4
Baixar para ler offline
Poster Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013

Identification of Rhetorical Structure Relation from
Discourse Marker in Bengali Language Understanding
Abhishek Sarkar, Pinaki Sankar Chatterjee
School of Computer Engineering
KIIT University, Bhubaneswar, Odisha
{abhishek690 , pinaki.sankar.chatterjee}@gmail.com
Abstract: Rhetorical Structure Theory (RST) has been in
constant research for a number of years. The different parts
of a text are called text spans. RST helps in organizing these
text spans. These text spans are connected with each other by
Discourse markers. In this paper, we are exploring the
possibility of realizing some RST relations (CONTRAST,
SEQUENCE and PARALLEL) from multi-nuclear sentences
in Bengali Language with the help of the semantic structure
of the sentences and the discourse markers. We present a rule
based approach for comprehending the RST relations from
the discourse markers in Bengali Language.
Keywords – RST, Discourse Marker, Compound Sentence,
Syntactic Aggregation, Semantic Representation.

A. Objective
The work in this paper has been done in Bengali
Language. Since the formation of Bengali Language is free
order, so Ambiguity occurs in mapping of Discourse markers
to the RST relations. Some Discourse Marker can be mapped
to more than one RST relation hence there exist some
ambiguity in mapping function. So to resolve this ambiguity
we have tried to formulate some algorithms for each of the
RST relations. For a given Semantic representation of
compound sentence, we can identify the RST relation of a
sentence with utmost accuracy.

I. INTRODUCTION
Rhetorical Structure Theory is a theory of organization of the
text spans, the working of those text spans and how those
text spans are related and connected to each other [2]. Each
text span can have one or two roles in a relation: it can be a
nucleus or a satellite. Nucleus represents the central unit in
the sentence, they exist independently. They are considered
essential to the understanding of the text. Satellites is less
central and can never exist independently, they are actually
dependent on the nucleus. Satellites contribute additional
information to the nucleus [3].
Discourse marker is a word or phrase that helps in
logically connecting two text spans which are coherent to
each other [2]. Examples of Discourse Markers are:- but, then,
and, although, yet, with, etc
In this paper, we are dealing with only compound
sentences, i.e. sentences having more than one nucleus.
Compound sentences are those sentences which are formed
by combining two or more simple sentences having one
nucleus. Some example are given as:- (1) CONTRAST:- Ram
is very happy with his marks but his sister is very upset with
her marks. Here the discourse marker is “but” and the RST
relation is CONTRAST.
(2) SEQUENCE:- Bumba is going to the shop then he will go
to visit his grandmother.
Here the discourse marker is “then” and the RST relation is
SEQUENCE.
(3) PARALLEL:- Dipshika is eating while watching the
television.
Here the discourse marker is “while” and the RST relation is
PARALLEL.
There are many more like EXPRESSION, CONJUNCTION,
CONTINUATION, LIST, etc.
42
© 2013 ACEEE
DOI: 03.LSCS.2013.3.53

Figure 1: Inputs of the function

The Figure 1 explains the system we are building. Our
formulated algorithm or function is going to receive the
semantic representations of simple sentence 1 and simple
sentence 2 along with the discourse marker and is going to
give the corresponding RST relation as the output. So our
objective is to identify the RST relation from discourse marker
in Bengali Language.
II. RELATED WORKS
One of the well known tasks in Discourse marker and
RST has been done by Maite Taboada(Discourse markers
as signals (or not) of rhetorical relations) [1],where he
discusses the idea of discourse markers or connectives
signaling in the presence of a particular relationship. He
addressed the relationship between discourse markers and
rhetorical relations, and, more generally, the signaling of
rhetorical relations. He along with William C. Mann (Rhetorical
Structure Theory: Looking Back and Moving Ahead) [5]
reviewed some of the discussions about RST, especially
addressing issues of the reliability of analyses and
psychological validity, together with a discussion of the nature
Poster Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
of text relations.
William C. Mann and Sandra A. Thompson (Rhetorical
Structure Theory: Towards a functional theory of text
organization) [3] extended the idea of what actually Rhetorical
Structure Theory is. This paper marks the origin of RST. It
describes RST as descriptive theory of a major aspect of
organization of natural text. This paper establishes a new
definitional foundation for RST.
Daniel Marcu and Abdessamad Echihabi( An
Unsupervised Approach to Recognizing Discourse
Relations) [9] present an unsupervised approach to recognize
discourse relations of contrast, explanation-evidence,
condition and elaboration that hold between arbitrary spans
of texts.
Sumit Das, Anupam Basu, Sudeshna Sarkar (Discourse
Marker Generation and Syntactic Aggregation in Bengali
Text Generation) [2], proposes the idea about the prevalent
syntactic aggregation constructs in Bengali. The paper
presented a rule based approach towards generating Bengali
compound sentences using the identified constructs.

This compound sentence can be divided into 2 simple sentences like:- Simple sentence1Simple sentence 2 The discourse marker is
Figure 2 shows the semantic representation of the
compound sentence[3], where the clause count is given as 2,
the type of the sentence is represented as composite, the
discourse marker is identified as
and rhetoric relation
is given as question mark implying the objective of our paper,
which is to identify the RST relation. We have designed
algorithms for RST relations CONTRAST, SEQUENCE and
PARALLEL.
The algorithm for CONTRAST:-

III. THE PROPOSED SYSTEM
We are only considering compound sentences. There are
many types of multi-nuclear RST relations, but in this paper
we are only dealing with CONTRAST, SEQUENCE and PARALLEL. In Bengali Language some of the Discourse markers
are: comma (,),
etc.
We have collected many Bengali compound sentences
and divided them manually into 2 different simple sentences.
Suppose if we take a Bengali sentence like:[10] -

Figure 2: After Syntactic Aggregation semantic representation of compound sentence[2] [4].

© 2013 ACEEE
DOI: 03.LSCS.2013.3.53

43
Poster Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
Let us take some examples for Contrast:Example 1:-

Let us take some example for Sequence :-

“[10]- Here
and according to our

the discourse marker is
algorithm, if the discourse marker is

[10] – Here we can see that the discourse marker is
and we can see the polarity of the t wo
simple sentences are same. So, according to our algorithm
RST is Sequence.
Example 2:[10] - Here the two simple
sentences are in different tense. So according to our algorithm RST is Sequence.
Example 3:-

RST is Contrast.

Example 2:[10]- Here the discourse
marker is

Example 1:-

then we need to check the semantic

structure of the simple sentences, here the polarities of the
two sentences are different, then RST is Contrast.
Example 3:[10]–Here the discourse marker is
then
we check the semantic structure of the simple sentences,
here the polarities of the two sentences are same but the
verb
in the simple sentence 2 is an antonym of the
in the simple sentence 1, according to our
verb
algorithm RST is Contrast.
Algorithm for Sequence:-

[11] - Here the two simple sentences implies about different
time in a single day. We have defined a function PRIORITY(
) where we have divided a single day into 8 different spans
and given each of them different priority values. Since the
priority value of
is less than
so RST
is Sequence.
Example 4:gets
[10] – Here the verb
changed to
when the two simple sentences are
joined to get the composite sentence and thus gives us a
non-finite form of a verb and it’s non-repetitive and thus RST
is Sequence.
Algorithm for PARALLEL RST:-

Let us take some example for Parallel:Example 1:Algorithm for function PRIORITY() :-

[10] – Here the actor (“Ke”) in both
the sentences are same and both the sentences are in same
tense. So RST is Parallel.
Example 2:[10]–
is the continuous form of the verb
Here the verb
and gets changed into a repetitive form in the
and also it is non-finite
composite sentence
form of the verb. So RST is Parallel.
IV. RESULT AND ANALYSIS
We have collected 1000 compound or composite sentences from short stories “KHIRER PUTUL” and “CHADER
PAHAR”, out of these 1000 sentences, we have randomly

© 2013 ACEEE
DOI: 03.LSCS.2013.3.53

44
Poster Paper
Proc. of Int. Conf. on Advances in Computer Science and Application 2013
FUTURE WORKS

taken 600 sentences as our training data set and the rest 400
sentences as our testing data set.
With the training data set, we have manually categorized
sentences into Contrast, Sequence and Parallel depending
on the definition of these RST relations, then we have split
the composite sentences into 2 simple sentences manually
and then create the semantic structure of each of the
sentences, after which we manually identify the Discourse
marker from each of the sentences. Then we categorized the
sentences into Contrast, Sequence and Parallel. we have
developed some rules for each of the RST relation from their
semantic representation. These rules were then formulated
into an algorithm for each of the RST relations. Then we have
created a system with the help of our algorithm.
The testing data set i.e. the remaining 400 sentences are
first manually categorized into Contrast, Sequence and Parallel
depending on the definition of these RST relations and then
to check the accuracy of our system, we put the semantic
representation of these 400 sentences into our system.

There are many more RST relations whose algorithms can
be derived and then a system can be built using the functions
of all the RST relations.
REFERENCES
[1] Maite Taboada : “Discourse markers as signals (or not) of
rhetorical relations” in Journal of Pragmatics 38 (2006) 567–
592
[2] Sumit Das, Anupam Basu, Sudeshna Sarkar: “Discourse Marker
Generation and Syntactic Aggregation in Bengali Text
Generation” in Proceedings of the 2010 IEEE Students’
Technology Symposium, 3-4 April 2010, IIT Kharagpur
[3] William C. Mann and Sandra A. Thompson: “Rhetorical
structure theory: Toward a functional theory of text
organization”. In Text, 8(3):243- 281, 1988.
[4] S amit Bhattacharya, Sanyog: “An iconic system for multilingual
communication for people with speech and motor
impairments”. M.S. Thesis, IIT, Kharagpur, Supervisor Anupam Basu and Sudeshna Sarkar, 2004.
[5] Maite Taboada and William C. Mann: “Rhetorical Structure
Theory: Looking Back and Moving Ahead” in Discourse
Studies 8 (3) by Sage Publicat, January 24, 2006.
[6] Caroline Sporleder, Alex Lascarides:”Using Automatically
Labelled Examples to Classify Rhetorical Relations: An
Assessment”, Printed in the United Kingdom Cambridge
University Press, Received 26 August 2005; revised 16 March
2006.
[7] Farhi Marir and Kame1 Haouam: “Rhetorical Structure Theory
for content-based indexing and retrieval of Web documents”
in Proc. of 2nd International Conference on Information
Technology: Research and Education, 2004, ITRE 2004.
Pages:160 - 164
[8] Maite Taboada and Manfred Stede: “Introduction to RST
Rhetorical Structure Theory” created as part of a project on
Natural Language Generation at the Information Sciences, May
2009.
[9] Daniel Marcu and Abdessamad Echihabi: “An Unsupervised
Approach to Recognizing Discourse Relations” in Proceedings
of the 40th Annual Meeting of the Association for
Computational Linguistics (ACL), Philadelphia, July 2002,
pp. 368-375.
[10] Abanindranath Thakur: “Khirer Putul” Published by Ananda
Publications, 1896. (For Corpus use)
[11] Bibhutibhushon Bondopoddhay: “Chander Pahar” Published
by Bornayon, 1937 (For Corpus use)

Figure.3:- Line Chart to show the accuracy of our system

The figure 3 shows the accuracy of the system. The
accuracy is measured on the testing data set. The horizontal
axis represents the Testing Data set, i.e. the number of
sentences tested and the vertical axis represents the number
of correct identification of RST relation by our system. We
can see that for the first 100 sentences, 91 sentences were
identified correctly by our system and so on. When all the
400 sentences are tested, 383 RST Relations are correctly
identified by our system.
CONCLUSION
In this paper we have discussed about the identification
of the Rhetorical Structure Theory Relation from Discourse
Marker in Bengali Language Understanding. We have derived algorithms for CONTRAST, SEQUENCE and PARALLEL RST relation and explained them with few examples. We
have given user based evaluation to validate our approach.
We have built a small system and a small corpus to check our
result.

© 2013 ACEEE
DOI: 03.LSCS.2013.3.53

45

Mais conteúdo relacionado

Mais procurados

A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...ijnlc
 
Improvement of Text Summarization using Fuzzy Logic Based Method
Improvement of Text Summarization using Fuzzy Logic Based  MethodImprovement of Text Summarization using Fuzzy Logic Based  Method
Improvement of Text Summarization using Fuzzy Logic Based MethodIOSR Journals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
Dil.7.1.13.doc
Dil.7.1.13.docDil.7.1.13.doc
Dil.7.1.13.docbutest
 
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML
 
[Emnlp] what is glo ve part iii - towards data science
[Emnlp] what is glo ve  part iii - towards data science[Emnlp] what is glo ve  part iii - towards data science
[Emnlp] what is glo ve part iii - towards data scienceNikhil Jaiswal
 
Sentence similarity-based-text-summarization-using-clusters
Sentence similarity-based-text-summarization-using-clustersSentence similarity-based-text-summarization-using-clusters
Sentence similarity-based-text-summarization-using-clustersMOHDSAIFWAJID1
 
Deep Contextualized Word Representations - SNLP2018
Deep Contextualized Word Representations - SNLP2018Deep Contextualized Word Representations - SNLP2018
Deep Contextualized Word Representations - SNLP2018Shun Kiyono
 
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHOD
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHODA SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHOD
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHODIJwest
 
A Self-Supervised Tibetan-Chinese Vocabulary Alignment Method
A Self-Supervised Tibetan-Chinese Vocabulary Alignment MethodA Self-Supervised Tibetan-Chinese Vocabulary Alignment Method
A Self-Supervised Tibetan-Chinese Vocabulary Alignment Methoddannyijwest
 
Chapter 6 relational data model and relational
Chapter  6  relational data model and relationalChapter  6  relational data model and relational
Chapter 6 relational data model and relationalJafar Nesargi
 
Cohesion In English Wasee
Cohesion In English  WaseeCohesion In English  Wasee
Cohesion In English WaseeDr. Cupid Lucid
 
Rule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
Rule-based Prosody Calculation for Marathi Text-to-Speech SynthesisRule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
Rule-based Prosody Calculation for Marathi Text-to-Speech SynthesisIJERA Editor
 

Mais procurados (20)

Pxc3898474
Pxc3898474Pxc3898474
Pxc3898474
 
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
A COMPUTATIONAL APPROACH FOR ANALYZING INTER-SENTENTIAL ANAPHORIC PRONOUNS IN...
 
Improvement of Text Summarization using Fuzzy Logic Based Method
Improvement of Text Summarization using Fuzzy Logic Based  MethodImprovement of Text Summarization using Fuzzy Logic Based  Method
Improvement of Text Summarization using Fuzzy Logic Based Method
 
ijcai05_srl
ijcai05_srlijcai05_srl
ijcai05_srl
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
Dil.7.1.13.doc
Dil.7.1.13.docDil.7.1.13.doc
Dil.7.1.13.doc
 
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
RuleML2015 The Herbrand Manifesto - Thinking Inside the Box
 
I5 geetha4 suraiya
I5 geetha4 suraiyaI5 geetha4 suraiya
I5 geetha4 suraiya
 
Structural semantics
Structural semanticsStructural semantics
Structural semantics
 
I3 madankarky2 karthika
I3 madankarky2 karthikaI3 madankarky2 karthika
I3 madankarky2 karthika
 
[Emnlp] what is glo ve part iii - towards data science
[Emnlp] what is glo ve  part iii - towards data science[Emnlp] what is glo ve  part iii - towards data science
[Emnlp] what is glo ve part iii - towards data science
 
Sentence similarity-based-text-summarization-using-clusters
Sentence similarity-based-text-summarization-using-clustersSentence similarity-based-text-summarization-using-clusters
Sentence similarity-based-text-summarization-using-clusters
 
Deep Contextualized Word Representations - SNLP2018
Deep Contextualized Word Representations - SNLP2018Deep Contextualized Word Representations - SNLP2018
Deep Contextualized Word Representations - SNLP2018
 
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHOD
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHODA SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHOD
A SELF-SUPERVISED TIBETAN-CHINESE VOCABULARY ALIGNMENT METHOD
 
A Self-Supervised Tibetan-Chinese Vocabulary Alignment Method
A Self-Supervised Tibetan-Chinese Vocabulary Alignment MethodA Self-Supervised Tibetan-Chinese Vocabulary Alignment Method
A Self-Supervised Tibetan-Chinese Vocabulary Alignment Method
 
Lexical sets
Lexical setsLexical sets
Lexical sets
 
Chapter 6 relational data model and relational
Chapter  6  relational data model and relationalChapter  6  relational data model and relational
Chapter 6 relational data model and relational
 
Cohesion In English Wasee
Cohesion In English  WaseeCohesion In English  Wasee
Cohesion In English Wasee
 
Cohesion Final
Cohesion FinalCohesion Final
Cohesion Final
 
Rule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
Rule-based Prosody Calculation for Marathi Text-to-Speech SynthesisRule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
Rule-based Prosody Calculation for Marathi Text-to-Speech Synthesis
 

Semelhante a Identification of Rhetorical Structure Relation from Discourse Marker in Bengali Language Understanding

Application of rhetorical
Application of rhetoricalApplication of rhetorical
Application of rhetoricalcsandit
 
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...ijctcm
 
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONEXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONIJNSA Journal
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations toIJNSA Journal
 
Sanskrit parser Project Report
Sanskrit parser Project ReportSanskrit parser Project Report
Sanskrit parser Project ReportLaxmi Kant Yadav
 
IRJET - Automatic Text Summarization of News Articles
IRJET -  	  Automatic Text Summarization of News ArticlesIRJET -  	  Automatic Text Summarization of News Articles
IRJET - Automatic Text Summarization of News ArticlesIRJET Journal
 
6 shallow parsing introduction
6 shallow parsing introduction6 shallow parsing introduction
6 shallow parsing introductionThennarasuSakkan
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESkevig
 
BanglaDocAnalyzer
BanglaDocAnalyzerBanglaDocAnalyzer
BanglaDocAnalyzerSamina Azad
 
A comparative analysis of particle swarm optimization and k means algorithm f...
A comparative analysis of particle swarm optimization and k means algorithm f...A comparative analysis of particle swarm optimization and k means algorithm f...
A comparative analysis of particle swarm optimization and k means algorithm f...ijnlc
 
[Emnlp] what is glo ve part ii - towards data science
[Emnlp] what is glo ve  part ii - towards data science[Emnlp] what is glo ve  part ii - towards data science
[Emnlp] what is glo ve part ii - towards data scienceNikhil Jaiswal
 
Expansion Teaching Technique
Expansion Teaching TechniqueExpansion Teaching Technique
Expansion Teaching Techniqueflattsph
 
Towards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesTowards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesAlgoscale Technologies Inc.
 
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTIONTRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTIONijnlc
 
Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...ijnlc
 
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...acijjournal
 

Semelhante a Identification of Rhetorical Structure Relation from Discourse Marker in Bengali Language Understanding (20)

Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
 
Application of rhetorical
Application of rhetoricalApplication of rhetorical
Application of rhetorical
 
P13 corley
P13 corleyP13 corley
P13 corley
 
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
 
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONEXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations to
 
Sanskrit parser Project Report
Sanskrit parser Project ReportSanskrit parser Project Report
Sanskrit parser Project Report
 
IRJET - Automatic Text Summarization of News Articles
IRJET -  	  Automatic Text Summarization of News ArticlesIRJET -  	  Automatic Text Summarization of News Articles
IRJET - Automatic Text Summarization of News Articles
 
6 shallow parsing introduction
6 shallow parsing introduction6 shallow parsing introduction
6 shallow parsing introduction
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 
BanglaDocAnalyzer
BanglaDocAnalyzerBanglaDocAnalyzer
BanglaDocAnalyzer
 
A comparative analysis of particle swarm optimization and k means algorithm f...
A comparative analysis of particle swarm optimization and k means algorithm f...A comparative analysis of particle swarm optimization and k means algorithm f...
A comparative analysis of particle swarm optimization and k means algorithm f...
 
[Emnlp] what is glo ve part ii - towards data science
[Emnlp] what is glo ve  part ii - towards data science[Emnlp] what is glo ve  part ii - towards data science
[Emnlp] what is glo ve part ii - towards data science
 
Expansion Teaching Technique
Expansion Teaching TechniqueExpansion Teaching Technique
Expansion Teaching Technique
 
Towards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian LanguagesTowards Building Semantic Role Labeler for Indian Languages
Towards Building Semantic Role Labeler for Indian Languages
 
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTIONTRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
 
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTIONTRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
TRANSLATING LEGAL SENTENCE BY SEGMENTATION AND RULE SELECTION
 
Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...Developing links of compound sentences for parsing through marathi link gramm...
Developing links of compound sentences for parsing through marathi link gramm...
 
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
A COMPARATIVE STUDY OF ROOT-BASED AND STEM-BASED APPROACHES FOR MEASURING THE...
 

Mais de idescitation (20)

65 113-121
65 113-12165 113-121
65 113-121
 
69 122-128
69 122-12869 122-128
69 122-128
 
71 338-347
71 338-34771 338-347
71 338-347
 
72 129-135
72 129-13572 129-135
72 129-135
 
74 136-143
74 136-14374 136-143
74 136-143
 
80 152-157
80 152-15780 152-157
80 152-157
 
82 348-355
82 348-35582 348-355
82 348-355
 
84 11-21
84 11-2184 11-21
84 11-21
 
62 328-337
62 328-33762 328-337
62 328-337
 
46 102-112
46 102-11246 102-112
46 102-112
 
47 292-298
47 292-29847 292-298
47 292-298
 
49 299-305
49 299-30549 299-305
49 299-305
 
57 306-311
57 306-31157 306-311
57 306-311
 
60 312-318
60 312-31860 312-318
60 312-318
 
5 1-10
5 1-105 1-10
5 1-10
 
11 69-81
11 69-8111 69-81
11 69-81
 
14 284-291
14 284-29114 284-291
14 284-291
 
15 82-87
15 82-8715 82-87
15 82-87
 
29 88-96
29 88-9629 88-96
29 88-96
 
43 97-101
43 97-10143 97-101
43 97-101
 

Último

Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 

Último (20)

Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 

Identification of Rhetorical Structure Relation from Discourse Marker in Bengali Language Understanding

  • 1. Poster Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 Identification of Rhetorical Structure Relation from Discourse Marker in Bengali Language Understanding Abhishek Sarkar, Pinaki Sankar Chatterjee School of Computer Engineering KIIT University, Bhubaneswar, Odisha {abhishek690 , pinaki.sankar.chatterjee}@gmail.com Abstract: Rhetorical Structure Theory (RST) has been in constant research for a number of years. The different parts of a text are called text spans. RST helps in organizing these text spans. These text spans are connected with each other by Discourse markers. In this paper, we are exploring the possibility of realizing some RST relations (CONTRAST, SEQUENCE and PARALLEL) from multi-nuclear sentences in Bengali Language with the help of the semantic structure of the sentences and the discourse markers. We present a rule based approach for comprehending the RST relations from the discourse markers in Bengali Language. Keywords – RST, Discourse Marker, Compound Sentence, Syntactic Aggregation, Semantic Representation. A. Objective The work in this paper has been done in Bengali Language. Since the formation of Bengali Language is free order, so Ambiguity occurs in mapping of Discourse markers to the RST relations. Some Discourse Marker can be mapped to more than one RST relation hence there exist some ambiguity in mapping function. So to resolve this ambiguity we have tried to formulate some algorithms for each of the RST relations. For a given Semantic representation of compound sentence, we can identify the RST relation of a sentence with utmost accuracy. I. INTRODUCTION Rhetorical Structure Theory is a theory of organization of the text spans, the working of those text spans and how those text spans are related and connected to each other [2]. Each text span can have one or two roles in a relation: it can be a nucleus or a satellite. Nucleus represents the central unit in the sentence, they exist independently. They are considered essential to the understanding of the text. Satellites is less central and can never exist independently, they are actually dependent on the nucleus. Satellites contribute additional information to the nucleus [3]. Discourse marker is a word or phrase that helps in logically connecting two text spans which are coherent to each other [2]. Examples of Discourse Markers are:- but, then, and, although, yet, with, etc In this paper, we are dealing with only compound sentences, i.e. sentences having more than one nucleus. Compound sentences are those sentences which are formed by combining two or more simple sentences having one nucleus. Some example are given as:- (1) CONTRAST:- Ram is very happy with his marks but his sister is very upset with her marks. Here the discourse marker is “but” and the RST relation is CONTRAST. (2) SEQUENCE:- Bumba is going to the shop then he will go to visit his grandmother. Here the discourse marker is “then” and the RST relation is SEQUENCE. (3) PARALLEL:- Dipshika is eating while watching the television. Here the discourse marker is “while” and the RST relation is PARALLEL. There are many more like EXPRESSION, CONJUNCTION, CONTINUATION, LIST, etc. 42 © 2013 ACEEE DOI: 03.LSCS.2013.3.53 Figure 1: Inputs of the function The Figure 1 explains the system we are building. Our formulated algorithm or function is going to receive the semantic representations of simple sentence 1 and simple sentence 2 along with the discourse marker and is going to give the corresponding RST relation as the output. So our objective is to identify the RST relation from discourse marker in Bengali Language. II. RELATED WORKS One of the well known tasks in Discourse marker and RST has been done by Maite Taboada(Discourse markers as signals (or not) of rhetorical relations) [1],where he discusses the idea of discourse markers or connectives signaling in the presence of a particular relationship. He addressed the relationship between discourse markers and rhetorical relations, and, more generally, the signaling of rhetorical relations. He along with William C. Mann (Rhetorical Structure Theory: Looking Back and Moving Ahead) [5] reviewed some of the discussions about RST, especially addressing issues of the reliability of analyses and psychological validity, together with a discussion of the nature
  • 2. Poster Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 of text relations. William C. Mann and Sandra A. Thompson (Rhetorical Structure Theory: Towards a functional theory of text organization) [3] extended the idea of what actually Rhetorical Structure Theory is. This paper marks the origin of RST. It describes RST as descriptive theory of a major aspect of organization of natural text. This paper establishes a new definitional foundation for RST. Daniel Marcu and Abdessamad Echihabi( An Unsupervised Approach to Recognizing Discourse Relations) [9] present an unsupervised approach to recognize discourse relations of contrast, explanation-evidence, condition and elaboration that hold between arbitrary spans of texts. Sumit Das, Anupam Basu, Sudeshna Sarkar (Discourse Marker Generation and Syntactic Aggregation in Bengali Text Generation) [2], proposes the idea about the prevalent syntactic aggregation constructs in Bengali. The paper presented a rule based approach towards generating Bengali compound sentences using the identified constructs. This compound sentence can be divided into 2 simple sentences like:- Simple sentence1Simple sentence 2 The discourse marker is Figure 2 shows the semantic representation of the compound sentence[3], where the clause count is given as 2, the type of the sentence is represented as composite, the discourse marker is identified as and rhetoric relation is given as question mark implying the objective of our paper, which is to identify the RST relation. We have designed algorithms for RST relations CONTRAST, SEQUENCE and PARALLEL. The algorithm for CONTRAST:- III. THE PROPOSED SYSTEM We are only considering compound sentences. There are many types of multi-nuclear RST relations, but in this paper we are only dealing with CONTRAST, SEQUENCE and PARALLEL. In Bengali Language some of the Discourse markers are: comma (,), etc. We have collected many Bengali compound sentences and divided them manually into 2 different simple sentences. Suppose if we take a Bengali sentence like:[10] - Figure 2: After Syntactic Aggregation semantic representation of compound sentence[2] [4]. © 2013 ACEEE DOI: 03.LSCS.2013.3.53 43
  • 3. Poster Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 Let us take some examples for Contrast:Example 1:- Let us take some example for Sequence :- “[10]- Here and according to our the discourse marker is algorithm, if the discourse marker is [10] – Here we can see that the discourse marker is and we can see the polarity of the t wo simple sentences are same. So, according to our algorithm RST is Sequence. Example 2:[10] - Here the two simple sentences are in different tense. So according to our algorithm RST is Sequence. Example 3:- RST is Contrast. Example 2:[10]- Here the discourse marker is Example 1:- then we need to check the semantic structure of the simple sentences, here the polarities of the two sentences are different, then RST is Contrast. Example 3:[10]–Here the discourse marker is then we check the semantic structure of the simple sentences, here the polarities of the two sentences are same but the verb in the simple sentence 2 is an antonym of the in the simple sentence 1, according to our verb algorithm RST is Contrast. Algorithm for Sequence:- [11] - Here the two simple sentences implies about different time in a single day. We have defined a function PRIORITY( ) where we have divided a single day into 8 different spans and given each of them different priority values. Since the priority value of is less than so RST is Sequence. Example 4:gets [10] – Here the verb changed to when the two simple sentences are joined to get the composite sentence and thus gives us a non-finite form of a verb and it’s non-repetitive and thus RST is Sequence. Algorithm for PARALLEL RST:- Let us take some example for Parallel:Example 1:Algorithm for function PRIORITY() :- [10] – Here the actor (“Ke”) in both the sentences are same and both the sentences are in same tense. So RST is Parallel. Example 2:[10]– is the continuous form of the verb Here the verb and gets changed into a repetitive form in the and also it is non-finite composite sentence form of the verb. So RST is Parallel. IV. RESULT AND ANALYSIS We have collected 1000 compound or composite sentences from short stories “KHIRER PUTUL” and “CHADER PAHAR”, out of these 1000 sentences, we have randomly © 2013 ACEEE DOI: 03.LSCS.2013.3.53 44
  • 4. Poster Paper Proc. of Int. Conf. on Advances in Computer Science and Application 2013 FUTURE WORKS taken 600 sentences as our training data set and the rest 400 sentences as our testing data set. With the training data set, we have manually categorized sentences into Contrast, Sequence and Parallel depending on the definition of these RST relations, then we have split the composite sentences into 2 simple sentences manually and then create the semantic structure of each of the sentences, after which we manually identify the Discourse marker from each of the sentences. Then we categorized the sentences into Contrast, Sequence and Parallel. we have developed some rules for each of the RST relation from their semantic representation. These rules were then formulated into an algorithm for each of the RST relations. Then we have created a system with the help of our algorithm. The testing data set i.e. the remaining 400 sentences are first manually categorized into Contrast, Sequence and Parallel depending on the definition of these RST relations and then to check the accuracy of our system, we put the semantic representation of these 400 sentences into our system. There are many more RST relations whose algorithms can be derived and then a system can be built using the functions of all the RST relations. REFERENCES [1] Maite Taboada : “Discourse markers as signals (or not) of rhetorical relations” in Journal of Pragmatics 38 (2006) 567– 592 [2] Sumit Das, Anupam Basu, Sudeshna Sarkar: “Discourse Marker Generation and Syntactic Aggregation in Bengali Text Generation” in Proceedings of the 2010 IEEE Students’ Technology Symposium, 3-4 April 2010, IIT Kharagpur [3] William C. Mann and Sandra A. Thompson: “Rhetorical structure theory: Toward a functional theory of text organization”. In Text, 8(3):243- 281, 1988. [4] S amit Bhattacharya, Sanyog: “An iconic system for multilingual communication for people with speech and motor impairments”. M.S. Thesis, IIT, Kharagpur, Supervisor Anupam Basu and Sudeshna Sarkar, 2004. [5] Maite Taboada and William C. Mann: “Rhetorical Structure Theory: Looking Back and Moving Ahead” in Discourse Studies 8 (3) by Sage Publicat, January 24, 2006. [6] Caroline Sporleder, Alex Lascarides:”Using Automatically Labelled Examples to Classify Rhetorical Relations: An Assessment”, Printed in the United Kingdom Cambridge University Press, Received 26 August 2005; revised 16 March 2006. [7] Farhi Marir and Kame1 Haouam: “Rhetorical Structure Theory for content-based indexing and retrieval of Web documents” in Proc. of 2nd International Conference on Information Technology: Research and Education, 2004, ITRE 2004. Pages:160 - 164 [8] Maite Taboada and Manfred Stede: “Introduction to RST Rhetorical Structure Theory” created as part of a project on Natural Language Generation at the Information Sciences, May 2009. [9] Daniel Marcu and Abdessamad Echihabi: “An Unsupervised Approach to Recognizing Discourse Relations” in Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), Philadelphia, July 2002, pp. 368-375. [10] Abanindranath Thakur: “Khirer Putul” Published by Ananda Publications, 1896. (For Corpus use) [11] Bibhutibhushon Bondopoddhay: “Chander Pahar” Published by Bornayon, 1937 (For Corpus use) Figure.3:- Line Chart to show the accuracy of our system The figure 3 shows the accuracy of the system. The accuracy is measured on the testing data set. The horizontal axis represents the Testing Data set, i.e. the number of sentences tested and the vertical axis represents the number of correct identification of RST relation by our system. We can see that for the first 100 sentences, 91 sentences were identified correctly by our system and so on. When all the 400 sentences are tested, 383 RST Relations are correctly identified by our system. CONCLUSION In this paper we have discussed about the identification of the Rhetorical Structure Theory Relation from Discourse Marker in Bengali Language Understanding. We have derived algorithms for CONTRAST, SEQUENCE and PARALLEL RST relation and explained them with few examples. We have given user based evaluation to validate our approach. We have built a small system and a small corpus to check our result. © 2013 ACEEE DOI: 03.LSCS.2013.3.53 45