SlideShare uma empresa Scribd logo
1 de 6
Baixar para ler offline
IOSR Journal of Computer Engineering (IOSRJCE)
ISSN: 2278-0661, ISBN: 2278-8727 Volume 5, Issue 6 (Sep-Oct. 2012), PP 05-10
www.iosrjournals.org
www.iosrjournals.org 5 | Page
Improvement of Text Summarization using Fuzzy Logic Based
Method
1
Rucha S. Dixit, 2
Prof. Dr.S.S.Apte
1
(Computer Science & Engg, W.I.T. Solapur/ Solapur University, India)
2
(Head, Computer Science & Engg, W.I.T. Solapur/ Solapur University, India)
ABSTRACT: Automatic text summarization is undergoing wide research and gaining importance as the
availability of online information is increasing. Automatic text summarization is to compress the larger original
text into shorter text called as summary. Abstraction and Extraction are the two main methods to carry out text
summarization. Our approach uses extractive method. Summarization by extraction involves identifying
important features and extracting sentences based on their scores. 30 documents from news based URLs are
used as input. After preprocessing the input document, eight features are used to calculate their score for each
sentence. In this paper fuzzy logic method is proposed for improvement in the extraction of summary sentences.
Keywords – fuzzy logic, feature-based, fuzzy centroid, sentence score , text summarization.
I. Introduction
With the exponential growth in the quantity and complexity of information sources on the internet, it
has become increasingly important to provide improved mechanisms to user to find exact information from
available documents. Text summarization has become an important and timely tool for helping and interpreting
the large volumes of text available in documents.
Automatic text summarization is the summary of the source text created by machine to present the
most important information in a shorter version of the original text while keeping its main content and helps the
user to quickly understand large volumes of information. Text summarization can handle the problem of
selecting the most important portions of text as well as the problem of generating coherent summaries.
Automatic text summarization is significantly different from that of human based text summarization since
humans can capture and relate deep meaning and themes of text documents while automation of such a skill is
very difficult to implement.
Automatic text summarization can be carried out mainly by two methods: extraction and abstraction.
Extraction summary method is the selection of sentences or phrases having highest score from the original text
and put it together to a new shorter text without changing the source text. Abstraction summary method uses
linguistic methods to examine and interpret the text. Automatic text summarization works best on well-
structured documents such as news, reports, articles and scientific papers. The Extraction method for
summarization involves indentifying the features such as sentence length, sentence location, term frequency,
number of words occurring in title, number of proper nouns, number of numerical data and thematic word. Our
approach uses feature fusion technique in order to decide which features are actually useful out of the available
ones. This paper proposes text summarization based on fuzzy logic to extract important sentences as a summary.
II. Summarization Approaches
In early classic summarization system the important summaries were created according to the most
frequent words in the text. Luhn created the first summarization system [1] in 1958. Rath et al. [2] in 1961
proposed empirical evidences for difficulties inherent in the notion of ideal summary. Both studies used one
thematic feature called term frequency, thus they are characterized by surface level approaches. In the early
1960‟s new approaches called entity level approaches appeared; the first approach of this kind used syntactic
analysis [3]. The location features were used in [4], where key phrases that are used dealt with three additional
components: pragmatic words; title and heading words and structural indicators.
In this paper the proposed method uses fuzzy rules and fuzzy set for selecting sentences based on their
features. Fuzzy logic technique in the form of approximate reasoning provides decision support and expert
system with powerful reasoning capabilities. The permissiveness of fuzziness in human thought processes
suggests that much of the logic behind human reasoning is not only a traditional two-values or multi-valued
logic, but also logic with fuzzy truths, fuzzy connectives, and fuzzy rules of inference [5]. Fuzzy set proposed
by Zadeh [6] is a mathematical tool for dealing with uncertainty, imprecision, vagueness and ambiguity. Fuzzy
logic in text summarization needs more investigation. A few studies were done in this area, Witte and Bergler
[7] presented a fuzzy-theory based approach to co-reference resolution and its application to text summarization.
Automatic determination of co-reference between noun phrases is fraught with uncertainty. Kiani and
Improvement of Text Summarization using Fuzzy Logic Based Method
www.iosrjournals.org 6 | Page
Akbarzadeh [8] proposed technique for summarizing text using combination of Genetic Algorithm (GA) and
Genetic Programming (GP) to optimize rule sets and membership function of fuzzy systems.
The feature extraction techniques are used to obtain the important sentences in the text. In feature
extraction technique some of the features have more importance and some have less so they should have balance
weight in computations and we use fuzzy logic to solve this problem by defining membership function for each
feature.
III. Experiment
3.1. Input Data and Preprocessing
We used news articles from news based URLs as an input to summarization system. The text portion of
news article fetched from URL is saved in a text document that acts as an input document to the summarizer.
Input document is of plain text format. In this paper, preprocessing involves four main activities: Sentence
Segmentation, Tokenization, Removing Stop Words and Word Stemming. Sentence segmentation is boundary
detection and separating source text into sentence. Tokenization is separating the input document into individual
words. Next, Removing Stop Words means removing the words which appear frequently in document but
provide less meaning in identifying the important content of the document such as „a‟, „an‟, „the‟, etc. The last
step for preprocessing is Word Stemming; Word stemming is the process of removing prefixes and suffixes of
each word.
3.2. Sentence Features
After this preprocessing, each sentence of the document is represented by a vector of features. We use
eight features for each sentence. Each feature is given a value between „0‟ and „1‟. There are eight features as
follows.
3.2.1 Title Feature
This feature gives the measure of the similarity between the title sentence and every other sentence of
the document. This is determined by counting the number of matches between the content words in a sentence
and the words in the title. The score for this feature is the ratio of the number of matches between a sentence and
title sentence over the number of words in title.
(1)
3.2.2. Sentence Length
We use this feature to filter out short sentences such as datelines and author names. The short sentences
are not expected to belong to the summary. Here first the length of the sentence is calculated by counting the
number of words in it and then it is normalized. The score for the feature is given by the ratio of length of
sentence over the length of the longest sentence in a document.
(2)
3.2.3. Term weight
This feature uses the concept of term frequency [10] which has often been used to calculate the
importance of a sentence. Here by term frequency we mean occurrence of a term within a document. We sum up
the term frequency of all the term in a sentence. The score of this feature is given by the ratio of summation of
term frequencies of all terms in a sentence over the maximum of summation values of all sentences in a
document.
Weight of sentence „i‟ is calculated by following equation.
(3)
Where „k‟ is the number of words in a sentence.
Score for this feature is calculated as follows.
(4)
3.2.4. Sentence Position
Improvement of Text Summarization using Fuzzy Logic Based Method
www.iosrjournals.org 7 | Page
Position of the sentence in the text, decides its importance. This feature can involve several items such
as the position of a sentence in the document, section and paragraph etc. For the score of this feature we
consider the first 5 sentences in a document.
The feature score is calculated as follows.
; ; ;
; (5)
3.2.5. Sentence to Sentence Similarity
This feature is similarity between sentences. For each sentence S, the similarity between S and every
other sentence is computed by the method of token matching. The [N][N] matrix is formed where N is the total
number of sentence in a document. The diagonal elements of a matrix are set to zero as the sentence should not
be compared with itself. The similarity of each sentence pair is calculated (6) as follows.
(6)
Where „TM‟ stands for token matching method. The score of this feature is the ratio of summary of similarity of
sentence S with every other sentence over the maximum of summary and is calculated (7) as follows.
(7)
3.2.6. Proper Noun
The sentence that contains more proper nouns (name entity) is an important and it is most probably
included in the document summary. The score for this feature is calculated as the ratio of the number of proper
nouns that occur in sentence over the sentence length.
(8)
3.2.7. Thematic Word
Thematic word means the word that occurs frequently in a document and it is probably related to the
topic. Thematic word is the word with maximum possible relativity. We computed the frequency of occurrence
of each word in a document and used the top 5 most frequent content words for consideration as thematic. The
score for this feature is calculated as the ratio of the number of thematic words that occurs in a sentence over the
maximum number of thematic words in a sentence.
(9)
3.2.8. Numerical Data
This feature is the number of numerical data in sentence. The sentence that contains numerical data is
important and it is most probably included in a summary. The score for this feature is calculated as the ratio of
the number of numerical data that occur in sentence over the sentence length.
(10)
3.3 Text Summarization based on Fuzzy logic
The goal of text summarization based on extraction is sentence selection. Our system consists of the
following main steps.
a. Read the source document into the system;
b. For preprocessing step, the system extracts the individual sentences of the original document. Then,
separate the input document into individual words. Next, remove stop words. The last step for
preprocessing is word stemming;
c. Each sentence is associated with vector of eight features that described in section 3.2, whose values are
derived from the content of the sentence;
Improvement of Text Summarization using Fuzzy Logic Based Method
www.iosrjournals.org 8 | Page
d. The score for each sentence is derived from its features based on fuzzy logic method;
e. A set of highest score sentences are extracted as document summary based on compression rate.
To extract important sentences we used fuzzy logic method. Fuzzy logic usually implicates selecting fuzzy
rules and membership function. The selection of fuzzy rules and membership functions directly affect the
performance of the fuzzy logic system.
The fuzzy logic system consists of four components: fuzzifier, inference engine, defuzzifier and the fuzzy
knowledge base. In the fuzzifer, crisp inputs are translated into linguistic values by using a membership
function. After fuzzification, the inference engine refers to the rule base containing fuzzy IF-THEN rules to
derive the linguistic values. In the last step, the output linguistic variables from the inference engine are
converted to the final crisp values by the defuzzfier using membership function for representing the final
sentence score. Fig. 1 shows text summarization based on fuzzy logic system architecture.
In order to implement text summarization based on fuzzy logic, first, the eight features extracted in the
previous section are used as input to the fuzzifier. We used triangular membership functions and fuzzy logic to
summarize the document. The input triangular membership function for each feature is divided into five fuzzy
sets which are composed of unimportant values (low (L) and very low (VL), average values (medium (M) and
important values (high (H) and very high(VH). The generalized triangular membership function depends on the
three parameters a, b, and c where, „a‟ and „c‟ are left and right feet of a triangle and „b‟ is at the peak of a
triangle as shown Fig. 2. We used fuzzy centroid method to calculate a score for each sentence of a document.
A value from zero to one is obtained for each sentence in the output based on sentence features and knowledge
base. The obtained value in the output determines the degree of importance of the sentence in the final
summary. The simplified fuzzy centroid calculation (11) is given by the following formula.
(11)
Where a, b, c are standard values of low, medium and high respectively and l, m, n are calculated values of low,
medium and high respectively.
In inference engine the most important part is the definition of fuzzy IF-THEN rules. The important
sentences are extracted from these rules according to our features criteria. Sample of IF-THEN rules shows as
the following rule.
(a, l)
(
a, l)
(
a, l)
(
a, l)
(c, n)
(b, m)
C(x, y)
Figure 2: Fuzzy Centroid Calculation
Figure 1: Text summarization based on fuzzy logic system
architecture.
Improvement of Text Summarization using Fuzzy Logic Based Method
www.iosrjournals.org 9 | Page
Likewise the last step in fuzzy logic system is the defuzzfication. We used the output membership
function which is divided into three membership functions: Output {Unimportant, Average, Important} to
convert the fuzzy results from the inference engine into a crisp output for the final score of each sentence.
3.4. Extraction of Sentences
In the fuzzy logic method described above, each sentence of the document is represented by the
sentence score. Then all the sentences of a document are ranked in a descending order based on their scores. A
set of highest score sentences are extracted as document summary based on 20% compression rate. Finally the
summary sentences are arranged in the original order.
4. Evaluation and Results
The evaluation of the summaries is done based on two factors mentioned in Fig. 5. We used 30 news
documents from news based URLs as an input to the system. Here the human generated summaries are used as
reference summaries for evaluation of our results. The human generated summary acts as a gold standard
summary since humans can capture and relate deep meanings of the text as compared to machines. We received
human generated summaries for our input documents from five different Experts. Here we call the summaries of
Fuzzy summarizer, Copernic summarizer and MS Word summarizer as the candidate summaries and human
generated summaries as reference summaries.
Figure 3: Content-wise sentence similarity between candidate summary and reference summary
Figure 4: Position-wise sentence similarity between candidate summary and reference summary
IF (NoWordInTitle is VH) and (SentenceLength is H) and
(TermFreq is VH) and (SentencePosition is H) and
(SentenceSimilarity is VH) and (NoProperNoun is M) and
(NoThematicWord is VH) and (NumericalData is M) THEN
(Sentence is important)
Improvement of Text Summarization using Fuzzy Logic Based Method
www.iosrjournals.org 10 | Page
Figure 5: Average similarity between candidate summary and reference summary
The first factor in the evaluation as mentioned in Fig. 3 is the number of sentences from candidate
summary that are similar to the sentences in reference summary. For this factor fuzzy summarizer exhibits the
highest average 81% resemblance to the reference summary as shown in Fig. 5. The second factor in the
evaluation as mentioned in Fig. 4 is the similarity between positions of the sentences in candidate summary and
reference summary. For this factor fuzzy summarizer has average 79% resemblance to the reference summary
which is highest among others as shown in Fig. 5. The results clearly show that fuzzy summarizer approach
under consideration performs better than Copernic summarizer and MS Word 2007 summarizer.
IV. Conclusion and Future Work
In this paper we implement automatic text summarization which involves feature based extraction of
important sentences using fuzzy logic method. The system is tested with 30 news documents and compared with
Copernic summarizer and MS Word 2007 summarizer. The results show that the use of fuzzy logic in text
summarization improves the quality of summaries. We applied our method for single document summarization
which could be extended for multi-document summarization. In the input we used 30 news documents. The
news have categorizes like sports, politics, weather etc. It is necessary to incorporate automatic selection of
fuzzy inference rules, based on the type of input news in order to generate the best summaries. Our method
could be extended for automatic selection of fuzzy inference rules.
V. Acknowledgement
We would like to thank Solapur University and WIT Faculty for supporting us.
References
Journal Papers:
[1] H. P. Luhn, “The Automatic Creation of Literature Abstracts,” IBM Journal of Research and Development, vol. 2, pp.159-
165.1958.
[2] G. J. Rath, A. Resnick, and T. R. Savage, “The formation of abstracts by the selection of sentences” American
Documentation, vol. 12, pp.139-143.1961.
[4] H. P. Edmundson., “New methods in automatic extracting” Journal of the Association for Computing Machinery 16
(2). pp.264-285.1969.
[5] A. D. Kulkarni and D. C. Cavanaugh, “Fuzzy Neural Network Models for Classification” Applied Intelligence 12,
pp.207-215. 2000.
[6] L. Zadeh, “Fuzzy sets. Information Control” vol. 8, pp.338–353.1965.
[10] G. Salton, C. Buckley, “Term-weighting approaches in automatic text retrieval” Information Processing and Management 24, 1988.
513-523. Reprinted in: Sparck-Jones, K.; Willet, P. (eds.) Readings in I.Retrieval. Morgan Kaufmann. pp.323-328.1997.
Books:
[3] Inderjeet Mani and Mark T. Maybury, editors, Advances in automatictext summarization (MIT Press. 1999)
Proceedings Papers:
[7] R Witte and S. Bergler, “Fuzzy coreference resolution for summarization” In Proceedings of 2003 International Symposium on
Reference Resolution and Its Applications to Question Answering and Summarization (ARQAS). Venice, Italy: Università Ca‟
Foscari. pp.43–50. 2003.
[8] Arman Kiani and M.R. Akbarzadeh, “Automatic Text Summarization Using: Hybrid Fuzzy GA-GP” In Proceedings of 2006 IEEE
International Conference on Fuzzy Systems, Sheraton Vancouver Wall Center Hotel, Vancouver, BC, Canada. pp.977-983.2006.
[9] J. Kupiec. , J. Pedersen, and F. Chen, “A Trainable Document Summarizer” In Proceedings of the Eighteenth Annual International
ACM Conference on Research and Development in Information Retrieval (SIGIR), Seattle, WA, pp.68-73.1995.

Mais conteúdo relacionado

Mais procurados

IRJET- A Survey Paper on Text Summarization Methods
IRJET- A Survey Paper on Text Summarization MethodsIRJET- A Survey Paper on Text Summarization Methods
IRJET- A Survey Paper on Text Summarization MethodsIRJET Journal
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarizationIAEME Publication
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural networkINFOGAIN PUBLICATION
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESijnlc
 
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVAL
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVALONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVAL
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVALijaia
 
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONSEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONijnlc
 
A domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logicA domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logicIAEME Publication
 
Regression model focused on query for multi documents summarization based on ...
Regression model focused on query for multi documents summarization based on ...Regression model focused on query for multi documents summarization based on ...
Regression model focused on query for multi documents summarization based on ...TELKOMNIKA JOURNAL
 
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents ijsc
 
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION cscpconf
 
Query Answering Approach Based on Document Summarization
Query Answering Approach Based on Document SummarizationQuery Answering Approach Based on Document Summarization
Query Answering Approach Based on Document SummarizationIJMER
 
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...TELKOMNIKA JOURNAL
 
Legal Document
Legal DocumentLegal Document
Legal Documentlegal4
 
IRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET Journal
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsWaqas Tariq
 

Mais procurados (18)

I6 mala3 sowmya
I6 mala3 sowmyaI6 mala3 sowmya
I6 mala3 sowmya
 
IRJET- A Survey Paper on Text Summarization Methods
IRJET- A Survey Paper on Text Summarization MethodsIRJET- A Survey Paper on Text Summarization Methods
IRJET- A Survey Paper on Text Summarization Methods
 
Optimal approach for text summarization
Optimal approach for text summarizationOptimal approach for text summarization
Optimal approach for text summarization
 
8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network8 efficient multi-document summary generation using neural network
8 efficient multi-document summary generation using neural network
 
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURESGENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
GENERATING SUMMARIES USING SENTENCE COMPRESSION AND STATISTICAL MEASURES
 
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVAL
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVALONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVAL
ONTOLOGICAL TREE GENERATION FOR ENHANCED INFORMATION RETRIEVAL
 
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATIONSEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
SEMI-AUTOMATIC SIMULTANEOUS INTERPRETING QUALITY EVALUATION
 
A domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logicA domain specific automatic text summarization using fuzzy logic
A domain specific automatic text summarization using fuzzy logic
 
Regression model focused on query for multi documents summarization based on ...
Regression model focused on query for multi documents summarization based on ...Regression model focused on query for multi documents summarization based on ...
Regression model focused on query for multi documents summarization based on ...
 
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents Keyword Extraction Based Summarization of Categorized Kannada Text Documents
Keyword Extraction Based Summarization of Categorized Kannada Text Documents
 
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
DOCUMENT SUMMARIZATION IN KANNADA USING KEYWORD EXTRACTION
 
N15-1013
N15-1013N15-1013
N15-1013
 
A4 elanjceziyan
A4 elanjceziyanA4 elanjceziyan
A4 elanjceziyan
 
Query Answering Approach Based on Document Summarization
Query Answering Approach Based on Document SummarizationQuery Answering Approach Based on Document Summarization
Query Answering Approach Based on Document Summarization
 
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...
Sentence Extraction Based on Sentence Distribution and Part of Speech Tagging...
 
Legal Document
Legal DocumentLegal Document
Legal Document
 
IRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text DocumentIRJET- Automatic Recapitulation of Text Document
IRJET- Automatic Recapitulation of Text Document
 
Semantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with IdiomsSemantic Based Model for Text Document Clustering with Idioms
Semantic Based Model for Text Document Clustering with Idioms
 

Destaque

Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...
Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...
Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...IOSR Journals
 
Development and Validation of prediction for estimating resting energy expend...
Development and Validation of prediction for estimating resting energy expend...Development and Validation of prediction for estimating resting energy expend...
Development and Validation of prediction for estimating resting energy expend...IOSR Journals
 
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...Anastasia Lanina
 
A New Theoretical Approach to Location Based Power Aware Routing
A New Theoretical Approach to Location Based Power Aware RoutingA New Theoretical Approach to Location Based Power Aware Routing
A New Theoretical Approach to Location Based Power Aware RoutingIOSR Journals
 
Wireless Micro-Sensor Network Models
Wireless Micro-Sensor Network ModelsWireless Micro-Sensor Network Models
Wireless Micro-Sensor Network ModelsIOSR Journals
 
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...IOSR Journals
 
Belgium's Social Media and Netlog
Belgium's Social Media and NetlogBelgium's Social Media and Netlog
Belgium's Social Media and NetlogKatherine Witsman
 
AACT: Anonymous and Accountable communication topology for Wireless Mesh Net...
AACT: Anonymous and Accountable communication topology  for Wireless Mesh Net...AACT: Anonymous and Accountable communication topology  for Wireless Mesh Net...
AACT: Anonymous and Accountable communication topology for Wireless Mesh Net...IOSR Journals
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierIOSR Journals
 
Mobile Networking and Mobile Ad Hoc Routing Protocol Modeling
Mobile Networking and Mobile Ad Hoc Routing Protocol ModelingMobile Networking and Mobile Ad Hoc Routing Protocol Modeling
Mobile Networking and Mobile Ad Hoc Routing Protocol ModelingIOSR Journals
 
Comparison of the Formal Specification Languages Based Upon Various Parameters
Comparison of the Formal Specification Languages Based Upon Various ParametersComparison of the Formal Specification Languages Based Upon Various Parameters
Comparison of the Formal Specification Languages Based Upon Various ParametersIOSR Journals
 
Use Email Marketing wisely; Stand out from Junk Mail
Use Email Marketing wisely; Stand out from Junk MailUse Email Marketing wisely; Stand out from Junk Mail
Use Email Marketing wisely; Stand out from Junk Mailbelieve52
 
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...IOSR Journals
 
Are you interested in increasing your Google PageRank?
Are you interested in increasing your Google PageRank?Are you interested in increasing your Google PageRank?
Are you interested in increasing your Google PageRank?believe52
 
Простые и составные числительные
Простые и составные числительныеПростые и составные числительные
Простые и составные числительныеNatalya Dyrda
 
Meta programing ruby monday
Meta programing ruby mondayMeta programing ruby monday
Meta programing ruby mondayYuisho Takafuji
 
Layout Based Information Retrieval from Document Images
Layout Based Information Retrieval from Document ImagesLayout Based Information Retrieval from Document Images
Layout Based Information Retrieval from Document ImagesIOSR Journals
 
Secured E-Learning Content on USB/HDD Device
Secured E-Learning Content on USB/HDD DeviceSecured E-Learning Content on USB/HDD Device
Secured E-Learning Content on USB/HDD DeviceIOSR Journals
 
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...IOSR Journals
 

Destaque (20)

Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...
Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...
Design and MATLAB Simulation of a Fuel Cell Based Interleaved Buck Converter ...
 
Development and Validation of prediction for estimating resting energy expend...
Development and Validation of prediction for estimating resting energy expend...Development and Validation of prediction for estimating resting energy expend...
Development and Validation of prediction for estimating resting energy expend...
 
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...
Презентация Ямпольского РРЦГ "Результаты работы ІІ Фазы Проекта МРГ в Ямпольс...
 
A New Theoretical Approach to Location Based Power Aware Routing
A New Theoretical Approach to Location Based Power Aware RoutingA New Theoretical Approach to Location Based Power Aware Routing
A New Theoretical Approach to Location Based Power Aware Routing
 
Wireless Micro-Sensor Network Models
Wireless Micro-Sensor Network ModelsWireless Micro-Sensor Network Models
Wireless Micro-Sensor Network Models
 
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...
Comparative study of IPv4 & IPv6 Point to Point Architecture on various OS pl...
 
Belgium's Social Media and Netlog
Belgium's Social Media and NetlogBelgium's Social Media and Netlog
Belgium's Social Media and Netlog
 
AACT: Anonymous and Accountable communication topology for Wireless Mesh Net...
AACT: Anonymous and Accountable communication topology  for Wireless Mesh Net...AACT: Anonymous and Accountable communication topology  for Wireless Mesh Net...
AACT: Anonymous and Accountable communication topology for Wireless Mesh Net...
 
Heart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means ClassifierHeart Attack Prediction System Using Fuzzy C Means Classifier
Heart Attack Prediction System Using Fuzzy C Means Classifier
 
Mobile Networking and Mobile Ad Hoc Routing Protocol Modeling
Mobile Networking and Mobile Ad Hoc Routing Protocol ModelingMobile Networking and Mobile Ad Hoc Routing Protocol Modeling
Mobile Networking and Mobile Ad Hoc Routing Protocol Modeling
 
Comparison of the Formal Specification Languages Based Upon Various Parameters
Comparison of the Formal Specification Languages Based Upon Various ParametersComparison of the Formal Specification Languages Based Upon Various Parameters
Comparison of the Formal Specification Languages Based Upon Various Parameters
 
Use Email Marketing wisely; Stand out from Junk Mail
Use Email Marketing wisely; Stand out from Junk MailUse Email Marketing wisely; Stand out from Junk Mail
Use Email Marketing wisely; Stand out from Junk Mail
 
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
Medial Axis Transformation based Skeletonzation of Image Patterns using Image...
 
Are you interested in increasing your Google PageRank?
Are you interested in increasing your Google PageRank?Are you interested in increasing your Google PageRank?
Are you interested in increasing your Google PageRank?
 
B0620510
B0620510B0620510
B0620510
 
Простые и составные числительные
Простые и составные числительныеПростые и составные числительные
Простые и составные числительные
 
Meta programing ruby monday
Meta programing ruby mondayMeta programing ruby monday
Meta programing ruby monday
 
Layout Based Information Retrieval from Document Images
Layout Based Information Retrieval from Document ImagesLayout Based Information Retrieval from Document Images
Layout Based Information Retrieval from Document Images
 
Secured E-Learning Content on USB/HDD Device
Secured E-Learning Content on USB/HDD DeviceSecured E-Learning Content on USB/HDD Device
Secured E-Learning Content on USB/HDD Device
 
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...
Control of Grid- Interfacing Inverters with Integrated Voltage Unbalance Corr...
 

Semelhante a Improvement of Text Summarization using Fuzzy Logic Based Method

A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...IJECEIAES
 
Multi Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation NetworkMulti Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation NetworkIRJET Journal
 
Legal Document
Legal DocumentLegal Document
Legal Documentlegal6
 
Legal Document
Legal DocumentLegal Document
Legal Documentlegal5
 
Legal Document
Legal DocumentLegal Document
Legal Documentlegal5
 
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIES
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIESCLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIES
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIESecij
 
Automatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewAutomatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewIRJET Journal
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
Article Summarizer
Article SummarizerArticle Summarizer
Article SummarizerJose Katab
 
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...ijctcm
 
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET Journal
 
IRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET Journal
 
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONEXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONIJNSA Journal
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations toIJNSA Journal
 
Automation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp basedAutomation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp basedIAEME Publication
 
Review of Topic Modeling and Summarization
Review of Topic Modeling and SummarizationReview of Topic Modeling and Summarization
Review of Topic Modeling and SummarizationIRJET Journal
 
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATION
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATIONAPPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATION
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATIONIJDKP
 

Semelhante a Improvement of Text Summarization using Fuzzy Logic Based Method (20)

A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...A hybrid approach for text summarization using semantic latent Dirichlet allo...
A hybrid approach for text summarization using semantic latent Dirichlet allo...
 
Multi Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation NetworkMulti Document Text Summarization using Backpropagation Network
Multi Document Text Summarization using Backpropagation Network
 
Legal Document
Legal DocumentLegal Document
Legal Document
 
Legal Document
Legal DocumentLegal Document
Legal Document
 
Legal Document
Legal DocumentLegal Document
Legal Document
 
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIES
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIESCLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIES
CLUSTER PRIORITY BASED SENTENCE RANKING FOR EFFICIENT EXTRACTIVE TEXT SUMMARIES
 
Automatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical ReviewAutomatic Text Summarization: A Critical Review
Automatic Text Summarization: A Critical Review
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
Article Summarizer
Article SummarizerArticle Summarizer
Article Summarizer
 
Y24168171
Y24168171Y24168171
Y24168171
 
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
An Approach To Automatic Text Summarization Using Simplified Lesk Algorithm A...
 
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
IRJET- Sewage Treatment Potential of Coir Geotextiles in Conjunction with Act...
 
IRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning ApproachIRJET- Text Highlighting – A Machine Learning Approach
IRJET- Text Highlighting – A Machine Learning Approach
 
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATIONEXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
EXPLOITING RHETORICAL RELATIONS TO MULTIPLE DOCUMENTS TEXT SUMMARIZATION
 
Exploiting rhetorical relations to
Exploiting rhetorical relations toExploiting rhetorical relations to
Exploiting rhetorical relations to
 
Automation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp basedAutomation tool for evaluation of the quality of nlp based
Automation tool for evaluation of the quality of nlp based
 
Review of Topic Modeling and Summarization
Review of Topic Modeling and SummarizationReview of Topic Modeling and Summarization
Review of Topic Modeling and Summarization
 
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATION
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATIONAPPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATION
APPROACH FOR THICKENING SENTENCE SCORE FOR AUTOMATIC TEXT SUMMARIZATION
 
Summarization of Software Artifacts : A Review
Summarization of Software Artifacts : A ReviewSummarization of Software Artifacts : A Review
Summarization of Software Artifacts : A Review
 
Summarization of Software Artifacts : A Review
Summarization of Software Artifacts : A ReviewSummarization of Software Artifacts : A Review
Summarization of Software Artifacts : A Review
 

Mais de IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Último

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 

Último (20)

TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 

Improvement of Text Summarization using Fuzzy Logic Based Method

  • 1. IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 5, Issue 6 (Sep-Oct. 2012), PP 05-10 www.iosrjournals.org www.iosrjournals.org 5 | Page Improvement of Text Summarization using Fuzzy Logic Based Method 1 Rucha S. Dixit, 2 Prof. Dr.S.S.Apte 1 (Computer Science & Engg, W.I.T. Solapur/ Solapur University, India) 2 (Head, Computer Science & Engg, W.I.T. Solapur/ Solapur University, India) ABSTRACT: Automatic text summarization is undergoing wide research and gaining importance as the availability of online information is increasing. Automatic text summarization is to compress the larger original text into shorter text called as summary. Abstraction and Extraction are the two main methods to carry out text summarization. Our approach uses extractive method. Summarization by extraction involves identifying important features and extracting sentences based on their scores. 30 documents from news based URLs are used as input. After preprocessing the input document, eight features are used to calculate their score for each sentence. In this paper fuzzy logic method is proposed for improvement in the extraction of summary sentences. Keywords – fuzzy logic, feature-based, fuzzy centroid, sentence score , text summarization. I. Introduction With the exponential growth in the quantity and complexity of information sources on the internet, it has become increasingly important to provide improved mechanisms to user to find exact information from available documents. Text summarization has become an important and timely tool for helping and interpreting the large volumes of text available in documents. Automatic text summarization is the summary of the source text created by machine to present the most important information in a shorter version of the original text while keeping its main content and helps the user to quickly understand large volumes of information. Text summarization can handle the problem of selecting the most important portions of text as well as the problem of generating coherent summaries. Automatic text summarization is significantly different from that of human based text summarization since humans can capture and relate deep meaning and themes of text documents while automation of such a skill is very difficult to implement. Automatic text summarization can be carried out mainly by two methods: extraction and abstraction. Extraction summary method is the selection of sentences or phrases having highest score from the original text and put it together to a new shorter text without changing the source text. Abstraction summary method uses linguistic methods to examine and interpret the text. Automatic text summarization works best on well- structured documents such as news, reports, articles and scientific papers. The Extraction method for summarization involves indentifying the features such as sentence length, sentence location, term frequency, number of words occurring in title, number of proper nouns, number of numerical data and thematic word. Our approach uses feature fusion technique in order to decide which features are actually useful out of the available ones. This paper proposes text summarization based on fuzzy logic to extract important sentences as a summary. II. Summarization Approaches In early classic summarization system the important summaries were created according to the most frequent words in the text. Luhn created the first summarization system [1] in 1958. Rath et al. [2] in 1961 proposed empirical evidences for difficulties inherent in the notion of ideal summary. Both studies used one thematic feature called term frequency, thus they are characterized by surface level approaches. In the early 1960‟s new approaches called entity level approaches appeared; the first approach of this kind used syntactic analysis [3]. The location features were used in [4], where key phrases that are used dealt with three additional components: pragmatic words; title and heading words and structural indicators. In this paper the proposed method uses fuzzy rules and fuzzy set for selecting sentences based on their features. Fuzzy logic technique in the form of approximate reasoning provides decision support and expert system with powerful reasoning capabilities. The permissiveness of fuzziness in human thought processes suggests that much of the logic behind human reasoning is not only a traditional two-values or multi-valued logic, but also logic with fuzzy truths, fuzzy connectives, and fuzzy rules of inference [5]. Fuzzy set proposed by Zadeh [6] is a mathematical tool for dealing with uncertainty, imprecision, vagueness and ambiguity. Fuzzy logic in text summarization needs more investigation. A few studies were done in this area, Witte and Bergler [7] presented a fuzzy-theory based approach to co-reference resolution and its application to text summarization. Automatic determination of co-reference between noun phrases is fraught with uncertainty. Kiani and
  • 2. Improvement of Text Summarization using Fuzzy Logic Based Method www.iosrjournals.org 6 | Page Akbarzadeh [8] proposed technique for summarizing text using combination of Genetic Algorithm (GA) and Genetic Programming (GP) to optimize rule sets and membership function of fuzzy systems. The feature extraction techniques are used to obtain the important sentences in the text. In feature extraction technique some of the features have more importance and some have less so they should have balance weight in computations and we use fuzzy logic to solve this problem by defining membership function for each feature. III. Experiment 3.1. Input Data and Preprocessing We used news articles from news based URLs as an input to summarization system. The text portion of news article fetched from URL is saved in a text document that acts as an input document to the summarizer. Input document is of plain text format. In this paper, preprocessing involves four main activities: Sentence Segmentation, Tokenization, Removing Stop Words and Word Stemming. Sentence segmentation is boundary detection and separating source text into sentence. Tokenization is separating the input document into individual words. Next, Removing Stop Words means removing the words which appear frequently in document but provide less meaning in identifying the important content of the document such as „a‟, „an‟, „the‟, etc. The last step for preprocessing is Word Stemming; Word stemming is the process of removing prefixes and suffixes of each word. 3.2. Sentence Features After this preprocessing, each sentence of the document is represented by a vector of features. We use eight features for each sentence. Each feature is given a value between „0‟ and „1‟. There are eight features as follows. 3.2.1 Title Feature This feature gives the measure of the similarity between the title sentence and every other sentence of the document. This is determined by counting the number of matches between the content words in a sentence and the words in the title. The score for this feature is the ratio of the number of matches between a sentence and title sentence over the number of words in title. (1) 3.2.2. Sentence Length We use this feature to filter out short sentences such as datelines and author names. The short sentences are not expected to belong to the summary. Here first the length of the sentence is calculated by counting the number of words in it and then it is normalized. The score for the feature is given by the ratio of length of sentence over the length of the longest sentence in a document. (2) 3.2.3. Term weight This feature uses the concept of term frequency [10] which has often been used to calculate the importance of a sentence. Here by term frequency we mean occurrence of a term within a document. We sum up the term frequency of all the term in a sentence. The score of this feature is given by the ratio of summation of term frequencies of all terms in a sentence over the maximum of summation values of all sentences in a document. Weight of sentence „i‟ is calculated by following equation. (3) Where „k‟ is the number of words in a sentence. Score for this feature is calculated as follows. (4) 3.2.4. Sentence Position
  • 3. Improvement of Text Summarization using Fuzzy Logic Based Method www.iosrjournals.org 7 | Page Position of the sentence in the text, decides its importance. This feature can involve several items such as the position of a sentence in the document, section and paragraph etc. For the score of this feature we consider the first 5 sentences in a document. The feature score is calculated as follows. ; ; ; ; (5) 3.2.5. Sentence to Sentence Similarity This feature is similarity between sentences. For each sentence S, the similarity between S and every other sentence is computed by the method of token matching. The [N][N] matrix is formed where N is the total number of sentence in a document. The diagonal elements of a matrix are set to zero as the sentence should not be compared with itself. The similarity of each sentence pair is calculated (6) as follows. (6) Where „TM‟ stands for token matching method. The score of this feature is the ratio of summary of similarity of sentence S with every other sentence over the maximum of summary and is calculated (7) as follows. (7) 3.2.6. Proper Noun The sentence that contains more proper nouns (name entity) is an important and it is most probably included in the document summary. The score for this feature is calculated as the ratio of the number of proper nouns that occur in sentence over the sentence length. (8) 3.2.7. Thematic Word Thematic word means the word that occurs frequently in a document and it is probably related to the topic. Thematic word is the word with maximum possible relativity. We computed the frequency of occurrence of each word in a document and used the top 5 most frequent content words for consideration as thematic. The score for this feature is calculated as the ratio of the number of thematic words that occurs in a sentence over the maximum number of thematic words in a sentence. (9) 3.2.8. Numerical Data This feature is the number of numerical data in sentence. The sentence that contains numerical data is important and it is most probably included in a summary. The score for this feature is calculated as the ratio of the number of numerical data that occur in sentence over the sentence length. (10) 3.3 Text Summarization based on Fuzzy logic The goal of text summarization based on extraction is sentence selection. Our system consists of the following main steps. a. Read the source document into the system; b. For preprocessing step, the system extracts the individual sentences of the original document. Then, separate the input document into individual words. Next, remove stop words. The last step for preprocessing is word stemming; c. Each sentence is associated with vector of eight features that described in section 3.2, whose values are derived from the content of the sentence;
  • 4. Improvement of Text Summarization using Fuzzy Logic Based Method www.iosrjournals.org 8 | Page d. The score for each sentence is derived from its features based on fuzzy logic method; e. A set of highest score sentences are extracted as document summary based on compression rate. To extract important sentences we used fuzzy logic method. Fuzzy logic usually implicates selecting fuzzy rules and membership function. The selection of fuzzy rules and membership functions directly affect the performance of the fuzzy logic system. The fuzzy logic system consists of four components: fuzzifier, inference engine, defuzzifier and the fuzzy knowledge base. In the fuzzifer, crisp inputs are translated into linguistic values by using a membership function. After fuzzification, the inference engine refers to the rule base containing fuzzy IF-THEN rules to derive the linguistic values. In the last step, the output linguistic variables from the inference engine are converted to the final crisp values by the defuzzfier using membership function for representing the final sentence score. Fig. 1 shows text summarization based on fuzzy logic system architecture. In order to implement text summarization based on fuzzy logic, first, the eight features extracted in the previous section are used as input to the fuzzifier. We used triangular membership functions and fuzzy logic to summarize the document. The input triangular membership function for each feature is divided into five fuzzy sets which are composed of unimportant values (low (L) and very low (VL), average values (medium (M) and important values (high (H) and very high(VH). The generalized triangular membership function depends on the three parameters a, b, and c where, „a‟ and „c‟ are left and right feet of a triangle and „b‟ is at the peak of a triangle as shown Fig. 2. We used fuzzy centroid method to calculate a score for each sentence of a document. A value from zero to one is obtained for each sentence in the output based on sentence features and knowledge base. The obtained value in the output determines the degree of importance of the sentence in the final summary. The simplified fuzzy centroid calculation (11) is given by the following formula. (11) Where a, b, c are standard values of low, medium and high respectively and l, m, n are calculated values of low, medium and high respectively. In inference engine the most important part is the definition of fuzzy IF-THEN rules. The important sentences are extracted from these rules according to our features criteria. Sample of IF-THEN rules shows as the following rule. (a, l) ( a, l) ( a, l) ( a, l) (c, n) (b, m) C(x, y) Figure 2: Fuzzy Centroid Calculation Figure 1: Text summarization based on fuzzy logic system architecture.
  • 5. Improvement of Text Summarization using Fuzzy Logic Based Method www.iosrjournals.org 9 | Page Likewise the last step in fuzzy logic system is the defuzzfication. We used the output membership function which is divided into three membership functions: Output {Unimportant, Average, Important} to convert the fuzzy results from the inference engine into a crisp output for the final score of each sentence. 3.4. Extraction of Sentences In the fuzzy logic method described above, each sentence of the document is represented by the sentence score. Then all the sentences of a document are ranked in a descending order based on their scores. A set of highest score sentences are extracted as document summary based on 20% compression rate. Finally the summary sentences are arranged in the original order. 4. Evaluation and Results The evaluation of the summaries is done based on two factors mentioned in Fig. 5. We used 30 news documents from news based URLs as an input to the system. Here the human generated summaries are used as reference summaries for evaluation of our results. The human generated summary acts as a gold standard summary since humans can capture and relate deep meanings of the text as compared to machines. We received human generated summaries for our input documents from five different Experts. Here we call the summaries of Fuzzy summarizer, Copernic summarizer and MS Word summarizer as the candidate summaries and human generated summaries as reference summaries. Figure 3: Content-wise sentence similarity between candidate summary and reference summary Figure 4: Position-wise sentence similarity between candidate summary and reference summary IF (NoWordInTitle is VH) and (SentenceLength is H) and (TermFreq is VH) and (SentencePosition is H) and (SentenceSimilarity is VH) and (NoProperNoun is M) and (NoThematicWord is VH) and (NumericalData is M) THEN (Sentence is important)
  • 6. Improvement of Text Summarization using Fuzzy Logic Based Method www.iosrjournals.org 10 | Page Figure 5: Average similarity between candidate summary and reference summary The first factor in the evaluation as mentioned in Fig. 3 is the number of sentences from candidate summary that are similar to the sentences in reference summary. For this factor fuzzy summarizer exhibits the highest average 81% resemblance to the reference summary as shown in Fig. 5. The second factor in the evaluation as mentioned in Fig. 4 is the similarity between positions of the sentences in candidate summary and reference summary. For this factor fuzzy summarizer has average 79% resemblance to the reference summary which is highest among others as shown in Fig. 5. The results clearly show that fuzzy summarizer approach under consideration performs better than Copernic summarizer and MS Word 2007 summarizer. IV. Conclusion and Future Work In this paper we implement automatic text summarization which involves feature based extraction of important sentences using fuzzy logic method. The system is tested with 30 news documents and compared with Copernic summarizer and MS Word 2007 summarizer. The results show that the use of fuzzy logic in text summarization improves the quality of summaries. We applied our method for single document summarization which could be extended for multi-document summarization. In the input we used 30 news documents. The news have categorizes like sports, politics, weather etc. It is necessary to incorporate automatic selection of fuzzy inference rules, based on the type of input news in order to generate the best summaries. Our method could be extended for automatic selection of fuzzy inference rules. V. Acknowledgement We would like to thank Solapur University and WIT Faculty for supporting us. References Journal Papers: [1] H. P. Luhn, “The Automatic Creation of Literature Abstracts,” IBM Journal of Research and Development, vol. 2, pp.159- 165.1958. [2] G. J. Rath, A. Resnick, and T. R. Savage, “The formation of abstracts by the selection of sentences” American Documentation, vol. 12, pp.139-143.1961. [4] H. P. Edmundson., “New methods in automatic extracting” Journal of the Association for Computing Machinery 16 (2). pp.264-285.1969. [5] A. D. Kulkarni and D. C. Cavanaugh, “Fuzzy Neural Network Models for Classification” Applied Intelligence 12, pp.207-215. 2000. [6] L. Zadeh, “Fuzzy sets. Information Control” vol. 8, pp.338–353.1965. [10] G. Salton, C. Buckley, “Term-weighting approaches in automatic text retrieval” Information Processing and Management 24, 1988. 513-523. Reprinted in: Sparck-Jones, K.; Willet, P. (eds.) Readings in I.Retrieval. Morgan Kaufmann. pp.323-328.1997. Books: [3] Inderjeet Mani and Mark T. Maybury, editors, Advances in automatictext summarization (MIT Press. 1999) Proceedings Papers: [7] R Witte and S. Bergler, “Fuzzy coreference resolution for summarization” In Proceedings of 2003 International Symposium on Reference Resolution and Its Applications to Question Answering and Summarization (ARQAS). Venice, Italy: Università Ca‟ Foscari. pp.43–50. 2003. [8] Arman Kiani and M.R. Akbarzadeh, “Automatic Text Summarization Using: Hybrid Fuzzy GA-GP” In Proceedings of 2006 IEEE International Conference on Fuzzy Systems, Sheraton Vancouver Wall Center Hotel, Vancouver, BC, Canada. pp.977-983.2006. [9] J. Kupiec. , J. Pedersen, and F. Chen, “A Trainable Document Summarizer” In Proceedings of the Eighteenth Annual International ACM Conference on Research and Development in Information Retrieval (SIGIR), Seattle, WA, pp.68-73.1995.