SlideShare uma empresa Scribd logo
1 de 4
Baixar para ler offline
Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006)



A Rule Based System for Speech Language Context
Understanding
Imran Sarwar Bajwa1, Muhammad Abbas Choudhary2
1 CISUC –Department of Informatics Engineering, University of Coimbra, 3030 Coimbra, Portugal
2 Balochistan University of Information Technology and Management Sciences, Quetta, Pakistan

imran@dei.uc.pt, abbas@buitms.edu.pk


                              Abstract                                         learning, understanding and using. On the other hand it is
                                                                               quite complicated to model natural languages for a
Speech or Natural language contents are major tools of                         computer to understand [3]. NLP includes techniques like
communication. This research paper presents a                                  word stemming (removing suffixes), lemmatization
natural language processing based automated system                             (replacing an inflected word with its base form), multiword
for understanding speech language text. A new rule                             phrase grouping, synonym normalization, part-of-speech
based model has been presented for analyzing the                               (POS) tagging (elaborations on noun, verb, preposition
natural languages and extracting the relative meanings                         etc.), word-sense disambiguation, anaphora resolution, and
from the given text. User writes the natural language                          role determination (subject and object), etc [1].
text in simple English in a few paragraphs and the
designed system has a sound ability of analyzing the                                 In modern era, computer machines have attained the
given script by the user. After composite analysis and                         ability of solving complex mathematical and statistical
extraction of associated information, the designed                             problems with speed and grace but still they are inefficient
system gives particular meanings to an assortment of                           to comprehend the basics of spoken and written languages
speech language text on the basis of its context. The                          [4]
                                                                                   . The designed automated system can be useful in various
designed system uses standard speech language rules                            business and technical software by only acquiring the
that are clearly defined for all speech languages as                           requirements from the user. The designed system will
English, Urdu, Chinese, Arabic, French, etc. The                               extract the required information from the given text and
designed system provides a quick and reliable way to                           provide to the computer for further automated processing.
comprehend speech language context and generate                                Applications can be automated generation of UML
respective meanings.                                                           diagrams, query processing, web mining, web template
Keywords: automatic text understanding, speech language                        designing, user interface designing, etc.
processing, information extraction, language engineering.
                                                                               2. Related Work
1. Introduction
                                                                                    Natural languages have been an area of interest from
    In daily life, natural languages or speech languages are                   last one century. In the late nineteen sixties and seventies,
main source of commutation. Particular and typical set of                      so many researchers as Noam Chomsky (1965) [5], Maron,
words and vocabulary collections are used for each field of                    M. E. and Kuhns, J. L (1960) [6], Chow, C., & Liu, C
life and they are quiet exclusive to their use. In computer                    (1968) [7] contributed in the area of information retrieval
science, Natural Language Processing (NLP) is the field                        from natural languages. They contributed for analysis and
used for the analysis of speech language text [1]. NLP                         understanding of the natural languages, but still there was
introduces many new concepts and new terminologies to                          lot of effort required for better understanding and analysis.
describe its models, techniques and processes. Some of                         Some authors concentrated in this area in eighties and
these terms come directly from linguistics and the science                     nineties as Losee, R. M (1998) [8], Salton, G., & McGill, M
of perception, while others were invented to describe                          (1995) [9], Krovetz, R., & Croft, W. B (1992) [10]. These
discoveries that did not fit into any previous category [2].                   authors worked for lexical ambiguity and information
Natural Language understanding is a hefty field to grasp.                      retrieval [7], probabilistic indexing [9], data bases handling
Understanding and comprehending a speech language                              [10]
                                                                                    and so many other related areas.
means to know that what concepts a word or a particular
phrase stands under a particular perspective. To give
                                                                               In this research paper, a newly designed rule based
meanings to a particular sentence a system should know
                                                                               framework has been proposed that is able to read the
how to link the concepts together in a meaningful way.
                                                                               English language text and extract its meanings after
Speech languages are easy for human beings in terms of
                                                                               analyzing and extracting related information. The

——————————————
ISSN 1672-5220 - CN 31-1920/N©2006                                        39
Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006)


conducted research provides a robust solution to the                 scenario provided by the user. This system draws in five
addressed problem. The functionality of the conducted                modules: Text input acquisition, text understanding,
research was domain specific but it can be enhanced easily           knowledge extraction, generation of speech language
in the future according to the requirements. Current                 contents and finally multi-lingual code generation as shown
designed system incorporate the capability of mapping user           in following fig.
requirements after reading the given requirements in plain
text and drawing the set of speech language contents.
                                                                                             Text

3. Speech Language Analysis                                                        Text Input Acquisition
Natural Language processing is field of interest for the
computer science researchers from last few decades.
Various statistical and non-statistical methods have been                          Contents Interpretation
used to design a robust algorithm to understand true
semantics of a natural language text. Neural networks can
be a great help in this context, especially imply the power
of Natural Language processing (NLP) in complex, real                               Knowledge Retrieval
world problems. NLP bases systems have abilities: part of
Speech (POS) tagging, error detection in annotated corpora
and self organization of semantic maps [9]. Human
Language Technology (HLT) helps in processing human                                  Meaning Extraction
languages for various applications: Speech Recognition,
Machine Translation, Text Generation and Text Mining.                                     Meanings
Text understanding and mining is one of the major
applications of natural language processing (NLP).
                                                                     Figure 1- Architecture of the designed Speech Language Context
 In Natural language processing various linguistic                   Understanding Model
techniques i.e. lexical and semantic analysis, have been
developed [10]. Text parsing in NLP can be a detailed                1. Text input acquisition
process or trivial process. Some applications e.g. text
                                                                         This module helps to acquire input text scenario. User
summarization and text generation needs detailed text
                                                                     provides the business scenario in from of paragraphs of the
processing. In detailed process every part of each sentence
                                                                     text. This module reads the input text in the form characters
is analyzed. Some applications just need trivial processing
                                                                     and generates the words by concatenating the input
e.g. text mining and web mining. In trivial processing of
                                                                     characters. This module is the implementation of the lexical
text, only certain passages or phrases within a sentence are
                                                                     phase. Lexicons and tokens are generated in this module.
processed [11].
Natural Language Processing require a series of actions to           2. Parts of Speech Tagging
process the text and retrieve required information. In this              Part of speech tagging is the process of assigning a part-
information, first of all knowledge about phonetics and              of-speech or other lexical speech marker to each word in a
phonology is required. These tasks further demand                    corpus. Rule-based taggers are the earliest taggers and they
recognition of variations in words that is called                    used hand-written disambiguation rules to assign a single
morphology. Afterwards, syntax of the text is understood             part-of-speech to each word [12]. This module categorizes
which requires the knowledge needed to order and group               tokens into various classes as verbs, nouns, pronouns,
words together. After syntax, semantical analysis is                 adjectives, prepositions, conjunctions, etc.
performed, in which the meanings of the sentences are
understood, mainly due to compositional semantics [12]. To           3. Information Repossession
have a more compound understanding of the sentence,
pragmatics are required, where the sentence is analyzed                  In information repossession module, distinct objects
according to the context. Discourse analysis is the last part        and classes and their respective attributes are extracted on
of NLP, where the semantic analysis of the linguistic                the basses of the input provided by the prior module. Nouns
structure is performed beyond the sentence level [13].               are symbolized as classes and objects and their associated
                                                                     characteristics are termed as attributes. In pattern analysis
4. Designed System Architecture                                      phase, irrelative rules and patterns are eliminated for
                                                                     efficient pattern discovery process [13].
The designed speech language contents system has ability
to draw speech language contents after reading the text



                                                                40
Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006)


4. Meaning extraction                                                 together for example "Ahmed played tennis with Ali." In
                                                                      this example Ahmed is actor and Ali is co-actor as Ahmed
    This module finally uses speech language contents
                                                                      is playing with Ali. Nouns and pronouns coming after the
symbols to constitute the various speech language contents
                                                                      verb phrase are normally co-actors.
by combining available symbols according to the
information extracted of the previous module. Distinct                c. Recipient object
representations are created and assigned to various
linguistic inputs. The process of verifying the meanings of           The recipient is the person for whom an action has been
a sentence is performed by matching the input                         performed. Recipient is that particular part of a sentence,
representations with the representation in the knowledge              about which that sentence is. "Ahmed brought the balls for
base [14].                                                            Ali." In this sentence, Ali is the recipient, where Ahmed is
                                                                      actor that has brought the ball for Ali.
5. Used Methodology                                                   d. Thematic object
    Conventional methods for natural language processing              The thematic object is an entity in the sentence, on which
use rule based techniques besides Hidden Markov Method                the action is being performed. Generally, ‘object’ part in a
(HMM), Neural Networks (NN), probabilistic theory and                 sentence is represented as the thematic object. Often the
statistical methods. Agents are another way to address this           thematic object is the same as the syntactic direct object, as
problem [8]. In the research, a rule-based algorithm has been         "Ahmed hit the ball." Here the ball is thematic object.
used which has robust ability to read, understand and
extract the desired information. First of all basic elements          e. Conveyance object
of the language grammar are extracted as verbs, nouns,                The conveyance is something in which or on which
adjectives, etc then on the basis of this extracted                   somebody travels. For example in the sentence “Aslam
information further processing is performed. In linguistic            always goes by train”, train is conveyance object.
terms, verbs often specify actions, and noun phrases the              Conveyance object normally appears after by or via
objects that participate in the action [14]. Each noun phrase         prepositions.
specifies that how an object participates in the action.
                                                                      f. Route object
A robust method that has robust ability to understand a
natural language sentence must discover the actor. Actor              Motion from source to destination takes place over at route
performs the action in a sentence. To manifest the                    or trajectory. Several prepositions can serve to introduce
meanings of a sentence some categorization has been                   trajectory noun phrases: through, from, to "Ahmed and
employed. In a sentence different words represent different           Aslam went in through the front door."
course of actions. Some are doing work and for some one a             g. Site object
work is being done. Some tool can be used to perform a
certain task on a certain place on a specific time. This type         The location is where an action occurs. As in the trajectory
of labeling can make the process of meaning representation            role, several prepositions are possible, which conveys
easy and simple. Following are major labels that are                  meanings in addition to serve as a signal that a location
applied to the different parts of the sentences: actor, co-           noun phrase is "Ahmed and Ali studied in the library, at a
actor, recipient, conveyance, route, site, time, duration.            desk, by the wall, near the door."
These labels can be identified by the help of prepositions.
                                                                      h. Time object
a. Actor object                                                       Time specifies the occasion of the occurrence of an action.
                                                                      Prepositions like at, before, and after introduce noun
The actor represents the person that causes an action in a            phrases serving as time. For example, "Ahmed and Ali left
sentence. For example in sentence "Ahmed hits the ball,"              before noon", before is the time object.
Ahmed is agent who performs the task. But in this example
a passive sentence, the agent also may appear after a                 i. Duration object
preposition for example: "The ball was hit by Ahmed.'' The
“subject” part in a sentence; that are Nouns and pronouns             Duration specifies how long an action takes. Preposition
appearing before the verb phrase in a sentence are                    such as for, since indicate duration. "Ali and Zia jogged for
nominated as actors.                                                  an hour.”

b. Co-Actor object                                                    6. Conclusion
                                                                         The designed system for speech language context
Co-actor is the joining phrase that serves as a partner of the
                                                                      understanding using a rule based algorithm is a robust
principal actor. Actor and co-actor carry out the action


                                                                 41
Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006)


framework for inferring the appropriate meanings from a                          [8]    Losee, R. M. (1988) “Parameter estimation for probabilistic
given text. The accomplished research is related to the                                 document retrieval models”. Journal of the American Society for
                                                                                        Information Science, 39(1), 1988, pp. 8–16.
understanding of the human languages. Human being needs
specific linguistic knowledge generating and understanding                       [9]    Salton, G., & McGill, M. (1995) “Introduction to Modern
speech language contents. It is difficult for computers to                              Information Retrieval” McGraw-Hill, New York., 1995
perform this task. The speech language context                                   [10]   Krovetz, R., & Croft, W. B. (1992) “Lexical ambiguity and
understanding using a rule based framework has ability to                               information retrieval”, ACM Transactions on Information Systems,
read user provided text, extract related information and                                10, 1992, pp. 115–141
ultimately give meanings to the extracted contents. The
                                                                                 [11]   Partee, B. H., Meulen, A. t., &Wall, R. E. (1999) “Mathematical
designed system very effective and have high accuracy up                                Methods in Linguistics”. Kluwer, Dordrecht, The Netherlands.
to 90 %. The designed system depicts the meanings of the                                1999.
given sentence or paragraph efficiently. An elegant
                                                                                 [12]   Voutilainen, A. (1995), “Constraint Grammar: A language
graphical user interface has also been provided to the user                             Independent system for parsing Unrestricted Text”, Morphological
for entering the Input scenario in a proper way and                                     disambiguation, pp. 165-284
generating speech language contents.
                                                                                 [13]   Jurafsky, Daniel, and James H. Martin. (2000). “Speech and
                                                                                        Language Processing: An Introduction to Natural Language
7. Future Work                                                                          Processing, Speech Recognition, and Computational Linguistics”.
                                                                                        Prentice-Hall
    The designed system analyzes and extracts the contents                       [14]   WangBin, LiuZhijing, (2003), “Web Mining Research”,
of a speech language as English. Current designed system                                Proceedings of Fifth International Conference on Computational
only works for the active-vice sentences. Passive-vice                                  Intelligence and Multimedia Applications, 2003. ICCIMA 2003,
                                                                                        Xi'an, CHINA. Page(s):84 - 89
sentences are still to work for to make the system more
efficient and effective for various business applications.
There is also periphery of improvements in the algorithms
for the better extraction of language parts and
understanding the meanings of the speech language
context. Current accuracy of generating meanings from a
text is about 85% to 90%. It can be enhanced up to 95% or
even more by improving the algorithms and inducing the
ability of learning in the system.

8. References

[1]   Anne Kao & Steve Poteet, (2005) “Text Mining and Natural
      Language Processing – Introduction for the Special Issue”, ACM
      Press New York, NY, USA

[2]   Blaschke C, Andrade M. A, Ouzounis C. and Valencia,A. (1999)
      “Automatic extraction of biological information from scientific
      text: protein–protein interactions”. In the proceedings of ISMB,
      1999, Heidelberg, Germany, pp. 60–67.

[3]   C. A. Thompson, R. J. Mooney and L. R. Tang, (1997) “Learning
      to parse natural language database queries into logical form”, in:
      Workshop on Automata Induction, Grammatical Inference and
      Language Acquisition, Nashville, Tennessee,1997.

[4]   Pustejovsky J, Castaño J., Zhang J, Kotecki M, Cochran B. (2002)
      “Robust relational parsing over biomedical literature: Extracting
      inhibit relations”. In proc. of Pacific Symposium of Bio-computing,
      362-373, 2002.

[5]   Maron, M. E. & Kuhns, J. L. (1997) “On relevance, probabilistic
      indexing, and information retrieval” Journal of the ACM, 1997, 7,
      216–244.

[6]   Chomsky, N. (1965) “Aspects of the Theory of Syntax. MIT Press,
      Cambridge, Mass, 1965.

[7]   Chow, C., & Liu, C. (1968) “Approximating discrete probability
      distributions with dependence trees”. IEEE Transactions on
      Information Theory, 1968, IT-14(3), 462–467.



                                                                            42

Mais conteúdo relacionado

Mais procurados

MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORKijitcs
 
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...ijtsrd
 
Teachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificialTeachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificialCamillaTonanzi
 
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityEffective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityIDES Editor
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...DR.P.S.JAGADEESH KUMAR
 
An Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalAn Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalWaqas Tariq
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingiosrjce
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...ijnlc
 
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...Editor IJARCET
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine LearningKarthik Sankar
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...IJNSA Journal
 
Marathi Isolated Word Recognition System using MFCC and DTW Features
Marathi Isolated Word Recognition System using MFCC and DTW FeaturesMarathi Isolated Word Recognition System using MFCC and DTW Features
Marathi Isolated Word Recognition System using MFCC and DTW FeaturesIDES Editor
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 

Mais procurados (18)

MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
MULTILINGUAL SPEECH IDENTIFICATION USING ARTIFICIAL NEURAL NETWORK
 
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
Suggestion Generation for Specific Erroneous Part in a Sentence using Deep Le...
 
Teachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificialTeachbot teaching robot_using_artificial
Teachbot teaching robot_using_artificial
 
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic AmbiguityEffective Approach for Disambiguating Chinese Polyphonic Ambiguity
Effective Approach for Disambiguating Chinese Polyphonic Ambiguity
 
10.1.1.35.8376
10.1.1.35.837610.1.1.35.8376
10.1.1.35.8376
 
NLPinAAC
NLPinAACNLPinAAC
NLPinAAC
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...
Speech Processing and Audio feedback for Tamil Alphabets using Artificial Neu...
 
An Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity RemovalAn Improved Approach for Word Ambiguity Removal
An Improved Approach for Word Ambiguity Removal
 
Speech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law compandingSpeech to text conversion for visually impaired person using µ law companding
Speech to text conversion for visually impaired person using µ law companding
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
 
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...
A Combined Approach to Part-of-Speech Tagging Using Features Extraction and H...
 
Natural Language Processing and Machine Learning
Natural Language Processing and Machine LearningNatural Language Processing and Machine Learning
Natural Language Processing and Machine Learning
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...
A MULTI-LAYER HYBRID TEXT STEGANOGRAPHY FOR SECRET COMMUNICATION USING WORD T...
 
Marathi Isolated Word Recognition System using MFCC and DTW Features
Marathi Isolated Word Recognition System using MFCC and DTW FeaturesMarathi Isolated Word Recognition System using MFCC and DTW Features
Marathi Isolated Word Recognition System using MFCC and DTW Features
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 

Destaque

Yanjiang22052010
Yanjiang22052010Yanjiang22052010
Yanjiang22052010yphs
 
Eye workshop
Eye workshopEye workshop
Eye workshopGoei
 
Web Layout Mining - JECS 29(2)
Web Layout Mining - JECS 29(2)Web Layout Mining - JECS 29(2)
Web Layout Mining - JECS 29(2)IT Industry
 
Diacritice romanesti
Diacritice romanestiDiacritice romanesti
Diacritice romanestinicoletta
 
Web Layout Generation (IC-SCCE 2006)
Web Layout Generation (IC-SCCE 2006)Web Layout Generation (IC-SCCE 2006)
Web Layout Generation (IC-SCCE 2006)IT Industry
 
Web User Forms (ICOMMS 2006)
Web User Forms (ICOMMS 2006)Web User Forms (ICOMMS 2006)
Web User Forms (ICOMMS 2006)IT Industry
 
Yanjiang22052010
Yanjiang22052010Yanjiang22052010
Yanjiang22052010yphs
 
Scientific Research Workshop
Scientific Research WorkshopScientific Research Workshop
Scientific Research WorkshopGoei
 
Optometry Workshop
Optometry WorkshopOptometry Workshop
Optometry WorkshopGoei
 
Basic Latex Typesetting - Session 2
Basic Latex Typesetting - Session 2Basic Latex Typesetting - Session 2
Basic Latex Typesetting - Session 2Kiel Granada
 
Basic Latex Typesetting - Session 1
Basic Latex Typesetting - Session 1Basic Latex Typesetting - Session 1
Basic Latex Typesetting - Session 1Kiel Granada
 
Philippe Violier - Comment un territoire devient-il une destination touristiq...
Philippe Violier - Comment un territoire devient-il une destination touristiq...Philippe Violier - Comment un territoire devient-il une destination touristiq...
Philippe Violier - Comment un territoire devient-il une destination touristiq...Offices de Tourisme de France
 
Expérience 10 - Retour sur l'opération "tous accueillants"
Expérience 10 - Retour sur l'opération "tous accueillants"Expérience 10 - Retour sur l'opération "tous accueillants"
Expérience 10 - Retour sur l'opération "tous accueillants"Offices de Tourisme de France
 

Destaque (13)

Yanjiang22052010
Yanjiang22052010Yanjiang22052010
Yanjiang22052010
 
Eye workshop
Eye workshopEye workshop
Eye workshop
 
Web Layout Mining - JECS 29(2)
Web Layout Mining - JECS 29(2)Web Layout Mining - JECS 29(2)
Web Layout Mining - JECS 29(2)
 
Diacritice romanesti
Diacritice romanestiDiacritice romanesti
Diacritice romanesti
 
Web Layout Generation (IC-SCCE 2006)
Web Layout Generation (IC-SCCE 2006)Web Layout Generation (IC-SCCE 2006)
Web Layout Generation (IC-SCCE 2006)
 
Web User Forms (ICOMMS 2006)
Web User Forms (ICOMMS 2006)Web User Forms (ICOMMS 2006)
Web User Forms (ICOMMS 2006)
 
Yanjiang22052010
Yanjiang22052010Yanjiang22052010
Yanjiang22052010
 
Scientific Research Workshop
Scientific Research WorkshopScientific Research Workshop
Scientific Research Workshop
 
Optometry Workshop
Optometry WorkshopOptometry Workshop
Optometry Workshop
 
Basic Latex Typesetting - Session 2
Basic Latex Typesetting - Session 2Basic Latex Typesetting - Session 2
Basic Latex Typesetting - Session 2
 
Basic Latex Typesetting - Session 1
Basic Latex Typesetting - Session 1Basic Latex Typesetting - Session 1
Basic Latex Typesetting - Session 1
 
Philippe Violier - Comment un territoire devient-il une destination touristiq...
Philippe Violier - Comment un territoire devient-il une destination touristiq...Philippe Violier - Comment un territoire devient-il une destination touristiq...
Philippe Violier - Comment un territoire devient-il une destination touristiq...
 
Expérience 10 - Retour sur l'opération "tous accueillants"
Expérience 10 - Retour sur l'opération "tous accueillants"Expérience 10 - Retour sur l'opération "tous accueillants"
Expérience 10 - Retour sur l'opération "tous accueillants"
 

Semelhante a NL Context Understanding 23(6)

Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)IT Industry
 
Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)IT Industry
 
A Comprehensive Study On Natural Language Processing And Natural Language Int...
A Comprehensive Study On Natural Language Processing And Natural Language Int...A Comprehensive Study On Natural Language Processing And Natural Language Int...
A Comprehensive Study On Natural Language Processing And Natural Language Int...Scott Bou
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingScott Faria
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challengesantonellarose
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguisticsAdnanBaloch15
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
 
Automatic text summarization of konkani texts using pre-trained word embeddin...
Automatic text summarization of konkani texts using pre-trained word embeddin...Automatic text summarization of konkani texts using pre-trained word embeddin...
Automatic text summarization of konkani texts using pre-trained word embeddin...IJECEIAES
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMijcsa
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfMoar Digital 360
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKijnlc
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Networkkevig
 
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert SystemModeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert Systemcsandit
 
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...IJERA Editor
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingIOSR Journals
 
Cross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interfaceCross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interfacepathsproject
 
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...kevig
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET Journal
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...PhD Assistance
 

Semelhante a NL Context Understanding 23(6) (20)

Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)Domain Specific Terminology Extraction (ICICT 2006)
Domain Specific Terminology Extraction (ICICT 2006)
 
Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)
 
A Comprehensive Study On Natural Language Processing And Natural Language Int...
A Comprehensive Study On Natural Language Processing And Natural Language Int...A Comprehensive Study On Natural Language Processing And Natural Language Int...
A Comprehensive Study On Natural Language Processing And Natural Language Int...
 
An Overview Of Natural Language Processing
An Overview Of Natural Language ProcessingAn Overview Of Natural Language Processing
An Overview Of Natural Language Processing
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
 
Computational linguistics
Computational linguisticsComputational linguistics
Computational linguistics
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
Automatic text summarization of konkani texts using pre-trained word embeddin...
Automatic text summarization of konkani texts using pre-trained word embeddin...Automatic text summarization of konkani texts using pre-trained word embeddin...
Automatic text summarization of konkani texts using pre-trained word embeddin...
 
INTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAMINTELLIGENT QUERY PROCESSING IN MALAYALAM
INTELLIGENT QUERY PROCESSING IN MALAYALAM
 
Natural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdfNatural Language Processing (NLP).pdf
Natural Language Processing (NLP).pdf
 
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORKSENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
SENTIMENT ANALYSIS IN MYANMAR LANGUAGE USING CONVOLUTIONAL LSTM NEURAL NETWORK
 
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural NetworkSentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
Sentiment Analysis In Myanmar Language Using Convolutional Lstm Neural Network
 
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert SystemModeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
 
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
 
Efficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And RecordingEfficient Intralingual Text To Speech Web Podcasting And Recording
Efficient Intralingual Text To Speech Web Podcasting And Recording
 
Cross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interfaceCross-lingual event-mining using wordnet as a shared knowledge interface
Cross-lingual event-mining using wordnet as a shared knowledge interface
 
Com ling
Com lingCom ling
Com ling
 
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
 
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival FrameworkIRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
 
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
Conversational AI:An Overview of Techniques, Applications & Future Scope - Ph...
 

Mais de IT Industry

The News Today 24 (https://thenewstoday24.com/)
The News Today 24 (https://thenewstoday24.com/)The News Today 24 (https://thenewstoday24.com/)
The News Today 24 (https://thenewstoday24.com/)IT Industry
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)IT Industry
 
NL Interface for Database - EJSR 20(4)
NL Interface for Database - EJSR 20(4)NL Interface for Database - EJSR 20(4)
NL Interface for Database - EJSR 20(4)IT Industry
 
Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)IT Industry
 
Virtual Telemedicine (IJITWE 5(1))
Virtual Telemedicine (IJITWE 5(1))Virtual Telemedicine (IJITWE 5(1))
Virtual Telemedicine (IJITWE 5(1))IT Industry
 
NL to OCL Transformation (EDOC 2010)
NL to OCL Transformation (EDOC 2010)NL to OCL Transformation (EDOC 2010)
NL to OCL Transformation (EDOC 2010)IT Industry
 
BPM & SOA for Small Business Enterprises (ICIME 2009)
BPM & SOA for Small Business Enterprises (ICIME 2009)BPM & SOA for Small Business Enterprises (ICIME 2009)
BPM & SOA for Small Business Enterprises (ICIME 2009)IT Industry
 
UCD Generator (ICIET 2007)
UCD Generator (ICIET 2007)UCD Generator (ICIET 2007)
UCD Generator (ICIET 2007)IT Industry
 
UML Generator (NCC18)
UML Generator (NCC18)UML Generator (NCC18)
UML Generator (NCC18)IT Industry
 
Image Classification (icast 2006)
Image Classification  (icast 2006)Image Classification  (icast 2006)
Image Classification (icast 2006)IT Industry
 
Reuse Software Components (IMS 2006)
Reuse Software Components (IMS 2006)Reuse Software Components (IMS 2006)
Reuse Software Components (IMS 2006)IT Industry
 
GIS for Quetta (ICAST 2006)
GIS for Quetta (ICAST 2006)GIS for Quetta (ICAST 2006)
GIS for Quetta (ICAST 2006)IT Industry
 
Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)IT Industry
 
PCA Clouds (ICET 2005)
PCA Clouds (ICET 2005)PCA Clouds (ICET 2005)
PCA Clouds (ICET 2005)IT Industry
 
Feature Based Image Classification by using Principal Component Analysis
Feature Based Image Classification by using Principal Component AnalysisFeature Based Image Classification by using Principal Component Analysis
Feature Based Image Classification by using Principal Component AnalysisIT Industry
 

Mais de IT Industry (15)

The News Today 24 (https://thenewstoday24.com/)
The News Today 24 (https://thenewstoday24.com/)The News Today 24 (https://thenewstoday24.com/)
The News Today 24 (https://thenewstoday24.com/)
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)
 
NL Interface for Database - EJSR 20(4)
NL Interface for Database - EJSR 20(4)NL Interface for Database - EJSR 20(4)
NL Interface for Database - EJSR 20(4)
 
Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)Requirement Analysis - ijcee 2(3)
Requirement Analysis - ijcee 2(3)
 
Virtual Telemedicine (IJITWE 5(1))
Virtual Telemedicine (IJITWE 5(1))Virtual Telemedicine (IJITWE 5(1))
Virtual Telemedicine (IJITWE 5(1))
 
NL to OCL Transformation (EDOC 2010)
NL to OCL Transformation (EDOC 2010)NL to OCL Transformation (EDOC 2010)
NL to OCL Transformation (EDOC 2010)
 
BPM & SOA for Small Business Enterprises (ICIME 2009)
BPM & SOA for Small Business Enterprises (ICIME 2009)BPM & SOA for Small Business Enterprises (ICIME 2009)
BPM & SOA for Small Business Enterprises (ICIME 2009)
 
UCD Generator (ICIET 2007)
UCD Generator (ICIET 2007)UCD Generator (ICIET 2007)
UCD Generator (ICIET 2007)
 
UML Generator (NCC18)
UML Generator (NCC18)UML Generator (NCC18)
UML Generator (NCC18)
 
Image Classification (icast 2006)
Image Classification  (icast 2006)Image Classification  (icast 2006)
Image Classification (icast 2006)
 
Reuse Software Components (IMS 2006)
Reuse Software Components (IMS 2006)Reuse Software Components (IMS 2006)
Reuse Software Components (IMS 2006)
 
GIS for Quetta (ICAST 2006)
GIS for Quetta (ICAST 2006)GIS for Quetta (ICAST 2006)
GIS for Quetta (ICAST 2006)
 
Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)Automated Java Code Generation (ICDIM 2006)
Automated Java Code Generation (ICDIM 2006)
 
PCA Clouds (ICET 2005)
PCA Clouds (ICET 2005)PCA Clouds (ICET 2005)
PCA Clouds (ICET 2005)
 
Feature Based Image Classification by using Principal Component Analysis
Feature Based Image Classification by using Principal Component AnalysisFeature Based Image Classification by using Principal Component Analysis
Feature Based Image Classification by using Principal Component Analysis
 

NL Context Understanding 23(6)

  • 1. Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006) A Rule Based System for Speech Language Context Understanding Imran Sarwar Bajwa1, Muhammad Abbas Choudhary2 1 CISUC –Department of Informatics Engineering, University of Coimbra, 3030 Coimbra, Portugal 2 Balochistan University of Information Technology and Management Sciences, Quetta, Pakistan imran@dei.uc.pt, abbas@buitms.edu.pk Abstract learning, understanding and using. On the other hand it is quite complicated to model natural languages for a Speech or Natural language contents are major tools of computer to understand [3]. NLP includes techniques like communication. This research paper presents a word stemming (removing suffixes), lemmatization natural language processing based automated system (replacing an inflected word with its base form), multiword for understanding speech language text. A new rule phrase grouping, synonym normalization, part-of-speech based model has been presented for analyzing the (POS) tagging (elaborations on noun, verb, preposition natural languages and extracting the relative meanings etc.), word-sense disambiguation, anaphora resolution, and from the given text. User writes the natural language role determination (subject and object), etc [1]. text in simple English in a few paragraphs and the designed system has a sound ability of analyzing the In modern era, computer machines have attained the given script by the user. After composite analysis and ability of solving complex mathematical and statistical extraction of associated information, the designed problems with speed and grace but still they are inefficient system gives particular meanings to an assortment of to comprehend the basics of spoken and written languages speech language text on the basis of its context. The [4] . The designed automated system can be useful in various designed system uses standard speech language rules business and technical software by only acquiring the that are clearly defined for all speech languages as requirements from the user. The designed system will English, Urdu, Chinese, Arabic, French, etc. The extract the required information from the given text and designed system provides a quick and reliable way to provide to the computer for further automated processing. comprehend speech language context and generate Applications can be automated generation of UML respective meanings. diagrams, query processing, web mining, web template Keywords: automatic text understanding, speech language designing, user interface designing, etc. processing, information extraction, language engineering. 2. Related Work 1. Introduction Natural languages have been an area of interest from In daily life, natural languages or speech languages are last one century. In the late nineteen sixties and seventies, main source of commutation. Particular and typical set of so many researchers as Noam Chomsky (1965) [5], Maron, words and vocabulary collections are used for each field of M. E. and Kuhns, J. L (1960) [6], Chow, C., & Liu, C life and they are quiet exclusive to their use. In computer (1968) [7] contributed in the area of information retrieval science, Natural Language Processing (NLP) is the field from natural languages. They contributed for analysis and used for the analysis of speech language text [1]. NLP understanding of the natural languages, but still there was introduces many new concepts and new terminologies to lot of effort required for better understanding and analysis. describe its models, techniques and processes. Some of Some authors concentrated in this area in eighties and these terms come directly from linguistics and the science nineties as Losee, R. M (1998) [8], Salton, G., & McGill, M of perception, while others were invented to describe (1995) [9], Krovetz, R., & Croft, W. B (1992) [10]. These discoveries that did not fit into any previous category [2]. authors worked for lexical ambiguity and information Natural Language understanding is a hefty field to grasp. retrieval [7], probabilistic indexing [9], data bases handling Understanding and comprehending a speech language [10] and so many other related areas. means to know that what concepts a word or a particular phrase stands under a particular perspective. To give In this research paper, a newly designed rule based meanings to a particular sentence a system should know framework has been proposed that is able to read the how to link the concepts together in a meaningful way. English language text and extract its meanings after Speech languages are easy for human beings in terms of analyzing and extracting related information. The —————————————— ISSN 1672-5220 - CN 31-1920/N©2006 39
  • 2. Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006) conducted research provides a robust solution to the scenario provided by the user. This system draws in five addressed problem. The functionality of the conducted modules: Text input acquisition, text understanding, research was domain specific but it can be enhanced easily knowledge extraction, generation of speech language in the future according to the requirements. Current contents and finally multi-lingual code generation as shown designed system incorporate the capability of mapping user in following fig. requirements after reading the given requirements in plain text and drawing the set of speech language contents. Text 3. Speech Language Analysis Text Input Acquisition Natural Language processing is field of interest for the computer science researchers from last few decades. Various statistical and non-statistical methods have been Contents Interpretation used to design a robust algorithm to understand true semantics of a natural language text. Neural networks can be a great help in this context, especially imply the power of Natural Language processing (NLP) in complex, real Knowledge Retrieval world problems. NLP bases systems have abilities: part of Speech (POS) tagging, error detection in annotated corpora and self organization of semantic maps [9]. Human Language Technology (HLT) helps in processing human Meaning Extraction languages for various applications: Speech Recognition, Machine Translation, Text Generation and Text Mining. Meanings Text understanding and mining is one of the major applications of natural language processing (NLP). Figure 1- Architecture of the designed Speech Language Context In Natural language processing various linguistic Understanding Model techniques i.e. lexical and semantic analysis, have been developed [10]. Text parsing in NLP can be a detailed 1. Text input acquisition process or trivial process. Some applications e.g. text This module helps to acquire input text scenario. User summarization and text generation needs detailed text provides the business scenario in from of paragraphs of the processing. In detailed process every part of each sentence text. This module reads the input text in the form characters is analyzed. Some applications just need trivial processing and generates the words by concatenating the input e.g. text mining and web mining. In trivial processing of characters. This module is the implementation of the lexical text, only certain passages or phrases within a sentence are phase. Lexicons and tokens are generated in this module. processed [11]. Natural Language Processing require a series of actions to 2. Parts of Speech Tagging process the text and retrieve required information. In this Part of speech tagging is the process of assigning a part- information, first of all knowledge about phonetics and of-speech or other lexical speech marker to each word in a phonology is required. These tasks further demand corpus. Rule-based taggers are the earliest taggers and they recognition of variations in words that is called used hand-written disambiguation rules to assign a single morphology. Afterwards, syntax of the text is understood part-of-speech to each word [12]. This module categorizes which requires the knowledge needed to order and group tokens into various classes as verbs, nouns, pronouns, words together. After syntax, semantical analysis is adjectives, prepositions, conjunctions, etc. performed, in which the meanings of the sentences are understood, mainly due to compositional semantics [12]. To 3. Information Repossession have a more compound understanding of the sentence, pragmatics are required, where the sentence is analyzed In information repossession module, distinct objects according to the context. Discourse analysis is the last part and classes and their respective attributes are extracted on of NLP, where the semantic analysis of the linguistic the basses of the input provided by the prior module. Nouns structure is performed beyond the sentence level [13]. are symbolized as classes and objects and their associated characteristics are termed as attributes. In pattern analysis 4. Designed System Architecture phase, irrelative rules and patterns are eliminated for efficient pattern discovery process [13]. The designed speech language contents system has ability to draw speech language contents after reading the text 40
  • 3. Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006) 4. Meaning extraction together for example "Ahmed played tennis with Ali." In this example Ahmed is actor and Ali is co-actor as Ahmed This module finally uses speech language contents is playing with Ali. Nouns and pronouns coming after the symbols to constitute the various speech language contents verb phrase are normally co-actors. by combining available symbols according to the information extracted of the previous module. Distinct c. Recipient object representations are created and assigned to various linguistic inputs. The process of verifying the meanings of The recipient is the person for whom an action has been a sentence is performed by matching the input performed. Recipient is that particular part of a sentence, representations with the representation in the knowledge about which that sentence is. "Ahmed brought the balls for base [14]. Ali." In this sentence, Ali is the recipient, where Ahmed is actor that has brought the ball for Ali. 5. Used Methodology d. Thematic object Conventional methods for natural language processing The thematic object is an entity in the sentence, on which use rule based techniques besides Hidden Markov Method the action is being performed. Generally, ‘object’ part in a (HMM), Neural Networks (NN), probabilistic theory and sentence is represented as the thematic object. Often the statistical methods. Agents are another way to address this thematic object is the same as the syntactic direct object, as problem [8]. In the research, a rule-based algorithm has been "Ahmed hit the ball." Here the ball is thematic object. used which has robust ability to read, understand and extract the desired information. First of all basic elements e. Conveyance object of the language grammar are extracted as verbs, nouns, The conveyance is something in which or on which adjectives, etc then on the basis of this extracted somebody travels. For example in the sentence “Aslam information further processing is performed. In linguistic always goes by train”, train is conveyance object. terms, verbs often specify actions, and noun phrases the Conveyance object normally appears after by or via objects that participate in the action [14]. Each noun phrase prepositions. specifies that how an object participates in the action. f. Route object A robust method that has robust ability to understand a natural language sentence must discover the actor. Actor Motion from source to destination takes place over at route performs the action in a sentence. To manifest the or trajectory. Several prepositions can serve to introduce meanings of a sentence some categorization has been trajectory noun phrases: through, from, to "Ahmed and employed. In a sentence different words represent different Aslam went in through the front door." course of actions. Some are doing work and for some one a g. Site object work is being done. Some tool can be used to perform a certain task on a certain place on a specific time. This type The location is where an action occurs. As in the trajectory of labeling can make the process of meaning representation role, several prepositions are possible, which conveys easy and simple. Following are major labels that are meanings in addition to serve as a signal that a location applied to the different parts of the sentences: actor, co- noun phrase is "Ahmed and Ali studied in the library, at a actor, recipient, conveyance, route, site, time, duration. desk, by the wall, near the door." These labels can be identified by the help of prepositions. h. Time object a. Actor object Time specifies the occasion of the occurrence of an action. Prepositions like at, before, and after introduce noun The actor represents the person that causes an action in a phrases serving as time. For example, "Ahmed and Ali left sentence. For example in sentence "Ahmed hits the ball," before noon", before is the time object. Ahmed is agent who performs the task. But in this example a passive sentence, the agent also may appear after a i. Duration object preposition for example: "The ball was hit by Ahmed.'' The “subject” part in a sentence; that are Nouns and pronouns Duration specifies how long an action takes. Preposition appearing before the verb phrase in a sentence are such as for, since indicate duration. "Ali and Zia jogged for nominated as actors. an hour.” b. Co-Actor object 6. Conclusion The designed system for speech language context Co-actor is the joining phrase that serves as a partner of the understanding using a rule based algorithm is a robust principal actor. Actor and co-actor carry out the action 41
  • 4. Journal of Donghua University (English Edition) Vol. 23 No. 06 (2006) framework for inferring the appropriate meanings from a [8] Losee, R. M. (1988) “Parameter estimation for probabilistic given text. The accomplished research is related to the document retrieval models”. Journal of the American Society for Information Science, 39(1), 1988, pp. 8–16. understanding of the human languages. Human being needs specific linguistic knowledge generating and understanding [9] Salton, G., & McGill, M. (1995) “Introduction to Modern speech language contents. It is difficult for computers to Information Retrieval” McGraw-Hill, New York., 1995 perform this task. The speech language context [10] Krovetz, R., & Croft, W. B. (1992) “Lexical ambiguity and understanding using a rule based framework has ability to information retrieval”, ACM Transactions on Information Systems, read user provided text, extract related information and 10, 1992, pp. 115–141 ultimately give meanings to the extracted contents. The [11] Partee, B. H., Meulen, A. t., &Wall, R. E. (1999) “Mathematical designed system very effective and have high accuracy up Methods in Linguistics”. Kluwer, Dordrecht, The Netherlands. to 90 %. The designed system depicts the meanings of the 1999. given sentence or paragraph efficiently. An elegant [12] Voutilainen, A. (1995), “Constraint Grammar: A language graphical user interface has also been provided to the user Independent system for parsing Unrestricted Text”, Morphological for entering the Input scenario in a proper way and disambiguation, pp. 165-284 generating speech language contents. [13] Jurafsky, Daniel, and James H. Martin. (2000). “Speech and Language Processing: An Introduction to Natural Language 7. Future Work Processing, Speech Recognition, and Computational Linguistics”. Prentice-Hall The designed system analyzes and extracts the contents [14] WangBin, LiuZhijing, (2003), “Web Mining Research”, of a speech language as English. Current designed system Proceedings of Fifth International Conference on Computational only works for the active-vice sentences. Passive-vice Intelligence and Multimedia Applications, 2003. ICCIMA 2003, Xi'an, CHINA. Page(s):84 - 89 sentences are still to work for to make the system more efficient and effective for various business applications. There is also periphery of improvements in the algorithms for the better extraction of language parts and understanding the meanings of the speech language context. Current accuracy of generating meanings from a text is about 85% to 90%. It can be enhanced up to 95% or even more by improving the algorithms and inducing the ability of learning in the system. 8. References [1] Anne Kao & Steve Poteet, (2005) “Text Mining and Natural Language Processing – Introduction for the Special Issue”, ACM Press New York, NY, USA [2] Blaschke C, Andrade M. A, Ouzounis C. and Valencia,A. (1999) “Automatic extraction of biological information from scientific text: protein–protein interactions”. In the proceedings of ISMB, 1999, Heidelberg, Germany, pp. 60–67. [3] C. A. Thompson, R. J. Mooney and L. R. Tang, (1997) “Learning to parse natural language database queries into logical form”, in: Workshop on Automata Induction, Grammatical Inference and Language Acquisition, Nashville, Tennessee,1997. [4] Pustejovsky J, Castaño J., Zhang J, Kotecki M, Cochran B. (2002) “Robust relational parsing over biomedical literature: Extracting inhibit relations”. In proc. of Pacific Symposium of Bio-computing, 362-373, 2002. [5] Maron, M. E. & Kuhns, J. L. (1997) “On relevance, probabilistic indexing, and information retrieval” Journal of the ACM, 1997, 7, 216–244. [6] Chomsky, N. (1965) “Aspects of the Theory of Syntax. MIT Press, Cambridge, Mass, 1965. [7] Chow, C., & Liu, C. (1968) “Approximating discrete probability distributions with dependence trees”. IEEE Transactions on Information Theory, 1968, IT-14(3), 462–467. 42