SlideShare uma empresa Scribd logo
1 de 3
Integrated Intelligent Research(IIR) International Journal of Business Intelligent
Volume: 01 Issue: 01 June 2012,Pages No.15-17
ISSN: 2278-2400
15
Pos Tagging for Classical Tamil Texts
R.Akilan, E.R.Naganathan,
Central Institute of Classical Tamil, Chennai.
Velammal Engineering College, Chennai - 600053.
E-mail: akilan.rp@gmail.com,ernindia@gmail.com
Abstract-This paper presents a rule based model of parts of
speech (POS) tagset for Classical Tamil Texts (CTT). The noun
forms are type pattern, verb forms are token pattern. This is
based on form agreement method. This is a very efficient and
novel approach because Tamil Language has a build-in system of
agreement/concord of the sentence. Classical Tamil Tagset is
divided into two basic classifications, noun morphology and verb
morphology.
Keywords- parts of speech; tagset; token; type; insert
I. INTRODUCTION
Natural Language Processing (NLP) is a computerized approach
to analyze the text based on a set of theories and set of
technologies. And, being a very active area of research and
development, the basic objective of Natural Language
Processing is to facilitate human-machine interaction through
the means of natural human language. Research on NLP has
focused on various intermediate tasks that make partial sense of
language structure without requiring complete understanding
which, in turn, contributes to develop a successful system. Part-
Of-Speech tagging is one of the processes in which grammatical
categories are assigned to each token in the context from a given
set of tags called POS Tagset. It serves wide number of
applications like parser, morphology, speech synthesis and
recognition, information retrieval, information extraction,
partial parsing, machine translation, lexicography, Word
Sense Disambiguation (WSD), Corpus based learning and
teaching etc. There are a few POS Tagsets for Modern Tamil
Language such as IL-ILMT, AU-KBC, IL-POST Microsoft,
LDC-IL Mysore, etc. There are various methods in POS tagsets
like linguistic rules, stochastic models and hybrid approaches,
and each approach has its own merits and demerits
(Rajendran. 2007). In this context, Classical Tamil further
presents a challenge in developing an automatic POS tagger
as the language is highly inflectional and morphologically rich.
Hence, it is essential to consider text processing prior to POS
tagging in order to achieve high performance and more
reliability.
II. POS TAGGER FOR CLASSICAL TAMIL: AN
OVERVIEW
In the last few decades of NLP initiatives on Indian languages in
India, different research groups working on various languages
have developed POS tagger. In this paper, a brief summary of
POS tagger for Classical Tamil is being analyzed.Parts of
speech tagging assigns grammatical category of the language.
There are noun, verbs, adjective, adverbs, postposition, etc. A
POS tagset is developed on the basis of the information from a
particular language. The tagset consist finite tags and the
information extraction should be infinite. Each word assigns a
unique category. Tamil is an agglutinative language. In Tamil,
root or stem can be added with some suffixes that derivate new
words. Those words are divided into two parts namely lexical
word and grammatical word. A word can function
independently; but grammatical word cannot function
independently. Based on the above criteria, all the languages
make a distinction between open classes and close classes. Thus,
basically word classes may be grouped into two viz., open
classes and closed classes. In relation with this, Robin (a
linguist) quotes ”we can describe open classes as those, whose
membership is in principle unlimited, varying from time to time
and between one speaker and another’ and closed classes as
those that ‘contain a fixed and usually small number of member
words, which are (essentially) the same for all the speakers of the
language or dialect’ ” (Robins, 1964 quoted in Paul and
Shopen:1985). Thus nouns, verbs, adjectives and adverbs
belongs to open classes and pronouns, conjunction, etc., belong
to closed classes.
III. METHODOLOGY FOR THE WORD TAGGING
The noun form carries plural marker, case markers, clitics,
postposition, etc., The noun form may be a root or a stem or an
oblique base form. The root form means that there is no future
segmentation ( kāl ‘leg’, kaṇ ’eye’ ) and the base form means
that the root word can be added with any suffix for example
(kaṇṇaṉ (kaṇ+aṉ) ‘male person’, kuṉṟam(kuṉṟu+am) ‘hill’ ), the
oblique means the form that carries suffix or equivalent another
form (kiṇaṟṟu (kiṇaṟu ) ‘well’ , āṟṟu (āṟu) ‘river’ , eṉ (nāṉ) ‘I’ ,
taṉ (tāṉ)) ’my’. In such a way, the verb forms consist of verb
root and conjugated. There are seven major types of verb
conjugated is as follows Verbal base, Infinitive, Verbal noun,
Verbal participle, Relative participle, Participial noun and Finite
verb.
3.1 Plural marker
Plural means a grammatical form that designates more than one
of the things specified.
மலர்+கள் = மலர்கள் (NC)
flower+pl = flowers
Integrated Intelligent Research(IIR) International Journal of Business Intelligent
Volume: 01 Issue: 01 June 2012,Pages No.15-17
ISSN: 2278-2400
16
3.2
Case marker
Case marker is a suffix, it occurs after noun and pronoun.
(i) Accusative Case marker –ஐ (ai)
மலர்+ஐ = மலரர (NC)
flower+acc. =flower
(ii) Instrumental case marker –ஆல் (āl)
மலர்+ஆல் = மலரால் (NC)
flower + ins. = with flower
(iii) Sociative case marker– ஓடு (ōṭu)
மலர் + ஓடு = மலரராடு (NC)
flower + soc. =with flower
(iv) Dative case marker – க்கு (kku)
மலர் + க்கு = மலருக்கு (NC)
flower +dat.= to flower
(v) Genitive case marker – இன் (iṉ)
மலர் + இன் = மலரின் (NC)
flower +gen.= of flower
(vi) Locative case marker – கண
் (kaṇ)
மலர்+கண
் = மலர்க்கண
் (NC)
flower – flower + loc.
3.3 Postposition
Postposition is a word that occurs after noun which may or may
not be appendix with case marker.
மலர் + உள் (NC) = மலருள்
3.4 Verb
In general, verb form takes tense marker, person, number, gender
markers (PNG). There are no multiple meaning features of tense
markers and PNG markers. But it provides many conjugated
forms form the verbs. It is well known that in almost all natural
languages, verbs are considered to be the most important part of
speech. Verbs play a very important role in any languages. As
Tamil verbs are inflected to various grammatical categories the
bulk of Tamil parts of speech dealt with verb is necessary.
3.5 Adjective (ADJ)
A word or phrase naming an attribute, added to or grammatically
related to a noun to modify or describe it. There are two types of
adjectives. They are inherent and derivation. Both are marked as
single tag. Ex. எல்லா, பல (ellā, pala)
3.6 Adverb (ADV)
An adverb is a part of speech. It is a verb that modifies any part
of language, there are two types of adverbs. They are inherent
and derivation.
Both are marked as single tag (அங்கு, ஆங்கனம் (aṅku,
āṅkaṉam)).
3.7 Pronoun (PRO)
A pronoun is a word that takes the place of a oun. (அவன் ,
அவள் (avaṉ, avaḷ)).
3.8 Particle (PRT)
A particle is a function word that does not belong to any of the
inflected grammatical word classes (such as nouns, pronouns,
verbs, or articles). It is a catch-all term for a heterogeneous set of
words and terms that lack a precise lexical definition. It is mostly
used for words that help to encode grammatical categories (such
as negation, mood or case), or fillers or discourse markers that
facilitate discourse such as
well, ah, anyway, etc. என்று, என் பது (eṉṟu, eṉpatu)
3.9 Clitics
Clitics may belong to any grammatical category, though they are
commonly determiners, or adpositions.
மலர் +ஏ = மலரர (NC)
3.10 Numerals (NUM)
Numeral is a noun, divided into two types cardinal and ordinal.
Both the types can be marked as a single tag (ஒன்று,
இரண
் டு, ஒன் றாம், ஒன் றாவது (oṉṟu, iraṇṭu, oṉṟām,
oṉṟāvatu))
IV. POS TAGGER PROCEDURE AND
ARCHITECTURE
4.1 Procedure
POS Tagger has two tasks: Training task and tagging task. In tha
training task we have trained the validated tagged data into base-
level training module. This module generates a database that
validates the root word. It searches the form of the word in the
root dictionary if it can find out the root dictionary it assignd the
appropriate tag, if it fails, it goes to the root dictionary in order to
find out the rules of the suffix list to match the word. If the word
is available, it assigns the appropriate tag and if it fails, it names
it as an unknown category. The tagging process follows the
following procedural steps
Integrated Intelligent Research(IIR) International Journal of Business Intelligent
Volume: 01 Issue: 01 June 2012,Pages No.15-17
ISSN: 2278-2400
17
4.2 Architecture of POS
The architecture of POS Tagger consists of three layers such as
Data Layer (DL),Process Layer (BL) and Presentation Layer
(PL). The system of architecture is shown below
The Data layer is developed at the time of Training; it
encapsulates all the information related to data from the tagging
module like electronic text, hibernated text and Extensible
Markup Language Database. The Process layer contains logic
for retrieving persistent data from the DL and placing it into
process objects. The PL gives graphical user interface (GUI)
application for end user application
V. CONCLUSION
A significant part of the development of any Natural
Language Processing system, Parts Of Speech system has to be
framed. Tagged corpus is of basic resource for all NLP research.
The Parts of Speech tagger is mainly used for morphology
analyzer, generator, Parser , etc., The tool provide category of
the text in this case ambiguities are common, to avoid such
ambiguities, there is a need for syntactic and semantic
information of a particular word. Hence POS for Classical Tamil
plays major role in NLP.
REFERENCES
[1] Andronov, M. 1969. The Standard Grammar of Modern and Classical
Tamil. Madras: New Century Book House, Pvt. Ltd.
[2] Arulmozhi. P, Sobha. L, Kumara Shanmugam. B. Parts of Speech Tagger
for Tamil. In the Proceedings of the Symposium on Indian Morphology,
Phonology & Language Engineering, Indian Institute of Technology,
Kharagpur. pp 55- 57, 2004
[3] Arulmozhi. P, Sobha L, A Hybrid POS Tagger for a Relatively Free Word
Order Language. In proceedings of MSPIL-2006, Indian Institute of
Technology , Bombay. pp 79-85, 2006
[4] Anandan, P. K. Saravanan, Ranjani Parthasarathi and T. V. Geetha.
Morphological Analyzer for Tamil. In: International Conference on
Natural language Processing, 2002.
[5] Jesse Liberty and Donald Xie, ”Programming C# 3.0”, O’Reilly 5th
edition, 2007.
[6] Lakshmana Pandian S. and T.V. Geetha, “Morpheme Components based
Part of Speech tagging”, International Conference on Natural language
processing, 2008
[7] Rajendran S, (2007) Complexity of Tamil in POS tagging, Language in
India, Jan 2007.
[8] Rajan,K. Corpus analysis and tagging for Tamil. In: Proceeding of
symposium on Translation support system STRANS-2002
[9] Thomas Lehman. A grammar of modern Tamil, Pondicherry Institute of
Linguistic and culture, Pondicherry.1989.

Mais conteúdo relacionado

Semelhante a Pos Tagging for Classical Tamil Texts

NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflowseungwoo kim
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002IJARTES
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingMariana Soffer
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challengesantonellarose
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGEScsandit
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESLinda Garcia
 
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODPART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODijait
 
Difficulties in processing malayalam verbs
Difficulties in processing malayalam verbsDifficulties in processing malayalam verbs
Difficulties in processing malayalam verbsijaia
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorWaqas Tariq
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...ijnlc
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerijnlc
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShashank Shisodia
 
Classification of serialverb constructions
Classification of serialverb constructionsClassification of serialverb constructions
Classification of serialverb constructionsijaia
 

Semelhante a Pos Tagging for Classical Tamil Texts (20)

NLP Deep Learning with Tensorflow
NLP Deep Learning with TensorflowNLP Deep Learning with Tensorflow
NLP Deep Learning with Tensorflow
 
Ijartes v1-i1-002
Ijartes v1-i1-002Ijartes v1-i1-002
Ijartes v1-i1-002
 
551 466-472
551 466-472551 466-472
551 466-472
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Nlp (1)
Nlp (1)Nlp (1)
Nlp (1)
 
Ny3424442448
Ny3424442448Ny3424442448
Ny3424442448
 
Nlp
NlpNlp
Nlp
 
Ijetcas14 458
Ijetcas14 458Ijetcas14 458
Ijetcas14 458
 
Natural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and ChallengesNatural Language Processing: State of The Art, Current Trends and Challenges
Natural Language Processing: State of The Art, Current Trends and Challenges
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGESA SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
A SURVEY OF GRAMMAR CHECKERS FOR NATURAL LANGUAGES
 
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHODPART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
PART OF SPEECH TAGGING OFMARATHI TEXT USING TRIGRAMMETHOD
 
REPORT.doc
REPORT.docREPORT.doc
REPORT.doc
 
Difficulties in processing malayalam verbs
Difficulties in processing malayalam verbsDifficulties in processing malayalam verbs
Difficulties in processing malayalam verbs
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text EditorDynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
Dynamic Construction of Telugu Speech Corpus for Voice Enabled Text Editor
 
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
International Journal on Natural Language Computing (IJNLC) Vol. 4, No.2,Apri...
 
An implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzerAn implementation of apertium based assamese morphological analyzer
An implementation of apertium based assamese morphological analyzer
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Classification of serialverb constructions
Classification of serialverb constructionsClassification of serialverb constructions
Classification of serialverb constructions
 

Mais de ijcnes

A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...ijcnes
 
Economic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian EconomyEconomic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian Economyijcnes
 
An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...ijcnes
 
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...ijcnes
 
Challenges of E-government in Oman
Challenges of E-government in OmanChallenges of E-government in Oman
Challenges of E-government in Omanijcnes
 
Power Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage SystemPower Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage Systemijcnes
 
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining TechniquesHolistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining Techniquesijcnes
 
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...ijcnes
 
Feature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image RetrievalFeature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image Retrievalijcnes
 
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud ComputingChallenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud Computingijcnes
 
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...ijcnes
 
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...ijcnes
 
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor NetworksAn Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networksijcnes
 
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehiclesijcnes
 
Virtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor NetworksVirtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor Networksijcnes
 
Mitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 FactorizationMitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 Factorizationijcnes
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniquesijcnes
 
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...ijcnes
 
Priority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNPriority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNijcnes
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based Systemijcnes
 

Mais de ijcnes (20)

A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...
 
Economic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian EconomyEconomic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian Economy
 
An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...
 
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
 
Challenges of E-government in Oman
Challenges of E-government in OmanChallenges of E-government in Oman
Challenges of E-government in Oman
 
Power Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage SystemPower Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage System
 
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining TechniquesHolistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
 
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
 
Feature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image RetrievalFeature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image Retrieval
 
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud ComputingChallenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
 
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
 
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
 
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor NetworksAn Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
 
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
 
Virtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor NetworksVirtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor Networks
 
Mitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 FactorizationMitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 Factorization
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
 
Priority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNPriority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSN
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 

Último

Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Christo Ananth
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)simmis5
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSrknatarajan
 

Último (20)

Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICSUNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
UNIT-IFLUID PROPERTIES & FLOW CHARACTERISTICS
 

Pos Tagging for Classical Tamil Texts

  • 1. Integrated Intelligent Research(IIR) International Journal of Business Intelligent Volume: 01 Issue: 01 June 2012,Pages No.15-17 ISSN: 2278-2400 15 Pos Tagging for Classical Tamil Texts R.Akilan, E.R.Naganathan, Central Institute of Classical Tamil, Chennai. Velammal Engineering College, Chennai - 600053. E-mail: akilan.rp@gmail.com,ernindia@gmail.com Abstract-This paper presents a rule based model of parts of speech (POS) tagset for Classical Tamil Texts (CTT). The noun forms are type pattern, verb forms are token pattern. This is based on form agreement method. This is a very efficient and novel approach because Tamil Language has a build-in system of agreement/concord of the sentence. Classical Tamil Tagset is divided into two basic classifications, noun morphology and verb morphology. Keywords- parts of speech; tagset; token; type; insert I. INTRODUCTION Natural Language Processing (NLP) is a computerized approach to analyze the text based on a set of theories and set of technologies. And, being a very active area of research and development, the basic objective of Natural Language Processing is to facilitate human-machine interaction through the means of natural human language. Research on NLP has focused on various intermediate tasks that make partial sense of language structure without requiring complete understanding which, in turn, contributes to develop a successful system. Part- Of-Speech tagging is one of the processes in which grammatical categories are assigned to each token in the context from a given set of tags called POS Tagset. It serves wide number of applications like parser, morphology, speech synthesis and recognition, information retrieval, information extraction, partial parsing, machine translation, lexicography, Word Sense Disambiguation (WSD), Corpus based learning and teaching etc. There are a few POS Tagsets for Modern Tamil Language such as IL-ILMT, AU-KBC, IL-POST Microsoft, LDC-IL Mysore, etc. There are various methods in POS tagsets like linguistic rules, stochastic models and hybrid approaches, and each approach has its own merits and demerits (Rajendran. 2007). In this context, Classical Tamil further presents a challenge in developing an automatic POS tagger as the language is highly inflectional and morphologically rich. Hence, it is essential to consider text processing prior to POS tagging in order to achieve high performance and more reliability. II. POS TAGGER FOR CLASSICAL TAMIL: AN OVERVIEW In the last few decades of NLP initiatives on Indian languages in India, different research groups working on various languages have developed POS tagger. In this paper, a brief summary of POS tagger for Classical Tamil is being analyzed.Parts of speech tagging assigns grammatical category of the language. There are noun, verbs, adjective, adverbs, postposition, etc. A POS tagset is developed on the basis of the information from a particular language. The tagset consist finite tags and the information extraction should be infinite. Each word assigns a unique category. Tamil is an agglutinative language. In Tamil, root or stem can be added with some suffixes that derivate new words. Those words are divided into two parts namely lexical word and grammatical word. A word can function independently; but grammatical word cannot function independently. Based on the above criteria, all the languages make a distinction between open classes and close classes. Thus, basically word classes may be grouped into two viz., open classes and closed classes. In relation with this, Robin (a linguist) quotes ”we can describe open classes as those, whose membership is in principle unlimited, varying from time to time and between one speaker and another’ and closed classes as those that ‘contain a fixed and usually small number of member words, which are (essentially) the same for all the speakers of the language or dialect’ ” (Robins, 1964 quoted in Paul and Shopen:1985). Thus nouns, verbs, adjectives and adverbs belongs to open classes and pronouns, conjunction, etc., belong to closed classes. III. METHODOLOGY FOR THE WORD TAGGING The noun form carries plural marker, case markers, clitics, postposition, etc., The noun form may be a root or a stem or an oblique base form. The root form means that there is no future segmentation ( kāl ‘leg’, kaṇ ’eye’ ) and the base form means that the root word can be added with any suffix for example (kaṇṇaṉ (kaṇ+aṉ) ‘male person’, kuṉṟam(kuṉṟu+am) ‘hill’ ), the oblique means the form that carries suffix or equivalent another form (kiṇaṟṟu (kiṇaṟu ) ‘well’ , āṟṟu (āṟu) ‘river’ , eṉ (nāṉ) ‘I’ , taṉ (tāṉ)) ’my’. In such a way, the verb forms consist of verb root and conjugated. There are seven major types of verb conjugated is as follows Verbal base, Infinitive, Verbal noun, Verbal participle, Relative participle, Participial noun and Finite verb. 3.1 Plural marker Plural means a grammatical form that designates more than one of the things specified. மலர்+கள் = மலர்கள் (NC) flower+pl = flowers
  • 2. Integrated Intelligent Research(IIR) International Journal of Business Intelligent Volume: 01 Issue: 01 June 2012,Pages No.15-17 ISSN: 2278-2400 16 3.2 Case marker Case marker is a suffix, it occurs after noun and pronoun. (i) Accusative Case marker –ஐ (ai) மலர்+ஐ = மலரர (NC) flower+acc. =flower (ii) Instrumental case marker –ஆல் (āl) மலர்+ஆல் = மலரால் (NC) flower + ins. = with flower (iii) Sociative case marker– ஓடு (ōṭu) மலர் + ஓடு = மலரராடு (NC) flower + soc. =with flower (iv) Dative case marker – க்கு (kku) மலர் + க்கு = மலருக்கு (NC) flower +dat.= to flower (v) Genitive case marker – இன் (iṉ) மலர் + இன் = மலரின் (NC) flower +gen.= of flower (vi) Locative case marker – கண ் (kaṇ) மலர்+கண ் = மலர்க்கண ் (NC) flower – flower + loc. 3.3 Postposition Postposition is a word that occurs after noun which may or may not be appendix with case marker. மலர் + உள் (NC) = மலருள் 3.4 Verb In general, verb form takes tense marker, person, number, gender markers (PNG). There are no multiple meaning features of tense markers and PNG markers. But it provides many conjugated forms form the verbs. It is well known that in almost all natural languages, verbs are considered to be the most important part of speech. Verbs play a very important role in any languages. As Tamil verbs are inflected to various grammatical categories the bulk of Tamil parts of speech dealt with verb is necessary. 3.5 Adjective (ADJ) A word or phrase naming an attribute, added to or grammatically related to a noun to modify or describe it. There are two types of adjectives. They are inherent and derivation. Both are marked as single tag. Ex. எல்லா, பல (ellā, pala) 3.6 Adverb (ADV) An adverb is a part of speech. It is a verb that modifies any part of language, there are two types of adverbs. They are inherent and derivation. Both are marked as single tag (அங்கு, ஆங்கனம் (aṅku, āṅkaṉam)). 3.7 Pronoun (PRO) A pronoun is a word that takes the place of a oun. (அவன் , அவள் (avaṉ, avaḷ)). 3.8 Particle (PRT) A particle is a function word that does not belong to any of the inflected grammatical word classes (such as nouns, pronouns, verbs, or articles). It is a catch-all term for a heterogeneous set of words and terms that lack a precise lexical definition. It is mostly used for words that help to encode grammatical categories (such as negation, mood or case), or fillers or discourse markers that facilitate discourse such as well, ah, anyway, etc. என்று, என் பது (eṉṟu, eṉpatu) 3.9 Clitics Clitics may belong to any grammatical category, though they are commonly determiners, or adpositions. மலர் +ஏ = மலரர (NC) 3.10 Numerals (NUM) Numeral is a noun, divided into two types cardinal and ordinal. Both the types can be marked as a single tag (ஒன்று, இரண ் டு, ஒன் றாம், ஒன் றாவது (oṉṟu, iraṇṭu, oṉṟām, oṉṟāvatu)) IV. POS TAGGER PROCEDURE AND ARCHITECTURE 4.1 Procedure POS Tagger has two tasks: Training task and tagging task. In tha training task we have trained the validated tagged data into base- level training module. This module generates a database that validates the root word. It searches the form of the word in the root dictionary if it can find out the root dictionary it assignd the appropriate tag, if it fails, it goes to the root dictionary in order to find out the rules of the suffix list to match the word. If the word is available, it assigns the appropriate tag and if it fails, it names it as an unknown category. The tagging process follows the following procedural steps
  • 3. Integrated Intelligent Research(IIR) International Journal of Business Intelligent Volume: 01 Issue: 01 June 2012,Pages No.15-17 ISSN: 2278-2400 17 4.2 Architecture of POS The architecture of POS Tagger consists of three layers such as Data Layer (DL),Process Layer (BL) and Presentation Layer (PL). The system of architecture is shown below The Data layer is developed at the time of Training; it encapsulates all the information related to data from the tagging module like electronic text, hibernated text and Extensible Markup Language Database. The Process layer contains logic for retrieving persistent data from the DL and placing it into process objects. The PL gives graphical user interface (GUI) application for end user application V. CONCLUSION A significant part of the development of any Natural Language Processing system, Parts Of Speech system has to be framed. Tagged corpus is of basic resource for all NLP research. The Parts of Speech tagger is mainly used for morphology analyzer, generator, Parser , etc., The tool provide category of the text in this case ambiguities are common, to avoid such ambiguities, there is a need for syntactic and semantic information of a particular word. Hence POS for Classical Tamil plays major role in NLP. REFERENCES [1] Andronov, M. 1969. The Standard Grammar of Modern and Classical Tamil. Madras: New Century Book House, Pvt. Ltd. [2] Arulmozhi. P, Sobha. L, Kumara Shanmugam. B. Parts of Speech Tagger for Tamil. In the Proceedings of the Symposium on Indian Morphology, Phonology & Language Engineering, Indian Institute of Technology, Kharagpur. pp 55- 57, 2004 [3] Arulmozhi. P, Sobha L, A Hybrid POS Tagger for a Relatively Free Word Order Language. In proceedings of MSPIL-2006, Indian Institute of Technology , Bombay. pp 79-85, 2006 [4] Anandan, P. K. Saravanan, Ranjani Parthasarathi and T. V. Geetha. Morphological Analyzer for Tamil. In: International Conference on Natural language Processing, 2002. [5] Jesse Liberty and Donald Xie, ”Programming C# 3.0”, O’Reilly 5th edition, 2007. [6] Lakshmana Pandian S. and T.V. Geetha, “Morpheme Components based Part of Speech tagging”, International Conference on Natural language processing, 2008 [7] Rajendran S, (2007) Complexity of Tamil in POS tagging, Language in India, Jan 2007. [8] Rajan,K. Corpus analysis and tagging for Tamil. In: Proceeding of symposium on Translation support system STRANS-2002 [9] Thomas Lehman. A grammar of modern Tamil, Pondicherry Institute of Linguistic and culture, Pondicherry.1989.