IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Reinforced Concrete

IRJET Journal
IRJET JournalFast Track Publications

https://irjet.net/archives/V5/i8/IRJET-V5I8236.pdf

International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 781
Tweet Segmentation and Its Application to Named Entity Recognition
Ramya
1Ramya , Address: Thanjavur
2Professor: Mrs. Hemalatha, M.Sc., M.Phil., M.C.A., Dept. of Computer Science, Shrimathi Indira Gandhi College,
Tamil Nadu, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Twitter has attracted millions of users to
share and disseminate most up-to-date information, resulting
in large volumes of data produced every day. However, many
applications in Information Retrieval (IR) and Natural
Language Processing (NLP) suffer severely from the noisyand
short nature of tweets. In this project, propose a novel
framework for tweet segmentation in a batch mode, called
Hybrid Segmentation. By splitting tweets into meaningful
segments, the semantic or context information is well
preserved and easily extracted by the downstream
applications. HybridSeg finds the optimal segmentation of a
tweet by maximizing the sum of the stickiness scores of its
candidate segments. The stickiness score considers the
probability of a segment being a phrase in English (i.e., global
context) and the probability of a segment being a phrase
within the batch of tweets (i.e., local context). For the latter,
propose and evaluate two models to derive local context by
considering the linguistic features and term-dependency in a
batch of tweets, respectively. HybridSeg is also designed to
iteratively learn from confident segments as pseudo feedback.
Experiments on two tweet data sets show that tweet
segmentation quality is significantly improved by learning
both global and local contexts compared with using global
context alone. Through analysis and comparison, show that
local linguistic features are more reliable for learning local
context compared with term-dependency. As an application,
show that high accuracy is achieved in named entity
recognition by applying segment-based part-of-speech (POS)
tagging.
Key Words: Twitter stream, tweet segmentation,named
entity recognition, linguistic processing, Wikipedia
1. INTRODUCTION
Tweets are posted for information sharing and
communication. The named entities and semantic phrases
are well preserved in tweets. The method realizing the
proposed framework that solely relies on global context is
denoted by HybridSeg. Tweets are highly time-sensitive.The
well preserved linguistic features in these tweets facilitate
named entity recognition with high accuracy. Each named
entity is a valid segment. The method utilizinglocallinguistic
features is denoted by HybridSeg NER. It obtains confident
segments based on the voting resultsofmultipleoff-the-shelf
NER tools. Another method utilizing local collocation
knowledge, denoted by HybridSeg N-Gram, is proposed
based on the observation that many tweets publishedwithin
a short time period are about the same topic. HybridSeg N-
Gram segments tweets by estimating the term-dependency
within a batch of tweets. To improve POS tagging on tweets,
Ritter et al. train a POS tagger by using CRF model with
conventional and tweet-specific features. Even thoughshort
text strings might be a problem, sentiment analysis
within micro blogging hasshown that Twitter canbeseenas
a valid online indicator of political sentiment. A lot of
advertising and branding is now tied to Twitter.
Furthermore, and most importantly, the first place one goes
to find any breaking news is to search it on Twitter. The
focus of this project is clustering with the TF-IDF weighted
mechanism of daily technology news tweets of prominent
bloggers and news sites using Apache Mahout and to
evaluate the effectsof introducing andremovinganonymous
wordson the quality of clustering. This projectrestrictsitself
to only tweets in the English language.
2. Existing System
Nowadays people are using so many social
networking sitesare. But the people using allsitesforupdate
statesand sharing photos, videosand so on. There is noalert
for bad weather situation, earthquakes and so on. The
problem is that lot of time consumed to the people for
checking the message submitted in the social networking
and file a case in cyber crime. After verification cyber crime
defense commission will remove the post according to the
cyber crime rules in a span of time. This will not solve the
problem only to remove the thingsafter the issuearises. Due
to ignorance and lack of understanding of Social Media
privacy feature, people make many mistakes. Another
situation to consider has to do with the availability of
information too personal, whether in pictures or text. You
cannot give out too much personal information. As
throughout the Internet. The User can send their post with
some illegal words and letters.
2.1 Survey
A. User tracking using tweet segmentation and word
Many organizations have been reported to create and
monitoring targeted Twitter streams to collect a bunch of
information and understand according to user's view.
Targeted Twitter stream is main usually constructed by
filtering tweets and that abused words with predefined
selection criteria. Due to its invaluable business value of
timely information from these tweets, it's a necessary to
understand that abused word's language for a large body of
downstream applications, such as named entity recognition
(NER), event detecting and summarizing that particular
word, opinion mining, sentiment analysis, and etc. In these
proposed system application is developed which take tweet
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 782
is input and search semantic negative or illegal words from
database. Generate report of that abusing word and send to
cyber crime's site. Then depending on the tweet, track the
related information of the person. Track allthetweetsofthat
particular person and track that person through
identification databases. Then take a action by cyber crime.
And after that prevent the tweet.
B. Measuring the controversy level of Arabic trending
topics on Twitter
Social micro-blogging systemslike Twitter are usedtodayas
a platform that enables its users to write down about
different topics. One important aspect of such human
interactions is the existence of debateanddisagreement.The
most heated debates are found on controversial topics.
Detecting such topics can be very beneficial in
understanding the behavior of online social networks users
and the dynamics of their interactions. Such an
understanding leads to better ways of handling and
predicting how the "online crowds" will act. Several
approaches have been proposed for detectingcontroversyin
online communication. Some of them represent the
interactions in the form of graphsand study their properties
in order to determine whether the topic of interaction is
controversial or not. Other approachesrely onthecontentof
the exchanged messages. In this study, we focus on the
former approach in identifying the controversy level of the
trending topics on Twitter. Unlike many previous works,
then do not limit ourselves to a certain domain. Moreover,
the main focus on social content written in Arabic about hot
events occurring in the Middle East. To the best of our
knowledge, ours is the first work to undertake thisapproach
in studying controversy in general topics written in Arabic.
Collect a large dataset of tweets on different trending topics
from different domains. First apply several approaches for
controversy detection and compare their outcomes to
determine which one is the most consistent measure.
C. Event Detection and Key Posts Discovering in Social
Media Data Streams
Microblogging, as a popular social mediaservice,hasbecome
a new information channel for usersto receiveandexchange
the most up-to-date information on current events.
Consequently, it is crucial to detect newemergingeventsand
to discover the key posts which have the potentialtoactively
disseminate the events through microblogging. However,
traditional event detection models require human
intervention for detecting the number of topics to be
explored, which significantly reduces the efficiency and
accuracy of event detection. Most of the existing methods
focus only on event detection and are unable to discover the
key posts related to the events, making itchallengingtotrack
momentous events in timely manner. To tackle these
problems, a HITS (Hypertext Induced Topic Search) based
topic decision method, named TD-HITSisproposed.TD-HITS
can automatically detect the number of topics as well as
discovering associated key posts from a large number of
posts. The experimental results are based on a Twitter
dataset to demonstrate the effectiveness of our proposed
methodsfor both detecting eventsand identifyingkeyposts.
D. Tweet modeling with LSTM recurrent neural
networks for hash tag recommendation
The hash symbol, called a hash tag, is used to mark the
keyword or topic in a tweet. It was created organically by
users as a way to categorize messages. Hash tags also
provide valuable information for manyresearchapplications
such as sentiment classification and topic analysis.However,
only a small number of tweets are manually annotated.
Therefore, an automatic hashtagrecommendationmethodis
needed to help userstag their new tweets.Previousmethods
mostly use conventional machine learning classifierssuchas
SVM or utilize collaborative filtering technique. A bottleneck
of these approachesisthat they all use the TF-IDF schemeto
represent tweets and ignore the semantic information in
tweets. In this paper, we also regard hash tag
recommendation asa classification task but propose a novel
recurrent neural network model tolearn vector-basedtweet
representations to recommend hashtags. More precisely,we
use a skip-gram model to generate distributed word
representations and then apply a convolution neural
network to learn semantic sentence vectors. Afterwards,we
make use of the sentence vectors to train a long short-term
memory recurrent neural network (LSTM-RNN).Wedirectly
use the produced tweet vectors as features to classify
hashtags without any feature engineering. Experiments on
real world data from Twitter to recommend hashtags show
that our proposed LSTM-RNN model outperforms state-of-
the-art methods and LSTM unit also obtains the best
performance compared to standard RNN and gated
recurrent unit (GRU).
E. Exploring human emotion via Twitter
Sentiment analysis or opinion mining on twitter data is an
emerging topic in research. In this paper, we have described
a system for emotion analysis of tweets using only the core
text. Tweets are usually short, more ambiguousandcontains
a huge amount of noisy data, sometimes it is difficult to
understand the user's opinion. The main challenge is to
feature extraction for the purpose of classification and
feature extraction depends on the perfection of
preprocessing of a tweet. The preprocessing is the most
difficult task, since it can be done in various ways and the
methods or steps applied in preprocessing are not distinct.
Most of the researches in this topic, have been focused on
binary (positive and negative) and 3-way (positive, negative
and neutral) classifications. In this paper, we have focused
on emotion classification of tweets as multi-class
classification. We have chosen basic human emotions
(happiness, sadness, surprise, disgust) and neutral as our
emotion classes. According to the experimental results, our
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 783
approach improved the performance of multi-class
classification of twitter data.
3. Proposed System
Tweets are posted for information sharing and
communication. The named entities and semantic phrases
are well preserved in tweets. The global context derived
from Web pages or Wikipedia therefore helps identifying
the meaningful segments in tweets. The method realizing
the proposed framework that solely relies on global
context is denoted by HybridSegWeb. Tweets are highly
time-sensitive so that many emerging phrases like “She
Dancing” cannot be found in external knowledge bases.
However, considering a large number of tweets published
within a short time period containing the phrase, it is not
difficult to recognize “She Dancing” as a valid and
meaningful segment. Therefore investigate two local
contexts, namely local linguistic features and local
collocation. Observe that tweets from many official
accounts of news agencies, organizations, and advertisers
are likely well written. The well preserved linguistic
features in these tweets facilitate named entity recognition
with high accuracy. Each named entity is a valid segment.
The method utilizing local linguistic features is denoted by
HybridSegNER. It obtains confident segments based on the
voting results of multiple off-the-shelf NER tools.
Fig1- Sample Twitter Segmentation
Text Mining Algorithm
Text mining, also referred to as text data mining, roughly
equivalent to text analytics, is the process of deriving high-
quality information from text. High-quality information is
typically derived through the devising of patternsandtrends
through means such as statistical pattern learning. Text
mining usually involves the process of structuring the input
text (usually parsing, along with the addition of some
derived linguistic features and the removal of others, and
subsequent insertion into a database), deriving patterns
within the structured data, and finally evaluation and
interpretation of the output. 'High quality' in text mining
usually refers to some combination of relevance, novelty,
and interestingness. Typical text mining tasks include text
categorization, text clustering, concept/entity extraction,
production of granular taxonomies, sentiment analysis,
document summarization, and entityrelationmodeling.Text
analysis involves information retrieval, lexical analysis to
study word frequency distributions, pattern recognition,
tagging/annotation, information extraction, data mining
techniques including link and association analysis,
visualization, and predictiveanalytics. The overarchinggoal
is, essentially, to turn text into data for analysis, via
application of natural language processing and analytical
methods.
4. Conclusion
The HybridSeg framework which segments tweets
into meaningful phrases called segments using both global
and local context. Through our framework, we demonstrate
that local linguistic features are more reliable than term
dependency in guiding the segmentation process. This
finding opens opportunities for tools developed for formal
text to be applied to tweets which are believed to be much
more noise than formal text. Tweet segmentation helps to
preserve the semantic meaning of tweets, which
subsequently benefits many downstream applications, e.g.
named entity recognition. Through experiments, we show
that segment based named entity recognition methods
achieves much better accuracy than the word-based
alternative.
REFERENCES
1. Twiner: Named entity recognition in targeted
twitter stream- C. Li, J. Weng, Q. He, Y. Yao, A. Datta,
A. Sun, and B.-S. Lee.
2. Augmenting naive bayes classifiers with statistical
language models- F. Peng, D. Schuurmans, and S.
Wang.
3. Topic sentiment analysis in twitter: a graph-based
hash tag sentiment classification approach- X.
Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang.
4. Twevent: segment-based event detection from
tweets- C. Li, A. Sun, and A. Datta-
5. Incorporating nonlocal information into
information extraction systems by gibbs sampling-
J. R. Finkel, T. Grenager, and C. Manning.

Recomendados

Detection and Analysis of Twitter Trending Topics via Link-Anomaly Detection por
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionDetection and Analysis of Twitter Trending Topics via Link-Anomaly Detection
Detection and Analysis of Twitter Trending Topics via Link-Anomaly DetectionIJERA Editor
277 visualizações3 slides
News construction from microblogging post using open data por
News construction from microblogging post using open dataNews construction from microblogging post using open data
News construction from microblogging post using open dataFrancisco Berrizbeitia
966 visualizações7 slides
Analyzing-Threat-Levels-of-Extremists-using-Tweets por
Analyzing-Threat-Levels-of-Extremists-using-TweetsAnalyzing-Threat-Levels-of-Extremists-using-Tweets
Analyzing-Threat-Levels-of-Extremists-using-TweetsRESHAN FARAZ
89 visualizações6 slides
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci... por
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET Journal
21 visualizações4 slides
IRJET- Twitter Spammer Detection por
IRJET- Twitter Spammer DetectionIRJET- Twitter Spammer Detection
IRJET- Twitter Spammer DetectionIRJET Journal
73 visualizações6 slides
Retrieving Hidden Friends a Collusion Privacy Attack against Online Friend Se... por
Retrieving Hidden Friends a Collusion Privacy Attack against Online Friend Se...Retrieving Hidden Friends a Collusion Privacy Attack against Online Friend Se...
Retrieving Hidden Friends a Collusion Privacy Attack against Online Friend Se...ijtsrd
93 visualizações5 slides

Mais conteúdo relacionado

Mais procurados

IRJET- Fake Profile Identification using Machine Learning por
IRJET-  	  Fake Profile Identification using Machine LearningIRJET-  	  Fake Profile Identification using Machine Learning
IRJET- Fake Profile Identification using Machine LearningIRJET Journal
2.2K visualizações6 slides
Detection of cyber-bullying por
Detection of cyber-bullying Detection of cyber-bullying
Detection of cyber-bullying Ziar Khan
4.2K visualizações15 slides
IRJET - Social Network Question and Answer System por
IRJET - Social Network Question and Answer SystemIRJET - Social Network Question and Answer System
IRJET - Social Network Question and Answer SystemIRJET Journal
9 visualizações3 slides
Detecting the presence of cyberbullying using computer software por
Detecting the presence of cyberbullying using computer softwareDetecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer softwareAshish Arora
5.2K visualizações40 slides
IRJET- Fake News Detection and Rumour Source Identification por
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source IdentificationIRJET Journal
75 visualizações4 slides
FAKE NEWS DETECTION PPT por
FAKE NEWS DETECTION PPT FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT VaishaliSrigadhi
26.3K visualizações25 slides

Mais procurados(20)

IRJET- Fake Profile Identification using Machine Learning por IRJET Journal
IRJET-  	  Fake Profile Identification using Machine LearningIRJET-  	  Fake Profile Identification using Machine Learning
IRJET- Fake Profile Identification using Machine Learning
IRJET Journal2.2K visualizações
Detection of cyber-bullying por Ziar Khan
Detection of cyber-bullying Detection of cyber-bullying
Detection of cyber-bullying
Ziar Khan4.2K visualizações
IRJET - Social Network Question and Answer System por IRJET Journal
IRJET - Social Network Question and Answer SystemIRJET - Social Network Question and Answer System
IRJET - Social Network Question and Answer System
IRJET Journal9 visualizações
Detecting the presence of cyberbullying using computer software por Ashish Arora
Detecting the presence of cyberbullying using computer softwareDetecting the presence of cyberbullying using computer software
Detecting the presence of cyberbullying using computer software
Ashish Arora5.2K visualizações
IRJET- Fake News Detection and Rumour Source Identification por IRJET Journal
IRJET- Fake News Detection and Rumour Source IdentificationIRJET- Fake News Detection and Rumour Source Identification
IRJET- Fake News Detection and Rumour Source Identification
IRJET Journal75 visualizações
FAKE NEWS DETECTION PPT por VaishaliSrigadhi
FAKE NEWS DETECTION PPT FAKE NEWS DETECTION PPT
FAKE NEWS DETECTION PPT
VaishaliSrigadhi26.3K visualizações
IRJET - Fake News Detection using Machine Learning por IRJET Journal
IRJET -  	  Fake News Detection using Machine LearningIRJET -  	  Fake News Detection using Machine Learning
IRJET - Fake News Detection using Machine Learning
IRJET Journal134 visualizações
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ... por IRJET Journal
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET- Big Data Driven Information Diffusion Analytics and Control on Social ...
IRJET Journal41 visualizações
IRJET - Real-Time Cyberbullying Analysis on Social Media using Machine Learni... por IRJET Journal
IRJET - Real-Time Cyberbullying Analysis on Social Media using Machine Learni...IRJET - Real-Time Cyberbullying Analysis on Social Media using Machine Learni...
IRJET - Real-Time Cyberbullying Analysis on Social Media using Machine Learni...
IRJET Journal52 visualizações
INFORMATION RETRIEVAL TOPICS IN TWITTER USING WEIGHTED PREDICTION NETWORK por IAEME Publication
INFORMATION RETRIEVAL TOPICS IN TWITTER USING WEIGHTED PREDICTION NETWORKINFORMATION RETRIEVAL TOPICS IN TWITTER USING WEIGHTED PREDICTION NETWORK
INFORMATION RETRIEVAL TOPICS IN TWITTER USING WEIGHTED PREDICTION NETWORK
IAEME Publication113 visualizações
IRJET- Fake News Detection por IRJET Journal
IRJET- Fake News DetectionIRJET- Fake News Detection
IRJET- Fake News Detection
IRJET Journal278 visualizações
A Paper on Web Data Segmentation for Terrorism Detection using Named Entity R... por IRJET Journal
A Paper on Web Data Segmentation for Terrorism Detection using Named Entity R...A Paper on Web Data Segmentation for Terrorism Detection using Named Entity R...
A Paper on Web Data Segmentation for Terrorism Detection using Named Entity R...
IRJET Journal42 visualizações
IRJET - Suicidal Text Detection using Machine Learning por IRJET Journal
IRJET -  	  Suicidal Text Detection using Machine LearningIRJET -  	  Suicidal Text Detection using Machine Learning
IRJET - Suicidal Text Detection using Machine Learning
IRJET Journal19 visualizações
IRJET - Detecting Spiteful Accounts in Social Network por IRJET Journal
IRJET - Detecting Spiteful Accounts in Social NetworkIRJET - Detecting Spiteful Accounts in Social Network
IRJET - Detecting Spiteful Accounts in Social Network
IRJET Journal21 visualizações
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING por ijnlc
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MININGFAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
FAKE NEWS DETECTION WITH SEMANTIC FEATURES AND TEXT MINING
ijnlc45 visualizações
Seminar Report Mine por sachin narang
Seminar Report MineSeminar Report Mine
Seminar Report Mine
sachin narang2.8K visualizações
Social networks protection against fake profiles and social bots attacks por Aboul Ella Hassanien
Social networks protection against  fake profiles and social bots attacksSocial networks protection against  fake profiles and social bots attacks
Social networks protection against fake profiles and social bots attacks
Aboul Ella Hassanien1.2K visualizações
Tweet segmentation and its application to named entity recognition por Shakas Technologies
Tweet segmentation and its application to named entity recognitionTweet segmentation and its application to named entity recognition
Tweet segmentation and its application to named entity recognition
Shakas Technologies613 visualizações

Similar a IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Reinforced Concrete

IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec por
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET Journal
40 visualizações5 slides
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag por
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash TagIRJET Journal
14 visualizações5 slides
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK por
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKDETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKIRJET Journal
91 visualizações7 slides
F017433947 por
F017433947F017433947
F017433947IOSR Journals
153 visualizações9 slides
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY por
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYcscpconf
63 visualizações20 slides
IRJET - Detection of Drug Abuse using Social Media Mining por
IRJET - Detection of Drug Abuse using Social Media MiningIRJET - Detection of Drug Abuse using Social Media Mining
IRJET - Detection of Drug Abuse using Social Media MiningIRJET Journal
12 visualizações4 slides

Similar a IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Reinforced Concrete(20)

IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec por IRJET Journal
IRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2VecIRJET-  	  Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET- Improved Real-Time Twitter Sentiment Analysis using ML & Word2Vec
IRJET Journal40 visualizações
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag por IRJET Journal
 IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET Journal14 visualizações
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK por IRJET Journal
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORKDETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
DETECTION OF MALICIOUS SOCIAL BOTS USING ML TECHNIQUE IN TWITTER NETWORK
IRJET Journal91 visualizações
F017433947 por IOSR Journals
F017433947F017433947
F017433947
IOSR Journals153 visualizações
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY por cscpconf
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITYFRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
FRAMEWORK FOR ANALYZING TWITTER TO DETECT COMMUNITY SUSPICIOUS CRIME ACTIVITY
cscpconf63 visualizações
IRJET - Detection of Drug Abuse using Social Media Mining por IRJET Journal
IRJET - Detection of Drug Abuse using Social Media MiningIRJET - Detection of Drug Abuse using Social Media Mining
IRJET - Detection of Drug Abuse using Social Media Mining
IRJET Journal12 visualizações
Dynamic feature selection for spam detection (1).pptx por RivikaJain
Dynamic feature selection for spam detection (1).pptxDynamic feature selection for spam detection (1).pptx
Dynamic feature selection for spam detection (1).pptx
RivikaJain21 visualizações
IRJET- An Improved Machine Learning for Twitter Breaking News Extraction ... por IRJET Journal
IRJET-  	  An Improved Machine Learning for Twitter Breaking News Extraction ...IRJET-  	  An Improved Machine Learning for Twitter Breaking News Extraction ...
IRJET- An Improved Machine Learning for Twitter Breaking News Extraction ...
IRJET Journal33 visualizações
IRJET- Identification of Prevalent News from Twitter and Traditional Media us... por IRJET Journal
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET- Identification of Prevalent News from Twitter and Traditional Media us...
IRJET Journal16 visualizações
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docx por jasoninnes20
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docxBUS 625 Week 4 Response to Discussion 2Guided Response Your.docx
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docx
jasoninnes202 visualizações
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docx por curwenmichaela
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docxBUS 625 Week 4 Response to Discussion 2Guided Response Your.docx
BUS 625 Week 4 Response to Discussion 2Guided Response Your.docx
curwenmichaela2 visualizações
A REVIEW ON MACHINE LEARNING TECHNIQUES ON SOCIAL MEDIA DATA FOR POLICY MAKING por Deja Lewis
A REVIEW ON MACHINE LEARNING TECHNIQUES ON SOCIAL MEDIA DATA FOR POLICY MAKINGA REVIEW ON MACHINE LEARNING TECHNIQUES ON SOCIAL MEDIA DATA FOR POLICY MAKING
A REVIEW ON MACHINE LEARNING TECHNIQUES ON SOCIAL MEDIA DATA FOR POLICY MAKING
Deja Lewis2 visualizações
Twitter_Hashtag_Prediction.pptx por SayaliKawale2
Twitter_Hashtag_Prediction.pptxTwitter_Hashtag_Prediction.pptx
Twitter_Hashtag_Prediction.pptx
SayaliKawale26 visualizações
A Baseline Based Deep Learning Approach of Live Tweets por ijtsrd
A Baseline Based Deep Learning Approach of Live TweetsA Baseline Based Deep Learning Approach of Live Tweets
A Baseline Based Deep Learning Approach of Live Tweets
ijtsrd81 visualizações
IRJET- Review Analyser with Bot por IRJET Journal
IRJET- Review Analyser with BotIRJET- Review Analyser with Bot
IRJET- Review Analyser with Bot
IRJET Journal35 visualizações
E017433538 por IOSR Journals
E017433538E017433538
E017433538
IOSR Journals109 visualizações
IRJET- Event Detection and Text Summary by Disaster Warning por IRJET Journal
IRJET- Event Detection and Text Summary by Disaster WarningIRJET- Event Detection and Text Summary by Disaster Warning
IRJET- Event Detection and Text Summary by Disaster Warning
IRJET Journal24 visualizações
IRJET - Fake News Detection: A Survey por IRJET Journal
IRJET -  	  Fake News Detection: A SurveyIRJET -  	  Fake News Detection: A Survey
IRJET - Fake News Detection: A Survey
IRJET Journal60 visualizações

Mais de IRJET Journal

SOIL STABILIZATION USING WASTE FIBER MATERIAL por
SOIL STABILIZATION USING WASTE FIBER MATERIALSOIL STABILIZATION USING WASTE FIBER MATERIAL
SOIL STABILIZATION USING WASTE FIBER MATERIALIRJET Journal
20 visualizações7 slides
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles... por
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...IRJET Journal
8 visualizações7 slides
Identification, Discrimination and Classification of Cotton Crop by Using Mul... por
Identification, Discrimination and Classification of Cotton Crop by Using Mul...Identification, Discrimination and Classification of Cotton Crop by Using Mul...
Identification, Discrimination and Classification of Cotton Crop by Using Mul...IRJET Journal
8 visualizações5 slides
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula... por
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...IRJET Journal
13 visualizações11 slides
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR... por
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...IRJET Journal
14 visualizações6 slides
Performance Analysis of Aerodynamic Design for Wind Turbine Blade por
Performance Analysis of Aerodynamic Design for Wind Turbine BladePerformance Analysis of Aerodynamic Design for Wind Turbine Blade
Performance Analysis of Aerodynamic Design for Wind Turbine BladeIRJET Journal
7 visualizações5 slides

Mais de IRJET Journal(20)

SOIL STABILIZATION USING WASTE FIBER MATERIAL por IRJET Journal
SOIL STABILIZATION USING WASTE FIBER MATERIALSOIL STABILIZATION USING WASTE FIBER MATERIAL
SOIL STABILIZATION USING WASTE FIBER MATERIAL
IRJET Journal20 visualizações
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles... por IRJET Journal
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
Sol-gel auto-combustion produced gamma irradiated Ni1-xCdxFe2O4 nanoparticles...
IRJET Journal8 visualizações
Identification, Discrimination and Classification of Cotton Crop by Using Mul... por IRJET Journal
Identification, Discrimination and Classification of Cotton Crop by Using Mul...Identification, Discrimination and Classification of Cotton Crop by Using Mul...
Identification, Discrimination and Classification of Cotton Crop by Using Mul...
IRJET Journal8 visualizações
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula... por IRJET Journal
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
“Analysis of GDP, Unemployment and Inflation rates using mathematical formula...
IRJET Journal13 visualizações
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR... por IRJET Journal
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
MAXIMUM POWER POINT TRACKING BASED PHOTO VOLTAIC SYSTEM FOR SMART GRID INTEGR...
IRJET Journal14 visualizações
Performance Analysis of Aerodynamic Design for Wind Turbine Blade por IRJET Journal
Performance Analysis of Aerodynamic Design for Wind Turbine BladePerformance Analysis of Aerodynamic Design for Wind Turbine Blade
Performance Analysis of Aerodynamic Design for Wind Turbine Blade
IRJET Journal7 visualizações
Heart Failure Prediction using Different Machine Learning Techniques por IRJET Journal
Heart Failure Prediction using Different Machine Learning TechniquesHeart Failure Prediction using Different Machine Learning Techniques
Heart Failure Prediction using Different Machine Learning Techniques
IRJET Journal7 visualizações
Experimental Investigation of Solar Hot Case Based on Photovoltaic Panel por IRJET Journal
Experimental Investigation of Solar Hot Case Based on Photovoltaic PanelExperimental Investigation of Solar Hot Case Based on Photovoltaic Panel
Experimental Investigation of Solar Hot Case Based on Photovoltaic Panel
IRJET Journal3 visualizações
Metro Development and Pedestrian Concerns por IRJET Journal
Metro Development and Pedestrian ConcernsMetro Development and Pedestrian Concerns
Metro Development and Pedestrian Concerns
IRJET Journal2 visualizações
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An... por IRJET Journal
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...
Mapping the Crashworthiness Domains: Investigations Based on Scientometric An...
IRJET Journal3 visualizações
Data Analytics and Artificial Intelligence in Healthcare Industry por IRJET Journal
Data Analytics and Artificial Intelligence in Healthcare IndustryData Analytics and Artificial Intelligence in Healthcare Industry
Data Analytics and Artificial Intelligence in Healthcare Industry
IRJET Journal3 visualizações
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC... por IRJET Journal
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...
DESIGN AND SIMULATION OF SOLAR BASED FAST CHARGING STATION FOR ELECTRIC VEHIC...
IRJET Journal52 visualizações
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur... por IRJET Journal
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...
Efficient Design for Multi-story Building Using Pre-Fabricated Steel Structur...
IRJET Journal10 visualizações
Development of Effective Tomato Package for Post-Harvest Preservation por IRJET Journal
Development of Effective Tomato Package for Post-Harvest PreservationDevelopment of Effective Tomato Package for Post-Harvest Preservation
Development of Effective Tomato Package for Post-Harvest Preservation
IRJET Journal4 visualizações
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION” por IRJET Journal
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”
“DYNAMIC ANALYSIS OF GRAVITY RETAINING WALL WITH SOIL STRUCTURE INTERACTION”
IRJET Journal5 visualizações
Understanding the Nature of Consciousness with AI por IRJET Journal
Understanding the Nature of Consciousness with AIUnderstanding the Nature of Consciousness with AI
Understanding the Nature of Consciousness with AI
IRJET Journal12 visualizações
Augmented Reality App for Location based Exploration at JNTUK Kakinada por IRJET Journal
Augmented Reality App for Location based Exploration at JNTUK KakinadaAugmented Reality App for Location based Exploration at JNTUK Kakinada
Augmented Reality App for Location based Exploration at JNTUK Kakinada
IRJET Journal6 visualizações
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba... por IRJET Journal
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...
Smart Traffic Congestion Control System: Leveraging Machine Learning for Urba...
IRJET Journal18 visualizações
Enhancing Real Time Communication and Efficiency With Websocket por IRJET Journal
Enhancing Real Time Communication and Efficiency With WebsocketEnhancing Real Time Communication and Efficiency With Websocket
Enhancing Real Time Communication and Efficiency With Websocket
IRJET Journal5 visualizações
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ... por IRJET Journal
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...
Textile Industrial Wastewater Treatability Studies by Soil Aquifer Treatment ...
IRJET Journal4 visualizações

Último

Effect of deep chemical mixing columns on properties of surrounding soft clay... por
Effect of deep chemical mixing columns on properties of surrounding soft clay...Effect of deep chemical mixing columns on properties of surrounding soft clay...
Effect of deep chemical mixing columns on properties of surrounding soft clay...AltinKaradagli
6 visualizações10 slides
Update 42 models(Diode/General ) in SPICE PARK(DEC2023) por
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Tsuyoshi Horigome
28 visualizações16 slides
MSA Website Slideshow (16).pdf por
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdfmsaucla
68 visualizações8 slides
Control Systems Feedback.pdf por
Control Systems Feedback.pdfControl Systems Feedback.pdf
Control Systems Feedback.pdfLGGaming5
6 visualizações39 slides
Pull down shoulder press final report docx (1).pdf por
Pull down shoulder press final report docx (1).pdfPull down shoulder press final report docx (1).pdf
Pull down shoulder press final report docx (1).pdfComsat Universal Islamabad Wah Campus
13 visualizações25 slides
Generative AI Models & Their Applications por
Generative AI Models & Their ApplicationsGenerative AI Models & Their Applications
Generative AI Models & Their ApplicationsSN
8 visualizações1 slide

Último(20)

Effect of deep chemical mixing columns on properties of surrounding soft clay... por AltinKaradagli
Effect of deep chemical mixing columns on properties of surrounding soft clay...Effect of deep chemical mixing columns on properties of surrounding soft clay...
Effect of deep chemical mixing columns on properties of surrounding soft clay...
AltinKaradagli6 visualizações
Update 42 models(Diode/General ) in SPICE PARK(DEC2023) por Tsuyoshi Horigome
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Update 42 models(Diode/General ) in SPICE PARK(DEC2023)
Tsuyoshi Horigome28 visualizações
MSA Website Slideshow (16).pdf por msaucla
MSA Website Slideshow (16).pdfMSA Website Slideshow (16).pdf
MSA Website Slideshow (16).pdf
msaucla68 visualizações
Control Systems Feedback.pdf por LGGaming5
Control Systems Feedback.pdfControl Systems Feedback.pdf
Control Systems Feedback.pdf
LGGaming56 visualizações
Generative AI Models & Their Applications por SN
Generative AI Models & Their ApplicationsGenerative AI Models & Their Applications
Generative AI Models & Their Applications
SN8 visualizações
Instrumentation & Control Lab Manual.pdf por NTU Faisalabad
Instrumentation & Control Lab Manual.pdfInstrumentation & Control Lab Manual.pdf
Instrumentation & Control Lab Manual.pdf
NTU Faisalabad 5 visualizações
Introduction to CAD-CAM.pptx por suyogpatil49
Introduction to CAD-CAM.pptxIntroduction to CAD-CAM.pptx
Introduction to CAD-CAM.pptx
suyogpatil495 visualizações
_MAKRIADI-FOTEINI_diploma thesis.pptx por fotinimakriadi
_MAKRIADI-FOTEINI_diploma thesis.pptx_MAKRIADI-FOTEINI_diploma thesis.pptx
_MAKRIADI-FOTEINI_diploma thesis.pptx
fotinimakriadi8 visualizações
sam_software_eng_cv.pdf por sammyigbinovia
sam_software_eng_cv.pdfsam_software_eng_cv.pdf
sam_software_eng_cv.pdf
sammyigbinovia5 visualizações
START Newsletter 3 por Start Project
START Newsletter 3START Newsletter 3
START Newsletter 3
Start Project5 visualizações
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ... por AltinKaradagli
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...
Investigation of Physicochemical Changes of Soft Clay around Deep Geopolymer ...
AltinKaradagli9 visualizações
fakenews_DBDA_Mar23.pptx por deepmitra8
fakenews_DBDA_Mar23.pptxfakenews_DBDA_Mar23.pptx
fakenews_DBDA_Mar23.pptx
deepmitra814 visualizações
What is Unit Testing por Sadaaki Emura
What is Unit TestingWhat is Unit Testing
What is Unit Testing
Sadaaki Emura23 visualizações
DevOps-ITverse-2023-IIT-DU.pptx por Anowar Hossain
DevOps-ITverse-2023-IIT-DU.pptxDevOps-ITverse-2023-IIT-DU.pptx
DevOps-ITverse-2023-IIT-DU.pptx
Anowar Hossain9 visualizações
Design_Discover_Develop_Campaign.pptx por ShivanshSeth6
Design_Discover_Develop_Campaign.pptxDesign_Discover_Develop_Campaign.pptx
Design_Discover_Develop_Campaign.pptx
ShivanshSeth614 visualizações
Design of machine elements-UNIT 3.pptx por gopinathcreddy
Design of machine elements-UNIT 3.pptxDesign of machine elements-UNIT 3.pptx
Design of machine elements-UNIT 3.pptx
gopinathcreddy32 visualizações
CHEMICAL KINETICS.pdf por AguedaGutirrez
CHEMICAL KINETICS.pdfCHEMICAL KINETICS.pdf
CHEMICAL KINETICS.pdf
AguedaGutirrez12 visualizações
GDSC Mikroskil Members Onboarding 2023.pdf por gdscmikroskil
GDSC Mikroskil Members Onboarding 2023.pdfGDSC Mikroskil Members Onboarding 2023.pdf
GDSC Mikroskil Members Onboarding 2023.pdf
gdscmikroskil51 visualizações
Searching in Data Structure por raghavbirla63
Searching in Data StructureSearching in Data Structure
Searching in Data Structure
raghavbirla637 visualizações

IRJET- An Experimental Evaluation of Mechanical Properties of Bamboo Fiber Reinforced Concrete

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 781 Tweet Segmentation and Its Application to Named Entity Recognition Ramya 1Ramya , Address: Thanjavur 2Professor: Mrs. Hemalatha, M.Sc., M.Phil., M.C.A., Dept. of Computer Science, Shrimathi Indira Gandhi College, Tamil Nadu, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Twitter has attracted millions of users to share and disseminate most up-to-date information, resulting in large volumes of data produced every day. However, many applications in Information Retrieval (IR) and Natural Language Processing (NLP) suffer severely from the noisyand short nature of tweets. In this project, propose a novel framework for tweet segmentation in a batch mode, called Hybrid Segmentation. By splitting tweets into meaningful segments, the semantic or context information is well preserved and easily extracted by the downstream applications. HybridSeg finds the optimal segmentation of a tweet by maximizing the sum of the stickiness scores of its candidate segments. The stickiness score considers the probability of a segment being a phrase in English (i.e., global context) and the probability of a segment being a phrase within the batch of tweets (i.e., local context). For the latter, propose and evaluate two models to derive local context by considering the linguistic features and term-dependency in a batch of tweets, respectively. HybridSeg is also designed to iteratively learn from confident segments as pseudo feedback. Experiments on two tweet data sets show that tweet segmentation quality is significantly improved by learning both global and local contexts compared with using global context alone. Through analysis and comparison, show that local linguistic features are more reliable for learning local context compared with term-dependency. As an application, show that high accuracy is achieved in named entity recognition by applying segment-based part-of-speech (POS) tagging. Key Words: Twitter stream, tweet segmentation,named entity recognition, linguistic processing, Wikipedia 1. INTRODUCTION Tweets are posted for information sharing and communication. The named entities and semantic phrases are well preserved in tweets. The method realizing the proposed framework that solely relies on global context is denoted by HybridSeg. Tweets are highly time-sensitive.The well preserved linguistic features in these tweets facilitate named entity recognition with high accuracy. Each named entity is a valid segment. The method utilizinglocallinguistic features is denoted by HybridSeg NER. It obtains confident segments based on the voting resultsofmultipleoff-the-shelf NER tools. Another method utilizing local collocation knowledge, denoted by HybridSeg N-Gram, is proposed based on the observation that many tweets publishedwithin a short time period are about the same topic. HybridSeg N- Gram segments tweets by estimating the term-dependency within a batch of tweets. To improve POS tagging on tweets, Ritter et al. train a POS tagger by using CRF model with conventional and tweet-specific features. Even thoughshort text strings might be a problem, sentiment analysis within micro blogging hasshown that Twitter canbeseenas a valid online indicator of political sentiment. A lot of advertising and branding is now tied to Twitter. Furthermore, and most importantly, the first place one goes to find any breaking news is to search it on Twitter. The focus of this project is clustering with the TF-IDF weighted mechanism of daily technology news tweets of prominent bloggers and news sites using Apache Mahout and to evaluate the effectsof introducing andremovinganonymous wordson the quality of clustering. This projectrestrictsitself to only tweets in the English language. 2. Existing System Nowadays people are using so many social networking sitesare. But the people using allsitesforupdate statesand sharing photos, videosand so on. There is noalert for bad weather situation, earthquakes and so on. The problem is that lot of time consumed to the people for checking the message submitted in the social networking and file a case in cyber crime. After verification cyber crime defense commission will remove the post according to the cyber crime rules in a span of time. This will not solve the problem only to remove the thingsafter the issuearises. Due to ignorance and lack of understanding of Social Media privacy feature, people make many mistakes. Another situation to consider has to do with the availability of information too personal, whether in pictures or text. You cannot give out too much personal information. As throughout the Internet. The User can send their post with some illegal words and letters. 2.1 Survey A. User tracking using tweet segmentation and word Many organizations have been reported to create and monitoring targeted Twitter streams to collect a bunch of information and understand according to user's view. Targeted Twitter stream is main usually constructed by filtering tweets and that abused words with predefined selection criteria. Due to its invaluable business value of timely information from these tweets, it's a necessary to understand that abused word's language for a large body of downstream applications, such as named entity recognition (NER), event detecting and summarizing that particular word, opinion mining, sentiment analysis, and etc. In these proposed system application is developed which take tweet
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 782 is input and search semantic negative or illegal words from database. Generate report of that abusing word and send to cyber crime's site. Then depending on the tweet, track the related information of the person. Track allthetweetsofthat particular person and track that person through identification databases. Then take a action by cyber crime. And after that prevent the tweet. B. Measuring the controversy level of Arabic trending topics on Twitter Social micro-blogging systemslike Twitter are usedtodayas a platform that enables its users to write down about different topics. One important aspect of such human interactions is the existence of debateanddisagreement.The most heated debates are found on controversial topics. Detecting such topics can be very beneficial in understanding the behavior of online social networks users and the dynamics of their interactions. Such an understanding leads to better ways of handling and predicting how the "online crowds" will act. Several approaches have been proposed for detectingcontroversyin online communication. Some of them represent the interactions in the form of graphsand study their properties in order to determine whether the topic of interaction is controversial or not. Other approachesrely onthecontentof the exchanged messages. In this study, we focus on the former approach in identifying the controversy level of the trending topics on Twitter. Unlike many previous works, then do not limit ourselves to a certain domain. Moreover, the main focus on social content written in Arabic about hot events occurring in the Middle East. To the best of our knowledge, ours is the first work to undertake thisapproach in studying controversy in general topics written in Arabic. Collect a large dataset of tweets on different trending topics from different domains. First apply several approaches for controversy detection and compare their outcomes to determine which one is the most consistent measure. C. Event Detection and Key Posts Discovering in Social Media Data Streams Microblogging, as a popular social mediaservice,hasbecome a new information channel for usersto receiveandexchange the most up-to-date information on current events. Consequently, it is crucial to detect newemergingeventsand to discover the key posts which have the potentialtoactively disseminate the events through microblogging. However, traditional event detection models require human intervention for detecting the number of topics to be explored, which significantly reduces the efficiency and accuracy of event detection. Most of the existing methods focus only on event detection and are unable to discover the key posts related to the events, making itchallengingtotrack momentous events in timely manner. To tackle these problems, a HITS (Hypertext Induced Topic Search) based topic decision method, named TD-HITSisproposed.TD-HITS can automatically detect the number of topics as well as discovering associated key posts from a large number of posts. The experimental results are based on a Twitter dataset to demonstrate the effectiveness of our proposed methodsfor both detecting eventsand identifyingkeyposts. D. Tweet modeling with LSTM recurrent neural networks for hash tag recommendation The hash symbol, called a hash tag, is used to mark the keyword or topic in a tweet. It was created organically by users as a way to categorize messages. Hash tags also provide valuable information for manyresearchapplications such as sentiment classification and topic analysis.However, only a small number of tweets are manually annotated. Therefore, an automatic hashtagrecommendationmethodis needed to help userstag their new tweets.Previousmethods mostly use conventional machine learning classifierssuchas SVM or utilize collaborative filtering technique. A bottleneck of these approachesisthat they all use the TF-IDF schemeto represent tweets and ignore the semantic information in tweets. In this paper, we also regard hash tag recommendation asa classification task but propose a novel recurrent neural network model tolearn vector-basedtweet representations to recommend hashtags. More precisely,we use a skip-gram model to generate distributed word representations and then apply a convolution neural network to learn semantic sentence vectors. Afterwards,we make use of the sentence vectors to train a long short-term memory recurrent neural network (LSTM-RNN).Wedirectly use the produced tweet vectors as features to classify hashtags without any feature engineering. Experiments on real world data from Twitter to recommend hashtags show that our proposed LSTM-RNN model outperforms state-of- the-art methods and LSTM unit also obtains the best performance compared to standard RNN and gated recurrent unit (GRU). E. Exploring human emotion via Twitter Sentiment analysis or opinion mining on twitter data is an emerging topic in research. In this paper, we have described a system for emotion analysis of tweets using only the core text. Tweets are usually short, more ambiguousandcontains a huge amount of noisy data, sometimes it is difficult to understand the user's opinion. The main challenge is to feature extraction for the purpose of classification and feature extraction depends on the perfection of preprocessing of a tweet. The preprocessing is the most difficult task, since it can be done in various ways and the methods or steps applied in preprocessing are not distinct. Most of the researches in this topic, have been focused on binary (positive and negative) and 3-way (positive, negative and neutral) classifications. In this paper, we have focused on emotion classification of tweets as multi-class classification. We have chosen basic human emotions (happiness, sadness, surprise, disgust) and neutral as our emotion classes. According to the experimental results, our
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 08 | Aug 2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 783 approach improved the performance of multi-class classification of twitter data. 3. Proposed System Tweets are posted for information sharing and communication. The named entities and semantic phrases are well preserved in tweets. The global context derived from Web pages or Wikipedia therefore helps identifying the meaningful segments in tweets. The method realizing the proposed framework that solely relies on global context is denoted by HybridSegWeb. Tweets are highly time-sensitive so that many emerging phrases like “She Dancing” cannot be found in external knowledge bases. However, considering a large number of tweets published within a short time period containing the phrase, it is not difficult to recognize “She Dancing” as a valid and meaningful segment. Therefore investigate two local contexts, namely local linguistic features and local collocation. Observe that tweets from many official accounts of news agencies, organizations, and advertisers are likely well written. The well preserved linguistic features in these tweets facilitate named entity recognition with high accuracy. Each named entity is a valid segment. The method utilizing local linguistic features is denoted by HybridSegNER. It obtains confident segments based on the voting results of multiple off-the-shelf NER tools. Fig1- Sample Twitter Segmentation Text Mining Algorithm Text mining, also referred to as text data mining, roughly equivalent to text analytics, is the process of deriving high- quality information from text. High-quality information is typically derived through the devising of patternsandtrends through means such as statistical pattern learning. Text mining usually involves the process of structuring the input text (usually parsing, along with the addition of some derived linguistic features and the removal of others, and subsequent insertion into a database), deriving patterns within the structured data, and finally evaluation and interpretation of the output. 'High quality' in text mining usually refers to some combination of relevance, novelty, and interestingness. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entityrelationmodeling.Text analysis involves information retrieval, lexical analysis to study word frequency distributions, pattern recognition, tagging/annotation, information extraction, data mining techniques including link and association analysis, visualization, and predictiveanalytics. The overarchinggoal is, essentially, to turn text into data for analysis, via application of natural language processing and analytical methods. 4. Conclusion The HybridSeg framework which segments tweets into meaningful phrases called segments using both global and local context. Through our framework, we demonstrate that local linguistic features are more reliable than term dependency in guiding the segmentation process. This finding opens opportunities for tools developed for formal text to be applied to tweets which are believed to be much more noise than formal text. Tweet segmentation helps to preserve the semantic meaning of tweets, which subsequently benefits many downstream applications, e.g. named entity recognition. Through experiments, we show that segment based named entity recognition methods achieves much better accuracy than the word-based alternative. REFERENCES 1. Twiner: Named entity recognition in targeted twitter stream- C. Li, J. Weng, Q. He, Y. Yao, A. Datta, A. Sun, and B.-S. Lee. 2. Augmenting naive bayes classifiers with statistical language models- F. Peng, D. Schuurmans, and S. Wang. 3. Topic sentiment analysis in twitter: a graph-based hash tag sentiment classification approach- X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang. 4. Twevent: segment-based event detection from tweets- C. Li, A. Sun, and A. Datta- 5. Incorporating nonlocal information into information extraction systems by gibbs sampling- J. R. Finkel, T. Grenager, and C. Manning.