Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Web mining (1)
1.
2. contents
The Web
Web mining
Data mining vs web mining
Why mine the web
Web mining taxonomy
Applications of web mining
Conclusion
3. The Web
Web is a collection of inter-related files on one or
more Web servers.
Wealth of information : Presence everywhere.
Structure : Graph structure with links between
pages.
Access : Hundreds of millions of requests per day.
4. Web Mining is the use of the data mining techniques to
automatically discover and extract information from web
documents
Discovering useful information from the World-Wide Web
and its usage patterns
5. Data Mining vs Web Mining
Traditional data mining
Data is structured and relational.
Well-defined tables, columns, rows, keys, and
constraints.
Web data
Semi-structured and unstructured.
Rich in features and patterns.
6. Enormous wealth of information on Web
Financial information
Book/CD/Video stores
Restaurant information
Car prices
Lots of data on user access patterns
Web logs contain sequence of URLs accessed by
users
7. The Web is a huge collection of documents except
for
Hyper-link information
Access and usage information
The Web is very dynamic
New pages are constantly being generated
9. Web Content Mining
This is the process of mining useful information
from the contents of Web pages and Web
documents,
which are mostly text, images and audio/video
files.
10. Web structure mining
Web structure mining is the process of
discovering structure information from the
web
This type of mining can be performed
either the documents level or at the
hyperlink level
11. web structure mining can be divided into two
kinds:
1. Hyperlink : A hyperlink is a structural unit
that connects a location in a web page to a
different location, either within the same web
page or on a different web page
2. document structure : The content within a
Web page can also be organized in a tree
structured format, based on the various
HTML and XML tags within the page
12. Web usage mining
Web Usage Mining is the application of data
mining techniques to discover interesting usage
patterns from Web data
Usage data captures the identity or origin of Web
users along with their browsing behavior at a Web
site.
13. Web usage mining itself can be classified
further depending on the kind of usage data
considered:
Web Server Data: The user logs are
collected by the Web server. Typical data
includes IP address
14. Application Server Data: Commercial
application servers have significant features
to enable e-commerce applications to be
built on top of them with little effort.
Application Level Data: New kinds of
events can be defined in an application, and
logging can be turned on for them thus
generating histories of these specially
defined events
16. conclusion
As the web and its usage continues to grow.
The past five years have seen the emergence of
web mining as a rapidly growing area, due to the
efforts of the research community as well as various
organizations that are practicing it