SlideShare uma empresa Scribd logo
1 de 31
Recommendation
Engine
Outlines
 Introduction
 Objectives
 Scope
 Problem with existing system
 Purpose of new system
 Proposed architecture
 Technologies to be used
 Modules of system
 Integration of technologies
 Implementation Issues to be solved
 Application
 Future Enhancement
Objectives
 Information Filtering System
 Recommendation engine recommends
- User based
- Item based
- Slop based
 Run On Cloud Environment
Introduction
 Engine - Gives Suggestion Based on
movies,songs,videos,websites,books,images and also
social elements.
 Applicable for E-business.
 Useful for both Customers and online Retailers
 Recommendation engine is being used at
Amazon, Youtube, Facebook,Twitter
Scope
 Our system will only provide Recommendation service
only.
 Recommendation will be genrated based on user’s
historical activity like purchase pattern as well as
rating and like.
 Recommendation will be either stored on database
,file or directly retrieved to retailers web application.
Problems with existing System
 Take more Time to generate recommendations
 No real time recommendation for large data
Purpose of new System
 Less time for generating recommendations
 Applicable for Bigdata
 Recommendations be several algorithms
 User based
 Item based
 Slop based
 Association rule mining
 Evaluation of recommendation
Recommendations-Type
 User Based Recommendation
Recommendations-Type
 Item Based Recommendation
Proposed System Architecture
Technologies to be used
 Hadoop
 Mahout
 Graphlab
 Google prediction
 Google Storage
 Google App engine
Modules of System
 User Module
 Admin Module
 Recommendation Module
 File management Module
 Search Module
Integration of Technologies
 Mahout based Recommendation
 Graph based Recommendation
 Google prediction Based Recommendation
Technology: HADOOP
 Hadoop is a top-level Apache project being built
and used by a global community of contributors.
 Hadoop project develops open-source software for
reliable, scalable, distributed computing.
 It enables applications to work with thousands of
nodes and peta bytes of data.
 Hadoop also support Map/Reduce Algorithm.
 It provides HDFS file system that stores data on
the compute nodes.
Hadoop
Graphlab
 It is New Parallel Framework for Machine
Learning Algorithm .
 Now a day ,Designing and implementing efficient
and correct parallel machine learning (ML)
algorithms can be very challenging.
 Designed specifically for ML needs
 Automatic data synchronization.
 Map phase like – Update Function .
 Reduce phase like – Sync Operation .
17
Data Graph
Shared Data Table
Scheduling
Update Functions and
Scopes
GraphLab
Model
CPU 1 CPU 2 CPU 3 CPU 4
MapReduce – Map Phase
18
Embarrassingly Parallel independent computation
1
2
.
9
4
2
.
3
2
1
.
3
2
5
.
8
No Communication needed
CPU 1 CPU 2 CPU 3 CPU 4
MapReduce – Map Phase
19
Embarrassingly Parallel independent computation
1
2
.
9
4
2
.
3
2
1
.
3
2
5
.
8
2
4
.
1
8
4
.
3
1
8
.
4
8
4
.
4
No Communication needed
CPU 1 CPU 2
MapReduce – Reduce Phase
20
1
2
.
9
4
2
.
3
2
1
.
3
2
5
.
8
2
4
.
1
8
4
.
3
1
8
.
4
8
4
.
4
1
7
.
5
6
7
.
5
1
4
.
9
3
4
.
3
22
26
.
26
17
26
.
31
Fold/Aggregation
Graphlab in Recommendation
 Graphlab provide better way in recommendation
engine.
 Its just first load fits simple dataset file.
 In graphlab we can also implement various algortihm
like k-means clustering ,fuzzy logic, pagerank and etc.
 Its first translated dataset into Matrix form.
 And then according to different algorithm it
generated recommendated output.
Google Prediction Service
 Google cloud service used for Building smart
Application.
 Having Machine learning Algorithms.
 Related to Artificial Intelligence.
Google Prediction Service
 Google Prediction API :
 Set of Methods for Data Analysis.
 Libraries support multiple languages.
 Google App Engine :
 Enable Application to Cloud environment Application
server
 Google Cloud Storage :
 Enable Data to store on Google Cloud database.
Google Prediction Service
Technology : MAHOUT
• Apache Mahout is open source project by the Apache
Software Foundation (ASF).
• The primary goal of Mahout is creating scalable
machine-learning algorithms.
• Several Map-Reduce in Mahout enabled clustering
implementations, including k-Means, fuzzy k-Means,
Canopy, Dirichlet, and Mean-Shift.
• Mahout have fix datasets which generally take as data
input.
• Amzon EC2 are working with Hadoop and Mahout.
Implementation Issues to solved
 Lack of knowledge about hadoop,mahout,hive
 Memory issue
 Operating system support
 Load Balancing
 Configuration
 Data normalization
 Developing Clustering algorithm
 Configuring mahout with hadoop
Application of recommendation
 Yahoo!
 Facebook
 Twitter
 Baidu
 eBay
 LinkedIn
 New York Times
 Rackspace
 eHarmony
 Powerset
Recommendation
Engine
Future enhancement
 Integration with Web Application like Jsp , Servlet
 Integration with Database like
Hive, Hbase, Mongodb, Couch db
 Cloud based recommendation Service
 Integration of Mahout , Graphlab and Google prediction
based recommendation services.
 Mobile application integration
Thank You
Recommendation engine
Recommendation engine

Mais conteúdo relacionado

Mais procurados

UX Academy 18th 롯데온 UX/UI 개선 프로젝트
UX Academy 18th  롯데온 UX/UI 개선 프로젝트UX Academy 18th  롯데온 UX/UI 개선 프로젝트
UX Academy 18th 롯데온 UX/UI 개선 프로젝트
RightBrain inc.
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
NYC Predictive Analytics
 
자동차 업계를 중심으로 한 Omni Channel 트렌드
자동차 업계를 중심으로 한 Omni Channel 트렌드자동차 업계를 중심으로 한 Omni Channel 트렌드
자동차 업계를 중심으로 한 Omni Channel 트렌드
RightBrain inc.
 

Mais procurados (20)

Website Personalization by WebEngage
Website Personalization by WebEngageWebsite Personalization by WebEngage
Website Personalization by WebEngage
 
CRM Engagement Strategy With Data Driven Optimisation.
CRM Engagement Strategy With Data Driven Optimisation. CRM Engagement Strategy With Data Driven Optimisation.
CRM Engagement Strategy With Data Driven Optimisation.
 
Personalizing the Customer Experience with a Customer Data Platform Master Cl...
Personalizing the Customer Experience with a Customer Data Platform Master Cl...Personalizing the Customer Experience with a Customer Data Platform Master Cl...
Personalizing the Customer Experience with a Customer Data Platform Master Cl...
 
The Role of CDP in Data-Driven Marketing
The Role of CDP in Data-Driven MarketingThe Role of CDP in Data-Driven Marketing
The Role of CDP in Data-Driven Marketing
 
Case Study : How CleverTap helped BookMyShow increase User Retention
Case Study : How CleverTap helped BookMyShow increase User RetentionCase Study : How CleverTap helped BookMyShow increase User Retention
Case Study : How CleverTap helped BookMyShow increase User Retention
 
Gain Competitive Advantage With Personalization
Gain Competitive Advantage With PersonalizationGain Competitive Advantage With Personalization
Gain Competitive Advantage With Personalization
 
Shopify
ShopifyShopify
Shopify
 
Ir 발표 가이드라인_writing a business plan
Ir 발표 가이드라인_writing a business planIr 발표 가이드라인_writing a business plan
Ir 발표 가이드라인_writing a business plan
 
Group 3 slide presentation
Group 3 slide presentationGroup 3 slide presentation
Group 3 slide presentation
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
How to build a recommender system?
How to build a recommender system?How to build a recommender system?
How to build a recommender system?
 
How to Improve Customer Loyalty in Retail
How to Improve Customer Loyalty in RetailHow to Improve Customer Loyalty in Retail
How to Improve Customer Loyalty in Retail
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Workshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David RaabWorkshop: Make the Most of Customer Data Platforms - David Raab
Workshop: Make the Most of Customer Data Platforms - David Raab
 
Progressive Web Apps
Progressive Web AppsProgressive Web Apps
Progressive Web Apps
 
[Rightbrain UX Academy] Megabox UX/UI개선
[Rightbrain UX Academy] Megabox UX/UI개선 [Rightbrain UX Academy] Megabox UX/UI개선
[Rightbrain UX Academy] Megabox UX/UI개선
 
UX Academy 18th 롯데온 UX/UI 개선 프로젝트
UX Academy 18th  롯데온 UX/UI 개선 프로젝트UX Academy 18th  롯데온 UX/UI 개선 프로젝트
UX Academy 18th 롯데온 UX/UI 개선 프로젝트
 
Subscription Business Model
Subscription Business ModelSubscription Business Model
Subscription Business Model
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 
자동차 업계를 중심으로 한 Omni Channel 트렌드
자동차 업계를 중심으로 한 Omni Channel 트렌드자동차 업계를 중심으로 한 Omni Channel 트렌드
자동차 업계를 중심으로 한 Omni Channel 트렌드
 

Semelhante a Recommendation engine

Vipul divyanshu mahout_documentation
Vipul divyanshu mahout_documentationVipul divyanshu mahout_documentation
Vipul divyanshu mahout_documentation
Vipul Divyanshu
 
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Yahoo Developer Network
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Cloudera, Inc.
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
lccinfotech
 

Semelhante a Recommendation engine (20)

Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...Big data: Descoberta de conhecimento em ambientes de big data e computação na...
Big data: Descoberta de conhecimento em ambientes de big data e computação na...
 
Architecting the Future of Big Data and Search
Architecting the Future of Big Data and SearchArchitecting the Future of Big Data and Search
Architecting the Future of Big Data and Search
 
Vipul divyanshu mahout_documentation
Vipul divyanshu mahout_documentationVipul divyanshu mahout_documentation
Vipul divyanshu mahout_documentation
 
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
Big-Data Hadoop Tutorials - MindScripts Technologies, Pune
 
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...Apache Hadoop India Summit 2011 talk  "Making Hadoop Enterprise Ready with Am...
Apache Hadoop India Summit 2011 talk "Making Hadoop Enterprise Ready with Am...
 
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
Hadoop World 2011: Building Web Analytics Processing on Hadoop at CBS Interac...
 
Hadoop Ecosystem at a Glance
Hadoop Ecosystem at a GlanceHadoop Ecosystem at a Glance
Hadoop Ecosystem at a Glance
 
JIT Borawan Cloud computing part 2
JIT Borawan Cloud computing part 2JIT Borawan Cloud computing part 2
JIT Borawan Cloud computing part 2
 
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
 
Machine Learning Hadoop
Machine Learning HadoopMachine Learning Hadoop
Machine Learning Hadoop
 
Hadoop training kit from lcc infotech
Hadoop   training kit from lcc infotechHadoop   training kit from lcc infotech
Hadoop training kit from lcc infotech
 
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
Big Data with hadoop, Spark and BigQuery (Google cloud next Extended 2017 Kar...
 
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
 
Hadoop in a Nutshell
Hadoop in a NutshellHadoop in a Nutshell
Hadoop in a Nutshell
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
 
Hadoop map reduce
Hadoop map reduceHadoop map reduce
Hadoop map reduce
 
Hadoop live online training
Hadoop live online trainingHadoop live online training
Hadoop live online training
 
Big data Question bank.pdf
Big data Question bank.pdfBig data Question bank.pdf
Big data Question bank.pdf
 
Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14Hadoop_Its_Not_Just_Internal_Storage_V14
Hadoop_Its_Not_Just_Internal_Storage_V14
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Recommendation engine

  • 2. Outlines  Introduction  Objectives  Scope  Problem with existing system  Purpose of new system  Proposed architecture  Technologies to be used  Modules of system  Integration of technologies  Implementation Issues to be solved  Application  Future Enhancement
  • 3. Objectives  Information Filtering System  Recommendation engine recommends - User based - Item based - Slop based  Run On Cloud Environment
  • 4. Introduction  Engine - Gives Suggestion Based on movies,songs,videos,websites,books,images and also social elements.  Applicable for E-business.  Useful for both Customers and online Retailers  Recommendation engine is being used at Amazon, Youtube, Facebook,Twitter
  • 5. Scope  Our system will only provide Recommendation service only.  Recommendation will be genrated based on user’s historical activity like purchase pattern as well as rating and like.  Recommendation will be either stored on database ,file or directly retrieved to retailers web application.
  • 6. Problems with existing System  Take more Time to generate recommendations  No real time recommendation for large data
  • 7. Purpose of new System  Less time for generating recommendations  Applicable for Bigdata  Recommendations be several algorithms  User based  Item based  Slop based  Association rule mining  Evaluation of recommendation
  • 11. Technologies to be used  Hadoop  Mahout  Graphlab  Google prediction  Google Storage  Google App engine
  • 12. Modules of System  User Module  Admin Module  Recommendation Module  File management Module  Search Module
  • 13. Integration of Technologies  Mahout based Recommendation  Graph based Recommendation  Google prediction Based Recommendation
  • 14. Technology: HADOOP  Hadoop is a top-level Apache project being built and used by a global community of contributors.  Hadoop project develops open-source software for reliable, scalable, distributed computing.  It enables applications to work with thousands of nodes and peta bytes of data.  Hadoop also support Map/Reduce Algorithm.  It provides HDFS file system that stores data on the compute nodes.
  • 16. Graphlab  It is New Parallel Framework for Machine Learning Algorithm .  Now a day ,Designing and implementing efficient and correct parallel machine learning (ML) algorithms can be very challenging.  Designed specifically for ML needs  Automatic data synchronization.  Map phase like – Update Function .  Reduce phase like – Sync Operation .
  • 17. 17 Data Graph Shared Data Table Scheduling Update Functions and Scopes GraphLab Model
  • 18. CPU 1 CPU 2 CPU 3 CPU 4 MapReduce – Map Phase 18 Embarrassingly Parallel independent computation 1 2 . 9 4 2 . 3 2 1 . 3 2 5 . 8 No Communication needed
  • 19. CPU 1 CPU 2 CPU 3 CPU 4 MapReduce – Map Phase 19 Embarrassingly Parallel independent computation 1 2 . 9 4 2 . 3 2 1 . 3 2 5 . 8 2 4 . 1 8 4 . 3 1 8 . 4 8 4 . 4 No Communication needed
  • 20. CPU 1 CPU 2 MapReduce – Reduce Phase 20 1 2 . 9 4 2 . 3 2 1 . 3 2 5 . 8 2 4 . 1 8 4 . 3 1 8 . 4 8 4 . 4 1 7 . 5 6 7 . 5 1 4 . 9 3 4 . 3 22 26 . 26 17 26 . 31 Fold/Aggregation
  • 21. Graphlab in Recommendation  Graphlab provide better way in recommendation engine.  Its just first load fits simple dataset file.  In graphlab we can also implement various algortihm like k-means clustering ,fuzzy logic, pagerank and etc.  Its first translated dataset into Matrix form.  And then according to different algorithm it generated recommendated output.
  • 22. Google Prediction Service  Google cloud service used for Building smart Application.  Having Machine learning Algorithms.  Related to Artificial Intelligence.
  • 23. Google Prediction Service  Google Prediction API :  Set of Methods for Data Analysis.  Libraries support multiple languages.  Google App Engine :  Enable Application to Cloud environment Application server  Google Cloud Storage :  Enable Data to store on Google Cloud database.
  • 25. Technology : MAHOUT • Apache Mahout is open source project by the Apache Software Foundation (ASF). • The primary goal of Mahout is creating scalable machine-learning algorithms. • Several Map-Reduce in Mahout enabled clustering implementations, including k-Means, fuzzy k-Means, Canopy, Dirichlet, and Mean-Shift. • Mahout have fix datasets which generally take as data input. • Amzon EC2 are working with Hadoop and Mahout.
  • 26. Implementation Issues to solved  Lack of knowledge about hadoop,mahout,hive  Memory issue  Operating system support  Load Balancing  Configuration  Data normalization  Developing Clustering algorithm  Configuring mahout with hadoop
  • 27. Application of recommendation  Yahoo!  Facebook  Twitter  Baidu  eBay  LinkedIn  New York Times  Rackspace  eHarmony  Powerset Recommendation Engine
  • 28. Future enhancement  Integration with Web Application like Jsp , Servlet  Integration with Database like Hive, Hbase, Mongodb, Couch db  Cloud based recommendation Service  Integration of Mahout , Graphlab and Google prediction based recommendation services.  Mobile application integration

Notas do Editor

  1. The GraphLab model is defined in 4 parts. The Data Graph which is used to express sparse data dependencies in your computation.And the Shared Data Table which is used to express global data as well as global computationIn addition, we also have the scheduler which determines the order of computationAnd the scope system which provides thread safety and consistency.
  2. 2 parts. A Map stage and a Reduce stage. The Map stage represents embarassingly parallel computation. That is, each computation is independent and can performed on different macheina without any communciation.
  3. For instance, we could use MapReduce to perform feature extraction on a large number of pictures. For instance, .. To compute an attractiveness score.
  4. The Reduce stage is essentially a “fold” or an aggregation operation over the results. This for instance can be used to compile summary statistics.