SlideShare uma empresa Scribd logo
1 de 22
Microsoft Neural Network and Logistic Regression
overview Microsoft Neural Network and Logistic Regression overview DMX Queries Model Content Principles of the Microsoft Neural Network Algorithm Algorithm Parameters
Microsoft Neural Network overview Microsoft Neural Network derives the analysis performed  from two factors.  Any and all of the inputs may be related somehow to any or all of the outputs, and the network must consider this in training.  Different combinations of inputs may be related differently to outputs.
Microsoft Neural Network overview The relationships detected by the Microsoft Natural Network algorithm may span on up to two levels.  In the single-level case, input facts are connected directly to the outputs.  In the two-level case, input combinations effectively become new inputs, which are then connected to the outputs. The level that transforms certain input combinations into new inputs is referred to as a hidden layer.
Microsoft Logistic Regression overview The Microsoft Logistic Regression algorithm is the one with a single level of relationships used to predict the probability of events based on inputs.  This algorithm is implemented by forcing the hidden layer of a neural network to have zero nodes and is manifest only in the internal structure of the algorithm.
DMX Queries The Microsoft Neural Network supports most of the tasks that Microsoft Decision Trees can do, including classification and regression. The next slide shows queries to create and train a mining structure for Employee information data.
DMX Queries CREATE MINING STRUCTURE EmployeeStructure( EmployeeID LONG KEY, Gender TEXT DISCRETE, [Marital Status] TEXT DISCRETE, Age LONG CONTINUOUS, [Education Level] TEXT DISCRETE, [Home Ownership] TEXT DISCRETE, TechnologyUsage TABLE ( [Technology] TEXT KEY ) ) GO A mining Structure Holding Employee data and Technology usage information
DMX Queries INSERT INTO MINING STRUCTURE [EmployeeStructure] ( [EmployeeID], [Gender], [Marital Status], [Age], [Education Level], [Home Ownership], [TechnologyUsage]( SKIP, [Technology] ) ) SHAPE { OPENQUERY ([Chapter 12], ‘SELECT [EmployeeID], [Gender], [Marital Status], [Age], [Education Level], [Home Ownership] FROM [Customers] ORDER BY [EmployeeID]‘) } APPEND ( { OPENQUERY ([Chapter 12], ‘SELECT [EmployeeID], [Technology] FROM [Technology] ORDER BY [EmployeeID]‘) } RELATE [EmployeeID] To [EmployeeID] ) AS [TechUsage] GO A mining structure holding customer data and technology usage information
DMX Queries ALTER MINING STRUCTURE EmployeeStructure ADD MINING MODEL VariousPredictions( EmployeeID, Gender, [Marital Status], [Age] PREDICT, [Education Level] PREDICT, [Home Ownership] PREDICT ) USING MICROSOFT NEURAL NETWORK GO INSERT INTO VariousPredictions GO Query to build a Neural Network mining model that predicts both a discrete target (Home Ownership) and a continuous (Age) target.
DMX Queries ALTER MINING STRUCTURE EmployeeStructure ADD MINING MODEL NestedTableInput( EmployeeID, Gender, [Marital Status], [Age] PREDICT, [Education Level], [Home Ownership], TechnologyUsage ( Technology ) ) USING MICROSOFT NEURAL NETWORK GO INSERT INTO NestedTableInput GO You can also include a nested table in a neural network algorithm, as long as it is not marked as predictable.  Query to predict Age based on the Employee’s demographic data, as well as the technology items that the Employee is currently using.
Model Content A Neural Network model has one or more subnets.  The model content describes the topologies of these subnets.  It also stores the weights of each edge of the neural network.
Model Content
Understanding the Structure of a Neural Network Model Each neural network model has a single parent node that represents the model and its metadata, and a marginal statistics node that provides descriptive statistics about the input attributes.  Underneath these two nodes, there are at least two more nodes, and might be many more, depending on how many predictable attributes the model has. The first node always represents the top node of the input layer. Beneath this top node, you can find input nodes that contain the actual input attributes and their values. Successive nodes each contain a different sub network .  Each sub network always contains a hidden layer , and an output layer for that sub network.
Principles of the Microsoft Neural Network Algorithm The origin of the Neural Network algorithm can be traced to the 1940s, when two researchers, Warren McCulloch and Walter Pits, tried to build a model to simulate how biological neurons work. Neural networks mainly address the classification and regression tasks of data mining such as decision trees, neural networks can find nonlinear relationships among input attributes and predictable attributes. Neural networks supports both discrete and continuous outputs.
How the algorithm works? The Microsoft Neural Network algorithm creates a network that is composed of up to three layers of neurons.  Input layer: Input neurons define all the input attribute values for the data mining model, and their probabilities. Hidden layer: Hidden neurons receive inputs from input neurons and provide outputs to output neurons. The hidden layer is where the various probabilities of the inputs are assigned weights. The greater the weight that is assigned to an input, the more important the value of that input is.  Output layer: Output neurons represent predictable attribute values for the data mining model.
Neural Network Model
Backpropagation Backpropagation(which is considered as the core process of the algorithm)involves the following steps: 1. Randomly assigns values for all the weights in the network at the initial stage (usually ranging from –1.0 to 1.0). 2. For each training example, the algorithm calculates the outputs based on the current weights in the network. 3. This step calculates the errors for each output and hidden neuron in the network.      The weights in the network are updated. 4. Step 2 is repeated until the condition is satisfied.
Algorithm Parameters The Microsoft Neural Network algorithm supports several parameters that affect the behavior, performance, and accuracy of the resulting mining model. ,[object Object],The default value is 255. ,[object Object],The default value is 255.
Algorithm Parameters ,[object Object],If the number of states that an attribute has is greater than the maximum number of states, the algorithm uses the attribute’s most popular states and treats the remaining states as Missing. The default value is 100. ,[object Object],Default value is 10000.
Algorithm Parameters ,[object Object], The holdout data is used to validate the accuracy during the training.  The default value is 0.1. ,[object Object]
HIDDEN_NODE_RATIO specifies the ratio of hidden neurons to input and output neurons. The following formula determines the initial number of neurons in the hidden layer:      HIDDEN_NODE_RATIO * SQRT(Total input neurons * Total output neurons) The default value is 4.0.
SUMMARY Microsoft Neural Network and Logistic Regression overview DMX Queries Model Content Principles of the Microsoft Neural Network Algorithm Algorithm Parameters

Mais conteúdo relacionado

Mais procurados

An integrated mechanism for feature selection
An integrated mechanism for feature selectionAn integrated mechanism for feature selection
An integrated mechanism for feature selection
sai kumar
 
Clustering and Regression using WEKA
Clustering and Regression using WEKAClustering and Regression using WEKA
Clustering and Regression using WEKA
Vijaya Prabhu
 
Paper-Allstate-Claim-Severity
Paper-Allstate-Claim-SeverityPaper-Allstate-Claim-Severity
Paper-Allstate-Claim-Severity
Gon-soo Moon
 
The solution of problem of parameterization of the proximity function in ace ...
The solution of problem of parameterization of the proximity function in ace ...The solution of problem of parameterization of the proximity function in ace ...
The solution of problem of parameterization of the proximity function in ace ...
eSAT Journals
 

Mais procurados (17)

Processes and threads
Processes and threadsProcesses and threads
Processes and threads
 
Preparing your data for Machine Learning with Feature Scaling
Preparing your data for  Machine Learning with Feature ScalingPreparing your data for  Machine Learning with Feature Scaling
Preparing your data for Machine Learning with Feature Scaling
 
Collaborative Filtering 2: Item-based CF
Collaborative Filtering 2: Item-based CFCollaborative Filtering 2: Item-based CF
Collaborative Filtering 2: Item-based CF
 
WEKA: Algorithms The Basic Methods
WEKA: Algorithms The Basic MethodsWEKA: Algorithms The Basic Methods
WEKA: Algorithms The Basic Methods
 
04 Classification in Data Mining
04 Classification in Data Mining04 Classification in Data Mining
04 Classification in Data Mining
 
An integrated mechanism for feature selection
An integrated mechanism for feature selectionAn integrated mechanism for feature selection
An integrated mechanism for feature selection
 
Data mining: Classification and prediction
Data mining: Classification and predictionData mining: Classification and prediction
Data mining: Classification and prediction
 
Literature Survey: Clustering Technique
Literature Survey: Clustering TechniqueLiterature Survey: Clustering Technique
Literature Survey: Clustering Technique
 
Feature selection on boolean symbolic objects
Feature selection on boolean symbolic objectsFeature selection on boolean symbolic objects
Feature selection on boolean symbolic objects
 
Clustering and Regression using WEKA
Clustering and Regression using WEKAClustering and Regression using WEKA
Clustering and Regression using WEKA
 
Paper-Allstate-Claim-Severity
Paper-Allstate-Claim-SeverityPaper-Allstate-Claim-Severity
Paper-Allstate-Claim-Severity
 
Branch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection AlgorithmsBranch And Bound and Beam Search Feature Selection Algorithms
Branch And Bound and Beam Search Feature Selection Algorithms
 
Machine learning and decision trees
Machine learning and decision treesMachine learning and decision trees
Machine learning and decision trees
 
PPT
PPTPPT
PPT
 
The solution of problem of parameterization of the proximity function in ace ...
The solution of problem of parameterization of the proximity function in ace ...The solution of problem of parameterization of the proximity function in ace ...
The solution of problem of parameterization of the proximity function in ace ...
 
XL Miner: Classification
XL Miner: ClassificationXL Miner: Classification
XL Miner: Classification
 
XL-MINER:Partition
XL-MINER:PartitionXL-MINER:Partition
XL-MINER:Partition
 

Destaque

Statistical learning
Statistical learningStatistical learning
Statistical learning
Slideshare
 

Destaque (20)

Bayesian classifiers programmed in sql
Bayesian classifiers programmed in sqlBayesian classifiers programmed in sql
Bayesian classifiers programmed in sql
 
Introduction to Statistical Machine Learning
Introduction to Statistical Machine LearningIntroduction to Statistical Machine Learning
Introduction to Statistical Machine Learning
 
Statistical learning intro
Statistical learning introStatistical learning intro
Statistical learning intro
 
Statistical learning
Statistical learningStatistical learning
Statistical learning
 
Paramount Search Partners
Paramount Search PartnersParamount Search Partners
Paramount Search Partners
 
Vision To Profit V2 Sept 2009
Vision To Profit V2 Sept 2009Vision To Profit V2 Sept 2009
Vision To Profit V2 Sept 2009
 
Facebook: An Innovative Influenza Pandemic Early Warning System
Facebook: An Innovative Influenza Pandemic Early Warning SystemFacebook: An Innovative Influenza Pandemic Early Warning System
Facebook: An Innovative Influenza Pandemic Early Warning System
 
Classification Continued
Classification ContinuedClassification Continued
Classification Continued
 
Info Chimps: What Makes Infochimps.org Unique
Info Chimps: What Makes Infochimps.org UniqueInfo Chimps: What Makes Infochimps.org Unique
Info Chimps: What Makes Infochimps.org Unique
 
R Datatypes
R DatatypesR Datatypes
R Datatypes
 
Simulation
SimulationSimulation
Simulation
 
Survival Strategies For Testers
Survival Strategies For TestersSurvival Strategies For Testers
Survival Strategies For Testers
 
Control Statements in Matlab
Control Statements in  MatlabControl Statements in  Matlab
Control Statements in Matlab
 
Webmining Overview
Webmining OverviewWebmining Overview
Webmining Overview
 
Quick Look At Classification
Quick Look At ClassificationQuick Look At Classification
Quick Look At Classification
 
Knowledge Discovery
Knowledge DiscoveryKnowledge Discovery
Knowledge Discovery
 
Matlab: Discrete Linear Systems
Matlab: Discrete Linear SystemsMatlab: Discrete Linear Systems
Matlab: Discrete Linear Systems
 
R Statistics
R StatisticsR Statistics
R Statistics
 
Data Applied: Developer Quicklook
Data Applied: Developer QuicklookData Applied: Developer Quicklook
Data Applied: Developer Quicklook
 
Norihicodanch
NorihicodanchNorihicodanch
Norihicodanch
 

Semelhante a MS SQL SERVER:Microsoft neural network and logistic regression

Separating Hype from Reality in Deep Learning with Sameer Farooqui
 Separating Hype from Reality in Deep Learning with Sameer Farooqui Separating Hype from Reality in Deep Learning with Sameer Farooqui
Separating Hype from Reality in Deep Learning with Sameer Farooqui
Databricks
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance Industry
Inderjeet Singh
 

Semelhante a MS SQL SERVER:Microsoft neural network and logistic regression (20)

MS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rulesMS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rules
 
Scalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNetScalable Deep Learning Using Apache MXNet
Scalable Deep Learning Using Apache MXNet
 
Shifu plugin-trainer and pmml-adapter
Shifu plugin-trainer and pmml-adapterShifu plugin-trainer and pmml-adapter
Shifu plugin-trainer and pmml-adapter
 
A TALE of DATA PATTERN DISCOVERY IN PARALLEL
A TALE of DATA PATTERN DISCOVERY IN PARALLELA TALE of DATA PATTERN DISCOVERY IN PARALLEL
A TALE of DATA PATTERN DISCOVERY IN PARALLEL
 
House price prediction
House price predictionHouse price prediction
House price prediction
 
Separating Hype from Reality in Deep Learning with Sameer Farooqui
 Separating Hype from Reality in Deep Learning with Sameer Farooqui Separating Hype from Reality in Deep Learning with Sameer Farooqui
Separating Hype from Reality in Deep Learning with Sameer Farooqui
 
An LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model TransformationsAn LSTM-Based Neural Network Architecture for Model Transformations
An LSTM-Based Neural Network Architecture for Model Transformations
 
11.digital image processing for camera application in mobile devices using ar...
11.digital image processing for camera application in mobile devices using ar...11.digital image processing for camera application in mobile devices using ar...
11.digital image processing for camera application in mobile devices using ar...
 
Digital image processing for camera application in mobile devices using artif...
Digital image processing for camera application in mobile devices using artif...Digital image processing for camera application in mobile devices using artif...
Digital image processing for camera application in mobile devices using artif...
 
# Neural network toolbox
# Neural network toolbox # Neural network toolbox
# Neural network toolbox
 
Neural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance IndustryNeural Network Classification and its Applications in Insurance Industry
Neural Network Classification and its Applications in Insurance Industry
 
IRJET - Single Image Super Resolution using Machine Learning
IRJET - Single Image Super Resolution using Machine LearningIRJET - Single Image Super Resolution using Machine Learning
IRJET - Single Image Super Resolution using Machine Learning
 
Handwritten Digit Recognition using Convolutional Neural Networks
Handwritten Digit Recognition using Convolutional Neural  NetworksHandwritten Digit Recognition using Convolutional Neural  Networks
Handwritten Digit Recognition using Convolutional Neural Networks
 
Intrusion Detection Model using Self Organizing Maps.
Intrusion Detection Model using Self Organizing Maps.Intrusion Detection Model using Self Organizing Maps.
Intrusion Detection Model using Self Organizing Maps.
 
Securing Privacy of User’s Data on Cloud Using Back Propagation Neural Networks
Securing Privacy of User’s Data on Cloud Using Back Propagation Neural NetworksSecuring Privacy of User’s Data on Cloud Using Back Propagation Neural Networks
Securing Privacy of User’s Data on Cloud Using Back Propagation Neural Networks
 
M021201092098
M021201092098M021201092098
M021201092098
 
Feed forward neural network for sine
Feed forward neural network for sineFeed forward neural network for sine
Feed forward neural network for sine
 
X-TREPAN: A MULTI CLASS REGRESSION AND ADAPTED EXTRACTION OF COMPREHENSIBLE D...
X-TREPAN: A MULTI CLASS REGRESSION AND ADAPTED EXTRACTION OF COMPREHENSIBLE D...X-TREPAN: A MULTI CLASS REGRESSION AND ADAPTED EXTRACTION OF COMPREHENSIBLE D...
X-TREPAN: A MULTI CLASS REGRESSION AND ADAPTED EXTRACTION OF COMPREHENSIBLE D...
 
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
X-TREPAN : A Multi Class Regression and Adapted Extraction of Comprehensible ...
 
Database programming
Database programmingDatabase programming
Database programming
 

Mais de DataminingTools Inc

Mais de DataminingTools Inc (20)

Terminology Machine Learning
Terminology Machine LearningTerminology Machine Learning
Terminology Machine Learning
 
Techniques Machine Learning
Techniques Machine LearningTechniques Machine Learning
Techniques Machine Learning
 
Machine learning Introduction
Machine learning IntroductionMachine learning Introduction
Machine learning Introduction
 
Areas of machine leanring
Areas of machine leanringAreas of machine leanring
Areas of machine leanring
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
 
AI: Logic in AI 2
AI: Logic in AI 2AI: Logic in AI 2
AI: Logic in AI 2
 
AI: Logic in AI
AI: Logic in AIAI: Logic in AI
AI: Logic in AI
 
AI: Learning in AI 2
AI: Learning in AI 2AI: Learning in AI 2
AI: Learning in AI 2
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 
AI: Introduction to artificial intelligence
AI: Introduction to artificial intelligenceAI: Introduction to artificial intelligence
AI: Introduction to artificial intelligence
 
AI: Belief Networks
AI: Belief NetworksAI: Belief Networks
AI: Belief Networks
 
AI: AI & Searching
AI: AI & SearchingAI: AI & Searching
AI: AI & Searching
 
AI: AI & Problem Solving
AI: AI & Problem SolvingAI: AI & Problem Solving
AI: AI & Problem Solving
 
Data Mining: Text and web mining
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web mining
 
Data Mining: Outlier analysis
Data Mining: Outlier analysisData Mining: Outlier analysis
Data Mining: Outlier analysis
 
Data Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence dataData Mining: Mining stream time series and sequence data
Data Mining: Mining stream time series and sequence data
 
Data Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlationsData Mining: Mining ,associations, and correlations
Data Mining: Mining ,associations, and correlations
 
Data Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysisData Mining: Graph mining and social network analysis
Data Mining: Graph mining and social network analysis
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Data Mining: Data processing
Data Mining: Data processingData Mining: Data processing
Data Mining: Data processing
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 

MS SQL SERVER:Microsoft neural network and logistic regression

  • 1. Microsoft Neural Network and Logistic Regression
  • 2. overview Microsoft Neural Network and Logistic Regression overview DMX Queries Model Content Principles of the Microsoft Neural Network Algorithm Algorithm Parameters
  • 3. Microsoft Neural Network overview Microsoft Neural Network derives the analysis performed from two factors. Any and all of the inputs may be related somehow to any or all of the outputs, and the network must consider this in training. Different combinations of inputs may be related differently to outputs.
  • 4. Microsoft Neural Network overview The relationships detected by the Microsoft Natural Network algorithm may span on up to two levels. In the single-level case, input facts are connected directly to the outputs. In the two-level case, input combinations effectively become new inputs, which are then connected to the outputs. The level that transforms certain input combinations into new inputs is referred to as a hidden layer.
  • 5. Microsoft Logistic Regression overview The Microsoft Logistic Regression algorithm is the one with a single level of relationships used to predict the probability of events based on inputs. This algorithm is implemented by forcing the hidden layer of a neural network to have zero nodes and is manifest only in the internal structure of the algorithm.
  • 6. DMX Queries The Microsoft Neural Network supports most of the tasks that Microsoft Decision Trees can do, including classification and regression. The next slide shows queries to create and train a mining structure for Employee information data.
  • 7. DMX Queries CREATE MINING STRUCTURE EmployeeStructure( EmployeeID LONG KEY, Gender TEXT DISCRETE, [Marital Status] TEXT DISCRETE, Age LONG CONTINUOUS, [Education Level] TEXT DISCRETE, [Home Ownership] TEXT DISCRETE, TechnologyUsage TABLE ( [Technology] TEXT KEY ) ) GO A mining Structure Holding Employee data and Technology usage information
  • 8. DMX Queries INSERT INTO MINING STRUCTURE [EmployeeStructure] ( [EmployeeID], [Gender], [Marital Status], [Age], [Education Level], [Home Ownership], [TechnologyUsage]( SKIP, [Technology] ) ) SHAPE { OPENQUERY ([Chapter 12], ‘SELECT [EmployeeID], [Gender], [Marital Status], [Age], [Education Level], [Home Ownership] FROM [Customers] ORDER BY [EmployeeID]‘) } APPEND ( { OPENQUERY ([Chapter 12], ‘SELECT [EmployeeID], [Technology] FROM [Technology] ORDER BY [EmployeeID]‘) } RELATE [EmployeeID] To [EmployeeID] ) AS [TechUsage] GO A mining structure holding customer data and technology usage information
  • 9. DMX Queries ALTER MINING STRUCTURE EmployeeStructure ADD MINING MODEL VariousPredictions( EmployeeID, Gender, [Marital Status], [Age] PREDICT, [Education Level] PREDICT, [Home Ownership] PREDICT ) USING MICROSOFT NEURAL NETWORK GO INSERT INTO VariousPredictions GO Query to build a Neural Network mining model that predicts both a discrete target (Home Ownership) and a continuous (Age) target.
  • 10. DMX Queries ALTER MINING STRUCTURE EmployeeStructure ADD MINING MODEL NestedTableInput( EmployeeID, Gender, [Marital Status], [Age] PREDICT, [Education Level], [Home Ownership], TechnologyUsage ( Technology ) ) USING MICROSOFT NEURAL NETWORK GO INSERT INTO NestedTableInput GO You can also include a nested table in a neural network algorithm, as long as it is not marked as predictable. Query to predict Age based on the Employee’s demographic data, as well as the technology items that the Employee is currently using.
  • 11. Model Content A Neural Network model has one or more subnets. The model content describes the topologies of these subnets. It also stores the weights of each edge of the neural network.
  • 13. Understanding the Structure of a Neural Network Model Each neural network model has a single parent node that represents the model and its metadata, and a marginal statistics node that provides descriptive statistics about the input attributes. Underneath these two nodes, there are at least two more nodes, and might be many more, depending on how many predictable attributes the model has. The first node always represents the top node of the input layer. Beneath this top node, you can find input nodes that contain the actual input attributes and their values. Successive nodes each contain a different sub network . Each sub network always contains a hidden layer , and an output layer for that sub network.
  • 14. Principles of the Microsoft Neural Network Algorithm The origin of the Neural Network algorithm can be traced to the 1940s, when two researchers, Warren McCulloch and Walter Pits, tried to build a model to simulate how biological neurons work. Neural networks mainly address the classification and regression tasks of data mining such as decision trees, neural networks can find nonlinear relationships among input attributes and predictable attributes. Neural networks supports both discrete and continuous outputs.
  • 15. How the algorithm works? The Microsoft Neural Network algorithm creates a network that is composed of up to three layers of neurons. Input layer: Input neurons define all the input attribute values for the data mining model, and their probabilities. Hidden layer: Hidden neurons receive inputs from input neurons and provide outputs to output neurons. The hidden layer is where the various probabilities of the inputs are assigned weights. The greater the weight that is assigned to an input, the more important the value of that input is. Output layer: Output neurons represent predictable attribute values for the data mining model.
  • 17. Backpropagation Backpropagation(which is considered as the core process of the algorithm)involves the following steps: 1. Randomly assigns values for all the weights in the network at the initial stage (usually ranging from –1.0 to 1.0). 2. For each training example, the algorithm calculates the outputs based on the current weights in the network. 3. This step calculates the errors for each output and hidden neuron in the network. The weights in the network are updated. 4. Step 2 is repeated until the condition is satisfied.
  • 18.
  • 19.
  • 20.
  • 21. HIDDEN_NODE_RATIO specifies the ratio of hidden neurons to input and output neurons. The following formula determines the initial number of neurons in the hidden layer: HIDDEN_NODE_RATIO * SQRT(Total input neurons * Total output neurons) The default value is 4.0.
  • 22. SUMMARY Microsoft Neural Network and Logistic Regression overview DMX Queries Model Content Principles of the Microsoft Neural Network Algorithm Algorithm Parameters
  • 23. Visit more self help tutorials Pick a tutorial of your choice and browse through it at your own pace. The tutorials section is free, self-guiding and will not involve any additional support. Visit us at www.dataminingtools.net