Deep Learning With Neural Networks

Aniket Maurya
Aniket MauryaSoftware Engineer em Kalinga Institute Of Industrial Technology
Deep Learning With Neural
Networks
Aniket Maurya
aniketmaurya@outlook.com
Deep Learning
The term Deep Learning refers
to training Neural Networks and
sometime very large Neural
Networks.
Deep Learning algorithms can
learn very complex patterns, if
provided enough data.
Source: https://www.analyticsvidhya.com/blog/2017/04/comparison-between-deep-learning-machine-learning/
What is Neural Networks?
● Neural Networks are computing systems inspired
by Biological Neurons. They try to mimic the
functionality of the brain.
● It is constructed of neurons which learns different
patterns and each neuron together produce the
correct output.
● Each Neurons has weight which are initialized with
random value and the values are updated to catch
a pattern by a series of propagations through the
network.
❏ Layers in a neural networks architecture:
● Input layer: The external information or data is fed in the units of input layer.
● Output layer: This layer gives the final output or prediction of the data fed into input
layer.
● Hidden layer: Layers except the input and output layers are called Hidden layers. These
are intermediate layers. There can be many hidden layers in a Network.
❏ Propagations in Neural Networks
Forward Propagation:
● As the name suggest, a series of calculations are
performed in neuralNets from first to the last
layer.
● The weights are multiplied with input and passed
through an activation function, the value obtained
is passed to the next layer and the same step is
repeated until the output layer.
● The value of output layer is used to calculate the
error in the model.
❏ Back-Propagation
● It is the main reason a neural network work.
● The randomly initialized weights are updated to minimize the error in the prediction. The
updation starts from last layer to the first hidden layer.
● The weight updation is done using the Gradient Descent algorithm.
❏ Gradient Descent Algorithm
The gradient descent algorithm calculates the error derivative w.r.t. weight and it is multiplied by the learning rate
and then subtracted from the old weight. The gradient descent algorithm has been optimized and modified in
recent years by many researchers. Some of the optimized learning algorithms are RMSProp and Adam.
❏ Bias-Variance Problem
During the training of Neural Network algorithms the
model can suffer overfitting and underfitting.
● Overfitting: When the model performs well on
training set but fails to generalise on the validation
set, is called Overfitting. This usually happens when
the architecture is large and the data is not
sufficient. It is also called High Variance.
● Underfitting: It is the poor performance of model
on the training set itself. It is also called High Bias.
❏ Controlling Overfitting and underfitting
Two main techniques to avoid overfitting problems are:
● Regularization: It is a technique used to reduce overfitting. Its basic concept is to
penalize the weights. It comes in action with Gradient Descent algo.
● Dropout: In this technique the random neurons are disabled. It improves the model
accuracy significantly.
To control underfitting:
● Number of hidden layers can be increased. Number of units of neurons in a layer can
also be increased.
Convolutional Neural Networks (CNN)
● CNNs are widely used to solve computer-vision related problems.
● It can be used for variety of tasks like image classification, face recognition, style
transfer etc.
● CNNs are formed of mainly 3 components: Convolution layer, Pooling layer and Fully
connected layer.
● In 1989, Yann LeCun used back-propagation to train a CNN to classify digits. It became
a foundation for modern computer-vision.
❏ Applications of CNN
Image Processing: CNN has changed the traditional image processing methods. Using
CNN face recognition and pattern matching can be done more accurately and with less
computation power. It can be used for lossless compression of images.
Autonomous Systems: CNN has powered autonomous systems. Autonomous car, security
cameras etc are not just part of sci-fi movies.
Style Transfer: CNNs are being used to create art.
Medical Image Computing: CNNs are being used in medical image computing to diagnose
diseases. Recently, a model has been developed that performed better than Radiologists to
diagnose Pneumonia from chest X-ray images.
Recurrent Neural Networks (RNN)
● RNNs are used to train sequential data like text and audio.
● RNNs is mostly used in Natural Language processing for recommendation
systems and sentiment analysis.
● The main advantage of RNN is that it can remember contexts.
❏ Conclusion
● With availability of huge amount of data Neural Networks can be used to train complex
patterns.
● Many groundbreaking algorithms have been implemented in recent years which can be
used to solve many problems which the traditional methods could not.
● There are many applications of Neural Network algorithms which can help solve real life
problems like in health sector and electric power consumption.
● Sequential models like RNN, LSTM and GRU are used to create chatbots.
References
● Geoffrey Hinton, “How Neural Networks Learn from Experience”, Scientific American, pp. 145-151, 1992.
● George Babis and Michael Georgiopoulas, “Feed-forward neural networks: Why network size is so important”, IEEE potentials, pp. 27-31, 1994.
● Paul J. Werbos, “Back-propagation Through Time: What It Does and How to Do It”, Proceedings of the IEEE, Vol. 78, No. 10, pp. 1550-1560,
1990.
● Katsuyuki Hagiwara and Kazuhiro Kuno, “Regularization learning and early stopping in linear networks”, Proceedings of the IEEE-INNS-ENNS
International Joint Conference on Neural Networks, Neural Computing: New Challenges and Perspectives for the New Millennium, pp. 511-516,
2000.
● Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov, “Dropout: A Simple Way to Prevent Neural Networks
from Over-fitting”, Journal of Machine Learning Research, Vol. 15, pp. 1929-1958, 2014.
● Salman Khan, Hossein Rahmani, Syed Afaq Ali Shah, Mohammed Bennamoun, Gerard Medioni, Sven Dickinson, “A guide to Convolutional
Network for computer vision” pp. 36-39, 43-60.
● Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland, “Efficient training and Evaluation of Recurrent Neural Network
Language Models for Automatic Speech Recognition”, IEEE/ACM Transactions on audio, speech and language processing Vol. 24, No. 11, pp.
2146-2157, 2016.
● Takaaki Hori, Chiori Hori, Shinji Watanabe, John R. Hershey, “Minimum word and training of long short - term memory recurrent neural network
language models for speech recognition”, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing.
● Saleh Ali Alshehri, “Neural Network Technique for Image Compression” IET Image Processing, pp. 222-226.
● Nadia Jmour, Sehla Zayen, Afef Abdelkrim, “Convolutional Neural Networks for Image Classification”, IC-ASET, 2018.
● Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, “A Neural Algorithm of Artistic Style”.
● Marcin Andrychowicz, Misha Denil, Sergio Gómez Colmenarejo, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando
de Freitas, “Learning to Learn by Gradient Descent by gradient descent”, NIPS 2016.
1 de 14

Recomendados

Introduction to Deep Learning por
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningOswald Campesato
3.8K visualizações66 slides
Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S... por
Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...
Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...Simplilearn
18.7K visualizações63 slides
Deep learning por
Deep learning Deep learning
Deep learning Rajgupta258
1.6K visualizações20 slides
Introduction to Deep learning por
Introduction to Deep learningIntroduction to Deep learning
Introduction to Deep learningleopauly
1.7K visualizações48 slides
Deep learning por
Deep learningDeep learning
Deep learningRatnakar Pandey
11.5K visualizações21 slides
Introduction of Deep Learning por
Introduction of Deep LearningIntroduction of Deep Learning
Introduction of Deep LearningMyungjin Lee
6K visualizações30 slides

Mais conteúdo relacionado

Mais procurados

Deep learning por
Deep learningDeep learning
Deep learningBenha University
8.5K visualizações36 slides
Deep learning - A Visual Introduction por
Deep learning - A Visual IntroductionDeep learning - A Visual Introduction
Deep learning - A Visual IntroductionLukas Masuch
57.5K visualizações53 slides
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori... por
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...Simplilearn
3.6K visualizações54 slides
Convolutional Neural Networks (CNN) por
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)Gaurav Mittal
58.5K visualizações70 slides
Hyperparameter Tuning por
Hyperparameter TuningHyperparameter Tuning
Hyperparameter TuningJon Lederman
2.9K visualizações87 slides
Deep learning presentation por
Deep learning presentationDeep learning presentation
Deep learning presentationTunde Ajose-Ismail
33.3K visualizações79 slides

Mais procurados(20)

Deep learning por Benha University
Deep learningDeep learning
Deep learning
Benha University8.5K visualizações
Deep learning - A Visual Introduction por Lukas Masuch
Deep learning - A Visual IntroductionDeep learning - A Visual Introduction
Deep learning - A Visual Introduction
Lukas Masuch57.5K visualizações
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori... por Simplilearn
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
What Is Deep Learning? | Introduction to Deep Learning | Deep Learning Tutori...
Simplilearn3.6K visualizações
Convolutional Neural Networks (CNN) por Gaurav Mittal
Convolutional Neural Networks (CNN)Convolutional Neural Networks (CNN)
Convolutional Neural Networks (CNN)
Gaurav Mittal58.5K visualizações
Hyperparameter Tuning por Jon Lederman
Hyperparameter TuningHyperparameter Tuning
Hyperparameter Tuning
Jon Lederman2.9K visualizações
Deep learning presentation por Tunde Ajose-Ismail
Deep learning presentationDeep learning presentation
Deep learning presentation
Tunde Ajose-Ismail33.3K visualizações
Introduction to Recurrent Neural Network por Knoldus Inc.
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural Network
Knoldus Inc.1.9K visualizações
Deep Learning - Convolutional Neural Networks por Christian Perone
Deep Learning - Convolutional Neural NetworksDeep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural Networks
Christian Perone71.4K visualizações
Deep Learning por Shaikh Shahzad
Deep LearningDeep Learning
Deep Learning
Shaikh Shahzad1.2K visualizações
Introduction to Recurrent Neural Network por Yan Xu
Introduction to Recurrent Neural NetworkIntroduction to Recurrent Neural Network
Introduction to Recurrent Neural Network
Yan Xu4.8K visualizações
Introduction to Neural Networks por Databricks
Introduction to Neural NetworksIntroduction to Neural Networks
Introduction to Neural Networks
Databricks21.6K visualizações
Convolutional Neural Networks por Ashray Bhandare
Convolutional Neural NetworksConvolutional Neural Networks
Convolutional Neural Networks
Ashray Bhandare16.2K visualizações
Feed forward ,back propagation,gradient descent por Muhammad Rasel
Feed forward ,back propagation,gradient descentFeed forward ,back propagation,gradient descent
Feed forward ,back propagation,gradient descent
Muhammad Rasel609 visualizações
Deep Learning Tutorial por Amr Rashed
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
Amr Rashed5.4K visualizações
Machine Learning por Bhupender Sharma
Machine LearningMachine Learning
Machine Learning
Bhupender Sharma1.9K visualizações
Machine learning por Amit Rathi
Machine learningMachine learning
Machine learning
Amit Rathi1.9K visualizações
Deep learning por Pratap Dangeti
Deep learningDeep learning
Deep learning
Pratap Dangeti8.8K visualizações
Recurrent Neural Networks, LSTM and GRU por ananth
Recurrent Neural Networks, LSTM and GRURecurrent Neural Networks, LSTM and GRU
Recurrent Neural Networks, LSTM and GRU
ananth29.8K visualizações
Deep Feed Forward Neural Networks and Regularization por Yan Xu
Deep Feed Forward Neural Networks and RegularizationDeep Feed Forward Neural Networks and Regularization
Deep Feed Forward Neural Networks and Regularization
Yan Xu1.9K visualizações

Similar a Deep Learning With Neural Networks

IRJET-Breast Cancer Detection using Convolution Neural Network por
IRJET-Breast Cancer Detection using Convolution Neural NetworkIRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural NetworkIRJET Journal
83 visualizações3 slides
Neural networks por
Neural networksNeural networks
Neural networksRizwan Rizzu
2.2K visualizações27 slides
Neural network por
Neural networkNeural network
Neural networkSaddam Hussain
774 visualizações17 slides
Neural Networks por
Neural Networks Neural Networks
Neural Networks Eric Su
2.6K visualizações21 slides
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep... por
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...Vidit Goyal
229 visualizações25 slides
FACE RECOGNITION USING ELM-LRF por
FACE RECOGNITION USING ELM-LRFFACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRFAras Masood
77 visualizações32 slides

Similar a Deep Learning With Neural Networks(20)

IRJET-Breast Cancer Detection using Convolution Neural Network por IRJET Journal
IRJET-Breast Cancer Detection using Convolution Neural NetworkIRJET-Breast Cancer Detection using Convolution Neural Network
IRJET-Breast Cancer Detection using Convolution Neural Network
IRJET Journal83 visualizações
Neural networks por Rizwan Rizzu
Neural networksNeural networks
Neural networks
Rizwan Rizzu2.2K visualizações
Neural network por Saddam Hussain
Neural networkNeural network
Neural network
Saddam Hussain774 visualizações
Neural Networks por Eric Su
Neural Networks Neural Networks
Neural Networks
Eric Su2.6K visualizações
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep... por Vidit Goyal
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...
Ghaziabad, India - Early Detection of Various Types of Skin Cancer Using Deep...
Vidit Goyal229 visualizações
FACE RECOGNITION USING ELM-LRF por Aras Masood
FACE RECOGNITION USING ELM-LRFFACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRF
Aras Masood77 visualizações
Artificial neural network por GauravPandey319
Artificial neural networkArtificial neural network
Artificial neural network
GauravPandey3191.1K visualizações
Artificial Neural Network for hand Gesture recognition por Vigneshwer Dhinakaran
Artificial Neural Network for hand Gesture recognitionArtificial Neural Network for hand Gesture recognition
Artificial Neural Network for hand Gesture recognition
Vigneshwer Dhinakaran3.9K visualizações
Artificial Neural Network and its Applications por shritosh kumar
Artificial Neural Network and its ApplicationsArtificial Neural Network and its Applications
Artificial Neural Network and its Applications
shritosh kumar1.6K visualizações
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI por IRJET Journal
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PIDETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI
IRJET Journal9 visualizações
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI por IRJET Journal
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PIDETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI
DETECTION OF DIFFERENT TYPES OF SKIN DISEASES USING RASPBERRY PI
IRJET Journal7 visualizações
Neural network techniques por Vipul Bhargava
Neural network techniquesNeural network techniques
Neural network techniques
Vipul Bhargava292 visualizações
Forecasting of Sales using Neural network techniques por Hitesh Dua
Forecasting of Sales using Neural network techniquesForecasting of Sales using Neural network techniques
Forecasting of Sales using Neural network techniques
Hitesh Dua3.1K visualizações
Industrial application por Ashish Goel
Industrial application Industrial application
Industrial application
Ashish Goel415 visualizações
DLD meetup 2017, Efficient Deep Learning por Brodmann17
DLD meetup 2017, Efficient Deep LearningDLD meetup 2017, Efficient Deep Learning
DLD meetup 2017, Efficient Deep Learning
Brodmann171.1K visualizações
Understanding Neural Networks Working and Applications.pptx por kcharizmacruz
Understanding Neural Networks Working and Applications.pptxUnderstanding Neural Networks Working and Applications.pptx
Understanding Neural Networks Working and Applications.pptx
kcharizmacruz4 visualizações
Quantum neural network por surat murthy
Quantum neural networkQuantum neural network
Quantum neural network
surat murthy578 visualizações
Evolving Comprehensible Neural Network Trees por Amr Kamel Deklel
Evolving Comprehensible Neural Network TreesEvolving Comprehensible Neural Network Trees
Evolving Comprehensible Neural Network Trees
Amr Kamel Deklel331 visualizações
Neural network in R by Aman Chauhan por Aman Chauhan
Neural network in R by Aman ChauhanNeural network in R by Aman Chauhan
Neural network in R by Aman Chauhan
Aman Chauhan37 visualizações

Último

CRM stick or twist workshop por
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshopinfo828217
11 visualizações16 slides
PRIVACY AWRE PERSONAL DATA STORAGE por
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGEantony420421
5 visualizações56 slides
Chapter 3b- Process Communication (1) (1)(1) (1).pptx por
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptxayeshabaig2004
7 visualizações30 slides
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks por
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks
[DSC Europe 23] Aleksandar Tomcic - Adversarial AttacksDataScienceConferenc1
5 visualizações20 slides
Data Journeys Hard Talk workshop final.pptx por
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptxinfo828217
10 visualizações18 slides
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ... por
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...DataScienceConferenc1
8 visualizações18 slides

Último(20)

CRM stick or twist workshop por info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821711 visualizações
PRIVACY AWRE PERSONAL DATA STORAGE por antony420421
PRIVACY AWRE PERSONAL DATA STORAGEPRIVACY AWRE PERSONAL DATA STORAGE
PRIVACY AWRE PERSONAL DATA STORAGE
antony4204215 visualizações
Chapter 3b- Process Communication (1) (1)(1) (1).pptx por ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20047 visualizações
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks por DataScienceConferenc1
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks
[DSC Europe 23] Aleksandar Tomcic - Adversarial Attacks
DataScienceConferenc15 visualizações
Data Journeys Hard Talk workshop final.pptx por info828217
Data Journeys Hard Talk workshop final.pptxData Journeys Hard Talk workshop final.pptx
Data Journeys Hard Talk workshop final.pptx
info82821710 visualizações
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ... por DataScienceConferenc1
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...
[DSC Europe 23] Predrag Ilic & Simeon Rilling - From Data Lakes to Data Mesh ...
DataScienceConferenc18 visualizações
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... por DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
DataScienceConferenc17 visualizações
LIVE OAK MEMORIAL PARK.pptx por ms2332always
LIVE OAK MEMORIAL PARK.pptxLIVE OAK MEMORIAL PARK.pptx
LIVE OAK MEMORIAL PARK.pptx
ms2332always7 visualizações
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an... por StatsCommunications
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
StatsCommunications5 visualizações
shivam tiwari.pptx por AanyaMishra4
shivam tiwari.pptxshivam tiwari.pptx
shivam tiwari.pptx
AanyaMishra45 visualizações
Data about the sector workshop por info828217
Data about the sector workshopData about the sector workshop
Data about the sector workshop
info82821715 visualizações
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ... por DataScienceConferenc1
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...
DataScienceConferenc15 visualizações
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P... por DataScienceConferenc1
[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...[DSC Europe 23][AI:CSI]  Dragan Pleskonjic - AI Impact on Cybersecurity and P...
[DSC Europe 23][AI:CSI] Dragan Pleskonjic - AI Impact on Cybersecurity and P...
DataScienceConferenc18 visualizações
Short Story Assignment by Kelly Nguyen por kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0119 visualizações
Ukraine Infographic_22NOV2023_v2.pdf por AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K visualizações
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... por DataScienceConferenc1
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
DataScienceConferenc15 visualizações
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx por DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
DataScienceConferenc15 visualizações

Deep Learning With Neural Networks

  • 1. Deep Learning With Neural Networks Aniket Maurya aniketmaurya@outlook.com
  • 2. Deep Learning The term Deep Learning refers to training Neural Networks and sometime very large Neural Networks. Deep Learning algorithms can learn very complex patterns, if provided enough data. Source: https://www.analyticsvidhya.com/blog/2017/04/comparison-between-deep-learning-machine-learning/
  • 3. What is Neural Networks? ● Neural Networks are computing systems inspired by Biological Neurons. They try to mimic the functionality of the brain. ● It is constructed of neurons which learns different patterns and each neuron together produce the correct output. ● Each Neurons has weight which are initialized with random value and the values are updated to catch a pattern by a series of propagations through the network.
  • 4. ❏ Layers in a neural networks architecture: ● Input layer: The external information or data is fed in the units of input layer. ● Output layer: This layer gives the final output or prediction of the data fed into input layer. ● Hidden layer: Layers except the input and output layers are called Hidden layers. These are intermediate layers. There can be many hidden layers in a Network.
  • 5. ❏ Propagations in Neural Networks Forward Propagation: ● As the name suggest, a series of calculations are performed in neuralNets from first to the last layer. ● The weights are multiplied with input and passed through an activation function, the value obtained is passed to the next layer and the same step is repeated until the output layer. ● The value of output layer is used to calculate the error in the model.
  • 6. ❏ Back-Propagation ● It is the main reason a neural network work. ● The randomly initialized weights are updated to minimize the error in the prediction. The updation starts from last layer to the first hidden layer. ● The weight updation is done using the Gradient Descent algorithm.
  • 7. ❏ Gradient Descent Algorithm The gradient descent algorithm calculates the error derivative w.r.t. weight and it is multiplied by the learning rate and then subtracted from the old weight. The gradient descent algorithm has been optimized and modified in recent years by many researchers. Some of the optimized learning algorithms are RMSProp and Adam.
  • 8. ❏ Bias-Variance Problem During the training of Neural Network algorithms the model can suffer overfitting and underfitting. ● Overfitting: When the model performs well on training set but fails to generalise on the validation set, is called Overfitting. This usually happens when the architecture is large and the data is not sufficient. It is also called High Variance. ● Underfitting: It is the poor performance of model on the training set itself. It is also called High Bias.
  • 9. ❏ Controlling Overfitting and underfitting Two main techniques to avoid overfitting problems are: ● Regularization: It is a technique used to reduce overfitting. Its basic concept is to penalize the weights. It comes in action with Gradient Descent algo. ● Dropout: In this technique the random neurons are disabled. It improves the model accuracy significantly. To control underfitting: ● Number of hidden layers can be increased. Number of units of neurons in a layer can also be increased.
  • 10. Convolutional Neural Networks (CNN) ● CNNs are widely used to solve computer-vision related problems. ● It can be used for variety of tasks like image classification, face recognition, style transfer etc. ● CNNs are formed of mainly 3 components: Convolution layer, Pooling layer and Fully connected layer. ● In 1989, Yann LeCun used back-propagation to train a CNN to classify digits. It became a foundation for modern computer-vision.
  • 11. ❏ Applications of CNN Image Processing: CNN has changed the traditional image processing methods. Using CNN face recognition and pattern matching can be done more accurately and with less computation power. It can be used for lossless compression of images. Autonomous Systems: CNN has powered autonomous systems. Autonomous car, security cameras etc are not just part of sci-fi movies. Style Transfer: CNNs are being used to create art. Medical Image Computing: CNNs are being used in medical image computing to diagnose diseases. Recently, a model has been developed that performed better than Radiologists to diagnose Pneumonia from chest X-ray images.
  • 12. Recurrent Neural Networks (RNN) ● RNNs are used to train sequential data like text and audio. ● RNNs is mostly used in Natural Language processing for recommendation systems and sentiment analysis. ● The main advantage of RNN is that it can remember contexts.
  • 13. ❏ Conclusion ● With availability of huge amount of data Neural Networks can be used to train complex patterns. ● Many groundbreaking algorithms have been implemented in recent years which can be used to solve many problems which the traditional methods could not. ● There are many applications of Neural Network algorithms which can help solve real life problems like in health sector and electric power consumption. ● Sequential models like RNN, LSTM and GRU are used to create chatbots.
  • 14. References ● Geoffrey Hinton, “How Neural Networks Learn from Experience”, Scientific American, pp. 145-151, 1992. ● George Babis and Michael Georgiopoulas, “Feed-forward neural networks: Why network size is so important”, IEEE potentials, pp. 27-31, 1994. ● Paul J. Werbos, “Back-propagation Through Time: What It Does and How to Do It”, Proceedings of the IEEE, Vol. 78, No. 10, pp. 1550-1560, 1990. ● Katsuyuki Hagiwara and Kazuhiro Kuno, “Regularization learning and early stopping in linear networks”, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks, Neural Computing: New Challenges and Perspectives for the New Millennium, pp. 511-516, 2000. ● Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov, “Dropout: A Simple Way to Prevent Neural Networks from Over-fitting”, Journal of Machine Learning Research, Vol. 15, pp. 1929-1958, 2014. ● Salman Khan, Hossein Rahmani, Syed Afaq Ali Shah, Mohammed Bennamoun, Gerard Medioni, Sven Dickinson, “A guide to Convolutional Network for computer vision” pp. 36-39, 43-60. ● Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland, “Efficient training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition”, IEEE/ACM Transactions on audio, speech and language processing Vol. 24, No. 11, pp. 2146-2157, 2016. ● Takaaki Hori, Chiori Hori, Shinji Watanabe, John R. Hershey, “Minimum word and training of long short - term memory recurrent neural network language models for speech recognition”, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing. ● Saleh Ali Alshehri, “Neural Network Technique for Image Compression” IET Image Processing, pp. 222-226. ● Nadia Jmour, Sehla Zayen, Afef Abdelkrim, “Convolutional Neural Networks for Image Classification”, IC-ASET, 2018. ● Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, “A Neural Algorithm of Artistic Style”. ● Marcin Andrychowicz, Misha Denil, Sergio Gómez Colmenarejo, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando de Freitas, “Learning to Learn by Gradient Descent by gradient descent”, NIPS 2016.