SlideShare uma empresa Scribd logo
1 de 64
NETWORK SCIENCE   The science of the 21st century




Times
cited




                             Years              Network Science: Introduction January 10, 2011
NETWORK SCIENCE   The science of the 21st century




Times
cited




                             Years              Network Science: Introduction January 10, 2011
NETWORK SCIENCE   The science of the 21st century




  Times
  cited




                             Years

                                                Network Science: Introduction January 10, 2011
NETWORK SCIENCE   The science of the 21st century




                   Why now?




                                                Network Science: Introduction January 10, 2011
THE EMERGENCE OF NETWORK SCIENCE


         Data Availability:    Movie Actor Network, 1998;
                               World Wide Web, 1999.
                               C elegans neural wiring diagram 1990
                               Citation Network, 1998
                               Metabolic Network, 2000;
                               PPI network, 2001



              Universality:    The architecture of networks emerging in various
                               domains of science, nature, and technology are
                               more similar to each other than one would have
                               expected.



        The (urgent) need to   Despite the challenges complex systems offer us, we
                               cannot afford to not address their behavior, a view
  understand complexity:       increasingly shared both by scientists and policy
                               makers. Networks are not only essential for this
                               journey, but during the past decade some of the most
                               important advances towards understanding complexity
                               were provided in context of network theory.




                                                       Network Science: Introduction January 10, 2011
EPIDEMIC FORECAST   Predicting the H1N1 pandemic




           Real                       Projected




                                             Network Science: Introduction January 10, 2011
DOCUMENTARY




Thex




              Network Science: Introduction January 10, 2011
Graph theory and basic terminology
      Learning the language




                              Network Science: Graph Theory January 24, 2011
COMPONENTS OF A COMPLEX SYSTEM




               components: nodes, vertices     N

               interactions: links, edges      L


               system:       network, graph   (N,L)

                                                    Network Science: Graph Theory January 24, 2011
NETWORKS OR GRAPHS?


network often refers to real systems
•www,
•social network
•metabolic network.

Language: (Network, node, link)


graph: mathematical representation of a network
•web graph,
•social graph (a Facebook term)

Language: (Graph, vertex, edge)

We will try to make this distinction whenever it is appropriate,
but in most cases we will use the two terms interchangeably.




                                                 Network Science: Graph Theory January 24, 2011
A COMMON LANGUAGE



              friend                                                   Movie 1
                                    co-worker
Peter                       Mary                         Actor 1                   Actor 2

                                                Albert                           Movie 3             Actor 4
   brothers               friend                             Movie 2




        Albert
                                                                   Actor 3



Protein 1              Protein 2
                                   Protein 5




  Protein 9
                                                          N=4
                                                          L=4
                                                                                  Network Science: Graph Theory January 24, 2011
CHOOSING A PROPER REPRESENTATION




 The choice of the proper network representation determines
 our ability to use network theory successfully.

    In some cases there is a unique, unambiguous
 representation.
    In other cases, the representation is by no means unique.

 For example, for a group of individuals, the way you assign
 the links will determine the nature of the question you can
 study.




                                               Network Science: Graph Theory January 24, 2011
CHOOSING A PROPER REPRESENTATION




                            If you connect individuals
                            that work with each other,
                            you will explore
                            the professional network.




                                         Network Science: Graph Theory January 24, 2011
CHOOSING A PROPER REPRESENTATION




                            If you connect those that
                            have a sexual relationship,
                            you will be exploring the
                            sexual networks.




                                         Network Science: Graph Theory January 24, 2011
CHOOSING A PROPER REPRESENTATION




  If you connect individuals based on their first name
  (all Peters connected to each other), you will be
  exploring what?

  It is a network, nevertheless.




                                                Network Science: Graph Theory January 24, 2011
Network Science: Graph Theory January 24, 2011
UNDIRECTED VS. DIRECTED NETWORKS

 Undirected                                Directed

 Links: undirected (symmetrical)           Links: directed (arcs).

 Graph:                                    Digraph = directed graph:

                   L
      A
                                                                     D
                                   M           B                                       An undirected
                        F
                                                           C                           link is the
                                       I                                               superposition of
               D                                                                       two opposite
                                                                                       directed links.
     B                      G
                                                                             E
                                                       G
                                   H       A
           C
                                                                         F

 Undirected links :                            Directed links :
 coauthorship links                            URLs on the www
 Actor network                                 phone calls
 protein interactions                          metabolic reactions


                                                                 Network Science: Graph Theory January 24, 2011
ADJACENCY MATRIX

                                                             4
               4



                                                                                           3
                                            3                             2
                             2
                                                            1
              1



           Aij=1 if there is a link between node i and j
           Aij=0 if nodes i and j are not connected to each other.




           Note that for a directed graph (right) the matrix is not symmetric.



                                                                                 Network Science: Graph Theory January 24, 2011
ADJACENCY MATRIX



 a         e

                        a       bcdefgh
                   a    0       1    0     0       1             0            1             0
                   b    1       0    1     0       0             0            0             1
                   c    0       1    0     1       0             1            1             0
                   d    0       0    1     0       1             0            0             0
      h   b    d   e    1       0    0     1       0             0            0             0
                   f0       0     1    0       0       0             1            0
                   g1       0     1    0       0       0             0            0
                   h0       1     0    0       0       0             0            0
      f


 g        c




                                                       Network Science: Graph Theory January 24, 2011
NODE DEGREES

                                 Node degree: the number of links connected to the node.
Undirected



             j


                         i




                 4




                             3
                     2
                 1




                                                                     Network Science: Graph Theory January 24, 2011
NODE DEGREES

                                           In directed networks we can define an in-degree and out-degree.
                           D               The (total) degree is the sum of in- and out-degree.
               B
                       C
Directed




                                              kC  2
                                               in
                                                          kC  1
                                                           out
                                                                         kC  3
                                       E
                   G
           A

                               F           Source: a node with kin= 0; Sink: a node withkout= 0.




               4




                                   3
                       2
               1
A BIT OF STATISTICS



      We have a sample of values x1, ..., xN


          Average(a.k.a. mean): typical value
             <x> = (x1 + x1 + ... + xN)/N = Σi xi /N




          Standard deviation:fluctuations around typical value
             σx= √Σi (xi - <x>)2/N




                                                         Network Science: Graph Theory January 24, 2011
AVERAGE DEGREE



                                                               N
                                                       1
                                                               k
Undirected



                 j                                 k                 i
                                                       N       i 1

                         i

                                                  N – the number of nodes in the graph




                                                         N                            N
                                                    1                             1
                                                        k , k                        
                             D
                                                                                      k iout , k in  k out
                 B                           in                in         out
                     C
                                         k                     i
                                                    N   i 1                      N   i 1
Directed




                                     E
             A

                                 F



                                                                                      Network Science: Graph Theory January 24, 2011
COMPLETE GRAPH




   The maximum number of links a network
   of N nodes can have is:




   A graph with degree L=Lmaxis called a complete graph,
   and its average degree is <k>=N-1




                                             Network Science: Graph Theory January 24, 2011
SPARSE GRAPH




       Most networks observed in real systems are sparse:

                                       L <<Lmax (or <k><<N-1).

     WWW (ND Sample):         N=325,729;             <k>=4.51
     Protein (S. Cerevisiae): N=1870;                <k>=2.39
 Coauthorship (Math):     N=70 975;              <k>=3.9
     Movie Actors:            N=212 250;             <k>=28.78
 (Source: Albert, Barabasi, RMP2002)


 Consequence: Their adjacency matrix is filled with zeros!




                                                                 Network Science: Graph Theory January 24, 2011
ACTOR NETWORK
          Austin Powers:                   Let’s make
          The spy who                      it legal
          shagged me

                           Robert Wagner



                     Wild Things
                                                        What Price Glory



                                                           Barry Norton

           A Few                                                   Monsieur
           Good Men                                                Verdoux
ACTOR NETWORK

         Nodes: actors
         Links: cast jointly




                         Days of Thunder (1990) Far
                         and Away    (1992) Eyes
                         Wide Shut (1999)




          N = 212,250 actors k=28.78




                                                      Network Science: Graph Theory January 24, 2011
IMBD SCALE FREE




                  Network Science: Graph Theory January 24, 2011
Network Science: Graph Theory January 24, 2011
GRAPHOLOGY 1

 Undirected                                    Directed
                                        4
                                                                                        4
                      1                                          1


                                  2                                              2
                  3                                          3




 Actor network, protein-protein interactions   WWW, citation networks
                                                                        Network Science: Graph Theory January 24, 2011
GRAPHOLOGY 2

 Unweighted                                  Weighted
 (undirected)                            4   (undirected)                              4
                      1                                          1


                                     2                                         2
                  3                                          3




 protein-protein interactions, www           Call Graph, metabolic networks
                                                                     Network Science: Graph Theory January 24, 2011
GRAPHOLOGY 3

 Self-interactions                          Multigraph
                                            (undirected)
                                        4                                          4
                     1                                         1

                                    2                                      2
                 3                                         3




 Protein interaction network, www           Social networks, collaboration networks
                                                                    Network Science: Graph Theory January 24, 2011
GRAPHOLOGY 4

 Complete Graph
 (undirected)
                                        4
                       1


                                   2
                   3




 Actor network, protein-protein interactions
                                               Network Science: Graph Theory January 24, 2011
GRAPHOLOGY X



                        WWW >>directed multigraph with self-interactions

        Protein Interactions >>undirected unweighted with self-interactions

           Protein Complex >>unweighted complete graph with self-interactions

     Collaboration network >>undirected multigraph or weighted.

         Mobile phone calls >>directed, weighted.

 Facebook Friendship links >>undirected, unweighted.




                                                           Network Science: Graph Theory January 24, 2011
BIPARTITE GRAPHS



 bipartite graph(or bigraph) is a graph
     whose nodes can be divided into two
     disjoint setsU and V such that every link
     connects a node in U to one in V; that is,
     U and V are independent sets.




 Examples:

 Hollywood actor network
 Collaboration networks
 Disease network (diseasome)




                                                  Network Science: Graph Theory January 24, 2011
GENE NETWORK – DISEASE NETWORK

                                            DISEASOME       PHENOME


                                                 GENOME




          Gene network
                                                                         Disease network



 Goh, Cusick, Valle, Childs, Vidal &Barabási, PNAS (2007)
                                                                      Network Science: Graph Theory January 24, 2011
HUMAN DISEASE NETWORK
Network Science: Graph Theory January 24, 2011
STATISTICS REMINDER


  We have a sample of values x1, ..., xN
     Distribution of x (a.k.a. PDF): probability that a randomly chosen value is x

         P(x) = (# values x) / N

         ΣiP(xi) = 1 always!




        Histograms >>>




                                                                      Network Science: Graph Theory January 24, 2011
DEGREE DISTRIBUTION


   Degree distributionP(k): probability that
      a randomly chosen vertex has degree k

      Nk = # nodes with degree k
      P(k) = Nk / N  plot
                                               P(k)
                                               0.6
                                               0.5
                                               0.4
                                               0.3
                                               0.2
                                               0.1

                                                      1    2       3      4         k




                                                          Network Science: Graph Theory January 24, 2011
DEGREE DISTRIBUTION


 discrete representation: pkis the probability that a node has degree k.


 continuum description: P(k) is the pdf of the degrees, where




 represents the probability that a node’s degree is between k1 and k2.

 Normalization condition:




  , where Kmin is the minimal degree in the network.




                                                                           Network Science: Graph Theory January 24, 2011
Network Science: Graph Theory January 24, 2011
PATHS

A path is a sequence of nodes in which each node is adjacent to the next one

Pi0,in of length nbetween nodes i0 and in is an ordered collection of n+1 nodes and n links




                                                                                                                    B



                                                                                          A
   •A path can intersect itself and pass through the same
   link repeatedly. Each time a link is crossed, it is counted
                                                                      E
   separately
                                                                                                                C
   •A legitimate path on the graph on the right:
                                                                                         D
   ABCBCADEEBA

   •In a directed network, the path can follow only the
   direction of an arrow.




                                                                          Network Science: Graph Theory January 24, 2011
NUMBER OF PATHS BETWEEN TWO NODES                                   Adjacency Matrix


 Nij,number of paths between any two nodes i and j:
 Length n=1:If there is a link between i and j, then Aij=1 and Aij=0 otherwise.

 Length n=2:If there is a path of length two between i and j, then AikAkj=1, and AikAkj=0 otherwise.
 The number of paths of length 2:




 Length n: In general, if there is a path of length n between i and j, then Aik…Alj=1
 and Aik…Alj=0 otherwise.
 The number of paths of length n between i and j is*




 *holds for both directed and undirected networks.



                                                                          Network Science: Graph Theory January 24, 2011
DISTANCE IN A GRAPH       Shortest Path, Geodesic Path



               B       The distance (shortest path, geodesic path) between two
                       nodes is defined as the number of edges along the shortest
       A
                       path connecting them.


                       *If the two nodes are disconnected, the distance is infinity.
                   C
   D



                       In directed graphs each path needs to follow the direction of
                       the arrows.
              B
                       Thus in a digraph the distance from node A to B (on an AB
       A
                       path) is generally different from the distance from node B to A
                       (on a BCA path).

                  C
  D



                                                              Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEATCH


 Distance between node1and node 4:


 1.Start at1.




                                     1




                                          Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEATCH


 Distance between node1and node 4:


 1.Start at1.
 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue.




                                     1       1      1




                                             1




                                                                      Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEATCH


 Distance between node1and node 4:


 1.Start at1.
 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue.
 3.Take the first node out of the queue. Find the unmarked nodes adjacent to it in the
 graph. Mark them with the label of 2. Put them in the queue.


                                                                2
                            2         1        1       1

                                                                2

                                               1




                                           2       2




                                                                        Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEARCH


 Distance between node1and node 4:


 1.Start at1.
 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue.
 3.Take the first node out of the queue. Find the unmarked nodes adjacent to it in the
 graph. Mark them with the label of 2. Put them in the queue.
 4.…
 5.Take the first node, w, out of the queue. Find the unmarked nodes adjacent to it in the
 graph. Mark them with the label of w+1. Put them in the queue.




                                                                2         w
                  w         2         1       1       1                                    w+1
        w+1
                                                                2




                                                                         Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEATCH


 Distance between node1and node 4:

 1.Repeat until you find node 4 or there are no more nodes in the queue.
 2.The distance between1and4is the label of4or, if4does not have a label, infinity.



                                                                         3              4
                           3

                                                                 2
           4                                                             3
                       3           2   1        1       1                               4

                                                                 2
                           3                                             3
               4                                1                                   4


                   4                                                 3
                               4                                             4
                                            2       2



                                            3


                                                                         Network Science: Graph Theory January 24, 2011
FINDING DISTANCES: BREADTH FIRST SEATCH ALGORITHM




    For a weighted network, we have Dijkstra’s algorithm.

    http://en.wikipedia.org/wiki/Dijkstra%27s_algorithm




                                                                                       Network Science: Graph Theory January 24, 2011
                            http://www.yaldex.com/games-programming/0672323699_ch12lev1sec7.html
RECORDING DISTANCES

                                                                                               B

                                  A B C D                                A
                                A 0 lABlAClAD
                                B lBA 0 lBClBD
  Fill out the matrix
                                C lCAlCB 0 lCD
                                D lDAlDBlDC 0                                                      C
                                                             D




  Q: How many entries will you need for an N- node graph?                                     B

                                                                         A



  A: N(N-1) in a digraph, N(N-1)/2 in a symmetrical graph.
  Let’s use the notation
                                                                                                  C
                                                             D




                                                             Network Science: Graph Theory January 24, 2011
NETWORK DIAMETER AND AVERAGE DISTANCE



  Diameter: the maximum distance between any pair of nodes in the graph.


  Average path length/distance for a connected graph (component) or a strongly
  connected (component of a) digraph.

       where lij is the distance from node i to node j




  In an undirected (symmetrical) graph lij =lji, we only need to count them once




                                                                    Network Science: Graph Theory January 24, 2011
CONNECTIVITY OF UNDIRECTED GRAPHS


   Connected (undirected) graph: any two vertices can be joined by a path.
   A disconnected graph is made up by two or more connected components.


                        B
                                                        B
              A
                                              A                 Largest Component:
                                                                Giant Component

                            C
     D    F                                                 C
                                    D     F
                         F
                                                         F      The rest: Isolates
          G
                                         G



   Bridge: if we erase it, the graph becomes disconnected.




                                                                   Network Science: Graph Theory January 24, 2011
CONNECTIVITY OF UNDIRECTED GRAPHS                             Adjacency Matrix




  The adjacency matrix of a network with several components can be written in a block-
  diagonal form, so that nonzero elements are confined to squares, with all other elements
  being zero:




  Figure after Newman, 2010
                                                                      Network Science: Graph Theory January 24, 2011
CONNECTIVITY OF DIRECTED GRAPHS

 Strongly connected directed graph: has a path from each node to
 every other node and vice versa (e.g. AB path and BA path).
 Weakly connected directed graph: it is connected if we disregard the
 edge directions.

 Strongly connected components can be identified, but not every node is part
 of a nontrivial strongly connected component.


                         B
                                        E
              A                                                                F
                                                              B

                                                    A

  D       E
                             C
                                        D                     C               G
                     F

          G

 In-component: nodes that can reach the scc,
 Out-component: nodes that can be reached from the scc.
                                                                      Network Science: Graph Theory January 24, 2011
HISTORICAL DETOUR: THE BRIDGES OF KONIGSBERG




 Can one walk across the seven bridges and never cross the same bridge twice?


 Euler circuit: return to the starting point by traveling each link of the graph
 once and only once.
                                                                              http://maps.google.com/maps?oe=utf-
                                                                              8&client=firefox-
                                                                              a&q=kaliningrad&ie=UTF8&hq=&hnear=Kalining
                                                                              rad,+Kaliningrad+Oblast,+Russia&gl=us&ll=54.70

 Euler’s theorem:                                                             7293,20.510788&spn=0.009248,0.025878&t=h&
                                                                              z=15

 (a) If a graph has nodes of odd degree, it cannot have an Euler circuit.
 (b) If a graph is connected and has no odd degree nodes, it has at
     least one Euler circuit.


       How would we need to modify the graph so it has an Euler circuit?
                                                                       Network Science: Graph Theory January 24, 2011
EULERIAN GRAPH




 Every vertex of this graph has an even degree, therefore this is an Eulerian graph.
 Following the edges in alphabetical order gives an Eulerian circuit/cycle.


 http://en.wikipedia.org/wiki/Euler_circuit
                                                                      Network Science: Graph Theory January 24, 2011
EULER CIRCUITS IN DIRECTED GRAPHS
       B
                   If a digraph is strongly connected and the in-
                   degree of each node is equal to its out-degree,
       D
                   then there is an Euler circuit

   A           C
                   Q: Give one possible Euler circuit
           E



   F           G


  Otherwise there is no Euler circuit.
  This is because in a circuit we need to
  enter each node as many times as we
  leave it.
                                                        Network Science: Graph Theory January 24, 2011
CLUSTERING COEFFICIENT



          Clustering coefficient:
       what portion of your neighbors are connected?

              Node i with degree ki

              Ciin [0,1]




                                                       Network Science: Graph Theory January 24, 2011
CLUSTERING COEFFICIENT




          Clustering coefficient: what portion of your
          neighbors are connected?
            Node i with degree ki

                                                     3            6
                                           1
                                                                                8
                                                         5
                                                     4
                                           2                 7
                                     10
                                                 9


       i=8: k8=2, e8=1, TOT=2*1/2=1  C8=1/1=1



                                                         Network Science: Graph Theory January 24, 2011
CLUSTERING COEFFICIENT



          Clustering coefficient: what portion of your
          neighbors are connected?
            Node i with degree ki

                                                     3            6
                                           1
                                                                                8
                                                         5
                                                     4
                                           2                 7
                                     10
                                                9



       i=4: k4=4, e4=2, TOTAL=4*3/2=6  C4=2/6=1/3



                                                         Network Science: Graph Theory January 24, 2011
KEY MEASURES



       Degree distribution:          P(k)

       Path length:              l

       Clustering coefficient:




                                            Network Science: Graph Theory January 24, 2011

Mais conteúdo relacionado

Mais procurados

Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
Data Analytics with R and SQL Server
Data Analytics with R and SQL ServerData Analytics with R and SQL Server
Data Analytics with R and SQL ServerStéphane Fréchette
 
Lecture 3 Computer Science Research SEM1 22_23 (1).pptx
Lecture 3 Computer Science Research SEM1 22_23 (1).pptxLecture 3 Computer Science Research SEM1 22_23 (1).pptx
Lecture 3 Computer Science Research SEM1 22_23 (1).pptxNabilaHassan13
 
5.5 graph mining
5.5 graph mining5.5 graph mining
5.5 graph miningKrish_ver2
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectivenessemapesce
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 
Software Design and Modularity
Software Design and ModularitySoftware Design and Modularity
Software Design and ModularityDanyal Ahmad
 
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...CloudxLab
 
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...Mahantesh Angadi
 
In-Memory Big Data Analytics
In-Memory Big Data AnalyticsIn-Memory Big Data Analytics
In-Memory Big Data AnalyticsSupreeth M P
 
AWS Forcecast: DeepAR Predictor Time-series
AWS Forcecast: DeepAR Predictor Time-series AWS Forcecast: DeepAR Predictor Time-series
AWS Forcecast: DeepAR Predictor Time-series PolarSeven Pty Ltd
 
Recurrent Convolutional Neural Networks for Text Classification
Recurrent Convolutional Neural Networks for Text ClassificationRecurrent Convolutional Neural Networks for Text Classification
Recurrent Convolutional Neural Networks for Text ClassificationShuangshuang Zhou
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewSatyaki Sikdar
 

Mais procurados (20)

Data Analysis in Python
Data Analysis in PythonData Analysis in Python
Data Analysis in Python
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Data Analytics with R and SQL Server
Data Analytics with R and SQL ServerData Analytics with R and SQL Server
Data Analytics with R and SQL Server
 
Apache hadoop hbase
Apache hadoop hbaseApache hadoop hbase
Apache hadoop hbase
 
Lecture 3 Computer Science Research SEM1 22_23 (1).pptx
Lecture 3 Computer Science Research SEM1 22_23 (1).pptxLecture 3 Computer Science Research SEM1 22_23 (1).pptx
Lecture 3 Computer Science Research SEM1 22_23 (1).pptx
 
5.5 graph mining
5.5 graph mining5.5 graph mining
5.5 graph mining
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectiveness
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
Software Design and Modularity
Software Design and ModularitySoftware Design and Modularity
Software Design and Modularity
 
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
Introduction to MapReduce - Hadoop Streaming | Big Data Hadoop Spark Tutorial...
 
Topic Modeling
Topic ModelingTopic Modeling
Topic Modeling
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
Introduction and Overview of BigData, Hadoop, Distributed Computing - BigData...
 
Ppt
PptPpt
Ppt
 
In-Memory Big Data Analytics
In-Memory Big Data AnalyticsIn-Memory Big Data Analytics
In-Memory Big Data Analytics
 
AWS Forcecast: DeepAR Predictor Time-series
AWS Forcecast: DeepAR Predictor Time-series AWS Forcecast: DeepAR Predictor Time-series
AWS Forcecast: DeepAR Predictor Time-series
 
Chapter1
Chapter1Chapter1
Chapter1
 
Recurrent Convolutional Neural Networks for Text Classification
Recurrent Convolutional Neural Networks for Text ClassificationRecurrent Convolutional Neural Networks for Text Classification
Recurrent Convolutional Neural Networks for Text Classification
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief Overview
 
Software documentation
Software documentationSoftware documentation
Software documentation
 

Semelhante a Network Science: The Science of the 21st Century

Complexity Play&Learn
Complexity Play&LearnComplexity Play&Learn
Complexity Play&LearnMassimo Conte
 
Network analysis in the social sciences
Network analysis in the social sciencesNetwork analysis in the social sciences
Network analysis in the social sciencesJorge Pacheco
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...Marc Smith
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
 
20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...Marc Smith
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network AnalysisMarc Smith
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachElena Sügis
 
Networks and epidemiology - an update
Networks and epidemiology - an updateNetworks and epidemiology - an update
Networks and epidemiology - an updateMarco Pautasso
 
Predicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksPredicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksAnvardh Nanduri
 
2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formattedMarc Smith
 
20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...Marc Smith
 
Networks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsNetworks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsChristoph Trattner
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)Duke Network Analysis Center
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 

Semelhante a Network Science: The Science of the 21st Century (17)

Complexity Play&Learn
Complexity Play&LearnComplexity Play&Learn
Complexity Play&Learn
 
Network analysis in the social sciences
Network analysis in the social sciencesNetwork analysis in the social sciences
Network analysis in the social sciences
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
 
Fbk Seminar Michela Ferron
Fbk Seminar Michela FerronFbk Seminar Michela Ferron
Fbk Seminar Michela Ferron
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
 
20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...
 
Book chapter 2
Book chapter 2Book chapter 2
Book chapter 2
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
 
Networks and epidemiology - an update
Networks and epidemiology - an updateNetworks and epidemiology - an update
Networks and epidemiology - an update
 
Predicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networksPredicting_new_friendships_in_social_networks
Predicting_new_friendships_in_social_networks
 
2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted2013 passbac-marc smith-node xl-sna-social media-formatted
2013 passbac-marc smith-node xl-sna-social media-formatted
 
20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...
 
Networks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsNetworks Navigability: Theory and Applications
Networks Navigability: Theory and Applications
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
SN- Lecture 9
SN- Lecture 9SN- Lecture 9
SN- Lecture 9
 

Último

Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfPrerana Jadhav
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 

Último (20)

Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Narcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdfNarcotic and Non Narcotic Analgesic..pdf
Narcotic and Non Narcotic Analgesic..pdf
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 

Network Science: The Science of the 21st Century

  • 1.
  • 2. NETWORK SCIENCE The science of the 21st century Times cited Years Network Science: Introduction January 10, 2011
  • 3. NETWORK SCIENCE The science of the 21st century Times cited Years Network Science: Introduction January 10, 2011
  • 4. NETWORK SCIENCE The science of the 21st century Times cited Years Network Science: Introduction January 10, 2011
  • 5. NETWORK SCIENCE The science of the 21st century Why now? Network Science: Introduction January 10, 2011
  • 6. THE EMERGENCE OF NETWORK SCIENCE Data Availability: Movie Actor Network, 1998; World Wide Web, 1999. C elegans neural wiring diagram 1990 Citation Network, 1998 Metabolic Network, 2000; PPI network, 2001 Universality: The architecture of networks emerging in various domains of science, nature, and technology are more similar to each other than one would have expected. The (urgent) need to Despite the challenges complex systems offer us, we cannot afford to not address their behavior, a view understand complexity: increasingly shared both by scientists and policy makers. Networks are not only essential for this journey, but during the past decade some of the most important advances towards understanding complexity were provided in context of network theory. Network Science: Introduction January 10, 2011
  • 7. EPIDEMIC FORECAST Predicting the H1N1 pandemic Real Projected Network Science: Introduction January 10, 2011
  • 8. DOCUMENTARY Thex Network Science: Introduction January 10, 2011
  • 9. Graph theory and basic terminology Learning the language Network Science: Graph Theory January 24, 2011
  • 10. COMPONENTS OF A COMPLEX SYSTEM components: nodes, vertices N interactions: links, edges L system: network, graph (N,L) Network Science: Graph Theory January 24, 2011
  • 11. NETWORKS OR GRAPHS? network often refers to real systems •www, •social network •metabolic network. Language: (Network, node, link) graph: mathematical representation of a network •web graph, •social graph (a Facebook term) Language: (Graph, vertex, edge) We will try to make this distinction whenever it is appropriate, but in most cases we will use the two terms interchangeably. Network Science: Graph Theory January 24, 2011
  • 12. A COMMON LANGUAGE friend Movie 1 co-worker Peter Mary Actor 1 Actor 2 Albert Movie 3 Actor 4 brothers friend Movie 2 Albert Actor 3 Protein 1 Protein 2 Protein 5 Protein 9 N=4 L=4 Network Science: Graph Theory January 24, 2011
  • 13. CHOOSING A PROPER REPRESENTATION The choice of the proper network representation determines our ability to use network theory successfully. In some cases there is a unique, unambiguous representation. In other cases, the representation is by no means unique. For example, for a group of individuals, the way you assign the links will determine the nature of the question you can study. Network Science: Graph Theory January 24, 2011
  • 14. CHOOSING A PROPER REPRESENTATION If you connect individuals that work with each other, you will explore the professional network. Network Science: Graph Theory January 24, 2011
  • 15. CHOOSING A PROPER REPRESENTATION If you connect those that have a sexual relationship, you will be exploring the sexual networks. Network Science: Graph Theory January 24, 2011
  • 16. CHOOSING A PROPER REPRESENTATION If you connect individuals based on their first name (all Peters connected to each other), you will be exploring what? It is a network, nevertheless. Network Science: Graph Theory January 24, 2011
  • 17. Network Science: Graph Theory January 24, 2011
  • 18. UNDIRECTED VS. DIRECTED NETWORKS Undirected Directed Links: undirected (symmetrical) Links: directed (arcs). Graph: Digraph = directed graph: L A D M B An undirected F C link is the I superposition of D two opposite directed links. B G E G H A C F Undirected links : Directed links : coauthorship links URLs on the www Actor network phone calls protein interactions metabolic reactions Network Science: Graph Theory January 24, 2011
  • 19. ADJACENCY MATRIX 4 4 3 3 2 2 1 1 Aij=1 if there is a link between node i and j Aij=0 if nodes i and j are not connected to each other. Note that for a directed graph (right) the matrix is not symmetric. Network Science: Graph Theory January 24, 2011
  • 20. ADJACENCY MATRIX a e a bcdefgh a 0 1 0 0 1 0 1 0 b 1 0 1 0 0 0 0 1 c 0 1 0 1 0 1 1 0 d 0 0 1 0 1 0 0 0 h b d e 1 0 0 1 0 0 0 0 f0 0 1 0 0 0 1 0 g1 0 1 0 0 0 0 0 h0 1 0 0 0 0 0 0 f g c Network Science: Graph Theory January 24, 2011
  • 21. NODE DEGREES Node degree: the number of links connected to the node. Undirected j i 4 3 2 1 Network Science: Graph Theory January 24, 2011
  • 22. NODE DEGREES In directed networks we can define an in-degree and out-degree. D The (total) degree is the sum of in- and out-degree. B C Directed kC  2 in kC  1 out kC  3 E G A F Source: a node with kin= 0; Sink: a node withkout= 0. 4 3 2 1
  • 23. A BIT OF STATISTICS We have a sample of values x1, ..., xN Average(a.k.a. mean): typical value <x> = (x1 + x1 + ... + xN)/N = Σi xi /N Standard deviation:fluctuations around typical value σx= √Σi (xi - <x>)2/N Network Science: Graph Theory January 24, 2011
  • 24. AVERAGE DEGREE N 1 k Undirected j k  i N i 1 i N – the number of nodes in the graph N N 1 1 k , k  D   k iout , k in  k out B in in out C k i N i 1 N i 1 Directed E A F Network Science: Graph Theory January 24, 2011
  • 25. COMPLETE GRAPH The maximum number of links a network of N nodes can have is: A graph with degree L=Lmaxis called a complete graph, and its average degree is <k>=N-1 Network Science: Graph Theory January 24, 2011
  • 26. SPARSE GRAPH Most networks observed in real systems are sparse: L <<Lmax (or <k><<N-1). WWW (ND Sample): N=325,729; <k>=4.51 Protein (S. Cerevisiae): N=1870; <k>=2.39 Coauthorship (Math): N=70 975; <k>=3.9 Movie Actors: N=212 250; <k>=28.78 (Source: Albert, Barabasi, RMP2002) Consequence: Their adjacency matrix is filled with zeros! Network Science: Graph Theory January 24, 2011
  • 27. ACTOR NETWORK Austin Powers: Let’s make The spy who it legal shagged me Robert Wagner Wild Things What Price Glory Barry Norton A Few Monsieur Good Men Verdoux
  • 28. ACTOR NETWORK Nodes: actors Links: cast jointly Days of Thunder (1990) Far and Away (1992) Eyes Wide Shut (1999) N = 212,250 actors k=28.78 Network Science: Graph Theory January 24, 2011
  • 29. IMBD SCALE FREE Network Science: Graph Theory January 24, 2011
  • 30. Network Science: Graph Theory January 24, 2011
  • 31. GRAPHOLOGY 1 Undirected Directed 4 4 1 1 2 2 3 3 Actor network, protein-protein interactions WWW, citation networks Network Science: Graph Theory January 24, 2011
  • 32. GRAPHOLOGY 2 Unweighted Weighted (undirected) 4 (undirected) 4 1 1 2 2 3 3 protein-protein interactions, www Call Graph, metabolic networks Network Science: Graph Theory January 24, 2011
  • 33. GRAPHOLOGY 3 Self-interactions Multigraph (undirected) 4 4 1 1 2 2 3 3 Protein interaction network, www Social networks, collaboration networks Network Science: Graph Theory January 24, 2011
  • 34. GRAPHOLOGY 4 Complete Graph (undirected) 4 1 2 3 Actor network, protein-protein interactions Network Science: Graph Theory January 24, 2011
  • 35. GRAPHOLOGY X WWW >>directed multigraph with self-interactions Protein Interactions >>undirected unweighted with self-interactions Protein Complex >>unweighted complete graph with self-interactions Collaboration network >>undirected multigraph or weighted. Mobile phone calls >>directed, weighted. Facebook Friendship links >>undirected, unweighted. Network Science: Graph Theory January 24, 2011
  • 36. BIPARTITE GRAPHS bipartite graph(or bigraph) is a graph whose nodes can be divided into two disjoint setsU and V such that every link connects a node in U to one in V; that is, U and V are independent sets. Examples: Hollywood actor network Collaboration networks Disease network (diseasome) Network Science: Graph Theory January 24, 2011
  • 37. GENE NETWORK – DISEASE NETWORK DISEASOME PHENOME GENOME Gene network Disease network Goh, Cusick, Valle, Childs, Vidal &Barabási, PNAS (2007) Network Science: Graph Theory January 24, 2011
  • 39. Network Science: Graph Theory January 24, 2011
  • 40. STATISTICS REMINDER We have a sample of values x1, ..., xN Distribution of x (a.k.a. PDF): probability that a randomly chosen value is x P(x) = (# values x) / N ΣiP(xi) = 1 always! Histograms >>> Network Science: Graph Theory January 24, 2011
  • 41. DEGREE DISTRIBUTION Degree distributionP(k): probability that a randomly chosen vertex has degree k Nk = # nodes with degree k P(k) = Nk / N  plot P(k) 0.6 0.5 0.4 0.3 0.2 0.1 1 2 3 4 k Network Science: Graph Theory January 24, 2011
  • 42. DEGREE DISTRIBUTION discrete representation: pkis the probability that a node has degree k. continuum description: P(k) is the pdf of the degrees, where represents the probability that a node’s degree is between k1 and k2. Normalization condition: , where Kmin is the minimal degree in the network. Network Science: Graph Theory January 24, 2011
  • 43. Network Science: Graph Theory January 24, 2011
  • 44. PATHS A path is a sequence of nodes in which each node is adjacent to the next one Pi0,in of length nbetween nodes i0 and in is an ordered collection of n+1 nodes and n links B A •A path can intersect itself and pass through the same link repeatedly. Each time a link is crossed, it is counted E separately C •A legitimate path on the graph on the right: D ABCBCADEEBA •In a directed network, the path can follow only the direction of an arrow. Network Science: Graph Theory January 24, 2011
  • 45. NUMBER OF PATHS BETWEEN TWO NODES Adjacency Matrix Nij,number of paths between any two nodes i and j: Length n=1:If there is a link between i and j, then Aij=1 and Aij=0 otherwise. Length n=2:If there is a path of length two between i and j, then AikAkj=1, and AikAkj=0 otherwise. The number of paths of length 2: Length n: In general, if there is a path of length n between i and j, then Aik…Alj=1 and Aik…Alj=0 otherwise. The number of paths of length n between i and j is* *holds for both directed and undirected networks. Network Science: Graph Theory January 24, 2011
  • 46. DISTANCE IN A GRAPH Shortest Path, Geodesic Path B The distance (shortest path, geodesic path) between two nodes is defined as the number of edges along the shortest A path connecting them. *If the two nodes are disconnected, the distance is infinity. C D In directed graphs each path needs to follow the direction of the arrows. B Thus in a digraph the distance from node A to B (on an AB A path) is generally different from the distance from node B to A (on a BCA path). C D Network Science: Graph Theory January 24, 2011
  • 47. FINDING DISTANCES: BREADTH FIRST SEATCH Distance between node1and node 4: 1.Start at1. 1 Network Science: Graph Theory January 24, 2011
  • 48. FINDING DISTANCES: BREADTH FIRST SEATCH Distance between node1and node 4: 1.Start at1. 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue. 1 1 1 1 Network Science: Graph Theory January 24, 2011
  • 49. FINDING DISTANCES: BREADTH FIRST SEATCH Distance between node1and node 4: 1.Start at1. 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue. 3.Take the first node out of the queue. Find the unmarked nodes adjacent to it in the graph. Mark them with the label of 2. Put them in the queue. 2 2 1 1 1 2 1 2 2 Network Science: Graph Theory January 24, 2011
  • 50. FINDING DISTANCES: BREADTH FIRST SEARCH Distance between node1and node 4: 1.Start at1. 2.Find the nodes adjacent to1. Mark them as at distance 1. Put them in a queue. 3.Take the first node out of the queue. Find the unmarked nodes adjacent to it in the graph. Mark them with the label of 2. Put them in the queue. 4.… 5.Take the first node, w, out of the queue. Find the unmarked nodes adjacent to it in the graph. Mark them with the label of w+1. Put them in the queue. 2 w w 2 1 1 1 w+1 w+1 2 Network Science: Graph Theory January 24, 2011
  • 51. FINDING DISTANCES: BREADTH FIRST SEATCH Distance between node1and node 4: 1.Repeat until you find node 4 or there are no more nodes in the queue. 2.The distance between1and4is the label of4or, if4does not have a label, infinity. 3 4 3 2 4 3 3 2 1 1 1 4 2 3 3 4 1 4 4 3 4 4 2 2 3 Network Science: Graph Theory January 24, 2011
  • 52. FINDING DISTANCES: BREADTH FIRST SEATCH ALGORITHM For a weighted network, we have Dijkstra’s algorithm. http://en.wikipedia.org/wiki/Dijkstra%27s_algorithm Network Science: Graph Theory January 24, 2011 http://www.yaldex.com/games-programming/0672323699_ch12lev1sec7.html
  • 53. RECORDING DISTANCES B A B C D A A 0 lABlAClAD B lBA 0 lBClBD Fill out the matrix C lCAlCB 0 lCD D lDAlDBlDC 0 C D Q: How many entries will you need for an N- node graph? B A A: N(N-1) in a digraph, N(N-1)/2 in a symmetrical graph. Let’s use the notation C D Network Science: Graph Theory January 24, 2011
  • 54. NETWORK DIAMETER AND AVERAGE DISTANCE Diameter: the maximum distance between any pair of nodes in the graph. Average path length/distance for a connected graph (component) or a strongly connected (component of a) digraph. where lij is the distance from node i to node j In an undirected (symmetrical) graph lij =lji, we only need to count them once Network Science: Graph Theory January 24, 2011
  • 55. CONNECTIVITY OF UNDIRECTED GRAPHS Connected (undirected) graph: any two vertices can be joined by a path. A disconnected graph is made up by two or more connected components. B B A A Largest Component: Giant Component C D F C D F F F The rest: Isolates G G Bridge: if we erase it, the graph becomes disconnected. Network Science: Graph Theory January 24, 2011
  • 56. CONNECTIVITY OF UNDIRECTED GRAPHS Adjacency Matrix The adjacency matrix of a network with several components can be written in a block- diagonal form, so that nonzero elements are confined to squares, with all other elements being zero: Figure after Newman, 2010 Network Science: Graph Theory January 24, 2011
  • 57. CONNECTIVITY OF DIRECTED GRAPHS Strongly connected directed graph: has a path from each node to every other node and vice versa (e.g. AB path and BA path). Weakly connected directed graph: it is connected if we disregard the edge directions. Strongly connected components can be identified, but not every node is part of a nontrivial strongly connected component. B E A F B A D E C D C G F G In-component: nodes that can reach the scc, Out-component: nodes that can be reached from the scc. Network Science: Graph Theory January 24, 2011
  • 58. HISTORICAL DETOUR: THE BRIDGES OF KONIGSBERG Can one walk across the seven bridges and never cross the same bridge twice? Euler circuit: return to the starting point by traveling each link of the graph once and only once. http://maps.google.com/maps?oe=utf- 8&client=firefox- a&q=kaliningrad&ie=UTF8&hq=&hnear=Kalining rad,+Kaliningrad+Oblast,+Russia&gl=us&ll=54.70 Euler’s theorem: 7293,20.510788&spn=0.009248,0.025878&t=h& z=15 (a) If a graph has nodes of odd degree, it cannot have an Euler circuit. (b) If a graph is connected and has no odd degree nodes, it has at least one Euler circuit. How would we need to modify the graph so it has an Euler circuit? Network Science: Graph Theory January 24, 2011
  • 59. EULERIAN GRAPH Every vertex of this graph has an even degree, therefore this is an Eulerian graph. Following the edges in alphabetical order gives an Eulerian circuit/cycle. http://en.wikipedia.org/wiki/Euler_circuit Network Science: Graph Theory January 24, 2011
  • 60. EULER CIRCUITS IN DIRECTED GRAPHS B If a digraph is strongly connected and the in- degree of each node is equal to its out-degree, D then there is an Euler circuit A C Q: Give one possible Euler circuit E F G Otherwise there is no Euler circuit. This is because in a circuit we need to enter each node as many times as we leave it. Network Science: Graph Theory January 24, 2011
  • 61. CLUSTERING COEFFICIENT Clustering coefficient: what portion of your neighbors are connected? Node i with degree ki Ciin [0,1] Network Science: Graph Theory January 24, 2011
  • 62. CLUSTERING COEFFICIENT Clustering coefficient: what portion of your neighbors are connected? Node i with degree ki 3 6 1 8 5 4 2 7 10 9 i=8: k8=2, e8=1, TOT=2*1/2=1  C8=1/1=1 Network Science: Graph Theory January 24, 2011
  • 63. CLUSTERING COEFFICIENT Clustering coefficient: what portion of your neighbors are connected? Node i with degree ki 3 6 1 8 5 4 2 7 10 9 i=4: k4=4, e4=2, TOTAL=4*3/2=6  C4=2/6=1/3 Network Science: Graph Theory January 24, 2011
  • 64. KEY MEASURES Degree distribution: P(k) Path length: l Clustering coefficient: Network Science: Graph Theory January 24, 2011

Notas do Editor

  1. The adjacency matrix can take far more complicated forms for a larger network….
  2. Erdos can be also connected to Kevin Bacon. Erdos plaid with Gene Paterson, in N is a Number (1993).Who played with Sam Rockwell (Box of Moonlight, 1996).Who palyed with Kevin Bacon in Frost/Nixon (2008)What is my Bacon number, what do you think?recent documentary that was eared on Discovery channel, called Connected (2009).
  3. let us get a feeling of how a sparse networks looks like...
  4. The distance between a node and itself can be taken as zero, and the average distance can be taken over N^2.Leave the two matrices on the blackboard.
  5. The first graph theory paper was published in 1736, written by Leonhard Euler a Swiss born mathematician who spent his career in Berlin and St. Petersburg, and who had an extraordinary influence on all areas of mathematics, physics and engineering. It addressed an amusing problem which originated in Königsberg, a town not too far from Euler’s home in St. Petersburg. Königsberg, a flowering city in Eastern Prussia, was a thriving city on the banks of the Pregel, with a busy fleet of ships and their trade offered a comfortable life to the local merchants and their families. The healthy economy allowed city officials to build not fewer than seven bridges across the river. Most of these connected the elegant island Kneiphof, which was caught between the two branches of the Pregel. Two additional bridges crossed the two branches of the river (Figure 1). The people of Königsberg, amused themselves with mind puzzles, one of which was: “Can one walk across the seven bridges and never cross the same one twice?” Euler offered a rigorous mathematical proof that with the seven bridges such a path does not exist.  Nevertheless, it is not the proof that made history, but rather the intermediate step that he took to solve the problem. Euler’s great insight decided to view Königsberg’s bridges as a graph, the collection of nodes connected by links. For this he used nodes to represent each of the four land areas separated by the river, distinguishing them with letters A, B, C, and D. Next he called the bridges the links, and connected with lines those pieces of land that had a bridge between them. He thus obtained a graph, whose nodes were pieces of land and links were bridges. Euler’s proof that in Königsberg there is no path crossing all seven bridges only once was based on a simple observation. Nodes with odd number of links must be either the starting or the end point of the journey. A continuous path that goes through all bridges can have only one starting and one end point. Thus, such a path cannot exist on a graph that has more than two nodes with an odd number of links. As the Königsberg graph had three such nodes, one could not find the desired path. For our purpose the most important aspect of Euler’s proof is that the existence of the path does not depend on our ingenuity to find it. Rather, it is a property of the graph. Given the layout of the Königsberg bridges, no matter how smart we are, we will never succeed at finding the desired path. The people of Königsberg finally agreed with Euler, gave up their fruitless search and in 1875 they built a new bridge between B and C, increasing the number of links of these two nodes to four. Now only one node (D) with an odd number of links remained. It was then rather straightforward to find the desired path. Perhaps the creation of this path was the hidden rationale behind building the bridge? In retrospect, Euler’s unintended message is very simple: graphs or networks have properties, hidden in their construction, that limit or enhance our ability to do things with or on them.