SlideShare uma empresa Scribd logo
1 de 86
Baixar para ler offline
Masterizing PHP Data Structure 102
          Patrick Allaert




      PHPUK 2012 Conference
About me
●   Patrick Allaert
●   Founder of Libereco
●   Playing with PHP/Linux for +10 years
●   eZ Publish core developer
●   Author of the APM PHP extension
●   @patrick_allaert
●   patrickallaert@php.net
●   http://github.com/patrickallaert/
●   http://patrickallaert.blogspot.com/
APM
APM
Masterizing =
 Mastering +
      Rising
PHP native datatypes
●   NULL (IS_NULL)
●   Booleans (IS_BOOL)
●   Integers (IS_LONG)
●   Floating point numbers
    (IS_DOUBLE)
●   Strings (IS_STRING)
●   Arrays (IS_ARRAY,
    IS_CONSTANT_ARRAY)
●   Objects (IS_OBJECT)
●   Resources (IS_RESOURCE)
●   Callable (IS_CALLABLE)
Wikipedia datatypes
●
    2-3-4 tree             ●   Bit field           ●
                                                       Directed acyclic      ●   Hashed array tree Map/Associative
                                                                                                    ●                       ●
                                                                                                                                Rapidly-exploring      ●
                                                                                                                                                           Suffix tree
                                                       graph                                            array/Dictionary        random tree                Symbol table
●
    2-3 heap               ●
                               Bitmap                                            Hash list                                                             ●
                                                                             ●
                                                       Directed graph                                   Matrix                  Record (also called        Syntax tree
    2-3 tree                   BK-tree             ●                                                                        ●                          ●

                                                                                 Hash table
                                                                                                    ●
●
                           ●
                                                       Disjoint-set                                                             tuple or struct)           Tagged union (variant
                                                                             ●
                                                                                                        Metric tree
    AA tree
                                                                                                                                                       ●

                               Bloom filter                                                                                                                record, discriminated
                                                   ●                                                ●
●
                                                       Distributed hash          Hash tree                                      Red-black tree
                           ●
                                                                             ●                          Minimax tree        ●
                                                                                                                                                           union, disjoint union)
    Abstract syntax tree                           ●                                                ●
●
                           ●   Boolean                 table                     Hash trie              Min/max kd-tree     ●
                                                                                                                                Rope                       Tango tree
    (a,b)-tree
                                                                             ●                      ●                                                  ●
●
                               Bounding interval       Double                                                                   Routing table              Ternary heap
                           ●                       ●
                                                                                 Heap                   M-tree              ●

    Adaptive k-d tree
                                                                                                                                                       ●

                               hierarchy                                     ●                      ●
●                                                      Doubly connected                                                         R-tree                     Ternary search tree
                                                   ●
                                                                                 Heightmap              Multigraph          ●                          ●


●
    Adjacency list             B sharp tree            edge list             ●
                                                                                                    ●
                                                                                                                                R* tree                    Threaded binary tree
                           ●
                                                                                                        Multimap            ●                          ●


●
    Adjacency matrix           BSP tree            ●
                                                       Doubly linked list    ●   Hilbert R-tree     ●
                                                                                                                                R+ tree                    Top tree
                                                                                                        Multiset
                                                                                                                                                       ●
                           ●                                                                                                ●

    AF-heap                                            Dynamic array                                                                                       Treap
                                                                                 Hypergraph
                                                                                                    ●

                               B-tree                                                                                           Scapegoat tree         ●

                                                                                                        Octree
●                                                  ●                         ●                                              ●
                           ●
                                                                                                                                                           Tree
    Alternating decision                               Enfilade                                     ●
                                                                                                                                Scene graph
                                                                                 Iliffe vector
                                                                                                                                                       ●
●

    tree                   ●
                               B*-tree             ●
                                                                             ●                          Pagoda              ●
                                                                                                                                                           Trees
                                                       Enumerated type                              ●
                                                                                                                                Segment tree
                                                                                                                                                       ●


                               B+ tree
                                                   ●
                                                                                 Image                  Pairing heap        ●                              Trie
    And-inverter graph     ●
                                                       Expectiminimax tree
                                                                             ●                      ●                                                  ●
●
                                                                                                                                Self-balancing             T-tree
                                                                                 Implicit kd-tree       Parallel array
                                                   ●

    And–or tree            ●
                               B-trie                  Exponential tree      ●                      ●
                                                                                                                            ●

                                                                                                                                binary search tree
                                                                                                                                                       ●

●
                                                                                                                                                           UB-tree
                                                                                                        Parse tree
                                                   ●


●
    Array                  ●
                               Bx-tree                 Fenwick tree          ●   Interval tree      ●
                                                                                                                            ●
                                                                                                                                Self-organizing list
                                                                                                                                                       ●

                                                                                                                                                           Union
                                                                                                        Plain old data
                                                   ●
                                                                                                                                                       ●

    AVL tree                   Cartesian tree          Fibonacci heap            Int                ●
                                                                                                                                Set                        Unrolled linked list
●                          ●                       ●                         ●
                                                                                                        structure           ●                          ●



●
    Beap                   ●
                               Char                ●
                                                       Finger tree           ●   Judy array             Prefix hash tree    ●
                                                                                                                                Skew heap              ●
                                                                                                                                                           Van Emde Boas tree
                                                                                                    ●
                                                                                                                                Skip list                  Variable-length array
●
    Bidirectional map      ●
                               Circular buffer     ●
                                                       Float                     Kdb tree               Priority queue      ●
                                                                                                                                                       ●

                                                                             ●                      ●
                                                                                                                                                           VList
    Bin                        Compressed suffix FM-index                                                                       Soft heap              ●
●                          ●                       ●
                                                                                 Kd-tree            ●
                                                                                                        Propositional       ●
                                                                                                                                                           VP-tree
                                                                             ●
    Binary decision            array             Fusion tree                                            directed acyclic        Sorted array           ●

●                                                  ●
                                                                                 Koorde
                                                                                                                            ●
                                                                                                                                                           Weight-balanced tree
    diagram                                            Gap buffer                                       graph                   Spaghetti stack
                                                                                                                                                       ●

                           ●
                               Container           ●
                                                                             ●
                                                                                                                            ●
                                                                                                                                                           Winged edge
    Binary heap                                        Generalised suffix        Leftist heap           Quad-edge               Sparse array
                                                                                                                                                       ●

●
                           ●   Control table       ●
                                                                             ●                      ●
                                                                                                                            ●                              X-fast trie
                                                                                                        Quadtree
                                                                                                                                                       ●

    Binary search tree                                 tree                      Lightmap                                       Sparse matrix              Xor linked list
                               Cover tree
                                                                                                    ●
●                                                                            ●                                              ●                          ●


    Binary tree
                           ●
                                                       Graph                                            Queap                   Splay tree                 X-tree
●
                               Ctrie
                                                   ●
                                                                             ●   Linear octree      ●
                                                                                                                            ●                          ●


    Binomial heap
                           ●
                                                       Graph-structured                                 Queue                   SPQR-tree                  Y-fast trie
●
                               Dancing tree
                                                   ●

                                                       stack                     Link/cut tree      ●
                                                                                                                            ●
                                                                                                                                                       ●
                                                                             ●
                                                                                                        Radix tree                                         Zero suppressed
                                                                                                                                Stack
                           ●
    Bit array                                                                                       ●                                                  ●
●
                               D-ary heap              Hash                  ●   Linked list            Randomized binary
                                                                                                                            ●
                                                                                                                                                           decision diagram
    Bitboard
                           ●                       ●
                                                                                                    ●                           String                     Zipper
                                                       Hash array mapped         Lookup table           search tree
●                                                                                                                           ●
                               Decision tree       ●                         ●
                                                                                                                                Suffix array
                                                                                                                                                       ●
                           ●
                                                       trie                                                                                                Z-order
                               Deque                                                                ●
                                                                                                        Range tree          ●                          ●

                           ●
Game:
Can you recognize some structures?
Array: PHP's untruthfulness
PHP “Arrays” are not true Arrays!
Array: PHP's untruthfulness
PHP “Arrays” are not true Arrays!
An array typically looks like this:



                 0      1      2      3      4      5


                Data   Data   Data   Data   Data   Data
Array: PHP's untruthfulness
PHP “Arrays” can dynamically grow and be iterated
both directions (reset(), next(), prev(), end()),
exclusively with O(1) operations.
Array: PHP's untruthfulness
PHP “Arrays” can dynamically grow and be iterated
both directions (reset(), next(), prev(), end()),
exclusively with O(1) operations.
Let's have a Doubly Linked List (DLL):
               Head                      Tail

       Data      Data      Data     Data        Data



Enables List, Deque, Queue and Stack
implementations
Array: PHP's untruthfulness
PHP “Arrays” elements are always accessible using a
key (index).
Array: PHP's untruthfulness
PHP “Arrays” elements are always accessible using a
key (index).
Let's have an Hash Table:
      Head                              Bucket pointers array                                       Tail
                  0        1        2        3            4    5            ...   nTableSize -1
               Bucket * Bucket * Bucket * Bucket * Bucket * Bucket *                Bucket *




              Bucket           Bucket            Bucket            Bucket              Bucket




       Data               Data                   Data                  Data                       Data
Array: PHP's untruthfulness
http://php.net/manual/en/language.types.array.php:


        “This type is optimized for several
        different uses; it can be treated as an
        array, list (vector), hash table (an
        implementation of a map),
        dictionary, collection, stack, queue,
        and probably more.”
Optimized for anything ≈ Optimized for nothing!
Array: PHP's untruthfulness
●   In C: 100 000 integers (using long on 64bits => 8
    bytes) can be stored in 0.76 Mb.
●   In PHP: it will take ≅ 13.97 Mb!
●   A PHP variable (containing an integer) takes 48
    bytes.
●   The overhead of buckets for every “array” entries is
    about 96 bytes.
●   More details:
    http://nikic.github.com/2011/12/12/How-big-are-PHP-arrays-really-Hint-BIG.html
Data Structure
Structs (or records, tuples,...)
Structs (or records, tuples,...)
●   A struct is a value containing other values which
    are typically accessed using a name.
●   Example:
    Person => firstName / lastName
    ComplexNumber => realPart / imaginaryPart
Structs – Using array
$person = array(
    "firstName" => "Patrick",
    "lastName" => "Allaert"
);
Structs – Using a class
$person = new PersonStruct(
    "Patrick", "Allaert"
);
Structs – Using a class
              (Implementation)
class PersonStruct
{
    public $firstName;
    public $lastName;

    public function __construct($firstName, $lastName)
    {
        $this->firstName = $firstName;
        $this->lastName = $lastName;
    }
}
Structs – Using a class
                (Implementation)
class PersonStruct
{
    public $firstName;
    public $lastName;

    public function __construct($firstName, $lastName)
    {
        $this->firstName = $firstName;
        $this->lastName = $lastName;
    }

    public   function __set($key, $value)
    {
        //   a. Do nothing
        //   b. trigger_error()
        //   c. Throws an exception
    }
}
Structs – Pros and Cons
              Array                            Class
+ Uses less memory (PHP < 5.4)   - Uses more memory (PHP < 5.4)
- Uses more memory (PHP = 5.4) + Uses less memory (PHP = 5.4)
- No type hinting                + Type hinting possible
- Flexible structure             + Rigid structure
+|- Less OO                      +|- More OO
+ Slightly faster                - Slightly slower
(true) Arrays
(true) Arrays
●   An array is a fixed size collection where elements
    are each identified by a numeric index.
(true) Arrays
●   An array is a fixed size collection where elements
    are each identified by a numeric index.

                 0      1      2      3      4      5


                Data   Data   Data   Data   Data   Data
(true) Arrays – Using SplFixedArray
$array = new SplFixedArray(3);
$array[0] = 1; // or $array->offsetSet()
$array[1] = 2; // or $array->offsetSet()
$array[2] = 3; // or $array->offsetSet()
$array[0]; // gives 1
$array[1]; // gives 2
$array[2]; // gives 3
(true) Arrays – Pros and Cons
              Array         SplFixedArray
- Uses more memory    + Uses less memory
+|- Less OO           +|- More OO
Queues
Queues
●   A queue is an ordered collection respecting First
    In, First Out (FIFO) order.
●   Elements are inserted at one end and removed at
    the other.
Queues
●   A queue is an ordered collection respecting First
    In, First Out (FIFO) order.
●   Elements are inserted at one end and removed at
    the other.

          Data

                                                           Dequeue
                 Data   Data   Data   Data   Data   Data
       Enqueue

                                                            Data
Queues – Using array
$queue = array();
$queue[] = 1; // or array_push()
$queue[] = 2; // or array_push()
$queue[] = 3; // or array_push()
array_shift($queue); // gives 1
array_shift($queue); // gives 2
array_shift($queue); // gives 3
Queues – Using SplQueue
$queue = new SplQueue();
$queue[] = 1; // or $queue->enqueue()
$queue[] = 2; // or $queue->enqueue()
$queue[] = 3; // or $queue->enqueue()
$queue->dequeue(); // gives 1
$queue->dequeue(); // gives 2
$queue->dequeue(); // gives 3
Stacks
Stacks
●   A stack is an ordered collection respecting Last In,
    First Out (LIFO) order.
●   Elements are inserted and removed on the same
    end.
Stacks
●   A stack is an ordered collection respecting Last In,
    First Out (LIFO) order.
●   Elements are inserted and removed on the same
    end.

          Data

        Push
                 Data   Data   Data   Data   Data   Data
        Pop


          Data
Stacks – Using array
$stack = array();
$stack[] = 1; // or array_push()
$stack[] = 2; // or array_push()
$stack[] = 3; // or array_push()
array_pop($stack); // gives 3
array_pop($stack); // gives 2
array_pop($stack); // gives 1
Stacks – Using SplStack
$stack = new SplStack();
$stack[] = 1; // or $stack->push()
$stack[] = 2; // or $stack->push()
$stack[] = 3; // or $stack->push()
$stack->pop(); // gives 3
$stack->pop(); // gives 2
$stack->pop(); // gives 1
Queues/Stacks – Pros and Cons
              Array              SplQueue / SplStack
- Uses more memory             + Uses less memory
(overhead / entry: 96 bytes)   (overhead / entry: 48 bytes)
- No type hinting              + Type hinting possible
+|- Less OO                    +|- More OO
Sets

Geeks                      Nerds
          People with
        strong views on
         the distinction
        between geeks
           and nerds
Sets
●   A set is a collection with no particular ordering
    especially suited for testing the membership of a
    value against a collection or to perform
    union/intersection/complement operations
    between them.
Sets
●   A set is a collection with no particular ordering
    especially suited for testing the membership of a
    value against a collection or to perform
    union/intersection/complement operations
    between them.
                                     Data

                     Data
                                   Data

                                            Data
                            Data
Sets – Using array
$set = array();

// Adding elements to a set
$set[] = 1;
$set[] = 2;
$set[] = 3;

// Checking presence in a set
in_array(2, $set); // true
in_array(5, $set); // false

array_merge($set1, $set2); // union
array_intersect($set1, $set2); // intersection
array_diff($set1, $set2); // complement
Sets – Using array
$set = array();

// Adding elements to a set
$set[] = 1;
$set[] = 2;
$set[] = 3;                     True
// Checking presence in a set   performance
in_array(2, $set); // true
in_array(5, $set); // false
                                killers!
array_merge($set1, $set2); // union
array_intersect($set1, $set2); // intersection
array_diff($set1, $set2); // complement
Sets – Mis-usage
if ($value === "val1" || $value === "val2" || $value ===
"val3")))
{
    // ...
}
Sets – Mis-usage
if (in_array($value, array("val1", "val2", "val3")))
{
    // ...
}
Sets – Mis-usage
switch ($value)
{
    case "val1":
    case "val2":
    case "val3":
        // ...
}
Sets – Using array (simple types)
$set = array();

// Adding   elements to a set
$set[1] =   true; // Any dummy value
$set[2] =   true; // is good but NULL!
$set[3] =   true;

// Checking presence in a set
isset($set[2]); // true
isset($set[5]); // false

$set1 + $set2; // union
array_intersect_key($set1, $set2); // intersection
array_diff_key($set1, $set2); // complement
Sets – Using array (simple types)
$set = array();

// Adding   elements to a set
$set[1] =   true; // Any dummy value
$set[2] =   true; // is good but NULL!
$set[3] =   true;

// Checking presence in a set
isset($set[2]); // true
isset($set[5]); // false

$set1 + $set2; // union
array_intersect_key($set1, $set2); // intersection
array_diff_key($set1, $set2); // complement
●   Remember that PHP Array keys can be integers or
    strings only!
Sets – Using array (objects)
$set = array();

// Adding elements to a set
$set[spl_object_hash($object1)] = $object1;
$set[spl_object_hash($object2)] = $object2;
$set[spl_object_hash($object3)] = $object3;

// Checking presence in a set
isset($set[spl_object_hash($object2)]); // true
isset($set[spl_object_hash($object5)]); // false

$set1 + $set2; // union
array_intersect_key($set1, $set2); // intersection
array_diff_key($set1, $set2); // complement
Sets – Using array (objects)
$set = array();

// Adding elements to a set
$set[spl_object_hash($object1)] = $object1;   Store a
$set[spl_object_hash($object2)] = $object2;   reference of
$set[spl_object_hash($object3)] = $object3;   the object!
// Checking presence in a set
isset($set[spl_object_hash($object2)]); // true
isset($set[spl_object_hash($object5)]); // false

$set1 + $set2; // union
array_intersect_key($set1, $set2); // intersection
array_diff_key($set1, $set2); // complement
Sets – Using SplObjectStorage
                 (objects)
$set = new SplObjectStorage();

// Adding elements to a   set
$set->attach($object1);   // or $set[$object1] = null;
$set->attach($object2);   // or $set[$object2] = null;
$set->attach($object3);   // or $set[$object3] = null;

// Checking presence in a set
isset($set[$object2]); // true
isset($set[$object2]); // false

$set1->addAll($set2); // union
$set1->removeAllExcept($set2); // intersection
$set1->removeAll($set2); // complement
Sets – Using QuickHash (int)
$set = new QuickHashIntSet(64,
QuickHashIntSet::CHECK_FOR_DUPES);

// Adding elements to a set
$set->add(1);
$set->add(2);
$set->add(3);

// Checking presence in a set
$set->exists(2); // true
$set->exists(5); // false

// Soonish: isset($set[2]);

●   No union/intersection/complement operations
    (yet?)
●   Yummy features like (loadFrom|saveTo)(String|File)
Sets – Using bitsets
define("E_ERROR", 1); // or 1<<0
define("E_WARNING", 2); // or 1<<1
define("E_PARSE", 4); // or 1<<2
define("E_NOTICE", 8); // or 1<<3

// Adding elements to a set
$set = 0;
$set |= E_ERROR;
$set |= E_WARNING;
$set |= E_PARSE;

// Checking presence in a set
$set & E_ERROR; // true
$set & E_NOTICE; // false

$set1 | $set2; // union
$set1 & $set2; // intersection
$set1 ^ $set2; // complement
Sets – Using bitsets (example)
Instead of:
function remove($path, $files = true, $directories = true, $links = true,
$executable = true)
{
    if (!$files && is_file($path))
        return false;

    if (!$directories && is_dir($path))
        return false;

    if (!$links && is_link($path))
        return false;

    if (!$executable && is_executable($path))
        return false;

    // ...
}

remove("/tmp/removeMe", true, false, true, false); // WTF ?!
Sets – Using bitsets (example)
Instead of:
define("REMOVE_FILES", 1 << 0);
define("REMOVE_DIRS", 1 << 1);
define("REMOVE_LINKS", 1 << 2);
define("REMOVE_EXEC", 1 << 3);
define("REMOVE_ALL", ~0); // Setting all bits

function remove($path, $options = REMOVE_ALL)
{
    if (~$options & REMOVE_FILES && is_file($path))
        return false;

    if (~$options & REMOVE_DIRS && is_dir($path))
        return false;

    if (~$options & REMOVE_LINKS && is_link($path))
        return false;

    if (~$options & REMOVE_EXEC && is_executable($path))
        return false;

    // ...
}

remove("/tmp/removeMe", REMOVE_FILES | REMOVE_LINKS); // Much better :)
Sets: Conclusions
●   Use the key and not the value when using PHP
    Arrays.
●   Use QuickHash for set of integers if possible.
●   Use SplObjectStorage as soon as you are playing
    with objects.
●   Don't use array_unique() when you need a set!
Maps
●   A map is a collection of key/value pairs where all
    keys are unique.
Maps – Using array
$map = array();
$map["ONE"] = 1;
$map["TWO"] = 2;
$map["THREE"] = 3;

// Merging maps:
array_merge($map1, $map2); // SLOW!
$map2 + $map1; // Fast :)
●   Don't use array_merge() on maps.
Multikey Maps – Using array
$map = array();
$map["ONE"] = 1;
$map["UN"] =& $map["ONE"];
$map["UNO"] =& $map["ONE"];
$map["TWO"] = 2;
$map["DEUX"] =& $map["TWO"];
$map["DUE"] =& $map["TWO"];

$map["UNO"] = "once";
$map["DEUX"] = "twice";

var_dump($map);
/*
array(6) {
["ONE"] => &string(4) "once"
● Don't use array_merge() on maps.
["UN"] => &string(4) "once"
["UNO"] => &string(4) "once"
["TWO"] => &string(5) "twice"
["DEUX"] => &string(5) "twice"
["DUE"] => &string(5) "twice"
}
*/
Heap
●   A heap is a tree-based structure in which all
    elements are ordered with largest key at the top,
    and the smallest one as leafs.
Heap
●   A heap is a tree-based structure in which all
    elements are ordered with largest key at the top,
    and the smallest one as leafs.
Heap – Using array
$heap = array();
$heap[] = 3;
sort($heap);
$heap[] = 1;
sort($heap);
$heap[] = 2;
sort($heap);
Heap – Using Spl(Min|Max)Heap
$heap = new SplMinHeap;
$heap->insert(3);
$heap->insert(1);
$heap->insert(2);
Heaps: Conclusions
●   MUCH faster than having to re-sort() an array at
    every insertion.
●   If you don't require a collection to be sorted at
    every single step and can insert all data at once
    and then sort(). Array is a much better/faster
    approach.
●   SplPriorityQueue is very similar, consider it is the
    same as SplHeap but where the sorting is made on
    the key rather than the value.
Bloom filters
●   A bloom filter is a space-efficient probabilistic data
    structure used to test whether an element is
    member of a set.
●   False positives are possible, but false negatives are
    not!
Bloom filters – Using bloomy
// BloomFilter::__construct(int capacity [, double
error_rate [, int random_seed ] ])
$bloomFilter = new BloomFilter(10000, 0.001);

$bloomFilter->add("An element");

$bloomFilter->has("An element"); // true for sure
$bloomFilter->has("Foo"); // false, most probably
Other related projects
●   SPL Types: Various types implemented as object:
    SplInt, SplFloat, SplEnum, SplBool and SplString
    http://pecl.php.net/package/SPL_Types
Other related projects
●   SPL Types: Various types implemented as object:
    SplInt, SplFloat, SplEnum, SplBool and SplString
    http://pecl.php.net/package/SPL_Types
●   Judy: Sparse dynamic arrays implementation
    http://pecl.php.net/package/Judy
Other related projects
●   SPL Types: Various types implemented as object:
    SplInt, SplFloat, SplEnum, SplBool and SplString
    http://pecl.php.net/package/SPL_Types
●   Judy: Sparse dynamic arrays implementation
    http://pecl.php.net/package/Judy
●   Weakref: Weak references implementation.
    Provides a gateway to an object without
    preventing that object from being collected by the
    garbage collector.
Conclusions
●   Use appropriate data structure. It will keep your
    code clean and fast.
Conclusions
●   Use appropriate data structure. It will keep your
    code clean and fast.
●   Think about the time and space complexity
    involved by your algorithms.
Conclusions
●   Use appropriate data structure. It will keep your
    code clean and fast.
●   Think about the time and space complexity
    involved by your algorithms.
●   Name your variables accordingly: use “Map”, “Set”,
    “List”, “Queue”,... to describe them instead of using
    something like: $ordersArray.
Questions?
Thanks
●   Don't forget to rate this talk on https://joind.in/4968
Photo Credits
●   Northstar Ski Jump: http://www.flickr.com/photos/renotahoe/5593248965
●   Tuned car:
    http://www.flickr.com/photos/gioxxswall/5783867752
●   London Eye Structure: http://www.flickr.com/photos/photographygal123/4883546484
●   Cigarette:
    http://www.flickr.com/photos/superfantastic/166215927
●   Sets: http://www.flickr.com/photos/yat86/4889135495
●   Heap structure:
    http://en.wikipedia.org/wiki/File:Max-Heap.svg
●   Drawers:
    http://www.flickr.com/photos/jamesclay/2312912612
●   Stones stack:
    http://www.flickr.com/photos/silent_e/2282729987
●   Tree:
    http://www.flickr.com/photos/drewbandy/6002204996
●   Matrix:
    http://www.flickr.com/photos/gamikun/2564208746

Mais conteúdo relacionado

Destaque

Digital Testing Strategy: Replace Manual Testing with Intelligent Automation
Digital Testing Strategy: Replace Manual Testing with Intelligent AutomationDigital Testing Strategy: Replace Manual Testing with Intelligent Automation
Digital Testing Strategy: Replace Manual Testing with Intelligent AutomationWorksoft
 
Manual Testing
Manual TestingManual Testing
Manual TestingG.C Reddy
 
Manual testing interview question by INFOTECH
Manual testing interview question by INFOTECHManual testing interview question by INFOTECH
Manual testing interview question by INFOTECHPravinsinh
 

Destaque (6)

David ausubel
David ausubelDavid ausubel
David ausubel
 
Digital Testing Strategy: Replace Manual Testing with Intelligent Automation
Digital Testing Strategy: Replace Manual Testing with Intelligent AutomationDigital Testing Strategy: Replace Manual Testing with Intelligent Automation
Digital Testing Strategy: Replace Manual Testing with Intelligent Automation
 
Manual Testing
Manual TestingManual Testing
Manual Testing
 
Manual testing ppt
Manual testing pptManual testing ppt
Manual testing ppt
 
Manual testing interview question by INFOTECH
Manual testing interview question by INFOTECHManual testing interview question by INFOTECH
Manual testing interview question by INFOTECH
 
reveal.js 3.0.0
reveal.js 3.0.0reveal.js 3.0.0
reveal.js 3.0.0
 

Último

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 

Último (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Masterizing PHP Data Structure 102 - PHPUK 2012

  • 1. Masterizing PHP Data Structure 102 Patrick Allaert PHPUK 2012 Conference
  • 2. About me ● Patrick Allaert ● Founder of Libereco ● Playing with PHP/Linux for +10 years ● eZ Publish core developer ● Author of the APM PHP extension ● @patrick_allaert ● patrickallaert@php.net ● http://github.com/patrickallaert/ ● http://patrickallaert.blogspot.com/
  • 3. APM
  • 4. APM
  • 6. PHP native datatypes ● NULL (IS_NULL) ● Booleans (IS_BOOL) ● Integers (IS_LONG) ● Floating point numbers (IS_DOUBLE) ● Strings (IS_STRING) ● Arrays (IS_ARRAY, IS_CONSTANT_ARRAY) ● Objects (IS_OBJECT) ● Resources (IS_RESOURCE) ● Callable (IS_CALLABLE)
  • 7. Wikipedia datatypes ● 2-3-4 tree ● Bit field ● Directed acyclic ● Hashed array tree Map/Associative ● ● Rapidly-exploring ● Suffix tree graph array/Dictionary random tree Symbol table ● 2-3 heap ● Bitmap Hash list ● ● Directed graph Matrix Record (also called Syntax tree 2-3 tree BK-tree ● ● ● Hash table ● ● ● Disjoint-set tuple or struct) Tagged union (variant ● Metric tree AA tree ● Bloom filter record, discriminated ● ● ● Distributed hash Hash tree Red-black tree ● ● Minimax tree ● union, disjoint union) Abstract syntax tree ● ● ● ● Boolean table Hash trie Min/max kd-tree ● Rope Tango tree (a,b)-tree ● ● ● ● Bounding interval Double Routing table Ternary heap ● ● Heap M-tree ● Adaptive k-d tree ● hierarchy ● ● ● Doubly connected R-tree Ternary search tree ● Heightmap Multigraph ● ● ● Adjacency list B sharp tree edge list ● ● R* tree Threaded binary tree ● Multimap ● ● ● Adjacency matrix BSP tree ● Doubly linked list ● Hilbert R-tree ● R+ tree Top tree Multiset ● ● ● AF-heap Dynamic array Treap Hypergraph ● B-tree Scapegoat tree ● Octree ● ● ● ● ● Tree Alternating decision Enfilade ● Scene graph Iliffe vector ● ● tree ● B*-tree ● ● Pagoda ● Trees Enumerated type ● Segment tree ● B+ tree ● Image Pairing heap ● Trie And-inverter graph ● Expectiminimax tree ● ● ● ● Self-balancing T-tree Implicit kd-tree Parallel array ● And–or tree ● B-trie Exponential tree ● ● ● binary search tree ● ● UB-tree Parse tree ● ● Array ● Bx-tree Fenwick tree ● Interval tree ● ● Self-organizing list ● Union Plain old data ● ● AVL tree Cartesian tree Fibonacci heap Int ● Set Unrolled linked list ● ● ● ● structure ● ● ● Beap ● Char ● Finger tree ● Judy array Prefix hash tree ● Skew heap ● Van Emde Boas tree ● Skip list Variable-length array ● Bidirectional map ● Circular buffer ● Float Kdb tree Priority queue ● ● ● ● VList Bin Compressed suffix FM-index Soft heap ● ● ● ● Kd-tree ● Propositional ● VP-tree ● Binary decision array Fusion tree directed acyclic Sorted array ● ● ● Koorde ● Weight-balanced tree diagram Gap buffer graph Spaghetti stack ● ● Container ● ● ● Winged edge Binary heap Generalised suffix Leftist heap Quad-edge Sparse array ● ● ● Control table ● ● ● ● X-fast trie Quadtree ● Binary search tree tree Lightmap Sparse matrix Xor linked list Cover tree ● ● ● ● ● Binary tree ● Graph Queap Splay tree X-tree ● Ctrie ● ● Linear octree ● ● ● Binomial heap ● Graph-structured Queue SPQR-tree Y-fast trie ● Dancing tree ● stack Link/cut tree ● ● ● ● Radix tree Zero suppressed Stack ● Bit array ● ● ● D-ary heap Hash ● Linked list Randomized binary ● decision diagram Bitboard ● ● ● String Zipper Hash array mapped Lookup table search tree ● ● Decision tree ● ● Suffix array ● ● trie Z-order Deque ● Range tree ● ● ●
  • 8. Game: Can you recognize some structures?
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17. Array: PHP's untruthfulness PHP “Arrays” are not true Arrays!
  • 18. Array: PHP's untruthfulness PHP “Arrays” are not true Arrays! An array typically looks like this: 0 1 2 3 4 5 Data Data Data Data Data Data
  • 19. Array: PHP's untruthfulness PHP “Arrays” can dynamically grow and be iterated both directions (reset(), next(), prev(), end()), exclusively with O(1) operations.
  • 20. Array: PHP's untruthfulness PHP “Arrays” can dynamically grow and be iterated both directions (reset(), next(), prev(), end()), exclusively with O(1) operations. Let's have a Doubly Linked List (DLL): Head Tail Data Data Data Data Data Enables List, Deque, Queue and Stack implementations
  • 21. Array: PHP's untruthfulness PHP “Arrays” elements are always accessible using a key (index).
  • 22. Array: PHP's untruthfulness PHP “Arrays” elements are always accessible using a key (index). Let's have an Hash Table: Head Bucket pointers array Tail 0 1 2 3 4 5 ... nTableSize -1 Bucket * Bucket * Bucket * Bucket * Bucket * Bucket * Bucket * Bucket Bucket Bucket Bucket Bucket Data Data Data Data Data
  • 23. Array: PHP's untruthfulness http://php.net/manual/en/language.types.array.php: “This type is optimized for several different uses; it can be treated as an array, list (vector), hash table (an implementation of a map), dictionary, collection, stack, queue, and probably more.”
  • 24. Optimized for anything ≈ Optimized for nothing!
  • 25. Array: PHP's untruthfulness ● In C: 100 000 integers (using long on 64bits => 8 bytes) can be stored in 0.76 Mb. ● In PHP: it will take ≅ 13.97 Mb! ● A PHP variable (containing an integer) takes 48 bytes. ● The overhead of buckets for every “array” entries is about 96 bytes. ● More details: http://nikic.github.com/2011/12/12/How-big-are-PHP-arrays-really-Hint-BIG.html
  • 27. Structs (or records, tuples,...)
  • 28. Structs (or records, tuples,...) ● A struct is a value containing other values which are typically accessed using a name. ● Example: Person => firstName / lastName ComplexNumber => realPart / imaginaryPart
  • 29. Structs – Using array $person = array( "firstName" => "Patrick", "lastName" => "Allaert" );
  • 30. Structs – Using a class $person = new PersonStruct( "Patrick", "Allaert" );
  • 31. Structs – Using a class (Implementation) class PersonStruct { public $firstName; public $lastName; public function __construct($firstName, $lastName) { $this->firstName = $firstName; $this->lastName = $lastName; } }
  • 32. Structs – Using a class (Implementation) class PersonStruct { public $firstName; public $lastName; public function __construct($firstName, $lastName) { $this->firstName = $firstName; $this->lastName = $lastName; } public function __set($key, $value) { // a. Do nothing // b. trigger_error() // c. Throws an exception } }
  • 33. Structs – Pros and Cons Array Class + Uses less memory (PHP < 5.4) - Uses more memory (PHP < 5.4) - Uses more memory (PHP = 5.4) + Uses less memory (PHP = 5.4) - No type hinting + Type hinting possible - Flexible structure + Rigid structure +|- Less OO +|- More OO + Slightly faster - Slightly slower
  • 35. (true) Arrays ● An array is a fixed size collection where elements are each identified by a numeric index.
  • 36. (true) Arrays ● An array is a fixed size collection where elements are each identified by a numeric index. 0 1 2 3 4 5 Data Data Data Data Data Data
  • 37. (true) Arrays – Using SplFixedArray $array = new SplFixedArray(3); $array[0] = 1; // or $array->offsetSet() $array[1] = 2; // or $array->offsetSet() $array[2] = 3; // or $array->offsetSet() $array[0]; // gives 1 $array[1]; // gives 2 $array[2]; // gives 3
  • 38. (true) Arrays – Pros and Cons Array SplFixedArray - Uses more memory + Uses less memory +|- Less OO +|- More OO
  • 40. Queues ● A queue is an ordered collection respecting First In, First Out (FIFO) order. ● Elements are inserted at one end and removed at the other.
  • 41. Queues ● A queue is an ordered collection respecting First In, First Out (FIFO) order. ● Elements are inserted at one end and removed at the other. Data Dequeue Data Data Data Data Data Data Enqueue Data
  • 42. Queues – Using array $queue = array(); $queue[] = 1; // or array_push() $queue[] = 2; // or array_push() $queue[] = 3; // or array_push() array_shift($queue); // gives 1 array_shift($queue); // gives 2 array_shift($queue); // gives 3
  • 43. Queues – Using SplQueue $queue = new SplQueue(); $queue[] = 1; // or $queue->enqueue() $queue[] = 2; // or $queue->enqueue() $queue[] = 3; // or $queue->enqueue() $queue->dequeue(); // gives 1 $queue->dequeue(); // gives 2 $queue->dequeue(); // gives 3
  • 45. Stacks ● A stack is an ordered collection respecting Last In, First Out (LIFO) order. ● Elements are inserted and removed on the same end.
  • 46. Stacks ● A stack is an ordered collection respecting Last In, First Out (LIFO) order. ● Elements are inserted and removed on the same end. Data Push Data Data Data Data Data Data Pop Data
  • 47. Stacks – Using array $stack = array(); $stack[] = 1; // or array_push() $stack[] = 2; // or array_push() $stack[] = 3; // or array_push() array_pop($stack); // gives 3 array_pop($stack); // gives 2 array_pop($stack); // gives 1
  • 48. Stacks – Using SplStack $stack = new SplStack(); $stack[] = 1; // or $stack->push() $stack[] = 2; // or $stack->push() $stack[] = 3; // or $stack->push() $stack->pop(); // gives 3 $stack->pop(); // gives 2 $stack->pop(); // gives 1
  • 49. Queues/Stacks – Pros and Cons Array SplQueue / SplStack - Uses more memory + Uses less memory (overhead / entry: 96 bytes) (overhead / entry: 48 bytes) - No type hinting + Type hinting possible +|- Less OO +|- More OO
  • 50. Sets Geeks Nerds People with strong views on the distinction between geeks and nerds
  • 51. Sets ● A set is a collection with no particular ordering especially suited for testing the membership of a value against a collection or to perform union/intersection/complement operations between them.
  • 52. Sets ● A set is a collection with no particular ordering especially suited for testing the membership of a value against a collection or to perform union/intersection/complement operations between them. Data Data Data Data Data
  • 53. Sets – Using array $set = array(); // Adding elements to a set $set[] = 1; $set[] = 2; $set[] = 3; // Checking presence in a set in_array(2, $set); // true in_array(5, $set); // false array_merge($set1, $set2); // union array_intersect($set1, $set2); // intersection array_diff($set1, $set2); // complement
  • 54. Sets – Using array $set = array(); // Adding elements to a set $set[] = 1; $set[] = 2; $set[] = 3; True // Checking presence in a set performance in_array(2, $set); // true in_array(5, $set); // false killers! array_merge($set1, $set2); // union array_intersect($set1, $set2); // intersection array_diff($set1, $set2); // complement
  • 55. Sets – Mis-usage if ($value === "val1" || $value === "val2" || $value === "val3"))) { // ... }
  • 56. Sets – Mis-usage if (in_array($value, array("val1", "val2", "val3"))) { // ... }
  • 57. Sets – Mis-usage switch ($value) { case "val1": case "val2": case "val3": // ... }
  • 58. Sets – Using array (simple types) $set = array(); // Adding elements to a set $set[1] = true; // Any dummy value $set[2] = true; // is good but NULL! $set[3] = true; // Checking presence in a set isset($set[2]); // true isset($set[5]); // false $set1 + $set2; // union array_intersect_key($set1, $set2); // intersection array_diff_key($set1, $set2); // complement
  • 59. Sets – Using array (simple types) $set = array(); // Adding elements to a set $set[1] = true; // Any dummy value $set[2] = true; // is good but NULL! $set[3] = true; // Checking presence in a set isset($set[2]); // true isset($set[5]); // false $set1 + $set2; // union array_intersect_key($set1, $set2); // intersection array_diff_key($set1, $set2); // complement ● Remember that PHP Array keys can be integers or strings only!
  • 60. Sets – Using array (objects) $set = array(); // Adding elements to a set $set[spl_object_hash($object1)] = $object1; $set[spl_object_hash($object2)] = $object2; $set[spl_object_hash($object3)] = $object3; // Checking presence in a set isset($set[spl_object_hash($object2)]); // true isset($set[spl_object_hash($object5)]); // false $set1 + $set2; // union array_intersect_key($set1, $set2); // intersection array_diff_key($set1, $set2); // complement
  • 61. Sets – Using array (objects) $set = array(); // Adding elements to a set $set[spl_object_hash($object1)] = $object1; Store a $set[spl_object_hash($object2)] = $object2; reference of $set[spl_object_hash($object3)] = $object3; the object! // Checking presence in a set isset($set[spl_object_hash($object2)]); // true isset($set[spl_object_hash($object5)]); // false $set1 + $set2; // union array_intersect_key($set1, $set2); // intersection array_diff_key($set1, $set2); // complement
  • 62. Sets – Using SplObjectStorage (objects) $set = new SplObjectStorage(); // Adding elements to a set $set->attach($object1); // or $set[$object1] = null; $set->attach($object2); // or $set[$object2] = null; $set->attach($object3); // or $set[$object3] = null; // Checking presence in a set isset($set[$object2]); // true isset($set[$object2]); // false $set1->addAll($set2); // union $set1->removeAllExcept($set2); // intersection $set1->removeAll($set2); // complement
  • 63. Sets – Using QuickHash (int) $set = new QuickHashIntSet(64, QuickHashIntSet::CHECK_FOR_DUPES); // Adding elements to a set $set->add(1); $set->add(2); $set->add(3); // Checking presence in a set $set->exists(2); // true $set->exists(5); // false // Soonish: isset($set[2]); ● No union/intersection/complement operations (yet?) ● Yummy features like (loadFrom|saveTo)(String|File)
  • 64. Sets – Using bitsets define("E_ERROR", 1); // or 1<<0 define("E_WARNING", 2); // or 1<<1 define("E_PARSE", 4); // or 1<<2 define("E_NOTICE", 8); // or 1<<3 // Adding elements to a set $set = 0; $set |= E_ERROR; $set |= E_WARNING; $set |= E_PARSE; // Checking presence in a set $set & E_ERROR; // true $set & E_NOTICE; // false $set1 | $set2; // union $set1 & $set2; // intersection $set1 ^ $set2; // complement
  • 65. Sets – Using bitsets (example) Instead of: function remove($path, $files = true, $directories = true, $links = true, $executable = true) { if (!$files && is_file($path)) return false; if (!$directories && is_dir($path)) return false; if (!$links && is_link($path)) return false; if (!$executable && is_executable($path)) return false; // ... } remove("/tmp/removeMe", true, false, true, false); // WTF ?!
  • 66. Sets – Using bitsets (example) Instead of: define("REMOVE_FILES", 1 << 0); define("REMOVE_DIRS", 1 << 1); define("REMOVE_LINKS", 1 << 2); define("REMOVE_EXEC", 1 << 3); define("REMOVE_ALL", ~0); // Setting all bits function remove($path, $options = REMOVE_ALL) { if (~$options & REMOVE_FILES && is_file($path)) return false; if (~$options & REMOVE_DIRS && is_dir($path)) return false; if (~$options & REMOVE_LINKS && is_link($path)) return false; if (~$options & REMOVE_EXEC && is_executable($path)) return false; // ... } remove("/tmp/removeMe", REMOVE_FILES | REMOVE_LINKS); // Much better :)
  • 67. Sets: Conclusions ● Use the key and not the value when using PHP Arrays. ● Use QuickHash for set of integers if possible. ● Use SplObjectStorage as soon as you are playing with objects. ● Don't use array_unique() when you need a set!
  • 68. Maps ● A map is a collection of key/value pairs where all keys are unique.
  • 69. Maps – Using array $map = array(); $map["ONE"] = 1; $map["TWO"] = 2; $map["THREE"] = 3; // Merging maps: array_merge($map1, $map2); // SLOW! $map2 + $map1; // Fast :) ● Don't use array_merge() on maps.
  • 70. Multikey Maps – Using array $map = array(); $map["ONE"] = 1; $map["UN"] =& $map["ONE"]; $map["UNO"] =& $map["ONE"]; $map["TWO"] = 2; $map["DEUX"] =& $map["TWO"]; $map["DUE"] =& $map["TWO"]; $map["UNO"] = "once"; $map["DEUX"] = "twice"; var_dump($map); /* array(6) { ["ONE"] => &string(4) "once" ● Don't use array_merge() on maps. ["UN"] => &string(4) "once" ["UNO"] => &string(4) "once" ["TWO"] => &string(5) "twice" ["DEUX"] => &string(5) "twice" ["DUE"] => &string(5) "twice" } */
  • 71. Heap ● A heap is a tree-based structure in which all elements are ordered with largest key at the top, and the smallest one as leafs.
  • 72. Heap ● A heap is a tree-based structure in which all elements are ordered with largest key at the top, and the smallest one as leafs.
  • 73. Heap – Using array $heap = array(); $heap[] = 3; sort($heap); $heap[] = 1; sort($heap); $heap[] = 2; sort($heap);
  • 74. Heap – Using Spl(Min|Max)Heap $heap = new SplMinHeap; $heap->insert(3); $heap->insert(1); $heap->insert(2);
  • 75. Heaps: Conclusions ● MUCH faster than having to re-sort() an array at every insertion. ● If you don't require a collection to be sorted at every single step and can insert all data at once and then sort(). Array is a much better/faster approach. ● SplPriorityQueue is very similar, consider it is the same as SplHeap but where the sorting is made on the key rather than the value.
  • 76. Bloom filters ● A bloom filter is a space-efficient probabilistic data structure used to test whether an element is member of a set. ● False positives are possible, but false negatives are not!
  • 77. Bloom filters – Using bloomy // BloomFilter::__construct(int capacity [, double error_rate [, int random_seed ] ]) $bloomFilter = new BloomFilter(10000, 0.001); $bloomFilter->add("An element"); $bloomFilter->has("An element"); // true for sure $bloomFilter->has("Foo"); // false, most probably
  • 78. Other related projects ● SPL Types: Various types implemented as object: SplInt, SplFloat, SplEnum, SplBool and SplString http://pecl.php.net/package/SPL_Types
  • 79. Other related projects ● SPL Types: Various types implemented as object: SplInt, SplFloat, SplEnum, SplBool and SplString http://pecl.php.net/package/SPL_Types ● Judy: Sparse dynamic arrays implementation http://pecl.php.net/package/Judy
  • 80. Other related projects ● SPL Types: Various types implemented as object: SplInt, SplFloat, SplEnum, SplBool and SplString http://pecl.php.net/package/SPL_Types ● Judy: Sparse dynamic arrays implementation http://pecl.php.net/package/Judy ● Weakref: Weak references implementation. Provides a gateway to an object without preventing that object from being collected by the garbage collector.
  • 81. Conclusions ● Use appropriate data structure. It will keep your code clean and fast.
  • 82. Conclusions ● Use appropriate data structure. It will keep your code clean and fast. ● Think about the time and space complexity involved by your algorithms.
  • 83. Conclusions ● Use appropriate data structure. It will keep your code clean and fast. ● Think about the time and space complexity involved by your algorithms. ● Name your variables accordingly: use “Map”, “Set”, “List”, “Queue”,... to describe them instead of using something like: $ordersArray.
  • 85. Thanks ● Don't forget to rate this talk on https://joind.in/4968
  • 86. Photo Credits ● Northstar Ski Jump: http://www.flickr.com/photos/renotahoe/5593248965 ● Tuned car: http://www.flickr.com/photos/gioxxswall/5783867752 ● London Eye Structure: http://www.flickr.com/photos/photographygal123/4883546484 ● Cigarette: http://www.flickr.com/photos/superfantastic/166215927 ● Sets: http://www.flickr.com/photos/yat86/4889135495 ● Heap structure: http://en.wikipedia.org/wiki/File:Max-Heap.svg ● Drawers: http://www.flickr.com/photos/jamesclay/2312912612 ● Stones stack: http://www.flickr.com/photos/silent_e/2282729987 ● Tree: http://www.flickr.com/photos/drewbandy/6002204996 ● Matrix: http://www.flickr.com/photos/gamikun/2564208746