Using Parallel Computing Platform - NHDNUG

Visual Studio 2010 Using the Parallel Computing Platform Phil Pennington philpenn@microsoft.com

Agenda 2 What’s new with Windows? Parallel Computing Tools in Visual Studio Using .NET Parallel Extensions

First, An ExampleMonte Carlo Approximation of Pi S = 4*r*r C = Pi*r*r Pi = 4*(C/S) For each Point (P), d(P) = SQRT((x * x) + (y * y)) if (d < r) thenP(x,y) is in C

Windows and Maximum Processors Before Win7/R2, the maximum number of Logical Processors (LPs) was dictated by processor integral word size LP state (e.g. idle, affinity) represented in word-sized bitmask 32-bit Windows: 32 LPs 64-bit Windows: 64 LPs 32-bit Idle Processor Mask 31 0 16 Busy Idle

Processor GroupsNew with Windows7 and Windows Server R2 5 GROUP NUMA NODE Socket Socket Core Core LP LP LP LP Core Core NUMA NODE

Processor GroupsExample: 2 Groups, 4 nodes, 8 sockets, 32 cores, 128 LP’s 6 Group Group NUMA Node NUMA Node Socket Socket Socket Socket NUMA Node NUMA Node Socket Socket Socket Socket Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core Core LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP LP

Many-Core Topology APIs Discovery 7

Many-Core Topology APIs Resource Localization 8

Many-Core Topology APIs Memory Management 9

User Mode SchedulingArchitectural Perspective UMS Scheduler’s Ready List Your Scheduler Logic Wait Reason: Yield Reason: Yield Reason: Blocked Reason: Created CPU 1 CPU 2 UMS Completion List W1 W2 W3 W4 S1 S2 Application Kernel Blocked Worker Threads Scheduler Threads

Task Scheduling with a UMS SchedulerMaximize Quantum, Minimize Blocking Affects Tasks are run by worker threads, which the scheduler controls Dead Zone WT0 WT1 WT2 WT3 Without UMS (signal-and-wait) WT0 WT1 WT2 WT3 With UMS (UMS yield)

Load-Balancing, Work Stealing Scheduler DynamicScheduling Static Scheduling CPU0 CPU1 CPU2 CPU3 CPU0 CPU1 CPU2 CPU3 Dynamic scheduling improves performance by distributing work efficiently at runtime.

Demos The Platform - Topology - Schedulers

Visual Studio 2010, .NET Developer Tools, Programming Models, Runtimes Tools Programming Models – Structured Parallelism Parallel LINQ (PLINQ) Task ParallelLibrary (TPL) Debugger Data Structures .NET Parallel Extensions Profiler Task Scheduler Resource Manager .NET Runtime Threads Pools Managed Library Tools

Thread-Pool Scheduler in .NET 4.0 Thread 1 Dispatch Loop Thread 2 Dispatch Loop Thread N Dispatch Loop Enqueue Dequeue T1 T2 T3 T4 Global Queue (FIFO) Dequeue Enqueue T5 Global Q is shared by legacy ThreadPool API and TPL Local work queues and work stealing scheduler (TPL only) T6 T7 T8 Steal Steal Steal Thread 1 Local Queue (LIFO) Thread 2 Local Queue (LIFO) Thread N Local Queue (LIFO)

Task Parallel Library (TPL)Tasks Concepts Common Functionality: waiting, cancellation, continuations, parent/child relationships

Primitives and Structures Thread-safe, scalable collections IProducerConsumerCollection<T> ConcurrentQueue<T> ConcurrentStack<T> ConcurrentBag<T> ConcurrentDictionary<TKey,TValue> Phases and work exchange Barrier BlockingCollection<T> CountdownEvent Partitioning {Orderable}Partitioner<T> Partitioner.Create Exception handling AggregateException Initialization Lazy<T> LazyInitializer.EnsureInitialized<T> ThreadLocal<T> Locks ManualResetEventSlim SemaphoreSlim SpinLock SpinWait Cancellation CancellationToken{Source}

Parallel Debugging Two new debugger toolwindows Support both native and managed “Parallel Tasks” “Parallel Stacks”

Parallel Tasks ,[object Object]

Where are my tasks running (location, call stack)?

How many tasks are waiting to run?,[object Object]

Task-specific view (Task status)

Easy navigation to any executing method

Rich UI (zooming, panning, bird’s eye view, flagging, tooltips)Bird’s eye view

CPU Utilization Other processes Number of cores Idle time Your Process

Threads Measure time for interesting segments Hide uninteresting threads Zoom in and out Detailed thread analysis (one channel per thread) Active Legend Usage Hints Call Stacks

Cores Each logical core in a swim lane One color per thread Migration visualization Cross-core migration details

Demo Libraries Languages Debuggers Profilers

Thinking Parallel - “Task” vs. “Data” Parallelism Task Parallelism Parallel.Invoke( () => { Console.WriteLine("Begin first task..."); }, () => { Console.WriteLine("Begin second task..."); }, () => { Console.WriteLine("Begin third task..."); } ); Data Parallelism IEnumerable<int> numbers = Enumerable.Range(2, 100-3); varmyQuery = from n in numbers.AsParallel() where Enumerable.Range(2, (int)Math.Sqrt(n)).All(i => n % i > 0) select n; int[] primes = myQuery.ToArray();

Using Parallel Computing Platform - NHDNUG

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (19)

Semelhante a Using Parallel Computing Platform - NHDNUG

Semelhante a Using Parallel Computing Platform - NHDNUG (20)

Último

Último (20)

Using Parallel Computing Platform - NHDNUG

Notas do Editor