Playing Go with Clojure

PLAYING GO
WITH CLOJURE

Zach Tellman
@ztellman

MY QUALIFICATIONS

• I’m a mediocre Go player

•I think this stuff is pretty cool

• I’ve written performance-oriented Clojure before

THE GAME TREE

X X X
…

X O X O X
O …

THE GAME TREE

a cooperative or single-player game, we only need to
• in
search for victory

• in a competitive game, we need to optimize for victory

MINIMAX

• assign a value to each leaf node

the parent the maximal child value if it’s our turn,
• assign
or the minimal child value if it’s our opponent’s turn

• ascend one level, and repeat

MINIMAX

• plays a perfect game

• requires traversing the entire tree

• there are ~9! possible tic-tac-toe games

EVALUATION FUNCTIONS

• construct a shallow game tree

each leaf node a value based on our best guess at
• assign
the outcome

• apply minimax to these values

THE TROUBLE
WITH HEURISTICS

• difﬁcult to quantify

• depth vs. quality

• understanding a function vs. understanding its repeated,
recursive application

THE TROUBLE
WITH GO

• lots of breadth

• lots of depth

HISTORICAL APPROACHES

• programmers who are professional-level players

• shallow and narrow searches, heavy evaluation functions

MONTE CARLO SIMULATION

(defn inside-circle? [x y]
(>= 1 (+ (* x x) (* y y))))

(defn sample []
(inside-circle? (rand) (rand)))
1

(0,0)

MONTE CARLO SIMULATION

(defn inside-circle? [x y]
(>= 1 (+ (* x x) (* y y))))

(defn sample []
(inside-circle? (rand) (rand)))
1

(repeatedly 1e6 sample)
(0,0)

MONTE CARLO EVALUATION

• thestrength of a position is how often one side wins,
given repeated, naive playouts by both sides

• possible playouts vs plausible playouts

• eachnew playout gives us additional information about
what is plausible

MULTI-ARMED BANDIT

• multiple levers to pull, each with an unknown expected
reward

• exploitation vs exploration

• if
the search space is stable over time, we expect to
converge on a solution

SIMULATING GO
WITH CLOJURE

• select a move

• check for suicide

• make move

• check for capture

• repeat

INCREMENTAL STATE

4

INCREMENTAL STATE

6

INCREMENTAL STATE

6

2

INCREMENTAL STATE

• we keep track of the pseudo-liberties, but also the
neighbor sum and sum-of-squares

• ifsum × sum = psudeo-liberties × sum-of-squares,
there’s only one real liberty

• proof is left as an exercise for the reader

A LITTLE ARITHMETIC

• select a move 1 second
/ 10,000 games per second
• check for suicide

• make move


• repeat

A LITTLE ARITHMETIC

• check for suicide / 100 moves per game

• make move


• repeat

A LITTLE ARITHMETIC

• check for suicide / 100 moves per game

• make move


• repeat = 1μs per move

A MILLION TIMES
A SECOND

(assoc {} :a 1, :b 2, :c 3, :d 4, :e 5)

(reduce + (range 10))

(set (range 5))

execute typical instruction 1 ns

fetch from L1 cache memory 0.5 ns

branch misprediction 5 ns

fetch from L2 cache memory 7 ns

mutex lock/unlock 25 ns

fetch from main memory 100 ns

“Teach Yourself Programming in Ten Years”
Peter Norvig, 2001

A QUICK DESCENT
INTO
INSANITY
PERFORMANCE

NOT YOUR DAY JOB

• not latency bound

• faster is smarter

• diminishing returns not within reach

MUTABLE STATE

“They are for experts only - if the semantics and
implications of :volatile-mutable or :unsynchronized-
mutable are not immediately apparent to you, you
should not be using them.”

Rich Hickey,
the docstring for deftype

MUTABLE STATE

• if
a value is only used on a single thread, it can be
unsynchronized

• for everything else, use volatile

• unsynchronized is roughly 2x faster

MUTABLE STATE
(deftype Mutable
[^:unsynchronized-mutable ^long counter]

IMutable
(get-counter [_]
counter)
(set-counter [_ val]
(set! counter val)))

MUTABLE STATE
(deftype Mutable

IMutable
(get-counter [_]
counter)
(set! counter val)))

(defn add [^Mutable m ^long val]
(set-counter m (+ (get-counter m) val)))

MUTABLE STATE
(deftype Mutable

IMutable
(get-counter [_]
counter)
(set! counter val))
(add [_ val]
(set! counter (+ counter val))))

MEASURE TWICE,
CODE ONCE
user> (bench (assoc {} :a 1, :b 2, :c 3, :d 4, :e 5))

Evaluation count : 46619100 in 60 samples of 776985 calls.

Execution time mean : 1.286463 us
Execution time std-deviation : 4.220920 ns
Execution time lower quantile : 1.280215 us ( 2.5%)
Execution time upper quantile : 1.295557 us (97.5%)

Found 4 outliers in 60 samples (6.6667 %)
low-severe 1 (1.6667 %)
low-mild 2 (3.3333 %)
high-mild 1 (1.6667 %)
Variance from outliers : 1.6389 %

CRITERIUM

• one of the many great libraries from Hugo Duncan

• use it at the REPL, and in your tests

• tends to exaggerate cache coherency

• minimum resolution appears to be ~15ns

BOTTOM-UP PERFORMANCE

• measure each piece of your code

• measure them when used together

• buildan intuition for how long something should take,
and investigate when you’re wrong

INDIRECTION

“Any performance problem can be solved
by removing a layer of indirection.”

Unattributed

INDIRECTION

= count

== identical? Array/getLength

PUT IT ALL TOGETHER
> lein test pushkin.test.simulator :benchmark

-----
random 9x9 playout
-----

Evaluation count : 501120 in 60 samples of 8352 calls.
Execution time mean : 121.293385 us
Execution time std-deviation : 2.534460 us
Execution time lower quantile : 119.685345 us ( 2.5%)
Execution time upper quantile : 128.150772 us (97.5%)

Found 8 outliers in 60 samples (13.3333 %)
low-severe 3 (5.0000 %)
low-mild 5 (8.3333 %)
Variance from outliers : 9.4036 %

PLUMBING THE DEPTHS
OF PERFORMANCE
INSANITY
• simulating C-style structs with ByteBuffers

• mimicking C compiler offset calculation with local analysis

• creating a bridge to heterogeneous computing

Exploitation Strong AIs

Pushkin
Exploration
Possible Plausible

http://github.com/ztellman/pushkin

Playing Go with Clojure

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Playing Go with Clojure

Semelhante a Playing Go with Clojure (20)

Último

Último (20)

Playing Go with Clojure

Notas do Editor