Semelhante a Legal Analytics Course - Class #2 - Introduction to Machine Learning for Lawyers - Professors Daniel Martin Katz + Michael J Bommarito (20)
Legal Analytics Course - Class #2 - Introduction to Machine Learning for Lawyers - Professors Daniel Martin Katz + Michael J Bommarito
1. Legal Analytics
Professor Daniel Martin Katz
Professor Michael J Bommarito II
legalanalyticscourse.com
Class 2
Machine Learning for Lawyers
2. < What is Machine ‘Learning’?>
access more at legalanalyticscourse.com
3. we are trying to learn from existing
data to infer future data
rough and ready goal for ML:
access more at legalanalyticscourse.com
4. “A computer program is said to learn from
experience E with respect to some task T
and some performance measure P, if its
performance on T, as measured by P,
improves with experience E.”
Tom Mitchell
Carnegie Mellon University
access more at legalanalyticscourse.com
5.
6. inference is tricky for
a variety of reasons
access more at legalanalyticscourse.com
7. data might be noisy
thus hard to infer true signal
(1)
access more at legalanalyticscourse.com
8. the dynamics change such that
past data is a bad guide for
future data
(2)
access more at legalanalyticscourse.com
17. historically decision making
in law is heavily skewed
toward the use of
Human Experts
access more at legalanalyticscourse.com
18. rarely is the historic
performance of
Human Experts
extensively benchmarked
or validated using
statistical methods
access more at legalanalyticscourse.com
19.
20. in related fields
this is becoming
increasingly common
access more at legalanalyticscourse.com
21.
22. it is even becoming
common among
those who use
statistical and other
methods to forecast
access more at legalanalyticscourse.com
23. Poll Aggregation is one form of ensemble where
the learning question is to determine how much
weight (if any) to assign to each individual poll
24. selecting the
ulitmate ensemble
of polls requires
grading the
historical
performance
of each pollster
access more at legalanalyticscourse.com
44. LearningTask = Determine Whether a Given
Document is Relevant?
Relevant
Not Relevant
f( )
relevance?
Binary Classification (Supervised Learning)
and/or
010
101
001
access more at legalanalyticscourse.com
45. take the sample set as
a training set and
use human experts
access more at legalanalyticscourse.com
46. the use of the human
experts is called
“supervised learning”
access more at legalanalyticscourse.com
47. in the simple binary case,
ask humans to assign
objects to two piles
access more at legalanalyticscourse.com
59. machine learning task is
trying to recover (learn)
what separates the
relevant from the
non-relevant documents
access more at legalanalyticscourse.com
60. once we learn the
rule / boundary
we can apply it to separate
the remain documents into
the two classes
access more at legalanalyticscourse.com
61. we want to take what we learn here
access more at legalanalyticscourse.com
62. we want to take what we learn here
access more at legalanalyticscourse.com
63. we want to take what we learn here
and apply it here
access more at legalanalyticscourse.com
70. < Bias vsVariance Tradeoff >
access more at legalanalyticscourse.com
71. The Bias vsVariance Tradeoff
“The problem of simultaneously minimizing the
bias (how accurate a model is across different
training sets) and variance of the model error
(how sensitive the model is to small changes in
training set).
Intuitively, it means that a model must be
chosen that at the same time captures the
regularities in its training data, but also
generalizes well to unseen data.”
access more at legalanalyticscourse.com
72. The Bias vsVariance Tradeoff
http://scott.fortmann-roe.com/docs/BiasVariance.html
Worst Case Scenario
Best Case Scenario
access more at legalanalyticscourse.com
73. Michael Clark citing Hastie, et al (2009)
access more at legalanalyticscourse.com
74.
75. < Precision vs Recall >
access more at legalanalyticscourse.com
78. Legal Analytics
Class 2 - Machine Learning for Lawyers
daniel martin katz
blog | ComputationalLegalStudies
corp | LexPredict
michael j bommarito
twitter | @computational
blog | ComputationalLegalStudies
corp | LexPredict
twitter | @mjbommar
more content available at legalanalyticscourse.com
site | danielmartinkatz.com site | bommaritollc.com