SlideShare uma empresa Scribd logo
1 de 34
Baixar para ler offline
Where	
  next?	
  	
  
Why	
  we	
  don’t	
  care	
  about	
  other	
  
fields	
  that	
  care	
  about	
  language	
  	
  
Eduard	
  Hovy	
  	
  
USC	
  Informa2on	
  Sciences	
  Ins2tute	
  
Marina	
  del	
  Rey,	
  CA	
  
hovy@isi.edu	
  
The	
  Sta2s2cs	
  Revolu2on	
  is	
  over…	
  
•  …when	
  EMNLP	
  2004	
  called	
  for	
  	
  
•  …when	
  even	
  Ken	
  Church	
  says	
  “enough	
  is	
  enough”	
  	
  
•  CL	
  has	
  become	
  a	
  field	
  of	
  code	
  monkeys:	
  	
  
–  Excellent	
  problem	
  analyzers	
  	
  
–  Excellent	
  large-­‐scale	
  data	
  managers	
  	
  
–  Excellent	
  machine	
  learning	
  appliers	
  	
  
–  Conference	
  papers	
  focus	
  on	
  how	
  to	
  do	
  something,	
  NOT	
  
why	
  it	
  is	
  important	
  or	
  why	
  the	
  method	
  works	
  
While	
  repor2ng	
  aggregate	
  percentages	
  remains	
  essen2al,	
  instruc2ve	
  research	
  
should	
  also	
  explain	
  a	
  model's	
  limita2ons	
  in	
  more	
  meaningful	
  ways.	
  	
  This	
  can	
  
be	
  as	
  simple	
  as	
  categorizing	
  error	
  sta2s2cs	
  by	
  finer-­‐grained	
  types	
  of	
  errors,	
  to	
  
reveal	
  specific	
  areas	
  of	
  model	
  limita2on.	
  	
  BeYer	
  yet,	
  we	
  seek	
  deeper	
  insight	
  
into	
  the	
  models’	
  inherent	
  representa2onal	
  biases,	
  in	
  the	
  form	
  of	
  qualita2ve	
  
theore2cal	
  analyses	
  	
  
	
  
So	
  what	
  are	
  we	
  going	
  to	
  do	
  now?	
  
•  Large	
  challenge	
  problems	
  remain:	
  
–  MT:	
  since	
  1950s,	
  s2ll	
  not	
  solved;	
  sta2s2cal	
  techniques	
  are	
  slowly	
  
reimplemen2ng	
  rule-­‐based	
  techniques,	
  up	
  the	
  Vauqois	
  Triangle	
  	
  
–  ASR:	
  since	
  1960s,	
  s2ll	
  not	
  solved:	
  terrible	
  performance	
  for	
  open-­‐domain	
  
vocab	
  when	
  untuned	
  
–  IR:	
  since	
  1970s,	
  s2ll	
  not	
  solved:	
  stuck	
  for	
  a	
  decade	
  at	
  0.45	
  F-­‐score.	
  	
  IR	
  
community	
  limps	
  along	
  year	
  by	
  year	
  searching	
  for	
  new	
  problem	
  	
  
–  IE:	
  since	
  1980s,	
  sta2c;	
  now	
  focusing	
  on	
  extrac2on	
  paYern	
  learning	
  and	
  
web	
  mining;	
  terrible	
  F-­‐score	
  on	
  complex	
  open-­‐ended	
  material	
  	
  
–  SummarizaCon:	
  since	
  1990s,	
  a	
  long	
  way	
  from	
  solved	
  for	
  non-­‐news	
  
genres;	
  few	
  new	
  ideas	
  since	
  early	
  2000s	
  	
  	
  
–  QA:	
  since	
  2000s,	
  only	
  factoids;	
  poor	
  performance	
  for	
  Why?	
  and	
  How?	
  	
  
–  Machine	
  Reading:	
  the	
  oldest	
  new	
  problem;	
  no	
  results	
  yet	
  	
  
•  Smaller	
  challenge	
  problems	
  invented	
  every	
  few	
  years:	
  	
  
–  WSD:	
  we	
  can	
  do	
  it,	
  but	
  we	
  don’t	
  know	
  how	
  to	
  define	
  good	
  word	
  senses	
  
–  SubjecCvity/senCment/opinion:	
  no-­‐one	
  knows	
  what	
  this	
  is	
  	
  	
  
–  Textual	
  Entailment:	
  diYo	
  	
  
To	
  become	
  famous,	
  you	
  used	
  to	
  
have	
  to	
  start	
  a	
  new	
  conference	
  	
  
(TINAP,	
  EMNLP,	
  RANLP,	
  PACLING…)	
  	
  
or	
  build	
  a	
  useful	
  piece	
  of	
  soiware	
  	
  
	
  (Brill	
  tagger,	
  Charnak	
  parser)	
  	
  
Now	
  you	
  create	
  a	
  new	
  challenge	
  
problem	
  and	
  corpus	
  	
  
What	
  our	
  field	
  lacks	
  	
  
•  Most	
  of	
  our	
  energy	
  today	
  is	
  spent	
  	
  	
  
–  hacking	
  away	
  at	
  applica2ons	
  or	
  challenge	
  problems	
  	
  
–  [and]	
  exploring	
  some	
  new	
  technique	
  (SVM,	
  Kernel	
  
methods,	
  LSA,	
  LDA…)	
  	
  
–  running	
  some	
  kind	
  of	
  evaluaCon	
  	
  
–  building	
  a	
  new	
  corpus	
  	
  	
  
	
  
•  Our	
  field	
  lacks	
  a	
  theory	
  	
  
	
  (or	
  even	
  mul2ple	
  theories)	
  	
  
–  We	
  really	
  are	
  code	
  monkeys,	
  typing	
   	
   	
   	
   	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  
away	
  without	
  a	
  Grand	
  Vision	
  	
  
So	
  where	
  should	
  we	
  turn?	
  	
  
•  Linguis2cs?	
  	
  
•  Psycholinguis2cs?	
  	
  
•  Cogni2ve	
  Science?	
  
•  Social	
  Sciences?	
  	
  
•  Informa2on	
  Theory?	
  	
  	
  	
  
Why	
  we	
  don’t	
  need	
  Linguis2cs	
  
	
  When	
  last	
  did	
  	
  you	
  	
  find	
  a	
  linguisCc	
  theory	
  helpful?	
  	
  
Problem	
  1:	
  Linguists	
  study	
  different	
  things	
  than	
  we	
  do:	
  	
  
•  We	
  work	
  with	
  the	
  average	
  case,	
  the	
  typical	
  phenomena.	
  	
  We	
  leave	
  
aberra2ons/excep2ons	
  either	
  for	
  special-­‐purpose	
  treatment	
  or	
  
ignore	
  them	
  	
  
•  Linguists	
  study	
  extreme/unusual/odd	
  phenomena,	
  to	
  gain	
  an	
  insight	
  
about	
  some	
  specific	
  phenomenon	
  	
  
–  They	
  produce	
  elegant	
  theore2cal	
  accounts,	
  representa2ons,	
  and	
  rules	
  	
  
–  They	
  do	
  not	
  produce	
  data,	
  numbers,	
  or	
  average	
  cases	
  	
  
•  So	
  linguists’	
  theories	
  are	
  [almost	
  always]	
  hard	
  to	
  implement,	
  
impossible	
  to	
  address	
  with	
  machine	
  learning,	
  and	
  mostly	
  simply	
  not	
  
relevant	
  to	
  our	
  problems	
  	
  
Problem	
  2:	
  Linguists’	
  methodology	
  is	
  suspect:	
  	
  
•  Linguists	
  annotate	
  during	
  theory	
  discovery/forma2on	
  	
  
–  Do	
  not	
  use	
  mul2ple	
  annotators	
  aierward	
  and	
  record	
  agreement	
  values	
  	
  
•  Result:	
  no	
  verifica2on	
  of	
  theore2cal	
  claims	
  via	
  annota2on!	
  	
  
–  (Seldom	
  give	
  any	
  verifica2on	
  at	
  all,	
  except	
  for	
  example	
  cases)	
  	
  
So:	
  we	
  can	
  help	
  the	
  linguists	
  by	
  
tes2ng	
  their	
  theories,	
  if	
  we	
  want	
  	
  
—	
  but	
  it’s	
  oien	
  a	
  waste	
  of	
  2me	
  	
  
since	
  they	
  don’t	
  like	
  us	
  	
  	
  
Why	
  we	
  don’t	
  need	
  Neurolinguis2cs	
  
	
  Learning	
  about	
  language	
  by	
  looking	
  at	
  firing	
  paRerns	
  in	
  the	
  brain	
  
is	
  like	
  learning	
  about	
  an	
  algorithm	
  by	
  looking	
  at	
  the	
  flickering	
  light	
  
paRerns	
  on	
  old-­‐style	
  computer	
  consoles	
  	
  
•  Problem:	
  The	
  architectures	
  of	
  the	
  brain	
  and	
  computer	
  differ	
  	
  
–  There’s	
  no	
  useful	
  info	
  for	
  us	
  in	
  purely	
  structural	
  info	
  about	
  the	
  brain	
  	
  
–  Can	
  we	
  infer	
  func0onal	
  lessons	
  from	
  the	
  brain?	
  	
  
•  Subtasks	
  X	
  and	
  Y	
  are	
  performed	
  in	
  different	
  parts	
  of	
  the	
  brain	
  —	
  perhaps	
  
they	
  should	
  be	
  dis2nct	
  code	
  modules	
  	
  	
  (e.g.,	
  syntax	
  independent	
  of	
  
seman2cs?)	
  	
  
•  Subtask	
  X	
  is	
  performed	
  before/aWer	
  subtask	
  Y	
  —	
  perhaps	
  that’s	
  important	
  
for	
  our	
  processing	
  	
  	
  (e.g.,	
  syntax	
  before	
  seman2cs?)	
  	
  
•  InformaCon	
  of	
  type	
  X	
  is	
  stored	
  elsewhere	
  from	
  info	
  of	
  type	
  Y	
  —	
  perhaps	
  we	
  
should	
  differen2ate	
  them	
  as	
  well	
  	
  	
  (e.g.,	
  word	
  sense	
  granularity?)	
  	
  
•  “We	
  need	
  you…You	
  make	
  things	
  really	
  work	
  that	
  the	
  rest	
  of	
  
us	
  simply	
  say	
  in	
  a	
  single	
  word”	
  —	
  our	
  invited	
  speaker	
  John	
  
Gabrieli	
  yesterday	
  	
  
Why	
  we	
  don’t	
  need	
  CogSci/
Psycholinguis2cs	
  
	
  Building	
  a	
  plane	
  that	
  flies	
  like	
  a	
  duck…	
  	
  
•  Problem:	
  Results	
  are	
  limited	
  by	
  the	
  crudeness	
  of	
  the	
  
experimental	
  methodology.	
  	
  They	
  have	
  essen2ally	
  only	
  
three	
  experimental	
  procedures	
  for	
  people:	
  	
  
–  ReacCon	
  Cme:	
  what	
  can	
  we	
  infer	
  from	
  how	
  long	
  it	
  takes	
  people	
  
to	
  respond.	
  	
  Gives	
  hints	
  about	
  complexity	
  of	
  task.	
  	
  How	
  does	
  
that	
  help	
  us?	
  	
  
–  Confusion:	
  what	
  we	
  can	
  infer	
  about	
  ‘cogni2ve	
  similarity’	
  of	
  
objects.	
  	
  Perhaps	
  useful	
  for	
  ontologies	
  and/or	
  word	
  senses?	
  	
  	
  
–  Forgeng:	
  what	
  can	
  can	
  infer	
  about	
  organiza2on	
  of	
  memory.	
  	
  
And	
  how	
  does	
  this	
  help	
  us?	
  	
  	
  
•  There’s	
  no	
  reason	
  to	
  believe	
  that	
  a	
  psychologically	
  /
cogni2vely	
  accurate	
  computa2onal	
  model	
  will	
  tell	
  us	
  much	
  
about	
  how	
  we	
  should	
  do	
  MT,	
  IR,	
  etc.	
  	
  Perhaps	
  the	
  opposite,	
  
even…	
  	
  	
  
Why	
  we	
  don’t	
  need	
  Info	
  Theory	
  
	
  …And	
  if	
  you	
  though	
  the	
  empirical	
  approach	
  has	
  gone	
  too	
  far…	
  	
  
•  Info	
  Theory	
  studies	
  accurate	
  transmission	
  of	
  info-­‐bearing	
  signals,	
  
and	
  measures	
  the	
  quan22es	
  or	
  loss	
  of	
  info	
  conveyed	
  	
  	
  
•  This	
  is	
  relevant	
  to	
  us,	
  but	
  not	
  directly:	
  
–  The	
  Sta2s2cs	
  Revolu2on	
  brought	
  Machine	
  Learning	
  (i.e.,	
  
measurements	
  of	
  effec2veness	
  of	
  nota2on	
  transforma2ons,	
  large-­‐
scale	
  models	
  of	
  language	
  (ngram	
  models),	
  measures	
  of	
  informa2onal	
  
similarity	
  (pmi,	
  etc.)	
  into	
  CL	
  	
  
–  These	
  no2ons	
  are	
  connected	
  to	
  Info	
  Theory	
  	
  
•  But	
  Info	
  Theory	
  is	
  too	
  incomplete/crude/’weak’	
  for	
  our	
  needs:	
  	
  
–  The	
  info	
  content	
  measure	
  (p	
  log	
  p)	
  is	
  a	
  generalized	
  score	
  of	
  
undifferen2ated	
  ‘informa2on’	
  	
  
–  We	
  need	
  more	
  specialized	
  measurements	
  of	
  specific	
  phenomena	
  
(parse	
  trees	
  built,	
  documents	
  returned,	
  labels	
  inserted,	
  etc.)	
  	
  
–  (what’s	
  more,	
  p	
  log	
  p	
  is	
  based	
  on	
  large-­‐scale	
  (averaged)	
  entropy;	
  no	
  
recogni2on	
  of	
  individual	
  user’s	
  knowledge	
  and/or	
  inferen2al	
  abili2es)	
  
Why	
  we	
  don’t	
  need	
  Social	
  Sciences	
  
	
  Here,	
  finally,	
  we’re	
  geng	
  some	
  connecCon!	
  	
  
•  Various	
  Social	
  Sciences	
  have	
  produced	
  results	
  and	
  
procedures	
  useful	
  for	
  us:	
  	
  	
  	
  
–  PoliCcal	
  Science:	
  Annota2on	
  of	
  goals,	
  autudes,	
  etc.	
  	
  
–  Psychology	
  and	
  related:	
  Lists	
  of	
  emo2ons,	
  autudes,	
  etc.	
  	
  
–  Sociology:	
  Annota2ons	
  of	
  interpersonal	
  behavior	
  types	
  
(dialogue,	
  flir2ng,	
  etc.)	
  	
  
–  HCI:	
  Studies	
  of	
  turn-­‐taking	
  and	
  ini2a2ve,	
  etc.	
  	
  
•  However,	
  the	
  methodology	
  is	
  also	
  oien	
  suspect:	
  	
  
–  Lists	
  of	
  terms	
  and	
  concept	
  simply	
  made	
  up,	
  annota2ons	
  
not	
  carefully	
  checked,	
  etc.	
  	
  
–  We	
  can	
  easily	
  read	
  their	
  papers	
  and	
  make	
  our	
  own	
  lists	
  	
  
–  But	
  we	
  could	
  enter	
  into	
  fruiwul	
  dialogue:	
  we	
  verify	
  their	
  
theories	
  and	
  they	
  give	
  us	
  theore2cal	
  bases	
  for	
  our	
  work	
  	
  	
  
What	
  we	
  need	
  	
  
1.	
  An	
  understanding	
  that	
  NLP	
  is	
  	
  
–  transforma2on	
  of	
  one	
  nota2on	
  into	
  another	
  	
  
–  with	
  some	
  inser2on	
  of	
  addi2onal	
  informa2on	
  	
  
–  with	
  the	
  purpose	
  of	
  modeling	
  human	
  communica2on	
  	
  
2.	
  Some	
  careful	
  thinking	
  about	
  an	
  overall	
  
computa2onal	
  theory	
  of	
  seman2cs,	
  anchored	
  on	
  
language	
  communica2on	
  	
  	
  
–  What	
  it	
  might	
  be	
  	
  
–  How	
  you	
  might	
  create	
  large	
  corpora	
  containing	
  it	
  	
  
–  How	
  the	
  different	
  NLP	
  applica2ons	
  might	
  use	
  it	
  to	
  
improve	
  	
  	
   (adverCsement:	
  code	
  monkeys	
  needed!)	
  
What	
  makes	
  us	
  unique	
  
We	
  are	
  a	
  hybrid	
  of	
  	
  
–  Computer	
  science:	
  methods	
  to	
  perform	
  nota2on	
  
transforma2ons	
  (Finite	
  State	
  Automata	
  etc.)	
  	
  
–  LinguisCcs:	
  phenomena	
  of	
  language	
  	
  	
  
–  StaCsCcs/machine	
  learning:	
  obtaining	
  necessary	
  
parameters	
  and	
  data	
  	
  	
  
–  Philosophy	
  /	
  logic:	
  understanding	
  the	
  nature	
  of	
  
meaning	
  and	
  communica2on	
  	
  
–  CogniCve	
  psychology:	
  seeing	
  what	
  humans	
  do	
  	
  	
  
–  InformaCon	
  theory	
  (eventually):	
  measuring	
  efficiency	
  
of	
  transforma2ons	
  and	
  communica2on	
  	
  	
  
	
  Our	
  theories	
  will	
  not	
  look	
  like	
  anyone	
  else’s.	
  	
  
 
	
  
	
  
…so,	
  it’s	
  up	
  to	
  us	
  	
  
	
  
	
  
THANK	
  YOU	
  
Where next? 
Why we should take into account
other fields that seriously care about
language and communication 	

Michael Zock	

C.N.R.S. – L.I.F.	

Marseille, France	

zock@free.fr
From theory to practice	

After two decades of suffering from theory-driven,
formal arm-chair-linguistics (symbolic approach), the
computational linguistic community has moved towards
data-driven corpus processing (statistical approach). 	

	

While this change has led in some areas to spectacular
progress, conforting the conclusion that we had moved
from toy worlds and made-up examples to the real
world, it has also produced a number of negative side-
effects	
  	
  
At least 3 consequences	

	

➟ change of perspective, sometimes at odds with reality 	

➟ a certain kind of inbreeding	

➟ overemphasis of evaluation
Change of perspectives	

A subtle change of research goals  has taken place
which is somehow at odds with reality, i.e. practical
needs. Many of the traditional tasks are hardly
addressed anymore: 	

	

➟ knowledge representation, 	

➟ multimedia processing (integration of linguistic + visual information) 	

➟ user modeling, 	

➟ computational cognitive modeling, 	

➟ NLP and education, 	

➟  etc.
Inbreeding	

Many of the traditional disciplines don’t seem to be relevant
anymore. For example, a study done by Reiter (2007) revealed
that the works done by psychologists and linguists cited in major
CL conferences and journals has dropped respectively from 13%
and 15% to 5% and 1% in the period of 1995 and 2005. 	

This decrease of citing psychological research is all the more
surprising, as the field has made substantial progress, partially
due to a more widespread availability of new technology (eye
trackers, brain scanners). 	

While this fact seems to be ignored by our community, it is
clearly acknowledged in the area of AI. Indeed, 25% of the work
cited in the highly respected AI journal, refers to psychologists’
work. In other words, the AI community shows considerably
more sensitivity and awereness of the work produced by the
broader language research community than the CL community
(25% vs. 5%).
July	
  25,	
  2004	
   EMNLP-­‐2004	
  	
  Senseval-­‐2004	
   19	
  
When will we see the last non-statistical
paper, 2010? (slide from Ken Church)	

0%
20%
40%
60%
80%
100%
1985
1990
1995
2000
2005
ACL Meeting
%Statistical
Papers
Bob Moore Fred Jelinek
Overemphasis on evaluation
schemata	
  	
  	

Evaluation schemata have reached their limits and are not fully
adequate: In many areas we seem to have come too close to the limits
of an approach to expect substantial gains from it. Hence we shouldn’t
use it alone or as the main approch anymore. 	

Having noted that our community tends to forget to draw attention to
the work done by those on whose shoulders they stand, Y. Wilks (2008)
ends his CL lifetime achievement award by writing 	

« … we need real ideas and innovations, and now may be a time for
fresh ones. We have stood on the shoulders of Fred Jelinek, Ken
Church, and others for nearly two decades now, and the strain is
beginning to tell as papers still strive to gain that extra 1% in their
scores on some small task. We know that some change is in the air and
I have tried to hint … to some of the places where that might be, even if
that will mean a partial return to older, unfashionable ideas; for there is
nothing new under the sun. »
And Yorick is not alone	
  	
  	

« it is now essential to play down the present narrowly-defined
performance measures in order to address the task context, and
specifically the role of the human participant in the task, so that
new measures, of larger value, can be developed and applied.’	

Spärck Jones K. Automatic language and information processing : rethinking
evaluation. Natural Language Engineering, chapter 7, p. 1-18, 2001	

	

So we need to be alert. It's not just that we may find ourselves
putting the cart before the horse. We can get obsessed with the
wheels, and finish up with uncritically reinvented, or square, or
over-refined or otherwise unsatisfactory wheels, or even just
unicycles. 	

Spärck Jones, K. Computational linguistics: What about the linguistics? 	

Comput. Linguist., 33(3):437–441, 2007.
So where should we turn to? 	

➟ Linguistics? 	

➟ Psycholinguistics? 	

➟ Cognitive Science?	

➟ Social Sciences? 	

➟ Information Theory? 	

	

Ed’s conclusions: 	

➟ We don’t need any of these. 	

➟ We lack vision and theory
Some gut reactions	

condescending attitude?	

Chinese room?
Linguistics	

Ed is right to some extent, but possibly for the wrong reasons.	

	

He is right	

➟ Linguists have indeed often dealt with borderline cases.	

➟ Their theories are hardly ever based on corpora, i.e. real world
data	

➟ Dealing only with competency, linguists have never cared to
take human constraints into account (short-term memory, etc.). Hence
they fail to provide a psychologically adequate model.	

➟ Linguists don’t care for the needs of other disciplines (problem
of metalanguage) and they have often been syntacto-centristic.
Linguistics	

I’d say, he is wrong	

➟ We do need linguistics, as at least half of the language is rule-
based, which makes it learnable within a reasonable time frame.
Of course, this is also relevant for CL.	

➟ They have not addressed but extreme cases, as otherwise they
wouldn’t have been able to write grammars, which account for
regularities of a substantial subset of language	

➟  There are different kinds of linguistics. Hence the question,
what kind of linguistic description do we need (procedural)?	

➟ Also, there a some linguists who do take psycholinguistic and
neurolinguistic factors into account (Jackendoff, Lamb, Hudson), and
of course, there are cognitive linguists (Fillmore) and
lexicographers, who, working on corpora, do cognitive linguistics
(Pustejovsky and Hanks: Concept Pattern Analysis).
Psycholinguistics	

Indeed, psychology does not have to say much about MT and IR, since
these topics have been outside of their scope, but they do have to say
quite a bit on sentence processing (alas, much less though, about discourse).	

Psycholinguistics is definitely useful, especially if you are interested in
man-machine communication (support people to process language). In this
case it helps a lot to know how people process information, how they
represent, store (organisation) and access information (dictionaries)	

WN is the brainchild of a psychologist (G.A. Miller). It is clearly based
on information concerning the human mind and its functioning
(associations), but so is the web (Berner Lee) which is inspired by notions
such as association and navigation, the latter corresponding to
spreading-activation.	

Notions like patterns, omnipresent in CL have long been used in
psychology (Gestalt psychology). And what we know about memory is
crucial for information processing (why do we chunk graphs into sentence
frames?).	

	
  
Neurosciences and neurolinguistics	

If we didn’t have a theory, we wouldn’t be able to interprete
properly patterns and correlations between them. 	

This information often comes from psychology and linguistics, be
it for the different levels or units   (meaning-form-sound) or the
general architecture and the information flow.	

Ed’s question: can we infer functional lessons from the brain?	

Yes, concerning modularity, flexibility of processing, possible
recovery despite incomplete information.
Sociolinguistics	

Contrary to what one might think, Ed provides more reasons in
favor of sociolinguistics than against it, and I couldn’t but agree
more with him.	

	

It has always struck me that so little attention has been paid by
CL to this kind of work. Yet, if we want to deal not only with
language, but also with communication (language being only a
means), than we have to take a closer look at the context in which
communication takes place, in order to determine the different
social parameters (place, context, people) affecting the choice of a
specific linguistic resource. For example, when shall we highlight
or downplay some information (active voice vs. passive voice;
independant vs. subordinate clause, etc.), and if so, how is this done in a
given language?
What we may get from them and 
what they may need take into account?	
  	
  	

What we can get from them	

➟ inspiration, no doubt;	

➟ ways to interprete our data;	

➟ information allowing us to build our own theory;	

➟  functionality: it would be absurd to simulate on a computer
what has been devised for a completely different hardware, with
different constraints (paper vs. brain, or, birds vs. airplanes). Never-
theless, at a functional level, there are clearly equivalences (brain,
i.e. neuronal networks and semantic networks, spreading-activation and search/
navigation), since we pursue similar goals.	

	

What they may need take into account	

➟ agree on annotations	

➟ work on real world data (corpus linguistics)
Mutual benefits, i.e. added value	

Build truly semiotic extensions, i.e. tools supporting people
to	

➟ think (problem solving), 	

➟ process language and 	

➟ communicate (both in L1 and L2). 	

	

How come that so few applications have been built to
support NLP (authoring tools) and education?
You've always got a choice, but it may be in your
own interest to step back and think first about its
adequacy, price, and consequences.

	

1° goals, i.e. needs
2° methods	

3° results	

4° adequacy/evaluation
5° price/consequences	
  
Conclusion + possible recommendations	

We do need the knowledge accumulated in other domains	

Some possible recommendations of how to take advantage of it	

•  broaden the range of papers in ACL conferences and Computational
Linguistics ;	

•  put more emphasis on the poster track ; 	

•  include a broader range of people in journal editorial boards and
conference PCs ;	

•  include a broader range of workshops, tutorials, and invited talks at
ACL conferences; 	

•  have survey papers in Computational Linguistics on other areas of
language-related research ; 	

•  offer a hands-on approach in smaller events like summerschools ;	

•  organize real WORKshops, possibly led by invited speakers ;	

•  support brainstorming by allowing the presentation of ideas (possibly
even if they are not implemented or evaluated) and work describing real
world applications.
July	
  25,	
  2004	
   EMNLP-­‐2004	
  	
  Senseval-­‐2004	
   33	
  
Grand	
  Challenges	
  
ip://ip.cordis.lu/pub/ist/docs/istag040319-­‐drainotesoihemee2ng.pdf	
  	
  
Thanks for their inspiration	

•  Reiter, E. (2007) The Shrinking Horizons of Computational
Linguistics. Computational Linguistics Vol. 33, No. 2, Pages
283—287	

•  Spärck Jones, K. Computational linguistics: What about the
linguistics? Comput. Linguist., 33(3):437–441, 2007.	

•  Spärck Jones K. Automatic language and information
processing : rethinking evaluation. Natural Language
Engineering, chapter 7, p. 1-18, 2001	

•  Wilks, Y. (2008) On Whose Shoulders? Computational
Linguistics 34 (4) 471-486.	

•  Wintner, S. (2009), What Science Underlies Natural Language
Engineering? Computational Linguistics December 2009, Vol.
35, No. 4: 641–644

Mais conteúdo relacionado

Destaque

EL CARMEN SCHOOL. FOOD
EL CARMEN SCHOOL. FOODEL CARMEN SCHOOL. FOOD
EL CARMEN SCHOOL. FOODsoniapr30
 
The need for Water Sanitaion
The need for Water SanitaionThe need for Water Sanitaion
The need for Water SanitaionEwan Cameron
 
Oregon weight loss
Oregon weight loss Oregon weight loss
Oregon weight loss mrmikesd
 
Patent Basics Presentation Mesa Thinkspot 2016
Patent Basics Presentation Mesa Thinkspot 2016Patent Basics Presentation Mesa Thinkspot 2016
Patent Basics Presentation Mesa Thinkspot 2016statelibaz
 
圣严法师《佛教入门》
圣严法师《佛教入门》圣严法师《佛教入门》
圣严法师《佛教入门》walkmankim
 
Skriva för webben - E-business 2.0
Skriva för webben - E-business 2.0Skriva för webben - E-business 2.0
Skriva för webben - E-business 2.0Kajsa Snickars
 
Conventions of my magazine question 1
Conventions of my magazine question 1Conventions of my magazine question 1
Conventions of my magazine question 1Charlotte Bowerman
 
pojarnaya bezopasnost
pojarnaya bezopasnostpojarnaya bezopasnost
pojarnaya bezopasnostmdou_142
 
bezopasnost v internete
bezopasnost v internetebezopasnost v internete
bezopasnost v internetemdou_142
 
αντιγονη
αντιγονηαντιγονη
αντιγονηekidrou
 
圣严法师《学佛群疑》
圣严法师《学佛群疑》圣严法师《学佛群疑》
圣严法师《学佛群疑》walkmankim
 
《枯木开花:圣严法师传》施叔青着
《枯木开花:圣严法师传》施叔青着《枯木开花:圣严法师传》施叔青着
《枯木开花:圣严法师传》施叔青着walkmankim
 
Dhammananda <the>
Dhammananda <the>Dhammananda <the>
Dhammananda <the>walkmankim
 
虚云大师《禅修入门》
虚云大师《禅修入门》虚云大师《禅修入门》
虚云大师《禅修入门》walkmankim
 
Impressionisme informàtica
Impressionisme informàticaImpressionisme informàtica
Impressionisme informàticatorragrau
 
Ras2000 acueductos
Ras2000 acueductosRas2000 acueductos
Ras2000 acueductosmariale06
 

Destaque (20)

EL CARMEN SCHOOL. FOOD
EL CARMEN SCHOOL. FOODEL CARMEN SCHOOL. FOOD
EL CARMEN SCHOOL. FOOD
 
The need for Water Sanitaion
The need for Water SanitaionThe need for Water Sanitaion
The need for Water Sanitaion
 
Oregon weight loss
Oregon weight loss Oregon weight loss
Oregon weight loss
 
Patent Basics Presentation Mesa Thinkspot 2016
Patent Basics Presentation Mesa Thinkspot 2016Patent Basics Presentation Mesa Thinkspot 2016
Patent Basics Presentation Mesa Thinkspot 2016
 
圣严法师《佛教入门》
圣严法师《佛教入门》圣严法师《佛教入门》
圣严法师《佛教入门》
 
Skriva för webben - E-business 2.0
Skriva för webben - E-business 2.0Skriva för webben - E-business 2.0
Skriva för webben - E-business 2.0
 
Conventions of my magazine question 1
Conventions of my magazine question 1Conventions of my magazine question 1
Conventions of my magazine question 1
 
Christmas
ChristmasChristmas
Christmas
 
pojarnaya bezopasnost
pojarnaya bezopasnostpojarnaya bezopasnost
pojarnaya bezopasnost
 
Comparing conventions
Comparing conventionsComparing conventions
Comparing conventions
 
bezopasnost v internete
bezopasnost v internetebezopasnost v internete
bezopasnost v internete
 
αντιγονη
αντιγονηαντιγονη
αντιγονη
 
圣严法师《学佛群疑》
圣严法师《学佛群疑》圣严法师《学佛群疑》
圣严法师《学佛群疑》
 
《枯木开花:圣严法师传》施叔青着
《枯木开花:圣严法师传》施叔青着《枯木开花:圣严法师传》施叔青着
《枯木开花:圣严法师传》施叔青着
 
Dhammananda <the>
Dhammananda <the>Dhammananda <the>
Dhammananda <the>
 
虚云大师《禅修入门》
虚云大师《禅修入门》虚云大师《禅修入门》
虚云大师《禅修入门》
 
Benefit Bulletin February 2014
Benefit Bulletin February 2014Benefit Bulletin February 2014
Benefit Bulletin February 2014
 
Impressionisme informàtica
Impressionisme informàticaImpressionisme informàtica
Impressionisme informàtica
 
TASTE
TASTETASTE
TASTE
 
Ras2000 acueductos
Ras2000 acueductosRas2000 acueductos
Ras2000 acueductos
 

Semelhante a Why Computational Linguistics Needs a Grand Theory

CS Education for All. A new wave of opportunity
CS Education for All. A new wave of opportunityCS Education for All. A new wave of opportunity
CS Education for All. A new wave of opportunityPeter Donaldson
 
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposium
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' SymposiumBad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposium
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposiumrichpaige
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Miningebelani
 
UbD and Technology
UbD and TechnologyUbD and Technology
UbD and Technologyehelfant
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data scienceBrian Spiering
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning AnalyticsXavier Ochoa
 
Apf reflections scientific temper
Apf reflections   scientific temperApf reflections   scientific temper
Apf reflections scientific temperET&D
 
Mauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopMauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopCosmoAIMS Bassett
 
Computational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogyComputational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogyPaul Herring
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1Taymoor Nazmy
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence Prasad Kulkarni
 
E-learning technologies - FREE TUTORIAL
E-learning technologies  - FREE TUTORIALE-learning technologies  - FREE TUTORIAL
E-learning technologies - FREE TUTORIALBIMS e.V.
 
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...Mnemonics and self-pedagogics for law students - Passing the bar with two wee...
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...T. Alexander Puutio
 
Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Christian Bokhove
 

Semelhante a Why Computational Linguistics Needs a Grand Theory (20)

CS Education for All. A new wave of opportunity
CS Education for All. A new wave of opportunityCS Education for All. A new wave of opportunity
CS Education for All. A new wave of opportunity
 
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposium
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' SymposiumBad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposium
Bad Modelling Teaching Practices: Invited talk at MoDELS'14 Educators' Symposium
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Mining
 
UbD and Technology
UbD and TechnologyUbD and Technology
UbD and Technology
 
Jackson - 22nd Century Librarians and the death of information skills
Jackson - 22nd Century Librarians and the death of information skillsJackson - 22nd Century Librarians and the death of information skills
Jackson - 22nd Century Librarians and the death of information skills
 
Introduction to ai
Introduction to aiIntroduction to ai
Introduction to ai
 
Eurodidaweb2012 03-13
Eurodidaweb2012 03-13Eurodidaweb2012 03-13
Eurodidaweb2012 03-13
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data science
 
Multimodal Learning Analytics
Multimodal Learning AnalyticsMultimodal Learning Analytics
Multimodal Learning Analytics
 
Apf reflections scientific temper
Apf reflections   scientific temperApf reflections   scientific temper
Apf reflections scientific temper
 
Eurodidaweb2012 03-13
Eurodidaweb2012 03-13Eurodidaweb2012 03-13
Eurodidaweb2012 03-13
 
NUS PhD e-open day 2020
NUS PhD e-open day 2020NUS PhD e-open day 2020
NUS PhD e-open day 2020
 
Mauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshopMauritius Big Data and Machine Learning JEDI workshop
Mauritius Big Data and Machine Learning JEDI workshop
 
Computational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogyComputational Thinking - a 4 step approach and a new pedagogy
Computational Thinking - a 4 step approach and a new pedagogy
 
Knowledge base system appl. p 1,2-ver1
Knowledge base system appl.  p 1,2-ver1Knowledge base system appl.  p 1,2-ver1
Knowledge base system appl. p 1,2-ver1
 
Artificial Intelligence
Artificial Intelligence Artificial Intelligence
Artificial Intelligence
 
E-learning technologies - FREE TUTORIAL
E-learning technologies  - FREE TUTORIALE-learning technologies  - FREE TUTORIAL
E-learning technologies - FREE TUTORIAL
 
Data-X-v3.1
Data-X-v3.1Data-X-v3.1
Data-X-v3.1
 
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...Mnemonics and self-pedagogics for law students - Passing the bar with two wee...
Mnemonics and self-pedagogics for law students - Passing the bar with two wee...
 
Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...
 

Último

Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸mathanramanathan2005
 
Anne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxAnne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxnoorehahmad
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...marjmae69
 
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this periodSaraIsabelJimenez
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Krijn Poppe
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptxogubuikealex
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEMCharmi13
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxmavinoikein
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxaryanv1753
 
miladyskindiseases-200705210221 2.!!pptx
miladyskindiseases-200705210221 2.!!pptxmiladyskindiseases-200705210221 2.!!pptx
miladyskindiseases-200705210221 2.!!pptxCarrieButtitta
 
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...Henrik Hanke
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringSebastiano Panichella
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Escort Service
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRachelAnnTenibroAmaz
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRRsarwankumar4524
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comsaastr
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSebastiano Panichella
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxJohnree4
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationNathan Young
 

Último (20)

Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸Mathan flower ppt.pptx slide orchids ✨🌸
Mathan flower ppt.pptx slide orchids ✨🌸
 
Anne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptxAnne Frank A Beacon of Hope amidst darkness ppt.pptx
Anne Frank A Beacon of Hope amidst darkness ppt.pptx
 
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
Gaps, Issues and Challenges in the Implementation of Mother Tongue Based-Mult...
 
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular PlasticsDutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
Dutch Power - 26 maart 2024 - Henk Kras - Circular Plastics
 
Early Modern Spain. All about this period
Early Modern Spain. All about this periodEarly Modern Spain. All about this period
Early Modern Spain. All about this period
 
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
Presentation for the Strategic Dialogue on the Future of Agriculture, Brussel...
 
Chizaram's Women Tech Makers Deck. .pptx
Chizaram's Women Tech Makers Deck.  .pptxChizaram's Women Tech Makers Deck.  .pptx
Chizaram's Women Tech Makers Deck. .pptx
 
Quality by design.. ppt for RA (1ST SEM
Quality by design.. ppt for  RA (1ST SEMQuality by design.. ppt for  RA (1ST SEM
Quality by design.. ppt for RA (1ST SEM
 
Work Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptxWork Remotely with Confluence ACE 2.pptx
Work Remotely with Confluence ACE 2.pptx
 
Event 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptxEvent 4 Introduction to Open Source.pptx
Event 4 Introduction to Open Source.pptx
 
miladyskindiseases-200705210221 2.!!pptx
miladyskindiseases-200705210221 2.!!pptxmiladyskindiseases-200705210221 2.!!pptx
miladyskindiseases-200705210221 2.!!pptx
 
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
DGT @ CTAC 2024 Valencia: Most crucial invest to digitalisation_Sven Zoelle_v...
 
The 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software EngineeringThe 3rd Intl. Workshop on NL-based Software Engineering
The 3rd Intl. Workshop on NL-based Software Engineering
 
Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170Call Girls In Aerocity 🤳 Call Us +919599264170
Call Girls In Aerocity 🤳 Call Us +919599264170
 
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATIONRACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
RACHEL-ANN M. TENIBRO PRODUCT RESEARCH PRESENTATION
 
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRRINDIAN GCP GUIDELINE. for Regulatory  affair 1st sem CRR
INDIAN GCP GUIDELINE. for Regulatory affair 1st sem CRR
 
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.comSaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
SaaStr Workshop Wednesday w/ Kyle Norton, Owner.com
 
SBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation TrackSBFT Tool Competition 2024 -- Python Test Case Generation Track
SBFT Tool Competition 2024 -- Python Test Case Generation Track
 
Genshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptxGenshin Impact PPT Template by EaTemp.pptx
Genshin Impact PPT Template by EaTemp.pptx
 
The Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism PresentationThe Ten Facts About People With Autism Presentation
The Ten Facts About People With Autism Presentation
 

Why Computational Linguistics Needs a Grand Theory

  • 1. Where  next?     Why  we  don’t  care  about  other   fields  that  care  about  language     Eduard  Hovy     USC  Informa2on  Sciences  Ins2tute   Marina  del  Rey,  CA   hovy@isi.edu  
  • 2. The  Sta2s2cs  Revolu2on  is  over…   •  …when  EMNLP  2004  called  for     •  …when  even  Ken  Church  says  “enough  is  enough”     •  CL  has  become  a  field  of  code  monkeys:     –  Excellent  problem  analyzers     –  Excellent  large-­‐scale  data  managers     –  Excellent  machine  learning  appliers     –  Conference  papers  focus  on  how  to  do  something,  NOT   why  it  is  important  or  why  the  method  works   While  repor2ng  aggregate  percentages  remains  essen2al,  instruc2ve  research   should  also  explain  a  model's  limita2ons  in  more  meaningful  ways.    This  can   be  as  simple  as  categorizing  error  sta2s2cs  by  finer-­‐grained  types  of  errors,  to   reveal  specific  areas  of  model  limita2on.    BeYer  yet,  we  seek  deeper  insight   into  the  models’  inherent  representa2onal  biases,  in  the  form  of  qualita2ve   theore2cal  analyses      
  • 3. So  what  are  we  going  to  do  now?   •  Large  challenge  problems  remain:   –  MT:  since  1950s,  s2ll  not  solved;  sta2s2cal  techniques  are  slowly   reimplemen2ng  rule-­‐based  techniques,  up  the  Vauqois  Triangle     –  ASR:  since  1960s,  s2ll  not  solved:  terrible  performance  for  open-­‐domain   vocab  when  untuned   –  IR:  since  1970s,  s2ll  not  solved:  stuck  for  a  decade  at  0.45  F-­‐score.    IR   community  limps  along  year  by  year  searching  for  new  problem     –  IE:  since  1980s,  sta2c;  now  focusing  on  extrac2on  paYern  learning  and   web  mining;  terrible  F-­‐score  on  complex  open-­‐ended  material     –  SummarizaCon:  since  1990s,  a  long  way  from  solved  for  non-­‐news   genres;  few  new  ideas  since  early  2000s       –  QA:  since  2000s,  only  factoids;  poor  performance  for  Why?  and  How?     –  Machine  Reading:  the  oldest  new  problem;  no  results  yet     •  Smaller  challenge  problems  invented  every  few  years:     –  WSD:  we  can  do  it,  but  we  don’t  know  how  to  define  good  word  senses   –  SubjecCvity/senCment/opinion:  no-­‐one  knows  what  this  is       –  Textual  Entailment:  diYo     To  become  famous,  you  used  to   have  to  start  a  new  conference     (TINAP,  EMNLP,  RANLP,  PACLING…)     or  build  a  useful  piece  of  soiware      (Brill  tagger,  Charnak  parser)     Now  you  create  a  new  challenge   problem  and  corpus    
  • 4. What  our  field  lacks     •  Most  of  our  energy  today  is  spent       –  hacking  away  at  applica2ons  or  challenge  problems     –  [and]  exploring  some  new  technique  (SVM,  Kernel   methods,  LSA,  LDA…)     –  running  some  kind  of  evaluaCon     –  building  a  new  corpus         •  Our  field  lacks  a  theory      (or  even  mul2ple  theories)     –  We  really  are  code  monkeys,  typing                             away  without  a  Grand  Vision    
  • 5. So  where  should  we  turn?     •  Linguis2cs?     •  Psycholinguis2cs?     •  Cogni2ve  Science?   •  Social  Sciences?     •  Informa2on  Theory?        
  • 6. Why  we  don’t  need  Linguis2cs    When  last  did    you    find  a  linguisCc  theory  helpful?     Problem  1:  Linguists  study  different  things  than  we  do:     •  We  work  with  the  average  case,  the  typical  phenomena.    We  leave   aberra2ons/excep2ons  either  for  special-­‐purpose  treatment  or   ignore  them     •  Linguists  study  extreme/unusual/odd  phenomena,  to  gain  an  insight   about  some  specific  phenomenon     –  They  produce  elegant  theore2cal  accounts,  representa2ons,  and  rules     –  They  do  not  produce  data,  numbers,  or  average  cases     •  So  linguists’  theories  are  [almost  always]  hard  to  implement,   impossible  to  address  with  machine  learning,  and  mostly  simply  not   relevant  to  our  problems     Problem  2:  Linguists’  methodology  is  suspect:     •  Linguists  annotate  during  theory  discovery/forma2on     –  Do  not  use  mul2ple  annotators  aierward  and  record  agreement  values     •  Result:  no  verifica2on  of  theore2cal  claims  via  annota2on!     –  (Seldom  give  any  verifica2on  at  all,  except  for  example  cases)     So:  we  can  help  the  linguists  by   tes2ng  their  theories,  if  we  want     —  but  it’s  oien  a  waste  of  2me     since  they  don’t  like  us      
  • 7. Why  we  don’t  need  Neurolinguis2cs    Learning  about  language  by  looking  at  firing  paRerns  in  the  brain   is  like  learning  about  an  algorithm  by  looking  at  the  flickering  light   paRerns  on  old-­‐style  computer  consoles     •  Problem:  The  architectures  of  the  brain  and  computer  differ     –  There’s  no  useful  info  for  us  in  purely  structural  info  about  the  brain     –  Can  we  infer  func0onal  lessons  from  the  brain?     •  Subtasks  X  and  Y  are  performed  in  different  parts  of  the  brain  —  perhaps   they  should  be  dis2nct  code  modules      (e.g.,  syntax  independent  of   seman2cs?)     •  Subtask  X  is  performed  before/aWer  subtask  Y  —  perhaps  that’s  important   for  our  processing      (e.g.,  syntax  before  seman2cs?)     •  InformaCon  of  type  X  is  stored  elsewhere  from  info  of  type  Y  —  perhaps  we   should  differen2ate  them  as  well      (e.g.,  word  sense  granularity?)     •  “We  need  you…You  make  things  really  work  that  the  rest  of   us  simply  say  in  a  single  word”  —  our  invited  speaker  John   Gabrieli  yesterday    
  • 8. Why  we  don’t  need  CogSci/ Psycholinguis2cs    Building  a  plane  that  flies  like  a  duck…     •  Problem:  Results  are  limited  by  the  crudeness  of  the   experimental  methodology.    They  have  essen2ally  only   three  experimental  procedures  for  people:     –  ReacCon  Cme:  what  can  we  infer  from  how  long  it  takes  people   to  respond.    Gives  hints  about  complexity  of  task.    How  does   that  help  us?     –  Confusion:  what  we  can  infer  about  ‘cogni2ve  similarity’  of   objects.    Perhaps  useful  for  ontologies  and/or  word  senses?       –  Forgeng:  what  can  can  infer  about  organiza2on  of  memory.     And  how  does  this  help  us?       •  There’s  no  reason  to  believe  that  a  psychologically  / cogni2vely  accurate  computa2onal  model  will  tell  us  much   about  how  we  should  do  MT,  IR,  etc.    Perhaps  the  opposite,   even…      
  • 9. Why  we  don’t  need  Info  Theory    …And  if  you  though  the  empirical  approach  has  gone  too  far…     •  Info  Theory  studies  accurate  transmission  of  info-­‐bearing  signals,   and  measures  the  quan22es  or  loss  of  info  conveyed       •  This  is  relevant  to  us,  but  not  directly:   –  The  Sta2s2cs  Revolu2on  brought  Machine  Learning  (i.e.,   measurements  of  effec2veness  of  nota2on  transforma2ons,  large-­‐ scale  models  of  language  (ngram  models),  measures  of  informa2onal   similarity  (pmi,  etc.)  into  CL     –  These  no2ons  are  connected  to  Info  Theory     •  But  Info  Theory  is  too  incomplete/crude/’weak’  for  our  needs:     –  The  info  content  measure  (p  log  p)  is  a  generalized  score  of   undifferen2ated  ‘informa2on’     –  We  need  more  specialized  measurements  of  specific  phenomena   (parse  trees  built,  documents  returned,  labels  inserted,  etc.)     –  (what’s  more,  p  log  p  is  based  on  large-­‐scale  (averaged)  entropy;  no   recogni2on  of  individual  user’s  knowledge  and/or  inferen2al  abili2es)  
  • 10. Why  we  don’t  need  Social  Sciences    Here,  finally,  we’re  geng  some  connecCon!     •  Various  Social  Sciences  have  produced  results  and   procedures  useful  for  us:         –  PoliCcal  Science:  Annota2on  of  goals,  autudes,  etc.     –  Psychology  and  related:  Lists  of  emo2ons,  autudes,  etc.     –  Sociology:  Annota2ons  of  interpersonal  behavior  types   (dialogue,  flir2ng,  etc.)     –  HCI:  Studies  of  turn-­‐taking  and  ini2a2ve,  etc.     •  However,  the  methodology  is  also  oien  suspect:     –  Lists  of  terms  and  concept  simply  made  up,  annota2ons   not  carefully  checked,  etc.     –  We  can  easily  read  their  papers  and  make  our  own  lists     –  But  we  could  enter  into  fruiwul  dialogue:  we  verify  their   theories  and  they  give  us  theore2cal  bases  for  our  work      
  • 11. What  we  need     1.  An  understanding  that  NLP  is     –  transforma2on  of  one  nota2on  into  another     –  with  some  inser2on  of  addi2onal  informa2on     –  with  the  purpose  of  modeling  human  communica2on     2.  Some  careful  thinking  about  an  overall   computa2onal  theory  of  seman2cs,  anchored  on   language  communica2on       –  What  it  might  be     –  How  you  might  create  large  corpora  containing  it     –  How  the  different  NLP  applica2ons  might  use  it  to   improve       (adverCsement:  code  monkeys  needed!)  
  • 12. What  makes  us  unique   We  are  a  hybrid  of     –  Computer  science:  methods  to  perform  nota2on   transforma2ons  (Finite  State  Automata  etc.)     –  LinguisCcs:  phenomena  of  language       –  StaCsCcs/machine  learning:  obtaining  necessary   parameters  and  data       –  Philosophy  /  logic:  understanding  the  nature  of   meaning  and  communica2on     –  CogniCve  psychology:  seeing  what  humans  do       –  InformaCon  theory  (eventually):  measuring  efficiency   of  transforma2ons  and  communica2on        Our  theories  will  not  look  like  anyone  else’s.    
  • 13.       …so,  it’s  up  to  us         THANK  YOU  
  • 14. Where next? Why we should take into account other fields that seriously care about language and communication Michael Zock C.N.R.S. – L.I.F. Marseille, France zock@free.fr
  • 15. From theory to practice After two decades of suffering from theory-driven, formal arm-chair-linguistics (symbolic approach), the computational linguistic community has moved towards data-driven corpus processing (statistical approach). While this change has led in some areas to spectacular progress, conforting the conclusion that we had moved from toy worlds and made-up examples to the real world, it has also produced a number of negative side- effects    
  • 16. At least 3 consequences ➟ change of perspective, sometimes at odds with reality ➟ a certain kind of inbreeding ➟ overemphasis of evaluation
  • 17. Change of perspectives A subtle change of research goals  has taken place which is somehow at odds with reality, i.e. practical needs. Many of the traditional tasks are hardly addressed anymore: ➟ knowledge representation, ➟ multimedia processing (integration of linguistic + visual information) ➟ user modeling, ➟ computational cognitive modeling, ➟ NLP and education, ➟  etc.
  • 18. Inbreeding Many of the traditional disciplines don’t seem to be relevant anymore. For example, a study done by Reiter (2007) revealed that the works done by psychologists and linguists cited in major CL conferences and journals has dropped respectively from 13% and 15% to 5% and 1% in the period of 1995 and 2005. This decrease of citing psychological research is all the more surprising, as the field has made substantial progress, partially due to a more widespread availability of new technology (eye trackers, brain scanners). While this fact seems to be ignored by our community, it is clearly acknowledged in the area of AI. Indeed, 25% of the work cited in the highly respected AI journal, refers to psychologists’ work. In other words, the AI community shows considerably more sensitivity and awereness of the work produced by the broader language research community than the CL community (25% vs. 5%).
  • 19. July  25,  2004   EMNLP-­‐2004    Senseval-­‐2004   19   When will we see the last non-statistical paper, 2010? (slide from Ken Church) 0% 20% 40% 60% 80% 100% 1985 1990 1995 2000 2005 ACL Meeting %Statistical Papers Bob Moore Fred Jelinek
  • 20. Overemphasis on evaluation schemata     Evaluation schemata have reached their limits and are not fully adequate: In many areas we seem to have come too close to the limits of an approach to expect substantial gains from it. Hence we shouldn’t use it alone or as the main approch anymore. Having noted that our community tends to forget to draw attention to the work done by those on whose shoulders they stand, Y. Wilks (2008) ends his CL lifetime achievement award by writing « … we need real ideas and innovations, and now may be a time for fresh ones. We have stood on the shoulders of Fred Jelinek, Ken Church, and others for nearly two decades now, and the strain is beginning to tell as papers still strive to gain that extra 1% in their scores on some small task. We know that some change is in the air and I have tried to hint … to some of the places where that might be, even if that will mean a partial return to older, unfashionable ideas; for there is nothing new under the sun. »
  • 21. And Yorick is not alone     « it is now essential to play down the present narrowly-defined performance measures in order to address the task context, and specifically the role of the human participant in the task, so that new measures, of larger value, can be developed and applied.’ Spärck Jones K. Automatic language and information processing : rethinking evaluation. Natural Language Engineering, chapter 7, p. 1-18, 2001 So we need to be alert. It's not just that we may find ourselves putting the cart before the horse. We can get obsessed with the wheels, and finish up with uncritically reinvented, or square, or over-refined or otherwise unsatisfactory wheels, or even just unicycles. Spärck Jones, K. Computational linguistics: What about the linguistics? Comput. Linguist., 33(3):437–441, 2007.
  • 22. So where should we turn to? ➟ Linguistics? ➟ Psycholinguistics? ➟ Cognitive Science? ➟ Social Sciences? ➟ Information Theory? Ed’s conclusions: ➟ We don’t need any of these. ➟ We lack vision and theory
  • 23. Some gut reactions condescending attitude? Chinese room?
  • 24. Linguistics Ed is right to some extent, but possibly for the wrong reasons. He is right ➟ Linguists have indeed often dealt with borderline cases. ➟ Their theories are hardly ever based on corpora, i.e. real world data ➟ Dealing only with competency, linguists have never cared to take human constraints into account (short-term memory, etc.). Hence they fail to provide a psychologically adequate model. ➟ Linguists don’t care for the needs of other disciplines (problem of metalanguage) and they have often been syntacto-centristic.
  • 25. Linguistics I’d say, he is wrong ➟ We do need linguistics, as at least half of the language is rule- based, which makes it learnable within a reasonable time frame. Of course, this is also relevant for CL. ➟ They have not addressed but extreme cases, as otherwise they wouldn’t have been able to write grammars, which account for regularities of a substantial subset of language ➟  There are different kinds of linguistics. Hence the question, what kind of linguistic description do we need (procedural)? ➟ Also, there a some linguists who do take psycholinguistic and neurolinguistic factors into account (Jackendoff, Lamb, Hudson), and of course, there are cognitive linguists (Fillmore) and lexicographers, who, working on corpora, do cognitive linguistics (Pustejovsky and Hanks: Concept Pattern Analysis).
  • 26. Psycholinguistics Indeed, psychology does not have to say much about MT and IR, since these topics have been outside of their scope, but they do have to say quite a bit on sentence processing (alas, much less though, about discourse). Psycholinguistics is definitely useful, especially if you are interested in man-machine communication (support people to process language). In this case it helps a lot to know how people process information, how they represent, store (organisation) and access information (dictionaries) WN is the brainchild of a psychologist (G.A. Miller). It is clearly based on information concerning the human mind and its functioning (associations), but so is the web (Berner Lee) which is inspired by notions such as association and navigation, the latter corresponding to spreading-activation. Notions like patterns, omnipresent in CL have long been used in psychology (Gestalt psychology). And what we know about memory is crucial for information processing (why do we chunk graphs into sentence frames?).  
  • 27. Neurosciences and neurolinguistics If we didn’t have a theory, we wouldn’t be able to interprete properly patterns and correlations between them. This information often comes from psychology and linguistics, be it for the different levels or units   (meaning-form-sound) or the general architecture and the information flow. Ed’s question: can we infer functional lessons from the brain? Yes, concerning modularity, flexibility of processing, possible recovery despite incomplete information.
  • 28. Sociolinguistics Contrary to what one might think, Ed provides more reasons in favor of sociolinguistics than against it, and I couldn’t but agree more with him. It has always struck me that so little attention has been paid by CL to this kind of work. Yet, if we want to deal not only with language, but also with communication (language being only a means), than we have to take a closer look at the context in which communication takes place, in order to determine the different social parameters (place, context, people) affecting the choice of a specific linguistic resource. For example, when shall we highlight or downplay some information (active voice vs. passive voice; independant vs. subordinate clause, etc.), and if so, how is this done in a given language?
  • 29. What we may get from them and what they may need take into account?     What we can get from them ➟ inspiration, no doubt; ➟ ways to interprete our data; ➟ information allowing us to build our own theory; ➟  functionality: it would be absurd to simulate on a computer what has been devised for a completely different hardware, with different constraints (paper vs. brain, or, birds vs. airplanes). Never- theless, at a functional level, there are clearly equivalences (brain, i.e. neuronal networks and semantic networks, spreading-activation and search/ navigation), since we pursue similar goals. What they may need take into account ➟ agree on annotations ➟ work on real world data (corpus linguistics)
  • 30. Mutual benefits, i.e. added value Build truly semiotic extensions, i.e. tools supporting people to ➟ think (problem solving), ➟ process language and ➟ communicate (both in L1 and L2). How come that so few applications have been built to support NLP (authoring tools) and education?
  • 31. You've always got a choice, but it may be in your own interest to step back and think first about its adequacy, price, and consequences. 1° goals, i.e. needs 2° methods 3° results 4° adequacy/evaluation 5° price/consequences  
  • 32. Conclusion + possible recommendations We do need the knowledge accumulated in other domains Some possible recommendations of how to take advantage of it •  broaden the range of papers in ACL conferences and Computational Linguistics ; •  put more emphasis on the poster track ; •  include a broader range of people in journal editorial boards and conference PCs ; •  include a broader range of workshops, tutorials, and invited talks at ACL conferences; •  have survey papers in Computational Linguistics on other areas of language-related research ; •  offer a hands-on approach in smaller events like summerschools ; •  organize real WORKshops, possibly led by invited speakers ; •  support brainstorming by allowing the presentation of ideas (possibly even if they are not implemented or evaluated) and work describing real world applications.
  • 33. July  25,  2004   EMNLP-­‐2004    Senseval-­‐2004   33   Grand  Challenges   ip://ip.cordis.lu/pub/ist/docs/istag040319-­‐drainotesoihemee2ng.pdf    
  • 34. Thanks for their inspiration •  Reiter, E. (2007) The Shrinking Horizons of Computational Linguistics. Computational Linguistics Vol. 33, No. 2, Pages 283—287 •  Spärck Jones, K. Computational linguistics: What about the linguistics? Comput. Linguist., 33(3):437–441, 2007. •  Spärck Jones K. Automatic language and information processing : rethinking evaluation. Natural Language Engineering, chapter 7, p. 1-18, 2001 •  Wilks, Y. (2008) On Whose Shoulders? Computational Linguistics 34 (4) 471-486. •  Wintner, S. (2009), What Science Underlies Natural Language Engineering? Computational Linguistics December 2009, Vol. 35, No. 4: 641–644