SlideShare uma empresa Scribd logo
1 de 15
Language Processing
at the core of the Media &
  Publishing Industries

                             Berlin, April 12th 2013
Language Processing at the core of the Media & Publishing Industries




Multifacet crisis in the Media & Publishing Industries
                                                     User
                                                   generated
                                                    content




                  Decline in                                               Shift in ways
                 advertising                                                audiences
                  revenues                                                  consume
                                                                             contents



                                                   Business
                                                    models
Language Processing at the core of the Media & Publishing Industries




An old concern: language quality




                                                                    Proofreading with Stilus:
                                                                    Spell, grammar and
                                                                    style checking
Language Processing at the core of the Media & Publishing Industries


         Daedalus: extracting meaning from
         multilingual & multimedia contents
         Semantic Processing: automatic extraction of knowledge items
                        from non-structured content

                                                                                          Facts
                                                                       Topics
                                                                                                     Sentiment

                                Semantic
                               Processing
                                                                          People

Annotation, Enrichment & Linking

Named entities and concepts extraction,
classification, clustering                                                     Organizations      Concepts
Areas: documentation and advanced
content search, SEO positioning
Language Processing at the core of the Media & Publishing Industries


Advanced Semantic Analysis at Daedalus

                                                                          People:
                                                                           Ben Bernanke, Mariano Rajoy…
                                                                          Companies, organizations:
                                                                           BBVA, Bankia, Goldman Sachs, Coca-Cola,
                                                                           Reserva Federal…

                                                                          Financial named entities:
                                                                           Ibex35, Dax Xetra…
                                                                          Places:
                                                                           Londres, EE.UU., París…
                                                                          Concepts:
                                                                           prima de riesgo, presidente del
                                                                           Gobierno, intervención parlamentaria,
                                                                           índice bursátil, situación económica…
                                                                          Time references:
                                                                           hoy, ayer, sobre las 11 de la mañana…
                                                                          Money amounts:
                                                                           104 dólares, 1 euro…
                                                                          Polarity positive/neutral/negative
Language Processing at the core of the Media & Publishing Industries




Content aggregation
Language Processing at the core of the Media & Publishing Industries




User-generated content: automatic translation
Language Processing at the core of the Media & Publishing Industries




User-generated content: automatic moderation

 Tool for automatic moderation of
  social media, blogs, fora, etc.
 Offensive, illegal, inappropriate or
  objectionable content filtering
Language Processing at the core of the Media & Publishing Industries




Social Media Analytics: Sentimentalytics
Language Processing at the core of the Media & Publishing Industries




Video/audio indexing & search



                                                 Transcription                     Indexing




                                                     Contents
                                                                                    Index



                                                                          Search
Language Processing at the core of the Media & Publishing Industries




Automatic Subtitling: transcription, segmentation &
synchronization



                            Transcription


TEXT



                      Processing
                      (checking,
                     proofreading,                              Storag
                         etc.)
                                                                   e
Language Processing at the core of the Media & Publishing Industries




Data Journalism: exploration & analysis of info sources
   Look4leaks.net: Wikileaks case
       • Automatic translation of 251.000 cables (5 languages)
       • Semantic enrichment: entities, classification
       • Multifacet search: by embassy, person, country…
   Trial files: Gürtel case (corruption, >100 Kpages)
       • OCR, fuzzy recognition and multifacet search
   Spanish state of the nation address
       • Semantic analysis and search
Language Processing at the core of the Media & Publishing Industries




Transmedia
   Content production for simultaneous
    and coordinated delivery through
    different channels
   Personalized delivery
       • Content of interest according
         to user profile
       • Contextual advertising


   E.g.: second screen apps
Language Processing at the core of the Media & Publishing Industries




New ways for monetizing content
 Selling content chunks:
    • chapters, sections of reference books, etc.
 Selling content aggregates:
    • full story through news published along the time about one topic
Language Processing at the core of the Media & Publishing Industries




  DAEDALUS, S.A.

Jose C. Gonzalez
jgonzalez@daedalus.es
http://www.daedalus.es

Mais conteúdo relacionado

Destaque

Empirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkEmpirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkLeon Derczynski
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Leon Derczynski
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy DataLeon Derczynski
 
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014Marko Grobelnik
 
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesLeon Derczynski
 
KiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonKiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonAlyona Medelyan
 
A Primer on Text Mining for Business
A Primer on Text Mining for BusinessA Primer on Text Mining for Business
A Primer on Text Mining for BusinessClement Levallois
 
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...Anna Divoli
 
Handling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCHandling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCLeon Derczynski
 
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...Afshin Rahimi
 
Fact Extraction from Wikipedia
Fact Extraction from WikipediaFact Extraction from Wikipedia
Fact Extraction from WikipediaMarco Fossati
 
Dependency Parsing
Dependency ParsingDependency Parsing
Dependency ParsingJinho Choi
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Isabelle Augenstein
 
Regularised Cross-Modal Hashing (SIGIR'15 Poster)
Regularised Cross-Modal Hashing (SIGIR'15 Poster)Regularised Cross-Modal Hashing (SIGIR'15 Poster)
Regularised Cross-Modal Hashing (SIGIR'15 Poster)Sean Moran
 
Turning a Thousand or so Words into a Map
Turning a Thousand or so Words into a MapTurning a Thousand or so Words into a Map
Turning a Thousand or so Words into a MapCharlie Greenbacker
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian LanguageAlexander Panchenko
 

Destaque (18)

Empirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense FrameworkEmpirical Validation of Reichenbach’s Tense Framework
Empirical Validation of Reichenbach’s Tense Framework
 
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
Mining Social Media with Linked Open Data, Entity Recognition, and Event Extr...
 
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data Twitter Part-of-Speech Tagging for All:  Overcoming Sparse and Noisy Data
Twitter Part-of-Speech Tagging for All: Overcoming Sparse and Noisy Data
 
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014
From Text To Reasoning - Marko Grobelnik - SWANK Workshop Stanford - 16 Apr 2014
 
Julia text mining_inmobi
Julia text mining_inmobiJulia text mining_inmobi
Julia text mining_inmobi
 
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice GuidelinesCorpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
Corpus Annotation through Crowdsourcing: Towards Best Practice Guidelines
 
KiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with PythonKiwiPyCon 2014 talk - Understanding human language with Python
KiwiPyCon 2014 talk - Understanding human language with Python
 
A Primer on Text Mining for Business
A Primer on Text Mining for BusinessA Primer on Text Mining for Business
A Primer on Text Mining for Business
 
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...
NLP Tales in Biomedicine (introductory presentation for the Auckland NLP Meet...
 
Data2Content Press Release
Data2Content Press ReleaseData2Content Press Release
Data2Content Press Release
 
Handling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGCHandling and Mining Linguistic Variation in UGC
Handling and Mining Linguistic Variation in UGC
 
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
ACL2015 Poster: Twitter User Geolocation Using a Unified Text and Network Pre...
 
Fact Extraction from Wikipedia
Fact Extraction from WikipediaFact Extraction from Wikipedia
Fact Extraction from Wikipedia
 
Dependency Parsing
Dependency ParsingDependency Parsing
Dependency Parsing
 
Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...Extracting Relations between Non-Standard Entities using Distant Supervision ...
Extracting Relations between Non-Standard Entities using Distant Supervision ...
 
Regularised Cross-Modal Hashing (SIGIR'15 Poster)
Regularised Cross-Modal Hashing (SIGIR'15 Poster)Regularised Cross-Modal Hashing (SIGIR'15 Poster)
Regularised Cross-Modal Hashing (SIGIR'15 Poster)
 
Turning a Thousand or so Words into a Map
Turning a Thousand or so Words into a MapTurning a Thousand or so Words into a Map
Turning a Thousand or so Words into a Map
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian Language
 

Semelhante a Language Processing at the Core of the Media & Publishing Industries - Daedalus Perspective

Measuring PR in the Digital Age - Evaluating Communications Effectiveness
Measuring PR in the Digital Age - Evaluating Communications EffectivenessMeasuring PR in the Digital Age - Evaluating Communications Effectiveness
Measuring PR in the Digital Age - Evaluating Communications EffectivenessLars Voedisch
 
USEEDS° :: Content Strategy and IA
USEEDS° :: Content Strategy and IAUSEEDS° :: Content Strategy and IA
USEEDS° :: Content Strategy and IAUSEEDS GmbH
 
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essen
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: EssenContent Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essen
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essennikki tiedtke
 
Xalok - The Newsroom Integrated System (Editorial CMS)
Xalok - The Newsroom Integrated System (Editorial CMS)Xalok - The Newsroom Integrated System (Editorial CMS)
Xalok - The Newsroom Integrated System (Editorial CMS)Amelia Roversi-Mónaco
 
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?Lars Voedisch
 
Goodbye Measurement, Hello Analytics: The Move to "Alw
Goodbye Measurement, Hello Analytics: The Move to "AlwGoodbye Measurement, Hello Analytics: The Move to "Alw
Goodbye Measurement, Hello Analytics: The Move to "AlwTim Marklein
 
Social analytics apr24'12_marklein-1
Social analytics apr24'12_marklein-1Social analytics apr24'12_marklein-1
Social analytics apr24'12_marklein-1ronpiovesan
 
Rebranding Logio
Rebranding LogioRebranding Logio
Rebranding LogioIdealisti
 
State Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsState Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsPascal Cottereau
 
Rebranding Logio
Rebranding LogioRebranding Logio
Rebranding LogioHrivnak
 
Manzama lma - 9-26-12
Manzama   lma - 9-26-12Manzama   lma - 9-26-12
Manzama lma - 9-26-12Peter Ozolin
 
Managing the Uncontrollable - Integrated Communications in the Age of Social ...
Managing the Uncontrollable - Integrated Communications in the Age of Social ...Managing the Uncontrollable - Integrated Communications in the Age of Social ...
Managing the Uncontrollable - Integrated Communications in the Age of Social ...Lars Voedisch
 
Research Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social MediaResearch Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social MediaJan Pawlowski
 
Intelligent Content Strategies
Intelligent Content StrategiesIntelligent Content Strategies
Intelligent Content StrategiesJoe Gollner
 
Ir online and websites best practice b
Ir online and websites best practice bIr online and websites best practice b
Ir online and websites best practice bHallvarsson Halvarsson
 
Content marketing sb 011812
Content marketing sb 011812Content marketing sb 011812
Content marketing sb 011812Capital Group
 

Semelhante a Language Processing at the Core of the Media & Publishing Industries - Daedalus Perspective (20)

Trends in Localisation
Trends in LocalisationTrends in Localisation
Trends in Localisation
 
Measuring PR in the Digital Age - Evaluating Communications Effectiveness
Measuring PR in the Digital Age - Evaluating Communications EffectivenessMeasuring PR in the Digital Age - Evaluating Communications Effectiveness
Measuring PR in the Digital Age - Evaluating Communications Effectiveness
 
USEEDS° :: Content Strategy and IA
USEEDS° :: Content Strategy and IAUSEEDS° :: Content Strategy and IA
USEEDS° :: Content Strategy and IA
 
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essen
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: EssenContent Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essen
Content Strategy and IA :: IA-Konferenz :: 11. - 12. Mai 2012 :: Essen
 
Xalok - The Newsroom Integrated System (Editorial CMS)
Xalok - The Newsroom Integrated System (Editorial CMS)Xalok - The Newsroom Integrated System (Editorial CMS)
Xalok - The Newsroom Integrated System (Editorial CMS)
 
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?
Measuring the Effectiveness of PR Efforts:Are You Busy or Indispensable?
 
Goodbye Measurement, Hello Analytics: The Move to "Alw
Goodbye Measurement, Hello Analytics: The Move to "AlwGoodbye Measurement, Hello Analytics: The Move to "Alw
Goodbye Measurement, Hello Analytics: The Move to "Alw
 
Social analytics apr24'12_marklein-1
Social analytics apr24'12_marklein-1Social analytics apr24'12_marklein-1
Social analytics apr24'12_marklein-1
 
Rebranding Logio
Rebranding LogioRebranding Logio
Rebranding Logio
 
State Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products ProjectsState Of The Art - Part 2 Products Projects
State Of The Art - Part 2 Products Projects
 
Ryerson
RyersonRyerson
Ryerson
 
Rebranding Logio
Rebranding LogioRebranding Logio
Rebranding Logio
 
Sentiment analysis taxonomy_apr-12-2011
Sentiment analysis taxonomy_apr-12-2011Sentiment analysis taxonomy_apr-12-2011
Sentiment analysis taxonomy_apr-12-2011
 
Getting Started with Content Strategy
Getting Started with Content StrategyGetting Started with Content Strategy
Getting Started with Content Strategy
 
Manzama lma - 9-26-12
Manzama   lma - 9-26-12Manzama   lma - 9-26-12
Manzama lma - 9-26-12
 
Managing the Uncontrollable - Integrated Communications in the Age of Social ...
Managing the Uncontrollable - Integrated Communications in the Age of Social ...Managing the Uncontrollable - Integrated Communications in the Age of Social ...
Managing the Uncontrollable - Integrated Communications in the Age of Social ...
 
Research Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social MediaResearch Issues in Knowledge Management and Social Media
Research Issues in Knowledge Management and Social Media
 
Intelligent Content Strategies
Intelligent Content StrategiesIntelligent Content Strategies
Intelligent Content Strategies
 
Ir online and websites best practice b
Ir online and websites best practice bIr online and websites best practice b
Ir online and websites best practice b
 
Content marketing sb 011812
Content marketing sb 011812Content marketing sb 011812
Content marketing sb 011812
 

Mais de Sngular Meaning

Customer Analytics; qué se necesita y cómo conseguirlo by Josep Curto
Customer Analytics; qué se necesita y cómo conseguirlo by Josep CurtoCustomer Analytics; qué se necesita y cómo conseguirlo by Josep Curto
Customer Analytics; qué se necesita y cómo conseguirlo by Josep CurtoSngular Meaning
 
Customer Analytics: de text analytics a Voice of Customer
Customer Analytics: de text analytics a Voice of CustomerCustomer Analytics: de text analytics a Voice of Customer
Customer Analytics: de text analytics a Voice of CustomerSngular Meaning
 
s|ngular Data and Analytics Intro
s|ngular Data and Analytics Intros|ngular Data and Analytics Intro
s|ngular Data and Analytics IntroSngular Meaning
 
Stilus corrector ortografico gramatical de estilo en espanol
Stilus   corrector ortografico gramatical de estilo en espanolStilus   corrector ortografico gramatical de estilo en espanol
Stilus corrector ortografico gramatical de estilo en espanolSngular Meaning
 
Social Media Analytics for Emergency Management - Telefonica Daedalus 2014
Social Media Analytics for Emergency Management -  Telefonica Daedalus 2014Social Media Analytics for Emergency Management -  Telefonica Daedalus 2014
Social Media Analytics for Emergency Management - Telefonica Daedalus 2014Sngular Meaning
 
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014Sngular Meaning
 
Tweet alert - semantic analysis in social networks for citizen opinion mining
Tweet alert - semantic analysis in social networks for citizen opinion miningTweet alert - semantic analysis in social networks for citizen opinion mining
Tweet alert - semantic analysis in social networks for citizen opinion miningSngular Meaning
 
Tecnologías semánticas en sanidad
Tecnologías semánticas en sanidadTecnologías semánticas en sanidad
Tecnologías semánticas en sanidadSngular Meaning
 
Semantic Technologies for Healthcare
Semantic Technologies for HealthcareSemantic Technologies for Healthcare
Semantic Technologies for HealthcareSngular Meaning
 
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014Sngular Meaning
 
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014Stilus en IX Seminario Internacional de Lengua y Periodismo 2014
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014Sngular Meaning
 
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...Sngular Meaning
 
Stilus lenguando-lc aplicada a la correccion
Stilus lenguando-lc aplicada a la correccionStilus lenguando-lc aplicada a la correccion
Stilus lenguando-lc aplicada a la correccionSngular Meaning
 
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014Sngular Meaning
 
An Introduction to Textalytics API - Redradix Weekend
An Introduction to Textalytics API - Redradix WeekendAn Introduction to Textalytics API - Redradix Weekend
An Introduction to Textalytics API - Redradix WeekendSngular Meaning
 
Real time semantic search engine for social tv streams
Real time semantic search engine for social tv streamsReal time semantic search engine for social tv streams
Real time semantic search engine for social tv streamsSngular Meaning
 
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013Sngular Meaning
 
Textalytics, Meaning as a Service
Textalytics, Meaning as a ServiceTextalytics, Meaning as a Service
Textalytics, Meaning as a ServiceSngular Meaning
 
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013Sngular Meaning
 
Webinar Publicacion Semantica - Daedalus 26 feb 2013
Webinar Publicacion Semantica - Daedalus 26 feb 2013Webinar Publicacion Semantica - Daedalus 26 feb 2013
Webinar Publicacion Semantica - Daedalus 26 feb 2013Sngular Meaning
 

Mais de Sngular Meaning (20)

Customer Analytics; qué se necesita y cómo conseguirlo by Josep Curto
Customer Analytics; qué se necesita y cómo conseguirlo by Josep CurtoCustomer Analytics; qué se necesita y cómo conseguirlo by Josep Curto
Customer Analytics; qué se necesita y cómo conseguirlo by Josep Curto
 
Customer Analytics: de text analytics a Voice of Customer
Customer Analytics: de text analytics a Voice of CustomerCustomer Analytics: de text analytics a Voice of Customer
Customer Analytics: de text analytics a Voice of Customer
 
s|ngular Data and Analytics Intro
s|ngular Data and Analytics Intros|ngular Data and Analytics Intro
s|ngular Data and Analytics Intro
 
Stilus corrector ortografico gramatical de estilo en espanol
Stilus   corrector ortografico gramatical de estilo en espanolStilus   corrector ortografico gramatical de estilo en espanol
Stilus corrector ortografico gramatical de estilo en espanol
 
Social Media Analytics for Emergency Management - Telefonica Daedalus 2014
Social Media Analytics for Emergency Management -  Telefonica Daedalus 2014Social Media Analytics for Emergency Management -  Telefonica Daedalus 2014
Social Media Analytics for Emergency Management - Telefonica Daedalus 2014
 
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014
 
Tweet alert - semantic analysis in social networks for citizen opinion mining
Tweet alert - semantic analysis in social networks for citizen opinion miningTweet alert - semantic analysis in social networks for citizen opinion mining
Tweet alert - semantic analysis in social networks for citizen opinion mining
 
Tecnologías semánticas en sanidad
Tecnologías semánticas en sanidadTecnologías semánticas en sanidad
Tecnologías semánticas en sanidad
 
Semantic Technologies for Healthcare
Semantic Technologies for HealthcareSemantic Technologies for Healthcare
Semantic Technologies for Healthcare
 
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014
 
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014Stilus en IX Seminario Internacional de Lengua y Periodismo 2014
Stilus en IX Seminario Internacional de Lengua y Periodismo 2014
 
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...
 
Stilus lenguando-lc aplicada a la correccion
Stilus lenguando-lc aplicada a la correccionStilus lenguando-lc aplicada a la correccion
Stilus lenguando-lc aplicada a la correccion
 
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014
 
An Introduction to Textalytics API - Redradix Weekend
An Introduction to Textalytics API - Redradix WeekendAn Introduction to Textalytics API - Redradix Weekend
An Introduction to Textalytics API - Redradix Weekend
 
Real time semantic search engine for social tv streams
Real time semantic search engine for social tv streamsReal time semantic search engine for social tv streams
Real time semantic search engine for social tv streams
 
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013
 
Textalytics, Meaning as a Service
Textalytics, Meaning as a ServiceTextalytics, Meaning as a Service
Textalytics, Meaning as a Service
 
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013
 
Webinar Publicacion Semantica - Daedalus 26 feb 2013
Webinar Publicacion Semantica - Daedalus 26 feb 2013Webinar Publicacion Semantica - Daedalus 26 feb 2013
Webinar Publicacion Semantica - Daedalus 26 feb 2013
 

Último

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Último (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Language Processing at the Core of the Media & Publishing Industries - Daedalus Perspective

  • 1. Language Processing at the core of the Media & Publishing Industries Berlin, April 12th 2013
  • 2. Language Processing at the core of the Media & Publishing Industries Multifacet crisis in the Media & Publishing Industries User generated content Decline in Shift in ways advertising audiences revenues consume contents Business models
  • 3. Language Processing at the core of the Media & Publishing Industries An old concern: language quality Proofreading with Stilus: Spell, grammar and style checking
  • 4. Language Processing at the core of the Media & Publishing Industries Daedalus: extracting meaning from multilingual & multimedia contents Semantic Processing: automatic extraction of knowledge items from non-structured content Facts Topics Sentiment Semantic Processing People Annotation, Enrichment & Linking Named entities and concepts extraction, classification, clustering Organizations Concepts Areas: documentation and advanced content search, SEO positioning
  • 5. Language Processing at the core of the Media & Publishing Industries Advanced Semantic Analysis at Daedalus  People: Ben Bernanke, Mariano Rajoy…  Companies, organizations: BBVA, Bankia, Goldman Sachs, Coca-Cola, Reserva Federal…  Financial named entities: Ibex35, Dax Xetra…  Places: Londres, EE.UU., París…  Concepts: prima de riesgo, presidente del Gobierno, intervención parlamentaria, índice bursátil, situación económica…  Time references: hoy, ayer, sobre las 11 de la mañana…  Money amounts: 104 dólares, 1 euro…  Polarity positive/neutral/negative
  • 6. Language Processing at the core of the Media & Publishing Industries Content aggregation
  • 7. Language Processing at the core of the Media & Publishing Industries User-generated content: automatic translation
  • 8. Language Processing at the core of the Media & Publishing Industries User-generated content: automatic moderation  Tool for automatic moderation of social media, blogs, fora, etc.  Offensive, illegal, inappropriate or objectionable content filtering
  • 9. Language Processing at the core of the Media & Publishing Industries Social Media Analytics: Sentimentalytics
  • 10. Language Processing at the core of the Media & Publishing Industries Video/audio indexing & search Transcription Indexing Contents Index Search
  • 11. Language Processing at the core of the Media & Publishing Industries Automatic Subtitling: transcription, segmentation & synchronization Transcription TEXT Processing (checking, proofreading, Storag etc.) e
  • 12. Language Processing at the core of the Media & Publishing Industries Data Journalism: exploration & analysis of info sources  Look4leaks.net: Wikileaks case • Automatic translation of 251.000 cables (5 languages) • Semantic enrichment: entities, classification • Multifacet search: by embassy, person, country…  Trial files: Gürtel case (corruption, >100 Kpages) • OCR, fuzzy recognition and multifacet search  Spanish state of the nation address • Semantic analysis and search
  • 13. Language Processing at the core of the Media & Publishing Industries Transmedia  Content production for simultaneous and coordinated delivery through different channels  Personalized delivery • Content of interest according to user profile • Contextual advertising  E.g.: second screen apps
  • 14. Language Processing at the core of the Media & Publishing Industries New ways for monetizing content  Selling content chunks: • chapters, sections of reference books, etc.  Selling content aggregates: • full story through news published along the time about one topic
  • 15. Language Processing at the core of the Media & Publishing Industries DAEDALUS, S.A. Jose C. Gonzalez jgonzalez@daedalus.es http://www.daedalus.es