SlideShare uma empresa Scribd logo
1 de 13
Baixar para ler offline
Intento
1
BEST vs. FIT TO PURPOSE
© Intento, Inc. / November 2019
linguistically best
for my
language pair
MT with a proper data
protection and retention
policy, proper level of custom
terminology support, which
trains on my linguistic assets
with good ROI per my business
goals, and works well with my
source data quality, format
and content type according to
my subject matter experts
vs.
Intento
2
BEST vs. FIT TO PURPOSE
© Intento, Inc. / November 2019
linguistically best
for my
language pair
MT with a proper data
protection and retention
policy, proper level of custom
terminology support, which
trains on my linguistic assets
with good ROI per my business
goals, and works well with my
source data quality, format
and content type according to
my subject matter experts
vs.
on-the-ïŹ‚y
MT routing
based on the
historical data
automated
procurement
and vendor
management
Intento
3
PRACTICAL
APPROACH
© Intento, Inc. / November 2019
1
2
3
4
Select candidate MT systems
Improve with your data assets
Automated scoring
Human-assisted scoring
Intento
1. SELECT CANDIDATE ENGINES
4© Intento, Inc. / November 2019
GENERIC STOCK MODELS
Alibaba Amazon Baidu DeepL eBay Google
GTCom IBM Kakao Microsoft Mirai ModernMT
Niutrans Naver Omniscien PROMT Rozetta SAP
SDL Sogou Systran Tencent Tilde Yandex
VERTICAL STOCK MODELS
CUSTOM TERMINOLOGY SUPPORT
AUTO DOMAIN ADAPTATION MANUAL DOMAIN ADAPTATION
Youdao
Alibaba Baidu
Cloud
Translate
Iconic Microsoft Omniscien
PROMT SAP Systran
Amazon Baidu Google IBM Iconic Microsoft Rozetta SDL Systran
Globalese Google IBM
Kantan Microsoft ModernMT
Omniscien SDL Systran
Alibaba Baidu
Cloud
Translate
Iconic
Omniscien PangeaMT Prompsit PROMT
SDL Systran Tilde Yandex
Yandex
Standalone commercial MT products with an API. All product names, trademarks and registered trademarks are property of their respective owners. All company,
product and service names used in this website are for identiïŹcation purposes only. Use of these names, trademarks and brands does not imply endorsement.
Intento
5
business requirements
—

language pair
—

domain
—

available language assets
—

tag and format support
—

cost of ownership
© Intento, Inc. / November 2019
July 2018 January 2019
WHAT MAKES THE DIFFERENCE?
Intento
6
2. IMPROVING
ENGINES
© Intento, Inc. / November 2019
data cleaning
TM training
glossaries
sentence scores
40-60% of “live”
TM is not suitable
for MT
—
linguistic
glossaries need to
be “compiled” for
MT
Intento
7
2. IMPROVING
ENGINES
© Intento, Inc. / November 2019
data cleaning
TM training
glossaries
sentence scores Different data volume and data
quality requirements
Different performance of baseline
models
July 2018 January 2019
Intento
8
2. IMPROVING
ENGINES
© Intento, Inc. / November 2019
data cleaning
TM training
glossaries
sentence scores
corpus scores are not
actionable
—
sentence scores help
linguists to focus
Intento
9
PRACTICAL
APPROACH
© Intento, Inc. / November 2019
1
2
3
4
Select candidate MT systems
Improve with your data assets
Automated scoring
Human-assisted scoring
Intento
CORPUS SCORES TO FIND TOP-RUNNERS
10© Intento, Inc. / November 2019
lack of correlation
indicates certain
types of errors
—

statistically
signiïŹcant rapid
drop-off identiïŹes
top-runners
Intento
SENTENCE SCORES TO HELP REVIEWERS
11© Intento, Inc. / November 2019
hard show NMT
training ïŹ‚aws
—

controversial expose
NMT quirks
—

easy to check how
high scores are
correlated with quality
—

typical to measure
PE effort
typical
Intento
4. HUMAN-ASSISTED SCORING
12© Intento, Inc. / November 2019
Depends
on the
purpose!
Linguistic Quality Assessment
—

Post-Editing Tracking
—

A/B testing
—

WTF-score
Intento
DIFFERENT SCENARIOS - DIFFERENT CHOICES
(even for the same language pair!)
13© Intento, Inc. / November 2019
PEMT / LSP
—

PEMT / Individual
—

Cross-Language Analysis and Retrieval (think eDiscovery)
—

Large-Scale Raw MT (think eCommerce)
—

Customer Support (think Global B2C)
—

Gisting and Inbound Content (think translation portals)
—

Large Enterprise
—

Government and Regulated Industries

Mais conteĂșdo relacionado

Mais de Konstantin Savenkov

Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚ĐžĐ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
Konstantin Savenkov
 
Lean production ĐŽĐ»Ń SAAS
Lean production ĐŽĐ»Ń SAASLean production ĐŽĐ»Ń SAAS
Lean production ĐŽĐ»Ń SAAS
Konstantin Savenkov
 

Mais de Konstantin Savenkov (20)

State of the Machine Translation by Intento (stock engines, Jun 2019)
State of the Machine Translation by Intento (stock engines, Jun 2019)State of the Machine Translation by Intento (stock engines, Jun 2019)
State of the Machine Translation by Intento (stock engines, Jun 2019)
 
State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)
 
State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)
 
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
 
Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)
 
СраĐČĐœĐžŃ‚Đ”Đ»ŃŒĐœŃ‹Đč Đ°ĐœĐ°Đ»ĐžĐ· ŃĐžŃŃ‚Đ”ĐŒ ĐŒĐ°ŃˆĐžĐœĐœĐŸĐłĐŸ пДрДĐČĐŸĐŽĐ°
СраĐČĐœĐžŃ‚Đ”Đ»ŃŒĐœŃ‹Đč Đ°ĐœĐ°Đ»ĐžĐ· ŃĐžŃŃ‚Đ”ĐŒ ĐŒĐ°ŃˆĐžĐœĐœĐŸĐłĐŸ пДрДĐČĐŸĐŽĐ°ĐĄŃ€Đ°ĐČĐœĐžŃ‚Đ”Đ»ŃŒĐœŃ‹Đč Đ°ĐœĐ°Đ»ĐžĐ· ŃĐžŃŃ‚Đ”ĐŒ ĐŒĐ°ŃˆĐžĐœĐœĐŸĐłĐŸ пДрДĐČĐŸĐŽĐ°
СраĐČĐœĐžŃ‚Đ”Đ»ŃŒĐœŃ‹Đč Đ°ĐœĐ°Đ»ĐžĐ· ŃĐžŃŃ‚Đ”ĐŒ ĐŒĐ°ŃˆĐžĐœĐœĐŸĐłĐŸ пДрДĐČĐŸĐŽĐ°
 
State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)
 
Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)
 
State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)
 
State of the Machine Translation by Intento (November 2017)
State of the Machine Translation by Intento (November 2017)State of the Machine Translation by Intento (November 2017)
State of the Machine Translation by Intento (November 2017)
 
NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017
 
Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017
 
Building a Data Driven Business
Building a Data Driven BusinessBuilding a Data Driven Business
Building a Data Driven Business
 
УпраĐČĐ»Đ”ĐœĐžĐ” Đ±ĐžĐ·ĐœĐ”ŃĐŸĐŒ ĐœĐ° ĐŸŃĐœĐŸĐČĐ” ĐŽĐ°ĐœĐœŃ‹Ń…
УпраĐČĐ»Đ”ĐœĐžĐ” Đ±ĐžĐ·ĐœĐ”ŃĐŸĐŒ ĐœĐ° ĐŸŃĐœĐŸĐČĐ” ĐŽĐ°ĐœĐœŃ‹Ń…ĐŁĐżŃ€Đ°ĐČĐ»Đ”ĐœĐžĐ” Đ±ĐžĐ·ĐœĐ”ŃĐŸĐŒ ĐœĐ° ĐŸŃĐœĐŸĐČĐ” ĐŽĐ°ĐœĐœŃ‹Ń…
УпраĐČĐ»Đ”ĐœĐžĐ” Đ±ĐžĐ·ĐœĐ”ŃĐŸĐŒ ĐœĐ° ĐŸŃĐœĐŸĐČĐ” ĐŽĐ°ĐœĐœŃ‹Ń…
 
Messengers, Bots and Personal Assistants
Messengers, Bots and Personal AssistantsMessengers, Bots and Personal Assistants
Messengers, Bots and Personal Assistants
 
Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚ĐžĐ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
Đ Đ”ĐșĐŸĐŒĐ”ĐœĐŽĐ°Ń‚Đ”Đ»ŃŒĐœŃ‹Đ” ŃĐžŃŃ‚Đ”ĐŒŃ‹: Ń€ĐŸĐ»ŃŒ Đž ĐŸŃ†Đ”ĐœĐșĐ° ŃŃ„Ń„Đ”ĐșтоĐČĐœĐŸŃŃ‚Đž
 
Measuring the agile process improvement
Measuring the agile process improvementMeasuring the agile process improvement
Measuring the agile process improvement
 
Lean production ĐŽĐ»Ń SAAS
Lean production ĐŽĐ»Ń SAASLean production ĐŽĐ»Ń SAAS
Lean production ĐŽĐ»Ń SAAS
 
Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015
 
The Economics of Recommender Systems
The Economics of Recommender SystemsThe Economics of Recommender Systems
The Economics of Recommender Systems
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Mcleodganj Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls đŸ„° 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls đŸ„° 8617370543 Service Offer VIP Hot Model
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

Fantastic MT Engines and Where to Find Them

  • 1. Intento 1 BEST vs. FIT TO PURPOSE © Intento, Inc. / November 2019 linguistically best for my language pair MT with a proper data protection and retention policy, proper level of custom terminology support, which trains on my linguistic assets with good ROI per my business goals, and works well with my source data quality, format and content type according to my subject matter experts vs.
  • 2. Intento 2 BEST vs. FIT TO PURPOSE © Intento, Inc. / November 2019 linguistically best for my language pair MT with a proper data protection and retention policy, proper level of custom terminology support, which trains on my linguistic assets with good ROI per my business goals, and works well with my source data quality, format and content type according to my subject matter experts vs. on-the-ïŹ‚y MT routing based on the historical data automated procurement and vendor management
  • 3. Intento 3 PRACTICAL APPROACH © Intento, Inc. / November 2019 1 2 3 4 Select candidate MT systems Improve with your data assets Automated scoring Human-assisted scoring
  • 4. Intento 1. SELECT CANDIDATE ENGINES 4© Intento, Inc. / November 2019 GENERIC STOCK MODELS Alibaba Amazon Baidu DeepL eBay Google GTCom IBM Kakao Microsoft Mirai ModernMT Niutrans Naver Omniscien PROMT Rozetta SAP SDL Sogou Systran Tencent Tilde Yandex VERTICAL STOCK MODELS CUSTOM TERMINOLOGY SUPPORT AUTO DOMAIN ADAPTATION MANUAL DOMAIN ADAPTATION Youdao Alibaba Baidu Cloud Translate Iconic Microsoft Omniscien PROMT SAP Systran Amazon Baidu Google IBM Iconic Microsoft Rozetta SDL Systran Globalese Google IBM Kantan Microsoft ModernMT Omniscien SDL Systran Alibaba Baidu Cloud Translate Iconic Omniscien PangeaMT Prompsit PROMT SDL Systran Tilde Yandex Yandex Standalone commercial MT products with an API. All product names, trademarks and registered trademarks are property of their respective owners. All company, product and service names used in this website are for identiïŹcation purposes only. Use of these names, trademarks and brands does not imply endorsement.
  • 5. Intento 5 business requirements — language pair — domain — available language assets — tag and format support — cost of ownership © Intento, Inc. / November 2019 July 2018 January 2019 WHAT MAKES THE DIFFERENCE?
  • 6. Intento 6 2. IMPROVING ENGINES © Intento, Inc. / November 2019 data cleaning TM training glossaries sentence scores 40-60% of “live” TM is not suitable for MT — linguistic glossaries need to be “compiled” for MT
  • 7. Intento 7 2. IMPROVING ENGINES © Intento, Inc. / November 2019 data cleaning TM training glossaries sentence scores Different data volume and data quality requirements Different performance of baseline models July 2018 January 2019
  • 8. Intento 8 2. IMPROVING ENGINES © Intento, Inc. / November 2019 data cleaning TM training glossaries sentence scores corpus scores are not actionable — sentence scores help linguists to focus
  • 9. Intento 9 PRACTICAL APPROACH © Intento, Inc. / November 2019 1 2 3 4 Select candidate MT systems Improve with your data assets Automated scoring Human-assisted scoring
  • 10. Intento CORPUS SCORES TO FIND TOP-RUNNERS 10© Intento, Inc. / November 2019 lack of correlation indicates certain types of errors — statistically signiïŹcant rapid drop-off identiïŹes top-runners
  • 11. Intento SENTENCE SCORES TO HELP REVIEWERS 11© Intento, Inc. / November 2019 hard show NMT training ïŹ‚aws — controversial expose NMT quirks — easy to check how high scores are correlated with quality — typical to measure PE effort typical
  • 12. Intento 4. HUMAN-ASSISTED SCORING 12© Intento, Inc. / November 2019 Depends on the purpose! Linguistic Quality Assessment — Post-Editing Tracking — A/B testing — WTF-score
  • 13. Intento DIFFERENT SCENARIOS - DIFFERENT CHOICES (even for the same language pair!) 13© Intento, Inc. / November 2019 PEMT / LSP — PEMT / Individual — Cross-Language Analysis and Retrieval (think eDiscovery) — Large-Scale Raw MT (think eCommerce) — Customer Support (think Global B2C) — Gisting and Inbound Content (think translation portals) — Large Enterprise — Government and Regulated Industries