View
151
Download
0
Embed Size (px)
DESCRIPTION
TAUS recently performed an extensive study of the machine translation market and estimates its size at $250 million. Despite the relatively small size of the market it has outsize impact and innovation power. TAUS believes that MT technology is a key enabler and a force multiplier for new services and growth. In our report we cover different uses of MT technology, survey players and offerings in the market and analyze the impact of large online MT providers and open source technology. We identified 65 MT operators and break down the market by offering type and region. We analyzed market trends, opportunities and challenges as well as market drivers and inhibitors. Based on our data and observations we venture a few predictions for the MT market. This presentation is a part of the MosesCore project that encourages the development and usage of open source machine translation tools, notably the Moses statistical MT toolkit. MosesCore is supported by the European Commission Grant Number 288487 under the 7th Framework Programme. For the latest updates go to http://www.statmt.org/mosescore/ or follow us on Twitter - #MosesCore
Citation preview
TAUS Moses Industry Roundtable
Jaap van der Meer, TAUS Achim Ruopp, TAUS AMTA Conference 26-‐Oct-‐2014
This slide may not be used or copied without permission from TAUS
Agenda 14:00 -‐ 14:15 / Introduc)on and Moses Market Overview, Jaap van der Meer (TAUS) and Achim Ruopp (TAUS)
14:15 -‐ 14:35 / Moses Latest Developments and Future Plans, Hieu Hoang & Ulrich Germann, University of Edinburgh
14:35 -‐ 14:45 / Introducing Strategic Ques)ons 14:45 -‐ 15:30 / Breakout Discussions 15:30 -‐ 16:00 / Coffee Break
16:00 -‐ 16:15 / Repor)ng back to en)re round table 16:15 -‐ 17:30 / Consolida)ng different strategic ques)ons -‐ char)ng a common path forward
This slide may not be used or copied without permission from TAUS
ParCcipants
John Paul Barraza Systran SoJware, Inc. Roland Kuhn NaConal Research Council of Canada
Kevin Blahut KB TranslaCons Alon Lavie Safaba TranslaCon SoluCons Ann CliJon Simon Fraser University Saab Mansour RWTH Aachen University
Chenchen Ding
NaConal InsCtute of InformaCon and CommunicaCons Marian MarCn-‐Moran Target
Kevin Flanagan Swansea University Jonathan May USC InformaCon Sciences InsCtute
Ray Flournoy Etsy Prakash Menon John Deere ScoZ Gaskill Sovee Keith Miller The MITRE CorporaCon Ulrich Germann University of Edinburgh Wafaa Mohiy SaudisoJ João Graça Unbabel, Inc. Edamwen Anthony Oni Seprotec MulClingual SL Junfei Guo University of StuZgart Anik PelleCer Target
Saša Hasan eBay Jesper Sandberg Sandberg TranslaCon Partners Ltd
Eric Hedekar Simon Fraser University JeanneZe Stewart TAUS/GLO
Hieu Hoang University of Edinburgh Michelle Vanni U.S. Army Research Laboratory
Reginald Hobbs Army Research Laboratory Pedro Vasco Unbabel, Inc. Macduff Hughes Google Cedric Vezinet LanguageLine SoluCons Ioannis Iakovidis Interverbum Varden Wang Google Rihards Kalnins Tilde Jian Zhang CNGL, Dublin City University Christopher Klapp Sovee Andrzej Zydron XTM-‐INTL
This slide may not be used or copied without permission from TAUS
MosesCore EU Project – what is it? o Reducing barriers to using MT technology for all users
§ OrganizaCons § Individuals
o Means § Moses CoordinaCon and IntegraCon
o Support, documentaCon, code maintenance, releases, packages
§ Machine TranslaCon Marathon Events § Workshop for Machine TranslaCon Events
o EvaluaCon with defined set of data § Industry Outreach
o Online tutorial, events o Project ending in January 2015
This slide may not be used or copied without permission from TAUS
TAUS Machine TranslaCon Market Report ExecuMve Summary o Market size: $250 Million, growing 16.9% per year o “Perfect storm condiCons” for MT o Key trends:
§ GlobalizaCon, IntegraCon, Convergence, VerCcalizaCon, Immediacy of communicaCon, Privacy – security, High quality translaCon
o OpportuniCes: § Business expansion, IntegraCon of MT, ProducCvity gains, MT as enabler for new
services, Narrow domain applicaCons, Customer support self-‐service o Challenges:
§ False expectaCons – false starts, Quality of MT, Language coverage, Available training data, Specialist skills, Vendor lock-‐in, CompeCCon from free MT, Quality measurement & esCmaCon
o PredicCons: § Post-‐ediCng MT will grow very quickly and become the primary producCon process in
translaCon within five years. § MT technology itself is on its way to become a commodity, shiJing the Holy Grail to
the data
This slide may not be used or copied without permission from TAUS
Open Source Machine TranslaCon o Open source toolkits available for all approaches to machine translaCon
o Development of a complex MT system core including a decoder (component that performs the translaCons) § Shared effort of a loosely organized group of expert developers § OJen developer group is mostly academic § Licensed under open source license allowing any use including commercial use o JusCfied by public funding of academic efforts
o Most popular/established: the Moses StaCsCcal MT toolkit
o Has significantly changed the market dynamics in the MT supplier market
This slide may not be used or copied without permission from TAUS
Moses As Technology
Technology Data
Moses Solutions MT as a Service Integrators
eDiscovery News Analysis Big Data Intelligence
Europarl
OPUS
UN
Mostly anonymous
DoMT +M4Loc
Examples:
This slide may not be used or copied without permission from TAUS
Open Source Machine TranslaCon Moses Business AdopCon Models Model 1 Model 2 Model 3 Model 4 Moses out-of-the-box
Moses with open source/free add-ons
Commercialized Moses
Hybrid MT
Download and build Moses and associated components yourself
Pre-built installation to make getting started easier Additional scripts to support language industry needs
Web-hosted or self-hosted
UI to build MT engines and translate documents Various add-ons for data cleaning, post-editing, language support etc.
Moses combined with other MT technologies (e.g. RBMT engines) to provide high-quality MT output
This slide may not be used or copied without permission from TAUS
Open Source Machine TranslaCon Observed Moses AdopCon PaZerns o Direct use of Moses mostly by § Suppliers of Model 3 and Model 4 soluCons § Large internal users o Government organizaCons and o Large mulCnaConal companies
§ Only these organizaCons o Have the resources to employ an expert team to
¨ Build well-‐performing MT engines ¨ Adapt Moses to their needs
§ See the full control of their MT desCny as a strategic goal
This slide may not be used or copied without permission from TAUS
Open Source Machine TranslaCon Observed Moses AdopCon PaZerns
o LiZle or no direct Moses use among small and mid-‐size companies § Even diminishing use by early adopters due to increased availability of affordable and customizable commercial MT soluCons
§ Complete Model 2 soluCon currently not available o Unlike comparable OSS projects like MySQL, OpenOffice and Linux DistribuCons
o Moses More Similar to Linux Kernel and Android projects § No large insCtuConal backer (yet?) § FoundaCon model a way forward for industry use?