How we train post-editors - Yongpeng Wei (Lingosail)

Preview:

Citation preview

How We Train MT Post-Editors

Yongpeng WEILingosail

2017.02.28

TAUS Translation Technology Showcase@LocWorld Shenzhen 2017

About LingosailProducts & Service• Translation Technology• Corpus / Term-base Construction• Vertical MT Training• On-line CAT Customization

• Translation (MTPE) Service• MTPE production• MTPE training

Markets • Intellectual Properties• Cross-border E-

Commerce• …

Why MT+PE?

Easier Entry

• Lower requirement for domain knowledge

• Lower cost of learning and self-improvement for translators

Higher Efficiency

• Speed-up search and verification process

• Reduce repetitions• Reduce typing time• Reduce elementary

mistakes

More Scalable

Management

• Unify workflow• Increase

standardization

What are important for MT? Better Usability

◦ Vertical domain adaption and optimization◦ Terminology/Knowledge embedding◦ Quality confidence grading

More interactive◦ Real-time interactions (e.g. Input Method)◦ Feedback from end-users◦ Dynamic user behaviors

Lower deploying threshold◦ Affordable computing capability (esp. for Neural MT)◦ Less corpus requirement◦ Acknowledged Quality Metrics

What are important for Post-Editors?

Confidence is Gold◦ Understanding MT◦ Active attitude◦ Controlling the process

Quality perception is essential◦ Understanding user scenario and requirements◦ Identifying error levels◦ Searching and verifying

Knowledge management is valuable◦ Terminology acquisition, management and share◦ Domain knowledge accumulation and reuse

Curriculum for Post-Editor Training

Part1 Live Lectures◦ Machine Translation principle and application◦ Post-Editing Guidelines

Part2 On-line Videos◦ Guidelines for patent abstract translation(CE)◦ Quality Standards for patent abstract translation◦ TMS operation guides◦ Terminology Review operation guides◦ Terminology searching and verifying skills

Parts3 Exercises◦ Translation (Post-editing) Tasks◦ Error Correction Tasks◦ Translation Comparison(PK) Tasks ◦ Terminology Review Tasks

Parts4 Simulative Project◦ Group Project &Discussion

Confidence

Quality

Knowledge

On-line Courses

Exercise Type I : Translation

Auto-Grading

Auto-Commenting

Exercise Type II : Error CorrectionSource

Translationwith error

CorrectionReference

Exercise Type III : Comparison (PK)

Source

Translation 1

Reasons

Translation 2

Your Choice

Teacher’s Choice

Exercise Type IV : Term Review

Sentence Pair

Examples

Term PairExtracted

Review decision

Patent PE Course for example◦ Hundreds participants last year◦ Many of them come from MTI schools◦ Most of them have no technical background ◦ Training lasts: 6-8 weeks

Training Results

Trainees and training results

Pre-Train TrainedMT+PE 1-2pcs/h 5-6pcs/hTraditional 1-2pcs/h 3-4pcs/h

SpeedQuality

PE courses for other domains PE courses for professionals & technical

Automatic/Semi-automatic reviews More interactive MT Knowledge Graph construction and sharing

Future Work

Thank you!Contact me:weiyongpeng@lingosail.com

Q&A

Recommended