Willkommen bei Scribd!

The Statistical Machine Translation

Hochgeladen von

0% fanden dieses Dokument nützlich (0 Abstimmungen)

39 Ansichten9 Seiten

Statistical machine translation (SMT) was introduced in 1947 and reintroduced in the late 1980s involving translation based on statistical models derived from analysis of parallel text corpora. SMT involves language models to generate fluent translations and can translate at the word, phrase, or syntax level but relies heavily on large parallel corpora and has difficulty with language pairs with different word orders. While allowing quick translation model building, SMT requires significant computational resources and corpora.

Originalbeschreibung:

Originaltitel

The statistical machine translation

Copyright

Verfügbare Formate

PPTX, PDF, TXT oder online auf Scribd lesen

Dieses Dokument teilen

Dokument teilen oder einbetten

Freigabeoptionen

Stufen Sie dieses Dokument als nützlich ein?

Sind diese Inhalte unangemessen?

Dieses Dokument melden

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

0% fanden dieses Dokument nützlich (0 Abstimmungen)

39 Ansichten9 Seiten

The Statistical Machine Translation

Hochgeladen von

Яна Заєць

Copyright:

Verfügbare Formate

Als PPTX, PDF, TXT herunterladen oder online auf Scribd lesen

Markieren Sie unangemessene Inhalte

Zu Seite

Sie sind auf Seite 1von 9

Im Dokument suchen

The statistical machine translation

Prepared by
Yana Zaiets
Group 2.1
The first ideas of Statistical Machine Translation were introduced by
Warren Weaver as far back as 1947.
Statistical machine translation was re-introduced in the late 1980s and
early 1990s by researchers at IBM's Thomas J. Watson Research Center

SMT is a machine translation paradigm where translations are generated

based on statistical models, whose parameters are derived from the
analysis of bilingual text corpora (text bodies) – a source text of translated
material and a target text of untranslated material.
Language model
A language model is an essential component of any statistical machine
translation system, which aids in making the translation as fluent as possible. It
is a function that takes a translated sentence and returns the probability of it
being said by a native speaker.
Other than word order, language models may also help with word choice: if a
foreign word has multiple possible translations, these functions may give better
probabilities for certain translations in specific contexts in the target language.
Word-based translation
In word-based translation, the fundamental unit of translation is a word in some
natural language.

Phrase-based translation
In phrase-based translation, the aim is to reduce the restrictions of word-based
translation by translating whole sequences of words, where the lengths may
differ.

Syntax-based translation
Syntax-based translation is based on the idea of translating syntactic units, rather
than single words or strings of words (as in phrase-based MT).
Benefits of SMT:

• More efficient use of human and data resources

• There are many parallel corpora in machine-readable format and even more
monolingual data.
• Generally, SMT systems are not tailored to any specific pair of languages.
• Rule-based translation systems require the manual development of linguistic
rules, which can be costly, and which often do not generalize to other
languages.
• More fluent translations owing to use of a language model
Shortcomings of SMT:

• Corpus creation can be costly.

• Specific errors are hard to predict and fix.
• Results may have superficial fluency that masks translation problems.
• Statistical machine translation usually works less well for language pairs with
significantly different word order.
• The benefits obtained for translation between Western European languages
are not representative of results for other language pairs, owing to smaller
training corpora and greater grammatical differences.
Conclusion

Statistical machine translation utilizes statistical translation models

whose parameters stem from the analysis of monolingual and
bilingual corpora. Building statistical translation models is a quick
process, but the technology relies heavily on existing multilingual
corpora. A minimum of 2 million words for a specific domain and
even more for general language are required. Theoretically it is
possible to reach the quality threshold but most companies do not
have such large amounts of existing multilingual corpora to build the
necessary translation models. Additionally, statistical machine
translation is CPU intensive and requires an extensive hardware
configuration to run translation models for average performance
levels.
Systems implementing statistical machine translation:

• Google Translate (started transition to neural machine translation in 2016)

• Microsoft Translator (started transition to NMT in 2016)

• Omniscien Technologies

• SYSTRAN (started transition to NMT in 2016)

• Yandex.Translate (switched to hybrid approach incorporating neural

machine translation in 2017)

Das könnte Ihnen auch gefallen

1.1 General: Resourced" Languages. To Enhance The Translation Performance of Dissimilar Language
Dokument18 Seiten
1.1 General: Resourced" Languages. To Enhance The Translation Performance of Dissimilar Language
Suresh Dhanasekar
Noch keine Bewertungen
Machine Translation
Dokument5 Seiten
Machine Translation
coffee princess
Noch keine Bewertungen
SUMMARY ON MACHINE TRANSLATION Sunilkpatel
Dokument3 Seiten
SUMMARY ON MACHINE TRANSLATION Sunilkpatel
SUNIL PATEL
Noch keine Bewertungen
Machine Translation Technologies
Dokument30 Seiten
Machine Translation Technologies
Алена
Noch keine Bewertungen
An SMT-driven Authoring Tool: Sriram Venkatapathy Shachar M Irkin
Dokument8 Seiten
An SMT-driven Authoring Tool: Sriram Venkatapathy Shachar M Irkin
music2850
Noch keine Bewertungen
E Translation
Dokument49 Seiten
E Translation
jaz baz
Noch keine Bewertungen
GA-Based Machine Translation System For Sanskrit To Hindi Language
Dokument9 Seiten
GA-Based Machine Translation System For Sanskrit To Hindi Language
hira
Noch keine Bewertungen
Chapter 2 - Machine Translation
Dokument14 Seiten
Chapter 2 - Machine Translation
Qori
Noch keine Bewertungen
Machine Translation
Dokument5 Seiten
Machine Translation
Vanessa Buelvas Padilla
Noch keine Bewertungen
How To Translate From English To Khmer Using Moses
Dokument11 Seiten
How To Translate From English To Khmer Using Moses
International Journal of Engineering Inventions (IJEI)
Noch keine Bewertungen
SPEC 17 Soft Copy of Report
Dokument4 Seiten
SPEC 17 Soft Copy of Report
Adrian Brongan
Noch keine Bewertungen
What Is Machine Translation?
Dokument4 Seiten
What Is Machine Translation?
Sidi Mohammed
Noch keine Bewertungen
English To Yorùbá Machine Translation System Using Rule-Based Approach
Dokument6 Seiten
English To Yorùbá Machine Translation System Using Rule-Based Approach
zemike
Noch keine Bewertungen
Department of Computer Science, University of Kashmir Presentation For PHD Admission
Dokument9 Seiten
Department of Computer Science, University of Kashmir Presentation For PHD Admission
rather Aarif
Noch keine Bewertungen
NLP Unit 1
Dokument34 Seiten
NLP Unit 1
hellrider22
100% (1)
FAMT, HAMT, Pre and Post Editing
Dokument29 Seiten
FAMT, HAMT, Pre and Post Editing
Victor Ciciac
Noch keine Bewertungen
List of Terms Used in Translation Technologies
Dokument2 Seiten
List of Terms Used in Translation Technologies
nicoletomyshch
Noch keine Bewertungen
SLT 1997 FrederkiTranslation Memory Engines: A Look Under The Hood and Road Testng
Dokument6 Seiten
SLT 1997 FrederkiTranslation Memory Engines: A Look Under The Hood and Road Testng
Ivan
Noch keine Bewertungen
Machine Translation Thesis PDF
Dokument8 Seiten
Machine Translation Thesis PDF
laurajohnsonphoenix
100% (2)
Machine Translation
Dokument38 Seiten
Machine Translation
Hussein Hazime
Noch keine Bewertungen
NLP Unit-5
Dokument14 Seiten
NLP Unit-5
Sunidhi Thakur
Noch keine Bewertungen
Anser
Dokument4 Seiten
Anser
Jiet Company
Noch keine Bewertungen
Termpaper
Dokument6 Seiten
Termpaper
Kisejjere Rashid
Noch keine Bewertungen
Lecture 13 Translation and Terminology Lecture Notes
Dokument4 Seiten
Lecture 13 Translation and Terminology Lecture Notes
IuLy IuLy
Noch keine Bewertungen
Machine Translation and Its Approaches: Vanlalmuansangi Khenglawt, Lal Anpuia
Dokument5 Seiten
Machine Translation and Its Approaches: Vanlalmuansangi Khenglawt, Lal Anpuia
Matiyas Gutema
Noch keine Bewertungen
Machine Status and Its Effec
Dokument16 Seiten
Machine Status and Its Effec
Muntasir Hashim
Noch keine Bewertungen
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
Dokument4 Seiten
Problem Statement:: Rule-Based Machine Translation (RBMT), Statistical Machine Translation (SMT), Neural
Govind Messi
Noch keine Bewertungen
RK Introduction To CAT Tools India
Dokument21 Seiten
RK Introduction To CAT Tools India
JaydeepDas
Noch keine Bewertungen
Termpaper
Dokument6 Seiten
Termpaper
Kisejjere Rashid
Noch keine Bewertungen
Language To Language Translation System Using LSTM
Dokument5 Seiten
Language To Language Translation System Using LSTM
WARSE Journals
Noch keine Bewertungen
Tts
Dokument13 Seiten
Tts
RASHID KHAN
Noch keine Bewertungen
Machine Translation For English To Kanna
Dokument8 Seiten
Machine Translation For English To Kanna
siva prince
Noch keine Bewertungen
Transformation of Multiple English Text Sentences To Vocal Sanskrit Using Rule Based Technique
Dokument5 Seiten
Transformation of Multiple English Text Sentences To Vocal Sanskrit Using Rule Based Technique
shantidsa
Noch keine Bewertungen
Computer-Assisted Translation
Dokument3 Seiten
Computer-Assisted Translation
Aqsa
Noch keine Bewertungen
Review On Machine Translation From English To Kannada
Dokument8 Seiten
Review On Machine Translation From English To Kannada
IJRASETPublications
Noch keine Bewertungen
Types of Translators: Copy Reading Proof Reading Computer Aided Machine Translation
Dokument32 Seiten
Types of Translators: Copy Reading Proof Reading Computer Aided Machine Translation
Nel Bornia
Noch keine Bewertungen
Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers
Dokument8 Seiten
Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers
nombre
Noch keine Bewertungen
Lecture 1 - 22-23
Dokument20 Seiten
Lecture 1 - 22-23
Назарій Рудавець
Noch keine Bewertungen
Neptune - Ai Hugging Face Pre-Trained Models
Dokument14 Seiten
Neptune - Ai Hugging Face Pre-Trained Models
Leon
Noch keine Bewertungen
Machine Translation With Statistical Approach
Dokument33 Seiten
Machine Translation With Statistical Approach
Rizky Aditya
Noch keine Bewertungen
29 02 2024цуацуа
Dokument1 Seite
29 02 2024цуацуа
eloheloes
Noch keine Bewertungen
Speech Translation
Dokument3 Seiten
Speech Translation
Aqsa
Noch keine Bewertungen
Improving The Performance of English-Tamil Statistical Machine Translation System Using Source-Side Pre-Processing
Dokument11 Seiten
Improving The Performance of English-Tamil Statistical Machine Translation System Using Source-Side Pre-Processing
Tamil
Noch keine Bewertungen
A Review On SMS Text Normalization Using Statistical Machine Translation Approach
Dokument3 Seiten
A Review On SMS Text Normalization Using Statistical Machine Translation Approach
International Journal of Application or Innovation in Engineering & Management
Noch keine Bewertungen
Review On Language Translator Using Quantum Neural Network (QNN)
Dokument4 Seiten
Review On Language Translator Using Quantum Neural Network (QNN)
International Journal of Engineering and Techniques
Noch keine Bewertungen
Big Data Analytics
Dokument13 Seiten
Big Data Analytics
gauri krishnamoorthy
Noch keine Bewertungen
Multi-Task Learning For Multiple Language Translation
Dokument10 Seiten
Multi-Task Learning For Multiple Language Translation
Tim
Noch keine Bewertungen
Machine Translation: What Is It?
Dokument2 Seiten
Machine Translation: What Is It?
Lionbridge
Noch keine Bewertungen
Machine Translation Computer-Assisted Translation
Dokument33 Seiten
Machine Translation Computer-Assisted Translation
Maryna Dorotiuk
Noch keine Bewertungen
JETIR2211403
Dokument6 Seiten
JETIR2211403
xxxtent301
Noch keine Bewertungen
A Gentle Introduction To Neural Machine Translation
Dokument14 Seiten
A Gentle Introduction To Neural Machine Translation
Dragan Zhivaljevikj
Noch keine Bewertungen
Machine Translation Mondal 2023
Dokument90 Seiten
Machine Translation Mondal 2023
Silvia Saporta Tarazona
Noch keine Bewertungen
Neural Machine Translation and Sequence-To-Sequence Models: A Tutorial
Dokument65 Seiten
Neural Machine Translation and Sequence-To-Sequence Models: A Tutorial
Alon Gonen
Noch keine Bewertungen
AssessAutomatedTranslation SJTPO
Dokument12 Seiten
AssessAutomatedTranslation SJTPO
davidemelisi
Noch keine Bewertungen
Natural Language Processing Revision Notes
Dokument4 Seiten
Natural Language Processing Revision Notes
anupriyasoundararaj
Noch keine Bewertungen
Informatics 06 00041 PDF
Dokument29 Seiten
Informatics 06 00041 PDF
Teacher Lily María
Noch keine Bewertungen
LLM 1
Dokument6 Seiten
LLM 1
anavari
Noch keine Bewertungen
NLP Project Final Report1
Dokument10 Seiten
NLP Project Final Report1
Abhishek Dhaka
Noch keine Bewertungen
Dissertation On Machine Translation
Dokument7 Seiten
Dissertation On Machine Translation
HelpWithWritingAPaperForCollegeSingapore
100% (1)
Terminology Extraction for Translation and Interpretation Made Easy: How to use ChatGPT and other low-cost, web-based programs to create terminology extraction lists and glossaries quickly and easily
Von Everand
Terminology Extraction for Translation and Interpretation Made Easy: How to use ChatGPT and other low-cost, web-based programs to create terminology extraction lists and glossaries quickly and easily
Uwe Muegge
Noch keine Bewertungen
The History of Graffiti
Dokument1 Seite
The History of Graffiti
Яна Заєць
Noch keine Bewertungen
Effortlessly Between The Two
Dokument4 Seiten
Effortlessly Between The Two
Яна Заєць
Noch keine Bewertungen
Be The Change You Want To See in The World
Dokument1 Seite
Be The Change You Want To See in The World
Яна Заєць
Noch keine Bewertungen
5 Cat Tools That Every Translator Should Use
Dokument6 Seiten
5 Cat Tools That Every Translator Should Use
Яна Заєць
Noch keine Bewertungen
Multilingual Vs Bilingual MT Systems. Unidirectional and Bidirectional MT Systems
Dokument12 Seiten
Multilingual Vs Bilingual MT Systems. Unidirectional and Bidirectional MT Systems
Яна Заєць
Noch keine Bewertungen
Anglijska 4klas Kalinina PDF
Dokument175 Seiten
Anglijska 4klas Kalinina PDF
Яна Заєць
Noch keine Bewertungen
Moulton GrammarNTGreek
Dokument321 Seiten
Moulton GrammarNTGreek
pavlovgg
100% (1)
Conjunctions: "A Word That Is Used To Join Words or Phrases or Sentences Is Called A Conjunction."
Dokument3 Seiten
Conjunctions: "A Word That Is Used To Join Words or Phrases or Sentences Is Called A Conjunction."
Vishal Pasar
Noch keine Bewertungen
UCSI Faculty of Social Sciences & Liberal Arts
Dokument16 Seiten
UCSI Faculty of Social Sciences & Liberal Arts
Shantanu Mishra
Noch keine Bewertungen
Heidi: Storytelling Classrooms
Dokument11 Seiten
Heidi: Storytelling Classrooms
MiguelDelCampoZurrón
Noch keine Bewertungen
Smith and Cusbert - Logic, The Drill
Dokument318 Seiten
Smith and Cusbert - Logic, The Drill
jalkfjlsaf
100% (1)
Harvard Linguistics 130 Syllabus
Dokument3 Seiten
Harvard Linguistics 130 Syllabus
J
100% (4)
A Study of The Conversational Features and Discourse Strategies in Select Sermons of Pastor E. A. Adeboye
Dokument8 Seiten
A Study of The Conversational Features and Discourse Strategies in Select Sermons of Pastor E. A. Adeboye
Freekado Gratis
Noch keine Bewertungen
Tech English?
Dokument6 Seiten
Tech English?
D HENDERSON
Noch keine Bewertungen
Animal Riddle
Dokument10 Seiten
Animal Riddle
Sharen Dhillon
Noch keine Bewertungen
Chukwuma Azuonye - The Nwagu Aneke Igbo Script
Dokument18 Seiten
Chukwuma Azuonye - The Nwagu Aneke Igbo Script
sector4bk
100% (2)
Weather Forecast
Dokument3 Seiten
Weather Forecast
Paula Ursu
Noch keine Bewertungen
Unit 3 - JUDGMENT and PROPOSITION
Dokument5 Seiten
Unit 3 - JUDGMENT and PROPOSITION
xylynn myka cabanatan
Noch keine Bewertungen
Journal Review
Dokument6 Seiten
Journal Review
ashikahmariam
Noch keine Bewertungen
Teaching English Language: Methods and Approaches
Dokument41 Seiten
Teaching English Language: Methods and Approaches
Paulina-
Noch keine Bewertungen
Voca-Book (Ruby Sparks)
Dokument46 Seiten
Voca-Book (Ruby Sparks)
Анастасия Вознесенская
Noch keine Bewertungen
Lyovin, Kessler, Leben. An Introduction To The Languages of The World PDF
Dokument545 Seiten
Lyovin, Kessler, Leben. An Introduction To The Languages of The World PDF
Claudio Chagas
100% (6)
SWOT ANALYSIS - Rubrics
Dokument1 Seite
SWOT ANALYSIS - Rubrics
Adiel
100% (2)
Polysemy and Homonymy
Dokument69 Seiten
Polysemy and Homonymy
Richmond Mathewson
Noch keine Bewertungen
Pidgins and Creoles
Dokument17 Seiten
Pidgins and Creoles
Yahfenel Evi Fussalam
100% (1)
Models of The Sign Peirce
Dokument23 Seiten
Models of The Sign Peirce
Joan Isma Ayu Astri
Noch keine Bewertungen
Eng Majorship
Dokument24 Seiten
Eng Majorship
arnel t. eborde
Noch keine Bewertungen
Arabic Shariahprogram
Dokument2 Seiten
Arabic Shariahprogram
Hisham Jafar Ali
100% (1)
Classroom Teaching Materials April 5 Yes
Dokument21 Seiten
Classroom Teaching Materials April 5 Yes
api-251470476
Noch keine Bewertungen
Present Simple Vs Present Continuous
Dokument3 Seiten
Present Simple Vs Present Continuous
LEIDY TATIANA CAVIEDES ZUTA
Noch keine Bewertungen
Rubric For Metacognitive Reading Report: Criteria Superior Sufficient Minimal Unacceptable Depth of Reflection
Dokument1 Seite
Rubric For Metacognitive Reading Report: Criteria Superior Sufficient Minimal Unacceptable Depth of Reflection
unicos phil
Noch keine Bewertungen
Strategic Intervention Material in English
Dokument39 Seiten
Strategic Intervention Material in English
Lydia Ela
Noch keine Bewertungen
Arabic Verbs Made Easy Site PDF
Dokument44 Seiten
Arabic Verbs Made Easy Site PDF
Zain Ahmed Chughtai
Noch keine Bewertungen
Trabajo 4
Dokument13 Seiten
Trabajo 4
yuli montaña
Noch keine Bewertungen
Lesson Plan
Dokument6 Seiten
Lesson Plan
Natalia Grozav
Noch keine Bewertungen
Writing Disabilities-4
Dokument19 Seiten
Writing Disabilities-4
api-451201960
Noch keine Bewertungen