Paper Digest: Recent Papers on Machine Translation
Paper Digest Team extracted all recent Machine Translation related papers on our radar, and generated highlight sentences for them. The results are then sorted by relevance & date. In addition to this ‘static’ page, we also provide a real-time version of this article, which has more coverage and is updated in real time to include the most recent updates on this topic.
Based in New York, Paper Digest is dedicated to producing high-quality text analysis results that people can acturally use on a daily basis. Since 2018, we have been serving users across the world with a number of exclusive services to track, search, review and rewrite scientific literature.
You are welcome to follow us on Twitter and Linkedin to get updated with new conference digests.
Paper Digest Team
New York City, New York, 10017
team@paperdigest.org
TABLE 1: Paper Digest: Recent Papers on Machine Translation
Paper | Author(s) | Source | Date | |
---|---|---|---|---|
1 | M3P: Towards Multimodal Multilingual Translation with Multimodal Prompt Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a framework to leverage the multimodal prompt to guide the Multimodal Multilingual neural Machine Translation (m3P), which aligns the representations of different languages with the same meaning and generates the conditional vision-language memory for translation. |
JIAN YANG et. al. | arxiv-cs.CL | 2024-03-26 |
2 | Prediction of Translation Techniques for The Translation Process Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In contrast, the process of human-generated translation relies on a wide range of translation techniques, which are crucial for ensuring linguistic adequacy and fluency. This study suggests that these translation techniques could further optimize machine translation if they are automatically identified before being applied to guide the translation process effectively. |
Fan Zhou; Vincent Vandeghinste; | arxiv-cs.CL | 2024-03-21 |
3 | Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Previous studies have demonstrated the feasibility of MQM annotation but there are, to our knowledge, no computational models that predict MQM scores for novel texts, due to a lack of resources. In this paper, we address these shortcomings by (a) providing a 1200-sentence MQM evaluation benchmark for the language pair English-Korean and (b) reframing MT evaluation as the multi-task problem of simultaneously predicting several MQM scores using SOTA language models, both in a reference-based MT evaluation setup and a reference-free quality estimation (QE) setup. |
Dojun Park; Sebastian Padó; | arxiv-cs.CL | 2024-03-19 |
4 | CantonMT: Cantonese to English NMT Platform with Fine-Tuned Models Using Synthetic Back-Translation Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we deploy a standard data augmentation methodology by back-translation to a new language translation direction Cantonese-to-English. |
Kung Yin Hong; Lifeng Han; Riza Batista-Navarro; Goran Nenadic; | arxiv-cs.CL | 2024-03-17 |
5 | A Novel Paradigm Boosting Translation Capabilities of Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a study on strategies to enhance the translation capabilities of large language models (LLMs) in the context of machine translation (MT) tasks. |
JIAXIN GUO et. al. | arxiv-cs.CL | 2024-03-17 |
6 | To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Both approaches have limitations – diversity methods may extract varied but trivial examples, while uncertainty sampling can yield repetitive, uninformative instances. To bridge this gap, we propose HUDS, a hybrid AL strategy for domain adaptation in NMT that combines uncertainty and diversity for sentence selection. |
Abdul Hameed Azeemi; Ihsan Ayyub Qazi; Agha Ali Raza; | arxiv-cs.CL | 2024-03-14 |
7 | Scaling Behavior of Machine Translation with Large Language Models Under Prompt Injection Attacks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Their generality, however, opens them up to subversion by end users who may embed into their requests instructions that cause the model to behave in unauthorized and possibly unsafe ways. In this work we study these Prompt Injection Attacks (PIAs) on multiple families of LLMs on a Machine Translation task, focusing on the effects of model size on the attack success rates. |
Zhifan Sun; Antonio Valerio Miceli-Barone; | arxiv-cs.CL | 2024-03-14 |
8 | MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a framework called MT-Patcher, which transfers knowledge from LLMs to existing MT models in a selective, comprehensive and proactive manner. |
Jiahuan Li; Shanbo Cheng; Shujian Huang; Jiajun Chen; | arxiv-cs.CL | 2024-03-14 |
9 | ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, due to the mixture of multilingual data during the pre-training of LLM, the LLM-based translation models face the off-target issue in both prompt-based methods, including a series of phenomena, namely instruction misunderstanding, translation with wrong language and over-generation. For this issue, this paper introduces an \textbf{\underline{A}}uto-\textbf{\underline{C}}onstriction \textbf{\underline{T}}urning mechanism for \textbf{\underline{M}}ultilingual \textbf{\underline{N}}eural \textbf{\underline{M}}achine \textbf{\underline{T}}ranslation (\model), which is a novel supervised fine-tuning mechanism and orthogonal to the traditional prompt-based methods. |
Shaojie Dai; Xin Liu; Ping Luo; Yue Yu; | arxiv-cs.CL | 2024-03-11 |
10 | Enhanced Auto Language Prediction with Dictionary Capsule — A Novel Approach Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The paper presents a novel Auto Language Prediction Dictionary Capsule (ALPDC) framework for language prediction and machine translation. |
PINNI VENKATA ABHIRAM et. al. | arxiv-cs.CL | 2024-03-09 |
11 | Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we directly compared two data augmentation techniques as potential solutions for monolingual STS: (a) cross-lingual transfer that exploits English resources alone as training data to yield non-English sentence embeddings as zero-shot inference, and (b) machine translation that coverts English data into pseudo non-English training data in advance. |
Sho Hoshino; Akihiko Kato; Soichiro Murakami; Peinan Zhang; | arxiv-cs.CL | 2024-03-08 |
12 | Where Does In-context Translation Happen in Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we attempt to characterize the region where large language models transition from in-context learners to translation models. |
Suzanna Sia; David Mueller; Kevin Duh; | arxiv-cs.CL | 2024-03-07 |
13 | Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an Indian English-to-Hindi SSMT system that can transfer stress and aim to enhance the overall quality and engagement of educational content. |
Sai Akarsh; Vamshi Raghusimha; Anindita Mondal; Anil Vuppala; | arxiv-cs.CL | 2024-03-06 |
14 | BiVert: Bidirectional Vocabulary Evaluation Using Relations for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a bidirectional semantic-based evaluation method designed to assess the sense distance of the translation from the source text. |
Carinne Cherf; Yuval Pinter; | arxiv-cs.CL | 2024-03-06 |
15 | GaHealth: An English-Irish Bilingual Corpus of Health Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our study outlines the process used in developing the corpus and empirically demonstrates the benefits of using an in-domain dataset for the health domain. |
Séamus Lankford; Haithem Afli; Órla Ní Loinsigh; Andy Way; | arxiv-cs.CL | 2024-03-06 |
16 | General2Specialized LLMs Translation for E-commerce Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Taking e-commerce as an example, the texts usually include amounts of domain-related words and have more grammar problems, which leads to inferior performances of current NMT methods. To address these problems, we collect two domain-related resources, including a set of term pairs (aligned Chinese-English bilingual terms) and a parallel corpus annotated for the e-commerce domain. |
KAIDI CHEN et. al. | arxiv-cs.CL | 2024-03-06 |
17 | Did Translation Models Get More Robust Without Anyone Even Noticing? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Neural machine translation (MT) models achieve strong results across a variety of settings, but it is widely believed that they are highly sensitive to noisy inputs, such as spelling errors, abbreviations, and other formatting issues. In this paper, we revisit this insight in light of recent multilingual MT models and large language models (LLMs) applied to machine translation. |
Ben Peters; André F. T. Martins; | arxiv-cs.CL | 2024-03-06 |
18 | Detecting Concrete Visual Tokens for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce new methods for detection of visually and contextually relevant (concrete) tokens from source sentences, including detection with natural language processing (NLP), detection with object detection, and a joint detection-verification technique. |
Braeden Bowen; Vipin Vijayan; Scott Grigsby; Timothy Anderson; Jeremy Gwinnup; | arxiv-cs.CL | 2024-03-05 |
19 | Adding Multimodal Capabilities to A Text-only Translation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In order to perform well on both Multi30k and typical text-only datasets, we use a performant text-only machine translation (MT) model as the starting point of our MMT model. |
Vipin Vijayan; Braeden Bowen; Scott Grigsby; Timothy Anderson; Jeremy Gwinnup; | arxiv-cs.CL | 2024-03-05 |
20 | The Case for Evaluating Multimodal Translation Models on Text Datasets Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Namely, the use of visual information by the MMT model cannot be shown directly from the Multi30k test set results and the sentences in Multi30k are are image captions, i.e., short, descriptive sentences, as opposed to complex sentences that typical text-only machine translation models are evaluated against. Therefore, we propose that MMT models be evaluated using 1) the CoMMuTE evaluation framework, which measures the use of visual information by MMT models, 2) the text-only WMT news translation task test sets, which evaluates translation performance against complex sentences, and 3) the Multi30k test sets, for measuring MMT model performance against a real MMT dataset. |
Vipin Vijayan; Braeden Bowen; Scott Grigsby; Timothy Anderson; Jeremy Gwinnup; | arxiv-cs.CL | 2024-03-05 |
21 | Machine Translation in The Covid Domain: An English-Irish Case Study for LoResMT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Translation models for the specific domain of translating Covid data from English to Irish were developed for the LoResMT 2021 shared task. |
Séamus Lankford; Haithem Afli; Andy Way; | arxiv-cs.CL | 2024-03-02 |
22 | EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In our work, we observe that both direct and pivot translations are noisy and achieve less satisfactory performance. We propose EBBS, an ensemble method with a novel bi-level beam search algorithm, where each ensemble component explores its own prediction step by step at the lower level but they are synchronized by a soft voting mechanism at the upper level. |
Yuqiao Wen; Behzad Shayegh; Chenyang Huang; Yanshuai Cao; Lili Mou; | arxiv-cs.CL | 2024-02-29 |
23 | Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we show that CLQA can be addressed using a single encoder-decoder model. |
Fan Jiang; Tom Drummond; Trevor Cohn; | arxiv-cs.CL | 2024-02-26 |
24 | A Benchmark for Learning to Translate A New Language from One Grammar Book Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We turn to a field that is explicitly motivated and bottlenecked by a scarcity of web data: low-resource languages. In this paper, we introduce MTOB (Machine Translation from One Book), a benchmark for learning to translate between English and Kalamang—a language with less than 200 speakers and therefore virtually no presence on the web—using several hundred pages of field linguistics reference materials. |
Garrett Tanzer; Mirac Suzgun; Eline Visser; Dan Jurafsky; Luke Melas-Kyriazi; | iclr | 2024-02-26 |
25 | MT-Ranker: Reference-free Machine Translation Evaluation By Inter-system Ranking Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we formulate the reference-free MT evaluation into a pairwise ranking problem. |
Anonymous Authors; | iclr | 2024-02-26 |
26 | Improving LLM-based Machine Translation with Systematic Self-Correction Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Importantly, feeding back such error information into the LLMs can lead to self-correction and result in improved translation performance. Motivated by these insights, we introduce a systematic LLM-based self-correcting translation framework, named TER, which stands for Translate, Estimate, and Refine, marking a significant step forward in this direction. |
ZHAOPENG FENG et. al. | arxiv-cs.CL | 2024-02-26 |
27 | A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we propose a novel fine-tuning approach for LLMs that is specifically designed for the translation task, eliminating the need for the abundant parallel data that traditional translation models usually depend on. |
Anonymous Authors; | iclr | 2024-02-26 |
28 | An Interpretable Error Correction Method for Enhancing Code-to-code Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Moreover, researchers frequently invest substantial time and computational resources in retraining models, yet the improvement in translation accuracy is quite limited. To address these issues, we introduce a novel approach, $k\text{NN-ECD}$, which combines $k$-nearest-neighbor search with a key-value error correction datastore to overwrite the wrong translations of TransCoder-ST. |
Min Xue; Artur Andrzejak; Marla Leuther; | iclr | 2024-02-26 |
29 | TMT: Tri-Modal Translation Between Speech, Image, and Text By Processing Different Modalities As Different Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a novel Tri-Modal Translation (TMT) model that translates between arbitrary modalities spanning speech, image, and text. |
MINSU KIM et. al. | arxiv-cs.CL | 2024-02-25 |
30 | Direct Punjabi to English Speech Translation Using Discrete Units Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: With a motive to contribute towards speech translation research for low-resource languages, our work presents a direct speech-to-speech translation model for one of the Indic languages called Punjabi to English. |
Prabhjot Kaur; L. Andrew M. Bush; Weisong Shi; | arxiv-cs.CL | 2024-02-24 |
31 | GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite numerous studies on gender bias in translations into English from weakly gendered-languages, there are no benchmarks for evaluating this phenomenon or for assessing mitigation strategies. To address this gap, we introduce GATE X-E, an extension to the GATE (Rarrick et al., 2023) corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English. |
Spencer Rarrick; Ranjita Naik; Sundar Poudel; Vishal Chowdhary; | arxiv-cs.CL | 2024-02-21 |
32 | Bangla AI: A Framework for Machine Translation Utilizing Large Language Models for Ethnic Media Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The paper outlines a theoretical framework elucidating the integration of LLM and MMT into the news searching and translation processes for ethnic media. |
MD Ashraful Goni; Fahad Mostafa; Kerk F. Kee; | arxiv-cs.CL | 2024-02-21 |
33 | What Linguistic Features and Languages Are Important in LLM Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Large Language Models (LLMs) demonstrate strong capability across multiple tasks, including machine translation. |
Ryandito Diandaru; Lucky Susanto; Zilu Tang; Ayu Purwarianti; Derry Wijaya; | arxiv-cs.CL | 2024-02-21 |
34 | Enhanced Hallucination Detection in Neural Machine Translation Through Simple Detector Aggregation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining them and introducing a straightforward method for aggregating multiple detectors. |
Anas Himmi; Guillaume Staerman; Marine Picot; Pierre Colombo; Nuno M. Guerreiro; | arxiv-cs.CL | 2024-02-20 |
35 | UMBCLU at SemEval-2024 Task 1A and 1C: Semantic Textual Relatedness with and Without Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the system we developed for SemEval-2024 Task 1, Semantic Textual Relatedness for African and Asian Languages. |
Shubhashis Roy Dipta; Sai Vallurupalli; | arxiv-cs.CL | 2024-02-20 |
36 | SiLLM: Large Language Models for Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose SiLLM, which delegates the two sub-tasks to separate agents, thereby incorporating LLM into SiMT. |
Shoutao Guo; Shaolei Zhang; Zhengrui Ma; Min Zhang; Yang Feng; | arxiv-cs.CL | 2024-02-20 |
37 | NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Models with later knowledge cutoff dates yield lower perplexities and perform better in downstream tasks. |
Jonathan Zheng; Alan Ritter; Wei Xu; | arxiv-cs.CL | 2024-02-19 |
38 | Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore leveraging reinforcement learning with human feedback (\textit{RLHF}) to improve translation quality. |
NUO XU et. al. | arxiv-cs.CL | 2024-02-18 |
39 | Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a conversational SimulMT framework to enhance the inference efficiency of LLM-based SimulMT through multi-turn-dialogue-based decoding. |
Minghan Wang; Thuy-Trang Vu; Ehsan Shareghi; Gholamreza Haffari; | arxiv-cs.CL | 2024-02-16 |
40 | Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, prior work on LLM-based machine translation has mainly focused on better utilizing training data, demonstrations, or pre-defined and universal knowledge to improve performance, with a lack of consideration of decision-making like human translators. In this paper, we incorporate Thinker with the Drift-Diffusion Model (Thinker-DDM) to address this issue. |
HONGBIN NA et. al. | arxiv-cs.CL | 2024-02-16 |
41 | Large Language Models Ad Referendum: How Good Are They at Machine Translation in The Legal Domain? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study evaluates the machine translation (MT) quality of two state-of-the-art large language models (LLMs) against a tradition-al neural machine translation (NMT) system across four language pairs in the legal domain. |
Vicent Briva-Iglesias; Joao Lucas Cavalheiro Camargo; Gokhan Dogru; | arxiv-cs.CL | 2024-02-12 |
42 | Quality Does Matter: A Detailed Look at The Quality and Utility of Web-Mined Parallel Corpora Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We conducted a detailed analysis on the quality of web-mined corpora for two low-resource languages (making three language pairs, English-Sinhala, English-Tamil and Sinhala-Tamil). |
Surangika Ranathunga; Nisansa de Silva; Menan Velayuthan; Aloka Fernando; Charitha Rathnayake; | arxiv-cs.CL | 2024-02-12 |
43 | Unsupervised Sign Language Translation and Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a sliding window method to address the issues of aligning variable-length text with video sequences. |
ZHENGSHENG GUO et. al. | arxiv-cs.CL | 2024-02-12 |
44 | GenTranslate: Large Language Models Are Generative Multilingual Speech and Machine Translators Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a new generative paradigm for translation tasks, namely GenTranslate, which builds upon LLMs to generate better results from the diverse translation versions in N-best list. |
YUCHEN HU et. al. | arxiv-cs.CL | 2024-02-10 |
45 | A Prompt Response to The Demand for Automatic Gender-Neutral Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: For such a scenario, large language models offer hitherto unforeseen possibilities, as they come with the distinct advantage of being versatile in various (sub)tasks when provided with explicit instructions. In this paper, we explore this potential to automate GNT by comparing MT with the popular GPT-4 model. |
Beatrice Savoldi; Andrea Piergentili; Dennis Fucci; Matteo Negri; Luisa Bentivogli; | arxiv-cs.CL | 2024-02-08 |
46 | TransLLaMa: LLM-based Simultaneous Translation System Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study demonstrates that, after fine-tuning on a small dataset comprising causally aligned source and target sentence pairs, a pre-trained open-source LLM can control input segmentation directly by generating a special wait token. |
Roman Koshkin; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2024-02-07 |
47 | Revisiting The Markov Property for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we re-examine the Markov property in the context of neural machine translation. |
Cunxiao Du; Hao Zhou; Zhaopeng Tu; Jing Jiang; | arxiv-cs.CL | 2024-02-03 |
48 | A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose strategies to synthesize parallel data relying on morpho-syntactic information and using bilingual lexicons along with a small amount of seed parallel data. |
Md Mahfuz Ibn Alam; Sina Ahmadi; Antonios Anastasopoulos; | arxiv-cs.CL | 2024-02-02 |
49 | Neural Machine Translation for Malayalam Paraphrase Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study explores four methods of generating paraphrases in Malayalam, utilizing resources available for English paraphrasing and pre-trained Neural Machine Translation (NMT) models. |
Christeena Varghese; Sergey Koshelev; Ivan P. Yamshchikov; | arxiv-cs.CL | 2024-01-31 |
50 | MT-Ranker: Reference-free Machine Translation Evaluation By Inter-system Ranking Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we formulate the reference-free MT evaluation into a pairwise ranking problem. |
Ibraheem Muhammad Moosa; Rui Zhang; Wenpeng Yin; | arxiv-cs.CL | 2024-01-30 |
51 | Non-Fluent Synthetic Target-Language Data Improve Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: These approaches work under the assumption that non-fluent target-side synthetic training samples can be harmful and may deteriorate translation performance. Even so, in this paper we demonstrate that synthetic training samples with non-fluent target sentences can improve translation performance if they are used in a multilingual machine translation framework as if they were sentences in another language. |
Víctor M. Sánchez-Cartagena; Miquel Esplà-Gomis; Juan Antonio Pérez-Ortiz; Felipe Sánchez-Martínez; | arxiv-cs.CL | 2024-01-29 |
52 | Massively Multilingual Text Translation For Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We attempt to leverage translation resources from rich-resource languages to efficiently produce best possible translation quality for well known texts, which are available in multiple languages, in a new, low-resource language. |
Zhong Zhou; | arxiv-cs.CL | 2024-01-29 |
53 | MultiMUC: Multilingual Template Filling on MUC-4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce MultiMUC, the first multilingual parallel corpus for template filling, comprising translations of the classic MUC-4 template filling benchmark into five languages: Arabic, Chinese, Farsi, Korean, and Russian. |
WILLIAM GANTT et. al. | arxiv-cs.CL | 2024-01-29 |
54 | Language Modelling Approaches to Adaptive Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Large language models (LLMs) have recently shown interesting capabilities of in-context learning, where they learn to replicate certain input-output text generation patterns, without further fine-tuning. Such capabilities have opened new horizons for domain-specific data augmentation and real-time adaptive MT. This work attempts to address two main relevant questions: 1) in scenarios involving human interaction and continuous feedback, can we employ language models to improve the quality of adaptive MT at inference time? |
Yasmin Moslem; | arxiv-cs.CL | 2024-01-25 |
55 | Misgendering and Assuming Gender in Machine Translation When Working with Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This chapter focuses on gender-related errors in machine translation (MT) in the context of low-resource languages. |
Sourojit Ghosh; Srishti Chatterjee; | arxiv-cs.CL | 2024-01-23 |
56 | How Far Can 100 Samples Go? Unlocking Overall Zero-Shot Multilingual Translation Via Tiny Multi-Parallel Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we show that for an English-centric model, surprisingly large zero-shot improvements can be achieved by simply fine-tuning with a very small amount of multi-parallel data. |
Di Wu; Shaomu Tan; Yan Meng; David Stap; Christof Monz; | arxiv-cs.CL | 2024-01-22 |
57 | An Empirical Study of In-context Learning in LLMs for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Recent interest has surged in employing Large Language Models (LLMs) for machine translation (MT) via in-context learning (ICL) (Vilar et al., 2023). |
Pranjal A. Chitale; Jay Gala; Raj Dabre; | arxiv-cs.CL | 2024-01-22 |
58 | Gender Bias in Machine Translation and The Era of Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This chapter examines the role of Machine Translation in perpetuating gender bias, highlighting the challenges posed by cross-linguistic settings and statistical dependencies. |
Eva Vanmassenhove; | arxiv-cs.CL | 2024-01-18 |
59 | Gradable ChatGPT Translation Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Accordingly, this paper proposes a generic taxonomy, which defines gradable translation prompts in terms of expression type, translation style, POS information and explicit statement, thus facilitating the construction of prompts endowed with distinct attributes tailored for various translation tasks. |
Hui Jiao; Bei Peng; Lu Zong; Xiaojun Zhang; Xinwei Li; | arxiv-cs.CL | 2024-01-18 |
60 | Salute The Classic: Revisiting Challenges of Machine Translation in The Age of Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The evolution of Neural Machine Translation (NMT) has been significantly influenced by six core challenges (Koehn and Knowles, 2017), which have acted as benchmarks for progress in this field. This study revisits these challenges, offering insights into their ongoing relevance in the context of advanced Large Language Models (LLMs): domain mismatch, amount of parallel data, rare word prediction, translation of long sentences, attention model as word alignment, and sub-optimal beam search. |
JIANHUI PANG et. al. | arxiv-cs.CL | 2024-01-16 |
61 | A Novel Approach for Automatic Program Repair Using Round-Trip Translation with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes bypassing the fine-tuning step and using Round-Trip Translation (RTT): translation of code from one programming language to another programming or natural language, and back. |
Fernando Vallecillos Ruiz; Anastasiia Grishina; Max Hort; Leon Moonen; | arxiv-cs.SE | 2024-01-15 |
62 | Enhancing Document-level Translation of Large Language Model Via Translation Mixed-instructions Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address the issue, we propose an approach that combines sentence-level and document-level translation instructions of varying lengths to fine-tune LLMs. |
Yachao Li; Junhui Li; Jing Jiang; Min Zhang; | arxiv-cs.CL | 2024-01-15 |
63 | MiTTenS: A Dataset for Evaluating Misgendering in Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Translation systems, including foundation models capable of translation, can produce errors that result in misgendering harms. To measure the extent of such potential harms when translating into and out of English, we introduce a dataset, MiTTenS, covering 26 languages from a variety of language families and scripts, including several traditionally underpresented in digital resources. |
Kevin Robinson; Sneha Kudugunta; Romina Stella; Sunipa Dev; Jasmijn Bastings; | arxiv-cs.CL | 2024-01-12 |
64 | Adapting Large Language Models for Document-Level Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we delve into the process of adapting LLMs to specialize in document-level machine translation (DocMT) for a specific language pair. |
Minghao Wu; Thuy-Trang Vu; Lizhen Qu; George Foster; Gholamreza Haffari; | arxiv-cs.CL | 2024-01-12 |
65 | Machine Translation Models Are Zero-Shot Detectors of Translation Direction Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we explore an unsupervised approach to translation direction detection based on the simple hypothesis that $p(\text{translation}|\text{original})>p(\text{original}|\text{translation})$, motivated by the well-known simplification effect in translationese or machine-translationese. |
Michelle Wastl; Jannis Vamvas; Rico Sennrich; | arxiv-cs.CL | 2024-01-12 |
66 | Lost in The Source Language: How Large Language Models Evaluate The Quality of Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This study aims to explore how LLMs leverage source and reference information in evaluating translations, with the ultimate goal of better understanding the working mechanism of LLMs. |
XU HUANG et. al. | arxiv-cs.CL | 2024-01-12 |
67 | An Approach for Mistranslation Removal from Popular Dataset for Indic MT Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Hence, the MT systems built using this dataset cannot perform to their usual potential. In this paper, we propose an algorithm to remove mistranslations from the training corpus and evaluate its performance and efficiency. |
Sudhansu Bala Das; Leo Raphael Rodrigues; Tapas Kumar Mishra; Bidyut Kr. Patra; | arxiv-cs.CL | 2024-01-12 |
68 | Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on boosting many-to-many multilingual translation of LLMs with an emphasis on zero-shot translation directions. |
Pengzhi Gao; Zhongjun He; Hua Wu; Haifeng Wang; | arxiv-cs.CL | 2024-01-11 |
69 | Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This article introduces contrastive alignment instructions (AlignInstruct) to address two challenges in machine translation (MT) on large language models (LLMs). |
Zhuoyuan Mao; Yen Yu; | arxiv-cs.CL | 2024-01-11 |
70 | Can ChatGPT Rival Neural Machine Translation? A Comparative Study Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by the increasing interest in leveraging large language models for translation, this paper evaluates the capabilities of large language models (LLMs) represented by ChatGPT in comparison to the mainstream neural machine translation (NMT) engines in translating Chinese diplomatic texts into English. |
Zhaokun Jiang; Ziyin Zhang; | arxiv-cs.CL | 2024-01-10 |
71 | POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Probability-driven Meta-graph Prompter (POMP), a novel approach employing a dynamic, sampling-based graph of multiple auxiliary languages to enhance LLMs’ translation capabilities for LRLs. |
SHILONG PAN et. al. | arxiv-cs.CL | 2024-01-10 |
72 | Aligning Translation-Specific Understanding to General Understanding in Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To align the translation-specific understanding to the general one, we propose a novel translation process xIoD (Cross-Lingual Interpretation of Difficult words), explicitly incorporating the general understanding on the content incurring inconsistent understanding to guide the translation. |
YICHONG HUANG et. al. | arxiv-cs.CL | 2024-01-10 |
73 | LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To mitigate that problem, we proposed the first unsupervised multilingual paraphrasing model, LAMPAT ($\textbf{L}$ow-rank $\textbf{A}$daptation for $\textbf{M}$ultilingual $\textbf{P}$araphrasing using $\textbf{A}$dversarial $\textbf{T}$raining), by which monolingual dataset is sufficient enough to generate a human-like and diverse sentence. |
Khoi M. Le; Trinh Pham; Tho Quan; Anh Tuan Luu; | arxiv-cs.CL | 2024-01-08 |
74 | Building Efficient and Effective OpenQA Systems for Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we show that effective, low-cost OpenQA systems can be developed for low-resource languages. |
EMRAH BUDUR et. al. | arxiv-cs.CL | 2024-01-07 |
75 | MirrorDiffusion: Stabilizing Diffusion Process in Zero-shot Image Translation By Prompts Redescription and Beyond Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To make reconstruction explicit, we propose a prompt redescription strategy to realize a mirror effect between the source and reconstructed image in the diffusion model (MirrorDiffusion). |
Yupei Lin; Xiaoyu Xian; Yukai Shi; Liang Lin; | arxiv-cs.CV | 2024-01-06 |
76 | Machine Translation Testing Via Syntactic Tree Pruning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, it is challenging to test machine translation systems because of the complexity and intractability of the underlying neural models. To tackle these challenges, we propose a novel metamorphic testing approach by syntactic tree pruning (STP) to validate machine translation systems. |
QUANJUN ZHANG et. al. | arxiv-cs.CL | 2024-01-01 |
77 | Multi-scale Progressive Feature Embedding for Accurate NIR-to-RGB Spectral Domain Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: NIR-to-RGB spectral domain translation is a challenging task due to the mapping ambiguities, and existing methods show limited learning capacities. To address these challenges, we propose to colorize NIR images via a multi-scale progressive feature embedding network (MPFNet), with the guidance of grayscale image colorization. |
Xingxing Yang; Jie Chen; Zaifeng Yang; | arxiv-cs.CV | 2023-12-26 |
78 | CLAD-ST: Contrastive Learning with Adversarial Data for Robust Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address this robustness problem in downstream MT models by forcing the MT encoder to bring the representations of a noisy input closer to its clean version in the semantic space. This is achieved by introducing a contrastive learning method that leverages adversarial examples in the form of ASR outputs paired with their corresponding human transcripts to optimize the network parameters. |
Sathish Indurthi; Shamil Chollampatt; Ravi Agrawal; Marco Turchi; | emnlp | 2023-12-22 |
79 | Program Translation Via Code Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we propose a novel model called Code Distillation (CoDist) whereby we capture the semantic and structural equivalence of code in a language agnostic intermediate representation. |
YUFAN HUANG et. al. | emnlp | 2023-12-22 |
80 | An Empirical Study of Translation Hypothesis Ensembling with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. |
Ant�nio Farinhas; Jos� de Souza; Andre Martins; | emnlp | 2023-12-22 |
81 | Revisiting Machine Translation for Cross-lingual Classification IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show that, by using a stronger MT system and mitigating the mismatch between training on original text and running inference on machine translated text, translate-test can do substantially better than previously assumed. |
Mikel Artetxe; Vedanuj Goswami; Shruti Bhosale; Angela Fan; Luke Zettlemoyer; | emnlp | 2023-12-22 |
82 | Towards A Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Through systematic experimentation, spanning 1,560 language directions across 40 languages, we identify three key factors contributing to high variations in ZS NMT performance: 1) target-side translation quality, 2) vocabulary overlap, and 3) linguistic properties. |
Shaomu Tan; Christof Monz; | emnlp | 2023-12-22 |
83 | MT2: Towards A Multi-Task Machine Translation Model with Translation-Specific In-Context Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Most of the previous work uses separate models or methods to solve these tasks, which is not conducive to knowledge transfer of different tasks and increases the complexity of system construction. In this work, we explore the potential of pre-trained language model in machine translation tasks and propose a Multi-Task Machine Translation (MT2) model to integrate these translation tasks. |
CHUNYOU LI et. al. | emnlp | 2023-12-22 |
84 | A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In MT, this might lead to misgendered translations, resulting, among other harms, in the perpetuation of stereotypes and prejudices. In this work, we address this gap by investigating whether and to what extent such models exhibit gender bias in machine translation and how we can mitigate it. |
Giuseppe Attanasio; Flor Plaza del Arco; Debora Nozza; Anne Lauscher; | emnlp | 2023-12-22 |
85 | HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we release an annotated dataset for the hallucination and omission phenomena covering 18 translation directions with varying resource levels and scripts. |
DAVID DALE et. al. | emnlp | 2023-12-22 |
86 | MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a modularized MNMT framework that is able to flexibly assemble dense and MoE-based sparse modules to achieve the best of both worlds. |
SHANGJIE LI et. al. | emnlp | 2023-12-22 |
87 | Continual Learning for Multilingual Neural Machine Translation Via Dual Importance-based Model Division Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To achieve this, the existing methods primarily focus on preventing catastrophic forgetting by making compromises between the original and new language pairs, leading to sub-optimal performance on both translation tasks. To mitigate this problem, we propose a dual importance-based model division method to divide the model parameters into two parts and separately model the translation of the original and new tasks. |
JUNPENG LIU et. al. | emnlp | 2023-12-22 |
88 | Crossing The Threshold: Idiomatic Machine Translation Through Retrieval Augmentation and Loss Weighting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To improve translation of natural idioms, we introduce two straightforward yet effective techniques: the strategic upweighting of training loss on potentially idiomatic sentences, and using retrieval-augmented models. |
Emmy Liu; Aditi Chaudhary; Graham Neubig; | emnlp | 2023-12-22 |
89 | Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with The GeNTE Corpus Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Based on GeNTE, we then overview existing reference-based evaluation approaches, highlight their limits, and propose a reference-free method more suitable to assess gender-neutral translation. |
Andrea Piergentili; Beatrice Savoldi; Dennis Fucci; Matteo Negri; Luisa Bentivogli; | emnlp | 2023-12-22 |
90 | PromptST: Abstract Prompt Learning for End-to-End Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we take the first step toward understanding the fusion of speech and text features in S2T model. |
TENGFEI YU et. al. | emnlp | 2023-12-22 |
91 | DecoMT: Decomposed Prompting for Machine Translation Between Related Languages Using Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce DecoMT, a novel approach of few-shot prompting that decomposes the translation process into a sequence of word chunk translations. |
Ratish Puduppully; Anoop Kunchukuttan; Raj Dabre; Ai Ti Aw; Nancy Chen; | emnlp | 2023-12-22 |
92 | Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, when it comes to non-English languages, the quantity and quality of textual information are comparatively scarce. To address this issue, we introduce the novel task of automatic Knowledge Graph Completion (KGE) and perform a thorough investigation on bridging the gap in both the quantity and quality of textual information between English and non-English languages. |
SIMONE CONIA et. al. | emnlp | 2023-12-22 |
93 | PROSE: A Pronoun Omission Solution for Chinese-English Spoken Language Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To alleviate the negative impact introduced by pro-drop, we propose Mention-Aware Semantic Augmentation, a novel approach that leverages the semantic embedding of dropped pronouns to augment training pairs. |
Ke Wang; Xiutian Zhao; Yanghui Li; Wei Peng; | emnlp | 2023-12-22 |
94 | Document-Level Machine Translation with Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. |
LONGYUE WANG et. al. | emnlp | 2023-12-22 |
95 | Learn and Consolidate: Continual Adaptation for Zero-Shot and Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose a two-stage approach that encourages original models to acquire language-agnostic multilingual representations from new data, and preserves the model architecture without introducing parameters. |
Kaiyu Huang; Peng Li; Junpeng Liu; Maosong Sun; Yang Liu; | emnlp | 2023-12-22 |
96 | On The Use of Metaphor Translation in Psychiatry Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Now, metaphor has been shown to be paramount in both identifying individuals struggling with mental problems and helping those individuals understand and communicate their experiences. Therefore, this paper aims to survey the potential of Machine Translation for providing equitable psychiatric healthcare and highlights the need for further research on the transferability of existing machine and metaphor translation research in the domain of psychiatry. |
Lois Wong; | arxiv-cs.CL | 2023-12-22 |
97 | Challenges in Context-Aware Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we investigate and present several core challenges that impede progress within the field, relating to discourse phenomena, context usage, model architectures, and document-level evaluation. |
Linghao Jin; Jacqueline He; Jonathan May; Xuezhe Ma; | emnlp | 2023-12-22 |
98 | Video-Helpful Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce EVA (Extensive training set and Video-helpful evaluation set for Ambiguous subtitles translation), an MMT dataset containing 852k Japanese-English parallel subtitle pairs, 520k Chinese-English parallel subtitle pairs, and corresponding video clips collected from movies and TV episodes. |
Yihang Li; Shuichiro Shimizu; Chenhui Chu; Sadao Kurohashi; Wei Li; | emnlp | 2023-12-22 |
99 | Multilingual K-Nearest-Neighbor Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, these improvements have been limited to high-resource language pairs, with large datastores, and remain a challenge for low-resource languages. In this paper, we address this issue by combining representations from multiple languages into a single datastore. |
David Stap; Christof Monz; | emnlp | 2023-12-22 |
100 | Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we specifically target the gender bias issue of multilingual machine translation models for unambiguous cases where there is a single correct translation, and propose a bias mitigation method based on a novel approach. |
MINWOO LEE et. al. | emnlp | 2023-12-22 |
101 | Simple and Effective Input Reformulations for Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we reformulate inputs during finetuning for challenging translation tasks, leveraging model strengths from pretraining in novel ways to improve downstream performance. |
Brian Yu; Hansen Lillemark; Kurt Keutzer; | emnlp | 2023-12-22 |
102 | Exploring Discourse Structure in Document-level Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a more sound paragraph-to-paragraph translation mode and explore whether discourse structure can improve DocMT. |
Xinyu Hu; Xiaojun Wan; | emnlp | 2023-12-22 |
103 | Contextual Code Switching for Machine Translation Using Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present an extensive study on the code switching task specifically for the machine translation task comparing multiple LLMs. |
Arshad Kaji; Manan Shah; | arxiv-cs.CL | 2023-12-20 |
104 | Is Post-editing Really Faster Than Human Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: It uses an exploratory data analysis approach to investigate data for 90 million words translated by 879 linguists across 11 language pairs, over 2.5 years. |
Silvia Terribile; | arxiv-cs.CL | 2023-12-19 |
105 | An Empirical Study of Unsupervised Neural Machine Translation: Analyzing NMT Output, Model’s Behavior and Sentences’ Contribution Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus on three very diverse languages, French, Gujarati, and Kazakh, and train bilingual NMT models, to and from English, with various levels of supervision, in high- and low- resource setups, measure quality of the NMT output and compare the generated sequences’ word order and semantic similarity to source and reference sentences. |
Isidora Chara Tourni; Derry Wijaya; | arxiv-cs.CL | 2023-12-19 |
106 | Fine-tuning Large Language Models for Adaptive Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents the outcomes of fine-tuning Mistral 7B, a general-purpose large language model (LLM), for adaptive machine translation (MT). |
Yasmin Moslem; Rejwanul Haque; Andy Way; | arxiv-cs.CL | 2023-12-19 |
107 | Word Closure-Based Metamorphic Testing for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a word closure-based output comparison method to address the limitations of the existing MTS MT methods. |
Xiaoyuan Xie; Shuo Jin; Songqiang Chen; Shing-Chi Cheung; | arxiv-cs.SE | 2023-12-19 |
108 | Predicting Human Translation Difficulty with Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that surprisal and attention are complementary predictors of translation difficulty, and that surprisal derived from a NMT model is the single most successful predictor of production duration. |
Zheng Wei Lim; Ekaterina Vylomova; Charles Kemp; Trevor Cohn; | arxiv-cs.CL | 2023-12-18 |
109 | Neural Machine Translation of Clinical Text: An Empirical Investigation Into Multilingual Pre-Trained Language Models and Transfer-Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We conduct investigations on clinical text machine translation by examining multilingual neural network models using deep learning such as Transformer based structures. |
LIFENG HAN et. al. | arxiv-cs.CL | 2023-12-12 |
110 | Converting Epics/Stories Into Pseudocode Using Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: With this research paper, we aim to present a methodology to generate pseudocode from a given agile user story of small functionalities so as to reduce the overall time spent on the industrial project. |
Gaurav Kolhatkar; Akshit Madan; Nidhi Kowtal; Satyajit Roy; Sheetal Sonawane; | arxiv-cs.CL | 2023-12-08 |
111 | Making Translators Privacy-aware on The User’s Side Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose PRISM to enable users of machine translation systems to preserve the privacy of data on their own initiative. |
Ryoma Sato; | arxiv-cs.CR | 2023-12-07 |
112 | Efficient Monotonic Multihead Attention Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce the Efficient Monotonic Multihead Attention (EMMA), a state-of-the-art simultaneous translation model with numerically-stable and unbiased monotonic alignment estimation. |
Xutai Ma; Anna Sun; Siqi Ouyang; Hirofumi Inaguma; Paden Tomasello; | arxiv-cs.CL | 2023-12-07 |
113 | Improving Neural Machine Translation By Multi-Knowledge Integration with Prompting Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we focus on how to integrate multi-knowledge, multiple types of knowledge, into NMT models to enhance the performance with prompting. |
Ke Wang; Jun Xie; Yuqi Zhang; Yu Zhao; | arxiv-cs.CL | 2023-12-07 |
114 | First Attempt at Building Parallel Corpora for Machine Translation of Northeast India’s Very Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents the creation of initial bilingual corpora for thirteen very low-resource languages of India, all from Northeast India. |
Atnafu Lambebo Tonja; Melkamu Mersha; Ananya Kalita; Olga Kolesnikova; Jugal Kalita; | arxiv-cs.CL | 2023-12-07 |
115 | Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we address key challenges facing LLMs fine-tuned for SimulMT, validate classical SimulMT concepts and practices in the context of LLMs, explore adapting LLMs that are fine-tuned for NMT to the task of SimulMT, and introduce Simul-LLM, the first open-source fine-tuning and evaluation pipeline development framework for LLMs focused on SimulMT. |
Victor Agostinelli; Max Wild; Matthew Raffel; Kazi Ahmed Asif Fuad; Lizhong Chen; | arxiv-cs.CL | 2023-12-07 |
116 | End-to-End Speech-to-Text Translation: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: As a result, researchers have been exploring end-to-end (E2E) models for ST translation. |
Nivedita Sethiya; Chandresh Kumar Maurya; | arxiv-cs.CL | 2023-12-02 |
117 | Quick Back-Translation for Unsupervised Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a two-for-one improvement to Transformer back-translation: Quick Back-Translation (QBT). |
Benjamin Brimacombe; Jiawei Zhou; | arxiv-cs.CL | 2023-12-01 |
118 | Women Are Beautiful, Men Are Leaders: Gender Stereotypes in Machine Translation and Language Modeling Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present GEST — a new dataset for measuring gender-stereotypical reasoning in masked LMs and English-to-X machine translation systems. |
Matúš Pikuliak; Andrea Hrckova; Stefan Oresko; Marián Šimko; | arxiv-cs.CL | 2023-11-30 |
119 | Relevance-guided Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an explainability-based training approach for NMT, applied in Unsupervised and Supervised model training, for translation of three languages of varying resources, French, Gujarati, Kazakh, to and from English. |
Isidora Chara Tourni; Derry Wijaya; | arxiv-cs.CL | 2023-11-30 |
120 | INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose the INarIG (Iterative Non-autoregressive Instruct Generation) model, which constructs the human typed sequence into Instruction Unit and employs iterative decoding with subwords to fully utilize input information given in the task. |
HENGCHAO SHANG et. al. | arxiv-cs.CL | 2023-11-29 |
121 | Mergen: The First Manchu-Korean Machine Translation Model Trained on Augmented Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In our efforts to safeguard the Manchu language, we introduce Mergen, the first-ever attempt at a Manchu-Korean Machine Translation (MT) model. |
Jean Seo; Sungjoo Byun; Minha Kang; Sangah Lee; | arxiv-cs.CL | 2023-11-29 |
122 | A Benchmark for Evaluating Machine Translation Metrics on Dialects Without Standard Orthography Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we evaluate how robust metrics are to non-standardized dialects, i.e. spelling differences in language varieties that do not have a standard orthography. |
Noëmi Aepli; Chantal Amrhein; Florian Schottmann; Rico Sennrich; | arxiv-cs.CL | 2023-11-28 |
123 | Reducing Gender Bias in Machine Translation Through Counterfactual Data Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We also propose a novel domain-adaptation technique that leverages in-domain data created with the counterfactual data generation techniques proposed by Zmigrod et al. (2019) to further improve accuracy on the WinoMT challenge test set without significant loss in translation quality. We show its effectiveness in NMT systems from English into three morphologically rich languages French, Spanish, and Italian. |
Ranjita Naik; Spencer Rarrick; Vishal Chowdhary; | arxiv-cs.CL | 2023-11-27 |
124 | Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, when it comes to non-English languages, the quantity and quality of textual information are comparatively scarce. To address this issue, we introduce the novel task of automatic Knowledge Graph Enhancement (KGE) and perform a thorough investigation on bridging the gap in both the quantity and quality of textual information between English and non-English languages. |
SIMONE CONIA et. al. | arxiv-cs.AI | 2023-11-27 |
125 | Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a simple and scalable approach to resolve translation ambiguity by incorporating a small amount of extra-sentential context in neural \mt. Our approach requires no sense annotation and no change to standard model architectures. |
Elijah Rippeth; Marine Carpuat; Kevin Duh; Matt Post; | arxiv-cs.CL | 2023-11-26 |
126 | Machine Translation for Ge’ez Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we explore various methods to improve Ge’ez MT, including transfer-learning from related languages, optimizing shared vocabulary and token segmentation approaches, finetuning large pre-trained models, and using large language models (LLMs) for few-shot translation with fuzzy matches. |
Aman Kassahun Wassie; | arxiv-cs.CL | 2023-11-24 |
127 | Machine Translation to Control Formality Features in The Target Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: When a language translation technique is used to translate from a source language that does not pertain the formality (e.g. English) to a target language that does, there is a missing information on formality that could be a challenge in producing an accurate outcome. This research explores how this issue should be resolved when machine learning methods are used to translate from English to languages with formality, using Hindi as the example data. |
Harshita Tyagi; Prashasta Jung; Hyowon Lee; | arxiv-cs.CL | 2023-11-22 |
128 | Context-aware Neural Machine Translation for English-Japanese Business Scene Dialogues Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we explore how context-awareness can improve the performance of the current Neural Machine Translation (NMT) models for English-Japanese business dialogues translation, and what kind of context provides meaningful information to improve translation. |
Sumire Honda; Patrick Fernandes; Chrysoula Zerva; | arxiv-cs.CL | 2023-11-20 |
129 | Vashantor: A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Despite extensive study into translating Bangla to English, English to Bangla, and Banglish to Bangla in the past, there has been a noticeable gap in translating Bangla regional dialects into standard Bangla. In this study, we set out to fill this gap by creating a collection of 32,500 sentences, encompassing Bangla, Banglish, and English, representing five regional Bangla dialects. |
FATEMA TUJ JOHORA FARIA et. al. | arxiv-cs.CL | 2023-11-18 |
130 | SentAlign: Accurate and Scalable Sentence Alignment Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present SentAlign, an accurate sentence alignment tool designed to handle very large parallel document pairs. |
Steinþór Steingrímsson; Hrafn Loftsson; Andy Way; | arxiv-cs.CL | 2023-11-15 |
131 | Assessing Translation Capabilities of Large Language Models Involving English and Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, our aim is to explore the multilingual capabilities of large language models by using machine translation as a task involving English and 22 Indian languages. |
VANDAN MUJADIA et. al. | arxiv-cs.CL | 2023-11-15 |
132 | Evaluating Gender Bias in The Translation of Gender-Neutral Languages Into English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite numerous studies into gender bias in translations from gender-neutral languages such as Turkish into more strongly gendered languages like English, there are no benchmarks for evaluating this phenomenon or for assessing mitigation strategies. To address this gap, we introduce GATE X-E, an extension to the GATE (Rarrick et al., 2023) corpus, that consists of human translations from Turkish, Hungarian, Finnish, and Persian into English. |
Spencer Rarrick; Ranjita Naik; Sundar Poudel; Vishal Chowdhary; | arxiv-cs.CL | 2023-11-15 |
133 | Aligning Neural Machine Translation Models: Human Feedback in Training and Inference Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we comprehensively explore and compare techniques for integrating quality metrics as reward models into the MT pipeline. |
Miguel Moura Ramos; Patrick Fernandes; António Farinhas; André F. T. Martins; | arxiv-cs.CL | 2023-11-15 |
134 | Pinpoint, Not Criticize: Refining Large Language Models Via Fine-Grained Actionable Feedback Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose an inference time optimization method FITO to use fine-grained actionable feedback in the form of error type, error location and severity level that are predicted by a learned error pinpoint model for iterative refinement. |
WENDA XU et. al. | arxiv-cs.CL | 2023-11-15 |
135 | Extending Multilingual Machine Translation Through Imitation Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We aim to extend large-scale MNMT models to a new language, allowing for translation between the newly added and all of the already supported languages in a challenging scenario: using only a parallel corpus between the new language and English. |
Wen Lai; Viktor Hangya; Alexander Fraser; | arxiv-cs.CL | 2023-11-14 |
136 | Non-autoregressive Machine Translation with Probabilistic Context-free Grammar Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, conventional NAT models suffer from limited expression power and performance degradation compared to autoregressive (AT) models due to the assumption of conditional independence among target tokens. To address these limitations, we propose a novel approach called PCFG-NAT, which leverages a specially designed Probabilistic Context-Free Grammar (PCFG) to enhance the ability of NAT models to capture complex dependencies among output tokens. |
SHANGTONG GUI et. al. | arxiv-cs.CL | 2023-11-14 |
137 | How Good Are Large Language Models on African Languages? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an analysis of three popular large language models (mT0, LLaMa 2, and GPT-4) on five tasks (news topic classification, sentiment classification, machine translation, question answering, and named entity recognition) across 30 African languages, spanning different language families and geographical regions. |
Jessica Ojo; Kelechi Ogueji; Pontus Stenetorp; David I. Adelani; | arxiv-cs.CL | 2023-11-14 |
138 | Anti-LM Decoding for Zero-shot In-context Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work introduces an Anti-Language Model objective with a decay factor designed to address the weaknesses of In-context Machine Translation. |
Suzanna Sia; Alexandra DeLucia; Kevin Duh; | arxiv-cs.CL | 2023-11-14 |
139 | On-the-Fly Fusion of Large Language Models and Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose the on-the-fly ensembling of a machine translation model with an LLM, prompted on the same task and input. |
Hieu Hoang; Huda Khayrallah; Marcin Junczys-Dowmunt; | arxiv-cs.CL | 2023-11-14 |
140 | Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Pivoting via high-resource languages remains a strong strategy for low-resource directions, and in this paper we revisit ways of pivoting through multiple languages. |
Alireza Mohammadshahi; Jannis Vamvas; Rico Sennrich; | arxiv-cs.CL | 2023-11-13 |
141 | Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Added toxicity in the context of translation refers to the fact of producing a translation output with more toxicity than there exists in the input. In this paper, we present MinTox which is a novel pipeline to identify added toxicity and mitigate this issue which works at inference time. |
Marta R. Costa-jussà; David Dale; Maha Elbayad; Bokai Yu; | arxiv-cs.CL | 2023-11-11 |
142 | Gender Inflected or Bias Inflicted: On Using Grammatical Gender Cues for Bias Evaluation in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To demonstrate our point, in this work, we use Hindi as the source language and construct two sets of gender-specific sentences: OTSC-Hindi and WinoMT-Hindi that we use to evaluate different Hindi-English (HI-EN) NMT systems automatically for gender bias. |
Pushpdeep Singh; | arxiv-cs.CL | 2023-11-07 |
143 | Findings of The WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in The Cosmos of LLMs Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We employ both automatic and human evaluations to measure the performance of the submitted systems. |
LONGYUE WANG et. al. | arxiv-cs.CL | 2023-11-06 |
144 | CBSiMT: Mitigating Hallucination in Simultaneous Machine Translation with Weighted Prefix-to-Prefix Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a Confidence-Based Simultaneous Machine Translation (CBSiMT) framework, which uses model confidence to perceive hallucination tokens and mitigates their negative impact with weighted prefix-to-prefix training. |
MENGGE LIU et. al. | arxiv-cs.CL | 2023-11-06 |
145 | Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To create the parallel corpora, we propose a dynamic programming based sentence alignment algorithm which leverages the cosine similarity of machine-translated sentences. |
Haiyue Song; Raj Dabre; Chenhui Chu; Atsushi Fujita; Sadao Kurohashi; | arxiv-cs.CL | 2023-11-06 |
146 | Replicable Benchmarking of Neural Machine Translation (NMT) on Low-Resource Local Languages in Indonesia Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Neural machine translation (NMT) for low-resource local languages in Indonesia faces significant challenges, including the need for a representative benchmark and limited data availability. This work addresses these challenges by comprehensively analyzing training NMT systems for four low-resource local languages in Indonesia: Javanese, Sundanese, Minangkabau, and Balinese. |
Lucky Susanto; Ryandito Diandaru; Adila Krisnadhi; Ayu Purwarianti; Derry Wijaya; | arxiv-cs.CL | 2023-11-02 |
147 | Robustness Tests for Automatic Machine Translation Metrics with Adversarial Attacks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We investigate MT evaluation metric performance on adversarially-synthesized texts, to shed light on metric robustness. |
Yichen Huang; Timothy Baldwin; | arxiv-cs.CL | 2023-11-01 |
148 | Is Robustness Transferable Across Languages in Multilingual Neural Machine Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate the transferability of robustness across different languages in multilingual neural machine translation. |
Leiyu Pan; Deyi Xiong; | arxiv-cs.AI | 2023-10-31 |
149 | Towards A Deep Understanding of Multilingual End-to-End Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we employ Singular Value Canonical Correlation Analysis (SVCCA) to analyze representations learnt in a multilingual end-to-end speech translation model trained over 22 languages. |
Haoran Sun; Xiaohu Zhao; Yikun Lei; Shaolin Zhu; Deyi Xiong; | arxiv-cs.CL | 2023-10-31 |
150 | Cultural Adaptation of Recipes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a new task involving the translation and cultural adaptation of recipes between Chinese and English-speaking cuisines. |
YONG CAO et. al. | arxiv-cs.CL | 2023-10-26 |
151 | SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, some of its key hurdles include domain generalisation, which is the ability to adapt to previously unseen databases, and alignment of natural language questions with the corresponding SQL queries. To overcome these challenges, we introduce SQLformer, a novel Transformer architecture specifically crafted to perform text-to-SQL translation tasks. |
Adrián Bazaga; Pietro Liò; Gos Micklem; | arxiv-cs.CL | 2023-10-26 |
152 | Incorporating Probing Signals Into Multimodal Machine Translation Via Visual Question-Answering Pairs Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents an in-depth study of multimodal machine translation (MMT), examining the prevailing understanding that MMT systems exhibit decreased sensitivity to visual information when text inputs are complete. |
YUXIN ZUO et. al. | arxiv-cs.CL | 2023-10-26 |
153 | DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Towards the goal of multilingual disfluency correction, we present a high-quality human-annotated DC corpus covering four important Indo-European languages: English, Hindi, German and French. |
Vineet Bhat; Preethi Jyothi; Pushpak Bhattacharyya; | arxiv-cs.CL | 2023-10-25 |
154 | CUNI Submission to MRL 2023 Shared Task on Multi-lingual Multi-task Information Retrieval Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To keep the inferred tags on the correct positions in the original language, we propose a method based on scoring the candidate positions using a label-sensitive translation model. |
Jindřich Helcl; Jindřich Libovický; | arxiv-cs.CL | 2023-10-25 |
155 | Machine Translation for Nko: Tools, Corpora and Baseline Results Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Currently, there is no usable machine translation system for Nko, a language spoken by tens of millions of people across multiple West African countries, which holds significant cultural and educational value. To address this issue, we present a set of tools, resources, and baseline results aimed towards the development of usable machine translation systems for Nko and other languages that do not currently have sufficiently large parallel text corpora available. |
MOUSSA KOULAKO BALA DOUMBOUYA et. al. | arxiv-cs.CL | 2023-10-24 |
156 | ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present ComSL, a speech-language model built atop a composite architecture of public pre-trained speech-only and language-only models and optimized data-efficiently for spoken language tasks. |
CHENYANG LE et. al. | nips | 2023-10-24 |
157 | Dissecting In-Context Learning of Translations in GPTs Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we try to better understand the role of demonstration attributes for the in-context learning of translations through perturbations of high-quality, in-domain demonstrations. |
Vikas Raunak; Hany Hassan Awadalla; Arul Menezes; | arxiv-cs.CL | 2023-10-24 |
158 | Extremal Domain Translation with Neural Optimal Transport Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose the extremal transport (ET) which is a mathematical formalization of the theoretically best possible unpaired translation between a pair of domains w.r.t. the given similarity function. |
Milena Gazdieva; Alexander Korotin; Daniil Selikhanovych; Evgeny Burnaev; | nips | 2023-10-24 |
159 | Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Surprisingly, our initial experiments find that fine-tuning for translation purposes even led to performance degradation. To overcome this, we propose an alternative approach: adapting LLM’s as Automatic Post-Editors (APE) rather than direct translators. |
Sai Koneru; Miriam Exel; Matthias Huck; Jan Niehues; | arxiv-cs.CL | 2023-10-23 |
160 | Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we compare three popular approaches: lexical replacements, linguistic theories, and back-translation (BT), in the context of Egyptian Arabic-English CSW. |
Injy Hamed; Nizar Habash; Ngoc Thang Vu; | arxiv-cs.CL | 2023-10-23 |
161 | Domain Terminology Integration Into Machine Translation: Leveraging Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper discusses the methods that we used for our submissions to the WMT 2023 Terminology Shared Task for German-to-English (DE-EN), English-to-Czech (EN-CS), and Chinese-to-English (ZH-EN) language pairs. |
D. Kelleher; | arxiv-cs.CL | 2023-10-22 |
162 | Boosting Unsupervised Machine Translation with Pseudo-Parallel Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a training strategy that relies on pseudo-parallel sentence pairs mined from monolingual corpora in addition to synthetic sentence pairs back-translated from monolingual corpora. |
Ivana Kvapilíková; Ondřej Bojar; | arxiv-cs.CL | 2023-10-22 |
163 | Code-Switching with Word Senses for Pretraining in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce Word Sense Pretraining for Neural Machine Translation (WSP-NMT) – an end-to-end approach for pretraining multilingual NMT models leveraging word sense-specific information from Knowledge Bases. |
Vivek Iyer; Edoardo Barba; Alexandra Birch; Jeff Z. Pan; Roberto Navigli; | arxiv-cs.CL | 2023-10-21 |
164 | Towards General Error Diagnosis Via Behavioral Testing in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In order to diagnose general errors, this paper proposes a new Bilingual Translation Pair Generation based Behavior Testing (BTPGBT) framework for conducting behavioral testing of MT systems. |
Junjie Wu; Lemao Liu; Dit-Yan Yeung; | arxiv-cs.CL | 2023-10-20 |
165 | CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work introduces CAPIVARA, a cost-efficient framework designed to enhance the performance of multilingual CLIP models in low-resource languages. |
GABRIEL OLIVEIRA DOS SANTOS et. al. | arxiv-cs.LG | 2023-10-20 |
166 | A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In MT, this might lead to misgendered translations, resulting, among other harms, in the perpetuation of stereotypes and prejudices. In this work, we address this gap by investigating whether and to what extent such models exhibit gender bias in machine translation and how we can mitigate it. |
Giuseppe Attanasio; Flor Miriam Plaza-del-Arco; Debora Nozza; Anne Lauscher; | arxiv-cs.CL | 2023-10-18 |
167 | Direct Neural Machine Translation with Task-level Mixture of Experts Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we examine Task-level MoE’s applicability in direct NMT and propose a series of high-performing training and evaluation configurations, through which Task-level MoE-based direct NMT systems outperform bilingual and pivot-based models for a large number of low and high-resource direct pairs, and translation directions. |
Isidora Chara Tourni; Subhajit Naskar; | arxiv-cs.CL | 2023-10-18 |
168 | Knn-seq: Efficient, Extensible KNN-MT Framework Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present an efficient and extensible kNN-MT framework, knn-seq, for researchers and developers that is carefully designed to run efficiently, even with a billion-scale large datastore. |
HIROYUKI DEGUCHI et. al. | arxiv-cs.CL | 2023-10-18 |
169 | An Empirical Study of Translation Hypothesis Ensembling with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. |
António Farinhas; José G. C. de Souza; André F. T. Martins; | arxiv-cs.CL | 2023-10-17 |
170 | Long-form Simultaneous Speech Translation: Thesis Proposal Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This thesis proposal addresses end-to-end simultaneous speech translation, particularly in the long-form setting, i.e., without pre-segmentation. We present a survey of the latest advancements in E2E SST, assess the primary obstacles in SST and its relevance to long-form scenarios, and suggest approaches to tackle these challenges. |
Peter Polák; | arxiv-cs.CL | 2023-10-17 |
171 | Exploring Automatic Evaluation Methods Based on A Decoder-based LLM for Text Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper compares various methods, including tuning with encoder-based models and large language models under equal conditions, on two different tasks, machine translation evaluation and semantic textual similarity, in two languages, Japanese and English. |
Tomohito Kasahara; Daisuke Kawahara; | arxiv-cs.CL | 2023-10-17 |
172 | UvA-MT’s Participation in The WMT23 General Translation Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the UvA-MT’s submission to the WMT 2023 shared task on general machine translation. |
Di Wu; Shaomu Tan; David Stap; Ali Araabi; Christof Monz; | arxiv-cs.CL | 2023-10-15 |
173 | Improving Access to Justice for The Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we construct the first high-quality legal parallel corpus containing aligned text units in English and nine Indian languages, that includes several low-resource languages. |
Sayan Mahapatra; Debtanu Datta; Shubham Soni; Adrijit Goswami; Saptarshi Ghosh; | arxiv-cs.CL | 2023-10-15 |
174 | Human-in-the-loop Machine Translation with Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we propose a human-in-the-loop pipeline that guides LLMs to produce customized outputs with revision instructions. |
Xinyi Yang; Runzhe Zhan; Derek F. Wong; Junchao Wu; Lidia S. Chao; | arxiv-cs.CL | 2023-10-13 |
175 | Political Claim Identification and Categorization in A Multilingual Setting: First Experiments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper explores different strategies for the cross-lingual projection of political claims analysis. |
Urs Zaberer; Sebastian Padó; Gabriella Lapesa; | arxiv-cs.CL | 2023-10-13 |
176 | XDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address the issue, we introduce xDial-Eval, built on top of open-source English dialogue evaluation datasets. |
CHEN ZHANG et. al. | arxiv-cs.CL | 2023-10-13 |
177 | Enhancing Expressivity Transfer in Textless Speech-to-speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Expressivity plays a vital role in conveying emotions, nuances, and cultural subtleties, thereby enhancing communication across diverse languages. To address this issue this study presents a novel method that operates at the discrete speech unit level and leverages multilingual emotion embeddings to capture language-agnostic information. |
Jarod Duret; Benjamin O’Brien; Yannick Estève; Titouan Parcollet; | arxiv-cs.SD | 2023-10-11 |
178 | Larth: Dataset and Machine Translation for Etruscan Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To the best of our knowledge, there are no publicly available Etruscan corpora for natural language processing. Therefore, we propose a dataset for machine translation from Etruscan to English, which contains 2891 translated examples from existing academic sources. |
Gianluca Vico; Gerasimos Spanakis; | arxiv-cs.CL | 2023-10-09 |
179 | Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Alternatively, we leverage a large language model to refine a hypothesis by providing it with terminology constraints. |
Nikolay Bogoychev; Pinzhen Chen; | arxiv-cs.CL | 2023-10-09 |
180 | Synslator: An Interactive Machine Translation Tool with Online Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces Synslator, a user-friendly computer-aided translation (CAT) tool that not only supports IMT, but is adept at online learning with real-time translation memories. |
JIAYI WANG et. al. | arxiv-cs.CL | 2023-10-08 |
181 | CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We develop multilingual modeling approaches for code translation and demonstrate their great potential in improving the translation quality of both low-resource and high-resource language pairs and boosting the training efficiency. |
Weixiang Yan; Yuchen Tian; Yunzhe Li; Qian Chen; Wen Wang; | arxiv-cs.AI | 2023-10-07 |
182 | Tuning Large Language Model for End-to-end Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces LST, a Large multimodal model designed to excel at the E2E-ST task. |
HAO ZHANG et. al. | arxiv-cs.CL | 2023-10-03 |
183 | Evaluation of Cross-Lingual Bug Localization: Two Industrial Cases Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study reports the results of applying the cross-lingual bug localization approach proposed by Xia et al. to industrial software projects. |
Shinpei Hayashi; Takashi Kobayashi; Tadahisa Kato; | arxiv-cs.SE | 2023-10-03 |
184 | Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To understand when and why the navigation capabilities of language IDs are weakened, we compare two extreme decoder input cases in the ZST directions: Off-Target (OFF) and On-Target (ON) cases. |
CHANGTONG ZAN et. al. | arxiv-cs.CL | 2023-09-28 |
185 | Cross-Modal Multi-Tasking for Speech-to-Text Translation Via Hard Parameter Sharing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we instead propose a ST/MT multi-tasking framework with hard parameter sharing in which all model parameters are shared cross-modally. |
Brian Yan; Xuankai Chang; Antonios Anastasopoulos; Yuya Fujita; Shinji Watanabe; | arxiv-cs.CL | 2023-09-27 |
186 | Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the FBK’s participation in the Simultaneous Translation and Automatic Subtitling tracks of the IWSLT 2023 Evaluation Campaign. |
Sara Papi; Marco Gaido; Matteo Negri; | arxiv-cs.CL | 2023-09-27 |
187 | MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Nonetheless, visual speech is not as distinguishable as audio speech, making it difficult to develop a mapping from source speech phonemes to the target language text. To address this issue, we propose MixSpeech, a cross-modality self-learning framework that utilizes audio speech to regularize the training of visual speech tasks. |
XIZE CHENG et. al. | iccv | 2023-09-27 |
188 | CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, these are not directly applicable to MMT since they do not provide aligned multimodal multilingual features for generative tasks. To alleviate this issue, instead of designing complex modules for MMT, we propose CLIPTrans, which simply adapts the independently pre-trained multimodal M-CLIP and the multilingual mBART. |
DEVAANSH GUPTA et. al. | iccv | 2023-09-27 |
189 | Segmentation-Free Streaming Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a Segmentation-Free framework that enables the model to translate an unsegmented source stream by delaying the segmentation decision until the translation has been generated. |
Javier Iranzo-Sánchez; Jorge Iranzo-Sánchez; Adrià Giménez; Jorge Civera; Alfons Juan; | arxiv-cs.CL | 2023-09-26 |
190 | Hindi to English: Transformer-Based Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we have developed a Neural Machine Translation (NMT) system by training the Transformer model to translate texts from Indian Language Hindi to English. |
Kavit Gangar; Hardik Ruparel; Shreyas Lele; | arxiv-cs.CL | 2023-09-22 |
191 | Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we developed carefully a parallel corpus for Arabic-English (AR- EN) translation in the financial domain for benchmarking different domain adaptation methods. |
Emad A. Alghamdi; Jezia Zakraoui; Fares A. Abanmy; | arxiv-cs.CL | 2023-09-22 |
192 | Audience-specific Explanations for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we explore techniques to extract example explanations from a parallel corpus. |
Renhan Lou; Jan Niehues; | arxiv-cs.CL | 2023-09-22 |
193 | OSN-MDAD: Machine Translation Dataset for Arabic Multi-Dialectal Conversations on Online Social Media Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While few attempts have been made to build translation datasets for dialectal Arabic, they are domain dependent and are not OSN cultural-language friendly. In this work, we attempt to alleviate these limitations by proposing an online social network-based multidialect Arabic dataset that is crafted by contextually translating English tweets into four Arabic dialects: Gulf, Yemeni, Iraqi, and Levantine. |
Fatimah Alzamzami; Abdulmotaleb El Saddik; | arxiv-cs.CL | 2023-09-21 |
194 | SignBank+: Preparing A Multilingual Sign Language Dataset for Machine Translation Using Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce SignBank+, a clean version of the SignBank dataset, optimized for machine translation between spoken language text and SignWriting, a phonetic sign language writing system. |
Amit Moryossef; Zifan Jiang; | arxiv-cs.CL | 2023-09-20 |
195 | A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we propose a novel fine-tuning approach for LLMs that is specifically designed for the translation task, eliminating the need for the abundant parallel data that traditional translation models usually depend on. |
Haoran Xu; Young Jin Kim; Amr Sharaf; Hany Hassan Awadalla; | arxiv-cs.CL | 2023-09-20 |
196 | SpeechAlign: A Framework for Speech Translation Alignment Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Speech-to-Speech and Speech-to-Text translation are currently dynamic areas of research. To contribute to these fields, we present SpeechAlign, a framework to evaluate the underexplored field of source-target alignment in speech models. |
Belen Alastruey; Aleix Sant; Gerard I. Gállego; David Dale; Marta R. Costa-jussà; | arxiv-cs.CL | 2023-09-20 |
197 | NSOAMT — New Search Only Approach to Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The idea is to develop a solution that, by indexing an incremental set of words that combine a certain semantic meaning, makes it possible to create a process of correspondence between their native language record and the language of translation. |
João Luís; Diogo Cardoso; José Marques; Luís Campos; | arxiv-cs.CL | 2023-09-19 |
198 | LoGenText-Plus: Improving Neural Machine Translation-based Logging Texts Generation with Syntactic Templates Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Developers insert logging statements in the source code to collect important runtime information about software systems. The textual descriptions in logging statements (i.e., … |
Zishuo Ding; Yiming Tang; Xiaoyu Cheng; Heng Li; Weiyi Shang; | ACM Transactions on Software Engineering and Methodology | 2023-09-18 |
199 | Neural Machine Translation Models Can Learn to Be Few-shot Learners Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we show that a much smaller model can be trained to perform ICL by fine-tuning towards a specialized training objective, exemplified on the task of domain adaptation for neural machine translation. |
Raphael Reinauer; Patrick Simianer; Kaden Uhlig; Johannes E. M. Mosig; Joern Wuebker; | arxiv-cs.CL | 2023-09-15 |
200 | Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Hallucinations and off-target translation remain unsolved problems in MT, especially for low-resource languages and massively multilingual models. In this paper, we introduce two related methods to mitigate these failure cases with a modified decoding objective, without either requiring retraining or external models. |
Rico Sennrich; Jannis Vamvas; Alireza Mohammadshahi; | arxiv-cs.CL | 2023-09-13 |
201 | Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Improperly assuming the pseudo-parallel data are correctly correlated will make the networks overfit to the noisy correspondence. Therefore, we propose Dual-view Curricular Optimal Transport (DCOT) to learn with noisy correspondence in CCR. |
YABING WANG et. al. | arxiv-cs.CV | 2023-09-11 |
202 | The Effect of Alignment Objectives on Code-Switching Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we are proposing a way of training a single machine translation model that is able to translate monolingual sentences from one language to another, along with translating code-switched sentences to either language. |
Mohamed Anwar; | arxiv-cs.CL | 2023-09-10 |
203 | Advancing Text-to-GLOSS Neural Translation Using A Novel Hyper-parameter Optimization Technique Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate the use of transformers for Neural Machine Translation of text-to-GLOSS for Deaf and Hard-of-Hearing communication. |
Younes Ouargani; Noussaima El Khattabi; | arxiv-cs.CL | 2023-09-05 |
204 | Epi-Curriculum: Episodic Curriculum Learning for Low-Resource Domain Adaptation in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a novel approach Epi-Curriculum to address low-resource domain adaptation (DA), which contains a new episodic training framework along with denoised curriculum learning. |
Keyu Chen; Di Zhuang; Mingchen Li; J. Morris Chang; | arxiv-cs.LG | 2023-09-05 |
205 | Impact of Visual Context on Noisy Multimodal NMT: An Empirical Study for English to Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The study investigates the effectiveness of utilizing multimodal information in Neural Machine Translation (NMT). |
Baban Gain; Dibyanayan Bandyopadhyay; Samrat Mukherjee; Chandranath Adak; Asif Ekbal; | arxiv-cs.CL | 2023-08-30 |
206 | A Classification-Guided Approach for Adversarial Attacks Against Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce ACT, a novel adversarial attack framework against NMT systems guided by a classifier. |
Sahar Sadrizadeh; Ljiljana Dolamic; Pascal Frossard; | arxiv-cs.CL | 2023-08-29 |
207 | An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we conduct empirical studies on intra-modal and cross-modal consistency and propose two training strategies, SimRegCR and SimZeroCR, for E2E ST in regular and zero-shot scenarios. |
Pengzhi Gao; Ruiqing Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | arxiv-cs.CL | 2023-08-28 |
208 | Training and Meta-Evaluating Machine Translation Evaluation Metrics at The Paragraph Level Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: As research on machine translation moves to translating text beyond the sentence level, it remains unclear how effective automatic evaluation metrics are at scoring longer … |
Daniel Deutsch; Juraj Juraska; Mara Finkelstein; Markus Freitag; | arxiv-cs.CL | 2023-08-25 |
209 | Improving Translation Faithfulness of Large Language Models Via Augmenting Instructions Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Large Language Models (LLMs) present strong general capabilities, and a current compelling challenge is stimulating their specialized capabilities, such as machine translation, through low-cost instruction tuning. |
YIJIE CHEN et. al. | arxiv-cs.CL | 2023-08-24 |
210 | SONAR: Sentence-Level Multimodal and Language-Agnostic Representations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce SONAR, a new multilingual and multimodal fixed-size sentence embedding space. |
Paul-Ambroise Duquenne; Holger Schwenk; Benoît Sagot; | arxiv-cs.CL | 2023-08-22 |
211 | SeamlessM4T: Massively Multilingual & Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: More specifically, conventional speech-to-speech translation systems rely on cascaded systems that perform translation progressively, putting high-performing unified systems out of reach. To address these gaps, we introduce SeamlessM4T, a single model that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation, and automatic speech recognition for up to 100 languages. |
SEAMLESS COMMUNICATION et. al. | arxiv-cs.CL | 2023-08-22 |
212 | An Effective Method Using Phrase Mechanism in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we report an effective method using a phrase mechanism, PhraseTransformer, to improve the strong baseline model Transformer in constructing a Neural Machine Translation (NMT) system for parallel corpora Vietnamese-Chinese. |
Phuong Minh Nguyen; Le Minh Nguyen; | arxiv-cs.CL | 2023-08-21 |
213 | Factuality Detection Using Machine Translation — A Use Case for German Clinical Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In the context of factuality detection, this work presents a simple solution using machine translation to translate English data to German to train a transformer-based factuality detection model. |
Mohammed Bin Sumait; Aleksandra Gabryszak; Leonhard Hennig; Roland Roller; | arxiv-cs.CL | 2023-08-17 |
214 | Fast Training of NMT Model with Data Sorting Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: One potential area for improvement is to address the computation of empty tokens that the Transformer computes only to discard them later, leading to an unnecessary computational burden. To tackle this, we propose an algorithm that sorts translation sentence pairs based on their length before batching, minimizing the waste of computing power. |
Daniela N. Rim; Kimera Richard; Heeyoul Choi; | arxiv-cs.CL | 2023-08-16 |
215 | VBD-MT Chinese-Vietnamese Translation Systems for VLSP 2022 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present our systems participated in the VLSP 2022 machine translation shared task. |
Hai Long Trieu; Song Kiet Bui; Tan Minh Tran; Van Khanh Tran; Hai An Nguyen; | arxiv-cs.CL | 2023-08-15 |
216 | Extrapolating Large Language Models to Non-English By Aligning Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we empower pre-trained LLMs on non-English languages by building semantic alignment across languages. |
WENHAO ZHU et. al. | arxiv-cs.CL | 2023-08-09 |
217 | Evaluating and Optimizing The Effectiveness of Neural Machine Translation in Supporting Code Retrieval Models: A Study on The CAT Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we analyze the performance of NMT in natural language-to-code translation in the newly curated CAT benchmark that includes the optimized versions of three Java datasets TLCodeSum, CodeSearchNet, Funcom, and a Python dataset PCSD. |
Hung Phan; Ali Jannesari; | arxiv-cs.SE | 2023-08-09 |
218 | Character-level NMT and Language Similarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We evaluate the models using automatic MT metrics and show that translation between similar languages benefits from character-level input segmentation, while for less related languages, character-level vanilla Transformer-base often lags behind subword-level segmentation. |
Josef Jon; Ondřej Bojar; | arxiv-cs.CL | 2023-08-08 |
219 | Negative Lexical Constraints in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We compared various methods based on modifying either the decoding process or the training data. |
Josef Jon; Dušan Variš; Michal Novák; João Paulo Aires; Ondřej Bojar; | arxiv-cs.CL | 2023-08-07 |
220 | Do Multilingual Language Models Think Better in English? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce a new approach called self-translate, which overcomes the need of an external translation system by leveraging the few-shot translation capabilities of multilingual language models. |
Julen Etxaniz; Gorka Azkune; Aitor Soroa; Oier Lopez de Lacalle; Mikel Artetxe; | arxiv-cs.CL | 2023-08-02 |
221 | Optimizing Machine Translation Through Prompt Engineering: An Investigation Into ChatGPT’s Customizability Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper explores the influence of integrating the purpose of the translation and the target audience into prompts on the quality of translations produced by ChatGPT. |
Masaru Yamada; | arxiv-cs.CL | 2023-08-02 |
222 | Predictive Data Analytics with AI: Assessing The Need for Post-editing of MT Output By Fine-tuning OpenAI LLMs Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We take OpenAI models as the best state-of-the-art technology and approach TQE as a binary classification task. |
Serge Gladkoff; Gleb Erofeev; Irina Sorokina; Lifeng Han; Goran Nenadic; | arxiv-cs.CL | 2023-07-31 |
223 | Toward Quantum Machine Translation of Syntactically Distinct Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The present study aims to explore the feasibility of language translation using quantum natural language processing algorithms on noisy intermediate-scale quantum (NISQ) devices. |
Mina Abbaszade; Mariam Zomorodi; Vahid Salari; Philip Kurian; | arxiv-cs.CL | 2023-07-31 |
224 | Structural Transfer Learning in NL-to-Bash Semantic Parsers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a methodology for obtaining a quantitative understanding of structural overlap between machine translation tasks. |
Kyle Duffy; Satwik Bhattamishra; Phil Blunsom; | arxiv-cs.CL | 2023-07-31 |
225 | Multilingual Lexical Simplification Via Paraphrase Generation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel multilingual LS method via paraphrase generation, as paraphrases provide diversity in word selection while preserving the sentence’s meaning. |
KANG LIU et. al. | arxiv-cs.CL | 2023-07-27 |
226 | XDLM: Cross-lingual Diffusion Language Model for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Additionally, while pretraining with diffusion models has been studied within a single language, the potential of cross-lingual pretraining remains understudied. To address these gaps, we propose XDLM, a novel Cross-lingual diffusion model for machine translation, consisting of pretraining and fine-tuning stages. |
Linyao Chen; Aosong Feng; Boming Yang; Zihui Li; | arxiv-cs.CL | 2023-07-25 |
227 | Joint Dropout: Improving Generalizability in Low-Resource Neural Machine Translation Through Phrase Pair Variables Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a method called Joint Dropout, that addresses the challenge of low-resource neural machine translation by substituting phrases with variables, resulting in significant enhancement of compositionality, which is a key aspect of generalization. |
Ali Araabi; Vlad Niculae; Christof Monz; | arxiv-cs.CL | 2023-07-24 |
228 | Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a comprehensive study on the robustness of current text adversarial attacks to round-trip translation. |
Neel Bhandari; Pin-Yu Chen; | arxiv-cs.CL | 2023-07-24 |
229 | Incorporating Human Translator Style Into English-Turkish Literary Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we focus on English-Turkish literary translation and develop machine translation models that take into account the stylistic features of translators. |
ZEYNEP YIRMIBEŞOĞLU et. al. | arxiv-cs.CL | 2023-07-21 |
230 | Improving End-to-End Speech Translation By Imitation-Based Knowledge Distillation with Synthetic Transcripts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present an imitation learning approach where a teacher NMT system corrects the errors of an AST student without relying on manual transcripts. |
Rebekka Hubert; Artem Sokolov; Stefan Riezler; | arxiv-cs.CL | 2023-07-17 |
231 | A Neural-Symbolic Approach Towards Identifying Grammatically Correct Sentences Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite the importance of having access to well-written sentences, figuring out ways to validate them is still an open area of research. To address this problem, we present a simplified way to validate English sentences through a novel neural-symbolic approach. |
Nicos Isaak; | arxiv-cs.CL | 2023-07-16 |
232 | Data Augmentation for Machine Translation Via Dependency Subtree Swapping Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a generic framework for data augmentation via dependency subtree swapping that is applicable to machine translation. |
Attila Nagy; Dorina Petra Lakatos; Botond Barta; Patrick Nanys; Judit Ács; | arxiv-cs.CL | 2023-07-13 |
233 | Neural Machine Translation Data Generation and Augmentation Using ChatGPT Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We investigate an alternative to manual parallel corpora – hallucinated parallel corpora created by generative language models. |
Wayne Yang; Garrett Nicolai; | arxiv-cs.CL | 2023-07-11 |
234 | The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-speech translation (S2ST) task which aims to translate from English speech of multi-source to Chinese speech. |
KUN SONG et. al. | arxiv-cs.SD | 2023-07-10 |
235 | Learning Optimal Policy for Simultaneous Machine Translation Via Binary Search Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a new method for constructing the optimal policy online via binary search. |
Shoutao Guo; Shaolei Zhang; Yang Feng; | acl | 2023-07-08 |
236 | Bring More Attention to Syntactic Symmetry for Automatic Postediting of High-Quality Machine Translations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, we propose a linguistically motivated method of regularization that is expected to enhance APE models� understanding of the target language: a loss function that encourages symmetric self-attention on the given MT. Our analysis of experimental results demonstrates that the proposed method helps improving the state-of-the-art architecture�s APE quality for high-quality MTs. |
Baikjin Jung; Myungji Lee; Jong-Hyeok Lee; Yunsu Kim; | acl | 2023-07-08 |
237 | Simple and Effective Unsupervised Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The amount of labeled data to train models for speech tasks is limited for most languages, however, the data scarcity is exacerbated for speech translation which requires labeled data covering two different languages. To address this issue, we study a simple and effective approach to build speech translation systems without labeled data by leveraging recent advances in unsupervised speech recognition, machine translation and speech synthesis, either in a pipeline approach, or to generate pseudo-labels for training end-to-end speech translation models. |
CHANGHAN WANG et. al. | acl | 2023-07-08 |
238 | MCLIP: Multilingual CLIP Via Cross-lingual Transfer Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce mCLIP, a retrieval-efficient dual-stream multilingual VLP model, trained by aligning the CLIP model and a Multilingual Text Encoder (MTE) through a novel Triangle Cross-modal Knowledge Distillation (TriKD) method. |
GUANHUA CHEN et. al. | acl | 2023-07-08 |
239 | Exploring Better Text Image Translation with Multimodal Codebook Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we first annotate a Chinese-English TIT dataset named OCRMT30K, providing convenience for subsequent studies. |
ZHIBIN LAN et. al. | acl | 2023-07-08 |
240 | A Holistic Approach to Reference-Free Evaluation of Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a reference-free evaluation approach that characterizes evaluation as two aspects: (1) fluency: how well the translated text conforms to normal human language usage; (2) faithfulness: how well the translated text reflects the source data. |
Hanming Wu; Wenjuan Han; Hui Di; Yufeng Chen; Jinan Xu; | acl | 2023-07-08 |
241 | Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Unpaired cross-lingual image captioning has long suffered from irrelevancy and disfluency issues, due to the inconsistencies of the semantic scene and syntax attributes during transfer. In this work, we propose to address the above problems by incorporating the scene graph (SG) structures and the syntactic constituency (SC) trees. |
Shengqiong Wu; Hao Fei; Wei Ji; Tat-Seng Chua; | acl | 2023-07-08 |
242 | Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Firstly, the current objective of KD spreads its focus to whole distributions to learn the knowledge, yet lacks special treatment on the most crucial top-1 information. Secondly, the knowledge is largely covered by the golden information due to the fact that most top-1 predictions of teachers overlap with ground-truth tokens, which further restricts the potential of KD. To address these issues, we propose a new method named Top-1 Information Enhanced Knowledge Distillation (TIE-KD). |
SONGMING ZHANG et. al. | acl | 2023-07-08 |
243 | Easy Guided Decoding in Providing Suggestions for Interactive Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we utilize the parameterized objective function of neural machine translation (NMT) and propose a novel constrained decoding algorithm, namely Prefix-Suffix Guided Decoding (PSGD), to deal with the TS problem without additional training. |
Ke Wang; Xin Ge; Jiayi Wang; Yuqi Zhang; Yu Zhao; | acl | 2023-07-08 |
244 | Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To eliminate the rule-based nature of data creation, we instead propose using machine translation models to create gender-biased text from real gender-fair text via round-trip translation. |
Chantal Amrhein; Florian Schottmann; Rico Sennrich; Samuel L�ubli; | acl | 2023-07-08 |
245 | Back Translation for Speech-to-text Translation Without Transcripts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we aim to utilize large amounts of target-side monolingual data to enhance ST without transcripts. |
Qingkai Fang; Yang Feng; | acl | 2023-07-08 |
246 | What About �em�? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Wrong pronoun translations can discriminate against marginalized groups, e. g. , non-binary individuals (Dev et al. , 2021). In this �reality check�, we study how three commercial MT systems translate 3rd-person pronouns. |
Anne Lauscher; Debora Nozza; Ehm Miltersen; Archie Crowley; Dirk Hovy; | acl | 2023-07-08 |
247 | CMOT: Cross-modal Mixup Via Optimal Transport for Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose Cross-modal Mixup via Optimal Transport (CMOT) to overcome the modality gap. |
Yan Zhou; Qingkai Fang; Yang Feng; | acl | 2023-07-08 |
248 | Rethinking Multimodal Entity and Relation Extraction from A Translation Point of View Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We revisit the multimodal entity and relation extraction from a translation point of view. |
Changmeng Zheng; Junhao Feng; Yi Cai; Xiaoyong Wei; Qing Li; | acl | 2023-07-08 |
249 | Do GPTs Produce Less Literal Translations? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, there has been relatively little investigation on how such translations differ qualitatively from the translations generated by standard Neural Machine Translation (NMT) models. In this work, we investigate these differences in terms of the literalness of translations produced by the two systems. |
Vikas Raunak; Arul Menezes; Matt Post; Hany Hassan; | acl | 2023-07-08 |
250 | Translation-Enhanced Multilingual Text-to-Image Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We provide two key contributions. 1) Relying on a multilingual multi-modal encoder, we provide a systematic empirical study of standard methods used in cross-lingual NLP when applied to mTTI: Translate Train, Translate Test, and Zero-Shot Transfer. 2) We propose Ensemble Adapter (EnsAd), a novel parameter-efficient approach that learns to weigh and consolidate the multilingual text knowledge within the mTTI framework, mitigating the language gap and thus improving mTTI performance. |
Yaoyiran Li; Ching-Yun Chang; Stephen Rawls; Ivan Vulic; Anna Korhonen; | acl | 2023-07-08 |
251 | Searching for Needles in A Haystack: On The Role of Incidental Bilingualism in PaLM�s Translation Capability Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a mixed-method approach to measure and understand incidental bilingualism at scale. |
Eleftheria Briakou; Colin Cherry; George Foster; | acl | 2023-07-08 |
252 | Scene Graph As Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we investigate a more realistic unsupervised multimodal machine translation (UMMT) setup, inference-time image-free UMMT, where the model is trained with source-text image pairs, and tested with only source-text inputs. |
Hao Fei; Qian Liu; Meishan Zhang; Min Zhang; Tat-Seng Chua; | acl | 2023-07-08 |
253 | Understanding and Improving The Robustness of Terminology Constraints in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we study the robustness of two typical terminology translation methods: Placeholder (PH) and Code-Switch (CS), concerning (1) the number of constraints and (2) the target constraint length. |
HUAAO ZHANG et. al. | acl | 2023-07-08 |
254 | A Simple Concatenation Can Effectively Improve Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by the works of video Transformer, we propose a simple unified cross-modal ST method, which concatenates speech and text as the input, and builds a teacher that can utilize both cross-modal information simultaneously. |
Linlin Zhang; Kai Fan; Boxing Chen; Luo Si; | acl | 2023-07-08 |
255 | XPQA: Cross-Lingual Product Question Answering in 12 Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While existing work on PQA focuses mainly on English, in practice there is need to support multiple customer languages while leveraging product information available in English. To study this practical industrial task, we present xPQA, a large-scale annotated cross-lingual PQA dataset in 12 languages, and report results in (1) candidate ranking, to select the best English candidate containing the information to answer a non-English question; and (2) answer generation, to generate a natural-sounding non-English answer based on the selected English candidate. |
Xiaoyu Shen; Akari Asai; Bill Byrne; Adria De Gispert; | acl | 2023-07-08 |
256 | Subset Retrieval Nearest Neighbor Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose �Subset kNN-MT�, which improves the decoding speed of kNN-MT by two methods: (1) retrieving neighbor target tokens from a subset that is the set of neighbor sentences of the input sentence, not from all sentences, and (2) efficient distance computation technique that is suitable for subset neighbor search using a look-up table. |
HIROYUKI DEGUCHI et. al. | acl | 2023-07-08 |
257 | Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a new MMT approach based on a strong text-only MT model, which uses neural adapters, a novel guided self-attention mechanism and which is jointly trained on both visually-conditioned masking and MMT. |
Matthieu Futeral; Cordelia Schmid; Ivan Laptev; Beno�t Sagot; Rachel Bawden; | acl | 2023-07-08 |
258 | Extrinsic Evaluation of Machine Translation Metrics Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate how useful MT metrics are at detecting the segment-level quality by correlating metrics with how useful the translations are for downstream task. |
Nikita Moghe; Tom Sherborne; Mark Steedman; Alexandra Birch; | acl | 2023-07-08 |
259 | Prompting PaLM for Translation: Assessing Strategies and Performance IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We investigate various strategies for choosing translation examples for few-shot prompting, concluding that example quality is the most important factor. |
DAVID VILAR et. al. | acl | 2023-07-08 |
260 | Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a simple yet efficient approach to adapt VLP to unseen languages using MPLM. |
Yasmine Karoui; R�mi Lebret; Negar Foroutan Eghlidi; Karl Aberer; | acl | 2023-07-08 |
261 | Understanding and Bridging The Modality Gap for Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We find that the modality gap is relatively small during training except for some difficult cases, but keeps increasing during inference due to the cascading effect. To address these problems, we propose the Cross-modal Regularization with Scheduled Sampling (Cress) method. |
Qingkai Fang; Yang Feng; | acl | 2023-07-08 |
262 | On Evaluating Multilingual Compositional Generalization with Translated Datasets Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, we show that this entails critical semantic distortion. To address this limitation, we craft a faithful rule-based translation of the MCWQ dataset from English to Chinese and Japanese. |
Zi Wang; Daniel Hershcovich; | acl | 2023-07-08 |
263 | Multilingual Event Extraction from Historical Newspaper Adverts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a new multilingual dataset in English, French, and Dutch composed of newspaper ads from the early modern colonial period reporting on enslaved people who liberated themselves from enslavement. |
Nadav Borenstein; Nat�lia da Silva Perez; Isabelle Augenstein; | acl | 2023-07-08 |
264 | RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While ACT has garnered attention in recent years due to its usefulness in real-world applications, progress in the task is currently limited by dataset availability, since most prior approaches rely on supervised methods. To address this limitation, we propose Retrieval and Attribute-Marking enhanced Prompting (RAMP), which leverages large multilingual language models to perform ACT in few-shot and zero-shot settings. |
GABRIELE SARTI et. al. | acl | 2023-07-08 |
265 | Ethical Considerations for Machine Translation of Indigenous Languages: Giving A Voice to The Speakers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The data collection, modeling and deploying machine translation systems thus result in new ethical questions that must be addressed. Motivated by this, we first survey the existing literature on ethical considerations for the documentation, translation, and general natural language processing for Indigenous languages. Afterward, we conduct and analyze an interview study to shed light on the positions of community leaders, teachers, and language activists regarding ethical concerns for the automatic translation of their languages. |
Manuel Mager; Elisabeth Mager; Katharina Kann; Ngoc Thang Vu; | acl | 2023-07-08 |
266 | Neural Machine Translation Methods for Translating Text to Sign Language Glosses Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In our experiments, we improve the performance of the transformer-based models via (1) data augmentation, (2) semi-supervised Neural Machine Translation (NMT), (3) transfer learning and (4) multilingual NMT. |
Dele Zhu; Vera Czehmann; Eleftherios Avramidis; | acl | 2023-07-08 |
267 | Learning Language-Specific Layers for Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce Language-Specific Transformer Layers (LSLs), which allow us to increase model capacity, while keeping the amount of computation and the number of parameters used in the forward pass constant. |
Telmo Pires; Robin Schmidt; Yi-Hsiu Liao; Stephan Peitz; | acl | 2023-07-08 |
268 | Binary and Ternary Natural Language Generation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We approach the problem with a mix of statistics-based quantization for the weights and elastic quantization of the activations and demonstrate the first ternary and binary transformer models on the downstream tasks of summarization and machine translation. |
Zechun Liu; Barlas Oguz; Aasish Pappu; Yangyang Shi; Raghuraman Krishnamoorthi; | acl | 2023-07-08 |
269 | Continual Knowledge Distillation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a method called continual knowledge distillation to take advantage of existing translation models to improve one model of interest. |
Yuanchi Zhang; Peng Li; Maosong Sun; Yang Liu; | acl | 2023-07-08 |
270 | Discourse-Centric Evaluation of Document-level Machine Translation with A New Densely Annotated Parallel Corpus of Novels Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Using these annotations, we systematically investigate the similarities and differences between the discourse structures of source and target languages, and the challenges they pose to MT. We discover that MT outputs differ fundamentally from human translations in terms of their latent discourse structures. This gives us a new perspective on the challenges and opportunities in document-level MT. We make our resource publicly available to spur future research in document-level MT and its generalization to other language translation tasks. |
YUCHEN ELEANOR JIANG et. al. | acl | 2023-07-08 |
271 | Considerations for Meaningful Sign Language Machine Translation Based on Glosses IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we review recent works on neural gloss translation. |
Mathias M�ller; Zifan Jiang; Amit Moryossef; Annette Rios; Sarah Ebling; | acl | 2023-07-08 |
272 | PEIT: Bridging The Modality Gap with Pre-trained Models for End-to-End Image Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose PEIT, an end-to-end image translation framework that bridges the modality gap with pre-trained models. |
Shaolin Zhu; Shangjie Li; Yikun Lei; Deyi Xiong; | acl | 2023-07-08 |
273 | Multi-VALUE: A Framework for Cross-Dialectal English NLP Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a suite of resources for evaluating and achieving English dialect invariance. |
CALEB ZIEMS et. al. | acl | 2023-07-08 |
274 | Neural Machine Translation for Mathematical Formulae Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we perform the tasks of translating from LaTeX to Mathematica as well as from LaTeX to semantic LaTeX. |
Felix Petersen; Moritz Schubotz; Andre Greiner-Petter; Bela Gipp; | acl | 2023-07-08 |
275 | BIG-C: A Multimodal Multi-Purpose Dataset for Bemba Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present BIG-C (Bemba Image Grounded Conversations), a large multimodal dataset for Bemba. |
Claytone Sikasote; Eunice Mukonde; Md Mahfuz Ibn Alam; Antonios Anastasopoulos; | acl | 2023-07-08 |
276 | A Survey on Zero Pronoun Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This phenomenon has been studied extensively in machine translation (MT), as it poses a significant challenge for MT systems due to the difficulty in determining the correct antecedent for the pronoun. This survey paper highlights the major works that have been undertaken in zero pronoun translation (ZPT) after the neural revolution so that researchers can recognize the current state and future directions of this field. |
LONGYUE WANG et. al. | acl | 2023-07-08 |
277 | Songs Across Borders: Singable and Controllable Neural Lyric Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper bridges the singability quality gap by formalizing lyric translation into a constrained translation problem, converting theoretical guidance and practical techniques from translatology literature to prompt-driven NMT approaches, exploring better adaptation methods, and instantiating them to an English-Chinese lyric translation system. |
Longshen Ou; Xichu Ma; Min-Yen Kan; Ye Wang; | acl | 2023-07-08 |
278 | TeCS: A Dataset and Benchmark for Tense Consistency of Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a parallel tense test set, containing French-English 552 utterances. |
Yiming Ai; Zhiwei He; Kai Yu; Rui Wang; | acl | 2023-07-08 |
279 | Text Style Transfer Back-Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For natural inputs, BT brings only slight improvements and sometimes even adverse effects. To address this issue, we propose Text Style Transfer Back Translation (TST BT), which uses a style transfer to modify the source side of BT data. |
DAIMENG WEI et. al. | acl | 2023-07-08 |
280 | INK: Injecting KNN Knowledge in Nearest Neighbor Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose an effective training framework INK to directly smooth the representation space via adjusting representations of kNN neighbors with a small number of new parameters. |
Wenhao Zhu; Jingjing Xu; Shujian Huang; Lingpeng Kong; Jiajun Chen; | acl | 2023-07-08 |
281 | Using Neural Machine Translation for Generating Diverse Challenging Exercises for Language Learner Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel approach to automatically generate distractors for cloze exercises for English language learners, using round-trip neural machine translation. |
Frank Palma Gomez; Subhadarshi Panda; Michael Flor; Alla Rozovskaya; | acl | 2023-07-08 |
282 | MultiTACRED: A Multilingual Version of The TAC Relation Extraction Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Relation extraction (RE) is a fundamental task in information extraction, whose extension to multilingual settings has been hindered by the lack of supervised resources comparable in size to large English datasets such as TACRED (Zhang et al. , 2017). To address this gap, we introduce the MultiTACRED dataset, covering 12 typologically diverse languages from 9 language families, which is created by machine-translating TACRED instances and automatically projecting their entity annotations. |
Leonhard Hennig; Philippe Thomas; Sebastian M�ller; | acl | 2023-07-08 |
283 | To Be or Not to Be: A Translation Reception Study of A Literary Text Translated Into Dutch and Catalan Using Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This article presents the results of a study involving the reception of a fictional story by Kurt Vonnegut translated from English into Catalan and Dutch in three conditions: machine-translated (MT), post-edited (PE) and translated from scratch (HT). |
Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2023-07-05 |
284 | X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. |
MEHRAD MORADSHAHI et. al. | arxiv-cs.CL | 2023-06-30 |
285 | Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a simple yet efficient approach to adapt VLP to unseen languages using MPLM. |
Yasmine Karoui; Rémi Lebret; Negar Foroutan; Karl Aberer; | arxiv-cs.CL | 2023-06-29 |
286 | The Unreasonable Effectiveness of Few-shot Learning for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show that with only 5 examples of high-quality translation data shown at inference, a transformer decoder-only model trained solely with self-supervised learning, is able to match specialized supervised state-of-the-art models as well as more general commercial translation systems. |
XAVIER GARCIA et. al. | icml | 2023-06-27 |
287 | Constructing Multilingual Code Search Dataset Using Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this research, we create a multilingual code search dataset in four natural and four programming languages using a neural machine translation model. |
Ryo Sekizawa; Nan Duan; Shuai Lu; Hitomi Yanaka; | arxiv-cs.CL | 2023-06-27 |
288 | Scaling Laws for Multilingual Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we provide a large-scale empirical study of the scaling properties of multilingual neural machine translation models. |
Patrick Fernandes; Behrooz Ghorbani; Xavier Garcia; Markus Freitag; Orhan Firat; | icml | 2023-06-27 |
289 | Quality Estimation of Machine Translated Texts Based on Direct Evidence from Training Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we show that the parallel corpus used as training data for training the MT system holds direct clues for estimating the quality of translations produced by the MT system. |
Vibhuti Kumari; Narayana Murthy Kavi; | arxiv-cs.CL | 2023-06-27 |
290 | Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a data-driven approach for Formality-Sensitive Machine Translation (FSMT) that caters to the unique linguistic properties of four target languages. |
Seugnjun Lee; Hyeonseok Moon; Chanjun Park; Heuiseok Lim; | arxiv-cs.CL | 2023-06-26 |
291 | A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel approach, which jointly models the cross-lingual alignment information and the mono-lingual syntax information using a graph. |
ZENAN XU et. al. | aaai | 2023-06-26 |
292 | Prompting Neural Machine Translation with Translation Memories Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a simple but effective method to introduce TMs into neural machine translation (NMT) systems. |
ABUDUREXITI REHEMAN et. al. | aaai | 2023-06-26 |
293 | EvolveMT: An Ensemble MT Engine Improving Itself with Usage Only Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents EvolveMT for efficiently combining multiple machine translation (MT) engines. |
Kamer Ali Yuksel; Ahmet Gunduz; Mohamed Al-Badrashiny; Shreyas Sharma; Hassan Sawaf; | arxiv-cs.CL | 2023-06-20 |
294 | Evaluation of Chinese-English Machine Translation of Emotion-Loaded Microblog Texts: A Human Annotated Dataset for The Quality Assessment of Emotion Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on how current Machine Translation (MT) tools perform on the translation of emotion-loaded texts by evaluating outputs from Google Translate according to a framework proposed in this paper. |
Shenbin Qian; Constantin Orasan; Felix do Carmo; Qiuliang Li; Diptesh Kanojia; | arxiv-cs.CL | 2023-06-20 |
295 | BayLing: Bridging Cross-lingual Alignment and Instruction Following Through Interactive Translation for Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To minimize human workload, we propose to transfer the capabilities of language generation and instruction following from English to other languages through an interactive translation task. |
SHAOLEI ZHANG et. al. | arxiv-cs.CL | 2023-06-19 |
296 | Discourse Representation Structure Parsing for Chinese Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We describe the pipeline of automatically collecting the linearized Chinese meaning representation data for sequential-to sequential neural networks. |
Chunliu Wang; Xiao Zhang; Johan Bos; | arxiv-cs.CL | 2023-06-16 |
297 | Sheffield’s Submission to The AmericasNLP Shared Task on Machine Translation Into Indigenous Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we describe the University of Sheffield’s submission to the AmericasNLP 2023 Shared Task on Machine Translation into Indigenous Languages which comprises the translation from Spanish to eleven indigenous languages. |
Edward Gow-Smith; Danae Sánchez Villegas; | arxiv-cs.CL | 2023-06-16 |
298 | Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce Babel-ImageNet, a massively multilingual benchmark that offers (partial) translations of 1000 ImageNet labels to 92 languages, built without resorting to machine translation (MT) or requiring manual annotation. |
Gregor Geigle; Radu Timofte; Goran Glavaš; | arxiv-cs.CL | 2023-06-14 |
299 | A Survey of Vision-Language Pre-training from The Lens of Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We summarize the common architectures, pre-training objectives, and datasets from literature and conjecture what further is needed to make progress on multimodal machine translation. |
Jeremy Gwinnup; Kevin Duh; | arxiv-cs.CL | 2023-06-12 |
300 | Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we investigate the impact of applying textual data augmentation tasks to low resource machine translation. |
Catherine Gitau; VUkosi Marivate; | arxiv-cs.CL | 2023-06-12 |
301 | Measuring Sentiment Bias in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we explore how machine translation might introduce a bias in sentiments as classified by sentiment analysis models. |
KAI HARTUNG et. al. | arxiv-cs.CL | 2023-06-12 |
302 | Rethinking Translation Memory Augmented Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper rethinks translation memory augmented neural machine translation (TM-augmented NMT) from two perspectives, i.e., a probabilistic view of retrieval and the variance-bias … |
HONGKUN HAO et. al. | arxiv-cs.CL | 2023-06-12 |
303 | Assisting Language Learners: Automated Trans-Lingual Definition Generation Via Contrastive Prompt Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel task of Trans-Lingual Definition Generation (TLDG), which aims to generate definitions in another language, i.e., the native speaker’s language. |
HENGYUAN ZHANG et. al. | arxiv-cs.CL | 2023-06-09 |
304 | Good, But Not Always Fair: An Evaluation of Gender Bias for Three Commercial Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Consequently, analyses have been redirected to more nuanced aspects, intricate phenomena, as well as potential risks that may arise from the widespread use of MT tools. Along this line, this paper offers a meticulous assessment of three commercial MT systems – Google Translate, DeepL, and Modern MT – with a specific focus on gender translation and bias. |
Silvia Alma Piazzolla; Beatrice Savoldi; Luisa Bentivogli; | arxiv-cs.CL | 2023-06-09 |
305 | Improving Language Model Integration for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Recently, some works on automatic speech recognition have demonstrated that, if the implicit language model is neutralized in decoding, further improvements can be gained when integrating an external language model. In this work, we transfer this concept to the task of machine translation and compare with the most prominent way of including additional monolingual data – namely back-translation. |
Christian Herold; Yingbo Gao; Mohammad Zeineldeen; Hermann Ney; | arxiv-cs.CL | 2023-06-08 |
306 | A Little Is Enough: Few-Shot Quality Estimation Based Corpus Filtering Improves Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: All the scripts and datasets utilized in this study will be publicly available. |
Akshay Batheja; Pushpak Bhattacharyya; | arxiv-cs.CL | 2023-06-06 |
307 | Extract and Attend: Improving Entity Translation in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: When we humans encounter an unknown entity during translation, we usually first look up in a dictionary and then organize the entity translation together with the translations of other parts to form a smooth target sentence. Inspired by this translation process, we propose an Extract-and-Attend approach to enhance entity translation in NMT, where the translation candidates of source entities are first extracted from a dictionary and then attended to by the NMT model to generate the target sentence. |
ZIXIN ZENG et. al. | arxiv-cs.CL | 2023-06-03 |
308 | Transformer: A General Framework from Machine Translation to Others Related Papers Related Patents Related Grants Related Venues Related Experts View |
Yang Zhao; Jiajun Zhang; Chengqing Zong; | Machine Intelligence Research | 2023-06-02 |
309 | Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the submission of the UPC Machine Translation group to the IWSLT 2023 Offline Speech Translation task. |
Ioannis Tsiamas; Gerard I. Gállego; José A. R. Fonollosa; Marta R. Costa-jussà; | arxiv-cs.CL | 2023-06-02 |
310 | Improving Polish to English Neural Machine Translation with Transfer Learning: Effects of Data Volume and Language Similarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper investigates the impact of data volume and the use of similar languages on transfer learning in a machine translation task. |
Juuso Eronen; Michal Ptaszynski; Karol Nowakowski; Zheng Lin Chia; Fumito Masui; | arxiv-cs.CL | 2023-06-01 |
311 | Improved Cross-Lingual Transfer Learning For Automatic Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The goal of this work it to improve cross-lingual transfer learning in multilingual speech-to-text translation via semantic knowledge distillation. |
SAMEER KHURANA et. al. | arxiv-cs.CL | 2023-06-01 |
312 | How Does Pretraining Improve Discourse-Aware Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the underlying reasons for their strong performance have not been well explained. To bridge this gap, we introduce a probing task to interpret the ability of PLMs to capture discourse relation knowledge. |
Zhihong Huang; Longyue Wang; Siyou Liu; Derek F. Wong; | arxiv-cs.CL | 2023-05-31 |
313 | Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We tackle the task of automatically discriminating between human and machine translations. |
Malina Chichirau; Rik van Noord; Antonio Toral; | arxiv-cs.CL | 2023-05-31 |
314 | Translation-Enhanced Multilingual Text-to-Image Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: 2) We propose Ensemble Adapter (EnsAd), a novel parameter-efficient approach that learns to weigh and consolidate the multilingual text knowledge within the mTTI framework, mitigating the language gap and thus improving mTTI performance. |
Yaoyiran Li; Ching-Yun Chang; Stephen Rawls; Ivan Vulić; Anna Korhonen; | arxiv-cs.CL | 2023-05-30 |
315 | A Corpus for Sentence-level Subjectivity Detection on English News Articles Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a novel corpus for subjectivity detection at the sentence level. |
FRANCESCO ANTICI et. al. | arxiv-cs.CL | 2023-05-29 |
316 | An Open-Source Gloss-Based Baseline for Spoken to Signed Language Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present an open-source implementation of a text-to-gloss-to-pose-to-video pipeline approach, demonstrating conversion from German to Swiss German Sign Language, French to French Sign Language of Switzerland, and Italian to Italian Sign Language of Switzerland. |
AMIT MORYOSSEF et. al. | arxiv-cs.CL | 2023-05-28 |
317 | Neural Machine Translation with Dynamic Graph Convolutional Decoder Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, most previous works merely focus on leveraging the source syntax in the well-known encoder-decoder framework. In sharp contrast, this paper proposes an end-to-end translation architecture from the (graph \& sequence) structural inputs to the (graph \& sequence) outputs, where the target translation and its corresponding syntactic graph are jointly modeled and generated. |
Lei Li; Kai Fan; Lingyu Yang; Hongjia Li; Chun Yuan; | arxiv-cs.CL | 2023-05-28 |
318 | HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents HaVQA, the first multimodal dataset for visual question-answering (VQA) tasks in the Hausa language. |
SHANTIPRIYA PARIDA et. al. | arxiv-cs.CL | 2023-05-28 |
319 | Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes CIC NLP’s submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas. |
ATNAFU LAMBEBO TONJA et. al. | arxiv-cs.CL | 2023-05-27 |
320 | Do GPTs Produce Less Literal Translations? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, there has been relatively little investigation on how such translations differ qualitatively from the translations generated by standard Neural Machine Translation (NMT) models. In this work, we investigate these differences in terms of the literalness of translations produced by the two systems. |
Vikas Raunak; Arul Menezes; Matt Post; Hany Hassan Awadalla; | arxiv-cs.CL | 2023-05-26 |
321 | Robustness of Multi-Source MT to Transcription Errors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Automatic speech translation is sensitive to speech recognition errors, but in a multilingual scenario, the same content may be available in various languages via simultaneous interpreting, dubbing or subtitling. In this paper, we hypothesize that leveraging multiple sources will improve translation quality if the sources complement one another in terms of correct information they contain. |
Dominik Macháček; Peter Polák; Ondřej Bojar; Raj Dabre; | arxiv-cs.CL | 2023-05-26 |
322 | Disambiguated Lexically Constrained Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose disambiguated LCNMT (D-LCNMT) to solve the problem. |
JINPENG ZHANG et. al. | arxiv-cs.CL | 2023-05-26 |
323 | On The Copying Problem of Unsupervised NMT: A Training Schedule with A Language Discriminator Loss Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a simple but effective training schedule that incorporates a language discriminator loss. |
Yihong Liu; Alexandra Chronopoulou; Hinrich Schütze; Alexander Fraser; | arxiv-cs.CL | 2023-05-26 |
324 | CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) systems exhibit limited robustness in handling source-side linguistic variations. Their performance tends to degrade when faced with even slight … |
Md Mahfuz Ibn Alam; Sina Ahmadi; Antonios Anastasopoulos; | arxiv-cs.CL | 2023-05-26 |
325 | Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we explore how bridging the gap between languages for which parallel data is not available affects gender bias in multilingual NMT, specifically for zero-shot directions. |
Lena Cabrera; Jan Niehues; | arxiv-cs.CL | 2023-05-26 |
326 | What About Em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Wrong pronoun translations can discriminate against marginalized groups, e.g., non-binary individuals (Dev et al., 2021). In this “reality check”, we study how three commercial MT systems translate 3rd-person pronouns. |
Anne Lauscher; Debora Nozza; Archie Crowley; Ehm Miltersen; Dirk Hovy; | arxiv-cs.CL | 2023-05-25 |
327 | MTCue: Learning Zero-Shot Control of Extra-Textual Attributes By Leveraging Unstructured Context in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work introduces MTCue, a novel neural machine translation (NMT) framework that interprets all context (including discrete variables) as text. |
Sebastian Vincent; Robert Flynn; Carolina Scarton; | arxiv-cs.CL | 2023-05-25 |
328 | Cross-Lingual Knowledge Distillation for Answer Sentence Selection in Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose Cross-Lingual Knowledge Distillation (CLKD) from a strong English AS2 teacher as a method to train AS2 models for low-resource languages in the tasks without the need of labeled data for the target language. |
Shivanshu Gupta; Yoshitomo Matsubara; Ankit Chadha; Alessandro Moschitti; | arxiv-cs.CL | 2023-05-25 |
329 | What About “em”? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. Exclusion is particularly harmful in one of the most … |
Anne Lauscher; Debora Nozza; Archie Crowley; E. Miltersen; Dirk Hovy; | Annual Meeting of the Association for Computational … | 2023-05-25 |
330 | Eliciting The Translation Ability of Large Language Models Via Multilingual Finetuning with Translation Instructions IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a detailed analysis by finetuning a multilingual pretrained language model, XGLM-7B, to perform multilingual translation following given instructions. |
Jiahuan Li; Hao Zhou; Shujian Huang; Shanbo Cheng; Jiajun Chen; | arxiv-cs.CL | 2023-05-24 |
331 | Textless Low-Resource Speech-to-Speech Translation With Unit Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a new framework for training textless low-resource speech-to-speech translation (S2ST) systems that only need dozens of hours of parallel speech data. |
Anuj Diwan; Anirudh Srinivasan; David Harwath; Eunsol Choi; | arxiv-cs.CL | 2023-05-24 |
332 | Leveraging GPT-4 for Automatic Translation Post-Editing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we formalize the task of direct translation post-editing with Large Language Models (LLMs) and explore the use of GPT-4 to automatically post-edit NMT outputs across several language pairs. |
Vikas Raunak; Amr Sharaf; Yiren Wang; Hany Hassan Awadallah; Arul Menezes; | arxiv-cs.CL | 2023-05-24 |
333 | Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We include training splits from our contemporary dataset and the Sanskrit-English parallel sentences from the training split of Itih\={a}sa, a previously released classical era machine translation dataset containing Sanskrit. |
AYUSH MAHESHWARI et. al. | arxiv-cs.CL | 2023-05-23 |
334 | BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To better model the common semantics shared across texts and videos, we introduce a contrastive learning method in the cross-modal encoder. |
LIYAN KANG et. al. | arxiv-cs.CV | 2023-05-23 |
335 | WYWEB: A NLP Evaluation Benchmark For Classical Chinese Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For the prosperity of the NLP community, in this paper, we introduce the WYWEB evaluation benchmark, which consists of nine NLP tasks in classical Chinese, implementing sentence classifi cation, sequence labeling, reading comprehension, and machine translation. |
Bo Zhou; Qianglong Chen; Tianyu Wang; Xiaomi Zhong; Yin Zhang; | arxiv-cs.CL | 2023-05-23 |
336 | CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a general framework for combining different features influencing example selection. |
Aswanth Kumar; Ratish Puduppully; Raj Dabre; Anoop Kunchukuttan; | arxiv-cs.CL | 2023-05-23 |
337 | Improving Speech Translation By Fusing Speech and Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we harness the complementary strengths of speech and text, which are disparate modalities. |
WENBIAO YIN et. al. | arxiv-cs.CL | 2023-05-23 |
338 | Improving Isochronous Machine Translation with Target Factors and Auxiliary Counters Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce target factors in a transformer model to predict durations jointly with target language phoneme sequences. |
PROYAG PAL et. al. | arxiv-cs.CL | 2023-05-22 |
339 | Non-parametric, Nearest-neighbor-assisted Fine-tuning for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Through qualitative analysis, we found particular improvements when it comes to translating grammatical relations or function words, which results in increased fluency of our model. |
Jiayi Wang; Ke Wang; Yuqi Zhang; Yu Zhao; Pontus Stenetorp; | arxiv-cs.CL | 2023-05-22 |
340 | Neural Machine Translation for Code Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we survey the NMT for code generation literature, cataloging the variety of methods that have been explored according to input and output representations, model architectures, optimization techniques used, data sets, and evaluation methods. |
Dharma KC; Clayton T. Morrison; | arxiv-cs.CL | 2023-05-22 |
341 | Decomposed Prompting for Machine Translation Between Related Languages Using Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce DecoMT, a novel approach of few-shot prompting that decomposes the translation process into a sequence of word chunk translations. |
Ratish Puduppully; Anoop Kunchukuttan; Raj Dabre; Ai Ti Aw; Nancy F. Chen; | arxiv-cs.CL | 2023-05-22 |
342 | Is Translation Helpful? An Empirical Analysis of Cross-Lingual Transfer in Low-Resource Dialog Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A typical approach is to leverage off-the-shelf machine translation (MT) systems to utilize either the training corpus or developed models from high-resource languages. In this work, we investigate whether it is helpful to utilize MT at all in this task. |
Lei Shen; Shuai Yu; Xiaoyu Shen; | arxiv-cs.CL | 2023-05-21 |
343 | VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present our deployment-ready Speech-to-Speech Machine Translation (SSMT) system for English-Hindi, English-Marathi, and Hindi-Marathi language pairs. |
SHIVAM MHASKAR et. al. | arxiv-cs.CL | 2023-05-21 |
344 | ReSeTOX: Re-learning Attention Weights for Toxicity Mitigation in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Our proposed method, ReSeTOX (REdo SEarch if TOXic), addresses the issue of Neural Machine Translation (NMT) generating translation outputs that contain toxic words not present in the input. |
Javier García Gilabert; Carlos Escolano; Marta R. Costa-Jussà; | arxiv-cs.CL | 2023-05-19 |
345 | HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we release an annotated dataset for the hallucination and omission phenomena covering 18 translation directions with varying resource levels and scripts. |
DAVID DALE et. al. | arxiv-cs.CL | 2023-05-19 |
346 | Viewing Knowledge Transfer in Multilingual Machine Translation Through A Representational Lens Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We argue that translation quality alone is not a sufficient metric for measuring knowledge transfer in multilingual neural machine translation. To support this claim, we introduce Representational Transfer Potential (RTP), which measures representational similarities between languages. |
David Stap; Vlad Niculae; Christof Monz; | arxiv-cs.CL | 2023-05-19 |
347 | Discourse Centric Evaluation of Machine Translation with A Densely Annotated Parallel Corpus Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents a new dataset with rich discourse annotations, built upon the large-scale parallel corpus BWB introduced in Jiang et al. (2022). |
YUCHEN ELEANOR JIANG et. al. | arxiv-cs.CL | 2023-05-18 |
348 | NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on the task of sentiment classification for cross domain adaptation. |
Iyanuoluwa Shode; David Ifeoluwa Adelani; Jing Peng; Anna Feldman; | arxiv-cs.CL | 2023-05-18 |
349 | DUB: Discrete Unit Back-translation for Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: With DUB, the back-translation technique can successfully be applied on direct ST and obtains an average boost of 5.5 BLEU on MuST-C En-De/Fr/Es. |
Dong Zhang; Rong Ye; Tom Ko; Mingxuan Wang; Yaqian Zhou; | arxiv-cs.CL | 2023-05-18 |
350 | AlignAtt: Using Attention-based Audio-Translation Alignments As A Guide for Simultaneous Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose AlignAtt, a novel policy for simultaneous ST (SimulST) that exploits the attention information to generate source-target alignments that guide the model during inference. |
Sara Papi; Marco Turchi; Matteo Negri; | arxiv-cs.CL | 2023-05-18 |
351 | On The Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we find that failing in encoding discriminative target language signal will lead to off-target and a closer lexical distance (i.e., KL-divergence) between two languages’ vocabularies is related with a higher off-target rate. |
Liang Chen; Shuming Ma; Dongdong Zhang; Furu Wei; Baobao Chang; | arxiv-cs.CL | 2023-05-18 |
352 | Multilingual Event Extraction from Historical Newspaper Adverts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a new multilingual dataset in English, French, and Dutch composed of newspaper ads from the early modern colonial period reporting on enslaved people who liberated themselves from enslavement. |
Nadav Borenstein; Natalia da Silva Perez; Isabelle Augenstein; | arxiv-cs.CL | 2023-05-18 |
353 | Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To eliminate the rule-based nature of data creation, we instead propose using machine translation models to create gender-biased text from real gender-fair text via round-trip translation. |
Chantal Amrhein; Florian Schottmann; Rico Sennrich; Samuel Läubli; | arxiv-cs.CL | 2023-05-18 |
354 | Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we introduce a novel method to enhance neural interlingua representations by making their length variable, thereby overcoming the constraint of fixed-length neural interlingua representations. |
Zhuoyuan Mao; Haiyue Song; Raj Dabre; Chenhui Chu; Sadao Kurohashi; | arxiv-cs.CL | 2023-05-17 |
355 | ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings Across Bengali and Five Other Low-Resource Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this multicultural age, language translation is one of the most performed tasks, and it is becoming increasingly AI-moderated and automated. As a novel AI system, ChatGPT claims to be proficient in such translation tasks and in this paper, we put that claim to the test. |
Sourojit Ghosh; Aylin Caliskan; | arxiv-cs.CY | 2023-05-17 |
356 | Searching for Needles in A Haystack: On The Role of Incidental Bilingualism in PaLM’s Translation Capability Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a mixed-method approach to measure and understand incidental bilingualism at scale. |
Eleftheria Briakou; Colin Cherry; George Foster; | arxiv-cs.CL | 2023-05-17 |
357 | Searching for Needles in A Haystack: On The Role of Incidental Bilingualism in PaLM’s Translation Capability IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Large, multilingual language models exhibit surprisingly good zero- or few-shot machine translation capabilities, despite having never seen the intentionally-included translation … |
Eleftheria Briakou; Colin Cherry; George F. Foster; | Annual Meeting of the Association for Computational … | 2023-05-17 |
358 | Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Borrowing techniques from Statistical Machine Translation, we propose intermediate signals which are intermediate sequences from the source-like structure to the target-like structure. |
Chaojun Wang; Yang Liu; Wai Lam; | arxiv-cs.CL | 2023-05-16 |
359 | The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided By Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Motivated particularly by the task of cross-lingual SLU, we demonstrate that the task of speech translation (ST) is a good means of pretraining speech models for end-to-end SLU on both intra- and cross-lingual scenarios. |
Mutian He; Philip N. Garner; | arxiv-cs.CL | 2023-05-16 |
360 | XPQA: Cross-Lingual Product Question Answering Across 12 Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While existing work on PQA focuses mainly on English, in practice there is need to support multiple customer languages while leveraging product information available in English. To study this practical industrial task, we present xPQA, a large-scale annotated cross-lingual PQA dataset in 12 languages across 9 branches, and report results in (1) candidate ranking, to select the best English candidate containing the information to answer a non-English question; and (2) answer generation, to generate a natural-sounding non-English answer based on the selected English candidate. |
Xiaoyu Shen; Akari Asai; Bill Byrne; Adrià de Gispert; | arxiv-cs.CL | 2023-05-16 |
361 | Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Secondly, the knowledge is largely covered by the golden information due to the fact that most top-1 predictions of teachers overlap with ground-truth tokens, which further restricts the potential of KD. To address these issues, we propose a novel method named \textbf{T}op-1 \textbf{I}nformation \textbf{E}nhanced \textbf{K}nowledge \textbf{D}istillation (TIE-KD). |
SONGMING ZHANG et. al. | arxiv-cs.CL | 2023-05-14 |
362 | PESTS: Persian_English Cross Lingual Corpus for Semantic Textual Similarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this article, the corpus of semantic textual similarity between sentences in Persian and English languages has been produced for the first time by using linguistic experts. |
Mohammad Abdous; Poorya Piroozfar; Behrouz Minaei Bidgoli; | arxiv-cs.CL | 2023-05-13 |
363 | Improving The Quality of Neural Machine Translation Through Proper Translation of Name Entities Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we have shown a method of improving the quality of neural machine translation by translating/transliterating name entities as a preprocessing step. |
Radhika Sharma; Pragya Katyayan; Nisheeth Joshi; | arxiv-cs.CL | 2023-05-12 |
364 | Improving Zero-shot Multilingual Neural Machine Translation By Leveraging Cross-lingual Consistency Regularization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces a cross-lingual consistency regularization, CrossConST, to bridge the representation gap among different languages and boost zero-shot translation performance. |
Pengzhi Gao; Liwen Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | arxiv-cs.CL | 2023-05-12 |
365 | Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present Perturbation-based QE – a word-level Quality Estimation approach that works simply by analyzing MT system output on perturbed input source sentences. |
Tu Anh Dinh; Jan Niehues; | arxiv-cs.CL | 2023-05-12 |
366 | Chain-of-Dictionary Prompting Elicits Translation in Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we present a novel method, CoD, which augments LLMs with prior knowledge with the chains of multilingual dictionaries for a subset of input words to elicit translation abilities for LLMs. |
HONGYUAN LU et. al. | arxiv-cs.CL | 2023-05-11 |
367 | Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To use SSMT during inference we propose dynamic decoding, a text generation algorithm that adapts segmentations as it generates translations. |
Francois Meyer; Jan Buys; | arxiv-cs.CL | 2023-05-11 |
368 | How Good Are Commercial Large Language Models on African Languages? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a preliminary analysis of commercial large language models on two tasks (machine translation and text classification) across eight African languages, spanning different language families and geographical areas. |
Jessica Ojo; Kelechi Ogueji; | arxiv-cs.CL | 2023-05-10 |
369 | PriGen: Towards Automated Translation of Android Applications’ Code to Privacy Captions Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Previous work has attempted to help developers create privacy notices through a questionnaire or predefined templates. In this paper, we propose a novel approach and a framework, called PriGen, that extends these prior work. |
Vijayanta Jain; Sanonda Datta Gupta; Sepideh Ghanavati; Sai Teja Peddinti; | arxiv-cs.SE | 2023-05-10 |
370 | Multi-Teacher Knowledge Distillation For Text Image Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel Multi-Teacher Knowledge Distillation (MTKD) method to effectively distillate knowledge into the end-to-end TIMT model from the pipeline model. |
CONG MA et. al. | arxiv-cs.CL | 2023-05-09 |
371 | CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing subword-based neural MT models do not explicitly harness this lexical similarity, as they only implicitly align HRL and ELRL latent embedding space. To overcome this limitation, we propose a novel, CharSpan, approach based on ‘character-span noise augmentation’ into the training data of HRL. |
Kaushal Kumar Maurya; Rahul Kejriwal; Maunendra Sankar Desarkar; Anoop Kunchukuttan; | arxiv-cs.CL | 2023-05-09 |
372 | MultiTACRED: A Multilingual Version of The TAC Relation Extraction Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Relation extraction (RE) is a fundamental task in information extraction, whose extension to multilingual settings has been hindered by the lack of supervised resources comparable in size to large English datasets such as TACRED (Zhang et al., 2017). To address this gap, we introduce the MultiTACRED dataset, covering 12 typologically diverse languages from 9 language families, which is created by machine-translating TACRED instances and automatically projecting their entity annotations. |
Leonhard Hennig; Philippe Thomas; Sebastian Möller; | arxiv-cs.CL | 2023-05-08 |
373 | Label-Free Multi-Domain Machine Translation with Stage-wise Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a label-free multi-domain machine translation model which requires only a few or no domain-annotated data in training and no domain labels in inference. |
Fan Zhang; Mei Tu; Sangha Kim; Song Liu; Jinyao Yan; | arxiv-cs.CL | 2023-05-06 |
374 | Exploring Human-Like Translation Strategy with Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Compared to typical machine translation that focuses solely on source-to-target mapping, LLM-based translation can potentially mimic the human translation process which might take preparatory steps to ensure high-quality translation. This work explores this possibility by proposing the MAPS framework, which stands for Multi-Aspect Prompting and Selection. |
ZHIWEI HE et. al. | arxiv-cs.CL | 2023-05-06 |
375 | In-context Learning As Maintaining Coherency: A Study of On-the-fly Machine Translation Using Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The phenomena of in-context learning has typically been thought of as learning from examples. In this work which focuses on Machine Translation, we present a perspective of in-context learning as the desired generation task maintaining coherency with its context, i.e., the prompt examples. |
Suzanna Sia; Kevin Duh; | arxiv-cs.CL | 2023-05-05 |
376 | Unified Model Learning for Various Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Although the dataset-specific models have achieved impressive performance, it is cumbersome as each dataset demands a model to be designed, trained, and stored. In this work, we aim to unify these translation tasks into a more general setting. |
YUNLONG LIANG et. al. | arxiv-cs.CL | 2023-05-04 |
377 | Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we investigate lexical sharing in multilingual machine translation (MT) from Hindi, Gujarati, Nepali into English. |
Sonal Sannigrahi; Rachel Bawden; | arxiv-cs.CL | 2023-05-04 |
378 | Learning Language-Specific Layers for Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce Language-Specific Transformer Layers (LSLs), which allow us to increase model capacity, while keeping the amount of computation and the number of parameters used in the forward pass constant. |
Telmo Pessoa Pires; Robin M. Schmidt; Yi-Hsiu Liao; Stephan Peitz; | arxiv-cs.CL | 2023-05-04 |
379 | Evaluating The Efficacy of Length-Controllable Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that BLEURT and COMET have the highest correlation with human evaluation and are most suitable as evaluation metrics for length-controllable machine translation. |
HAO CHENG et. al. | arxiv-cs.CL | 2023-05-03 |
380 | SLTUNET: A Simple Unified Model for Sign Language Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose SLTUNET, a simple unified neural model designed to support multiple SLTrelated tasks jointly, such as sign-to-gloss, gloss-to-text and sign-to-text translation. |
Biao Zhang; Mathias Müller; Rico Sennrich; | arxiv-cs.CL | 2023-05-02 |
381 | English-Assamese Neural Machine Translation Using Prior Alignment and Pre-trained Language Model Related Papers Related Patents Related Grants Related Venues Related Experts View |
SAHINUR RAHMAN LASKAR et. al. | Comput. Speech Lang. | 2023-05-01 |
382 | Metamorphic Testing of Machine Translation Models Using Back Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation software has been widely adopted in recent years. The recent advance in deep learning research has massively improved the accuracy and fluency of the … |
Wentao Gao; Jiayuan He; Van-Thuan Pham; | 2023 IEEE/ACM International Workshop on Deep Learning for … | 2023-05-01 |
383 | Low-Resourced Machine Translation for Senegalese Wolof Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a parallel Wolof/French corpus of 123,000 sentences on which we conducted experiments on machine translation models based on Recurrent Neural Networks (RNN) in different data configurations. |
Derguene Mbaye; Moussa Diallo; Thierno Ibrahima Diop; | arxiv-cs.CL | 2023-04-30 |
384 | Synthetic Cross-language Information Retrieval Training Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While MS MARCO is a large resource, it is of fixed size; its genre and domain of discourse are fixed; and the translated documents are not written in the language of a native speaker of the language, but rather in translationese. To address these problems, we introduce the JH-POLO CLIR training set creation methodology. |
JAMES MAYFIELD et. al. | arxiv-cs.IR | 2023-04-29 |
385 | Lost In Translation: Generating Adversarial Examples Robust to Round-Trip Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a comprehensive study on the robustness of current text adversarial attacks to round-trip translation. |
N. Bhandari; P. -Y. Chen; | icassp | 2023-04-27 |
386 | Rethinking The Reasonability of The Test Set for Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we manually annotate a monotonic test set based on the MuST-C English-Chinese test set, denoted as SiMuST-C. |
M. LIU et. al. | icassp | 2023-04-27 |
387 | Improving Speech-to-Speech Translation Through Unlabeled Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an effective way to utilize the massive existing unlabeled text from different languages to create a large amount of S2ST data to improve S2ST performance by applying various acoustic effects to the generated synthetic data. |
X. -P. NGUYEN et. al. | icassp | 2023-04-27 |
388 | LEAPT: Learning Adaptive Prefix-to-Prefix Translation For Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by strategies utilized by human interpreters and wait policies, we propose a novel adaptive prefix-to-prefix training policy called LEAPT, which allows our machine translation model to learn how to translate source sentence prefixes and make use of the future context. |
L. Lin; S. Li; X. Shi; | icassp | 2023-04-27 |
389 | M3ST: Mix at Three Levels for Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Mix at three levels for Speech Translation (M3ST) method to increase the diversity of the augmented training corpus. |
X. CHENG et. al. | icassp | 2023-04-27 |
390 | Targeted Adversarial Attacks Against Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a new targeted adversarial attack against NMT models. |
S. Sadrizadeh; A. D. Aghdam; L. Dolamic; P. Frossard; | icassp | 2023-04-27 |
391 | Escaping The Sentence-level Paradigm in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Much work in document-context machine translation exists, but for various reasons has been unable to catch hold. This paper suggests a path out of this rut by addressing three impediments at once: what architectures should we use? |
Matt Post; Marcin Junczys-Dowmunt; | arxiv-cs.CL | 2023-04-25 |
392 | Lost in Translationese? Reducing Translation Effect Using Abstract Meaning Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We compare our AMR-based approach against three other techniques based on machine translation or paraphrase generation. |
Shira Wein; Nathan Schneider; | arxiv-cs.CL | 2023-04-22 |
393 | Improving Speech Translation By Cross-Modal Multi-Grained Contrastive Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the final model often performs worse on the MT task than the MT model trained alone, which means that the knowledge transfer ability of this method is also limited. To deal with these problems, we propose the FCCL (Fine- and Coarse- Granularity Contrastive Learning) approach for E2E-ST, which makes explicit knowledge transfer through cross-modal multi-grained contrastive learning. |
HAO ZHANG et. al. | arxiv-cs.CL | 2023-04-20 |
394 | The EBible Corpus: Data and Model Benchmarks for Bible Translation for Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce the eBible corpus: a dataset containing 1009 translations of portions of the Bible with data in 833 different languages across 75 language families. |
VESA AKERMAN et. al. | arxiv-cs.CL | 2023-04-19 |
395 | An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, works focusing on distilling knowledge from large multilingual neural machine translation (MNMT) models into smaller ones are practically nonexistent, despite the popularity and superiority of MNMT. This paper bridges this gap by presenting an empirical investigation of knowledge distillation for compressing MNMT models. |
Varun Gumma; Raj Dabre; Pratyush Kumar; | arxiv-cs.CL | 2023-04-18 |
396 | Improving Autoregressive NLP Tasks Via Modular Linearized Attention Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes modular linearized attention (MLA), which combines multiple efficient attention mechanisms, including cosFormer, to maximize inference quality while achieving notable speedups. |
Victor Agostinelli; Lizhong Chen; | arxiv-cs.CL | 2023-04-17 |
397 | Neural Machine Translation For Low Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The goal of this paper is to investigate the realm of low resource languages and build a Neural Machine Translation model to achieve state-of-the-art results. |
Vakul Goyle; Parvathy Krishnaswamy; Kannan Girija Ravikumar; Utsa Chattopadhyay; Kartikay Goyle; | arxiv-cs.CL | 2023-04-16 |
398 | TransDocs: Optical Character Recognition with Word to Word Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, I have shown comparative study for pre-trained OCR while using deep learning model using LSTM-based seq2seq architecture with attention for machine translation. |
Abhishek Bamotra; Phani Krishna Uppala; | arxiv-cs.CV | 2023-04-15 |
399 | Learning Homographic Disambiguation Representation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel approach to tackle homographic issues of NMT in the latent space. |
Weixuan Wang; Wei Peng; Qun Liu; | arxiv-cs.CL | 2023-04-12 |
400 | Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we systematically investigate the advantages and challenges of LLMs for MMT by answering two questions: 1) How well do LLMs perform in translating massive languages? |
WENHAO ZHU et. al. | arxiv-cs.CL | 2023-04-10 |
401 | RISC: Generating Realistic Synthetic Bilingual Insurance Contract Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents RISC, an open-source Python package data generator (https://github.com/GRAAL-Research/risc). |
David Beauchemin; Richard Khoury; | arxiv-cs.CL | 2023-04-09 |
402 | MUFIN: Improving Neural Repair Models with Back-Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The main problem is to generate interesting and diverse pairs that maximize the effectiveness of training. As a contribution to this problem, we propose to use back-translation, a technique coming from neural machine translation. |
André Silva; João F. Ferreira; He Ye; Martin Monperrus; | arxiv-cs.SE | 2023-04-05 |
403 | How to Design Translation Prompts for ChatGPT: An Empirical Study IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, in this paper, we explore how to assist machine translation with ChatGPT. |
Yuan Gao; Ruili Wang; Feng Hou; | arxiv-cs.CL | 2023-04-04 |
404 | LAHM : Large Annotated Dataset for Multi-Domain and Multilingual Hate Speech Identification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a new multilingual hate speech analysis dataset for English, Hindi, Arabic, French, German and Spanish languages for multiple domains across hate speech – Abuse, Racism, Sexism, Religious Hate and Extremism. |
Ankit Yadav; Shubham Chandel; Sushant Chatufale; Anil Bandhakavi; | arxiv-cs.CL | 2023-04-03 |
405 | $\varepsilon$ KÚ Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper investigates the performance of massively multilingual neural machine translation (NMT) systems in translating Yor\`ub\’a greetings ($\varepsilon$ k\’u [MASK]), which are a big part of Yor\`ub\’a language and culture, into English. To evaluate these models, we present IkiniYor\`ub\’a, a Yor\`ub\’a-English translation dataset containing some Yor\`ub\’a greetings, and sample use cases. |
Idris Akinade; Jesujoba Alabi; David Adelani; Clement Odoje; Dietrich Klakow; | arxiv-cs.CL | 2023-03-31 |
406 | Hallucinations in Large Multilingual Translation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing research on hallucinations has primarily focused on small bilingual models trained on high-resource languages, leaving a gap in our understanding of hallucinations in massively multilingual models across diverse translation scenarios. In this work, we fill this gap by conducting a comprehensive analysis on both the M2M family of conventional neural machine translation models and ChatGPT, a general-purpose large language model~(LLM) that can be prompted for translation. |
NUNO M. GUERREIRO et. al. | arxiv-cs.CL | 2023-03-28 |
407 | Translate The Beauty in Songs: Jointly Learning to Align Melody and Translate Lyrics Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Lyrics-Melody Translation with Adaptive Grouping (LTAG), a holistic solution to automatic song translation by jointly modeling lyrics translation and lyrics-melody alignment. |
CHENGXI LI et. al. | arxiv-cs.CL | 2023-03-27 |
408 | Bilex Rx: Lexical Data Augmentation for Massively Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We test the efficacy of bilingual lexica in a real-world set-up, on 200-language translation models trained on web-crawled text. We present several findings: (1) using lexical data augmentation, we demonstrate sizable performance gains for unsupervised translation; (2) we compare several families of data augmentation, demonstrating that they yield similar improvements, and can be combined for even greater improvements; (3) we demonstrate the importance of carefully curated lexica over larger, noisier ones, especially with larger models; and (4) we compare the efficacy of multilingual lexicon data versus human-translated parallel data. |
Alex Jones; Isaac Caswell; Ishank Saxena; Orhan Firat; | arxiv-cs.CL | 2023-03-27 |
409 | Linguistically Informed ChatGPT Prompts to Enhance Japanese-Chinese Machine Translation: A Case Study on Attributive Clauses Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Present-day machine translation tools often fail to accurately translate attributive clauses from Japanese to Chinese. In light of this, this paper investigates the linguistic problem underlying such difficulties, namely how does the semantic role of the modified noun affect the selection of translation patterns for attributive clauses, from a linguistic perspective. |
Wenshi Gu; | arxiv-cs.CL | 2023-03-27 |
410 | Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To further improve the performance of LLMs on MT quality assessment, we investigate several prompting designs, and propose a new prompting method called \textbf{\texttt{Error Analysis Prompting}} (EAPrompt) by combining Chain-of-Thoughts (Wei et al., 2022) and Error Analysis (Lu et al., 2023). |
QINGYU LU et. al. | arxiv-cs.CL | 2023-03-24 |
411 | Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models: A Case Study on ChatGPT IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Generative large language models (LLMs), e.g., ChatGPT, have demonstrated remarkable proficiency across several NLP tasks, such as machine translation, text summarization. Recent … |
Qingyu Lu; Baopu Qiu; Liang Ding; Liping Xie; Dacheng Tao; | ArXiv | 2023-03-24 |
412 | Towards Making The Most of ChatGPT for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we aim to further mine ChatGPT’s translation ability by revisiting several aspects: temperature, task information, and domain information, and correspondingly propose an optimal temperature setting and two (simple but effective) prompts: Task-Specific Prompts (TSP) and Domain-Specific Prompts (DSP). |
KEQIN PENG et. al. | arxiv-cs.CL | 2023-03-23 |
413 | Selective Data Augmentation for Robust Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose to use an e2e architecture for English-Hindi (en-hi) ST. We use two imperfect machine translation (MT) services to translate Libri-trans en text into hi text. |
Rajul Acharya; Ashish Panda; Sunil Kumar Kopparapu; | arxiv-cs.CL | 2023-03-22 |
414 | LEAPT: Learning Adaptive Prefix-to-prefix Translation For Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by strategies utilized by human interpreters and wait policies, we propose a novel adaptive prefix-to-prefix training policy called LEAPT, which allows our machine translation model to learn how to translate source sentence prefixes and make use of the future context. |
Lei Lin; Shuangtao Li; Xiaodong Shi; | arxiv-cs.CL | 2023-03-21 |
415 | Translate Your Gibberish: Black-box Adversarial Attack on Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present a simple approach to fool state-of-the-art machine translation tools in the task of translation from Russian to English and vice versa. |
Andrei Chertkov; Olga Tsymboi; Mikhail Pautov; Ivan Oseledets; | arxiv-cs.CL | 2023-03-20 |
416 | Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A contributing factor to this problem is that NMT models trained with the one-to-one paradigm struggle to handle the source diversity phenomenon, where inputs with the same meaning can be expressed differently. In this work, we treat this problem as a bilevel optimization problem and present a consistency-aware meta-learning (CAML) framework derived from the model-agnostic meta-learning (MAML) algorithm to address it. |
Rongxiang Weng; Qiang Wang; Wensen Cheng; Changfeng Zhu; Min Zhang; | arxiv-cs.CL | 2023-03-20 |
417 | ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To relax the dependency on labeled data of downstream tasks, we propose an intuitive and effective zero-shot learning framework, ZeroNLG, which can deal with multiple NLG tasks, including image-to-text (image captioning), video-to-text (video captioning), and text-to-text (neural machine translation), across English, Chinese, German, and French within a unified framework. |
BANG YANG et. al. | arxiv-cs.CL | 2023-03-11 |
418 | A Multi-stack RNN-based Neural Machine Translation Model for English to Pakistan Sign Language Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
U. Farooq; Mohd Shafry Mohd Rahim; Adnan Abid; | Neural Computing and Applications | 2023-03-11 |
419 | MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Nonetheless, visual speech is not as distinguishable as audio speech, making it difficult to develop a mapping from source speech phonemes to the target language text. To address this issue, we propose MixSpeech, a cross-modality self-learning framework that utilizes audio speech to regularize the training of visual speech tasks. |
XIZE CHENG et. al. | arxiv-cs.CV | 2023-03-09 |
420 | GATE: A Challenge Set for Gender-Ambiguous Translation Examples Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Recent work has led to the development of gender rewriters that generate alternative gender translations on such ambiguous inputs, but such systems are plagued by poor linguistic coverage. To encourage better performance on this task we present and release GATE, a linguistically diverse corpus of gender-ambiguous source sentences along with multiple alternative target language translations. |
Spencer Rarrick; Ranjita Naik; Varun Mathur; Sundar Poudel; Vishal Chowdhary; | arxiv-cs.CL | 2023-03-07 |
421 | Exploiting Language Relatedness in Machine Translation Through Domain Adaptation Techniques Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In order to tackle the challenges faced by MT, we present a novel approach of using a scaled similarity score of sentences, especially for related languages based on a 5-gram KenLM language model with Kneser-ney smoothing technique for filtering in-domain data from out-of-domain corpora that boost the translation quality of MT. Furthermore, we employ other domain adaptation techniques such as multi-domain, fine-tuning and iterative back-translation approach to compare our novel approach on the Hindi-Nepali language pair for NMT and SMT. |
Amit Kumar; Rupjyoti Baruah; Ajay Pratap; Mayank Swarnkar; Anil Kumar Singh; | arxiv-cs.CL | 2023-03-03 |
422 | Rethinking The Reasonability of The Test Set for Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we manually annotate a monotonic test set based on the MuST-C English-Chinese test set, denoted as SiMuST-C. |
MENGGE LIU et. al. | arxiv-cs.CL | 2023-03-02 |
423 | Targeted Adversarial Attacks Against Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a new targeted adversarial attack against NMT models. |
Sahar Sadrizadeh; AmirHossein Dabiri Aghdam; Ljiljana Dolamic; Pascal Frossard; | arxiv-cs.CL | 2023-03-02 |
424 | Federated Nearest Neighbor Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel federated nearest neighbor (FedNN) machine translation framework that, instead of multi-round model-based interactions, leverages one-round memorization-based interaction to share knowledge across different clients to build low-overhead privacy-preserving systems. |
YICHAO DU et. al. | arxiv-cs.CL | 2023-02-23 |
425 | Machine Translation and Its Evaluation: A Study Related Papers Related Patents Related Grants Related Venues Related Experts View |
S. Mondal; Haoxi Zhang; H. Kabir; Kan Ni; Hongning Dai; | Artificial Intelligence Review | 2023-02-19 |
426 | Exploring The Potential of Machine Translation for Generating Named Entity Datasets: A Case Study Between Persian and English Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This study focuses on the generation of Persian named entity datasets through the application of machine translation on English datasets. |
Amir Sartipi; Afsaneh Fatemi; | arxiv-cs.CL | 2023-02-19 |
427 | Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with A Distilled Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose automatic methods that use ToD training data in a source language to build a high-quality functioning dialogue agent in another target language that has no training data (i.e. zero-shot) or a small training set (i.e. few-shot). |
Mehrad Moradshahi; Sina J. Semnani; Monica S. Lam; | arxiv-cs.CL | 2023-02-18 |
428 | How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a comprehensive evaluation of GPT models for machine translation, covering various aspects such as quality of different GPT models in comparison with state-of-the-art research and commercial systems, effect of prompting strategies, robustness towards domain shifts and document-level translation. |
AMR HENDY et. al. | arxiv-cs.CL | 2023-02-17 |
429 | Evaluating and Improving The Coreference Capabilities of Machine Translation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we ask: \emph{How well do MT models learn coreference resolution from implicit signal?} |
Asaf Yehudai; Arie Cattan; Omri Abend; Gabriel Stanovsky; | arxiv-cs.CL | 2023-02-16 |
430 | Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We conduct comprehensive experiments and analyses on three benchmark datasets for English-German translation, and validate the effectiveness of two variants of DocFlat. |
Minghao Wu; George Foster; Lizhen Qu; Gholamreza Haffari; | arxiv-cs.CL | 2023-02-15 |
431 | Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We compare various methods to encode sentence positions into token representations, including novel methods. |
Lorenzo Lupo; Marco Dinarelli; Laurent Besacier; | arxiv-cs.CL | 2023-02-13 |
432 | Language-Aware Multilingual Machine Translation with Self-Supervised Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Finally, we apply intra-distillation to this co-training approach. Combining these two approaches significantly improves MMT performance, outperforming three state-of-the-art SSL methods by a large margin, e.g., 11.3\% and 3.7\% improvement on an 8-language and a 15-language benchmark compared with MASS, respectively |
Haoran Xu; Jean Maillard; Vedanuj Goswami; | arxiv-cs.CL | 2023-02-09 |
433 | Learning Translation Quality Evaluation on Low Resource Languages from Large Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Training such metrics requires data which can be expensive and difficult to acquire, particularly for lower-resource languages. We show how knowledge can be distilled from Large Language Models (LLMs) to improve upon such learned metrics without requiring human annotators, by creating synthetic datasets which can be mixed into existing datasets, requiring only a corpus of text in the target language. |
Amirkeivan Mohtashami; Mauro Verzetti; Paul K. Rubenstein; | arxiv-cs.CL | 2023-02-07 |
434 | The Unreasonable Effectiveness of Few-shot Learning for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show that with only 5 examples of high-quality translation data shown at inference, a transformer decoder-only model trained solely with self-supervised learning, is able to match specialized supervised state-of-the-art models as well as more general commercial translation systems. |
XAVIER GARCIA et. al. | arxiv-cs.CL | 2023-02-02 |
435 | Code Translation with Compiler Representations IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we leverage low-level compiler intermediate representations (IR) code translation. |
MARC SZAFRANIEC et. al. | iclr | 2023-02-01 |
436 | An Evaluation of Persian-English Machine Translation Datasets with Transformers Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Nowadays, many researchers are focusing their attention on the subject of machine translation (MT). However, Persian machine translation has remained unexplored despite a vast … |
Amir Sartipi; Meghdad Dehghan; Afsaneh Fatemi; | arxiv-cs.CL | 2023-02-01 |
437 | Attention Link: An Efficient Attention-Based Low Resource Machine Translation Architecture Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel architecture named as attention link (AL) to help improve transformer models’ performance, especially in low training resources. |
Zeping Min; | arxiv-cs.CL | 2023-02-01 |
438 | Adaptive Machine Translation with Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work aims to investigate how we can utilize in-context learning to improve real-time adaptive MT. Our extensive experiments show promising results at translation time. |
Yasmin Moslem; Rejwanul Haque; John D. Kelleher; Andy Way; | arxiv-cs.CL | 2023-01-30 |
439 | KG-BERTScore: Incorporating Knowledge Graph Into BERTScore for Reference-Free Machine Translation Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we incorporate multilingual knowledge graph into BERTScore and propose a metric named KG-BERTScore, which linearly combines the results of BERTScore and bilingual named entity matching for reference-free machine translation evaluation. |
ZHANGLIN WU et. al. | arxiv-cs.CL | 2023-01-30 |
440 | Gender Neutralization for An Inclusive Machine Translation: from Theoretical Foundations to Open Challenges Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal to be achieved by machine translation (MT) models, which have been found to perpetuate gender bias and discrimination. |
Andrea Piergentili; Dennis Fucci; Beatrice Savoldi; Luisa Bentivogli; Matteo Negri; | arxiv-cs.CL | 2023-01-24 |
441 | Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Choosing the incorrect option might significantly affect translation usefulness and quality. We propose a novel method interactive-chain prompting — a series of question, answering and generation intermediate steps between a Translator model and a User model — that reduces translations into a list of subproblems addressing ambiguities and then resolving such subproblems before producing the final text to be translated. |
Jonathan Pilault; Xavier Garcia; Arthur Bražinskas; Orhan Firat; | arxiv-cs.LG | 2023-01-24 |
442 | Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This report provides a preliminary evaluation of ChatGPT for machine translation, including translation prompt, multilingual translation, and translation robustness. |
WENXIANG JIAO et. al. | arxiv-cs.CL | 2023-01-20 |
443 | Improving Machine Translation with Phrase Pair Injection and Corpus Filtering Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we show that the combination of Phrase Pair Injection and Corpus Filtering boosts the performance of Neural Machine Translation (NMT) systems. |
Akshay Batheja; Pushpak Bhattacharyya; | arxiv-cs.CL | 2023-01-19 |
444 | Machine Translation for Accessible Multi-Language Text Analysis Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we aim to leverage those very advances to demonstrate that multi-language analysis is currently accessible to all computational scholars. |
Edward W. Chew; William D. Weisman; Jingying Huang; Seth Frey; | arxiv-cs.CL | 2023-01-19 |
445 | Understanding and Detecting Hallucinations in Neural Machine Translation Via Model Introspection IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Neural sequence generation models are known to hallucinate, by producing outputs that are unrelated to the source text. These hallucinations are potentially harmful, yet it … |
Weijia Xu; Sweta Agrawal; Eleftheria Briakou; Marianna J. Martindale; Marine Carpuat; | arxiv-cs.CL | 2023-01-18 |
446 | Unsupervised Mandarin-Cantonese Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The key contributions of our project include: 1. |
Megan Dare; Valentina Fajardo Diaz; Averie Ho Zoen So; Yifan Wang; Shibingfeng Zhang; | arxiv-cs.CL | 2023-01-10 |
447 | Automatic Standardization of Arabic Dialects for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Carrying out this research could then lead to combining ”automatic standardization” software and automatic translation software so that we take the output of the first software and introduce it as input into the second one to obtain at the end a quality machine translation. |
Abidrabbo Alnassan; | arxiv-cs.CL | 2023-01-09 |
448 | Applying Automated Machine Translation to Educational Video Courses Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We studied the capability of automated machine translation in the online video education space by automatically translating Khan Academy videos with state-of-the-art translation models and applying text-to-speech synthesis and audio/video synchronization to build engaging videos in target languages. |
Linden Wang; | arxiv-cs.CL | 2023-01-08 |
449 | Building A Parallel Corpus and Training Translation Models Between Luganda and English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we build a parallel corpus with 41,070 pairwise sentences for Luganda and English which is based on three different open-sourced corpora. |
Richard Kimera; Daniela N. Rim; Heeyoul Choi; | arxiv-cs.CL | 2023-01-06 |
450 | Statistical Machine Translation for Indic Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Different preprocessing approaches are proposed in this paper to handle the noise of the dataset. |
Sudhansu Bala Das; Divyajoti Panda; Tapas Kumar Mishra; Bidyut Kr. Patra; | arxiv-cs.CL | 2023-01-02 |
451 | Bridging The Gap Between Native Text and Translated Text Through Adversarial Learning: A Case Study on Cross-Lingual Event Extraction Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent research in cross-lingual learning has found that combining large-scale pretrained multilingual language models with machine translation can yield good performance. We … |
Pengfei Yu; Jonathan May; Heng Ji; | Findings | 2023-01-01 |
452 | DialectNLU at NADI 2023 Shared Task: Transformer Based Multitask Approach Jointly Integrating Dialect and Machine Translation Tasks in Arabic Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With approximately 400 million speakers worldwide, Arabic ranks as the fifth most-spoken language globally, necessitating advancements in natural language processing. This paper … |
Hariram Veeramani; Surendrabikram Thapa; Usman Naseem; | ARABICNLP | 2023-01-01 |
453 | From Inclusive Language to Gender-Neutral Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Gender inclusivity in language has become a central topic of debate and research. Its application in the cross-lingual contexts of human and machine translation (MT), however, … |
Andrea Piergentili; Dennis Fucci; Beatrice Savoldi; L. Bentivogli; Matteo Negri; | ArXiv | 2023-01-01 |
454 | Results of WMT23 Metrics Shared Task: Metrics Might Be Guilty But References Are Not Innocent Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the WMT23 Metrics Shared Task. Participants submitting automatic MT evaluation metrics were asked to score the outputs of the translation … |
MARKUS FREITAG et. al. | Conference on Machine Translation | 2023-01-01 |
455 | Findings of The WMT 2023 Shared Task on Low-Resource Indic Language Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the low-resource Indic language translation task organized alongside the Eighth Conference on Machine Translation (WMT) 2023. In this task, … |
SANTANU PAL et. al. | Conference on Machine Translation | 2023-01-01 |
456 | Exploring Prompt Engineering with GPT Language Models for Document-Level Machine Translation: Insights and Findings Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes Lan-Bridge Translation systems for the WMT 2023 General Translation shared task. We participate in 2 directions: English to and from Chinese. With the … |
Yangjian Wu; Gang Hu; | Conference on Machine Translation | 2023-01-01 |
457 | PROMT Systems for WMT23 Shared General Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the PROMT submissions for the WMT23 Shared General Translation Task. This year we participated in two directions of the Shared Translation Task: English to … |
Alexander P. Molchanov; Vladislav Kovalenko; | Conference on Machine Translation | 2023-01-01 |
458 | AIST AIRC Submissions to The WMT23 Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the development process of NMT systems that were submitted to the WMT 2023 General Translation task by the team of AIST AIRC. We trained constrained track … |
Matīss Rikters; Makoto Miwa; | Conference on Machine Translation | 2023-01-01 |
459 | DSHacker at SemEval-2023 Task 3: Genres and Persuasion Techniques Detection with Multilingual Data Augmentation Through Machine Translation and Text Generation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In our article, we present the systems developed for SemEval-2023 Task 3, which aimed to evaluate the ability of Natural Language Processing (NLP) systems to detect genres and … |
Arkadiusz Modzelewski; Witold Sosnowski; M. Wilczynska; A. Wierzbicki; | International Workshop on Semantic Evaluation | 2023-01-01 |
460 | Findings of The 2023 Conference on Machine Translation (WMT23): LLMs Are Here But Not Quite There Yet Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the General Machine Translation Task organised as part of the 2023 Conference on Machine Translation (WMT). In the general MT task, participants … |
TOM KOCMI et. al. | Conference on Machine Translation | 2023-01-01 |
461 | Findings of The Second WMT Shared Task on Sign Language Translation (WMT-SLT23) Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the Second WMT Shared Task on Sign Language Translation (WMT-SLT23; https://www.wmt-slt.com/). This shared task is concerned with automatic … |
MATHIAS MÜLLER et. al. | Conference on Machine Translation | 2023-01-01 |
462 | Towards Responsible Machine Translation: Ethical and Legal Considerations in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Towards Responsible Machine Translation | 2023-01-01 | |
463 | Improving Neural Machine Translation Formality Control with Domain Adaptation and Reranking-based Transductive Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents Huawei Translation Service Center (HW-TSC)’s submission on the IWSLT 2023 formality control task, which provides two training scenarios: supervised and … |
ZHANGLIN WU et. al. | International Workshop on Spoken Language Translation | 2023-01-01 |
464 | The HW-TSC’s Simultaneous Speech-to-Text Translation System for IWSLT 2023 Evaluation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present our submission to the IWSLT 2023 Simultaneous Speech-to-Text Translation competition. Our participation involves three language directions: … |
JIAXIN GUO et. al. | International Workshop on Spoken Language Translation | 2023-01-01 |
465 | Improving Formality-Sensitive Machine Translation Using Data-Centric Approaches and Prompt Engineering Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present the KU x Upstage team’s submission for the Special Task on Formality Control on Spoken Language Translation, which involves translating English into four … |
Seugnjun Lee; Hyeonseok Moon; Chanjun Park; Heu-Jeoung Lim; | International Workshop on Spoken Language Translation | 2023-01-01 |
466 | NAIST Simultaneous Speech-to-speech Translation System for IWSLT 2023 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes NAIST’s submission to the IWSLT 2023 Simultaneous Speech Translation task: English-to-German, Japanese, Chinese speech-to-text translation and … |
RYO FUKUDA et. al. | International Workshop on Spoken Language Translation | 2023-01-01 |
467 | A Data Augmentation Method for English-Vietnamese Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The translation quality of machine translation systems depends on the parallel corpus used for training, particularly on the quantity and quality of the corpus. However, building … |
N. Pham; Van Vinh Nguyen; T. Pham; | IEEE Access | 2023-01-01 |
468 | Is ChatGPT A Good Translator? A Preliminary Study Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This report provides a preliminary evaluation of ChatGPT for machine translation, including translation prompt, multilingual translation, and translation robustness. We adopt the … |
Wenxiang Jiao; Wenxuan Wang; Jen-tse Huang; Xing Wang; Zhaopeng Tu; | ArXiv | 2023-01-01 |
469 | HW-TSC 2023 Submission for The Quality Estimation Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Quality estimation (QE) is an essential technique to assess machine translation quality without reference translations. In this paper, we focus on Huawei Translation Services … |
YUANG LI et. al. | Conference on Machine Translation | 2023-01-01 |
470 | Non-Autoregressive Neural Machine Translation: A Call for Clarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we take a step back and revisit several techniques that have been proposed for improving non-autoregressive translation models and compare their combined translation quality and speed implications under third-party testing environments. |
Robin Schmidt; Telmo Pires; Stephan Peitz; Jonas L��f; | emnlp | 2022-12-30 |
471 | Breaking The Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a novel representation method for Chinese characters to break the bottlenecks, namely StrokeNet, which represents a Chinese character by a Latinized stroke sequence (e. g. , �? |
Zhijun Wang; Xuebo Liu; Min Zhang; | emnlp | 2022-12-30 |
472 | Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To compress and accelerate Transformer, we propose a Hybrid Tensor-Train (HTT) decomposition, which retains full rank and meanwhile reduces operations and parameters. |
SUNZHU LI et. al. | emnlp | 2022-12-30 |
473 | IndicXNLI: Evaluating Multilingual Inference for Indian Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, we introduce INDICXNLI, an NLI dataset for 11 Indic languages. |
Divyanshu Aggarwal; Vivek Gupta; Anoop Kunchukuttan; | emnlp | 2022-12-30 |
474 | Neural Machine Translation with Contrastive Translation Memories Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Different from previous works that make use of mutually similar but redundant translation memories (TMs), we propose a new retrieval-augmented NMT to model contrastively retrieved translation memories that are holistically similar to the source sentence while individually contrastive to each other providing maximal information gain in three phases. |
Xin Cheng; Shen Gao; Lemao Liu; Dongyan Zhao; Rui Yan; | emnlp | 2022-12-30 |
475 | Modeling Consistency Preference Via Lexical Chains for Document-level Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we aim to relieve the issue of lexical translation inconsistency for document-level neural machine translation (NMT) by modeling consistency preference for lexical chains, which consist of repeated words in a source-side document and provide a representation of the lexical consistency structure of the document. |
XINGLIN LYU et. al. | emnlp | 2022-12-30 |
476 | DEMETR: Diagnosing Evaluation Metrics for Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The operations of newer learned metrics (e. g. , BLEURT, COMET), which leverage pretrained language models to achieve higher correlations with human quality judgments than BLEU, are opaque in comparison. In this paper, we shed light on the behavior of these learned metrics by creating DEMETR, a diagnostic dataset with 31K English examples (translated from 10 source languages) for evaluating the sensitivity of MT evaluation metrics to 35 different linguistic perturbations spanning semantic, syntactic, and morphological error categories. |
MARZENA KARPINSKA et. al. | emnlp | 2022-12-30 |
477 | Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we explore the challenging problem of performing a generative task in a target language when labeled data is only available in English, using summarization as a case study. |
TU VU et. al. | emnlp | 2022-12-30 |
478 | LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end,we first propose the Multilingual MMT task by establishing two new Multilingual MMT benchmark datasets covering seven languages.Then, an effective baseline LVP-M3 using visual prompts is proposed to support translations between different languages,which includes three stages (token encoding, language-aware visual prompt generation, and language translation). |
HONGCHENG GUO et. al. | emnlp | 2022-12-30 |
479 | Information-Transport-based Policy for Simultaneous Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we treat the translation as information transport from source to target and accordingly propose an Information-Transport-based Simultaneous Translation (ITST). |
Shaolei Zhang; Yang Feng; | emnlp | 2022-12-30 |
480 | Exploring Document-Level Literary Machine Translation with Parallel Paragraphs from World Literature IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The experts note that MT outputs contain not only mistranslations, but also discourse-disrupting errors and stylistic inconsistencies. To address these problems, we train a post-editing model whose output is preferred over normal MT output at a rate of 69% by experts. |
KATHERINE THAI et. al. | emnlp | 2022-12-30 |
481 | DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce DivEMT, the first publicly available post-editing study of Neural Machine Translation (NMT) over a typologically diverse set of target languages. |
Gabriele Sarti; Arianna Bisazza; Ana Guerberof-Arenas; Antonio Toral; | emnlp | 2022-12-30 |
482 | A Template-based Method for Constrained Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a template-based method that can yield results with high translation quality and match accuracy and the inference speed of our method is comparable with unconstrained NMT models. |
SHUO WANG et. al. | emnlp | 2022-12-30 |
483 | PreQuEL: Quality Estimation of Machine Translation Outputs in Advance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. |
Shachar Don-Yehiya; Leshem Choshen; Omri Abend; | emnlp | 2022-12-30 |
484 | WeTS: A Benchmark for Translation Suggestion IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To break these limitations mentioned above and spur the research in TS, we create a benchmark dataset, called WeTS, which is a golden corpus annotated by expert translators on four translation directions. |
Zhen Yang; Fandong Meng; Yingxue Zhang; Ernan Li; Jie Zhou; | emnlp | 2022-12-30 |
485 | Entropy-Based Vocabulary Substitution for Incremental Learning in Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose an entropy-based vocabulary substitution (EVS) method that just needs to walk through new language pairs for incremental learning in a large-scale multilingual data updating while remaining the size of the vocabulary. |
Kaiyu Huang; Peng Li; Jin Ma; Yang Liu; | emnlp | 2022-12-30 |
486 | Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Thus, in this work, we introduce IKD-MMT, a novel MMT framework to support the image-free inference phase via an inversion knowledge distillation scheme. |
Ru Peng; Yawen Zeng; Jake Zhao; | emnlp | 2022-12-30 |
487 | Low-resource Neural Machine Translation with Cross-modal Alignment Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we turn to connect several low-resource languages to a particular high-resource one by additional visual modality. |
Zhe Yang; Qingkai Fang; Yang Feng; | emnlp | 2022-12-30 |
488 | Competency-Aware Neural Machine Translation: Can Machine Translation Know Its Own Translation Quality? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This is in sharp contrast to human translators who give feedback or conduct further investigations whenever they are in doubt about predictions. To fill this gap, we propose a novel competency-aware NMT by extending conventional NMT with a self-estimator, offering abilities to translate a source sentence and estimate its competency. |
PEI ZHANG et. al. | emnlp | 2022-12-30 |
489 | MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce MT-GenEval, a benchmark for evaluating gender accuracy in translation from English into eight widely-spoken languages. |
ANNA CURREY et. al. | emnlp | 2022-12-30 |
490 | GuoFeng: A Benchmark for Zero Pronoun Recovery and Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To bridge the data and evaluation gaps, we propose a benchmark testset for target evaluation on Chinese-English ZP translation. |
MINGZHOU XU et. al. | emnlp | 2022-12-30 |
491 | T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a new approach to perform zero-shot cross-modal transfer between speech and text for translation tasks. |
Paul-Ambroise Duquenne; Hongyu Gong; Beno�t Sagot; Holger Schwenk; | emnlp | 2022-12-30 |
492 | Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In order to enable zero-shot ST, we propose a novel Discrete Cross-Modal Alignment (DCMA) method that employs a shared discrete vocabulary space to accommodate and match both modalities of speech and text. |
CHEN WANG et. al. | emnlp | 2022-12-30 |
493 | ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel transfer learning method for NMT, namely ConsistTL, which can continuously transfer knowledge from the parent model during the training of the child model. |
Zhaocong Li; Xuebo Liu; Derek F. Wong; Lidia S. Chao; Min Zhang; | emnlp | 2022-12-30 |
494 | Bilingual Synchronization: Restoring Translational Relationships with Editing Operations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We consider here a more general setting which assumes an initial target sequence, that must be transformed into a valid translation of the source, thereby restoring parallelism between source and target. |
Jitao Xu; Josep Crego; Fran�ois Yvon; | emnlp | 2022-12-30 |
495 | Optimizing Deep Transformers for Chinese-Thai Low-Resource Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we study the use of deep Transformer translation model for the CCMT 2022 Chinese-Thai low-resource machine translation task. |
Wenjie Hao; Hongfei Xu; Lingling Mu; Hongying Zan; | arxiv-cs.CL | 2022-12-24 |
496 | T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we present T-Projection, a novel approach for annotation projection that leverages large pretrained text-to-text language models and state-of-the-art machine translation technology. |
Iker García-Ferrero; Rodrigo Agerri; German Rigau; | arxiv-cs.CL | 2022-12-20 |
497 | IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Indian languages, having over a billion speakers, are linguistically different from English, and to date, there has not been a systematic study of evaluating MT systems from English into Indian languages. In this paper, we fill this gap by creating an MQM dataset consisting of 7000 fine-grained annotations, spanning 5 Indian languages and 7 MT systems, and use it to establish correlations between annotator scores and scores obtained using existing automatic metrics. |
ANANYA B. SAI et. al. | arxiv-cs.CL | 2022-12-20 |
498 | Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a new MMT approach based on a strong text-only MT model, which uses neural adapters, a novel guided self-attention mechanism and which is jointly trained on both visually-conditioned masking and MMT. |
Matthieu Futeral; Cordelia Schmid; Ivan Laptev; Benoît Sagot; Rachel Bawden; | arxiv-cs.CL | 2022-12-20 |
499 | Beyond Triplet: Leveraging The Most Data for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: First, they can only utilize triple data (bilingual texts with images), which is scarce; second, current benchmarks are relatively restricted and do not correspond to realistic scenarios. Therefore, this paper correspondingly establishes new methods and new datasets for MMT. |
YAOMING ZHU et. al. | arxiv-cs.CL | 2022-12-20 |
500 | Mu2SLAM: Multitask, Multilingual Speech and Language Models Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present Mu$^{2}$SLAM, a multilingual sequence-to-sequence model pre-trained jointly on unlabeled speech, unlabeled text and supervised data spanning Automatic Speech … |
Yong Cheng; Yu Zhang; Melvin Johnson; Wolfgang Macherey; Ankur Bapna; | ArXiv | 2022-12-19 |
501 | Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Mu$^{2}$SLAM, a multilingual sequence-to-sequence model pre-trained jointly on unlabeled speech, unlabeled text and supervised data spanning Automatic Speech Recognition (ASR), Automatic Speech Translation (AST) and Machine Translation (MT), in over 100 languages. |
Yong Cheng; Yu Zhang; Melvin Johnson; Wolfgang Macherey; Ankur Bapna; | arxiv-cs.CL | 2022-12-19 |
502 | AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose AdaTranS for end-to-end ST. It adapts the speech features with a new shrinking mechanism to mitigate the length mismatch between speech and text features by predicting word boundaries. |
Xingshan Zeng; Liangyou Li; Qun Liu; | arxiv-cs.CL | 2022-12-17 |
503 | Controlling Styles in Neural Machine Translation with Activation Prompt Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address both challenges, this paper presents a new benchmark and approach. |
Yifan Wang; Zewei Sun; Shanbo Cheng; Weiguo Zheng; Mingxuan Wang; | arxiv-cs.CL | 2022-12-17 |
504 | Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose PRED, a framework that leverages Pre-trained models for Datastores in kNN-MT. |
Jiahuan Li; Shanbo Cheng; Zewei Sun; Mingxuan Wang; Shujian Huang; | arxiv-cs.CL | 2022-12-17 |
505 | Attention As A Guide for Simultaneous Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Although its patterns have been exploited to perform different tasks, from neural network understanding to textual alignment, no previous work has analysed the encoder-decoder attention behavior in speech translation (ST) nor used it to improve ST on a specific task. In this paper, we fill this gap by proposing an attention-based policy (EDAtt) for simultaneous ST (SimulST) that is motivated by an analysis of the existing attention relations between audio input and textual output. |
Sara Papi; Matteo Negri; Marco Turchi; | arxiv-cs.CL | 2022-12-15 |
506 | Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show effective regularization strategies, namely dropout techniques for MoE layers in EOM and FOM, Conditional MoE Routing and Curriculum Learning methods that prevent over-fitting and improve the performance of MoE models on low-resource tasks without adversely affecting high-resource tasks. |
Maha Elbayad; Anna Sun; Shruti Bhosale; | arxiv-cs.CL | 2022-12-14 |
507 | ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we step towards bridging the gap between multilingual NLs and multilingual PLs for large language models (LLMs). |
YEKUN CHAI et. al. | arxiv-cs.CL | 2022-12-13 |
508 | Towards A General Purpose Machine Translation System for Sranantongo Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study we create a general purpose machine translation system for srn. |
Just Zwennicker; David Stap; | arxiv-cs.CL | 2022-12-13 |
509 | End-to-End Speech Translation of Arabic to English Broadcast News Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents our efforts towards the development of the first Broadcast News end-to-end Arabic to English speech translation system. |
Fethi Bougares; Salim Jouili; | arxiv-cs.CL | 2022-12-11 |
510 | M3ST: Mix at Three Levels for Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Mix at three levels for Speech Translation (M^3ST) method to increase the diversity of the augmented training corpus. |
XUXIN CHENG et. al. | arxiv-cs.CL | 2022-12-07 |
511 | Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Specifically, in one-tomany scenario, we propose a multilingual distillation method to make the new model (student) jointly learn multilingual output from old model (teacher) and new task. |
YANG ZHAO et. al. | arxiv-cs.CL | 2022-12-06 |
512 | Impact of Domain-Adapted Multilingual Neural Machine Translation in The Medical Domain Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We compare the out-of-domain MNMT with the in-domain adapted MNMT. |
Miguel Rios; Raluca-Maria Chereji; Alina Secara; Dragos Ciobanu; | arxiv-cs.CL | 2022-12-05 |
513 | In-context Examples Selection for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we aim to understand the properties of good in-context examples for MT in both in-domain and out-of-domain settings. |
Sweta Agrawal; Chunting Zhou; Mike Lewis; Luke Zettlemoyer; Marjan Ghazvininejad; | arxiv-cs.CL | 2022-12-05 |
514 | Democratizing Neural Machine Translation with OPUS-MT Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development platforms and professional workflows. |
JÖRG TIEDEMANN et. al. | arxiv-cs.CL | 2022-12-04 |
515 | The RoyalFlush System for The WMT 2022 Efficiency Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the submission of the RoyalFlush neural machine translation system for the WMT 2022 translation efficiency task. |
BO QIN et. al. | arxiv-cs.CL | 2022-12-03 |
516 | Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the system developed at the Universitat Polit\`ecnica de Catalunya for the Workshop on Machine Translation 2022 Sign Language Translation Task, in particular, for the sign-to-text direction. |
Laia Tarrés; Gerard I. Gàllego; Xavier Giró-i-Nieto; Jordi Torres; | arxiv-cs.CL | 2022-12-02 |
517 | CUNI Systems for The WMT22 Czech-Ukrainian Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Charles University submissions to the WMT22 General Translation Shared Task on Czech-Ukrainian and Ukrainian-Czech machine translation. |
Martin Popel; Jindřich Libovický; Jindřich Helcl; | arxiv-cs.CL | 2022-12-01 |
518 | CUNI Systems for The WMT 22 Czech-Ukrainian Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present Charles University submissions to the WMT 22 GeneralTranslation Shared Task on Czech-Ukrainian and Ukrainian-Czech machine translation. We present two constrained … |
M. Popel; Jindřich Libovický; Jindřich Helcl; | Conference on Machine Translation | 2022-12-01 |
519 | Sevi: Speech-to-Visualization Through Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Arguably, the most natural way to specify what to visualize is through natural language or speech, similar to our daily search on Google or Apple Siri, leaving to the system the task of reasoning about what to visualize and how. In this demo, we present Sevi an end-to-end data visualization system that acts as a virtual assistant to allow novices to create visualizations through either natural language or speech. |
Jiawei Tang; Yuyu Luo; Mourad Ouzzani; Guoliang Li; Hongyang Chen; | sigmod | 2022-11-30 |
520 | Word Alignment in The Era of Deep Learning: A Tutorial Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The word alignment task, despite its prominence in the era of statistical machine translation (SMT), is niche and under-explored today. In this two-part tutorial, we argue for the continued relevance for word alignment. |
Bryan Li; | arxiv-cs.CL | 2022-11-30 |
521 | VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a machine translation system tailored for the task of video dubbing, which directly considers the speech duration of each token in translation, to match the length of source and target speech. |
YIHAN WU et. al. | arxiv-cs.CL | 2022-11-30 |
522 | CUNI Submission in WMT22 General Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present the CUNI-Bergamot submission for the WMT22 General translation task. |
Josef Jon; Martin Popel; Ondřej Bojar; | arxiv-cs.CL | 2022-11-29 |
523 | Findings of The WMT 2022 Shared Task on Translation Suggestion Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We report the result of the first edition of the WMT shared task on Translation Suggestion (TS). |
Zhen Yang; Fandong Meng; Yingxue Zhang; Ernan Li; Jie Zhou; | arxiv-cs.CL | 2022-11-29 |
524 | CUNI-Bergamot Submission at WMT22 General Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present the CUNI-Bergamot submission for the WMT22 General translation task. We compete in English-Czech direction. Our submission further explores block backtranslation … |
Josef Jon; M. Popel; Ondrej Bojar; | ArXiv | 2022-11-29 |
525 | Domain Mismatch Doesn’t Always Prevent Cross-Lingual Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we show that a simple initialization regimen can overcome much of the effect of domain mismatch in cross-lingual transfer. |
Daniel Edmiston; Phillip Keung; Noah A. Smith; | arxiv-cs.CL | 2022-11-29 |
526 | Extending The Subwording Model of Multilingual Pretrained Models for New Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we add new subwords to the SentencePiece tokenizer to apply a multilingual pretrained model to new languages (Inuktitut in this paper). |
Kenji Imamura; Eiichiro Sumita; | arxiv-cs.CL | 2022-11-29 |
527 | Considerations for Meaningful Sign Language Machine Translation Based on Glosses IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we review recent works on neural gloss translation. |
Mathias Müller; Zifan Jiang; Amit Moryossef; Annette Rios; Sarah Ebling; | arxiv-cs.CL | 2022-11-28 |
528 | Summer: WeChat Neural Machine Translation Systems for The WMT22 Biomedical Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces WeChat’s participation in WMT 2022 shared biomedical translation task on Chinese to English. |
Ernan Li; Fandong Meng; Jie Zhou; | arxiv-cs.CL | 2022-11-27 |
529 | BJTU-WeChat’s Systems for The WMT22 Chat Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces the joint submission of the Beijing Jiaotong University and WeChat AI to the WMT’22 chat translation task for English-German. |
Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou; | arxiv-cs.CL | 2022-11-27 |
530 | Competency-Aware Neural Machine Translation: Can Machine Translation Know Its Own Translation Quality? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This is in sharp contrast to human translators who give feedback or conduct further investigations whenever they are in doubt about predictions. To fill this gap, we propose a novel competency-aware NMT by extending conventional NMT with a self-estimator, offering abilities to translate a source sentence and estimate its competency. |
PEI ZHANG et. al. | arxiv-cs.CL | 2022-11-24 |
531 | ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic-English Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present our work on collecting ArzEn-ST, a code-switched Egyptian Arabic-English Speech Translation Corpus. This corpus is an extension of the ArzEn speech corpus, which was … |
Injy Hamed; Nizar Habash; S. Abdennadher; Ngoc Thang Vu; | Workshop on Arabic Natural Language Processing | 2022-11-22 |
532 | Average Token Delay: A Latency Metric for Simultaneous Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel latency evaluation metric called Average Token Delay (ATD) that focuses on the end timings of partial translations in simultaneous translation. |
Yasumasa Kano; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2022-11-22 |
533 | ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic – English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we collect translations in both directions, monolingual Egyptian Arabic and monolingual English, forming a three-way speech translation corpus. |
Injy Hamed; Nizar Habash; Slim Abdennadher; Ngoc Thang Vu; | arxiv-cs.CL | 2022-11-21 |
534 | Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we propose a simple back-translation-style data augmentation method for mandarin Chinese polyphone disambiguation, utilizing a large amount of unlabeled text data. |
CHUNYU QIANG et. al. | arxiv-cs.SD | 2022-11-17 |
535 | TSMind: Alibaba and Soochow University’s Submission to The WMT22 Translation Suggestion Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the joint submission of Alibaba and Soochow University, TSMind, to the WMT 2022 Shared Task on Translation Suggestion (TS). |
XIN GE et. al. | arxiv-cs.CL | 2022-11-16 |
536 | MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we leverage the evaluations of candidate systems submitted to the English-German SST task at IWSLT 2022 and conduct an extensive correlation analysis of CR and the aforementioned metrics. |
Dominik Macháček; Ondřej Bojar; Raj Dabre; | arxiv-cs.CL | 2022-11-15 |
537 | Findings of The Covid-19 MLIA Machine Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents the results of the machine translation (MT) task from the Covid-19 MLIA @ Eval initiative, a community effort to improve the generation of MT systems focused on the current Covid-19 crisis. |
FRANCISCO CASACUBERTA et. al. | arxiv-cs.CL | 2022-11-14 |
538 | Easy Guided Decoding in Providing Suggestions for Interactive Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we utilize the parameterized objective function of neural machine translation (NMT) and propose a novel constrained decoding algorithm, namely Prefix Suffix Guided Decoding (PSGD), to deal with the TS problem without additional training. |
Ke Wang; Xin Ge; Jiayi Wang; Yu Zhao; Yuqi Zhang; | arxiv-cs.CL | 2022-11-13 |
539 | Improving The Machine Translation Model in Specific Domains for The Ukrainian Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: One of the main tasks of natural language generation is to improve the quality of translation. For a morphologically rich language like Ukrainian, there are few ordered datasets … |
DANIIL MAKSYMENKO et. al. | 2022 IEEE 17th International Conference on Computer … | 2022-11-10 |
540 | Grammatical Error Correction: A Survey of The State of The Art IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this survey paper, we condense the field into a single article and first outline some of the linguistic challenges of the task, introduce the most popular datasets that are available to researchers (for both English and other languages), and summarise the various methods and techniques that have been developed with a particular focus on artificial error generation. |
CHRISTOPHER BRYANT et. al. | arxiv-cs.CL | 2022-11-09 |
541 | ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose ERNIE-UniX2, a unified cross-lingual cross-modal pre-training framework for both generation and understanding tasks. |
BIN SHAN et. al. | arxiv-cs.CV | 2022-11-09 |
542 | Review of Coreference Resolution in English and Persian Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Consequently, it has a significant effect on the quality of these systems. This article reviews the existing corpora and evaluation metrics in this field. |
Hassan Haji Mohammadi; Alireza Talebpour; Ahmad Mahmoudi Aznaveh; Samaneh Yazdani; | arxiv-cs.CL | 2022-11-08 |
543 | Building User-oriented Personalized Machine Translator Based on User-Generated Textual Content Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine Translation (MT) has been a very useful tool to assist multilingual communication and collaboration. In recent years, by taking advantage of the exciting developments of … |
P. ZHANG et. al. | Proceedings of the ACM on Human-Computer Interaction | 2022-11-07 |
544 | Refining Low-Resource Unsupervised Translation By Language Disentanglement of Multilingual Translation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a simple refinement procedure to separate languages from a pre-trained multilingual UMT model for it to focus on only the target low-resource task. |
Xuan-Phi Nguyen; Shafiq Joty; Kui Wu; Ai Ti Aw; | nips | 2022-11-06 |
545 | InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose InsNet, an expressive insertion-based text generator with efficient training and flexible decoding (parallel or sequential). |
Sidi Lu; Tao Meng; Nanyun Peng; | nips | 2022-11-06 |
546 | MT-GenEval: A Counterfactual and Contextual Dataset for Evaluating Gender Accuracy in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce MT-GenEval, a benchmark for evaluating gender accuracy in translation from English into eight widely-spoken languages. |
ANNA CURREY et. al. | arxiv-cs.CL | 2022-11-02 |
547 | Domain Curricula for Code-Switched MT at MixMT 2022 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present our approach and results for the Code-mixed Machine Translation (MixMT) shared task at WMT 2022: the task consists of two subtasks, monolingual to code-mixed machine translation (Subtask-1) and code-mixed to monolingual machine translation (Subtask-2). |
Lekan Raheem; Maab Elrashid; | arxiv-cs.CL | 2022-10-31 |
548 | Domain Adaptation of Machine Translation with Crowdworkers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a framework that efficiently and effectively collects parallel sentences in a target domain from the web with the help of crowdworkers. |
Makoto Morishita; Jun Suzuki; Masaaki Nagata; | arxiv-cs.CL | 2022-10-27 |
549 | COMET-QE and Active Learning for Low-Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We use COMET-QE, a reference-free evaluation metric, to select sentences for low-resource neural machine translation. |
Everlyn Asiko Chimoto; Bruce A. Bassett; | arxiv-cs.CL | 2022-10-27 |
550 | The Effect of Normalization for Bi-directional Amharic-English Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents the first relatively large-scale Amharic-English parallel sentence dataset. |
TADESSE DESTAW BELAY et. al. | arxiv-cs.CL | 2022-10-27 |
551 | Improving Speech-to-Speech Translation Through Unlabeled Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an effective way to utilize the massive existing unlabeled text from different languages to create a large amount of S2ST data to improve S2ST performance by applying various acoustic effects to the generated synthetic data. |
XUAN-PHI NGUYEN et. al. | arxiv-cs.CL | 2022-10-26 |
552 | A Bilingual Parallel Corpus with Discourse Annotations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes BWB, a large parallel corpus first introduced in Jiang et al. (2022), along with an annotated test set. |
YUCHEN ELEANOR JIANG et. al. | arxiv-cs.CL | 2022-10-26 |
553 | Smart Speech Segmentation Using Acousto-Linguistic Features with Look-ahead Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a hybrid approach that leverages both acoustic and language information to improve segmentation. |
PIYUSH BEHRE et. al. | arxiv-cs.CL | 2022-10-25 |
554 | Bilingual Synchronization: Restoring Translational Relationships with Editing Operations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We consider here a more general setting which assumes an initial target sequence, that must be transformed into a valid translation of the source, thereby restoring parallelism between source and target. |
Jitao Xu; Josep Crego; François Yvon; | arxiv-cs.CL | 2022-10-24 |
555 | Analyzing The Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we examine the use of influence functions for Neural Machine Translation (NMT). |
Tsz Kin Lam; Eva Hasler; Felix Hieber; | arxiv-cs.CL | 2022-10-24 |
556 | Focused Concatenation for Context-Aware Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose an improved concatenation approach that encourages the model to focus on the translation of the current sentence, discounting the loss generated by target context. |
Lorenzo Lupo; Marco Dinarelli; Laurent Besacier; | arxiv-cs.CL | 2022-10-24 |
557 | Translation Word-Level Auto-Completion: What Can We Achieve Out of The Box? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work describes our submissions to WMT’s shared task on word-level auto-completion, for the Chinese-to-English, English-to-Chinese, German-to-English, and English-to-German language directions. |
Yasmin Moslem; Rejwanul Haque; Andy Way; | arxiv-cs.CL | 2022-10-23 |
558 | University of Cape Town’s WMT22 System: Multilingual Machine Translation for Southern African Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The paper describes the University of Cape Town’s submission to the constrained track of the WMT22 Shared Task: Large-Scale Machine Translation Evaluation for African Languages. |
Khalid N. Elmadani; Francois Meyer; Jan Buys; | arxiv-cs.CL | 2022-10-21 |
559 | Turning Fixed to Adaptive: Integrating Post-Evaluation Into Simultaneous Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a method of performing the adaptive policy via integrating post-evaluation into the fixed policy. |
Shoutao Guo; Shaolei Zhang; Yang Feng; | arxiv-cs.CL | 2022-10-21 |
560 | A Semi-supervised Approach for A Better Translation of Sentiment in Dialectical Arabic UGT Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this research, we aim to improve the translation of sentiment in UGT written in the dialectical versions of the Arabic language to English. |
Hadeel Saadany; Constantin Orasan; Emad Mohamed; Ashraf Tantawy; | arxiv-cs.CL | 2022-10-21 |
561 | Gui at MixMT 2022 : English-Hinglish: An MT Approach for Translation of Code Mixed Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we discuss the use of mBART with some special pre-processing and post-processing (transliteration from Devanagari to Roman) for the first task in detail and the experiments that we performed for the second task of translating code-mixed Hinglish to monolingual English. |
AKSHAT GAHOI et. al. | arxiv-cs.CL | 2022-10-21 |
562 | Is Encoder-Decoder Redundant for Neural Machine Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we investigate the aforementioned concept for machine translation. |
Yingbo Gao; Christian Herold; Zijian Yang; Hermann Ney; | arxiv-cs.CL | 2022-10-21 |
563 | Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a Wait-info Policy to balance source and target at the information level. |
Shaolei Zhang; Shoutao Guo; Yang Feng; | arxiv-cs.CL | 2022-10-20 |
564 | Can Domains Be Transferred Across Languages in Multi-Domain Multilingual Neural Machine Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Previous works mostly focus on either multilingual or multi-domain aspects of neural machine translation (NMT). |
Thuy-Trang Vu; Shahram Khadivi; Xuanli He; Dinh Phung; Gholamreza Haffari; | arxiv-cs.CL | 2022-10-20 |
565 | The University of Edinburgh’s Submission to The WMT22 Code-Mixing Shared Task (MixMT) Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For subtask 2, we investigated different pretraining techniques, namely comparing simple initialisation from existing machine translation models and aligned augmentation. |
Faheem Kirefu; Vivek Iyer; Pinzhen Chen; Laurie Burchell; | arxiv-cs.CL | 2022-10-20 |
566 | SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the Stevens Institute of Technology’s submission for the WMT 2022 Shared Task: Code-mixed Machine Translation (MixMT). |
Abdul Rafae Khan; Hrishikesh Kanade; Girish Amar Budhrani; Preet Jhanglani; Jia Xu; | arxiv-cs.CL | 2022-10-20 |
567 | The VolcTrans System for WMT22 Multilingual Machine Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report describes our VolcTrans system for the WMT22 shared task on large-scale multilingual machine translation. |
XIAN QIAN et. al. | arxiv-cs.CL | 2022-10-20 |
568 | Hybrid-Regressive Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we empirically confirm that non-autoregressive translation with an iterative refinement mechanism (IR-NAT) suffers from poor acceleration robustness because it is more sensitive to decoding batch size and computing device setting than autoregressive translation (AT). |
Qiang Wang; Xinhui Hu; Ming Chen; | arxiv-cs.CL | 2022-10-19 |
569 | LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Recent advances still struggle to train a separate model for each language pair, which is costly and unaffordable when the number of languages increases in the real world. |
HONGCHENG GUO et. al. | arxiv-cs.CL | 2022-10-19 |
570 | Separating Grains from The Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work describes our approach, which is based on filtering the given noisy data using a sentence-pair classifier that was built by fine-tuning a pre-trained language model. |
IDRIS ABDULMUMIN et. al. | arxiv-cs.CL | 2022-10-19 |
571 | Domain Specific Sub-network for Multi-Domain Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents Domain-Specific Sub-network (DoSS). |
Amr Hendy; Mohamed Abdelghaffar; Mohamed Afify; Ahmed Y. Tawfik; | arxiv-cs.CL | 2022-10-18 |
572 | Tencent’s Multilingual Machine Translation System for WMT22 Large-Scale African Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes Tencent’s multilingual machine translation systems for the WMT22 shared task on Large-Scale Machine Translation Evaluation for African Languages. |
WENXIANG JIAO et. al. | arxiv-cs.CL | 2022-10-18 |
573 | Tencent’s Multilingual Machine Translation System for WMT22 Large-Scale African Languages IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes Tencent’s multilingual machine translation systems for the WMT22 shared task on Large-Scale Machine Translation Evaluation for African Languages. We … |
WENXIANG JIAO et. al. | Conference on Machine Translation | 2022-10-18 |
574 | Alibaba-Translate China’s Submission for WMT 2022 Quality Estimation Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE (Unified Translation Evaluation). |
KEQIN BAO et. al. | arxiv-cs.CL | 2022-10-18 |
575 | Alibaba-Translate China’s Submission for WMT 2022 Quality Estimation Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present our submission to the sentence-level MQM benchmark at Quality Estimation Shared Task, named UniTE (Unified Translation Evaluation). Specifically, our … |
KEQIN BAO et. al. | ArXiv | 2022-10-18 |
576 | Tencent AI Lab – Shanghai Jiao Tong University Low-Resource Translation System for The WMT22 Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes Tencent AI Lab – Shanghai Jiao Tong University (TAL-SJTU) Low-Resource Translation systems for the WMT22 shared task. |
Zhiwei He; Xing Wang; Zhaopeng Tu; Shuming Shi; Rui Wang; | arxiv-cs.CL | 2022-10-17 |
577 | DICTDIS: Dictionary Constrained Disambiguation for Improved NMT Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we present \dictdis, a lexically constrained NMT system that disambiguates between multiple candidate translations derived from dictionaries. |
Ayush Maheshwari; Piyush Sharma; Preethi Jyothi; Ganesh Ramakrishnan; | arxiv-cs.CL | 2022-10-13 |
578 | Improved Data Augmentation for Translation Suggestion Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces the system used in our submission to the WMT’22 Translation Suggestion shared task. |
HONGXIAO ZHANG et. al. | arxiv-cs.CL | 2022-10-12 |
579 | Integrating Translation Memories Into Non-Autoregressive Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: By modifying the data presentation and introducing an extra deletion operation, we obtain performance that are on par with an autoregressive approach, while reducing the decoding load. |
Jitao Xu; Josep Crego; François Yvon; | arxiv-cs.CL | 2022-10-12 |
580 | Investigating Massive Multilingual Pre-Trained Machine Translation Models for Clinical Domain Via Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Massively multilingual pre-trained language models (MMPLMs) are developed in recent years demonstrating superpowers and the pre-knowledge they acquire for downstream tasks. |
Lifeng Han; Gleb Erofeev; Irina Sorokina; Serge Gladkoff; Goran Nenadic; | arxiv-cs.CL | 2022-10-12 |
581 | Streaming Punctuation for Long-form Dictation with Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Automatic Speech Recognition (ASR) production systems, however, are constrained by real-time requirements, making it hard to incorporate the right context when making punctuation decisions. In this paper, we propose a streaming approach for punctuation or re-punctuation of ASR output using dynamic decoding windows and measure its impact on punctuation and segmentation accuracy across scenarios. |
Piyush Behre; Sharman Tan; Padma Varadharajan; Shuangyu Chang; | arxiv-cs.CL | 2022-10-11 |
582 | CTC Alignments Improve Autoregressive Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we argue that CTC does in fact make sense for translation if applied in a joint CTC/attention framework wherein CTC’s core properties can counteract several key weaknesses of pure-attention models during training and decoding. |
BRIAN YAN et. al. | arxiv-cs.CL | 2022-10-11 |
583 | Machine Translation Between Spoken Languages and Signed Languages Represented in SignWriting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents work on novel machine translation (MT) systems between spoken and signed languages, where signed languages are represented in SignWriting, a sign language writing system. |
Zifan Jiang; Amit Moryossef; Mathias Müller; Sarah Ebling; | arxiv-cs.CL | 2022-10-11 |
584 | Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we study the effectiveness of different segmentation approaches on MT performance, covering morphology-based and frequency-based segmentation techniques. |
MARWA GASER et. al. | arxiv-cs.CL | 2022-10-11 |
585 | Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Thus, in this work, we introduce IKD-MMT, a novel MMT framework to support the image-free inference phase via an inversion knowledge distillation scheme. |
Ru Peng; Yawen Zeng; Junbo Zhao; | arxiv-cs.CL | 2022-10-10 |
586 | Improving End-to-End Text Image Translation From The Auxiliary Text Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel text translation enhanced text image translation, which trains the end-to-end model with text translation as an auxiliary task. |
CONG MA et. al. | arxiv-cs.CL | 2022-10-07 |
587 | Toxicity in Multilingual Machine Translation at Scale Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we focus on one type of critical error: added toxicity. |
MARTA R. COSTA-JUSSÀ et. al. | arxiv-cs.CL | 2022-10-06 |
588 | The Boundaries of Meaning: A Case Study in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: But do they have any linguistic or philosophical plausibility? I attempt to cast light on this question by reviewing the relevant details of the subword segmentation algorithms and by relating them to important philosophical and linguistic debates, in the spirit of making artificial intelligence more transparent and explainable. |
Yuri Balashov; | arxiv-cs.CL | 2022-10-02 |
589 | Developing A Rule-Based Machine-Translation System, Ewondo-French-Ewondo Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation (MT) significantly contributes to democratizing access to textual information across multiple languages and is established as a dynamic language service in the … |
Emmanuel Ngué Um; Émilie Eliette; Caroline Ngo Tjomb Assembe; Francis M. Tyers; | Int. J. Humanit. Arts Comput. | 2022-10-01 |
590 | MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We empirically demonstrate the effectiveness of self-supervised pre-training and data augmentation for zero-shot multi-lingual machine translation. |
Kshitij Gupta; | arxiv-cs.CL | 2022-10-01 |
591 | Exploring Explicitation and Implicitation in Parallel Interpreting and Translation Corpora Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present a study of discourse connectives in English-German and German-English translation and interpreting where we focus on the phenomena of explicitation and implicitation. … |
Ekaterina Lapshinova-Koltunski; Christina Pollkläsener; Heike Przybyl; | Prague Bull. Math. Linguistics | 2022-10-01 |
592 | Multimodality Information Fusion for Automated Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Lin Li; Turghun Tayir; Yifeng Han; Xiaohui Tao; Juan D. Velasquez; | Inf. Fusion | 2022-10-01 |
593 | FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present FRMT, a new dataset and evaluation benchmark for Few-shot Region-aware Machine Translation, a type of style-targeted translation. |
PARKER RILEY et. al. | arxiv-cs.CL | 2022-10-01 |
594 | Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose training language-family adapters on top of mBART-50 to facilitate cross-lingual transfer. |
Alexandra Chronopoulou; Dario Stojanovski; Alexander Fraser; | arxiv-cs.CL | 2022-09-30 |
595 | QUAK: A Synthetic Quality Estimation Dataset for Korean-English Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite its high utility in the real world, there remain several limitations concerning manual QE data creation: inevitably incurred non-trivial costs due to the need for translation experts, and issues with data scaling and language expansion. To tackle these limitations, we present QUAK, a Korean-English synthetic QE dataset generated in a fully automatic manner. |
SUGYEONG EO et. al. | arxiv-cs.CL | 2022-09-30 |
596 | Blur The Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English Via Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we proposed a novel approach to building a practical NMT model for Buddhist scriptures. |
DENGHAO LI et. al. | arxiv-cs.CL | 2022-09-29 |
597 | Revamping Multilingual Agreement Bidirectionally Via Switched Back-translation for Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present \textbf{B}idirectional \textbf{M}ultilingual \textbf{A}greement via \textbf{S}witched \textbf{B}ack-\textbf{t}ranslation (\textbf{BMA-SBT}), a novel and universal multilingual agreement framework for fine-tuning pre-trained MNMT models, which (i) exempts the need for aforementioned parallel data by using a novel method called switched BT that creates synthetic text written in another source language using the translation target and (ii) optimizes the agreement bidirectionally with the Kullback-Leibler Divergence loss. |
HONGYUAN LU et. al. | arxiv-cs.CL | 2022-09-28 |
598 | Effective General-Domain Data Inclusion for The Machine Translation Task By Vanilla Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Not only it is revolutionary for various translation tasks, but also for a majority of other NLP tasks. In this paper, we aim at a Transformer-based system that is able to translate a source sentence in German to its counterpart target sentence in English. |
Hassan Soliman; | arxiv-cs.CL | 2022-09-28 |
599 | An Automatic Evaluation of The WMT22 General Machine Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report presents an automatic evaluation of the general machine translation task of the Seventh Conference on Machine Translation (WMT22). |
Benjamin Marie; | arxiv-cs.CL | 2022-09-28 |
600 | Improving Multilingual Neural Machine Translation System for Indic Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a MNMT system to address the issues related to low-resource language translation. |
Sudhansu Bala Das; Atharv Biradar; Tapas Kumar Mishra; Bidyut Kumar Patra; | arxiv-cs.CL | 2022-09-27 |
601 | Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a non-tuning paradigm, resolving domain adaptation with a prompt-based method. |
ZEWEI SUN et. al. | arxiv-cs.CL | 2022-09-23 |
602 | Approaching English-Polish Machine Translation Quality Assessment with Neural-based Methods Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents our contribution to the PolEval 2021 Task 2: Evaluation of translation quality assessment metrics. |
Artur Nowakowski; | arxiv-cs.CL | 2022-09-22 |
603 | PePe: Personalized Post-editing Model Utilizing User-generated Post-edits Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite the recent advancement of machine translation, it remains a demanding task to properly reflect personal style. In this paper, we introduce a personalized automatic post-editing framework to address this challenge, which effectively generates sentences considering distinct personal behaviors. |
Jihyeon Lee; Taehee Kim; Yunwon Tae; Cheonbok Park; Jaegul Choo; | arxiv-cs.CL | 2022-09-21 |
604 | Vega-MT: The JD Explore Academy Machine Translation System for WMT22 IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We describe the JD Explore Academy’s submission of the WMT 2022 shared general translation task. We participated in all high-resource tracks and one medium-resource track, … |
CHANGTONG ZAN et. al. | ArXiv | 2022-09-20 |
605 | Vega-MT: The JD Explore Academy Translation System for WMT22 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We describe the JD Explore Academy’s submission of the WMT 2022 shared general translation task. |
CHANGTONG ZAN et. al. | arxiv-cs.CL | 2022-09-19 |
606 | The First Neural Machine Translation System for The Erzya Language Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present the first neural machine translation system for translation between the endangered Erzya language and Russian and the dataset collected by us to train and evaluate it. |
David Dale; | arxiv-cs.CL | 2022-09-19 |
607 | A Snapshot Into The Possibility of Video Game Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present in this article what we believe to be one of the first attempts at video game machine translation. |
Damien Hansen; Pierre-Yves Houlmont; | arxiv-cs.CL | 2022-09-19 |
608 | Normalization of Code-switched Text for Speech Synthesis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In multilingual communities, code-switching is a common phenomenon. Due to the increase in usage of social media, high level of code-switching is present in social media text as … |
Sreeram Manghat; Sreeja Manghat; Tanja Schultz; | Interspeech | 2022-09-18 |
609 | Cross-Modal Decision Regularization for Simultaneous Speech Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Simultaneous translation systems start producing the output while processing the partial source sentence in the incoming input stream. These systems need to decide when to read … |
Mohd Abbas Zaidi; Beomseok Lee; Sangha Kim; Chanwoo Kim; | Interspeech | 2022-09-18 |
610 | Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Generally, kNN-MT borrows the off-the-shelf context representation in the translation task, e.g., the output of the last decoder layer, as the query vector of the retrieval task. In this work, we highlight that coupling the representations of these two tasks is sub-optimal for fine-grained retrieval. |
Qiang Wang; Rongxiang Weng; Ming Chen; | arxiv-cs.CL | 2022-09-18 |
611 | Rethinking Round-Trip Translation for Machine Translation Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we report the surprising finding that round-trip translation can be used for automatic evaluation without the references. |
Terry Yue Zhuo; Qiongkai Xu; Xuanli He; Trevor Cohn; | arxiv-cs.CL | 2022-09-15 |
612 | Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Multilingual transfer techniques often improve low-resource machine translation (MT). |
Nathaniel R. Robinson; Cameron J. Hogan; Nancy Fulda; David R. Mortensen; | arxiv-cs.CL | 2022-09-13 |
613 | Rethink About The Word-level Quality Estimation for Machine Translation from Human Judgement Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Typically, conventional works on word-level QE are designed to predict the translation quality in terms of the post-editing effort, where the word labels (OK and BAD) are automatically generated by comparing words between MT sentences and the post-edited sentences through a Translation Error Rate (TER) toolkit. |
Zhen Yang; Fandong Meng; Yuanmeng Yan; Jie Zhou; | arxiv-cs.CL | 2022-09-12 |
614 | Adapting to Non-Centered Languages for Zero-shot Multilingual Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a simple, lightweight yet effective language-specific modeling method by adapting to non-centered languages and combining the shared information and the language-specific information to counteract the instability of zero-shot translation. |
Zhi Qu; Taro Watanabe; | arxiv-cs.CL | 2022-09-09 |
615 | On The Complementarity Between Pre-Training and Random-Initialization for Resource-Rich Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We take the first step to investigate the complementarity between PT and RI in resource-rich scenarios via two probing analyses, and find that: 1) PT improves NOT the accuracy, but the generalization by achieving flatter loss landscapes than that of RI; 2) PT improves NOT the confidence of lexical choice, but the negative diversity by assigning smoother lexical probability distributions than that of RI. Based on these insights, we propose to combine their complementarities with a model fusion algorithm that utilizes optimal transport to align neurons between PT and RI. |
CHANGTONG ZAN et. al. | arxiv-cs.CL | 2022-09-07 |
616 | Facilitating Global Team Meetings Between Language-Based Subgroups: When and How Can Machine Translation Help? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In the current study, we investigate the idea of leveraging machine translation (MT) to facilitate global team meetings. |
Yongle Zhang; Dennis Asamoah Owusu; Marine Carpuat; Ge Gao; | arxiv-cs.CL | 2022-09-06 |
617 | Rare But Severe Neural Machine Translation Errors Induced By Minimal Deletion: An Empirical Study on Chinese and English Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We examine the inducement of rare but severe errors in English-Chinese and Chinese-English in-domain neural machine translation by minimal deletion of the source text with character-based models. |
Ruikang Shi; Alvin Grissom II; Duc Minh Trinh; | arxiv-cs.CL | 2022-09-05 |
618 | Informative Language Representation Learning for Massively Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, recent studies show that prepending language tokens sometimes fails to navigate the multilingual neural machine translation models into right translation directions, especially on zero-shot translation. To mitigate this issue, we propose two methods, language embedding embodiment and language-aware multi-head attention, to learn informative language representations to channel translation into right directions. |
Renren Jin; Deyi Xiong; | arxiv-cs.CL | 2022-09-04 |
619 | Semantic Connections in The Complex Sentences for Post-Editing Machine Translation in The Kazakh Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The problems of machine translation are constantly arising. While the most advanced translation platforms, such as Google and Yandex, allow for high-quality translations of … |
A. Turganbayeva; D. Rakhimova; V. Karyukin; A. Karibayeva; Asem Turarbek; | Inf. | 2022-08-30 |
620 | Nearest Neighbor Non-autoregressive Text Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Previous studies addressed this issue through iterative decoding. This study proposes using nearest neighbors as the initial state of an NAR decoder and editing them iteratively. |
Ayana Niwa; Sho Takase; Naoaki Okazaki; | arxiv-cs.CL | 2022-08-26 |
621 | Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Kencorpus is a Kenyan Language corpus that intends to bridge the gap on how to collect, and store text and speech data that is good enough to enable data-driven solutions in applications such as machine translation, question answering and transcription in multilingual communities. |
BARACK WANJAWA et. al. | arxiv-cs.CL | 2022-08-25 |
622 | MuMUR : Multilingual Multimodal Universal Retrieval Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a framework MuMUR, that utilizes knowledge transfer from a multilingual model to boost the performance of multi-modal (image and video) retrieval. |
AVINASH MADASU et. al. | arxiv-cs.CV | 2022-08-24 |
623 | Improving Video Retrieval Using Multilingual Knowledge Transfer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Video retrieval has seen tremendous progress with the development of vision-language models. However, further improving these models require additional labelled data which is a … |
AVINASH MADASU et. al. | European Conference on Information Retrieval | 2022-08-24 |
624 | Domain-Specific Text Generation for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a novel approach to domain adaptation leveraging state-of-the-art pretrained language models (LMs) for domain-specific data augmentation for MT, simulating the domain characteristics of either (a) a small bilingual dataset, or (b) the monolingual source text to be translated. |
Yasmin Moslem; Rejwanul Haque; John D. Kelleher; Andy Way; | arxiv-cs.CL | 2022-08-11 |
625 | Looking for A Needle in A Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we set foundations for the study of NMT hallucinations. |
Nuno M. Guerreiro; Elena Voita; André F. T. Martins; | arxiv-cs.CL | 2022-08-10 |
626 | An Efficient Method for Generating Synthetic Data for Low-Resource Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation (NMT). Previous works have presented different approaches for data … |
Thi-Vinh Ngo; Phuong-Thai Nguyen; V. Nguyen; Thanh-Le Ha; Le-Minh Nguyen; | Applied Artificial Intelligence | 2022-08-02 |
627 | Mismatching-Aware Unsupervised Translation Quality Estimation For Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We evaluate the proposed method on four low-resource language pairs of WMT21 QE shared task, as well as a new English-Farsi test dataset introduced in this paper. |
Fatemeh Azadi; Heshaam Faili; Mohammad Javad Dousti; | arxiv-cs.CL | 2022-07-31 |
628 | GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, vanilla Transformer mainly exploits the top-layer representation, assuming the lower layers provide trivial or redundant information and thus ignoring the bottom-layer feature that is potentially valuable. In this work, we propose the Group-Transformer model (GTrans) that flexibly divides multi-layer representations of both encoder and decoder into different groups and then fuses these group features to generate target words. |
JIAN YANG et. al. | arxiv-cs.CL | 2022-07-29 |
629 | Benchmarking Azerbaijani Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we benchmark the performance of Azerbaijani-English NMT systems on a range of techniques and datasets. |
Chih-Chen Chen; William Chen; | arxiv-cs.CL | 2022-07-29 |
630 | Thutmose Tagger: Single-pass Neural Model for Inverse Text Normalization Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Unfortunately, such neural models are prone to hallucinations that could lead to unacceptable errors. To mitigate this issue, we propose a single-pass token classifier model that regards ITN as a tagging task. |
Alexandra Antonova; Evelina Bakhturina; Boris Ginsburg; | arxiv-cs.CL | 2022-07-29 |
631 | Multimodal Neural Machine Translation with Search Engine Based Image Retrieval Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an open-vocabulary image retrieval methods to collect descriptive images for bilingual parallel corpus using image search engine. |
ZhenHao Tang; XiaoBing Zhang; Zi Long; XiangHua Fu; | arxiv-cs.CV | 2022-07-26 |
632 | Unifying Cross-lingual Summarization and Machine Translation with Compression Rate Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel task, Cross-lingual Summarization with Compression rate (CSC), to benefit Cross-Lingual Summarization by large-scale Machine Translation corpus. |
YU BAI et. al. | sigir | 2022-07-12 |
633 | No Language Left Behind: Scaling Human-Centered Machine Translation IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose multiple architectural and training improvements to counteract overfitting while training on thousands of tasks. |
NLLB TEAM et. al. | arxiv-cs.CL | 2022-07-11 |
634 | UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel method, named as Unified Multilingual Multiple teacher-student Model for NMT (UM4). |
JIAN YANG et. al. | arxiv-cs.CL | 2022-07-11 |
635 | Original or Translated? A Causal Analysis of The Impact of Translationese on Machine Translation Performance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we collect CausalMT, a dataset where the MT training data are also labeled with the human translation directions. |
Jingwei Ni; Zhijing Jin; Markus Freitag; Mrinmaya Sachan; Bernhard Sch?lkopf; | naacl | 2022-07-09 |
636 | Tricks for Training Sparse Translation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that that sparse architectures for multilingual machine translation can perform poorly out of the box and propose two straightforward techniques to mitigate this – a temperature heating mechanism and dense pre-training. |
DHEERU DUA et. al. | naacl | 2022-07-09 |
637 | Non-Autoregressive Machine Translation: It’s Not As Fast As It Seems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task. |
Jindrich Helcl; Barry Haddow; Alexandra Birch; | naacl | 2022-07-09 |
638 | Language Model Augmented Monotonic Attention for Simultaneous Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a framework to aid monotonic attention with an external language model to improve its decisions. |
Sathish Reddy Indurthi; Mohd Abbas Zaidi; Beomseok Lee; Nikhil Kumar Lakumarapu; Sangha Kim; | naacl | 2022-07-09 |
639 | Does Summary Evaluation Survive Translation to Other Languages? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To investigate how much we can trust machine translation of summarization datasets, we translate the English SummEval dataset to seven languages and compare performances across automatic evaluation measures. |
Spencer Braun; Oleg Vasilyev; Neslihan Iskender; John Bohannon; | naacl | 2022-07-09 |
640 | Semantically Informed Slang Interpretation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a semantically informed slang interpretation (SSI) framework that considers jointly the contextual and semantic appropriateness of a candidate interpretation for a query slang. |
Zhewei Sun; Richard Zemel; Yang Xu; | naacl | 2022-07-09 |
641 | Quantifying Synthesis and Fusion and Their Impact on Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, literature in Natural Language Processing (NLP) typically labels a whole language with a strict type of morphology, e.g. fusional or agglutinative. In this work, we propose to reduce the rigidity of such claims, by quantifying morphological typology at the word and segment level. |
ARTURO ONCEVAY et. al. | naacl | 2022-07-09 |
642 | Quality-Aware Decoding for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT, by leveraging recent breakthroughs in reference-free and reference-based MT evaluation through various inference methods like N-best reranking and minimum Bayes risk decoding. |
PATRICK FERNANDES et. al. | naacl | 2022-07-09 |
643 | Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The model suffers from poor performance in one-to-many and many-to-many with zero-shot setup. To address this issue, this paper discusses how to practically build MNMT systems that serve arbitrary X-Y translation directions while leveraging multilinguality with a two-stage training strategy of pretraining and finetuning. |
Akiko Eriguchi; Shufang Xie; Tao Qin; Hany Hassan; | naacl | 2022-07-09 |
644 | AdMix: A Mixed Sample Data Augmentation Method for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel data augmentation approach for NMT, which is independent of any additional training data. |
Chang Jin; Shigui Qiu; Nini Xiao; Hao Jia; | ijcai | 2022-07-01 |
645 | Low Resource Machine Translation of English-manipuri: A Semi-supervised Approach Related Papers Related Patents Related Grants Related Venues Related Experts View |
Salam Michael Singh; Thoudam Doren Singh; | Expert Syst. Appl. | 2022-07-01 |
646 | Reduce Indonesian Vocabularies with An Indonesian Sub-word Separator Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a strategy to address the unique word problem of the neural machine translation (NMT) system, which uses Indonesian as a pair language. |
Mukhlis Amien; Feng Chong; Huang Heyan; | arxiv-cs.CL | 2022-07-01 |
647 | Towards Discourse-Aware Document-Level Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we aim at incorporating the coherence information hidden within the RST-style discourse structure into machine translation. |
Xin Tan; Longyin Zhang; Fang Kong; Guodong Zhou; | ijcai | 2022-07-01 |
648 | Explicit Alignment Learning for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, we propose two approaches an explicit alignment learning approach, in which we further remove the need for the additional alignment model, and perform embedding mixup with the alignment based on encoder–decoder attention weights in the NMT model. |
Zuchao Li; Hai Zhao; Fengshun Xiao; Masao Utiyama; Eiichiro Sumita; | ijcai | 2022-07-01 |
649 | Code Translation with Compiler Representations IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we leverage low-level compiler intermediate representations (IR) to improve code translation. |
MARC SZAFRANIEC et. al. | arxiv-cs.PL | 2022-06-30 |
650 | GERNERMED++: Transfer Learning in German Medical NLP Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a statistical model for German medical natural language processing trained for named entity recognition (NER) as an open, publicly available model. |
Johann Frei; Ludwig Frei-Stuber; Frank Kramer; | arxiv-cs.CL | 2022-06-29 |
651 | Building Multilingual Machine Translation Systems That Serve Arbitrary X-Y Translations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The model suffers from poor performance in one-to-many and many-to-many with zero-shot setup. To address this issue, this paper discusses how to practically build MNMT systems that serve arbitrary X-Y translation directions while leveraging multilinguality with a two-stage training strategy of pretraining and finetuning. |
Akiko Eriguchi; Shufang Xie; Tao Qin; Hany Hassan Awadalla; | arxiv-cs.CL | 2022-06-29 |
652 | On The Impact of Noises in Crowd-Sourced Data for Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: What are the impacts of these data quality issues for model development and evaluation? In this paper, we propose an automatic method to fix or filter the above quality issues, using English-German (En-De) translation as an example. |
Siqi Ouyang; Rong Ye; Lei Li; | arxiv-cs.CL | 2022-06-28 |
653 | Human Evaluation of English-Irish Transformer-Based NMT Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this study, a human evaluation is carried out on how hyperparameter settings impact the quality of Transformer-based Neural Machine Translation (NMT) for the low-resourced … |
Séamus Lankford; Haithem Afli; Andy Way; | Inf. | 2022-06-25 |
654 | Comparing Formulaic Language in Human and Machine Translation: Insight from A Parliamentary Corpus Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A recent study has shown that, compared to human translations, neural machine translations contain more strongly-associated formulaic sequences made of relatively high-frequency words, but far less strongly-associated formulaic sequences made of relatively rare words. |
Yves Bestgen; | arxiv-cs.CL | 2022-06-22 |
655 | Scaling Autoregressive Models for Content-Rich Text-to-Image Generation IF:7 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge. |
JIAHUI YU et. al. | arxiv-cs.CV | 2022-06-21 |
656 | Understanding and Being Understood: User Strategies for Identifying and Recovering From Mistranslations in Machine Translation-Mediated Chat Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation (MT) is now widely and freely available, and has the potential to greatly improve cross-lingual communication. In order to use MT reliably and safely, end … |
Samantha Robertson; Mark Díaz; | Proceedings of the 2022 ACM Conference on Fairness, … | 2022-06-20 |
657 | Reliable and Safe Use of Machine Translation in Medical Settings IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Language barriers between patients and clinicians contribute to disparities in quality of care. Machine Translation (MT) tools are widely used in healthcare settings, but even … |
Nikita Mehandru; Samantha Robertson; Niloufar Salehi; | 2022 ACM Conference on Fairness, Accountability, and … | 2022-06-20 |
658 | The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the submission of our end-to-end YiTrans speech translation system for the IWSLT 2022 offline task, which translates from English audio to German, Chinese, and Japanese. |
ZIQIANG ZHANG et. al. | arxiv-cs.CL | 2022-06-12 |
659 | The YiTrans Speech Translation System for IWSLT 2022 Offline Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the submission of our end-to-end YiTrans speech translation system for the IWSLT 2022 offline task, which translates from English audio to German, Chinese, … |
Ziqiang Zhang; Junyi Ao; | ArXiv | 2022-06-12 |
660 | A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel Chinese dialect TTS frontend with a translation module, which converts Mandarin text into dialectic expressions to improve the intelligibility and naturalness of synthesized speech. |
Junhui Zhang; Wudi Bao; Junjie Pan; Xiang Yin; Zejun Ma; | arxiv-cs.CL | 2022-06-10 |
661 | VALHALLA: Visual Hallucination for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a visual hallucination framework, called VALHALLA, which requires only source sentences at inference time and instead uses hallucinated visual representations for multimodal machine translation. |
YI LI et. al. | cvpr | 2022-06-07 |
662 | Bilingual Attention Based Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Liyan Kang; Shaojie He; Mingxuan Wang; Fei Long; Jinsong Su; | Applied Intelligence | 2022-06-07 |
663 | LegoNN: Building Modular Encoder-Decoder Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To achieve this reusability, the interface between encoder and decoder modules is grounded to a sequence of marginal distributions over a pre-defined discrete vocabulary. We present two approaches for ingesting these marginals; one is differentiable, allowing the flow of gradients across the entire network, and the other is gradient-isolating. |
SIDDHARTH DALMIA et. al. | arxiv-cs.CL | 2022-06-07 |
664 | MorisienMT: A Dataset for Mauritian Creole Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we describe MorisienMT, a dataset for benchmarking machine translation quality of Mauritian Creole. |
Raj Dabre; Aneerav Sukhoo; | arxiv-cs.CL | 2022-06-06 |
665 | Finetuning A Kalaallisut-English Machine Translation System Using Web-crawled Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Here, we attempt to finetune a pretrained Kalaallisut-to-English neural machine translation (NMT) system using web-crawled pseudoparallel sentences from around 30 multilingual websites. |
Alex Jones; | arxiv-cs.CL | 2022-06-05 |
666 | Findings of The The RuATD Shared Task 2022 on Artificial Text Detection in Russian IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present the shared task on artificial text detection in Russian, which is organized as a part of the Dialogue Evaluation initiative, held in 2022. |
TATIANA SHAMARDINA et. al. | arxiv-cs.CL | 2022-06-03 |
667 | Exploring Diversity in Back Translation for Low-Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work puts forward a more nuanced framework for understanding diversity in training data, splitting it into lexical diversity and syntactic diversity. We present novel metrics for measuring these different aspects of diversity and carry out empirical analysis into the effect of these types of diversity on final neural machine translation model performance for low-resource English$\leftrightarrow$Turkish and mid-resource English$\leftrightarrow$Icelandic. |
Laurie Burchell; Alexandra Birch; Kenneth Heafield; | arxiv-cs.CL | 2022-06-01 |
668 | NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we focus on developing resources for languages in Indonesia. |
GENTA INDRA WINATA et. al. | arxiv-cs.CL | 2022-05-31 |
669 | Refining Low-Resource Unsupervised Translation By Language Disentanglement of Multilingual Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a simple refinement procedure to separate languages from a pre-trained multilingual UMT model for it to focus on only the target low-resource task. |
Xuan-Phi Nguyen; Shafiq Joty; Wu Kui; Ai Ti Aw; | arxiv-cs.CL | 2022-05-31 |
670 | VALHALLA: Visual Hallucination for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a visual hallucination framework, called VALHALLA, which requires only source sentences at inference time and instead uses hallucinated visual representations for multimodal machine translation. |
YI LI et. al. | arxiv-cs.CV | 2022-05-31 |
671 | Preparing An Endangered Language for The Digital Age: The Case of Judeo-Spanish Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For text-to-speech synthesis, we present a 3.5 hour single speaker speech corpus for building a neural speech synthesis engine. |
Alp Öktem; Rodolfo Zevallos; Yasmin Moslem; Güneş Öztürk; Karen Şarhon; | arxiv-cs.CL | 2022-05-31 |
672 | X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we fill this research gap and present an abstractive cross-lingual summarization dataset for four different languages in the scholarly domain, which enables us to train and evaluate models that process English papers and generate summaries in German, Italian, Chinese and Japanese. |
Sotaro Takeshita; Tommaso Green; Niklas Friedrich; Kai Eckert; Simone Paolo Ponzetto; | arxiv-cs.CL | 2022-05-30 |
673 | Can Transformer Be Too Compositional? Analysing Idiom Processing in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we investigate whether the non-compositionality of idioms is reflected in the mechanics of the dominant NMT model, Transformer, by analysing the hidden states and attention patterns for models with English as source language and one of seven European languages as target language. |
Verna Dankers; Christopher G. Lucas; Ivan Titov; | arxiv-cs.CL | 2022-05-30 |
674 | BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While most of the research attention is given to the English language in a monolingual setting, resource-constrained languages like Bangla remain out of focus, predominantly due to a lack of standard datasets. Addressing this issue, we present a new dataset BAN-Cap following the widely used Flickr8k dataset, where we collect Bangla captions of the images provided by qualified annotators. |
Mohammad Faiyaz Khan; S. M. Sadiq-Ur-Rahman Shifath; Md Saiful Islam; | arxiv-cs.CL | 2022-05-28 |
675 | TURJUMAN: A Public Toolkit for Neural Arabic Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: We present TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA). TURJUMAN exploits the recently-introduced text-to-text Transformer AraT5 … |
El Moatez Billah Nagoudi; AbdelRahim Elmadany; M. Abdul-Mageed; | ArXiv | 2022-05-27 |
676 | Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate data augmentation techniques for synthesizing dialectal Arabic-English CS text. |
Injy Hamed; Nizar Habash; Slim Abdennadher; Ngoc Thang Vu; | arxiv-cs.CL | 2022-05-25 |
677 | FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we provide baselines for the tasks based on multilingual pre-trained models like mSLAM. |
ALEXIS CONNEAU et. al. | arxiv-cs.CL | 2022-05-24 |
678 | DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce DivEMT, the first publicly available post-editing study of Neural Machine Translation (NMT) over a typologically diverse set of target languages. |
Gabriele Sarti; Arianna Bisazza; Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2022-05-24 |
679 | T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a new approach to perform zero-shot cross-modal transfer between speech and text for translation tasks. |
Paul-Ambroise Duquenne; Hongyu Gong; Benoît Sagot; Holger Schwenk; | arxiv-cs.CL | 2022-05-24 |
680 | Tackling Data Scarcity in Speech Translation Using Zero-Shot Multilingual Machine Translation Techniques Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In the related field of multilingual text translation, several techniques have been proposed for zero-shot translation. |
T. A. Dinh; D. Liu; J. Niehues; | icassp | 2022-05-22 |
681 | Context-Adaptive Document-Level Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work introduces a data-adaptive method that enables the model to adopt the necessary and helpful context. |
L. Zhang; Z. Zhang; B. Chen; W. Luo; L. Si; | icassp | 2022-05-22 |
682 | ISOMETRIC MT: Neural Machine Translation for Automatic Dubbing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work introduces a self-learning approach that allows a transformer model to directly learn to generate outputs that closely match the source length, in short Isometric MT. In particular, our approach does not require to generate multiple hypotheses nor any auxiliary ranking function. |
S. M. Lakew; Y. Virkar; P. Mathur; M. Federico; | icassp | 2022-05-22 |
683 | Non-Autoregressive Neural Machine Translation: A Call for Clarity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Non-autoregressive approaches aim to improve the inference speed of translation models by only requiring a single forward pass to generate the output sequence instead of iteratively producing each predicted token. |
Robin M. Schmidt; Telmo Pires; Stephan Peitz; Jonas Lööf; | arxiv-cs.CL | 2022-05-21 |
684 | Understanding and Mitigating The Uncertainty in Zero-Shot Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we aim to understand and alleviate the off-target issues from the perspective of uncertainty in zero-shot translation. |
Wenxuan Wang; Wenxiang Jiao; Shuo Wang; Zhaopeng Tu; Michael R. Lyu; | arxiv-cs.CL | 2022-05-20 |
685 | Translating Hanja Historical Documents to Contemporary Korean and English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, we propose H2KE, a neural machine translation model, that translates historical documents in Hanja to more easily understandable Korean and to English. |
JUHEE SON et. al. | arxiv-cs.CL | 2022-05-20 |
686 | Data Augmentation to Address Out-of-Vocabulary Problem in Low-Resource Sinhala-English Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a word and phrase replacement-based DA technique that consider both types of OOV, by augmenting (1) rare words in the existing parallel corpus, and (2) new words from a bilingual dictionary. |
Aloka Fernando; Surangika Ranathunga; | arxiv-cs.CL | 2022-05-18 |
687 | Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we report our recent achievements in S2ST. |
QIANQIAN DONG et. al. | arxiv-cs.CL | 2022-05-18 |
688 | Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While one possible solution is to directly take target contexts into these statistical metrics, the target-context-aware statistical computing is extremely expensive, and the corresponding storage overhead is unrealistic. To solve the above issues, we propose a target-context-aware metric, named conditional bilingual mutual information (CBMI), which makes it feasible to supplement target context information for statistical metrics. |
SONGMING ZHANG et. al. | acl | 2022-05-17 |
689 | DEEP: DEnoising Entity Pre-training for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Earlier named entity translation methods mainly focus on phonetic transliteration, which ignores the sentence context for translation and is limited in domain and language coverage. To address this limitation, we propose DEEP, a DEnoising Entity Pre-training method that leverages large amounts of monolingual data and a knowledge base to improve named entity translation accuracy within sentences. |
Junjie Hu; Hiroaki Hayashi; Kyunghyun Cho; Graham Neubig; | acl | 2022-05-17 |
690 | As Little As Possible, As Much As Necessary: Detecting Over- and Undertranslations with Contrastive Conditioning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Omission and addition of content is a typical issue in neural machine translation. We propose a method for detecting such phenomena with off-the-shelf translation models. |
Jannis Vamvas; Rico Sennrich; | acl | 2022-05-17 |
691 | Zero-Shot Cross-lingual Semantic Parsing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a multi-task encoder-decoder model to transfer parsing knowledge to additional languages using only English-logical form paired data and in-domain natural language corpora in each new language. |
Tom Sherborne; Mirella Lapata; | acl | 2022-05-17 |
692 | Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a Confidence Based Bidirectional Global Context Aware (CBBGCA) training framework for NMT, where the NMT model is jointly trained with an auxiliary conditional masked language model (CMLM). |
CHULUN ZHOU et. al. | acl | 2022-05-17 |
693 | UniTE: Unified Translation Evaluation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose , which is the first unified framework engaged with abilities to handle all three evaluation tasks. |
YU WAN et. al. | acl | 2022-05-17 |
694 | BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel BiTIIMT system, Bilingual Text-Infilling for Interactive Neural Machine Translation. |
YANLING XIAO et. al. | acl | 2022-05-17 |
695 | Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce multilingual crossover encoder-decoder (mXEncDec) to fuse language pairs at an instance level. |
YONG CHENG et. al. | acl | 2022-05-17 |
696 | Towards Making The Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper demonstrates that multilingual pretraining and multilingual fine-tuning are both critical for facilitating cross-lingual transfer in zero-shot translation, where the neural machine translation (NMT) model is tested on source languages unseen during supervised training. Following this idea, we present SixT+, a strong many-to-English NMT model that supports 100 source languages but is trained with a parallel dataset in only six source languages. |
GUANHUA CHEN et. al. | acl | 2022-05-17 |
697 | Sub-Word Alignment Is Still Useful: A Vest-Pocket Method for Enhancing Low-Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We leverage embedding duplication between aligned sub-words to extend the Parent-Child transfer learning method, so as to improve low-resource machine translation. |
Minhan Xu; Yu Hong; | acl | 2022-05-17 |
698 | STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing techniques often attempt to transfer powerful machine translation (MT) capabilities to ST, but neglect the representation discrepancy across modalities. In this paper, we propose the Speech-TExt Manifold Mixup (STEMM) method to calibrate such discrepancy. |
Qingkai Fang; Rong Ye; Lei Li; Yang Feng; Mingxuan Wang; | acl | 2022-05-17 |
699 | Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes an adaptive segmentation policy for end-to-end ST. Inspired by human interpreters, the policy learns to segment the source streaming speech into meaningful units by considering both acoustic features and translation history, maintaining consistency between the segmentation and translation. |
Ruiqing Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | acl | 2022-05-17 |
700 | Measuring and Mitigating Name Biases in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we describe a new source of bias prevalent in NMT systems, relating to translations of sentences containing person names. |
Jun Wang; Benjamin Rubinstein; Trevor Cohn; | acl | 2022-05-17 |
701 | Triangular Transfer: Freezing The Pivot for Triangular Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a transfer-learning-based approach that utilizes all types of auxiliary data. |
Meng Zhang; Liangyou Li; Qun Liu; | acl | 2022-05-17 |
702 | MSCTD: A Multimodal Sentiment Chat Translation Dataset IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce a new task named Multimodal Chat Translation (MCT), aiming to generate more accurate translations with the help of the associated dialogue history and visual context. |
Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou; | acl | 2022-05-17 |
703 | ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper explores a deeper relationship between Transformer and numerical ODE methods. |
BEI LI et. al. | acl | 2022-05-17 |
704 | Machine Translation for Livonian: Catering to 20 Speakers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we tackle the task of developing neural machine translation (NMT) between Livonian and English, with a two-fold aim: on one hand, preserving the language and on the other – enabling access to Livonian folklore, lifestories and other textual intangible heritage as well as making it easier to create further parallel corpora. |
Matiss Rikters; Marili Tomingas; Tuuli Tuisk; Valts Ern�treits; Mark Fishel; | acl | 2022-05-17 |
705 | Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a novel data augmentation paradigm termed Continuous Semantic Augmentation (CsaNMT), which augments each training instance with an adjacency semantic region that could cover adequate variants of literal expression under the same meaning. |
XIANGPENG WEI et. al. | acl | 2022-05-17 |
706 | Can Transformer Be Too Compositional? Analysing Idiom Processing in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we investigate whether the non-compositionality of idioms is reflected in the mechanics of the dominant NMT model, Transformer, by analysing the hidden states and attention patterns for models with English as source language and one of seven European languages as target language. |
Verna Dankers; Christopher Lucas; Ivan Titov; | acl | 2022-05-17 |
707 | Redistributing Low-Frequency Words: Making The Most of Monolingual Data in Non-Autoregressive Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we provide an appealing alternative for NAT – monolingual KD, which trains NAT student on external monolingual data with AT teacher trained on the original bilingual data. |
Liang Ding; Longyue Wang; Shuming Shi; Dacheng Tao; Zhaopeng Tu; | acl | 2022-05-17 |
708 | DiBiMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present DiBiMT, the first entirely manually-curated evaluation benchmark which enables an extensive study of semantic biases in Machine Translation of nominal and verbal words in five different language combinations, namely, English and one or other of the following languages: Chinese, German, Italian, Russian and Spanish. |
Niccol� Campolungo; Federico Martelli; Francesco Saina; Roberto Navigli; | acl | 2022-05-17 |
709 | A Variational Hierarchical Model for Neural Cross-Lingual Summarization IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, it is very challenging for the model to directly conduct CLS as it requires both the abilities to translate and summarize. To address this issue, we propose a hierarchical model for the CLS task, based on the conditional variational auto-encoder. |
YUNLONG LIANG et. al. | acl | 2022-05-17 |
710 | Focus on The Target’s Vocabulary: Masked Label Smoothing for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: When allocating smoothed probability, original label smoothing treats the source-side words that would never appear in the target language equally to the real target-side words, which could bias the translation model. To address this issue, we propose Masked Label Smoothing (MLS), a new mechanism that masks the soft label probability of source-side words to zero. |
Liang Chen; Runxin Xu; Baobao Chang; | acl | 2022-05-17 |
711 | Geographical Distance Is The New Hyperparameter: A Case Study Of Finding The Optimal Pre-trained Language For English-isiZulu Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This study explores the potential benefits of transfer learning in an English-isiZulu translation framework. |
Muhammad Umair Nasir; Innocent Amos Mchechesi; | arxiv-cs.CL | 2022-05-17 |
712 | Scheduled Multi-task Learning for Neural Chat Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Although the NCT models have achieved impressive success, it is still far from satisfactory due to insufficient chat translation data and simple joint training manners. To address the above issues, we propose a scheduled multi-task learning framework for NCT. |
Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou; | acl | 2022-05-17 |
713 | An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. |
Sweta Agrawal; Marine Carpuat; | acl | 2022-05-17 |
714 | Bridging The Data Gap Between Training and Inference for Unsupervised Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To narrow the data gap, we propose an online self-training approach, which simultaneously uses the pseudo parallel data {natural source, translated target} to mimic the inference scenario. |
Zhiwei He; Xing Wang; Rui Wang; Shuming Shi; Zhaopeng Tu; | acl | 2022-05-17 |
715 | Consistent Human Evaluation of Machine Translation Across Language Pairs IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more consistent assessment. |
DANIEL LICHT et. al. | arxiv-cs.CL | 2022-05-17 |
716 | Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel posterior alignment technique that is truly online in its execution and superior in terms of alignment error rates compared to existing methods. |
Soumya Chatterjee; Sunita Sarawagi; Preethi Jyothi; | acl | 2022-05-17 |
717 | From Simultaneous to Streaming Machine Translation By Leveraging Streaming History Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task |
Javier Iranzo Sanchez; Jorge Civera; Alfons Juan-C�scar; | acl | 2022-05-17 |
718 | AppTek’s Submission to The IWSLT 2022 Isometric Spoken Language Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To participate in the Isometric Spoken Language Translation Task of the IWSLT 2022 evaluation, constrained condition, AppTek developed neural Transformer-based systems for … |
P. Wilken; E. Matusov; | International Workshop on Spoken Language Translation | 2022-05-12 |
719 | Investigating Contextual Influence in Document-Level Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Current state-of-the-art neural machine translation (NMT) architectures usually do not take document-level context into account. However, the document-level context of a source … |
Prashant Nayak; Rejwanul Haque; John D. Kelleher; Andy Way; | Inf. | 2022-05-12 |
720 | AppTek’s Submission to The IWSLT 2022 Isometric Spoken Language Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To participate in the Isometric Spoken Language Translation Task of the IWSLT 2022 evaluation, constrained condition, AppTek developed neural Transformer-based systems for … |
Patrick Wilken; Evgeny Matusov; | arxiv-cs.CL | 2022-05-11 |
721 | Improving English-to-Indian Language Neural Machine Translation Systems Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this study, we build English-to-Indian language Neural Machine Translation (NMT) … |
Akshara Kandimalla; P. Lohar; S. Maji; Andy Way; | Inf. | 2022-05-11 |
722 | Controlling Extra-Textual Attributes About Dialogue Participants: A Case Study of English-to-Polish Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unlike English, morphologically rich languages can reveal characteristics of speakers or their conversational partners, such as gender and number, via pronouns, morphological … |
S. Vincent; Loïc Barrault; Carolina Scarton; | ArXiv | 2022-05-10 |
723 | Controlling Extra-Textual Attributes About Dialogue Participants — A Case Study of English-to-Polish Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus on the underresearched problem of utilising external metadata in automatic translation of TV dialogue, proposing a case study where a wide range of approaches for controlling attributes in translation is employed in a multi-attribute scenario. |
Sebastian T. Vincent; Loïc Barrault; Carolina Scarton; | arxiv-cs.CL | 2022-05-10 |
724 | ParaCotta: Synthetic Multilingual Paraphrase Corpora from The Most Diverse Translation Sample Pair Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We generate multiple translation samples using beam search and choose the most lexically diverse pair according to their sentence BLEU. |
ALHAM FIKRI AJI et. al. | arxiv-cs.CL | 2022-05-09 |
725 | CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce an annotated dataset (CoCoA-MT) and an associated evaluation metric for training and evaluating formality-controlled MT models for six diverse target languages. |
MARIA NĂDEJDE et. al. | arxiv-cs.CL | 2022-05-09 |
726 | Example-Based Machine Translation from Text to A Hierarchical Representation of Sign Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This article presents an original method for Text-to-Sign Translation. |
Élise Bertin-Lemée; Annelies Braffort; Camille Challant; Claire Danet; Michael Filhol; | arxiv-cs.CL | 2022-05-06 |
727 | Bridging The Domain Gap for Stance Detection for The Zulu Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a black-box non-intrusive method that utilizes techniques from Domain Adaptation to reduce the domain gap, without requiring any human expertise in the target language, by leveraging low-quality data in both a supervised and unsupervised manner. |
Gcinizwe Dlamini; Imad Eddine Ibrahim Bekkouch; Adil Khan; Leon Derczynski; | arxiv-cs.CL | 2022-05-06 |
728 | Example-Based Machine Translation from Textto A Hierarchical Representation of Sign Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article presents an original method for Text-to-Sign Translation. It compensates data scarcity using a domain-specific parallel corpus of alignments between text and … |
Élise Bertin-Lemée; Annelies Braffort; Camille Challant; Claire Danet; Michael Filhol; | ArXiv | 2022-05-06 |
729 | Non-Autoregressive Machine Translation: It’s Not As Fast As It Seems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task. |
Jindřich Helcl; Barry Haddow; Alexandra Birch; | arxiv-cs.CL | 2022-05-04 |
730 | ON-TRAC Consortium Systems for The IWSLT 2022 Dialect and Low-resource Speech Translation Tasks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2022: low-resource and dialect speech translation. |
MARCELY ZANON BOITO et. al. | arxiv-cs.CL | 2022-05-04 |
731 | Original or Translated? A Causal Analysis of The Impact of Translationese on Machine Translation Performance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we collect CausalMT, a dataset where the MT training data are also labeled with the human translation directions. |
Jingwei Ni; Zhijing Jin; Markus Freitag; Mrinmaya Sachan; Bernhard Schölkopf; | arxiv-cs.CL | 2022-05-04 |
732 | Non-Autoregressive Machine Translation: It’s Not As Fast As It Seems IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Efficient machine translation models are commercially important as they can increase inference speeds, and reduce costs and carbon emissions. Recently, there has been much … |
Jindvrich Helcl; B. Haddow; Alexandra Birch; | ArXiv | 2022-05-04 |
733 | The Implicit Length Bias of Label Smoothing on Beam Search Decoding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We verify our theory by applying a simple rectification function at inference time to restore the unbiased distributions from the label-smoothed model predictions. |
Bowen Liang; Pidong Wang; Yuan Cao; | arxiv-cs.CL | 2022-05-02 |
734 | Quality-Aware Decoding for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT, by leveraging recent breakthroughs in reference-free and reference-based MT evaluation through various inference methods like $N$-best reranking and minimum Bayes risk decoding. |
PATRICK FERNANDES et. al. | arxiv-cs.CL | 2022-05-02 |
735 | Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, there is a need to create training and evaluation data for implementing machine learning tasks and bridging the research gap in the language. This work presents the Hausa Visual Genome (HaVG), a dataset that contains the description of an image or a section within the image in Hausa and its equivalent in English. |
IDRIS ABDULMUMIN et. al. | arxiv-cs.CL | 2022-05-02 |
736 | The Cross-lingual Conversation Summarization Challenge Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose the shared task of cross-lingual conversation summarization, \emph{ConvSumX Challenge}, opening new avenues for researchers to investigate solutions that integrate conversation summarization and machine translation. |
YULONG CHEN et. al. | arxiv-cs.CL | 2022-04-30 |
737 | Can Machine Translation Be A Reasonable Alternative for Multilingual Question Answering Systems Over Knowledge Graphs? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we discuss Knowledge Graph Question Answering (KGQA) systems that aim at providing natural language access to data stored in Knowledge Graphs (KG). |
Aleksandr Perevalov; Andreas Both; Dennis Diefenbach; Axel-Cyrille Ngonga Ngomo; | www | 2022-04-29 |
738 | How Robust Is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we analyze how translation performance changes as the data ratios among languages vary in the tokenizer training corpus. |
SHIYUE ZHANG et. al. | arxiv-cs.CL | 2022-04-29 |
739 | NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Translation-based similarity measures include direct and pivot translation probability, as well as translation cross-likelihood, which has not been studied so far. We analyze these measures in the common framework of multilingual NMT, releasing the NMTScore library (available at https://github.com/ZurichNLP/nmtscore). |
Jannis Vamvas; Rico Sennrich; | arxiv-cs.CL | 2022-04-28 |
740 | UniTE: Unified Translation Evaluation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose UniTE, which is the first unified framework engaged with abilities to handle all three evaluation tasks. |
YU WAN et. al. | arxiv-cs.CL | 2022-04-28 |
741 | Data-Driven Adaptive Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, wait-k suffers from two major limitations: (a) it is a fixed policy that can not adaptively adjust latency given context, and (b) its training is much slower than full-sentence translation. To alleviate these issues, we propose a novel and efficient training scheme for adaptive SimulMT by augmenting the training corpus with adaptive prefix-to-prefix pairs, while the training complexity remains the same as that of training full-sentence translation models. |
GUANGXU XUN et. al. | arxiv-cs.CL | 2022-04-26 |
742 | Efficient Machine Translation Domain Adaptation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we explore several approaches to speed up nearest neighbor machine translation. |
Pedro Henrique Martins; Zita Marinho; André F. T. Martins; | arxiv-cs.CL | 2022-04-26 |
743 | Leveraging Frozen Pretrained Written Language Models for Neural Sign Language Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We consider neural sign language translation: machine translation from signed to written languages using encoder–decoder neural networks. Translating sign language videos to … |
Mathieu De Coster; J. Dambre; | Inf. | 2022-04-23 |
744 | A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we conduct a systematic survey with comparisons and discussions of various non-autoregressive translation (NAT) models from different aspects. |
YISHENG XIAO et. al. | arxiv-cs.CL | 2022-04-20 |
745 | PICT@DravidianLangTech-ACL2022: Neural Machine Translation On Dravidian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a summary of the findings that we obtained based on the shared task on machine translation of Dravidian languages. |
Aditya Vyawahare; Rahul Tangsali; Aditya Mandke; Onkar Litake; Dipali Kadam; | arxiv-cs.CL | 2022-04-19 |
746 | Dynamic Position Encoding for Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, such embeddings are fixed after training regardless of the task and the word ordering system of the source or target language. In this paper, we propose a novel architecture with new position embeddings depending on the input text to address this shortcoming by taking the order of target words into consideration. |
Joyce Zheng; Mehdi Rezagholizadeh; Peyman Passban; | arxiv-cs.CL | 2022-04-17 |
747 | Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a novel data augmentation paradigm termed Continuous Semantic Augmentation (CsaNMT), which augments each training instance with an adjacency semantic region that could cover adequate variants of literal expression under the same meaning. |
XIANGPENG WEI et. al. | arxiv-cs.CL | 2022-04-14 |
748 | Creativity in Translation: Machine Translation As A Constraint for Literary Texts IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This article presents the results of a study involving the translation of a short story by Kurt Vonnegut from English to Catalan and Dutch using three modalities: machine-translation (MT), post-editing (PE) and translation without aid (HT). |
Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2022-04-12 |
749 | Large-Scale Streaming End-to-End Speech Translation with Neural Transducers IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce it to streaming end-to-end speech translation (ST), which aims to convert audio signals to texts in other languages directly. |
Jian Xue; Peidong Wang; Jinyu Li; Matt Post; Yashesh Gaur; | arxiv-cs.CL | 2022-04-11 |
750 | Toward More Effective Human Evaluation for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We investigate a simple way to reduce cost by reducing the number of text segments that must be annotated in order to accurately predict a score for a complete test set. |
Belén Saldías; George Foster; Markus Freitag; Qijun Tan; | arxiv-cs.CL | 2022-04-11 |
751 | Towards Better Chinese-centric Neural Machine Translation for Low-resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present the winner competition system that leverages monolingual word embeddings data enhancement, bilingual curriculum learning, and contrastive re-ranking. |
Bin Li; Yixuan Weng; Fei Xia; Hanjun Deng; | arxiv-cs.CL | 2022-04-08 |
752 | MMTAfrica: Multilingual Machine Translation for African Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on the task of multilingual machine translation for African languages and describe our contribution in the 2021 WMT Shared Task: Large-Scale Multilingual Machine Translation. |
Chris C. Emezue; Bonaventure F. P. Dossou; | arxiv-cs.CL | 2022-04-08 |
753 | GigaST: A 10, 000-hour Pseudo Speech Translation Corpus IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper introduces GigaST, a large-scale pseudo speech translation (ST) corpus. We create the corpus by translating the text in GigaSpeech, an English ASR corpus, into German … |
RONG YE et. al. | ArXiv | 2022-04-08 |
754 | GigaST: A 10,000-hour Pseudo Speech Translation Corpus Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces GigaST, a large-scale pseudo speech translation (ST) corpus. |
RONG YE et. al. | arxiv-cs.CL | 2022-04-08 |
755 | Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Direct speech-to-speech translation (S2ST) models suffer from data scarcity issues as there exists little parallel S2ST data, compared to the amount of data available for conventional cascaded systems that consist of automatic speech recognition (ASR), machine translation (MT), and text-to-speech (TTS) synthesis. In this work, we explore self-supervised pre-training with unlabeled speech data and data augmentation to tackle this issue. |
SRAVYA POPURI et. al. | arxiv-cs.CL | 2022-04-06 |
756 | Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we show that two parameter-efficient approaches to cross-lingual transfer, namely Sparse Fine-Tuning Masks (SFTMs) and Adapters, allow for a more lightweight and more effective zero-shot transfer to multilingual and cross-lingual retrieval tasks. |
Robert Litschko; Ivan Vulić; Goran Glavaš; | arxiv-cs.CL | 2022-04-05 |
757 | Creativity in Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article presents the results of a study involving the translation of a short story by Kurt Vonnegut from English to Catalan and Dutch using three modalities: … |
Ana Guerberof-Arenas; Antonio Toral; | Translation Spaces | 2022-03-28 |
758 | Modeling Target-Side Morphology in Neural Machine Translation: A Comparison of Strategies Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we re-investigate two target-side linguistic processing techniques: a lemma-tag strategy and a linguistically informed word segmentation strategy. |
Marion Weller-Di Marco; Matthias Huck; Alexander Fraser; | arxiv-cs.CL | 2022-03-25 |
759 | Mitigating Gender Bias in Machine Translation Through Adversarial Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present an adversarial learning framework that addresses these challenges to mitigate gender bias in seq2seq machine translation. |
Eve Fleisig; Christiane Fellbaum; | arxiv-cs.CL | 2022-03-20 |
760 | Gaussian Multi-head Attention for Simultaneous Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose Gaussian Multi-head Attention (GMA) to develop a new SiMT policy by modeling alignment and translation in a unified manner. |
Shaolei Zhang; Yang Feng; | arxiv-cs.CL | 2022-03-17 |
761 | English-Chinese Machine Translation Model Based on Bidirectional Neural Network with Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, with the development of deep learning, machine translation using neural network has gradually become the mainstream method in industry and academia. The existing … |
Yonglan Li; Wenjia He; | J. Sensors | 2022-03-17 |
762 | Low-resource Neural Machine Translation: Methods and Trends Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural Machine Translation (NMT) brings promising improvements in translation quality, but until recently, these models rely on large-scale parallel corpora. As such corpora only … |
Shumin Shi; Xing Wu; Rihai Su; Heyan Huang; | ACM Transactions on Asian and Low-Resource Language … | 2022-03-15 |
763 | A New Approach to Calculating BERTScore for Automatic Assessment of Translation Quality Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The study of the applicability of the BERTScore metric was conducted to translation quality assessment at the sentence level for English -> Russian direction. |
A. A. Vetrov; E. A. Gorn; | arxiv-cs.CL | 2022-03-10 |
764 | Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To alleviate this problem, we propose a novel method, Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation(SBD-NMT). |
Xuanwei Zhang; Libin Shen; Disheng Pan; Liang Wang; Yanjun Miao; | arxiv-cs.CL | 2022-03-10 |
765 | Onception: Active Learning with Expert Advice for Real World Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this article, we assume a real world human-in-the-loop scenario in which: (i) the source sentences may not be readily available, but instead arrive in a stream; (ii) the automatic translations receive feedback in the form of a rating, instead of a correct/edited translation, since the human-in-the-loop might be a user looking for a translation, but not be able to provide one. |
Vânia Mendonça; Ricardo Rei; Luisa Coheur; Alberto Sardinha; | arxiv-cs.CL | 2022-03-08 |
766 | Challenges of Neural Machine Translation for Short Texts IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Short texts (STs) present in a variety of scenarios, including query, dialog, and entity names. Most of the exciting studies in neural machine translation (NMT) are focused on … |
YU WAN et. al. | Computational Linguistics | 2022-03-07 |
767 | Focus on The Target’s Vocabulary: Masked Label Smoothing for Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Label smoothing and vocabulary sharing are two widely used techniques in neural machine translation models. However, we argue that simply applying both techniques can be … |
Liang Chen; Runxin Xu; Baobao Chang; | ArXiv | 2022-03-06 |
768 | From Simultaneous to Streaming Machine Translation By Leveraging Streaming History Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, a state-of-the-art simultaneous sentence-level MT system is extended to the streaming setup by leveraging the streaming history. |
Javier Iranzo-Sánchez; Jorge Civera; Alfons Juan; | arxiv-cs.CL | 2022-03-04 |
769 | UDAAN – Machine Learning Based Post-Editing Tool for Document Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We introduce UDAAN, an open-source post-editing tool that can reduce manual editing efforts to quickly produce publishable-standard documents in several Indic languages. UDAAN has … |
Ayush Maheshwari; A. Ravindran; Venkatapathy Subramanian; Akshay Jalan; Ganesh Ramakrishnan; | Proceedings of the 6th Joint International Conference on … | 2022-03-03 |
770 | UDAAN: Machine Learning Based Post-Editing Tool for Document Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce UDAAN, an open-source post-editing tool that can reduce manual editing efforts to quickly produce publishable-standard documents in several Indic languages. |
Ayush Maheshwari; Ajay Ravindran; Venkatapathy Subramanian; Ganesh Ramakrishnan; | arxiv-cs.CL | 2022-03-03 |
771 | End-to-end Entity-aware Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
SHUFANG XIE et. al. | Machine Learning | 2022-03-01 |
772 | Offline Corpus Augmentation for English-Amharic Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The present paper investigates the effect of corpus augmentation on the quality of English-Amharic Machine Translation (MT) with the goal of improving translation quality of … |
Yohannes Biadgligne; K. Smaïli; | 2022 5th International Conference on Information and … | 2022-03-01 |
773 | OCR Improves Machine Translation for Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We aim to investigate the performance of current OCR systems on low resource languages and low resource scripts. |
Oana Ignat; Jean Maillard; Vishrav Chaudhary; Francisco Guzmán; | arxiv-cs.CL | 2022-02-26 |
774 | The Reality of Multi-Lingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our book The Reality of Multi-Lingual Machine Translation discusses the benefits and perils of using more than two languages in machine translation systems. |
Tom Kocmi; Dominik Macháček; Ondřej Bojar; | arxiv-cs.CL | 2022-02-25 |
775 | JParaCrawl V3.0: A Large-scale English-Japanese Parallel Corpus Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Most current machine translation models are mainly trained with parallel corpora, and their translation accuracy largely depends on the quality and quantity of the corpora. |
Makoto Morishita; Katsuki Chousa; Jun Suzuki; Masaaki Nagata; | arxiv-cs.CL | 2022-02-25 |
776 | Screening Gender Transfer in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper aims at identifying the information flow in state-of-the-art machine translation systems, taking as example the transfer of gender when translating from French into English. |
Guillaume Wisniewski; Lichao Zhu; Nicolas Ballier; François Yvon; | arxiv-cs.CL | 2022-02-25 |
777 | Using Natural Language Prompts for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We explore the use of natural language prompts for controlling various aspects of the outputs generated by machine translation models. |
Xavier Garcia; Orhan Firat; | arxiv-cs.CL | 2022-02-23 |
778 | Refining The State-of-the-art in Machine Translation, Optimizing NMT for The JA <-> EN Language Pair By Leveraging Personal Domain Expertise Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Documenting the construction of an NMT (Neural Machine Translation) system for En/Ja based on the Transformer architecture leveraging the OpenNMT framework. |
Matthew Bieda; | arxiv-cs.CL | 2022-02-23 |
779 | An Overview on Machine Translation Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report mainly includes the following contents: a brief history of machine translation evaluation (MTE), the classification of research methods on MTE, and the the cutting-edge progress, including human evaluation, automatic evaluation, and evaluation of evaluation methods (meta-evaluation). |
Lifeng Han; | arxiv-cs.CL | 2022-02-22 |
780 | CALCS 2021 Shared Task: Machine Translation for Code-Switched Data IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we address machine translation for code-switched social media data. |
Shuguang Chen; Gustavo Aguilar; Anirudh Srinivasan; Mona Diab; Thamar Solorio; | arxiv-cs.CL | 2022-02-19 |
781 | PETCI: A Parallel English Translation Dataset of Chinese Idioms Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present PETCI, a parallel English translation dataset of Chinese idioms, aiming to improve idiom translation by both human and machine. |
Kenan Tang; | arxiv-cs.CL | 2022-02-18 |
782 | Improving English to Sinhala Neural Machine Translation Using Part-of-Speech Tag Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, in this research, we explore effective methods of incorporating Part of Speech (POS) tags to the Transformer input embedding and positional encoding to further enhance the performance of the baseline English to Sinhala neural machine translation model. |
Ravinga Perera; Thilakshi Fonseka; Rashmini Naranpanawa; Uthayasanker Thayasivam; | arxiv-cs.CL | 2022-02-17 |
783 | End-to-End Training for Back-Translation with Categorical Reparameterization Trick Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a categorical reparameterization trick that makes NMT models generate differentiable sentences so that the VAE’s training framework can work in the end-to-end fashion. |
DongNyeong Heo; Heeyoul Choi; | arxiv-cs.CL | 2022-02-17 |
784 | Simplification of English and Bengali Sentences for Improving Quality of Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
S. Mahata; Avishek Garain; Dipankar Das; Sivaji Bandyopadhyay; | Neural Processing Letters | 2022-02-15 |
785 | Lexical Diversity in Statistical and Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation systems have revolutionized translation processes in terms of quantity and speed in recent years, and they have even been claimed to achieve human … |
Mojca Brglez; Špela Vintar; | Inf. | 2022-02-15 |
786 | Sequence-to-Sequence Resources for Catalan Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce sequence-to-sequence language resources for Catalan, a moderately under-resourced language, towards two tasks, namely: Summarization and Machine Translation (MT). |
Ona de Gibert; Ksenia Kharitonova; Blanca Calvo Figueras; Jordi Armengol-Estapé; Maite Melero; | arxiv-cs.CL | 2022-02-14 |
787 | Knowledge Distillation: A Method for Making Neural Machine Translation More Efficient IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) systems have greatly improved the quality available from machine translation (MT) compared to statistical machine translation (SMT) systems. … |
Wandri Jooste; Rejwanul Haque; Andy Way; | Inf. | 2022-02-14 |
788 | Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a fine-tuning loss that enables pre-trained model’s ability to mine pseudo-parallel data for fully unsupervised machine translation. |
XUAN-PHI NGUYEN et. al. | iclr | 2022-02-08 |
789 | Non-Autoregressive Models Are Better Multilingual Translators Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose switch-GLAT, a non-autoregressive multilingual machine translation model with a code-switch decoder. |
ZHENQIAO SONG et. al. | iclr | 2022-02-08 |
790 | From Fully Trained to Fully Random Embeddings: Improving Neural Machine Translation with Compact Word Embedding Tables Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Embedding matrices are key components in neural natural language processing (NLP) models that are responsible to provide numerical representations of input tokens (i.e. words or subwords). In this paper, we analyze the impact and utility of such matrices in the context of neural machine translation (NMT). |
Krtin Kumar; Peyman Passban; Mehdi Rezagholizadeh; Yiusing Lau; Qun Liu; | aaai | 2022-02-07 |
791 | Machine Translation from Signed to Spoken Languages: State of The Art and Challenges IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a systematic literature review to illustrate the state of the art in the domain and then, harking back to the requirements, lay out several challenges for future research. |
Mathieu De Coster; Dimitar Shterionov; Mieke Van Herreweghe; Joni Dambre; | arxiv-cs.CL | 2022-02-07 |
792 | Multilingual Code Snippets Training for Program Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce CoST, a new multilingual Code Snippet Translation dataset that contains parallel data from 7 commonly used programming languages. |
Ming Zhu; Karthik Suresh; Chandan K Reddy; | aaai | 2022-02-07 |
793 | Frequency-Aware Contrastive Learning for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Specifically, we propose a frequency-aware token-level contrastive learning method, in which the hidden state of each decoding step is pushed away from the counterparts of other target words, in a soft contrastive way based on the corresponding word frequencies. |
TONG ZHANG et. al. | aaai | 2022-02-07 |
794 | Non-autoregressive Translation with Layer-Wise Prediction and Deep Supervision IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose DSLP, a highly efficient and high-performance model for machine translation. |
Chenyang Huang; Hao Zhou; Osmar R. Zaïane; Lili Mou; Lei Li; | aaai | 2022-02-07 |
795 | Fully Unsupervised Machine Translation Using Context-Aware Word Translation and Denoising Autoencoder Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Learning machine translation by using only monolingual data sets is a complex task as there are many possible ways to connect or associate target sentences with source sentences. … |
S. Chauhan; Philemon Daniel; Shefali Saxena; Ayush Sharma; | Applied Artificial Intelligence | 2022-02-04 |
796 | Pirá: A Bilingual Portuguese-English Dataset for Question-Answering About The Ocean Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents the Pir\’a dataset, a large set of questions and answers about the ocean and the Brazilian coast both in Portuguese and English. |
ANDRÉ F. A. PASCHOAL et. al. | arxiv-cs.CL | 2022-02-04 |
797 | Anticipation-Free Training for Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This leads to aggressive anticipation during inference, resulting in the hallucination phenomenon. To mitigate this problem, we propose a new framework that decompose the translation process into the monotonic translation step and the reordering step, and we model the latter by the auxiliary sorting network (ASN). |
Chih-Chiang Chang; Shun-Po Chuang; Hung-yi Lee; | arxiv-cs.CL | 2022-01-30 |
798 | Translation Alignment with Ugarit Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Ugarit is a public web-based tool for manual annotation of parallel texts for generating word-level translation alignment. We aimed to develop a user-friendly interactive … |
Tariq Yousef; Chiara Palladino; Farnoosh Shamsian; Maryam Foradi; | Inf. | 2022-01-27 |
799 | Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Thus, we introduce Prabhupadavani, which is a multilingual code-mixed ST dataset for 25 languages. |
Jivnesh Sandhan; Ayush Daksh; Om Adideva Paranjay; Laxmidhar Behera; Pawan Goyal; | arxiv-cs.CL | 2022-01-27 |
800 | Tackling Data Scarcity in Speech Translation Using Zero-shot Multilingual Machine Translation Techniques Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In the related field of multilingual text translation, several techniques have been proposed for zero-shot translation. |
Tu Anh Dinh; Danni Liu; Jan Niehues; | arxiv-cs.CL | 2022-01-26 |
801 | Supervised Visual Attention for Simultaneous Multimodal Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose the first Transformer-based simultaneous MMT architecture, which has not been previously explored in the field. |
Veneta Haralampieva; Ozan Caglayan; Lucia Specia; | arxiv-cs.CL | 2022-01-23 |
802 | VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations. We introduce VISA, a new dataset that consists of 40k Japanese-English parallel sentence pairs and corresponding video clips with the following key features: (1) the parallel sentences are subtitles from movies and TV episodes; (2) the source subtitles are ambiguous, which means they have multiple possible translations with different meanings; (3) we divide the dataset into Polysemy and Omission according to the cause of ambiguity. |
Yihang Li; Shuichiro Shimizu; Weiqi Gu; Chenhui Chu; Sadao Kurohashi; | arxiv-cs.CL | 2022-01-20 |
803 | Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. |
Zhuoyuan Mao; Chenhui Chu; Sadao Kurohashi; | arxiv-cs.CL | 2022-01-20 |
804 | Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for … |
Zhuoyuan Mao; Chenhui Chu; S. Kurohashi; | Transactions on Asian and Low-Resource Language Information … | 2022-01-19 |
805 | Syntax-based Data Augmentation for Hungarian-English Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus. |
Attila Nagy; Patrick Nanys; Balázs Frey Konrád; Bence Bial; Judit Ács; | arxiv-cs.CL | 2022-01-18 |
806 | Improving Neural Machine Translation By Denoising Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a simple and effective pretraining strategy {D}en{o}ising {T}raining DoT for neural machine translation. |
Liang Ding; Keqin Peng; Dacheng Tao; | arxiv-cs.CL | 2022-01-18 |
807 | Klexikon: A German Dataset for Joint Summarization and Simplification IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Simultaneously, resources for non-English languages are scarce in general and prohibitive for training new solutions. To tackle this problem, we pose core requirements for a system that can jointly summarize and simplify long source documents. |
Dennis Aumiller; Michael Gertz; | arxiv-cs.CL | 2022-01-18 |
808 | Improved Unsupervised Neural Machine Translation with Semantically Weighted Back Translation for Morphologically Rich and Low Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View |
S. Chauhan; Shefali Saxena; Philemon Daniel; | Neural Processing Letters | 2022-01-10 |
809 | Towards The Next 1000 Languages in Multilingual Machine Translation: Exploring The Synergy Between Supervised and Self-Supervised Learning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we present a pragmatic approach towards building a multilingual MT model that covers hundreds of languages, using a mixture of supervised and self-supervised objectives, depending on the data availability for different language pairs. |
ADITYA SIDDHANT et. al. | arxiv-cs.CL | 2022-01-09 |
810 | An Ensemble Approach to Acronym Extraction Using Transformers Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper discusses an ensemble approach for the task of Acronym Extraction, which utilises two different methods to extract acronyms and their corresponding long forms. |
Prashant Sharma; Hadeel Saadany; Leonardo Zilio; Diptesh Kanojia; Constantin Orăsan; | arxiv-cs.CL | 2022-01-09 |
811 | PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a Phrase-level Adversarial Example Generation (PAEG) framework to enhance the robustness of the translation model. |
JUNCHENG WAN et. al. | arxiv-cs.CL | 2022-01-06 |
812 | Pre-Trained Word Embedding and Language Model Improve Multimodal Machine Translation: A Case Study in Multi30K Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multimodal machine translation (MMT) is an attractive application of neural machine translation (NMT) that is commonly incorporated with image information. However, the MMT models … |
Tosho Hirasawa; Masahiro Kaneko; Aizhan Imankulova; Mamoru Komachi; | IEEE Access | 2022-01-01 |
813 | The AISP-SJTU Simultaneous Translation System for IWSLT 2022 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes AISP-SJTU’s submissions for the IWSLT 2022 Simultaneous Translation task. We participate in the text-to-text and speech-to-text simultaneous translation from … |
QINPEI ZHU et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
814 | The Xiaomi Text-to-Text Simultaneous Speech Translation System for IWSLT 2022 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This system paper describes the Xiaomi Translation System for the IWSLT 2022 Simultaneous Speech Translation (noted as SST) shared task. We participate in the English-to-Mandarin … |
BAO GUO et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
815 | The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022 IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes USTC-NELSLIP’s submissions to the IWSLT 2022 Offline Speech Translation task, including speech translation of talks from English to German, English to Chinese … |
WEITAI ZHANG et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
816 | NVIDIA NeMo Offline Speech Translation Systems for IWSLT 2022 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper provides an overview of NVIDIA NeMo’s speech translation systems for the IWSLT 2022 Offline Speech Translation Task. Our cascade system consists of 1) Conformer RNN-T … |
OLEKSII HRINCHUK et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
817 | Amazon Alexa AI’s System for IWSLT 2022 Offline Speech Translation Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes Amazon Alexa AI’s submission to the IWSLT 2022 Offline Speech Translation Task. Our system is an end-to-end speech translation model that leverages pretrained … |
A. Shanbhogue; Ran Xue; Ching-Yun Chang; Sarah Campbell; | International Workshop on Spoken Language Translation | 2022-01-01 |
818 | The NiuTrans’s Submission to The IWSLT22 English-to-Chinese Offline Speech Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes NiuTrans’s submission to the IWSLT22 English-to-Chinese (En-Zh) offline speech translation task. The end-to-end and bilingual system is built by constrained … |
YUHAO ZHANG et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
819 | SEScore2: Retrieval Augmented Pretraining for Text Generation Evaluation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Is it possible to leverage large scale raw and raw parallel corpora to build a general learned metric? Existing learned metrics have gaps to human judgements, are model-dependent … |
Wenda Xu; Xian Qian; Mingxuan Wang; Lei Li; William Yang Wang; | ArXiv | 2022-01-01 |
820 | The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents our work in the participation of IWSLT 2022 simultaneous speech translation evaluation. For the track of text-to-text (T2T), we participate in three language … |
MINGHAN WANG et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
821 | Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022 IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: This paper describes the submissions of the UPC Machine Translation group to the IWSLT 2022 Offline Speech Translation and Speech-to-Speech Translation tracks. The offline task … |
Ioannis Tsiamas; Gerard I. Gállego; Carlos Escolano; José A. R. Fonollosa; M. Costa-jussà; | International Workshop on Spoken Language Translation | 2022-01-01 |
822 | Word-Region Alignment-Guided Multimodal Neural Machine Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We propose word-region alignment-guided multimodal neural machine translation (MNMT), a novel model for MNMT that links the semantic correlation between textual and visual … |
Yuting Zhao; Mamoru Komachi; Tomoyuki Kajiwara; Chenhui Chu; | IEEE/ACM Transactions on Audio, Speech, and Language … | 2022-01-01 |
823 | HW-TSC’s Participation in The IWSLT 2022 Isometric Spoken Language Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents our submissions to the IWSLT 2022 Isometric Spoken Language Translation task. We participate in all three language pairs (English-German, English-French, … |
ZONGYAO LI et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
824 | CMU’s IWSLT 2022 Dialect Speech Translation System IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes CMU’s submissions to the IWSLT 2022 dialect speech translation (ST) shared task for translating Tunisian-Arabic speech to English text. We use additional … |
BRIAN YAN et. al. | International Workshop on Spoken Language Translation | 2022-01-01 |
825 | JHU IWSLT 2022 Dialect Speech Translation System Description Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper details the Johns Hopkins speech translation (ST) system used in the IWLST2022 dialect speech translation task. Our system uses a cascade of automatic speech … |
Jinyi Yang; A. Hussein; Matthew Wiesner; S. Khudanpur; | International Workshop on Spoken Language Translation | 2022-01-01 |
826 | On The Robustness of Self-Supervised Representations for Spoken Language Modeling Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Self-supervised representations have been extensively studied for discriminative and generative tasks. However, their robustness capabilities have not been extensively … |
ITAI GAT et. al. | ArXiv | 2022-01-01 |
827 | Structural Supervision for Word Alignment and Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Syntactic structure has long been argued to be potentially useful for enforcing accurate word alignment and improving generalization performance of machine translation. … |
Lei Li; Kai Fan; Hongjian Li; Chun Yuan; | Findings | 2022-01-01 |
828 | A Simple Yet Robust Algorithm for Automatic Extraction of Parallel Sentences: A Case Study on Arabic-English Wikipedia Articles Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Parallel corpora are vital components in several applications of Natural Language Processing (NLP), particularly in machine translation. In this paper, we present a novel method … |
M. Althobaiti; | IEEE Access | 2022-01-01 |
829 | Democratizing Machine Translation with OPUS-MT Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development … |
J. TIEDEMANN et. al. | ArXiv | 2022-01-01 |
830 | Prompt-Driven Neural Machine Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Neural machine translation (NMT) has obtained significant performance improvement over the recent years. However, NMT models still face various challenges including fragility and … |
Yafu Li; Yongjing Yin; Jing Li; Yue Zhang; | Findings | 2022-01-01 |
831 | Locality-Sensitive Hashing for Long Context Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: After its introduction the Transformer architecture quickly became the gold standard for the task of neural machine translation. A major advantage of the Transformer compared to … |
Frithjof Petrick; Jan Rosendahl; Christian Herold; H. Ney; | International Workshop on Spoken Language Translation | 2022-01-01 |
832 | Simultaneous Neural Machine Translation with Prefix Alignment Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Simultaneous translation is a task that requires starting translation before the speaker has finished speaking, so we face a trade-off between latency and accuracy. In this work, … |
Yasumasa Kano; Katsuhito Sudoh; Satoshi Nakamura; | International Workshop on Spoken Language Translation | 2022-01-01 |
833 | Using Massive Multilingual Pre-Trained Language Models Towards Real Zero-Shot Neural Machine Translation in Clinical Domain Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Massively multilingual pre-trained language models (MMPLMs) are developed in recent years demonstrating super powers and the pre-knowledge they acquire for downstream tasks. In … |
Lifeng Han; G. Erofeev; I. Sorokina; Serge Gladkoff; G. Nenadic; | ArXiv | 2022-01-01 |
834 | An Automatic Post Editing With Efficient and Simple Data Generation Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Automatic post-editing (APE) research considers methods for correcting translation results inferred by machine translation systems. The training of APE models, generally require … |
Hyeonseok Moon; Chanjun Park; Jaehyung Seo; Sugyeong Eo; Heuiseok Lim; | IEEE Access | 2022-01-01 |
835 | CHIA: CHoosing Instances to Annotate for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
R. Bhatnagar; Ananya Ganesh; Katharina Kann; | Conference on Empirical Methods in Natural Language … | 2022-01-01 |
836 | Integrating Prior Translation Knowledge Into Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT), which is an encoder-decoder joint neural language model with an attention mechanism, has achieved impressive results on various machine … |
Kehai Chen; Rui Wang; M. Utiyama; E. Sumita; | IEEE/ACM Transactions on Audio, Speech, and Language … | 2022-01-01 |
837 | Using Neural Machine Translation Methods for Sign Language Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: We examine methods and techniques, proven to be helpful for the text-to-text translation of spoken languages in the context of gloss-to-text translation systems, where the glosses … |
G. Angelova; Eleftherios Avramidis; Sebastian Möller; | Annual Meeting of the Association for Computational … | 2022-01-01 |
838 | A Natural Diet: Towards Improving Naturalness of Machine Translation Output IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation (MT) evaluation often focuses on accuracy and fluency, without paying much attention to translation style. This means that, even when considered accurate and … |
Markus Freitag; David Vilar; David Grangier; Colin Cherry; George F. Foster; | Findings | 2022-01-01 |
839 | Lossless Speedup of Autoregressive Translation with Generalized Aggressive Decoding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we propose Generalized Aggressive Decoding (GAD) – a novel decoding paradigm for speeding up autoregressive translation without quality loss, through the … |
Heming Xia; Tao Ge; Furu Wei; Zhifang Sui; | ArXiv | 2022-01-01 |
840 | Machine Translation of English Speech: Comparison of Multiple Algorithms Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In order to improve the efficiency of the English translation, machine translation is gradually and widely used. This study briefly introduces the neural network algorithm for … |
Yijun Wu; Yonghong Qin; | Journal of Intelligent Systems | 2022-01-01 |
841 | Hierarchical Multi-task Learning Framework for Isometric-Speech Language Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: This paper presents our submission for the shared task on isometric neural machine translation in International Conference on Spoken Language Translation (IWSLT). There are … |
Aakash Bhatnagar; Nidhir Bhavsar; Muskaan Singh; P. Motlícek; | International Workshop on Spoken Language Translation | 2022-01-01 |
842 | Automatic Generation of Distractors for Fill-in-the-Blank Exercises with Round-Trip Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In a fill-in-the-blank exercise, a student is presented with a carrier sentence with one word hidden, and a multiple-choice list that includes the correct answer and several … |
Subhadarshi Panda; Frank Palma Gomez; Michael Flor; Alla Rozovskaya; | Annual Meeting of the Association for Computational … | 2022-01-01 |
843 | Bridging The Gap Between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Diverse NMT aims at generating multiple diverse yet faithful translations given a source sentence. In this paper, we investigate a common shortcoming in existing diverse NMT … |
HUAN LIN et. al. | NAACL-HLT | 2022-01-01 |
844 | How Do Lexical Semantics Affect Translation? An Empirical Study Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Here, we investigate these relationships on a variety of low-resource language pairs from the OpenSubtitles2016 database, where the source language is English, and find that the more similar the target language is to English, the greater the translation performance. |
Vivek Subramanian; Dhanasekar Sundararaman; | arxiv-cs.CL | 2021-12-31 |
845 | ViNMT: Neural Machine Translation Toolkit Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present an open-source toolkit for neural machine translation (NMT). |
NGUYEN HOANG QUAN et. al. | arxiv-cs.CL | 2021-12-30 |
846 | Pirá: A Bilingual Portuguese-English Dataset for Question-Answering About The Ocean Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents the Pirá dataset, a large set of questions and answers about the ocean and the Brazilian coast both in Portuguese and English. |
ANDR&EACUTE; F. A. PASCHOAL et. al. | cikm | 2021-12-30 |
847 | QEMind: Alibaba’s Submission to The WMT21 Quality Estimation Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present our submissions to the WMT 2021 QE shared task and an extensive set of experimental results have shown us that our multilingual systems outperform the best system in the Direct Assessment QE task of WMT 2020. |
JIAYI WANG et. al. | arxiv-cs.CL | 2021-12-29 |
848 | Model and Verification of Medical English Machine Translation Based on Optimized Generalized Likelihood Ratio Algorithm Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Phrase identification plays an important role in medical English machine translation. However, the phrases in medical English are complicated in internal structure and semantic … |
YU Peng; Youyu Zhu; | J. Sensors | 2021-12-28 |
849 | A Preordered RNN Layer Boosts Neural Machine Translation in Low Resource Settings Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose to augment attention based neural network with reordering information to alleviate the lack of data. |
Mohaddeseh Bastan; Shahram Khadivi; | arxiv-cs.CL | 2021-12-27 |
850 | HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professional Post-Editing Towards More Effective MT Evaluation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce HOPE, a task-oriented and human-centric evaluation framework for machine translation output based on professional post-editing annotations. |
Serge Gladkoff; Lifeng Han; | arxiv-cs.CL | 2021-12-27 |
851 | Improved Neural Machine Translation for Low-resource English-Assamese Pair Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Language translation is essential to bring the world closer and plays a significant part in building a community among people of different linguistic backgrounds. Machine … |
Sahinur Rahman Laskar; Abdullah Faiz Ur Rahman Khilji; Partha Pakray; Sivaji Bandyopadhyay; | J. Intell. Fuzzy Syst. | 2021-12-22 |
852 | Learning and Analyzing Generation Order for Undirected Sequence Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we train a policy that learns the generation order for a pre-trained, undirected translation model via reinforcement learning. |
Yichen Jiang; Mohit Bansal; | arxiv-cs.CL | 2021-12-16 |
853 | Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we demonstrate the use of cross-lingual word embeddings for detecting cognates among fourteen Indian Languages. |
DIPTESH KANOJIA et. al. | arxiv-cs.CL | 2021-12-16 |
854 | Isometric MT: Neural Machine Translation for Automatic Dubbing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work introduces a self-learning approach that allows a transformer model to directly learn to generate outputs that closely match the source length, in short Isometric MT. In particular, our approach does not require to generate multiple hypotheses nor any auxiliary ranking function. |
Surafel M. Lakew; Yogesh Virkar; Prashant Mathur; Marcello Federico; | arxiv-cs.CL | 2021-12-16 |
855 | Lesan — Machine Translation for Low Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Lesan, an MT system for low resource languages. |
Asmelash Teka Hadgu; Abel Aregawi; Adam Beaudoin; | arxiv-cs.CL | 2021-12-15 |
856 | Isochrony-Aware Neural Machine Translation for Automatic Dubbing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose implicit and explicit modeling approaches to integrate isochrony information into neural machine translation. |
Derek Tam; Surafel M. Lakew; Yogesh Virkar; Prashant Mathur; Marcello Federico; | arxiv-cs.CL | 2021-12-15 |
857 | Improving Both Domain Robustness and Domain Adaptability in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel approach, RMLNMT (Robust Meta-Learning Framework for Neural Machine Translation Domain Adaptation), which improves the robustness of existing meta-learning models. |
Wen Lai; Jindřich Libovický; Alexander Fraser; | arxiv-cs.CL | 2021-12-15 |
858 | Enhancing Lexical Translation Consistency for Document-Level Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Document-level neural machine translation (DocNMT) has yielded attractive improvements. In this article, we systematically analyze the discourse phenomena in Chinese-to-English … |
Xiaomian Kang; Yang Zhao; Jiajun Zhang; Chengqing Zong; | Transactions on Asian and Low-Resource Language Information … | 2021-12-14 |
859 | Step-unrolled Denoising Autoencoders for Text Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we propose a new generative model of text, Step-unrolled Denoising Autoencoder (SUNDAE), that does not rely on autoregressive models. |
Nikolay Savinov; Junyoung Chung; Mikolaj Binkowski; Erich Elsen; Aaron van den Oord; | arxiv-cs.CL | 2021-12-13 |
860 | Communication-Efficient Federated Learning for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore how to efficiently build NMT models in an FL setup by proposing a novel solution. |
Tanya Roosta; Peyman Passban; Ankit Chadha; | arxiv-cs.CL | 2021-12-11 |
861 | Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Specific use-cases are usually left out, since generic models tend to perform poorly in domain-specific cases. |
Javad Pourmostafa Roshan Sharami; Dimitar Shterionov; Pieter Spronck; | arxiv-cs.CL | 2021-12-11 |
862 | Post-Editese in Literary Translations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the present study, we investigate the post-editese phenomenon, i.e., the unique features that set machine translated post-edited texts apart from human-translated texts. We use … |
Sheila Castilho; Natália Resende; | Inf. | 2021-12-08 |
863 | Multitask Finetuning for Improving Neural Machine Translation in Indian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a Multitask Finetuning methodology which combines the Bilingual Machine Translation task with an auxiliary Causal Language Modeling task to improve performance on the former task on Indian Languages. |
Shaily Desai; Atharva Kshirsagar; Manisha Marathe; | arxiv-cs.CL | 2021-12-03 |
864 | Translating Politeness Across Cultures: Case of Hindi and English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a corpus based study of politeness across two languages-English and Hindi. |
Ritesh Kumar; Girish Nath Jha; | arxiv-cs.CL | 2021-12-03 |
865 | A Transformer-based Approach for Translating Natural Language to Bash Commands Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: This paper explores the translation of natural language into Bash Commands, which developers commonly use to accomplish command-line tasks in a terminal. In our approach a … |
Quchen Fu; Zhongwei Teng; Jules White; Douglas C. Schmidt; | 2021 20th IEEE International Conference on Machine Learning … | 2021-12-01 |
866 | Improvement in Machine Translation with Generative Adversarial Networks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore machine translation improvement via Generative Adversarial Network (GAN) architecture. |
Jay Ahn; Hari Madhu; Viet Nguyen; | arxiv-cs.CL | 2021-11-30 |
867 | Ensembling of Distilled Models from Multi-task Teachers for Constrained Resource Language Pairs Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes our submission to the constrained track of WMT21 shared news translation task. |
AMR HENDY et. al. | arxiv-cs.CL | 2021-11-25 |
868 | Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose a parameter-free, Dependency-scaled Self-Attention Network (Deps-SAN) for syntax-aware Transformer-based NMT. |
RU PENG et. al. | arxiv-cs.CL | 2021-11-23 |
869 | A Parallel Corpora for Bi-directional Neural Machine Translation for Low Resourced Ethiopian Languages Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we described an effort towards the development of parallel corpora for English and Ethiopian Languages, such as Wolaita, Gamo, Gofa, and Dawuro neural machine … |
Atnafu Lambebo Tonja; Michael Melese Woldeyohannis; Mesay Gemeda Yigezu; | 2021 International Conference on Information and … | 2021-11-22 |
870 | Recent Advances in Dialogue Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent years have seen a surge of interest in dialogue translation, which is a significant application task for machine translation (MT) technology. However, this has so far not … |
Siyou Liu; Yuqi Sun; Longyue Wang; | Inf. | 2021-11-22 |
871 | Multilingual Neural Machine Translation for Low Resourced Languages: Ometo-English IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unlike technologically favored languages, under-resourced languages highly suffer from the lack of language resources for machine translation. In this paper, we present a new … |
Mesay Gemeda Yigezu; Michael Melese Woldeyohannis; A. Tonja; | 2021 International Conference on Information and … | 2021-11-22 |
872 | R-Drop: Regularized Dropout for Neural Networks IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a simple consistency training strategy to regularize dropout, namely R-Drop, which forces the output distributions of different sub models generated by dropout to be consistent with each other. |
XIAOBO LIANG et. al. | nips | 2021-11-20 |
873 | BARTScore: Evaluating Generated Text As Text Generation IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we conceptualize the evaluation of generated text as a text generation problem, modeled using pre-trained sequence-to-sequence models. |
Weizhe Yuan; Graham Neubig; Pengfei Liu; | nips | 2021-11-20 |
874 | Khmer Speech Translation Corpus of The Extraordinary Chambers in The Courts of Cambodia (ECCC) Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Speech translation (ST) is a subject of rapidly increasing interest in the area of speech processing research. This interest is apparent from the increasing tools and corpora for … |
KAK SOKY et. al. | 2021 24th Conference of the Oriental COCOSDA International … | 2021-11-18 |
875 | XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents XLS-R, a large-scale model for cross-lingual speech representation learning based on wav2vec 2.0. |
ARUN BABU et. al. | arxiv-cs.CL | 2021-11-17 |
876 | NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper provides an overview of NVIDIA NeMo’s neural machine translation systems for the constrained data track of the WMT21 News and Biomedical Shared Translation Tasks. |
Sandeep Subramanian; Oleksii Hrinchuk; Virginia Adams; Oleksii Kuchaiev; | arxiv-cs.CL | 2021-11-16 |
877 | Measuring Uncertainty in Translation Quality Evaluation (TQE) IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The methodology we applied for this work is from Bernoulli Statistical Distribution Modelling (BSDM) and Monte Carlo Sampling Analysis (MCSA). |
Serge Gladkoff; Irina Sorokina; Lifeng Han; Alexandra Alekseeva; | arxiv-cs.CL | 2021-11-15 |
878 | Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, in this paper, these unique characteristics of sign languages are formulated as hierarchical spatio-temporal graph representations, including high-level and fine-level graphs of which a vertex characterizes a specified body part and an edge represents their interactions. |
JICHAO KAN et. al. | arxiv-cs.CV | 2021-11-14 |
879 | BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In our work, we propose instead, to refine the mined bitexts via automatic editing: given a sentence in a language xf, and a possibly imperfect translation of it xe, our model generates a revised version xf’ or xe’ that yields a more equivalent translation pair (i.e., |
Eleftheria Briakou; Sida I. Wang; Luke Zettlemoyer; Marjan Ghazvininejad; | arxiv-cs.CL | 2021-11-12 |
880 | Developing Neural Machine Translation Models for Hungarian-English Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: I propose 5 different augmentation methods that are structure-aware, meaning that instead of randomly selecting words for blanking or replacement, the dependency tree of sentences is used as a basis for augmentation. |
Attila Nagy; | arxiv-cs.CL | 2021-11-07 |
881 | Don’t Go Far Off: An Empirical Study on Neural Poetry Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present an empirical investigation for poetry translation along several dimensions: 1) size and style of training data (poetic vs. non-poetic), including a zero-shot setup; 2) bilingual vs. multilingual learning; and 3) language-family-specific models vs. mixed-language-family models. |
Tuhin Chakrabarty; Arkadiy Saakyan; Smaranda Muresan; | emnlp | 2021-11-05 |
882 | Enlivening Redundant Heads in Multi-head Self-attention for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a redundant head enlivening (RHE) method to precisely identify redundant heads, and then vitalize their potential by learning syntactic relations and prior knowledge in the text without sacrificing the roles of important heads. |
Tianfu Zhang; Heyan Huang; Chong Feng; Longbing Cao; | emnlp | 2021-11-05 |
883 | Improving The Quality Trade-Off for Neural Machine Translation Multi-Domain Adaptation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We study this problem in an adaptation setting where the goal is to preserve the existing system quality while incorporating data for domains that were not the focus of the original translation system. |
EVA HASLER et. al. | emnlp | 2021-11-05 |
884 | Cross Attention Augmented Transducer Networks for Simultaneous Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper proposes a novel architecture, Cross Attention Augmented Transducer (CAAT), for simultaneous translation. |
Dan Liu; Mengge Du; Xiaoxi Li; Ya Li; Enhong Chen; | emnlp | 2021-11-05 |
885 | Rule-based Morphological Inflection Improves Neural Terminology Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a modular framework for incorporating lemma constraints in neural MT (NMT) in which linguistic knowledge and diverse types of NMT models can be flexibly applied. |
Weijia Xu; Marine Carpuat; | emnlp | 2021-11-05 |
886 | Learning to Rewrite for Non-Autoregressive Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose an architecture named RewriteNAT to explicitly learn to rewrite the erroneous translation pieces. |
Xinwei Geng; Xiaocheng Feng; Bing Qin; | emnlp | 2021-11-05 |
887 | Cross-Attention Is All You Need: Adapting Pretrained Transformers for Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We study the power of cross-attention in the Transformer architecture within the context of transfer learning for machine translation, and extend the findings of studies into cross-attention when training from scratch. |
Mozhdeh Gheini; Xiang Ren; Jonathan May; | emnlp | 2021-11-05 |
888 | A Generative Framework for Simultaneous Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a generative framework for simultaneous machine translation. |
Yishu Miao; Phil Blunsom; Lucia Specia; | emnlp | 2021-11-05 |
889 | Zero-Shot Information Extraction As A Unified Text-to-Triple Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We cast a suite of information extraction tasks into a text-to-triple translation framework. |
CHENGUANG WANG et. al. | emnlp | 2021-11-05 |
890 | Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a qualitative study that examines the role of datasets in stimulating the leverage of visual modality and we propose methods to highlight the importance of visual signals in the datasets which demonstrate improvements in reliance of models on the source images. |
Jiaoda Li; Duygu Ataman; Rico Sennrich; | emnlp | 2021-11-05 |
891 | Wikily Supervised Neural Translation Tailored to Cross-Lingual Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a simple but effective approach for leveraging Wikipedia for neural machine translation as well as cross-lingual tasks of image captioning and dependency parsing without using any direct supervision from external parallel data or supervised models in the target language. |
Mohammad Sadegh Rasooli; Chris Callison-Burch; Derry Tanti Wijaya; | emnlp | 2021-11-05 |
892 | AligNART: Non-autoregressive Neural Machine Translation By Jointly Learning to Estimate Alignment and Translate IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce AligNART, which leverages full alignment information to explicitly reduce the modality of the target distribution. |
Jongyoon Song; Sungwon Kim; Sungroh Yoon; | emnlp | 2021-11-05 |
893 | AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To tackle these challenges, we propose AfroMT, a standardized, clean, and reproducible machine translation benchmark for eight widely spoken African languages. |
Machel Reid; Junjie Hu; Graham Neubig; Yutaka Matsuo; | emnlp | 2021-11-05 |
894 | Neural Machine Translation Quality and Post-Editing Performance IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Across all models, we found that better MT systems indeed lead to fewer changes in the sentences in this industry setting. |
Vil?m Zouhar; Martin Popel; Ondrej Bojar; Ale? Tamchyna; | emnlp | 2021-11-05 |
895 | I Wish I Would Have Loved This One, But I Didn’t – A Multilingual Dataset for Counterfactual Detection in Product Review Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We consider the problem of counterfactual detection (CFD) in product reviews. |
James O?Neill; Polina Rozenshtein; Ryuichi Kiryo; Motoko Kubota; Danushka Bollegala; | emnlp | 2021-11-05 |
896 | Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose heterogeneous ways of embedding topic information at the sentence level into an NMT model to improve translation performance. |
Weixuan Wang; Wei Peng; Meng Zhang; Qun Liu; | emnlp | 2021-11-05 |
897 | HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To improve effectiveness of the available BT data, we introduce HintedBT-a family of techniques which provides hints (through tags) to the encoder and decoder. |
Sahana Ramnath; Melvin Johnson; Abhirut Gupta; Aravindan Raghuveer; | emnlp | 2021-11-05 |
898 | GFST: Gender-Filtered Self-Training for More Accurate Gender in Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose gender-filtered self-training (GFST) to improve gender translation accuracy on unambiguously gendered inputs. |
Prafulla Kumar Choubey; Anna Currey; Prashant Mathur; Georgiana Dinu; | emnlp | 2021-11-05 |
899 | Towards Making The Most of Dialogue Characteristics for Neural Chat Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose to promote the chat translation by introducing the modeling of dialogue characteristics into the NCT model. |
YUNLONG LIANG et. al. | emnlp | 2021-11-05 |
900 | Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a novel supervised learning approach for training an agent that can detect the minimum number of reads required for generating each target token by comparing simultaneous translations against full-sentence translations during training to generate oracle action sequences. |
Ashkan Alinejad; Hassan S. Shavarani; Anoop Sarkar; | emnlp | 2021-11-05 |
901 | One Source, Two Targets: Challenges and Rewards of Dual Decoding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we consider a stronger requirement: to jointly generate two texts so that each output side effectively depends on the other. |
Jitao Xu; Fran?ois Yvon; | emnlp | 2021-11-05 |
902 | Encouraging Lexical Translation Consistency for Document-Level Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we apply one translation per discourse in NMT, and aim to encourage lexical translation consistency for document-level NMT. |
Xinglin Lyu; Junhui Li; Zhengxian Gong; Min Zhang; | emnlp | 2021-11-05 |
903 | Improving Neural Machine Translation By Bidirectional Training IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a simple and effective pretraining strategy – bidirectional training (BiT) for neural machine translation. |
Liang Ding; Di Wu; Dacheng Tao; | emnlp | 2021-11-05 |
904 | Generalised Unsupervised Domain Adaptation of Neural Machine Translation with Cross-Lingual Data Selection Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a cross-lingual data selection method to extract in-domain sentences in the missing language side from a large generic monolingual corpus. |
Thuy-Trang Vu; Xuanli He; Dinh Phung; Gholamreza Haffari; | emnlp | 2021-11-05 |
905 | An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate the benefits of an explicit alignment to language labels in Transformer-based MNMT models in the zero-shot context, by jointly training one cross attention head with word alignment supervision to stress the focus on the target language label. |
Alessandro Raganato; Ra?l V?zquez; Mathias Creutz; J?rg Tiedemann; | emnlp | 2021-11-05 |
906 | Wino-X: Multilingual Winograd Schemas for Commonsense Reasoning and Coreference Resolution IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work presents Wino-X, a parallel dataset of German, French, and Russian schemas, aligned with their English counterparts. |
Denis Emelin; Rico Sennrich; | emnlp | 2021-11-05 |
907 | Cross-lingual Intermediate Fine-tuning Improves Dialogue State Tracking IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks. |
Nikita Moghe; Mark Steedman; Alexandra Birch; | emnlp | 2021-11-05 |
908 | It Is Not As Good As You Think! Evaluating Simultaneous Machine Translation on Interpretation Data Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In the absence of interpretation training data, we propose a translation-to-interpretation (T2I) style transfer method which allows converting existing offline translations into interpretation-style data, leading to up-to 2.8 BLEU improvement. |
Jinming Zhao; Philip Arthur; Gholamreza Haffari; Trevor Cohn; Ehsan Shareghi; | emnlp | 2021-11-05 |
909 | Recurrent Attention for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we push further in this research line and propose a novel substitute mechanism for self-attention: Recurrent AtteNtion (RAN) . |
Jiali Zeng; Shuangzhi Wu; Yongjing Yin; Yufan Jiang; Mu Li; | emnlp | 2021-11-05 |
910 | Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose an approach, MultiUAT, that dynamically adjusts the training data usage based on the model’s uncertainty on a small set of trusted clean data for multi-corpus machine translation. |
MINGHAO WU et. al. | emnlp | 2021-11-05 |
911 | XLEnt: Mining A Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this, we propose Lexical-Semantic-Phonetic Align (LSP-Align), a technique to automatically mine cross-lingual entity lexica from mined web data. |
Ahmed El-Kishky; Adithya Renduchintala; James Cross; Francisco Guzm?n; Philipp Koehn; | emnlp | 2021-11-05 |
912 | Translating Headers of Tabular Data: A Pilot Study of Schema Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To facilitate the research study, we construct the first parallel dataset for schema translation, which consists of 3,158 tables with 11,979 headers written in 6 different languages, including English, Chinese, French, German, Spanish, and Japanese. Also, we propose the first schema translation model called CAST, which is a header-to-header neural machine translation model augmented with schema context. |
Kunrui Zhu; Yan Gao; Jiaqi Guo; Jian-Guang Lou; | emnlp | 2021-11-05 |
913 | Language Modeling, Lexical Translation, Reordering: The Training Process of NMT Through The Lens of Classical SMT IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we look at the competences related to three core SMT components and find that during training, NMT first focuses on learning target-side language modeling, then improves translation quality approaching word-by-word translation, and finally learns more complicated reordering patterns. |
Elena Voita; Rico Sennrich; Ivan Titov; | emnlp | 2021-11-05 |
914 | Controlling Machine Translation for Multiple Attributes with Additive Interventions IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address these problems by introducing vector-valued interventions which allow for fine-grained control over multiple attributes simultaneously via a weighted linear combination of the corresponding vectors. |
Andrea Schioppa; David Vilar; Artem Sokolov; Katja Filippova; | emnlp | 2021-11-05 |
915 | Document Graph for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this issue, we hypothesize that a document can be represented as a graph that connects relevant contexts regardless of their distances. |
Mingzhou Xu; Liangyou Li; Derek F. Wong; Qun Liu; Lidia S. Chao; | emnlp | 2021-11-05 |
916 | PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a high-quality and large-scale Vietnamese-English parallel dataset of 3.02M sentence pairs, which is 2.9M pairs larger than the benchmark Vietnamese-English machine translation corpus IWSLT15. |
Long Doan; Linh The Nguyen; Nguyen Luong Tran; Thai Hoang; Dat Quoc Nguyen; | emnlp | 2021-11-05 |
917 | Mutual-Learning Improves End-to-End Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an alternative-a trainable mutual-learning scenario, where the MT and the ST models are collaboratively trained and are considered as peers, rather than teacher/student. |
Jiawei Zhao; Wei Luo; Boxing Chen; Andrew Gilman; | emnlp | 2021-11-05 |
918 | MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we improve multilingual text-to-text transfer Transformer with translation pairs (mT6). |
ZEWEN CHI et. al. | emnlp | 2021-11-05 |
919 | Unsupervised Neural Machine Translation with Universal Grammar Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, in this paper, we seek to leverage such shared grammar clues to provide more explicit language parallel signals to enhance the training of unsupervised machine translation models. |
Zuchao Li; Masao Utiyama; Eiichiro Sumita; Hai Zhao; | emnlp | 2021-11-05 |
920 | Reducing The Impact of Out of Vocabulary Words in The Translation of Natural Language Questions Into SPARQL Queries Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we combine Named Entity Linking, Named Entity Recognition, and Neural Machine Translation to perform automatic translation of natural language questions into SPARQL queries. |
Manuel A. Borroto Santana; Francesco Ricca; Bernardo Cuteri; | arxiv-cs.CL | 2021-11-04 |
921 | Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report describes Microsoft’s machine translation systems for the WMT21 shared task on large-scale multilingual machine translation. |
JIAN YANG et. al. | arxiv-cs.CL | 2021-11-03 |
922 | Lingua Custodia’s Participation at The WMT 2021 Machine Translation Using Terminologies Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes Lingua Custodia’s submission to the WMT21 shared task on machine translation using terminologies. |
Melissa Ailem; Jinghsu Liu; Raheel Qader; | arxiv-cs.CL | 2021-11-03 |
923 | Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose automatic translation of dialogue datasets with alignment to ensure faithful translation of slot values and eliminate costly human supervision used in previous benchmarks. |
Mehrad Moradshahi; Victoria Tsai; Giovanni Campagna; Monica S. Lam; | arxiv-cs.CL | 2021-11-03 |
924 | Automated Testing for Machine Translation Via Constituency Invariance Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the development of deep neural networks, machine translation has achieved significant progress and integrated with people’s daily lives to assist in various tasks. However, … |
Pin Ji; Yang Feng; Jia Liu; Zhihong Zhao; Baowen Xu; | 2021 36th IEEE/ACM International Conference on Automated … | 2021-11-01 |
925 | How Should Human Translation Coexist with NMT? Efficient Tool for Building High Quality Parallel Corpus Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a tool for efficiently constructing high-quality parallel corpora with minimizing human labor and making this tool publicly available. |
CHANJUN PARK et. al. | arxiv-cs.CL | 2021-10-30 |
926 | Simultaneous Neural Machine Translation with Constituent Label Prediction Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Motivated by the concept of pre-reordering, we propose a couple of simple decision rules using the label of the next constituent predicted by incremental constituent label prediction. |
Yasumasa Kano; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2021-10-26 |
927 | Assessing Evaluation Metrics for Speech-to-Speech Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate … |
Elizabeth Salesky; Julian Mäder; Severin Klinger; | arxiv-cs.CL | 2021-10-26 |
928 | Noisy UGC Translation at The Character Level: Revisiting Open-Vocabulary Capabilities and Robustness of Char-Based Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work explores the capacities of character-based Neural Machine Translation to translate noisy User-Generated Content (UGC) with a strong focus on exploring the limits of such approaches to handle productive UGC phenomena, which almost by definition, cannot be seen at training time. |
José Carlos Rosales Núñez; Guillaume Wisniewski; Djamé Seddah; | arxiv-cs.CL | 2021-10-24 |
929 | Discontinuous Grammar As A Foreign Language Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To close the gap, we here extend the framework of sequence-to-sequence models for constituent parsing, not only by providing a more powerful neural architecture for improving their performance, but also by enlarging their coverage to handle the most complex syntactic phenomena: discontinuous structures. |
Daniel Fernández-González; Carlos Gómez-Rodríguez; | arxiv-cs.CL | 2021-10-20 |
930 | Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we thus propose an algorithm to reorder and refine the target side of a full sentence translation corpus, so that the words/phrases between the source and target sentences are aligned largely monotonically, using word alignment and non-autoregressive neural machine translation. |
HYOJUNG HAN et. al. | arxiv-cs.CL | 2021-10-18 |
931 | The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new corpus for gender identification and rewriting in contexts involving one or two target users (I and/or You) — first and second grammatical persons with independent grammatical gender preferences. |
Bashar Alhafni; Nizar Habash; Houda Bouamor; | arxiv-cs.CL | 2021-10-18 |
932 | Towards Making The Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper demonstrates that multilingual pretraining and multilingual fine-tuning are both critical for facilitating cross-lingual transfer in zero-shot translation, where the neural machine translation (NMT) model is tested on source languages unseen during supervised training. Following this idea, we present SixT+, a strong many-to-English NMT model that supports 100 source languages but is trained with a parallel dataset in only six source languages. |
GUANHUA CHEN et. al. | arxiv-cs.CL | 2021-10-16 |
933 | Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose DSLP, a highly efficient and high-performance model for machine translation. |
Chenyang Huang; Hao Zhou; Osmar R. Zaïane; Lili Mou; Lei Li; | arxiv-cs.CL | 2021-10-14 |
934 | Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose Decision Attentive Regularization (DAR) to improve the decision policy of SimulST systems by using the simultaneous text-to-text translation (SimulMT) task. |
Mohd Abbas Zaidi; Beomseok Lee; Sangha Kim; Chanwoo Kim; | arxiv-cs.SD | 2021-10-13 |
935 | Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present work on summarising deliberative processes for non-English languages. |
M. Arana-Catania; Rob Procter; Yulan He; Maria Liakata; | arxiv-cs.CL | 2021-10-12 |
936 | Unsupervised Neural Machine Translation with Generative Language Models Only IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show how to derive state-of-the-art unsupervised neural machine translation systems from generatively pre-trained language models. |
JESSE MICHAEL HAN et. al. | arxiv-cs.CL | 2021-10-11 |
937 | Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To generate such samples, we propose a doubly-trained architecture that pairs two NMT models of opposite translation directions with a joint loss function, which combines the target-side attack and the source-side semantic similarity constraint. |
Weiting Tan; Shuoyang Ding; Huda Khayrallah; Philipp Koehn; | arxiv-cs.CL | 2021-10-11 |
938 | Using Document Similarity Methods to Create Parallel Datasets for Code Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose to use document similarity methods to create noisy parallel datasets of code, thus enabling supervised techniques to be applied for automated code translation without having to rely on the availability or expensive curation of parallel code datasets. |
MAYANK AGARWAL et. al. | arxiv-cs.CL | 2021-10-11 |
939 | Machine Translation Verbosity Control for Automatic Dubbing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we focus on the problem of controlling the verbosity of machine translation output, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. |
SURAFEL M. LAKEW et. al. | arxiv-cs.CL | 2021-10-07 |
940 | Sequence-to-Sequence Lexical Normalization with Multilingual Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a sentence-level sequence-to-sequence model based on mBART, which frames the problem as a machine translation problem. |
Ana-Maria Bucur; Adrian Cosma; Liviu P. Dinu; | arxiv-cs.CL | 2021-10-06 |
941 | The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we instead consider the impact of compression in a data-limited regime. |
Orevaoghene Ahia; Julia Kreutzer; Sara Hooker; | arxiv-cs.CL | 2021-10-06 |
942 | On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a methodology that explores how sentence structure is reflected in neural representations of machine translation systems. |
Gal Patel; Leshem Choshen; Omri Abend; | arxiv-cs.CL | 2021-10-06 |
943 | On The Complementarity Between Pre-Training and Back-Translation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce two probing tasks for PT and BT respectively and find that PT mainly contributes to the encoder module while BT brings more benefits to the decoder. |
XUEBO LIU et. al. | arxiv-cs.CL | 2021-10-05 |
944 | Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer By Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a numerical `sentiment-closeness’ measure appropriate for assessing the accuracy of a translated affect message in UGC text by an MT system. |
Hadeel Saadany; Constantin Orasan; Emad Mohamed; Ashraf Tantawy; | arxiv-cs.CL | 2021-09-30 |
945 | BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing Critical Translation Errors in Sentiment-oriented Text Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we assess the ability of automatic quality metrics to detect critical machine translation errors which can cause serious misunderstanding of the affect message. |
Hadeel Saadany; Constantin Orasan; | arxiv-cs.CL | 2021-09-29 |
946 | EdinSaar@WMT21: North-Germanic Low-Resource Multilingual NMT Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe the EdinSaar submission to the shared task of Multilingual Low-Resource Translation for North Germanic Languages at the Sixth Conference on Machine Translation (WMT2021). |
Svetlana Tchistiakova; Jesujoba Alabi; Koel Dutta Chowdhury; Sourav Dutta; Dana Ruiter; | arxiv-cs.CL | 2021-09-29 |
947 | Towards Reinforcement Learning for Pivot-based Neural Machine Translation with Non-autoregressive Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We utilize a non-autoregressive transformer and present an end-to-end pivot-based integrated model, enabling training on source-target data. |
EVGENIIA TOKARCHUK et. al. | arxiv-cs.CL | 2021-09-27 |
948 | Integrated Training for Sequence-to-Sequence Models Using Non-Autoregressive Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a cascaded model based on the non-autoregressive Transformer that enables end-to-end training without the need for an explicit intermediate representation. |
EVGENIIA TOKARCHUK et. al. | arxiv-cs.CL | 2021-09-27 |
949 | International Sentiment Analysis for News and Blogs IF:5 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: There is a growing interest in mining opinions using sentiment analysis methods from sources such as news, blogs and product reviews. Most of these methods have been developed for … |
Mikhail Bautin; Lohit Vijayarenu; S. Skiena; | Proceedings of the International AAAI Conference on Web and … | 2021-09-25 |
950 | Unsupervised Translation of German–Lower Sorbian: Exploring Training and Novel Transfer Methods on A Low-Resource Language Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the methods behind the systems submitted by the University of Groningen for the WMT 2021 Unsupervised Machine Translation task for German–Lower Sorbian (DE–DSB): a high-resource language to a low-resource one. |
Lukas Edman; Ahmet Üstün; Antonio Toral; Gertjan van Noord; | arxiv-cs.CL | 2021-09-24 |
951 | Faithful Target Attribute Prediction in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We argue that predicting the target word and attributes simultaneously is an effective way to ensure that translations are more faithful to the training data distribution with respect to these attributes. |
Xing Niu; Georgiana Dinu; Prashant Mathur; Anna Currey; | arxiv-cs.CL | 2021-09-24 |
952 | Exploiting Curriculum Learning in Unsupervised Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address this problem, we propose a curriculum learning method to gradually utilize pseudo bi-texts based on their quality from multiple granularities. |
Jinliang Lu; Jiajun Zhang; | arxiv-cs.CL | 2021-09-23 |
953 | The Volctrans GLAT System: Non-autoregressive Translation Meets WMT21 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the Volctrans’ submission to the WMT21 news translation shared task for German->English translation. |
LIHUA QIAN et. al. | arxiv-cs.CL | 2021-09-23 |
954 | The NiuTrans Machine Translation Systems for WMT21 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. |
SHUHAN ZHOU et. al. | arxiv-cs.CL | 2021-09-21 |
955 | Learning Kernel-Smoothed Machine Translation with Retrieved Examples IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose to learn Kernel-Smoothed Translation with Example Retrieval (KSTER), an effective approach to adapt neural machine translation models online. |
QINGNAN JIANG et. al. | arxiv-cs.CL | 2021-09-21 |
956 | TranslateLocally: Blazing-fast Translation Running on The Local CPU Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To bring control back to the end user and demonstrate speed, we developed translateLocally. |
Nikolay Bogoychev; Jelmer Van der Linde; Kenneth Heafield; | arxiv-cs.CL | 2021-09-21 |
957 | One Source, Two Targets: Challenges and Rewards of Dual Decoding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we consider a stronger requirement: to jointly generate two texts so that each output side effectively depends on the other. |
Jitao Xu; François Yvon; | arxiv-cs.CL | 2021-09-21 |
958 | CUNI Systems for WMT21: Terminology Translation Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes Charles University submission for Terminology translation Shared Task at WMT21. |
Josef Jon; Michal Novák; João Paulo Aires; Dušan Variš; Ondřej Bojar; | arxiv-cs.CL | 2021-09-20 |
959 | The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents the JHU-Microsoft joint submission for WMT 2021 quality estimation shared task. |
Shuoyang Ding; Marcin Junczys-Dowmunt; Matt Post; Christian Federmann; Philipp Koehn; | arxiv-cs.CL | 2021-09-17 |
960 | Back-translation for Large-Scale Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work aims to build a single multilingual translation system with a hypothesis that a universal cross-language representation leads to better multilingual translation performance. |
Baohao Liao; Shahram Khadivi; Sanjika Hewavitharana; | arxiv-cs.CL | 2021-09-17 |
961 | Miðeind’s WMT 2021 Submission Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Mi{\dh}eind’s submission for the English$\to$Icelandic and Icelandic$\to$English subsets of the 2021 WMT news translation task. |
Haukur Barri Símonarson; Vésteinn Snæbjarnarson; Pétur Orri Ragnarsson; Haukur Páll Jónsson; Vilhjálmur Þorsteinsson; | arxiv-cs.CL | 2021-09-15 |
962 | Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we extend the definition of glass-box QE generally to uncertainty quantification with both black-box and glass-box approaches and design several features deduced from them to blaze a new trial in improving QE’s performance. |
KE WANG et. al. | arxiv-cs.CL | 2021-09-15 |
963 | Netmarble AI Center’s WMT21 Automatic Post-Editing Shared Task Submission Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes Netmarble’s submission to WMT21 Automatic Post-Editing (APE) Shared Task for the English-German language pair. |
Shinhyeok Oh; Sion Jang; Hu Xu; Shounan An; Insoo Oh; | arxiv-cs.CL | 2021-09-14 |
964 | Non-Parametric Unsupervised Domain Adaptation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel framework that directly uses in-domain monolingual sentences in the target language to construct an effective datastore for $k$-nearest-neighbor retrieval. |
XIN ZHENG et. al. | arxiv-cs.CL | 2021-09-14 |
965 | Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose CorefCL, a novel data augmentation and contrastive learning scheme based on coreference between the source and contextual sentences. |
Yongkeun Hwang; Hyungu Yun; Kyomin Jung; | arxiv-cs.CL | 2021-09-13 |
966 | Uncertainty-Aware Machine Translation Evaluation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce uncertainty-aware MT evaluation and analyze the trustworthiness of the predicted quality. |
Taisiya Glushkova; Chrysoula Zerva; Ricardo Rei; André F. T. Martins; | arxiv-cs.CL | 2021-09-13 |
967 | Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a universal SiMT model with Mixture-of-Experts Wait-k Policy to achieve the best translation quality under arbitrary latency with only one trained model. |
Shaolei Zhang; Yang Feng; | arxiv-cs.CL | 2021-09-11 |
968 | Improving Multilingual Translation By Representation and Gradient Regularization IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we observe that off-target translation is dominant even in strong multilingual systems, trained on massive multilingual corpora. |
YILIN YANG et. al. | arxiv-cs.CL | 2021-09-10 |
969 | Neural Machine Translation Quality and Post-Editing Performance IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Across all models, we found that better MT systems indeed lead to fewer changes in the sentences in this industry setting. |
Vilém Zouhar; Aleš Tamchyna; Martin Popel; Ondřej Bojar; | arxiv-cs.CL | 2021-09-10 |
970 | Rethinking Zero-shot Neural Machine Translation: From A Perspective of Latent Variables IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a denoising autoencoder objective based on pivot language into traditional training objective to improve the translation accuracy on zero-shot directions. |
WEIZHI WANG et. al. | arxiv-cs.CL | 2021-09-10 |
971 | Collecting A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. |
Shahar Levy; Koren Lazar; Gabriel Stanovsky; | arxiv-cs.CL | 2021-09-08 |
972 | Competence-based Curriculum Learning for Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, we focus on balancing the learning competencies of different languages and propose Competence-based Curriculum Learning for Multilingual Machine Translation, named CCL-M. |
Mingliang Zhang; Fandong Meng; Yunhai Tong; Jie Zhou; | arxiv-cs.CL | 2021-09-08 |
973 | IndicBART: A Pre-trained Model for Indic Natural Language Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we study pre-trained sequence-to-sequence models for a group of related languages, with a focus on Indic languages. |
RAJ DABRE et. al. | arxiv-cs.CL | 2021-09-07 |
974 | Don’t Go Far Off: An Empirical Study on Neural Poetry Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Despite constant improvements in machine translation quality, automatic poetry translation remains a challenging problem due to the lack of open-sourced parallel poetic corpora, … |
Tuhin Chakrabarty; Arkadiy Saakyan; S. Muresan; | Conference on Empirical Methods in Natural Language … | 2021-09-07 |
975 | Infusing Future Information Into Monotonic Attention Through Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Simultaneous neural machine translation(SNMT) models start emitting the target sequence before they have processed the source sequence. |
Mohd Abbas Zaidi; Sathish Indurthi; Beomseok Lee; Nikhil Kumar Lakumarapu; Sangha Kim; | arxiv-cs.CL | 2021-09-07 |
976 | Transformer Models for Text Coherence Assessment Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Accordingly, in this paper, we propose four different Transformer-based architectures for the task: vanilla Transformer, hierarchical Transformer, multi-task learning-based model, and a model with fact-based input representation. |
Tushar Abhishek; Daksh Rawat; Manish Gupta; Vasudeva Varma; | arxiv-cs.CL | 2021-09-05 |
977 | Masked Adversarial Generation for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose the Masked Adversarial Generation (MAG) model, that learns to perturb the translation model throughout the training process. |
Badr Youbi Idrissi; Stéphane Clinchant; | arxiv-cs.CL | 2021-09-01 |
978 | Survey of Low-Resource Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a survey covering the state of the art in low-resource machine translation research. |
Barry Haddow; Rachel Bawden; Antonio Valerio Miceli Barone; Jindřich Helcl; Alexandra Birch; | arxiv-cs.CL | 2021-09-01 |
979 | MMARCO: A Multilingual Version of The MS MARCO Passage Ranking Dataset IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present mMARCO, a multilingual version of the MS MARCO passage ranking dataset comprising 13 languages that was created using machine translation. |
LUIZ BONIFACIO et. al. | arxiv-cs.CL | 2021-08-31 |
980 | An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose an unsupervised method to build SS corpora from large-scale bilingual translation corpora, alleviating the need for SS supervised corpora. By taking the pair of the source sentences of translation corpus and the translations of their references in a bridge language, we can construct large-scale pseudo parallel SS data. |
Xinyu Lu; Jipeng Qiang; Yun Li; Yunhao Yuan; Yi Zhu; | arxiv-cs.CL | 2021-08-31 |
981 | Secoco: Self-Correcting Encoding for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents Self-correcting Encoding (Secoco), a framework that effectively deals with input noise for robust neural machine translation by introducing self-correcting predictors. |
TAO WANG et. al. | arxiv-cs.CL | 2021-08-27 |
982 | Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To effectively learn semantic alignments among product images and bilingual texts in translation, we design a unified product-oriented cross-modal cross-lingual model (\upoc~) for pre-training and fine-tuning. In this paper, we first construct a large-scale bilingual product description dataset called Fashion-MMT, which contains over 114k noisy and 40k manually cleaned description translations with multiple product images. We will release the dataset and codes at https://github.com/syuqings/Fashion-MMT. |
YUQING SONG et. al. | arxiv-cs.CV | 2021-08-25 |
983 | Recurrent Multiple Shared Layers in Depth for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose to train a deeper model with recurrent mechanism, which loops the encoder and decoder blocks of Transformer in the depth direction. |
GuoLiang Li; Yiyang Li; | arxiv-cs.CL | 2021-08-23 |
984 | Examining Covert Gender Bias: A Case Study in Turkish and English Machine Translation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Specifically, we introduce a method to investigate asymmetrical gender markings. |
Chloe Ciora; Nur Iren; Malihe Alikhani; | arxiv-cs.CL | 2021-08-23 |
985 | Attentive Fine-tuning of Transformers for Translation of Low-resourced Languages @LoResMT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper reports the Machine Translation (MT) systems submitted by the IIITT team for the English->Marathi and English->Irish language pairs LoResMT 2021 shared task. |
KARTHIK PURANIK et. al. | arxiv-cs.CL | 2021-08-19 |
986 | Active Learning for Massively Parallel Translation of Constrained Text Into Low Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an algorithm for human and machine to work together seamlessly to translate a closed text into a severely low resource language. |
Zhong Zhou; Alex Waibel; | arxiv-cs.CL | 2021-08-16 |
987 | Findings of The LoResMT 2021 Shared Task on COVID and Sign Language for Low-resource Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present the findings of the LoResMT 2021 shared task which focuses on machine translation (MT) of COVID-19 data for both low-resource spoken and sign languages. |
ATUL KR. OJHA et. al. | arxiv-cs.CL | 2021-08-14 |
988 | Improving Context-Aware Neural Machine Translation with Source-side Monolingual Documents Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To make full use of source-side monolingual documents for context-aware NMT, we propose a Pre-training approach with Global Context (PGC). |
LINQING CHEN et. al. | ijcai | 2021-08-13 |
989 | Improving Stylized Neural Machine Translation with Iterative Dual Knowledge Transfer Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this problem, we propose an iterative dual knowledge transfer framework that utilizes informal training data of machine translation and formality style transfer data to create large-scale stylized paired data, for the training of stylized machine translation model. |
XUANXUAN WU et. al. | ijcai | 2021-08-13 |
990 | The HW-TSC’s Offline Speech Translation Systems for IWSLT 2021 Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes our work in participation of the IWSLT-2021 offline speech translation task. |
MINGHAN WANG et. al. | arxiv-cs.CL | 2021-08-09 |
991 | Improving Similar Language Translation With Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work is part of our contribution to the WMT 2021 Similar Languages Translation Shared Task where we submitted models for different language pairs, including French-Bambara, Spanish-Catalan, and Spanish-Portuguese in both directions. |
Ife Adebara; Muhammad Abdul-Mageed; | arxiv-cs.AI | 2021-08-07 |
992 | Facebook AI WMT21 News Translation Task Submission Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe Facebook’s multilingual model submission to the WMT2021 shared task on news translation. |
CHAU TRAN et. al. | arxiv-cs.CL | 2021-08-06 |
993 | WeChat Neural Machine Translation Systems for WMT21 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces WeChat AI’s participation in WMT 2021 shared news translation task on English->Chinese, English->Japanese, Japanese->English and English->German. |
XIANFENG ZENG et. al. | arxiv-cs.CL | 2021-08-05 |
994 | ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality Estimation and Corrective Feedback Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce ChrEnTranslate, an online machine translation demonstration system for translation between English and an endangered language Cherokee. |
Shiyue Zhang; Benjamin Frey; Mohit Bansal; | arxiv-cs.CL | 2021-07-30 |
995 | The Cross-Lingual Arabic Information REtrieval (CLAIRE) System Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we build our end-to-end Cross-Lingual Arabic Information REtrieval (CLAIRE) system based on the cross-lingual word embedding where searchers are assumed to have a passable passive understanding of Arabic and various supporting information in English is provided to aid retrieval experience. |
Zhizhong Chen; Carsten Eickhoff; | arxiv-cs.IR | 2021-07-29 |
996 | Difficulty-Aware Machine Translation Evaluation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel difficulty-aware MT evaluation metric, expanding the evaluation dimension by taking translation difficulty into consideration. |
Runzhe Zhan; Xuebo Liu; Derek F. Wong; Lidia S. Chao; | arxiv-cs.CL | 2021-07-29 |
997 | Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose sequence-level knowledge distillation (SKD) using perturbed length-aware positional encoding and apply it to a student model, the Levenshtein Transformer. |
Yui Oka; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2021-07-28 |
998 | Cross-lingual Transferring of Pre-trained Contextualized Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, building upon the recent works connecting cross-lingual model transferring and neural machine translation, we thus propose a novel cross-lingual model transferring framework for PrLMs: TreLM. |
ZUCHAO LI et. al. | arxiv-cs.CL | 2021-07-27 |
999 | XLPT-AMR: Cross-Lingual Pre-Training Via Multi-Task Learning for Zero-Shot AMR Parsing and Text Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Upon the availability of English AMR dataset and English-to- X parallel datasets, in this paper we propose a novel cross-lingual pre-training approach via multi-task learning (MTL) for both zeroshot AMR parsing and AMR-to-text generation. |
Dongqin Xu; Junhui Li; Muhua Zhu; Min Zhang; Guodong Zhou; | acl | 2021-07-26 |
1000 | Improving Speech Translation By Understanding and Learning from The Auxiliary Text Translation Task IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we are interested in training a speech translation model along with an auxiliary text translation task. |
Yun Tang; Juan Pino; Xian Li; Changhan Wang; Dmitriy Genzel; | acl | 2021-07-26 |
1001 | Beyond Noise: Mitigating The Impact of Fine-grained Semantic Divergences on Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Based on these findings, we introduce a divergent-aware NMT framework that uses factors to help NMT recover from the degradation caused by naturally occurring divergences, improving both translation quality and model calibration on EN-FR tasks. |
Eleftheria Briakou; Marine Carpuat; | acl | 2021-07-26 |
1002 | Modeling Bilingual Conversational Characteristics for Neural Chat Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we aim to promote the translation quality of conversational text by modeling the above properties. |
Yunlong Liang; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou; | acl | 2021-07-26 |
1003 | On Compositional Generalization of Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we study NMT models from the perspective of compositional generalization by building a benchmark dataset, CoGnition, consisting of 216k clean and consistent sentence pairs. |
Yafu Li; Yongjing Yin; Yulong Chen; Yue Zhang; | acl | 2021-07-26 |
1004 | Revisiting Negation in Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we evaluate the translation of negation both automatically and manually, in English–German (EN–DE) and English–Chinese (EN–ZH). |
Gongbo Tang; Philipp Rönchen; Rico Sennrich; Joakim Nivre; | arxiv-cs.CL | 2021-07-26 |
1005 | Multilingual Agreement for Multilingual Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel agreement-based method to encourage multilingual agreement among different translation directions, which minimizes the differences among them. |
JIAN YANG et. al. | acl | 2021-07-26 |
1006 | Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a sophisticated neural architecture to incorporate bilingual dictionaries into Neural Machine Translation (NMT) models. |
TONG ZHANG et. al. | acl | 2021-07-26 |
1007 | Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel bilingual mutual information (BMI) based adaptive objective, which measures the learning difficulty for each target token from the perspective of bilingualism, and assigns an adaptive weight accordingly to improve token-level adaptive training. |
YANGYIFAN XU et. al. | acl | 2021-07-26 |
1008 | Learning Language Specific Sub-network for Multilingual Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose LaSS to jointly train a single unified multilingual MT model. |
Zehui Lin; Liwei Wu; Mingxuan Wang; Lei Li; | acl | 2021-07-26 |
1009 | Towards User-Driven Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To fill this gap, we introduce a novel framework called user-driven NMT. |
HUAN LIN et. al. | acl | 2021-07-26 |
1010 | Beyond Sentence-Level End-to-End Speech Translation: Context Helps IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We investigate several decoding approaches, and introduce in-model ensemble decoding which jointly performs document- and sentence-level translation using the same model. |
Biao Zhang; Ivan Titov; Barry Haddow; Rico Sennrich; | acl | 2021-07-26 |
1011 | Selective Knowledge Distillation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we design a novel protocol that can effectively analyze the different impacts of samples by comparing various samples’ partitions. |
Fusheng Wang; Jianhao Yan; Fandong Meng; Jie Zhou; | acl | 2021-07-26 |
1012 | CCMatrix: Mining Billions of High-Quality Parallel Sentences on The Web IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We show that margin-based bitext mining in a multilingual sentence space can be successfully scaled to operate on monolingual corpora of billions of sentences. |
HOLGER SCHWENK et. al. | acl | 2021-07-26 |
1013 | Diverse Pretrained Context Encodings Improve Document Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pre-trained document context signals and assess the impact on translation performance of (1) different pretraining approaches for generating these signals, (2) the quantity of parallel data for which document context is available, and (3) conditioning on source, target, or source and target contexts. |
Domenic Donato; Lei Yu; Chris Dyer; | acl | 2021-07-26 |
1014 | Crafting Adversarial Examples for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we investigate veritable evaluations of NMT adversarial attacks, and propose a novel method to craft NMT adversarial examples. |
Xinze Zhang; Junzhe Zhang; Zhenhua Chen; Kun He; | acl | 2021-07-26 |
1015 | Contrastive Learning for Many-to-many Multilingual Neural Machine Translation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we aim to build a many-to-many translation system with an emphasis on the quality of non-English language directions. |
Xiao Pan; Mingxuan Wang; Liwei Wu; Lei Li; | acl | 2021-07-26 |
1016 | Consistency Regularization for Cross-Lingual Fine-Tuning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose to improve cross-lingual fine-tuning with consistency regularization. |
BO ZHENG et. al. | acl | 2021-07-26 |
1017 | G-Transformer for Document-Level Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: As a solution, we propose G-Transformer, introducing locality assumption as an inductive bias into Transformer, reducing the hypothesis space of the attention from target to source. |
Guangsheng Bao; Yue Zhang; Zhiyang Teng; Boxing Chen; Weihua Luo; | acl | 2021-07-26 |
1018 | Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this problem, we introduce another decoder, called seer decoder, into the encoder-decoder framework during training, which involves future information in target predictions. |
Yang Feng; Shuhao Gu; Dengji Guo; Zhengxin Yang; Chenze Shao; | acl | 2021-07-26 |
1019 | Cross-language Sentence Selection Via Data Augmentation and Rationale Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes an approach to cross-language sentence selection in a low-resource setting. |
YANDA CHEN et. al. | acl | 2021-07-26 |
1020 | Rewriter-Evaluator Architecture for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this issue, we introduce a novel architecture of Rewriter-Evaluator. |
Yangming Li; Kaisheng Yao; | acl | 2021-07-26 |
1021 | From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we adapt a state-of-the-art neural machine translation model to generate Hindi-English code-switched sentences starting from monolingual Hindi sentences. |
Ishan Tarunesh; Syamantak Kumar; Preethi Jyothi; | acl | 2021-07-26 |
1022 | Lightweight Adapter Tuning for Multilingual Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While adapter tuning was investigated for multilingual neural machine translation, this paper proposes a comprehensive analysis of adapters for multilingual speech translation (ST). |
HANG LE et. al. | acl | 2021-07-26 |
1023 | Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose to improve the sampling procedure by selecting the most informative monolingual sentences to complement the parallel data. |
WENXIANG JIAO et. al. | acl | 2021-07-26 |
1024 | End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In particular, we focus on methods based on training the model with constraints provided as part of the input sequence. |
Josef Jon; Jo?o Paulo Aires; Dusan Varis; Ondrej Bojar; | acl | 2021-07-26 |
1025 | Fast and Accurate Neural Machine Translation with Translation Memory IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a fast and accurate approach to TM-based NMT within the Transformer framework: the model architecture is simple and employs a single bilingual sentence as its TM, leading to efficient training and inference; and its parameters are effectively optimized through a novel training criterion. |
Qiuxiang He; Guoping Huang; Qu Cui; Li Li; Lemao Liu; | acl | 2021-07-26 |
1026 | Good for Misconceived Reasons: An Empirical Revisiting on The Need for Visual Context in Multimodal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Upon further investigation, we discover that the improvements achieved by the multimodal models over text-only counterparts are in fact results of the regularization effect. |
Zhiyong Wu; Lingpeng Kong; Wei Bi; Xiang Li; Ben Kao; | acl | 2021-07-26 |
1027 | BERTGen: Multi-task Generation Through BERT Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present BERTGen, a novel, generative, decoder-only model which extends BERT by fusing multimodal and multilingual pre-trained models VL-BERT and M-BERT, respectively. |
Faidon Mitzalis; Ozan Caglayan; Pranava Madhyastha; Lucia Specia; | acl | 2021-07-26 |
1028 | Mid-Air Hand Gestures for Post-Editing of Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Here, we present the first study that investigates the usefulness of mid-air hand gestures in combination with the keyboard (GK) for text editing in PE of MT. Guided by a gesture elicitation study with 14 freelance translators, we develop a prototype supporting mid-air hand gestures for cursor placement, text selection, deletion, and reordering. |
Rashad Albo Jamara; Nico Herbig; Antonio Kr?ger; Josef van Genabith; | acl | 2021-07-26 |
1029 | Adaptive Nearest Neighbor Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose Adaptive kNN-MT to dynamically determine the number of k for each target token. |
XIN ZHENG et. al. | acl | 2021-07-26 |
1030 | Unsupervised Neural Machine Translation for Low-Resource Domains Via Meta-Learning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this issue, this paper presents a novel meta-learning algorithm for unsupervised neural machine translation (UNMT) that trains the model to adapt to another domain by utilizing only a small amount of training data. |
CHEONBOK PARK et. al. | acl | 2021-07-26 |
1031 | Glancing Transformer for Non-Autoregressive Neural Machine Translation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose the Glancing Language Model (GLM) for single-pass parallel generation models. |
LIHUA QIAN et. al. | acl | 2021-07-26 |
1032 | Neural Machine Translation with Monolingual Translation Memory IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In contrast to existing work that uses bilingual corpus as TM and employs source-side similarity search for memory retrieval, we propose a new framework that uses monolingual memory and performs learnable memory retrieval in a cross-lingual manner. |
Deng Cai; Yan Wang; Huayang Li; Wai Lam; Lemao Liu; | acl | 2021-07-26 |
1033 | Prevent The Language Model from Being Overconfident in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Based on the property, we propose a Margin-based Token-level Objective (MTO) and a Margin-based Sentence-level Objective (MSO) to maximize the Margin for preventing the LM from being overconfident. |
Mengqi Miao; Fandong Meng; Yijin Liu; Xiao-Hua Zhou; Jie Zhou; | acl | 2021-07-26 |
1034 | The USYD-JD Speech Translation System for IWSLT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the University of Sydney& JD’s joint submission of the IWSLT 2021 low resource speech translation task. |
Liang Ding; Di Wu; Dacheng Tao; | arxiv-cs.CL | 2021-07-24 |
1035 | MDQE: A More Accurate Direct Pretraining for Machine Translation Quality Estimation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Based on previous related work that have alleviated gaps to some extent, we propose a novel framework that provides a more accurate direct pretraining for QE tasks. |
Lei Lin; | arxiv-cs.CL | 2021-07-24 |
1036 | Extending Challenge Sets to Uncover Gender Bias in Machine Translation: Impact of Stereotypical Verbs and Adjectives IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we present an extension of this challenge set, called WiBeMT, with gender-biased adjectives and adds sentences with gender-biased verbs. |
Jonas-Dario Troles; Ute Schmid; | arxiv-cs.CL | 2021-07-24 |
1037 | Modelling Latent Translations for Cross-Lingual Transfer IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To remedy this, we propose a new technique that integrates both steps of the traditional pipeline (translation and classification) into a single model, by treating the intermediate translations as a latent random variable. |
Edoardo Maria Ponti; Julia Kreutzer; Ivan Vulić; Siva Reddy; | arxiv-cs.CL | 2021-07-23 |
1038 | Confidence-Aware Scheduled Sampling for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address this issue, we propose confidence-aware scheduled sampling. |
Yijin Liu; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou; | arxiv-cs.CL | 2021-07-21 |
1039 | Integrating Unsupervised Data Generation Into Self-Supervised Neural Machine Translation for Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this, unsupervised machine translation (UMT) exploits large amounts of monolingual data by using synthetic data generation techniques such as back-translation and noising, while self-supervised NMT (SSNMT) identifies parallel sentences in smaller comparable data and trains on them. |
Dana Ruiter; Dietrich Klakow; Josef van Genabith; Cristina España-Bonet; | arxiv-cs.CL | 2021-07-19 |
1040 | As Easy As 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work we develop comprehensive assessments of the robustness of neural machine translation systems to numerical text via behavioural testing. |
JUN WANG et. al. | arxiv-cs.CL | 2021-07-18 |
1041 | Picard Understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in A Constructed Language Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Tamarian, a fictional language introduced in the Star Trek episode Darmok, communicates meaning through utterances of metaphorical references, such as Darmok and Jalad at Tanagra instead of We should work together. This work assembles a Tamarian-English dictionary of utterances from the original episode and several follow-on novels, and uses this to construct a parallel corpus of 456 English-Tamarian utterances. |
Peter Jansen; Jordan Boyd-Graber; | arxiv-cs.CL | 2021-07-16 |
1042 | FST: The FAIR Speech Translation System for The IWSLT21 Multilingual Shared Task IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we describe our end-to-end multilingual speech translation system submitted to the IWSLT 2021 evaluation campaign on the Multilingual Speech Translation shared task. |
YUN TANG et. al. | arxiv-cs.CL | 2021-07-14 |
1043 | The IWSLT 2021 BUT Speech Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we study their efficiency from the perspective of having a large amount of separate ASR training data and MT training data, and a smaller amount of speech-translation training data. |
Hari Krishna Vydana; Martin Karafi’at; Luk’as Burget; Honza Cernock’y; | arxiv-cs.CL | 2021-07-13 |
1044 | Zero-shot Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: These models tend to output the wrong language when performing zero-shot ST. We tackle the issues by including additional training data and an auxiliary loss function that minimizes the text-audio difference. |
Tu Anh Dinh; | arxiv-cs.CL | 2021-07-13 |
1045 | Putting Words Into The System’s Mouth: A Targeted Attack on Neural Machine Translation Using Monolingual Data Poisoning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present two methods for crafting poisoned examples, and show that only a tiny handful of instances, amounting to only 0.02% of the training set, is sufficient to enact a successful attack. |
JUN WANG et. al. | arxiv-cs.CL | 2021-07-12 |
1046 | Putting Words Into The System’s Mouth: A Targeted Attack on Neural Machine Translation Using Monolingual Data Poisoning IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation systems are known to be vulnerable to adversarial test inputs, however, as we show in this paper, these systems are also vulnerable to training attacks. … |
JUN WANG et. al. | ArXiv | 2021-07-12 |
1047 | Using Machine Translation to Localize Task Oriented NLG Output Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper explores doing this by applying machine translation to the English output. |
SCOTT ROY et. al. | arxiv-cs.CL | 2021-07-09 |
1048 | Temporally Correlated Task Scheduling for Sequence Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce a learnable scheduler to sequence learning, which can adaptively select auxiliary tasks for training depending on the model status and the current training data. |
XUEQING WU et. al. | icml | 2021-07-08 |
1049 | Using CollGram to Compare Formulaic Language in Human and Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A comparison of formulaic sequences in human and neural machine translation of quality newspaper articles shows that neural machine translations contain less lower-frequency, but … |
Yves Bestgen; | arxiv-cs.CL | 2021-07-08 |
1050 | Cross-model Back-translated Distillation for Unsupervised Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a novel component to the standard UMT framework called Cross-model Back-translated Distillation (CBD), that is aimed to induce another level of data diversification that existing principles lack. |
Xuan-Phi Nguyen; Shafiq Joty; Thanh-Tung Nguyen; Kui Wu; Ai Ti Aw; | icml | 2021-07-08 |
1051 | Self-supervised and Supervised Joint Training for Resource-rich Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a joint training approach, F2-XEnDec, to combine self-supervised and supervised learning to optimize NMT models. |
Yong Cheng; Wei Wang; Lu Jiang; Wolfgang Macherey; | icml | 2021-07-08 |
1052 | A Topic Guided Pointer-Generator Model for Generating Natural Language Code Summaries Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a neural network model named ToPNN for code summarization, which uses the topics in a broader context (e.g., class) to guide the neural networks that combine the generation of new words and the copy of existing words in code. |
XIN WANG et. al. | arxiv-cs.SE | 2021-07-04 |
1053 | IITP at WAT 2021: System Description for English-Hindi Multimodal Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We participate in the 8th Workshop on Asian Translation (WAT – 2021) for English-Hindi multimodal translation task and achieve 42.47 and 37.50 BLEU points for Evaluation and Challenge subset, respectively. |
Baban Gain; Dibyanayan Bandyopadhyay; Asif Ekbal; | arxiv-cs.CL | 2021-07-04 |
1054 | Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we study the benefit that such compositionality brings about to several machine translation tasks. |
Rahma Chaabouni; Roberto Dessì; Eugene Kharitonov; | arxiv-cs.CL | 2021-07-03 |
1055 | Zero-pronoun Data Augmentation for Japanese-to-English Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose a data augmentation method that provides additional training signals for the translation model to learn correlations between local context and zero pronouns. |
Ryokan Ri; Toshiaki Nakazawa; Yoshimasa Tsuruoka; | arxiv-cs.CL | 2021-07-01 |
1056 | Modeling Target-side Inflection in Placeholder Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address this problem, we propose a novel method of placeholder translation that can inflect specified terms according to the grammatical construction of the output sentence. |
Ryokan Ri; Toshiaki Nakazawa; Yoshimasa Tsuruoka; | arxiv-cs.CL | 2021-07-01 |
1057 | The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes USTC-NELSLIP’s submissions to the IWSLT2021 Simultaneous Speech Translation task. |
Dan Liu; Mengge Du; Xiaoxi Li; Yuchen Hu; Lirong Dai; | arxiv-cs.CL | 2021-07-01 |
1058 | IMS’ Systems for The IWSLT 2021 Low-Resource Speech Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the submission to the IWSLT 2021 Low-Resource Speech Translation Shared Task by IMS team. We utilize state-of-the-art models combined with several data … |
Pavel Denisov; Manuel Mager; Ngoc Thang Vu; | International Workshop on Spoken Language Translation | 2021-06-30 |
1059 | IMS’ Systems for The IWSLT 2021 Low-Resource Speech Translation Task Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the submission to the IWSLT 2021 Low-Resource Speech Translation Shared Task by IMS team. |
Pavel Denisov; Manuel Mager; Ngoc Thang Vu; | arxiv-cs.CL | 2021-06-30 |
1060 | DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation By Augmenting Pretrained Multilingual Encoders IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To reduce this gap, we introduce DeltaLM, a pretrained multilingual encoder-decoder model that regards the decoder as the task layer of off-the-shelf pretrained encoders. |
SHUMING MA et. al. | arxiv-cs.CL | 2021-06-25 |
1061 | Language Models Are Good Translators IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent years have witnessed the rapid advance in neural machine translation (NMT), the core of which lies in the encoder-decoder architecture. Inspired by the recent progress of … |
SHUO WANG et. al. | arxiv-cs.CL | 2021-06-25 |
1062 | On The Influence of Machine Translation on Language Origin Obfuscation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we analyze the ability to detect the source language from the translated output of two widely used commercial machine translation systems by utilizing machine-learning algorithms with basic textual features like n-grams. |
Benjamin Murauer; Michael Tschuggnall; Günther Specht; | arxiv-cs.CL | 2021-06-24 |
1063 | End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In particular, we focus on methods based on training the model with constraints provided as part of the input sequence. |
Josef Jon; João Paulo Aires; Dušan Variš; Ondřej Bojar; | arxiv-cs.CL | 2021-06-23 |
1064 | Phrase-level Active Learning for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we address this problem in an active learning setting where we can spend a given budget on translating in-domain data, and gradually fine-tune a pre-trained out-of-domain NMT model on the newly translated data. |
Junjie Hu; Graham Neubig; | arxiv-cs.CL | 2021-06-21 |
1065 | Recurrent Stacking of Layers in Neural Networks: An Application to Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose to share parameters across all layers thereby leading to a recurrently stacked neural network model. |
Raj Dabre; Atsushi Fujita; | arxiv-cs.CL | 2021-06-18 |
1066 | Lost in Interpreting: Speech Translation from Source or Interpreter? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We investigate if such an automatic system should rather follow the original speaker, or an interpreter to achieve better translation quality at the cost of increased delay. |
Dominik Macháček; Matúš Žilinec; Ondřej Bojar; | arxiv-cs.CL | 2021-06-17 |
1067 | Central Kurdish Machine Translation: First Large Scale Parallel Corpus and Experiments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present the first large scale parallel corpus of Central Kurdish-English, Awta, containing 229,222 pairs of manually aligned translations. |
Zhila Amini; Mohammad Mohammadamini; Hawre Hosseini; Mehran Mansouri; Daban Jaff; | arxiv-cs.AI | 2021-06-17 |
1068 | Alternated Training with Synthetic and Authentic Data for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose alternated training with synthetic and authentic data for NMT. Compared with previous work, we introduce authentic data as guidance to prevent the training of NMT models from being disturbed by noisy synthetic data. |
Rui Jiao; Zonghan Yang; Maosong Sun; Yang Liu; | arxiv-cs.CL | 2021-06-16 |
1069 | Evaluating Gender Bias in Hindi-English Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We implement a modified version of the existing TGBI metric based on the grammatical considerations for Hindi. |
Gauri Gupta; Krithika Ramesh; Sanjay Singh; | arxiv-cs.CL | 2021-06-16 |
1070 | Language Tags Matter for Zero-Shot Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we demonstrate that the LTs are not only indicators for translation directions but also crucial to zero-shot translation qualities. |
Liwei Wu; Shanbo Cheng; Mingxuan Wang; Lei Li; | arxiv-cs.CL | 2021-06-15 |
1071 | English to Bangla Machine Translation Using Recurrent Neural Network IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes an architecture of English to Bangla machine translation system. |
Shaykh Siddique; Tahmid Ahmed; Md. Rifayet Azam Talukder; Md. Mohsin Uddin; | arxiv-cs.CL | 2021-06-14 |
1072 | Machine Translation Into Low-resource Language Varieties IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a general framework to rapidly adapt MT systems to generate language varieties that are close to, but different from, the standard target language, using no parallel (source–variety) data. |
Sachin Kumar; Antonios Anastasopoulos; Shuly Wintner; Yulia Tsvetkov; | arxiv-cs.CL | 2021-06-12 |
1073 | UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To generalize this success to non-English languages, we introduce UC^2, the first machine translation-augmented framework for cross-lingual cross-modal representation learning. |
MINGYANG ZHOU et. al. | cvpr | 2021-06-11 |
1074 | Exploring Unsupervised Pretraining Objectives for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we systematically compare masking with alternative objectives that produce inputs resembling real (full) sentences, by reordering and replacing words based on their context. |
Christos Baziotis; Ivan Titov; Alexandra Birch; Barry Haddow; | arxiv-cs.CL | 2021-06-10 |
1075 | Input Augmentation Improves Constrained Beam Search for Neural Machine Translation: NTT at WAT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes our systems that were submitted to the restricted translation task at WAT 2021. |
Katsuki Chousa; Makoto Morishita; | arxiv-cs.CL | 2021-06-09 |
1076 | Crosslingual Embeddings Are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we show that initializing the embedding layer of UNMT models with cross-lingual embeddings shows significant improvements in BLEU score over existing approaches with embeddings randomly initialized. |
Tamali Banerjee; Rudra Murthy V; Pushpak Bhattacharyya; | arxiv-cs.CL | 2021-06-09 |
1077 | Multilingual Neural Semantic Parsing for Low-Resourced Languages Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To tackle the data limitation problem, we propose using machine translation to bootstrap multilingual training data from the more abundant English data. To evaluate our multilingual models on human-written sentences as opposed to machine translated ones, we introduce a new multilingual semantic parsing dataset in English, Italian and Japanese based on the Facebook Task Oriented Parsing (TOP) dataset. |
Menglin Xia; Emilio Monti; | arxiv-cs.CL | 2021-06-07 |
1078 | The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation IF:5 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce the FLORES-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. |
NAMAN GOYAL et. al. | arxiv-cs.CL | 2021-06-06 |
1079 | Cross-language Sentence Selection Via Data Augmentation and Rationale Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes an approach to cross-language sentence selection in a low-resource setting. |
YANDA CHEN et. al. | arxiv-cs.CL | 2021-06-04 |
1080 | Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose to improve the sampling procedure by selecting the most informative monolingual sentences to complement the parallel data. |
WENXIANG JIAO et. al. | arxiv-cs.CL | 2021-06-02 |
1081 | Part of Speech and Universal Dependency Effects on English Arabic Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this research paper, I will elaborate on a method to evaluate machine translation models based on their performance on underlying syntactical phenomena between English and Arabic languages. |
Ofek Rafaeli; Omri Abend; Leshem Choshen; Dmitry Nikolaev; | arxiv-cs.CL | 2021-06-01 |
1082 | The REPU CS’ Spanish–Quechua Submission to The AmericasNLP 2021 Shared Task on Open Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present the submission of REPUcs to the AmericasNLP machine translation shared task for the low resource language pair Spanish–Quechua. Our neural machine translation system … |
Oscar Moreno; | AMERICASNLP | 2021-06-01 |
1083 | ICT’s System for AutoSimTrans 2021: Robust Char-Level Simultaneous Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Simultaneous translation (ST) outputs the translation simultaneously while reading the input sentence, which is an important component of simultaneous interpretation. In this … |
Shaolei Zhang; Yang Feng; | Proceedings of the Second Workshop on Automatic … | 2021-06-01 |
1084 | ViTA: Visual-Linguistic Translation By Aligning Object Tags IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose our system under the team name Volta for the Multimodal Translation Task of WAT 2021 from English to Hindi. |
Kshitij Gupta; Devansh Gautam; Radhika Mamidi; | arxiv-cs.CL | 2021-06-01 |
1085 | Rejuvenating Low-Frequency Words: Making The Most of Parallel Data in Non-Autoregressive Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Accordingly, we propose reverse KD to rejuvenate more alignments for low-frequency target words. |
LIANG DING et. al. | arxiv-cs.CL | 2021-06-01 |
1086 | Verdi: Quality Estimation and Error Detection for Bilingual Corpora Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose Verdi, a novel framework for word-level and sentence-level post-editing effort estimation for bilingual corpora. |
Mingjun Zhao; Haijiang Wu; Di Niu; Zixuan Wang; Xiaoli Wang; | arxiv-cs.CL | 2021-05-31 |
1087 | Multilingual Speech Translation with Unified Transformer: Huawei Noah’s Ark Lab at IWSLT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the system submitted to the IWSLT 2021 Multilingual Speech Translation (MultiST) task from Huawei Noah’s Ark Lab. |
Xingshan Zeng; Liangyou Li; Qun Liu; | arxiv-cs.CL | 2021-05-31 |
1088 | Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning-Based Methods IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The Bangla language is the seventh most spoken language, with 265 million native and non-native speakers worldwide. However, English is the predominant language for online … |
OVISHAKE SEN et. al. | IEEE Access | 2021-05-31 |
1089 | Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, in this paper, we present a thorough analysis of 75 BNLP research papers and categorize them into 11 categories, namely Information Extraction, Machine Translation, Named Entity Recognition, Parsing, Parts of Speech Tagging, Question Answering System, Sentiment Analysis, Spam and Fake Detection, Text Summarization, Word Sense Disambiguation, and Speech Processing and Recognition. |
OVISHAKE SEN et. al. | arxiv-cs.CL | 2021-05-31 |
1090 | Korean-English Machine Translation with Multiple Tokenization Strategy Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, alphabet tokenization, morpheme tokenization, and BPE tokenization were applied to Korean as the source language and English as the target language respectively, and the comparison experiment was conducted by repeating 50,000 epochs of each 9 models using the Transformer neural network. |
Dojun Park; Youngjin Jang; Harksoo Kim; | arxiv-cs.CL | 2021-05-29 |
1091 | TranSmart: A Practical Interactive Machine Translation System IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report presents major functions of TranSmart, algorithms for achieving these functions, how to use the TranSmart APIs, and evaluation results of some key functions. |
GUOPING HUANG et. al. | arxiv-cs.CL | 2021-05-27 |
1092 | Extremely Low-resource Machine Translation for Closely Related Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: An effective method to improve extremely low-resource neural machine translation is multilingual training, which can be improved by leveraging monolingual data to create synthetic bilingual corpora using the back-translation method. We collected new parallel data for V\~oro, North and South Saami and present first results of neural machine translation for these languages. |
Maali Tars; Andre Tättar; Mark Fišel; | arxiv-cs.CL | 2021-05-27 |
1093 | Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We develop models under different conditions, employing both (i) standard end-to-end sequence-to-sequence (S2S) Transformers trained from scratch and (ii) pre-trained S2S language models (LMs). |
El Moatez Billah Nagoudi; AbdelRahim Elmadany; Muhammad Abdul-Mageed; | arxiv-cs.LG | 2021-05-27 |
1094 | How Does Distilled Data Complexity Impact The Quality and Confidence of Non-Autoregressive Machine Translation? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this issue, we seek to understand why distillation is so effective. |
Weijia Xu; Shuming Ma; Dongdong Zhang; Marine Carpuat; | arxiv-cs.CL | 2021-05-26 |
1095 | IntelliCAT: Intelligent Machine Translation Post-Editing with Quality Estimation and Translation Suggestion IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present IntelliCAT, an interactive translation interface with neural models that streamline the post-editing process on machine translation output. |
Dongjun Lee; Junhyeong Ahn; Heesoo Park; Jaemin Jo; | arxiv-cs.CL | 2021-05-25 |
1096 | Context-Interactive Pre-Training for Document Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To remedy this, here we propose a simple yet effective context-interactive pre-training approach, which targets benefiting from external large-scale corpora. |
Pengcheng Yang; Pei Zhang; Boxing Chen; Jun Xie; Weihua Luo; | naacl | 2021-05-23 |
1097 | SGL: Speaking The Graph Languages of Semantic Parsing Via Multilingual Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, instead, we reframe semantic parsing towards multiple formalisms as Multilingual Neural Machine Translation (MNMT), and propose SGL, a many-to-many seq2seq architecture trained with an MNMT objective. |
Luigi Procopio; Rocco Tripodi; Roberto Navigli; | naacl | 2021-05-23 |
1098 | Towards Continual Learning for Multilingual Machine Translation Via Vocabulary Substitution IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a straightforward vocabulary adaptation scheme to extend the language capacity of multilingual machine translation models, paving the way towards efficient continual learning for multilingual machine translation. |
Xavier Garcia; Noah Constant; Ankur Parikh; Orhan Firat; | naacl | 2021-05-23 |
1099 | Generative Imagination Elevates Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose ImagiT, a novel machine translation method via visual imagination. |
Quanyu Long; Mingxuan Wang; Lei Li; | naacl | 2021-05-23 |
1100 | Towards Modeling The Style of Translators in Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we investigate methods to augment the state of the art Transformer model with translator information that is available in part of the training data. |
Yue Wang; Cuong Hoang; Marcello Federico; | naacl | 2021-05-23 |
1101 | Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we focus on sequence-level knowledge distillation (SeqKD) from external text-based NMT models. |
Hirofumi Inaguma; Tatsuya Kawahara; Shinji Watanabe; | naacl | 2021-05-23 |
1102 | MT5: A Massively Multilingual Pre-trained Text-to-Text Transformer IF:8 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce mT5, a multilingual variant of T5 that was pre-trained on a new Common Crawl-based dataset covering 101 languages. |
LINTING XUE et. al. | naacl | 2021-05-23 |
1103 | UniDrop: A Simple Yet Effective Technique to Improve Transformer Without Extra Cost IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, in this paper, we integrate different dropout techniques into the training of Transformer models. |
ZHEN WU et. al. | naacl | 2021-05-23 |
1104 | Backtranslation Feedback Improves User Confidence in MT, Not Quality Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Translating text into a language unknown to the text’s author, dubbed outbound translation, is a modern need for which the user experience has significant room for improvement, beyond the basic machine translation facility. We demonstrate this by showing three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected: backward translation, quality estimation (with alignment) and source paraphrasing. |
VIL�M ZOUHAR et. al. | naacl | 2021-05-23 |
1105 | Non-Autoregressive Translation By Learning Target Categorical Codes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose CNAT, which learns implicitly categorical codes as latent variables into the non-autoregressive decoding. |
YU BAO et. al. | naacl | 2021-05-23 |
1106 | Neural Machine Translation Without Embeddings Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Surprisingly, replacing the ubiquitous embedding layer with one-hot representations of each byte does not hurt performance; experiments on byte-to-byte machine translation from English to 10 different languages show a consistent improvement in BLEU, rivaling character-level and even standard subword-level models. |
Uri Shaham; Omer Levy; | naacl | 2021-05-23 |
1107 | Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present an end-to-end framework that exploits compositionality to learn searchable hidden representations at intermediate stages of a sequence model using decomposed sub-tasks. |
Siddharth Dalmia; Brian Yan; Vikas Raunak; Florian Metze; Shinji Watanabe; | naacl | 2021-05-23 |
1108 | Cross-lingual Cross-modal Pretraining for Multimodal Retrieval IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a new approach to learn cross-lingual cross-modal representations for matching images and their relevant captions in multiple languages. |
Hongliang Fei; Tan Yu; Ping Li; | naacl | 2021-05-23 |
1109 | From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer. |
ROB VAN DER GOOT et. al. | naacl | 2021-05-23 |
1110 | Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we show that multilinguality is critical to making unsupervised systems practical for low-resource settings. |
Xavier Garcia; Aditya Siddhant; Orhan Firat; Ankur Parikh; | naacl | 2021-05-23 |
1111 | Counterfactual Data Augmentation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a data augmentation method for neural machine translation. |
Qi Liu; Matt Kusner; Phil Blunsom; | naacl | 2021-05-23 |
1112 | Can Latent Alignments Improve Autoregressive Machine Translation? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We explore the possibility of training autoregressive machine translation models with latent alignment objectives, and observe that, in practice, this approach results in degenerate models. |
Adi Haviv; Lior Vassertail; Omer Levy; | naacl | 2021-05-23 |
1113 | Training Data Augmentation for Code-Mixed Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present an m-BERT based procedure whose core learnable component is a ternary sequence labeling model, that can be trained with a limited code-mixed corpus alone. |
Abhirut Gupta; Aditya Vavre; Sunita Sarawagi; | naacl | 2021-05-23 |
1114 | Multi-Task Learning with Shared Encoder for Non-Autoregressive Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we hypothesize and empirically verify that AT and NAT encoders capture different linguistic properties of source sentences. |
YONGCHANG HAO et. al. | naacl | 2021-05-23 |
1115 | Investigating Math Word Problems Using Pretrained Multilingual Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we revisit math word problems~(MWPs) from the cross-lingual and multilingual perspective. |
Minghuan Tan; Lei Wang; Lingxiao Jiang; Jing Jiang; | arxiv-cs.CL | 2021-05-19 |
1116 | VEGA: A Virtual Environment for Exploring Gender Bias Vs. Accuracy Trade-offs in AI Translation Services Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation services are a very popular class of Artificial Intelligence (AI) services nowadays but public’s trust in these services is not guaranteed since they have been … |
Mariana Bernagozzi; Biplav Srivastava; F. Rossi; Sheema Usmani; | AAAI Conference on Artificial Intelligence | 2021-05-18 |
1117 | Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We describe models focused at the understudied problem of translating between monolingual and code-mixed language pairs. |
Ganesh Jawahar; El Moatez Billah Nagoudi; Muhammad Abdul-Mageed; Laks V. S. Lakshmanan; | arxiv-cs.CL | 2021-05-18 |
1118 | Ensemble-based Transfer Learning for Low-resource Machine Translation Quality Estimation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we focus on the Sentence-Level QE Shared Task of the Fifth Conference on Machine Translation (WMT20), but in a more challenging setting. |
Ting-Wei Wu; Yung-An Hsieh; Yi-Chieh Liu; | arxiv-cs.CL | 2021-05-17 |
1119 | A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose a general multi-task learning framework to leverage text data for ASR and ST tasks. |
Y. Tang; J. Pino; C. Wang; X. Ma; D. Genzel; | icassp | 2021-05-16 |
1120 | Jointly Trained Transformers Models for Spoken Language Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, degradation in performance is reduced by creating an End-to-End differentiable pipeline between the ASR and MT systems. |
H. K. Vydana; M. Karafi�t; K. Zmolikova; L. Burget; H. Cernock�; | icassp | 2021-05-16 |
1121 | An Empirical Study on Task-Oriented Dialogue Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we systematically investigate advanced models on the task-oriented dialogue translation task, including sentence-level, document-level and non-autoregressive NMT models. |
S. Liu; | icassp | 2021-05-16 |
1122 | Modeling Homophone Noise for Robust Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a robust neural machine translation (NMT) framework to deal with homophone errors. |
W. QIN et. al. | icassp | 2021-05-16 |
1123 | Machine Translation Verbosity Control for Automatic Dubbing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we focus on the problem of controlling the verbosity of machine translation out-put, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. |
S. M. Lakew; et al. | icassp | 2021-05-16 |
1124 | Task Aware Multi-Task Learning for Speech to Text Tasks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a task modulation network which allows the model to learn task specific features, while learning the shared features simultaneously. |
S. Indurthi; et al. | icassp | 2021-05-16 |
1125 | Data Augmentation for Sign Language Gloss Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus here on gloss-to-text translation, which we treat as a low-resource neural machine translation (NMT) problem. |
Amit Moryossef; Kayo Yin; Graham Neubig; Yoav Goldberg; | arxiv-cs.CL | 2021-05-16 |
1126 | Cascaded Models with Cyclic Feedback for Direct Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a technique that allows cascades of automatic speech recognition (ASR) and machine translation (MT) to exploit in-domain direct speech translation data in addition to out-of-domain MT and ASR data. |
T. K. Lam; S. Schamoni; S. Riezler; | icassp | 2021-05-16 |
1127 | Image-Assisted Transformer in Zero-Resource Multi-Modal Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we investigate how to use visual information as an auxiliary hint for a Transformer-based system in a zero-resource translation scenario. |
P. Huang; S. Sun; H. Yang; | icassp | 2021-05-16 |
1128 | The Volctrans Neural Speech Translation System for IWSLT 2021 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the systems submitted to IWSLT 2021 by the Volctrans team. |
CHENGQI ZHAO et. al. | arxiv-cs.CL | 2021-05-15 |
1129 | From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer. |
ROB VAN DER GOOT et. al. | arxiv-cs.CL | 2021-05-15 |
1130 | Dynamic Multi-Branch Layers for On-Device Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Inspired by conditional computation, we propose to improve the performance of on-device NMT systems with dynamic multi-branch layers. |
ZHIXING TAN et. al. | arxiv-cs.CL | 2021-05-14 |
1131 | Do Context-Aware Translation Models Pay The Right Attention? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we ask several questions: What contexts do human translators use to resolve ambiguous words? |
KAYO YIN et. al. | arxiv-cs.CL | 2021-05-14 |
1132 | Improving Lexically Constrained Neural Machine Translation with Source-Conditioned Masked Span Prediction Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we instead tackle a more challenging setup consisting of domain-specific corpora with much longer n-gram and highly specialized terms. |
Gyubok Lee; Seongjun Yang; Edward Choi; | arxiv-cs.CL | 2021-05-12 |
1133 | Can You Traducir This? Machine Translation for Code-Switched Input IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus here on Machine Translation (MT) of CSW texts, where we aim to simultaneously disentangle and translate the two mixed languages. Due to the lack of actual translated CSW data, we generate artificial training data from regular parallel texts. |
Jitao Xu; François Yvon; | arxiv-cs.CL | 2021-05-11 |
1134 | Automatic Classification of Human Translation and Machine Translation: A Study from The Perspective of Lexical Diversity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: By using a trigram model and fine-tuning a pretrained BERT model for sequence classification, we show that machine translation and human translation can be classified with an accuracy above chance level, which suggests that machine translation and human translation are different in a systematic way. |
Yingxue Fu; Mark-Jan Nederhof; | arxiv-cs.CL | 2021-05-10 |
1135 | Self-Guided Curriculum Learning for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by this, we propose a self-guided curriculum strategy to encourage the learning of neural machine translation (NMT) models to follow the above recovery criterion, where we cast the recovery degree of each training example as its learning difficulty. |
LEI ZHOU et. al. | arxiv-cs.CL | 2021-05-10 |
1136 | End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Machine Translation group. |
Gerard I. Gállego; Ioannis Tsiamas; Carlos Escolano; José A. R. Fonollosa; Marta R. Costa-jussà; | arxiv-cs.CL | 2021-05-10 |
1137 | Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a continual pre-training (CPT) framework on mBART to effectively adapt it to unseen languages. |
Zihan Liu; Genta Indra Winata; Pascale Fung; | arxiv-cs.CL | 2021-05-09 |
1138 | Learning Shared Semantic Space for Speech-to-Text Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In observation of this obstacle, we propose to bridge this representation gap with Chimera. |
Chi Han; Mingxuan Wang; Heng Ji; Lei Li; | arxiv-cs.CL | 2021-05-07 |
1139 | Duplex Sequence-to-Sequence Learning for Reversible Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose REDER (Reversible Duplex Transformer), a parameter-efficient model and apply it to machine translation. |
ZAIXIANG ZHENG et. al. | arxiv-cs.CL | 2021-05-07 |
1140 | Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper investigates two key aspects of end-to-end simultaneous speech translation: (a) how to encode efficiently the continuous speech flow, and (b) how to segment the speech flow in order to alternate optimally between reading (R: encoding input) and writing (W: decoding output) operations. |
Ha Nguyen; Yannick Estève; Laurent Besacier; | arxiv-cs.CL | 2021-04-29 |
1141 | Automatic Post-Editing for Vietnamese Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a systematic approach to tackle the APE task for Vietnamese. |
Thanh Vu; Dai Quoc Nguyen; | arxiv-cs.CL | 2021-04-25 |
1142 | Applying Automatic Translation for Optical Music Recognition’s Encoding Step Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Optical music recognition is a research field whose efforts have been mainly focused, due to the difficulties involved in its processes, on document and image recognition. … |
Antonio Ríos-Vila; M. Esplà-Gomis; D. Rizo; P. D. León; J. Iñesta; | Applied Sciences | 2021-04-25 |
1143 | Modeling Coverage for Non-Autoregressive Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we argue that these issues of NAT can be addressed through coverage modeling, which has been proved to be useful in autoregressive decoding. |
Yong Shan; Yang Feng; Chenze Shao; | arxiv-cs.CL | 2021-04-24 |
1144 | Should We Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Most work in NLP makes the assumption that it is desirable to develop solutions in the native language in question. |
Tim Isbister; Fredrik Carlsson; Magnus Sahlgren; | arxiv-cs.CL | 2021-04-21 |
1145 | End-to-end Speech Translation Via Cross-modal Progressive Training IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose Cross Speech-Text Network (XSTNet), an end-to-end model for speech-to-text translation. |
Rong Ye; Mingxuan Wang; Lei Li; | arxiv-cs.CL | 2021-04-21 |
1146 | Addressing The Vulnerability of NMT in Input Perturbations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we improve the robustness of NMT models by reducing the effect of noisy words through a Context-Enhanced Reconstruction (CER) approach. |
Weiwen Xu; Ai Ti Aw; Yang Ding; Kui Wu; Shafiq Joty; | arxiv-cs.CL | 2021-04-20 |
1147 | Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on a zero-shot cross-lingual transfer task in NMT. |
GUANHUA CHEN et. al. | arxiv-cs.CL | 2021-04-18 |
1148 | DCH-2: A Parallel Customer-Helpdesk Dialogue Corpus with Distributions of Annotators’ Labels Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a data set called DCH-2, which contains 4,390 real customer-helpdesk dialogues in Chinese and their English translations. |
Zhaohao Zeng; Tetsuya Sakai; | arxiv-cs.CL | 2021-04-18 |
1149 | Stream-level Latency Evaluation for Simultaneous Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task. |
Javier Iranzo-Sánchez; Jorge Civera; Alfons Juan; | arxiv-cs.CL | 2021-04-18 |
1150 | Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, this study proposes a simple data augmentation method to handle long sentences. |
Seiichiro Kondo; Kengo Hotate; Masahiro Kaneko; Mamoru Komachi; | arxiv-cs.CL | 2021-04-17 |
1151 | XLEnt: Mining A Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this, we propose Lexical-Semantic-Phonetic Align (LSP-Align), a technique to automatically mine cross-lingual entity lexica from mined web data. |
Ahmed El-Kishky; Adithya Renduchintala; James Cross; Francisco Guzmán; Philipp Koehn; | arxiv-cs.CL | 2021-04-17 |
1152 | Sentence Alignment with Parallel Documents Facilitates Biomedical Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work presents an unsupervised algorithm for deriving parallel corpora from document-level translations by using sentence alignment and explores how training materials affect the performance of biomedical NMT systems. |
Shengxuan Luo; Huaiyuan Ying; Jiao Li; Sheng Yu; | arxiv-cs.CL | 2021-04-17 |
1153 | From Fully Trained to Fully Random Embeddings: Improving Neural Machine Translation with Compact Word Embedding Tables Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: \footnote{In this paper words and subwords are referred to as \textit{tokens} and the term \textit{embedding} only refers to embeddings of inputs.} In this paper, we analyze the impact and utility of such matrices in the context of neural machine translation (NMT). |
Krtin Kumar; Peyman Passban; Mehdi Rezagholizadeh; Yiu Sing Lau; Qun Liu; | arxiv-cs.CL | 2021-04-17 |
1154 | MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we improve multilingual text-to-text transfer Transformer with translation pairs (mT6). |
ZEWEN CHI et. al. | arxiv-cs.CL | 2021-04-17 |
1155 | “Wikily” Supervised Neural Translation Tailored to Cross-Lingual Tasks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present a simple but effective approach for leveraging Wikipedia for neural machine translation as well as cross-lingual tasks of image captioning and dependency parsing … |
Mohammad Sadegh Rasooli; Chris Callison-Burch; D. Wijaya; | Conference on Empirical Methods in Natural Language … | 2021-04-16 |
1156 | Robust Open-Vocabulary Translation from Visual Text Representations IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Motivated by the robustness of human language processing, we propose the use of visual text representations, which dispense with a finite set of text embeddings in favor of continuous vocabularies created by processing visually rendered text with sliding windows. |
Elizabeth Salesky; David Etter; Matt Post; | arxiv-cs.CL | 2021-04-16 |
1157 | Counter-Interference Adapter for Multilingual Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To accommodate the issue, we propose CIAT, an adapted Transformer model with a small parameter overhead for multilingual machine translation. |
Yaoming Zhu; Jiangtao Feng; Chengqi Zhao; Mingxuan Wang; Lei Li; | arxiv-cs.CL | 2021-04-16 |
1158 | Towards Variable-Length Textual Adversarial Attacks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose variable-length textual adversarial attacks~(VL-Attack) and integrate three atomic operations, namely \textit{insertion}, \textit{deletion} and \textit{replacement}, into a unified framework, by introducing and manipulating a special \textit{blank} token while attacking. |
JUNLIANG GUO et. al. | arxiv-cs.CL | 2021-04-16 |
1159 | Hierarchical Learning for Generation with Long Source Sequences IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We design and study a new Hierarchical Attention Transformer-based architecture (HAT) that outperforms standard Transformers on several sequence to sequence tasks. |
Tobias Rohde; Xiaoxia Wu; Yinhan Liu; | arxiv-cs.CL | 2021-04-15 |
1160 | Improving Gender Translation Accuracy with Filtered Self-Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a gender-filtered self-training technique to improve gender translation accuracy on unambiguously gendered inputs. |
Prafulla Kumar Choubey; Anna Currey; Prashant Mathur; Georgiana Dinu; | arxiv-cs.CL | 2021-04-15 |
1161 | Simultaneous Multi-Pivot Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To solve this issue, we propose multi-pivot translation and apply it to a simultaneous translation setting involving pivot languages. |
Raj Dabre; Aizhan Imankulova; Masahiro Kaneko; Abhisek Chakrabarty; | arxiv-cs.CL | 2021-04-15 |
1162 | I Wish I Would Have Loved This One, But I Didn’t — A Multilingual Dataset for Counterfactual Detection in Product Reviews Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We consider the problem of counterfactual detection (CFD) in product reviews. For this purpose, we annotate a multilingual CFD dataset from Amazon product reviews covering counterfactual statements written in English, German, and Japanese languages. |
James O’Neill; Polina Rozenshtein; Ryuichi Kiryo; Motoko Kubota; Danushka Bollegala; | arxiv-cs.CL | 2021-04-14 |
1163 | I Wish I Would Have Loved This One, But I Didn’t – A Multilingual Dataset for Counterfactual Detection in Product Review IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Counterfactual statements describe events that did not or cannot take place. We consider the problem of counterfactual detection (CFD) in product reviews. For this purpose, we … |
James O’Neill; Polina Rozenshtein; Ryuichi Kiryo; Motoko Kubota; D. Bollegala; | Conference on Empirical Methods in Natural Language … | 2021-04-14 |
1164 | Family of Origin and Family of Choice: Massively Parallel Lexiconized Iterative Pretraining for Severely Low Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To translate named entities correctly, we build a massive lexicon table for 2,939 Bible named entities in 124 source languages, and include many that occur once and covers more than 66 severely low resource languages. |
Zhong Zhou; Alex Waibel; | arxiv-cs.CL | 2021-04-12 |
1165 | Backtranslation Feedback Improves User Confidence in MT, Not Quality Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we describe an experiment on outbound translation from English to Czech and Estonian. |
VILÉM ZOUHAR et. al. | arxiv-cs.CL | 2021-04-12 |
1166 | Sentiment-based Candidate Selection for NMT Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Grounded in the observation that UGC features highly idiomatic, sentiment-charged language, we propose a decoder-side approach that incorporates automatic sentiment scoring into the MT candidate selection process. |
Alex Jones; Derry Tanti Wijaya; | arxiv-cs.CL | 2021-04-10 |
1167 | Extended Parallel Corpus for Amharic-English Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper describes the acquisition, preprocessing, segmentation, and alignment of an Amharic-English parallel corpus. |
Andargachew Mekonnen Gezmu; Andreas Nürnberger; Tesfaye Bayu Bati; | arxiv-cs.CL | 2021-04-08 |
1168 | Design and Implementation of English To Yorùbá Verb Phrase Machine Translation System Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We aim to develop an English-to-Yoruba machine translation system which can translate English verb phrase text to its Yoruba equivalent.Words from both languages Source Language and Target Language were collected for the verb phrase group in the home domain. |
Benjamin Ajibade; Safiriyu Eludiora; | arxiv-cs.CL | 2021-04-08 |
1169 | Interpreting Verbal Metaphors By Paraphrasing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we interpret metaphors with BERT and WordNet hypernyms and synonyms in an unsupervised manner, showing that our method significantly outperforms the state-of-the-art baseline. |
Rui Mao; Chenghua Lin; Frank Guerin; | arxiv-cs.CL | 2021-04-07 |
1170 | AI4D — African Language Program Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work details the AI4D – African Language Program, a 3-part project that 1) incentivised the crowd-sourcing, collection and curation of language datasets through an online quantitative and qualitative challenge, 2) supported research fellows for a period of 3-4 months to create datasets annotated for NLP tasks, and 3) hosted competitive Machine Learning challenges on the basis of these datasets. |
KATHLEEN SIMINYU et. al. | arxiv-cs.CL | 2021-04-06 |
1171 | IndT5: A Text-to-Text Transformer for 10 Indigenous Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce IndT5, the first Transformer language model for Indigenous languages. |
El Moatez Billah Nagoudi; Wei-Rui Chen; Muhammad Abdul-Mageed; Hasan Cavusogl; | arxiv-cs.CL | 2021-04-04 |
1172 | Sampling and Filtering of Neural Machine Translation Distillation Data Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The highest-scoring hypothesis of the teacher model is commonly used to train a new model (student). |
Vilém Zouhar; | arxiv-cs.CL | 2021-04-01 |
1173 | Many-to-English Machine Translation Tools, Data, and Pretrained Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present useful tools for machine translation research: MTData, NLCodec, and RTG. |
Thamme Gowda; Zhao Zhang; Chris A Mattmann; Jonathan May; | arxiv-cs.CL | 2021-04-01 |
1174 | Context Based Machine Translation with Recurrent Neural Network for English–Amharic Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Yeabsira Asefa Ashengo; Rosa Tsegaye Aga; S. Abebe; | Machine Translation | 2021-04-01 |
1175 | Low-Resource Neural Machine Translation for Southern African Languages IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Motivated by this challenge we compare zero-shot learning, transfer learning and multilingual learning on three Bantu languages (Shona, isiXhosa and isiZulu) and English. |
Evander Nyoni; Bruce A. Bassett; | arxiv-cs.CL | 2021-04-01 |
1176 | Zero-Shot Language Transfer Vs Iterative Back Translation for Unsupervised Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work focuses on comparing different solutions for machine translation on low resource language pairs, namely, with zero-shot transfer learning and unsupervised machine translation. |
Aviral Joshi; Chengzhi Huang; Har Simrat Singh; | arxiv-cs.CL | 2021-03-31 |
1177 | An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a case study of Tigrinya where we investigate several back-translation methods to generate synthetic source sentences. |
Lidia Kidane; Sachin Kumar; Yulia Tsvetkov; | arxiv-cs.CL | 2021-03-30 |
1178 | Autocorrect in The Process of Translation — Multi-task Learning Improves Dialogue Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we conduct a deep analysis of a dialogue corpus and summarize three major issues on dialogue translation, including pronoun dropping (\droppro), punctuation dropping (\droppun), and typos (\typo). To properly evaluate the performance, we propose a manually annotated dataset with 1,931 Chinese-English parallel utterances from 300 dialogues as a benchmark testbed for dialogue translation. |
Tao Wang; Chengqi Zhao; Mingxuan Wang; Lei Li; Deyi Xiong; | arxiv-cs.CL | 2021-03-30 |
1179 | Evaluating The Morphosyntactic Well-formedness of Generated Texts Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose L’AMBRE — a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. |
ADITHYA PRATAPA et. al. | arxiv-cs.CL | 2021-03-30 |
1180 | Autocorrect in The Process of Translation — Multi-task Learning Improves Dialogue Machine Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Automatic translation of dialogue texts is a much needed demand in many real life scenarios. However, the currently existing neural machine translation delivers unsatisfying … |
Tao Wang; Chengqi Zhao; Mingxuan Wang; Lei Li; Deyi Xiong; | ArXiv | 2021-03-30 |
1181 | Unsupervised Machine Translation On Dravidian Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we focus on unsupervised translation between English and Kannada, a low resource Dravidian language. |
Sai Koneru; Danni Liu; Jan Niehues; | arxiv-cs.CL | 2021-03-29 |
1182 | English-Twi Parallel Corpus for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a parallel machine translation training corpus for English and Akuapem Twi of 25,421 sentence pairs. |
PAUL AZUNRE et. al. | arxiv-cs.CL | 2021-03-29 |
1183 | PENELOPIE: Enabling Open Information Extraction for The Greek Language Through Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we present our submission for the EACL 2021 SRW; a methodology that aims at bridging the gap between high and low-resource languages in the context of Open Information Extraction, showcasing it on the Greek language. |
Dimitris Papadopoulos; Nikolaos Papadakis; Nikolaos Matsatsinis; | arxiv-cs.CL | 2021-03-28 |
1184 | Evaluation of English–Slovak Neural and Statistical Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This study is focused on the comparison of phrase-based statistical machine translation (SMT) systems and neural machine translation (NMT) systems using automatic metrics for … |
Lucia Benkova; Dasa Munková; Ľubomír Benko; M. Munk; | Applied Sciences | 2021-03-25 |
1185 | Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We conduct an empirical study of neural machine translation (NMT) for truly low-resource languages, and propose a training curriculum fit for cases when both parallel training data and compute resource are lacking, reflecting the reality of most of the world’s languages and the researchers working on these languages. |
GARRY KUWANTO et. al. | arxiv-cs.CL | 2021-03-24 |
1186 | Repairing Pronouns in Translation with BERT-Based Post-Editing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We investigate the severity of this pronoun issue, showing that (1) in some domains, pronoun choice can account for more than half of a NMT systems’ errors, and (2) pronouns have a disproportionately large impact on perceived translation quality. |
Reid Pryzant; | arxiv-cs.CL | 2021-03-23 |
1187 | Monolingual and Parallel Corpora for Kangri Low Resource Language IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we present the dataset of Himachali low resource endangered language, Kangri (ISO 639-3xnr) listed in the United Nations Educational, Scientific and Cultural Organization (UNESCO). |
Shweta Chauhan; Shefali Saxena; Philemon Daniel; | arxiv-cs.CL | 2021-03-22 |
1188 | Dependency Graph-to-String Statistical Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present graph-based translation models which translate source graphs into target strings. |
Liangyou Li; Andy Way; Qun Liu; | arxiv-cs.CL | 2021-03-20 |
1189 | The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper evaluates the performance of several modern subword segmentation methods in a low-resource neural machine translation setting. |
Jonne Sälevä; Constantine Lignos; | arxiv-cs.CL | 2021-03-20 |
1190 | Congolese Swahili Machine Translation for Humanitarian Response Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we describe our efforts to make a bidirectional Congolese Swahili (SWC) to French (FRA) neural machine translation system with the motivation of improving humanitarian translation workflows. For training, we created a 25,302-sentence general domain parallel corpus and combined it with publicly available data. |
Alp Öktem; Eric DeLuca; Rodrigue Bashizi; Eric Paquin; Grace Tang; | arxiv-cs.CL | 2021-03-19 |
1191 | Gumbel-Attention for Multi-modal Machine Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a novel Gumbel-Attention for multi-modal machine translation, which selects the text-related parts of the image features. |
Pengbo Liu; Hailong Cao; Tiejun Zhao; | arxiv-cs.CL | 2021-03-16 |
1192 | Towards The Evaluation of Automatic Simultaneous Speech Translation from A Communicative Perspective Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present the results of an experiment aimed at evaluating the quality of a real-time speech translation engine by comparing it to the performance of professional simultaneous interpreters. |
Claudio Fantinuoli; Bianca Prandi; | arxiv-cs.CL | 2021-03-15 |
1193 | The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present MENYO-20k, the first multi-domain parallel corpus with a special focus on clean orthography for Yor\`ub\’a–English with standardized train-test splits for benchmarking. We provide several neural MT benchmarks and compare them to the performance of popular pre-trained (massively multilingual) MT models both for the heterogeneous test set and its subdomains. |
DAVID I. ADELANI et. al. | arxiv-cs.CL | 2021-03-15 |
1194 | Crowdsourced Phrase-Based Tokenization for Low-Resourced Neural Machine Translation: The Case of Fon Language Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, using Fon language as a case study, we revisit standard tokenization methods and introduce Word-Expressions-Based (WEB) tokenization, a human-involved super-words tokenization strategy to create a better representative vocabulary for training. |
Bonaventure F. P. Dossou; Chris C. Emezue; | arxiv-cs.CL | 2021-03-14 |
1195 | Visual Cues and Error Correction for Translation Robustness Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we focus on three types of realistic noise that are commonly generated by humans and introduce the idea of visual context to improve translation robustness for noisy texts. |
Zhenhao Li; Marek Rei; Lucia Specia; | arxiv-cs.CL | 2021-03-12 |
1196 | Learning Policies for Multilingual Training of Neural Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose two simple search based curricula — orderings of the multilingual training data — which help improve translation performance in conjunction with existing techniques such as fine-tuning. |
Gaurav Kumar; Philipp Koehn; Sanjeev Khudanpur; | arxiv-cs.CL | 2021-03-11 |
1197 | Learning Feature Weights Using Reward Modeling for Denoising Parallel Corpora Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents an alternative approach which learns weights for multiple sentence-level features. |
Gaurav Kumar; Philipp Koehn; Sanjeev Khudanpur; | arxiv-cs.CL | 2021-03-11 |
1198 | Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To alleviate the need for expensive parallel corpora by TLM, in this work, we incorporate the translation information from dictionaries into the pretraining process and propose a novel Bilingual Dictionary-based Language Model (BDLM). |
Yusen Lin; Jiayong Lin; Shuaicheng Zhang; Haoying Dai; | arxiv-cs.CL | 2021-03-11 |
1199 | Towards Continual Learning for Multilingual Machine Translation Via Vocabulary Substitution IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a straightforward vocabulary adaptation scheme to extend the language capacity of multilingual machine translation models, paving the way towards efficient continual learning for multilingual machine translation. |
Xavier Garcia; Noah Constant; Ankur P. Parikh; Orhan Firat; | arxiv-cs.CL | 2021-03-11 |
1200 | Weather GAN: Multi-Domain Weather Translation Using Generative Adversarial Networks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, a new task is proposed, namely, weather translation, which refers to transferring weather conditions of the image from one category to another. |
Xuelong Li; Kai Kou; Bin Zhao; | arxiv-cs.CV | 2021-03-09 |
1201 | Translating The Unseen? Yoruba-English MT in Low-Resource, Morphologically-Unmarked Settings Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we perform fine-grained analysis on how an SMT system compares with two NMT systems (BiLSTM and Transformer) when translating bare nouns in Yor\`ub\’a into English. |
Ife Adebara; Muhammad Abdul-Mageed; Miikka Silfverberg; | arxiv-cs.CL | 2021-03-06 |
1202 | Hierarchical Transformer for Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we test this idea using the Transformer architecture and show that despite the success in previous work there are problems inherent to training such hierarchical models. |
Albina Khusainova; Adil Khan; Adín Ramírez Rivera; Vitaly Romanov; | arxiv-cs.CL | 2021-03-05 |
1203 | Addressing Limited Vocabulary and Long Sentences Constraints in English–Arabic Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Safae Berrichi; A. Mazroui; | Arabian Journal for Science and Engineering | 2021-03-02 |
1204 | Multichannel LSTM-CNN for Telugu Technical Domain Identification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we proposed the Multichannel LSTM-CNN methodology for Technical Domain Identification for Telugu. |
Sunil Gundapu; Radhika Mamidi; | arxiv-cs.CL | 2021-02-24 |
1205 | Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For that, we propose a multimodal approach to simultaneous machine translation using reinforcement learning, with strategies to integrate visual and textual information in both the agent and the environment. |
JULIA IVE et. al. | arxiv-cs.CL | 2021-02-22 |
1206 | Machine Translation Customization Via Automatic Training Data Selection from The Web Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We describe an approach for customizing MT systems on specific domains by selecting data similar to the target customer data to train neural translation models. |
Thuy Vu; Alessandro Moschitti; | arxiv-cs.CL | 2021-02-19 |
1207 | Sparsely Factored Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a method suited for such a case, showing large improvements in out-of-domain data, and comparable quality for the in-domain data. |
Noe Casas; Jose A. R. Fonollosa; Marta R. Costa-jussà; | arxiv-cs.CL | 2021-02-17 |
1208 | Meta Back-translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel method to generate pseudo-parallel data from a pre-trained back-translation model. |
Hieu Pham; Xinyi Wang; Yiming Yang; Graham Neubig; | arxiv-cs.CL | 2021-02-15 |
1209 | Crowdsourcing Parallel Corpus for English-Oromo Neural Machine Translation Using Community Engagement Platform Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The paper deals with implementing a translation of English to Afaan Oromo and vice versa using Neural Machine Translation. |
SISAY CHALA et. al. | arxiv-cs.AI | 2021-02-15 |
1210 | Improving Zero-shot Neural Machine Translation on Language-specific Encoders-Decoders Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we study zero-shot translation using language-specific encoders-decoders. |
JUNWEI LIAO et. al. | arxiv-cs.CL | 2021-02-12 |
1211 | Continuous Learning in Neural Machine Translation Using Bilingual Dictionaries IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we proposed an evaluation framework to assess the ability of neural machine translation to continuously learn new phrases. |
Jan Niehues; | arxiv-cs.CL | 2021-02-12 |
1212 | Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Experiments on three translation directions show that by fine-tuning from FAT-MLM, our proposed speech translation models substantially improve translation quality by up to +5.9 BLEU. |
Renjie Zheng; Junkun Chen; Mingbo Ma; Liang Huang; | arxiv-cs.CL | 2021-02-10 |
1213 | Guiding Non-Autoregressive Neural Machine Translation Decoding with Reordering Information IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this problem, we propose a novel NAT framework ReorderNAT which explicitly models the reordering information to guide the decoding of NAT. |
Qiu Ran; Yankai Lin; Peng Li; Jie Zhou; | aaai | 2021-02-09 |
1214 | Multilingual Transfer Learning for QA Using Translation As Data Augmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we explore strategies that improve cross-lingual transfer by bringing the multilingual embeddings closer in the semantic space. |
Mihaela Bornea; Lin Pan; Sara Rosenthal; Radu Florian; Avirup Sil; | aaai | 2021-02-09 |
1215 | Accelerating Neural Machine Translation with Partial Word Embedding Compression Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Partial Vector Quantization (P-VQ) for NMT models, which can both compress the word embedding matrix and accelerate word probability prediction in the softmax layer. |
Fan Zhang; Mei Tu; Jinyao Yan; | aaai | 2021-02-09 |
1216 | Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work shows the first attempt of a source-target bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. |
Tianfu Zhang; Heyan Huang; Chong Feng; Longbing Cao; | aaai | 2021-02-09 |
1217 | Empirical Regularization for Synthetic Sentence Pairs in Unsupervised Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we empirically study the core training procedure of UNMT to analyze the synthetic sentence pairs obtained from back-translation. |
Xi Ai; Bin Fang; | aaai | 2021-02-09 |
1218 | Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Listen-Understand-Translate, (LUT), a unified framework with triple supervision signals to decouple the end-to-end speech-to-text translation task. |
QIANQIAN DONG et. al. | aaai | 2021-02-09 |
1219 | Towards Fully Automated Manga Translation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we make the following four contributions that establishes the foundation of manga translation research. |
Ryota Hinami; Shonosuke Ishiwatari; Kazuhiko Yasuda; Yusuke Matsui; | aaai | 2021-02-09 |
1220 | Bridging The Domain Gap: Improve Informal Language Translation Via Counterfactual Domain Adaptation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address this problem, we propose a counterfactual domain adaptation method to better leverage both large-scale source-domain data (formal texts) and small-scale target-domain data (informal texts). |
Ke Wang; Guandan Chen; Zhongqiang Huang; Xiaojun Wan; Fei Huang; | aaai | 2021-02-09 |
1221 | Learning Light-Weight Translation Models from Deep Transformer IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we take a natural step towards learning strong but light-weight NMT systems. |
BEI LI et. al. | aaai | 2021-02-09 |
1222 | Facilitating Terminology Translation with Target Lemma Annotations IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose to train machine translation systems using a source-side data augmentation method that annotates randomly selected source language words with their target language lemmas. |
Toms Bergmanis; Mārcis Pinnis; | arxiv-cs.CL | 2021-01-25 |
1223 | Uncertainty Estimation in Autoregressive Structured Prediction IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A Deep Investigation of Ensemble-based Uncertainty Estimation for Autoregressive ASR and NMT models. |
Andrey Malinin; Mark Gales; | iclr | 2021-01-21 |
1224 | Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We show that the speed disadvantage for autoregressive baselines to evaluate non-autoregressive machine translation is overestimated in three aspects: suboptimal layer allocation, insufficient speed measurement, and lack of knowledge distillation. |
Jungo Kasai; Nikolaos Pappas; Hao Peng; James Cross; Noah Smith; | iclr | 2021-01-21 |
1225 | Enriching Non-Autoregressive Transformer with Syntactic and SemanticStructures for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we claim that the syntactic and semantic structures among natural language are critical for non-autoregressive machine translation and can further improve the performance. |
Ye Liu; Yao Wan; Jian-Guo Zhang; Wenting Zhao; Philip S. Yu; | arxiv-cs.CL | 2021-01-21 |
1226 | The Impact of Post-editing and Machine Translation on Creativity and Reading Experience IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This article presents the results of a study involving the translation of a fictional story from English into Catalan in three modalities: machine-translated (MT), post-edited (MTPE) and translated without aid (HT). |
Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2021-01-15 |
1227 | Context- and Sequence-Aware Convolutional Recurrent Encoder for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing models use recurrent neural networks to construct both the encoder and decoder modules. |
Ritam Mallick; Seba Susan; Vaibhaw Agrawal; Rizul Garg; Prateek Rawal; | arxiv-cs.CL | 2021-01-11 |
1228 | Topology-Sensitive Neural Architecture Search for Language Modeling Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recently Neural Architecture Search has drawn interest from researchers because of its ability to learn neural network architectures from data automatically. The differentiable … |
Quan Du; Nuo Xu; Yinqiao Li; Tong Xiao; Jingbo Zhu; | IEEE Access | 2021-01-01 |
1229 | Neural Machine Translation for Turkish to English Using Deep Learning Related Papers Related Patents Related Grants Related Venues Related Experts View |
Fatih Balki; Hilmi Demirhan; Salih Sarp; | Digital Interaction and Machine Intelligence | 2021-01-01 |
1230 | The Solution of The Problem of Unknown Words Under Neural Machine Translation of The Kazakh Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The paper proposes a solution to the problem of unknown words for neural machine translation (NMT). The proposed solution is shown by the example of NMT of the Kazakh-English … |
Aliya Turganbayeva; Ualsher Tukeyev; | Journal of Information and Telecommunication | 2021-01-01 |
1231 | Machine Translation Quality Assessment of Selected Works of Xiaoping Deng Supported By Digital Humanistic Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the development of modern technology, the new technological methods need to be applied timely into translation research and translation quality assessment. As a new research … |
Qing Wang; Xiao Ma; | International Journal of Applied Linguistics and Translation | 2021-01-01 |
1232 | Design and Testing of Automatic Machine Translation System Based on Chinese-English Phrase Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the development of linguistics and the improvement of computer performance, the effect of machine translation is getting better and better, and it is widely used. The … |
Jing Ning; Haidong Ban; | Mobile Information Systems | 2021-01-01 |
1233 | Inflection Rules for Marathi to English in Rule Based Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation is important application in natural language processing. Machine translation means translation from source language to target language to save the meaning of … |
Namrata G Kharate; Varsha H Patil; | IAES International Journal of Artificial Intelligence | 2021-01-01 |
1234 | Current Status and Directions of Russian-Korean Machine Translation: Focusing on The Quality Study of Translation Software Results According to Formal and Dynamic Equivalence Related Papers Related Patents Related Grants Related Venues Related Experts View |
Na-young Kim; | Journal of Systems and Software | 2021-01-01 |
1235 | Human Evaluations of Machine Translation in An Ethically Charged Situation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Despite the immense influence of machine translation (MT) on cross-cultural communication worldwide, little is known about end users’ predispositions toward MT. Our online … |
Omri Asscher; Ella Glikson; | New Media & Society | 2021-01-01 |
1236 | Pre-Training on Mixed Data for Low-Resource Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The pre-training fine-tuning mode has been shown to be effective for low resource neural machine translation. In this mode, pre-training models trained on monolingual data are … |
Wenbo Zhang; Xiao Li; Yating Yang; Rui Dong; | Inf. | 2021-01-01 |
1237 | Augmenting Training Data for Low-Resource Neural Machine Translation Via Bilingual Word Embeddings and BERT Language Modelling Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large amounts of parallel data in order to build a good-quality machine translation … |
Akshai Ramesh; Haque Usuf Uhana; Venkatesh Balavadhani Parthasarathy; Rejwanul Haque; Andy Way; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1238 | Context-Aware Neural Machine Translation for Korean Honorific Expressions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) is one of the text generation tasks which has achieved significant improvement with the rise of deep neural networks. However, language-specific … |
Yongkeun Hwang; Yanghoon Kim; Kyomin Jung; | Electronics | 2021-01-01 |
1239 | Design of English Automatic Translation System Based on Machine Intelligent Translation and Secure Internet of Things Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the rapid development of Internet technology and the development of economic globalization, international exchanges in various fields have become increasingly active, and the … |
Haidong Ban; Jing Ning; | Mob. Inf. Syst. | 2021-01-01 |
1240 | Attention Based Sequence to Sequence Learning for Machine Translation of Low Resourced Indic Languages – A Case of Sanskrit to Hindi Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deep Learning techniques are powerful in mimicking humans in a particular set of problems. They have achieved a remarkable performance in complex learning tasks. Deep learning … |
Vishvajit Bakarola; Jitendra Nasriwala; | ArXiv | 2021-01-01 |
1241 | Translation Shifts on Reference By Machine Translation in Descriptive Text Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Translation shifts are one of strategy to get a high-quality translation. It’s also used to solve the absent meaning on the target text. The objectives of this research are to … |
Kammer Tuahman Sipayung; | 2021-01-01 | |
1242 | DIDACTIC ASPECT OF MEASURING TRANSLATION TASK DIFFICULTY Related Papers Related Patents Related Grants Related Venues Related Experts View |
T. Korol; | International Humanitarian University Herald. Philology | 2021-01-01 |
1243 | Translational Equivalence in Statistical Machine Translation or Meaning As Co-occurrence Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we will describe the current state-of-the-art of Statistical Machine Translation (SMT), and reflect on how SMT handles meaning. Statistical Machine Translation is a … |
Lieve Macken; Els Lefever; | Linguistica Antverpiensia, New Series – Themes in … | 2021-01-01 |
1244 | Using Sub-character Level Information for Neural Machine Translation of Logographic Languages Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Logographic and alphabetic languages (e.g., Chinese vs. English) have different writing systems linguistically. Languages belonging to the same writing system usually exhibit more … |
Longtu Zhang; Mamoru Komachi; | Transactions on Asian and Low-Resource Language Information … | 2021-01-01 |
1245 | ON-TRAC’ Systems for The IWSLT 2021 Low-resource Speech Translation and Multilingual Speech Translation Shared Tasks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2021, low-resource speech … |
HANG LE et. al. | International Workshop on Spoken Language Translation | 2021-01-01 |
1246 | KIT’s IWSLT 2021 Offline Speech Translation System Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes KIT’submission to the IWSLT 2021 Offline Speech Translation Task. We describe a system in both cascaded condition and end-to-end condition. In the cascaded … |
TUAN-NAM NGUYEN et. al. | International Workshop on Spoken Language Translation | 2021-01-01 |
1247 | Free/Open-Source Machine Translation for The Low-Resource Languages of Spain (Invited Talk) Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: While machine translation has historically been rule-based, that is, based on dictionaries and rules written by experts, most present-day machine translation is corpus-based. In … |
Mikel L. Forcada; | 2021-01-01 | |
1248 | Using Dependency-Based Contextualization for Transferring Passive Constructions from English to Spanish Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We hypothesize that parallel corpora as well as machine translation outputs contain many literal translations that are the result of transferring the constructions of the source … |
Pablo Otero; Gorka Labaka Intxauspe; | Procesamiento Del Lenguaje Natural | 2021-01-01 |
1249 | UoB at ProfNER 2021: Data Augmentation for Classification Using Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the participation of the UoB-NLP team in the ProfNER-ST shared subtask 7a. The task was aimed at detecting the mention of professions in social media text. … |
Frances Adriana Laureano De Leon; Harish Tayyar Madabushi; Mark Lee; | 2021-01-01 | |
1250 | Development of A Model and Software Solution for The Problem of Determining Unknown Words in Post-editing Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation is the technology of consecutive translation of texts from one language to another by a computer program. As a result of machine translation, there are always … |
D. R. Rakhimova; N. M. Pazylkhan; A. A. Kulzhanova; Zh.G. Alen; | 2021-01-01 | |
1251 | Applying Machine Translation Methods in The Problem of Automatic Text Correction Related Papers Related Patents Related Grants Related Venues Related Experts View |
Wojciech Jarmosz; | 2021-01-01 | |
1252 | A Comprehensive Survey on Machine Translation for English, Hindi and Sanskrit Languages IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Transforming text from one language to another by using computer systems automatically or with little human interventions is known as Machine Translation System (MTS). Divergence … |
Seema Bawa; Munish Kumar; | Journal of Ambient Intelligence and Humanized Computing | 2021-01-01 |
1253 | A Study of English-Indonesian Neural Machine Translation with Attention (Seq2Seq, ConvSeq2Seq, RNN, and MHA): A Comparative Study of NMT on English-Indonesian Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, Neural Machine Translation (NMT) with attention mechanisms has emerged in research and industry. This study discusses the essentials of NMT (Seq2Seq, … |
Diyah Puspitaningrum; | 6th International Conference on Sustainable Information … | 2021-01-01 |
1254 | Robust Cross-lingual Task-oriented Dialogue Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Cross-lingual dialogue systems are increasingly important in e-commerce and customer service due to the rapid progress of globalization. In real-world system deployment, machine … |
2021-01-01 | ||
1255 | Human Evaluation of Three Machine Translation Systems: from Quality to Attitudes By Professional Translators Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article aims to compare three machine translation systems with a focus on human evaluation. The systems under analysis are a domain-adapted statistical machine translation … |
Anna Fernández i Torné; Anna Matamala; | 2021-01-01 | |
1256 | MiSS: An Assistant for Multi-Style Simultaneous Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present MiSS, an assistant for multi-style simultaneous translation. Our proposed translation system has five key features: highly accurate translation, … |
Zuchao Li; Kevin Parnow; M. Utiyama; E. Sumita; Hai Zhao; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1257 | NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes NAIST’s system for the English-to-Japanese Simultaneous Text-to-text Translation Task in IWSLT 2021 Evaluation Campaign. Our primary submission is based on … |
RYO FUKUDA et. al. | 2021-01-01 | |
1258 | Towards Achieving A Delicate Blending Between Rule-based Translator and Neural Machine Translator IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Popular translators such as Google, Bing, etc., perform quite well when translating among the popular languages such as English, French, etc.; however, they make elementary … |
Md. Adnanul Islam; | Neural Comput. Appl. | 2021-01-01 |
1259 | Unsupervised Statistical Text Simplification IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most recent approaches for Text Simplification (TS) have drawn on insights from machine translation to learn simplification rewrites from the monolingual parallel corpus of … |
Jipeng Qiang; Xindong Wu; | IEEE Transactions on Knowledge and Data Engineering | 2021-01-01 |
1260 | A Novel Hybrid Approach to Improve Neural Machine Translation Decoding Using Phrase-Based Statistical Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Phrase-based models are among the best performing statistical machine translation (SMT) systems. These systems make translations phrase-by-phrase at a time. The decoding process … |
Emre Satir; Hasan Bulut; | 2021 International Conference on INnovations in Intelligent … | 2021-01-01 |
1261 | Neural Machine Translation 2020, By Philipp Koehn, Cambridge, Cambridge University Press, ISBN 978-1-108-49732-9, Pages 393 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural Machine Translation delivers a thorough and well-structured walk through the core concepts of the field. The book is primarily aimed at students who will want to go on to … |
Alexandra Birch; | Natural Language Engineering | 2021-01-01 |
1262 | A Machine Translation Mechanism of Brazilian Portuguese to Libras with Syntactic-semantic Adequacy Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deaf people communicate naturally using visual-spatial languages, called sign languages (SL). Although SLs are recognized as a language in many countries, the problems faced by … |
Manuella Aschoff C. B. Lima; Tiago Maritan U. de Araújo; Rostand E. O. Costa; Erickson S. Oliveira; | Natural Language Engineering | 2021-01-01 |
1263 | Towards A Better Integration of Fuzzy Matches in Neural Machine Translation Through Data Augmentation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: We identify a number of aspects that can boost the performance of Neural Fuzzy Repair (NFR), an easy-to-implement method to integrate translation memory matches and neural machine … |
Arda Tezcan; Bram Bulté; Bram Vanroy; | Informatics | 2021-01-01 |
1264 | Bilingual Subword Segmentation for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper proposed a new subword segmentation method for neural machine translation, “Bilingual Subword Segmentation,” which tokenizes sentences to minimize the difference … |
Hiroyuki Deguchi; Masao Utiyama; Akihiro Tamura; Takashi Ninomiya; Eiichiro Sumita; | 2021-01-01 | |
1265 | Evolving Transformer Architecture for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The transformer models have achieved great success on neural machine translation tasks in recent years. However, the hyper-parameters of the transformer are often manually … |
Ben Feng; Dayiheng Liu; Yanan Sun; | Proceedings of the Genetic and Evolutionary Computation … | 2021-01-01 |
1266 | Neural Machine Translation for Amharic-English Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Andargachew Mekonnen Gezmu; Andreas Nürnberger; Tesfaye Bayu Bati; | 2021-01-01 | |
1267 | GX at SemEval-2021 Task 2: BERT with Lemma Information for MCL-WiC Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: This paper presents the GX system for the Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC) task. The purpose of the MCL-WiC task is to tackle the challenge … |
Wanying Xie; | 2021-01-01 | |
1268 | Machine Translation: Its Typology, Advantages and Disadvantages Related Papers Related Patents Related Grants Related Venues Related Experts View |
Halyna Veselovska; Svitlana Radetska; | 2021-01-01 | |
1269 | NLPHut’s Participation at WAT2021 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper provides the description of shared tasks to the WAT 2021 by our team “NLPHut”. We have participated in the English→Hindi Multimodal translation task, English→Malayalam … |
SHANTIPRIYA PARIDA et. al. | Workshop on Asian Translation | 2021-01-01 |
1270 | Online Learning Over Time in Adaptive Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Adaptive Machine Translation purports to dynamically include user feedback to improve translation quality. In a post-editing scenario, user corrections of machine translation … |
Thierry Etchegoyhen; David Ponce; Harritxu Gete Ugarte; Victor Ruiz Gómez; | 2021-01-01 | |
1271 | Translation of Chinese Traditional Literature Classics Under The Background of Informationization Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the continuous development of modern information technology, all aspects of people’s lives have been greatly affected, and the field of education is also facing unprecedented … |
Xin Wang; | 2021 2nd International Conference on Computers, Information … | 2021-01-01 |
1272 | Comparative Analysis of Language Translation and Detection System Using Machine Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Abstract: Words are the meaty component which can be expressed through speech, writing or signals. It is important that the actual message or meaning of the words sent must … |
Aishwarya R. Verma; | International Journal for Research in Applied Science and … | 2021-01-01 |
1273 | PhraseAttn: Dynamic Slot Capsule Networks for Phrase Representation in Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Word representation plays a vital role in most Natural Language Processing systems, especially for Neural Machine Translation. It tends to capture semantic and similarity between … |
Binh Nguyen; Binh Van Le; Long H.B. Nguyen; Dien Dinh; | Journal of Intelligent & Fuzzy Systems | 2021-01-01 |
1274 | Portuguese Neural Text Simplification Using Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
T. LIMA et. al. | Brazilian Conference on Intelligent Systems | 2021-01-01 |
1275 | An Overview of The Basic NLP Resources Towards Building The Assamese-English Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine Translation (MT) is the process of automatically converting one natural language into another, preserving the exact meaning of the input text to the output text. It is one … |
Nibedita Roy; Apurbalal Senapati; | Proceedings of Intelligent Computing and Technologies … | 2021-01-01 |
1276 | Various Approaches of Machine Translation for Marathi to English Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine Translation (MT) is a generic term for computerised systems that generate translations from one natural language to another, with or without human intervention. Text may … |
Nilesh Shirsath; Aniruddha Velankar; Ranjeet Patil; Shilpa Shinde; | ITM Web of Conferences | 2021-01-01 |
1277 | The USYD-JD Speech Translation System for IWSLT2021 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the University of Sydney & JD’s joint submission of the IWSLT 2021 low resource speech translation task. We participated in the Swahili->English direction and … |
Liang Ding; Di Wu; Dacheng Tao; | 2021-01-01 | |
1278 | Translating IdiomsusingParaphrasing, Machine Translation and Rescoring Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Idioms are rich multi-word expressions that can be found in many works of literature. The meaning of most idioms cannot be deduced literally. This makes translating idioms … |
Tan Et.al Tien-Ping; | 2021-01-01 | |
1279 | Different Processes for Translating Expressive Versus Informative Texts? A Computer-assisted Study of Professionals’ English-Chinese Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Jianwei Zheng; Wenjun Fan; | Digit. Scholarsh. Humanit. | 2021-01-01 |
1280 | Assessing Human Post-Editing Efforts to Compare The Performance of Three Machine Translation Engines for English to Russian Translation of Cochrane Plain Language Health Information: Results of A Randomised Comparison Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Cochrane produces independent research to improve healthcare decisions. It translates its research summaries into different languages to enable wider access, relying largely on … |
Liliya Eugenevna Ziganshina; Ekaterina V. Yudina; Azat I. Gabdrakhmanov; Juliane Ried; | Informatics | 2021-01-01 |
1281 | Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The non-autoregressive models have boosted the efficiency of neural machine translation through parallelized decoding at the cost of effectiveness, when comparing with the … |
Ye Liu; Yao Wan; Jian-Guo Zhang; Wenting Zhao; Philip S. Yu; | 2021-01-01 | |
1282 | Post-editing Guidelines for Korean-English Machine Translation of Informative Texts Related Papers Related Patents Related Grants Related Venues Related Experts View |
Kunyoung Park; | 2021-01-01 | |
1283 | On Knowledge Distillation for Translating Erroneous Speech Transcriptions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent studies argue that knowledge distillation is promising for speech translation (ST) using end-to-end models. In this work, we investigate the effect of knowledge … |
Ryo Fukuda; Katsuhito Sudoh; Satoshi Nakamura; | 2021-01-01 | |
1284 | An Investigation of Machine Translation Output Quality and The Influencing Factors of Source Texts IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The use of machine translation (MT) in the academic context has increased in recent years. Hence, language teachers have found it difficult to ignore MT, which has led to some … |
Sangmin-Michelle Lee; | ReCALL | 2021-01-01 |
1285 | Tag Assisted Neural Machine Translation of Film Subtitles Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: We implemented a neural machine translation system that uses automatic sequence tagging to improve the quality of translation. Instead of operating on unannotated sentence pairs, … |
Aren Siekmeier; WonKee Lee; Hongseok Kwon; Jong-Hyeok Lee; | 2021-01-01 | |
1286 | Improving Lexical-Constraint-Aware Machine Translation By Factoring Encoders Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Existing lexically constrained machine translation employs data augmentation and incorporates lexical constraints during decoding period, which requires a bilingual dictionary or … |
Weiyuan Zeng; Cong Liu; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1287 | Hindi Chhattisgarhi Machine Translation System Using Statistical Approach Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine Translation is a subfield of Natural language Processing (NLP) which uses to translate source language to target language. In this paper an attempt has been made to make a … |
Vikas Pandey; Dr.M.V. Padmavati; Dr. Ramesh Kumar; | 2021-01-01 | |
1288 | More Data Is Better Only to Some Level, After Which It Is Harmful: Profiling Neural Machine Translation Self-learning with Back-Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation needs a very large volume of data to unfold its potential. Self-learning with back-translation became widely adopted to address this data scarceness … |
Rodrigo Santos; João Silva; António Branco; | 2021-01-01 | |
1289 | Semantically Constrained Document-Level Chinese-Mongolian Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: By using document-level contextual information, document-level neural machine translation can achieve better results than ordinary machine translation, but traditional … |
Haoran Li; Hongxu Hou; Nier Wu; Xiaoning Jia; Xin Chang; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1290 | Improving German Image Captions Using Machine Translation and Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts View |
Rajarshi Biswas; Michael Barz; Mareike Hartmann; Daniel Sonntag; | 2021-01-01 | |
1291 | Utilizing Machine Translation Systems to Generate Word Lists for Learning Vocabulary in English Related Papers Related Patents Related Grants Related Venues Related Experts View |
Jin-Ha Woo; Heeyoul Choi; | 2021-01-01 | |
1292 | Machine Translation Systems and Quality Assessment: A Systematic Review IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Nowadays, in the globalised context in which we find ourselves, language barriers can still be an obstacle to accessing information. On occasions, it is impossible to satisfy the … |
Irene Rivera-Trigueros; | 2021-01-01 | |
1293 | Progress in Machine Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: After more than 70 years of evolution, great achievements have been made in machine translation. Especially in recent years, translation quality has been greatly improved with the … |
Haifeng Wang; Hua Wu; Zhongjun He; Liang Huang; Kenneth Ward Church; | Engineering | 2021-01-01 |
1294 | ATLASLang NMT: Arabic Text Language Into Arabic Sign Language Neural Machine Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: ATLASLang is a machine translation system from Arabic text language into Arabic sign language (ArSL). The first version of the system (Brour and Benabbou, 2019) is based on two … |
Mourad Brour; Abderrahim Benabbou; | J. King Saud Univ. Comput. Inf. Sci. | 2021-01-01 |
1295 | Construct-Extract: An Effective Model for Building Bilingual Corpus to Improve English-Myanmar Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: When dealing with low resource languages such as Myanmar, using additional pseudo parallel data for training machine translation systems is often an effective approach. As a … |
May Myo Zin; Teeradaj Racharak; Nguyen Minh Le; | 2021-01-01 | |
1296 | Deep Learning Methods for Sign Language Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Many sign languages are bona fide natural languages with grammatical rules and lexicons hence can benefit from machine translation methods. Similarly, since sign language is a … |
TEJASWINI ANANTHANARAYANA et. al. | ACM Transactions on Accessible Computing (TACCESS) | 2021-01-01 |
1297 | Efficiency of Machine Translation in Urban Discourse Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article aims to analyze the use of Yandex.Translate, an online machine translation system, in translating urban discourse texts on the web. The authors use integrative … |
Svetlana Korolkova; Anna Novozhilova; | Vestnik Volgogradskogo gosudarstvennogo universiteta. … | 2021-01-01 |
1298 | English to Bengali Neural Machine Translation Using Global Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural Machine Translation (NMT), increased the accuracy of machine translation and recently become more popular in machine translation research community. An attention mechanism … |
Sheikh Abujar; Abu Kaisar Mohammad Masum; Abhishek Bhattacharya; Soumi Dutta; Syed Akhter Hossain; | 2021-01-01 | |
1299 | Japanese–English Conversation Parallel Corpus for Promoting Context-aware Machine Translation Research Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most machine translation (MT) research has focused on sentences as translation units (sentence-level MT), and has achieved acceptable translation quality for sentences where … |
Matīss Rikters; Ryokan Ri; Tong Li; Toshiaki Nakazawa; | Journal of Natural Language Processing | 2021-01-01 |
1300 | English-Marathi Neural Machine Translation Using Local Attention Related Papers Related Patents Related Grants Related Venues Related Experts View |
K. Adi Narayana Reddy; G. Shyam Chandra Prasad; A. Rajashekar Reddy; L. Naveen Kumar; | 2021-01-01 | |
1301 | Translating Sentimental Statements Using Deep Learning Techniques Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Natural Language Processing (NLP) allows machines to know nature languages and helps us do tasks, such as retrieving information, answering questions, text summarization, … |
Yin-Fu Huang; Yi-Hao Li; | Electronics | 2021-01-01 |
1302 | Mintzai-ST: Corpus and Baselines for Basque-Spanish Speech Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The lack of resources to train end-to-end Speech Translation models hinders research and development in the field. Although recent efforts have been made to prepare additional … |
THIERRY ETCHEGOYHEN et. al. | IberSPEECH 2021 | 2021-01-01 |
1303 | Confidence Measures for Interactive Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Confidence Measures (CMs) can be used to estimate the reliability of the words of a hypothesis generated by a machine translation system. In the Interactive-Predictive Machine … |
Ángel Navarro; Francisco Casacuberta; | 2021-01-01 | |
1304 | Tools and Techniques for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Archana Sachindeo Maurya; Srishti Garg; Promila Bahadur; | Proceedings of Second Doctoral Symposium on Computational … | 2021-01-01 |
1305 | Design of English Translation System Based on Deep Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Intelligent translation software is one of the important tools for people to learn, but the existing intelligent translation system usually needs to be further improved in terms … |
Yang Ting; | Journal of Physics: Conference Series | 2021-01-01 |
1306 | A Comparative Study on The Quality of Translation in Korean-English Translation Using Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Gilja Byun; | The Journal of Mirae English Language and Literature | 2021-01-01 |
1307 | Revealing Translation Techniques Applied in The Translation of Batik Motif Names in See Instagram Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article discusses one of the forms of machine translation, the Instagram translation feature called “see translation”. The research is focused on the translation techniques … |
Dyah Raina Purwaningsih; Ika Maratus Sholikhah; Erna Wardani; | Celt: A Journal of Culture, English Language Teaching & … | 2021-01-01 |
1308 | Empirical Analysis of Performance of MT Systems and Its Metrics for English to Bengali: A Black Box-Based Approach Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: There are numerous use cases of machine translation (MT) systems. Therefore, it has become very important to evaluate the performance of MT which can help researchers design a … |
Goutam Datta; Nisheeth Joshi; Kusum Gupta; | 2021-01-01 | |
1309 | Semantic and Syntactic Information for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Introducing factors such as linguistic features has long been proposed in machine translation to improve the quality of translations. More recently, factored machine translation … |
Jordi Armengol-Estapé; Marta R. Costa-jussà; | Mach. Transl. | 2021-01-01 |
1310 | Multilingual Translation from Denoising Pre-Training Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent work demonstrates the potential of training one model for multilingual machine translation. In parallel, denoising pretraining using unlabeled monolingual data as a … |
YUQING TANG et. al. | 2021-01-01 | |
1311 | The BLEU Score for Automatic Evaluation of English to Bangla NMT Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The BLEU Bilingual Evaluation Underscore is a very popular technique to evaluate machine translation(MT)systems since long. It mainly exploits the precision to evaluate the … |
Goutam Datta; Nisheeth Joshi; Kusum Gupta; | 2021-01-01 | |
1312 | Findings of The Second Workshop on Automatic Simultaneous Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the shared task of the 2nd Workshop on Automatic Simultaneous Translation (AutoSimTrans). The task includes two tracks, one for text-to-text … |
Ruiqing Zhang; Chuanqiang Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | 2021-01-01 | |
1313 | Feature-level Incongruence Reduction for Multimodal Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Caption translation aims to translate image annotations (captions for short). Recently, Multimodal Neural Machine Translation (MNMT) has been explored as the essential solution. … |
ZHIFENG LI et. al. | 2021-01-01 | |
1314 | Syntax-Based Attention Masking for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We present a simple method for extending transformers to source-side trees. We define a number of masks that limit self-attention based on relationships among tree nodes, and we … |
Colin McDonald; David Chiang; | 2021-01-01 | |
1315 | Negation Typology and General Representation Models for Cross-lingual Zero-shot Negation Scope Resolution in Russian, French, and Spanish Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Negation is a linguistic universal that poses difficulties for cognitive and computational processing. Despite many advances in text analytics, negation resolution remains an … |
Anastassia Shaitarova; Fabio Rinaldi; | 2021-01-01 | |
1316 | Intelligent English Translation System Based on Evolutionary Multi-objective Optimization Algorithm IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The difficulty of obtaining the characteristics of the corpus database of neural machine translation is a factor hindering its development. In order to improve the effect of … |
Xin Song; | J. Intell. Fuzzy Syst. | 2021-01-01 |
1317 | Source-side Reordering to Improve Machine Translation Between Languages with Distinct Word Orders Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: English and Hindi have significantly different word orders. English follows the subject-verb-object (SVO) order, while Hindi primarily follows the subject-object-verb (SOV) order. … |
Karunesh Kumar Arora; Shyam Sunder Agrawal; | Transactions on Asian and Low-Resource Language Information … | 2021-01-01 |
1318 | The Effect of Using Machine Translation on Linguistic Features in L2 Writing Across Proficiency Levels and Text Genres IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Many studies that have investigated the educational value of online machine translation (MT) in second language (L2) writing generally report significant improvements after MT … |
Eun Seon Chung; Soojin Ahn; | Computer Assisted Language Learning | 2021-01-01 |
1319 | MuST-C: A Multilingual Corpus for End-to-end Speech Translation IF:4 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: End-to-end spoken language translation (SLT) has recently gained popularity thanks to the advancement of sequence to sequence learning in its two parent tasks: automatic speech … |
ROLDANO CATTONI et. al. | Comput. Speech Lang. | 2021-01-01 |
1320 | Neural Machine Translation in Academic Contexts Related Papers Related Patents Related Grants Related Venues Related Experts View |
Alice Delorme Benites; Fernando Benites; | 2021-01-01 | |
1321 | Statistical and Neural Machine Translation for Manipuri-English On Intelligence Domain Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the development and results of Manipuri-English machine translation system built on an intelligence domain. Manipuri is an under-resourced Tibeto-Burman … |
Laishram Rahul; Loitongbam Sanayai Meetei; H. S. Jayanna; | Lecture Notes in Electrical Engineering | 2021-01-01 |
1322 | Japanese Translation Teaching Corpus Based on Bilingual Non Parallel Data Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, with the development of Internet and intelligent technology, Japanese translation teaching has gradually explored a new teaching mode. Under the guidance of … |
Zheng Guo; Jifeng Zhu; | J. Intell. Fuzzy Syst. | 2021-01-01 |
1323 | Monolingual and Cross-Lingual Intent Detection Without Training Data in Target Languages Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Due to recent DNN advancements, many NLP problems can be effectively solved using transformer-based models and supervised data. Unfortunately, such data is not available in some … |
Jurgita Kapočiūtė-Dzikienė; Askars Salimbajevs; Raivis Skadiņš; | Electronics | 2021-01-01 |
1324 | Application of Quantum Natural Language Processing for Language Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we develop compositional vector-based semantics of positive transitive sentences using quantum natural language processing (Q-NLP) to compare the parametrized … |
Mina Abbaszade; Vahid Salari; Seyed Shahin Mousavi; Mariam Zomorodi; Xujuan Zhou; | IEEE Access | 2021-01-01 |
1325 | Overview of Machine Translation Development Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Access to information is increasingly global, which brings with it the growth in a non-English speaking public and, as such, a demand for tools that allow users to access this … |
Irene Rivera-Trigueros; María-Dolores Olvera-Lobo; Juncal Gutiérrez-Artacho; | 2021-01-01 | |
1326 | Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across Domains Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Automated Term Extraction (ATE), even though well-investigated, continues to be a challenging task. Approaches conventionally extract terms on corpus or document level and the … |
Christian Lang; Lennart Wachowiak; Barbara Heinisch; Dagmar Gromann; | 2021-01-01 | |
1327 | Tutorial Proposal: End-to-End Speech Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Speech translation is the translation of speech in one language typically to text in another, traditionally accomplished through a combination of automatic speech recognition and … |
Jan Niehues; Elizabeth Salesky; Marco Turchi; Matteo Negri; | 2021-01-01 | |
1328 | Japanese Zero Anaphora Resolution Can Benefit from Parallel Texts Through Neural Transfer Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Parallel texts of Japanese and a non-pro-drop language have the potential of improving the performance of Japanese zero anaphora resolution (ZAR) because pronouns dropped in the … |
Masato Umakoshi; Yugo Murawaki; S. Kurohashi; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1329 | Advances and Challenges in Unsupervised Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unsupervised cross-lingual language representation initialization methods, together with mechanisms such as denoising and back-translation, have advanced unsupervised neural … |
Rui Wang; Hai Zhao; | 2021-01-01 | |
1330 | The Inspiration of Effort Model to The Practice Teaching of Simultaneous Translation with Shorthand Typing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the rapid development of science and technology and global integration today, modern information technology plays an irreplaceable role in education. The course of … |
Beilei Chen; | 2021-01-01 | |
1331 | Networked Artificial Intelligence English Translation System Based on An Intelligent Knowledge Base and Translation Method Thereof Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Language translation is often conducted in work and study. Traditional language translation is based on lexical structure analysis. However, natural language is not so … |
Shuping Ren; | Mob. Inf. Syst. | 2021-01-01 |
1332 | Revisiting Dropout: Escaping Pressure for Training Neural Networks with Multiple Costs Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A common approach to jointly learn multiple tasks with a shared structure is to optimize the model with a combined landscape of multiple sub-costs. However, gradients derived from … |
Sangmin Woo; Kangil Kim; Junhyug Noh; Shin Jong Hun; Seung-Hoon Na; | Electronics | 2021-01-01 |
1333 | Automatic Bilingual Markup Transfer Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: We describe the task of bilingual markup transfer, which involves placing markup tags from a source sentence into a fixed target translation. This task arises in practice when a … |
Thomas Zenkel; Joern Wuebker; John DeNero; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1334 | A Study on Automatic Machine Translation Tools: A Comparative Error Analysis Between DeepL and Yandex for Russian-Italian Medical Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The present research is aimed at conducting a study with regard to Russian-Italian medical translation on the current development of two Machine Translation tools that feature … |
Giulia Cambedda; Giorgio Maria Di Nunzio; Viviana Nosilia; | 2021-01-01 | |
1335 | Exploring Multi-stage Information Interactions for Multi-source Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Existing studies for multi-source neural machine translation (NMT) either separately model different source sentences or resort to the conventional single-source NMT by simply … |
ZIYAO LU et. al. | IEEE/ACM Transactions on Audio, Speech, and Language … | 2021-01-01 |
1336 | Experience of Neural Machine Translation Between Indian Languages IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper we explore neural machine translation (NMT) for Indian languages. Reported work on Indian language Statistical Machine Translation (SMT) demonstrated good … |
Shubham Dewangan; Shreya Alva; Nitish Joshi; Pushpak Bhattacharyya; | Machine Translation | 2021-01-01 |
1337 | Research on Literary Intelligent Translation Based on Improved Optimization Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation is widely used in people’s daily life and production, occupying an important position. In order to improve the accuracy of literary intelligent translation, … |
Hongjian Liu; | Journal of Intelligent and Fuzzy Systems | 2021-01-01 |
1338 | Text Identification System for Translation of English Language Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: India is a multilinguistic country. People of different states speak different languages but all Indians are not polyglots. English is called as universal language and Kannada is … |
Sushma Kumari; Snitha Shetty; Aparna Shetty; Saranya Babu; Sharon D’souza; | SSRN Electronic Journal | 2021-01-01 |
1339 | Editing Actions: A Missing Link Between Translation Process Research and Machine Translation Research Related Papers Related Patents Related Grants Related Venues Related Experts View |
Félix do Carmo; | Explorations in Empirical Translation Process Research | 2021-01-01 |
1340 | Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Though great progress has been made in the Aspect-Based Sentiment Analysis(ABSA) task through research, most of the previous work focuses on English-based ABSA problems, and there … |
Hanqian Wu; Zhike Wang; Feng Qing; Shoushan Li; | Electronics | 2021-01-01 |
1341 | Prolexbase: Une Ontologie Pour Le Traitement Multilingue Des Noms Propres Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Proper names often constitute a problem in translation. This contribution deals with an ontology which represents the basis for a multilingual database of proper names, … |
Thierry Grass; Denis Maurel; Mickaël Tran; | Linguistica Antverpiensia, New Series – Themes in … | 2021-01-01 |
1342 | Improving Zero-shot Neural Machine Translation on Language-specific Encoders- Decoders Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recently, universal neural machine translation (NMT) with shared encoder-decoder gained good performance on zero-shot translation. Unlike universal NMT, jointly trained … |
JUNWEI LIAO et. al. | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1343 | Machine Translation in Healthcare Related Papers Related Patents Related Grants Related Venues Related Experts View |
Barry Haddow; Alexandra Birch; Kenneth Heafield; | 2021-01-01 | |
1344 | Study on Post-editing for Machine Translation of Railway Engineering Texts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With rapid development of China’s railways, there are more overseas construction projects and technical exchanges in the field of railway engineering, which have generated … |
Yuting Li; Xiuying Lu; | 2021-01-01 | |
1345 | Transformer with Syntactic Position Encoding for Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: It has been widely recognized that syntax information can help end-to-end neural machine translation (NMT) systems to achieve better translation. In order to integrate dependency … |
Yikuan Xie; Wenyong Wang; Mingqian Du; Qing He; | 2021-01-01 | |
1346 | Named Entity Translation Method Based on Machine Translation Lexicon IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the context of the rapid development of computer technology, communication between various languages has become increasingly important. Among the research methods of named … |
Panpan Li; Mengxiang Wang; Jian Wang; | Neural Comput. Appl. | 2021-01-01 |
1347 | Post-editing Machine Translation in MateCat: A Classroom Experiment Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Advances in machine translation resulted in an increase of both volume and quality of machine-translated texts. However, machine translation still requires humans to post-edit the … |
Katrin Herget; | 7th International Conference on Higher Education Advances … | 2021-01-01 |
1348 | The Quality of Machine Translation Assessment On Gender Markers Lingual Units Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine Translation (MT) is one of the most advanced and elaborate research fields within Translation Technology, the quality of MT output has always been a great concern, and MT … |
Hapni Nurliana H.D Hasibuan; | Lensa: Kajian Kebahasaan, Kesusastraan, dan Budaya | 2021-01-01 |
1349 | Adapting Entities Across Languages and Cultures IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: How would you explain Bill Gates to a German? He is associated with founding a company in the United States, so perhaps the German founder Carl Benz could stand in for Gates in … |
Denis Peskov; Viktor Hangya; Jordan L. Boyd-Graber; Alexander Fraser; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1350 | CoMeT: Towards Code-Mixed Translation Using Parallel Monolingual Sentences Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Code-mixed languages are very popular in multilingual societies around the world, yet the resources lag behind to enable robust systems on such languages. A major contributing … |
DEVANSH GAUTAM et. al. | 2021-01-01 | |
1351 | Unsupervised Text Style Transfer with Content Embeddings Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The style transfer task (here style is used in a broad “authorial” sense with many aspects including register, sentence structure, and vocabulary choice) takes text input and … |
Keith Carlson; Allen Riddell; Daniel Rockmore; | 2021-01-01 | |
1352 | Artificial Intelligence in Machine Translation Technologies Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The capabilities of machine translation are closely related to the improvement of modeling the processes of understanding and generating texts in natural language, which … |
Konstantin kolin; | Social novelties and Social sciences | 2021-01-01 |
1353 | Variational Multimodal Machine Translation with Underlying Semantic Alignment IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Capturing the underlying semantic relationships of sentences is helpful for machine translation. Variational neural machine translation approaches provide an effective way to … |
Xiao Liu; Jing Zhao; Shiliang Sun; Huawen Liu; Hao Yang; | Inf. Fusion | 2021-01-01 |
1354 | Improving Neural Machine Translation Model with Deep Encoding Information IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Availability of very high computational power along with the development of deep neural network (DNN) technology has enabled rapid progress of machine translation technology. The … |
Guiduo Duan; Haobo Yang; Ke Qin; Tianxi Huang; | Cogn. Comput. | 2021-01-01 |
1355 | Supervised Visual Attention for Multimodal Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper proposed a supervised visual attention mechanism for multimodal neural machine translation (MNMT), trained with constraints based on manual alignments between words in … |
Tetsuro Nishihara; Akihiro Tamura; Takashi Ninomiya; Yutaro Omote; Hideki Nakayama; | 2021-01-01 | |
1356 | Contrastive Learning for Machine Translation Quality Estimation Related Papers Related Patents Related Grants Related Venues Related Experts View |
HUI HUANG et. al. | 2021-01-01 | |
1357 | Improving The Transformer Translation Model with Back-Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Back-translation has been proved to help improve the translation quality of statistical machine translation (SMT) systems and neural machine translation (NMT) systems. But in the … |
Hailiang Wang; Peng Jin; Jinrong Hu; Lujin Li; Xingyuan Chen; | Advances in Artificial Intelligence and Security | 2021-01-01 |
1358 | Discourse Phenomena in Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Miculicich Werlen; Lesly Sadiht; | 2021-01-01 | |
1359 | English-Vietnamese Machine Translation Using Deep Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recently, artificial intelligence-based machine translation has been much improved over the traditional methods. A machine translator is very useful for translating text or speech … |
Tuan Nguyen Minh; Phayung Meesad; Huy Cuong Nguyen Ha; | Lecture Notes in Networks and Systems | 2021-01-01 |
1360 | Adaptation of Back-translation to Automatic Post-Editing for Synthetic Data Generation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Automatic Post-Editing (APE) aims to correct errors in the output of a given machine translation (MT) system. Although data-driven approaches have become prevalent also in the APE … |
WonKee Lee; Baikjin Jung; Jaehun Shin; Jong-Hyeok Lee; | 2021-01-01 | |
1361 | Parallel Corpora Preparation for English-Amharic Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Yohanens Biadgligne; Kamel Smaïli; | 2021-01-01 | |
1362 | Improving Neural Machine Translation Using Gated State Network and Focal Adaptive Attention Networtk Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The currently predominant token-to-token attention mechanism has demonstrated its ability to capture word dependencies in neural machine translation. This mechanism treats a … |
Li Huang; Wenyu Chen; Yuguo Liu; He Zhang; Hong Qu; | Neural Comput. Appl. | 2021-01-01 |
1363 | Learning to Select Relevant Knowledge for Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
JIAN YANG et. al. | 2021-01-01 | |
1364 | Automatic Translation of Spoken English Based on Improved Machine Learning Algorithm IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Due to the complexity of English machine translation technology and its broad application prospects, many experts and scholars have invested more energy to analyze it. In view of … |
Lin Lin; Jie Liu; Xuebing Zhang; Xiufang Liang; | J. Intell. Fuzzy Syst. | 2021-01-01 |
1365 | Incorporating Translation Quality Estimation Into Chinese-Korean Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Exposure bias and poor translation diversity are two common problems in neural machine translation (NMT), which are caused by the general of the teacher forcing strategy for … |
Feiyu Li; Yahui Zhao; Feiyang Yang; Rongyi Cui; | 2021-01-01 | |
1366 | Twin-GAN for Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, Neural Machine Translation (NMT) has achieved great success, but we can not ignore two important problems. One is the exposure bias caused by the different … |
Jiaxu Zhao; Li Huang; Ruixuan Sun; Liao Bing; Hong Qu; | 2021-01-01 | |
1367 | Low-Resource Machine Translation Based on Asynchronous Dynamic Programming Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Reinforcement learning has been proved to be effective in handling low resource machine translation tasks and different sampling methods of reinforcement learning affect the … |
Xiaoning Jia; Hongxu Hou; Nier Wu; Haoran Li; Xin Chang; | 2021-01-01 | |
1368 | Deep Neural Networks for Multilingual Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Meryem Boukrissa; Fadoua Ataa Allah; | 2021-01-01 | |
1369 | Machine Translation with Pre-specified Target-side Words Using A Semi-autoregressive Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We introduce our TMU Japanese-to-English system, which employs a semi-autoregressive model, to tackle the WAT 2021 restricted translation task. In this task, we translate an input … |
Seiichiro Kondo; Aomi Koyama; Tomoshige Kiyuna; Tosho Hirasawa; Mamoru Komachi; | 2021-01-01 | |
1370 | Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-2021 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper we describe our submissions to WAT-2021 (Nakazawa et al., 2021) for English-to-Myanmar language (Burmese) task. Our team, ID: “YCC-MT1”, focused on bringing … |
YE KYAW THU et. al. | 2021-01-01 | |
1371 | HUMAN-COMPUTER INTERACTION IN TRANSLATION ACTIVITY: FLUENCY OF MACHINE TRANSLATION Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Digitalization is one of the key distinctive features of modern environment and social life. Nowadays more and more functions are transferred to the artificial mind. How effective … |
KATARINA WELNITZOVA et. al. | 2021-01-01 | |
1372 | Improving Multilingual Neural Machine Translation with Auxiliary Source Languages Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Multilingual neural machine translation models typically handle one source language at a time. However, prior work has shown that translating from multiple source languages … |
Weijia Xu; Yuwei Yin; Shuming Ma; Dongdong Zhang; Haoyang Huang; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1373 | Beyond MT Metrics in Specialised Translation: Automated and Manual Evaluation of Machine Translation Output for Freelance Translators and Small LSPs in The Context of EU Documents Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper discusses simplified methods of translation evaluation in two seemingly disparate areas: machine translation (MT) technology and translation for EU institutions. It … |
Krzysztof Łoboda; | Beyond Philology An International Journal of Linguistics, … | 2021-01-01 |
1374 | Comparative Analysis of Machine Translation and Human Translation Under The Background of Internet Related Papers Related Patents Related Grants Related Venues Related Experts View |
Hongxia Dai; | Lecture Notes on Data Engineering and Communications … | 2021-01-01 |
1375 | OCR Error Correction for Vietnamese Handwritten Text Using Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: OCR post-processing is an important step for improving the quality of OCR output texts. Long short-term memory (LSTM) is a deep learning model, which has wide-range applications … |
D. Q. Nguyen; A. D. Le; M. N. Phan; P. Kromer; I. Zelinka; | 1ST VAN LANG INTERNATIONAL CONFERENCE ON HERITAGE AND … | 2021-01-01 |
1376 | Machine Translation and Global Research: Towards Improved Machine Translation Literacy in The Scholarly Community. Lynne Bowker and Jairo Buitrago Ciro Related Papers Related Patents Related Grants Related Venues Related Experts View |
Wei Zhao; | Digit. Scholarsh. Humanit. | 2021-01-01 |
1377 | Comparing Statistical and Neural Machine Translation Performance on Hindi-To-Tamil and English-To-Tamil Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Phrase-based statistical machine translation (PB-SMT) has been the dominant paradigm in machine translation (MT) research for more than two decades. Deep neural MT models have … |
Akshai Ramesh; Venkatesh Balavadhani Parthasarathy; Rejwanul Haque; Andy Way; | 2021-01-01 | |
1378 | Complex Question Answering on Knowledge Graphs Using Machine Translation and Multi-task Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Question answering (QA) over a knowledge graph (KG) is a task of answering a natural language (NL) query using the information stored in KG. In a real-world industrial setting, … |
SAURABH SRIVASTAVA et. al. | 2021-01-01 | |
1379 | Exploring The Effectiveness of Employing Limited Resources for Deep Neural Pairwise Evaluation of Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, a light resource learning schema, i.e. a schema that depends on limited resources, is introduced, which aims to choose the better translation between two machine … |
Despoina Mouratidis; Katia Lida Kermanidis; | 2021 12th International Conference on Information, … | 2021-01-01 |
1380 | Deep Residual and Deep Dense Attentions in English Chinese Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural Machine Translation (NMT) with attention mechanism has achieved impressively improvement for automated translation. However, such models may lose information during … |
Yi-Xing Lin; Kai-Wen Liang; Chih-Hsuan Yang; Jia-Ching Wang; | 2021 IEEE International Conference on Consumer … | 2021-01-01 |
1381 | PECULIARITIES OF TRANSLATION OF ENGLISH LANGUAGE INSTRUCTIONS WITH THE USE OF ADDITIONAL TOOLS IN SDL TRADOS STUDIO AND MEMOQ TRANSLATOR PRO ENVIRONMENTS Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The presented research focuses upon the analysis of additional specific tools (namely translation memory (TM) technologies) of SDL Trados Studio 2017 and MemoQ Translator Pro 2017 … |
Iryna Karamysheva; Roksolyana Nazarchuk; Kateryna Lishnievska; | 2021-01-01 | |
1382 | Document-level Neural Machine Translation with Document Embeddings Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Standard neural machine translation (NMT) is on the assumption of document-level context independent. Most existing document-level NMT methods are satisfied with a smattering … |
Shu Jiang; Hai Zhao; Zuchao Li; Bao-Liang Lu; | ArXiv | 2021-01-01 |
1383 | Accuracy Analysis of Japanese Machine Translation Based on Machine Learning and Image Feature Retrieval Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: At present, there are still many deficiencies in Chinese-Japanese machine translation methods, the processing of corpus information is not deep enough, and the translation process … |
Gang Song; | J. Intell. Fuzzy Syst. | 2021-01-01 |
1384 | Optimized Chinese Pronunciation Prediction ByComponent-Based Statistical Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To eliminate ambiguities in the existing methods to simplify Chinese pronunciation learning, we propose a model that can predict the pronunciation of Chinese characters … |
Shunle Zhu; | J. Inf. Process. Syst. | 2021-01-01 |
1385 | Exploiting Translation Model for Parallel Corpus Mining Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Parallel corpus mining (PCM) is beneficial for many corpus-based natural language processing tasks, e.g., machine translation and bilingual dictionary induction, especially in … |
Chongman Leong; Xuebo Liu; Derek F. Wong; Lidia S. Chao; | IEEE/ACM Transactions on Audio, Speech, and Language … | 2021-01-01 |
1386 | Machine Translation Problems at Discourse Level Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xiaojun Zhang; | 2021-01-01 | |
1387 | Named Entity Correction in Neural Machine Translation Using The Attention Alignment Map Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) methods based on various artificial neural network models have shown remarkable performance in diverse tasks and have become mainstream for … |
Jangwon Lee; Jungi Lee ; Minho Lee ; Gil-Jin Jang; | Applied Sciences | 2021-01-01 |
1388 | Multilingual Simultaneous Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Simultaneous machine translation (SIMT) involves translating source utterances to the target language in real-time before the speaker utterance completes. This paper proposes the … |
Philip Arthur; Dongwon Ryu; Gholamreza Haffari; | 2021-01-01 | |
1389 | Factors Behind The Effectiveness of An Unsupervised Neural Machine Translation System Between Korean and Japanese Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Korean and Japanese have different writing scripts but share the same Subject-Object-Verb (SOV) word order. In this study, we pre-train a language-generation model using a Masked … |
Yong-Seok Choi; Yo-Han Park; Seung Yun; Sang-Hun Kim; Kong-Joo Lee; | Applied Sciences | 2021-01-01 |
1390 | Can A Corpus-driven Lexical Analysis of Human and Machine Translation Unveil Discourse Features That Set Them Apart? Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: There is still much to learn about the ways in which human and machine translation differ with regard to the contexts that regulate the production and interpretation of discourse. … |
Ana Frankenberg-Garcia; | Target. International Journal of Translation Studies | 2021-01-01 |
1391 | Text Generation and Enhanced Evaluation of Metric for Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Sujit S. Amin; Lata Ragha; | 2021-01-01 | |
1392 | The University of Edinburgh’s Submission to The IWSLT21 Simultaneous Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We describe our submission to the IWSLT 2021 shared task on simultaneous text-to-text English-German translation. Our system is based on the re-translation approach where the … |
Sukanta Sen; Ulrich Germann; B. Haddow; | International Workshop on Spoken Language Translation | 2021-01-01 |
1393 | Investigating Usability in Postediting Neural Machine Translation: Evidence from Translation Trainees’ Self-perception and Performance IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This is a report on an empirical study on the usability for translation trainees of neural machine translation systems when post-editing (mtpe). Sixty Chinese translation trainees … |
Xiangling Wang; Tingting Wang; Ricardo Muñoz Martín; Yanfang Jia; | Across Languages and Cultures | 2021-01-01 |
1394 | Search Query of English Translation Text Based on Embedded System and Big Data Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Cross-Language Information Retrieval (CLIR) the purpose of another language (target language), a collection of documents written question from one language (source language).CLIR … |
Zhihong Li; | Microprocess. Microsystems | 2021-01-01 |
1395 | Considering Machine Translation (MT) As An Aid or A Threat to The Human Translator: Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The present study aims to evaluate the output quality of an online MT; namely, Google Translate, from English into Persian and compare its output with the translations made by the … |
Hamidreza Abdi; | 2021-01-01 | |
1396 | Syntax-aware Transformers for Neural Machine Translation: The Case of Text to Sign Gloss Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: It is well-established that the preferred mode of communication of the deaf and hard of hearing (DHH) community are Sign Languages (SLs), but they are considered low resource … |
Santiago Egea Gómez; Euan McGill; Horacio Saggion; | Workshop on Building and Using Comparable Corpora | 2021-01-01 |
1397 | Research on Uyghur-Chinese Neural Machine Translation Based on The Transformer at Multistrategy Segmentation Granularity Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, machine translation based on neural networks has become the mainstream method in the field of machine translation, but there are still challenges of insufficient … |
Zhiwang Xu; Huibin Qin; Yongzhu Hua; | Mob. Inf. Syst. | 2021-01-01 |
1398 | Speech Decoding As Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Joseph G. Makin; David A. Moses; Edward F. Chang; | SpringerBriefs in Electrical and Computer Engineering | 2021-01-01 |
1399 | Effective Bitext Extraction From Comparable Corpora Using A Combination of Three Different Approaches Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Parallel sentences extracted from comparable corpora can be useful to supplement parallel corpora when training machine translation (MT) systems. This is even more prominent in … |
Steinþór Steingrímsson; Pintu Lohar; H. Loftsson; Andy Way; | BUCC | 2021-01-01 |
1400 | Guiding Neural Machine Translation with Retrieved Translation Template Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: While various neural machine translation (NMT) methods have integrated multiple prior knowledge to guide the translation, no research is available on combining with source-target … |
Wei Shang; Chong Feng; Tianfu Zhang; Da Xu; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1401 | Leveraging Machine Translation to Support Distributed Teamwork Between Language-Based Subgroups: The Effects of Automated Keyword Tagging Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Modern teamwork often happens between subgroups located in different countries. Members of the same subgroup prefer to communicate in their native language for efficiency, which … |
YONGLE ZHANG et. al. | Extended Abstracts of the 2021 CHI Conference on Human … | 2021-01-01 |
1402 | ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes ANVITA-1.0 MT system, architected for submission to WAT2021 MultiIndicMT shared task by mcairt team, where the team participated in 20 translation directions: … |
Pavanpankaj Vegi; J. Sivabhavani; Biswajit Paul; Chitra Viswanathan; K. R. Prasanna Kumar; | 2021-01-01 | |
1403 | Optimal Word Segmentation for Neural Machine Translation Into Dravidian Languages Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-the-art neural models. This stems from the fact that these languages are … |
Prajit Dhar; Arianna Bisazza; Gertjan van Noord; | 2021-01-01 | |
1404 | IITP-MT at WAT2021: Indic-English Multilingual Neural Machine Translation Using Romanized Vocabulary Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the systems submitted to WAT 2021 MultiIndicMT shared task by IITP-MT team. We submit two multilingual Neural Machine Translation (NMT) systems … |
Ramakrishna Appicharla; Kamal Kumar Gupta; Asif Ekbal; Pushpak Bhattacharyya; | 2021-01-01 | |
1405 | Kannada to English Machine Translation Using Deep Neural Network IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Received: 9 August 2020 Accepted: 15 November 2020 In this paper, we focus on the unidirectional translation of Kannada text to English text using Neural Machine Translation … |
Pushpalatha Kadavigere Nagaraj; Kshamitha Shobha Ravikumar; Mydugolam Sreenivas Kasyap; Medhini Hullumakki Srinivas Murthy; Jithin Paul; | Ingénierie des Systèmes d Inf. | 2021-01-01 |
1406 | Synchronous Syntactic Attention for Transformer Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper proposes a novel attention mechanism for Transformer Neural Machine Translation, “Synchronous Syntactic Attention,” inspired by synchronous dependency grammars. The … |
Hiroyuki Deguchi; Akihiro Tamura; Takashi Ninomiya; | 2021-01-01 | |
1407 | Improving Transformer-Based Neural Machine Translation with Prior Alignments Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Transformer is a neural machine translation model which revolutionizes machine translation. Compared with traditional statistical machine translation models and other neural … |
Thien Nguyen; Lam Nguyen; Phuoc Tran; Huu Nguyen; | Complex. | 2021-01-01 |
1408 | Improving Neural Machine Translation with Sentence Alignment Learning IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we propose an NMT … |
Xuewen Shi; Heyan Huang; Ping Jian; Yi-Kun Tang; | Neurocomputing | 2021-01-01 |
1409 | Automatic Evaluation of The Quality of Machine Translation of A Scientific Text: The Results of A Five-year-long Experiment Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We report on various approaches to automatic evaluation of machine translation quality and describe three widely used methods. These methods, i.e. methods based on string matching … |
Ilya Ulitkin; Irina Filippova; Natalia Ivanova; Alexey Poroykov; | E3S Web of Conferences | 2021-01-01 |
1410 | Probing Multi-modal Machine Translation with Pre-trained Language Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multi-modal machine translation (MMT) aimed at using images to help disambiguate the target during translation and improving robustness, but some recent works showed that the … |
Yawei Kong; Kai Fan; | 2021-01-01 | |
1411 | MAIN DIFFICULTIES IN TRANSLATING CONTRACTUAL DOCUMENTATION (ENGLISH/RUSSIAN) Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The article is devoted to one of the most common translation problems in the sphere of law, namely finding the adequate equivalents in vocabulary, especially it concerns foreign … |
Lilia Timofeeva; Maria Morozova; Tamara Potapova; | Journal of Teaching English for Specific and Academic … | 2021-01-01 |
1412 | Guwen-UNILM: Machine Translation Between Ancient and Modern Chinese Based on Pre-Trained Models Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zinong Yang; Ke-Jia Chen; Jingqiang Chen; | 2021-01-01 | |
1413 | Adaptive Transformer for Multilingual Neural Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Junpeng Liu; Kaiyu Huang; Jiuyi Li; Huan Liu; Degen Huang; | 2021-01-01 | |
1414 | TMEKU System for The WAT2021 Multimodal Translation Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We introduce our TMEKU system submitted to the English-Japanese Multimodal Translation Task for WAT 2021. We participated in the Flickr30kEnt-JP task and Ambiguous MSCOCO … |
Yuting Zhao; Mamoru Komachi; Tomoyuki Kajiwara; Chenhui Chu; | 2021-01-01 | |
1415 | Multilingual Machine Translation Systems at WAT 2021: One-to-Many and Many-to-One Transformer Based NMT Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present the details of the systems that we have submitted for the WAT 2021 MultiIndicMT: An Indic Language Multilingual Task. We have submitted two separate … |
Shivam Mhaskar; Aditya Jain; Aakash Banerjee; Pushpak Bhattacharyya; | 2021-01-01 | |
1416 | TMU NMT System with Japanese BART for The Patent Task of WAT 2021 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we introduce our TMU Neural Machine Translation (NMT) system submitted for the Patent task (Korean Japanese and English Japanese) of 8th Workshop on Asian … |
Hwichan Kim; Mamoru Komachi; | 2021-01-01 | |
1417 | Improved English to Hindi Multimodal Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation performs automatic translation from one natural language to another. Neural machine translation attains a state-of-the-art approach in machine translation, but … |
Sahinur Rahman Laskar; Abdullah Faiz Ur Rahman Khilji; Darsh Kaushik; Partha Pakray; Sivaji Bandyopadhyay; | 2021-01-01 | |
1418 | An Economic Model of Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the advent of free, online translation services such as Google Translate, many people are now able to obtain information relatively effortlessly from a wide variety of … |
Milam Aiken; Mina Park; | 2021-01-01 | |
1419 | Machine Translation from Text to Sign Language: A Systematic Review IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: An equal opportunity for all is the basic right of every human being. The deaf society of the world needs to have access to all the information just like hearing people do. For … |
Navroz Kaur Kahlon; Williamjeet Singh; | Universal Access in the Information Society | 2021-01-01 |
1420 | Sustainability of Translation As A Profession: Changing Roles of Translators in Light of The Developments in Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View |
Caner ÇETİNER; | RumeliDE Dil ve Edebiyat Araştırmaları Dergisi | 2021-01-01 |
1421 | Strengthening Low-resource Neural Machine Translation Through Joint Learning: The Case of Farsi-Spanish Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes a systematic study of an approach to Farsi-Spanish low-resource Neural Machine Translation (NMT) that leverages monolingual data for joint learning of forward … |
Benyamin Ahmadnia; Raúl Aranovich; Bonnie J. Dorr; | 2021-01-01 | |
1422 | METHOD OF SYSTEM ENGINEERING OF NEURAL MACHINE TRANSLATION SYSTEMS Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Background. There are not many machine translation companies on the market whose products are in demand. These are, for example, free and commercial products such as … |
Pavlo P. Maslianko; Yevhenii P. Sielskyi; | KPI Science News | 2021-01-01 |
1423 | Semantic Morphological Variant Selection and Translation Disambiguation for Cross-lingual Information Retrieval Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Cross-Lingual Information Retrieval (CLIR) enables a user to query in a language which is different from the target documents language. CLIR incorporates a translation technique … |
Vijay Kumar Sharma; Namita Mittal; Ankit Vidyarthi; | Multimedia Tools and Applications | 2021-01-01 |
1424 | Build Italian-Chinese Parallel Sentence Corpus to Implement Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The cooperation in infrastructure, economics between China and Italy is deepen and cultural exchanges are more closely related, thus the demand for Italian-Chinese translation … |
Wuying Liu; Lin Bai; Randie Yi; Han Wu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
1425 | Improving Neural Machine Translation with Latent Features Feedback Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most state-of-the-art neural machine translation (NMT) models progressively encode feature representation in a bottom-up feed-forward fashion. This traditional encoding mechanism … |
Yachao Li; Junhui Li; Min Zhang; | Neurocomputing | 2021-01-01 |
1426 | Pipeline Signed Japanese Translation Focusing on A Post-positional Particle Complement and Conjugation in A Low-resource Setting Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Because sign language is a visual language, the translation of it into spoken language is typically performed through an intermediate representation called gloss notation. In sign … |
Ken Yano; Akira Utsumi; | 2021-01-01 | |
1427 | Machine Translation and Postediting in The Didactics of Translation and Interpreting IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Diana González Pastor; Celia Rico; | Revista Digital de Investigación en Docencia Universitaria | 2021-01-01 |
1428 | Preordering Encoding on Transformer for Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The difference in word orders between source and target languages is a serious hurdle for machine translation. Preordering methods, which reorder the words in a source sentence … |
Yuki Kawara; Chenhui Chu; Yuki Arase; | IEEE/ACM Transactions on Audio, Speech, and Language … | 2021-01-01 |
1429 | Highland Puebla Nahuatl Speech Translation Corpus for Endangered Language Documentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Documentation of endangered languages (ELs) has become increasingly urgent as thousands of languages are on the verge of disappearing by the end of the 21st century. One … |
JIATONG SHI et. al. | 2021-01-01 | |
1430 | Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In Natural Language Understanding (NLU), to facilitate Cross-Lingual Transfer Learning (CLTL), especially CLTL between distant languages, we integrate CLTL with Machine … |
Chao Wang; Judith Gaspers; Quynh Ngoc Thi Do; Hui Jiang; | 2021-01-01 | |
1431 | IITP-MT at CALCS2021: English to Hinglish Neural Machine Translation Using Unsupervised Synthetic Code-Mixed Parallel Corpus Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the system submitted by IITP-MT team to Computational Approaches to Linguistic Code-Switching (CALCS 2021) shared task on MT for English→Hinglish. We submit a … |
Ramakrishna Appicharla; Kamal Kumar Gupta; Asif Ekbal; Pushpak Bhattacharyya; | 2021-01-01 | |
1432 | MACHINE TRANSLATION AND ITS PRINCIPLES OF CLASSIFICATION Related Papers Related Patents Related Grants Related Venues Related Experts View |
Natallia Pushyk; | 2021-01-01 | |
1433 | Arabic Machine Translation: A Survey with Challenges and Future Directions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, computer language area has witnessed important evolvement with applications in different domains. Machine Translation MT technology, considered as a subfield, has … |
Jezia Zakraoui; M. Saleh; S. Al-Maadeed; J. Jaam; | IEEE Access | 2021-01-01 |
1434 | Philipp Koehn: Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) is an approach to machine translation (MT) that uses deep learning techniques, a broad area of machine learning based on deep artificial neural … |
Wandri Jooste; Rejwanul Haque; Andy Way; | Machine Translation | 2021-01-01 |
1435 | OPUS-CAT: Desktop NMT with CAT Integration and Local Fine-tuning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: OPUS-CAT is a collection of software which enables translators to use neural machine translation in computer-assisted translation tools without exposing themselves to security and … |
Tommi Nieminen; | 2021-01-01 | |
1436 | Translate and Classify: Improving Sequence Level Classification for English-Hindi Code-Mixed Data Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Code-mixing is a common phenomenon in multilingual societies around the world and is especially common in social media texts. Traditional NLP systems, usually trained on … |
Devansh Gautam; Kshitij Gupta; Manish Shrivastava; | 2021-01-01 | |
1437 | Gated Convolutional Sequence to Sequence Based Learning for English-Hingilsh Code-Switched Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Code-Switching is the embedding of linguistic units or phrases from two or more languages in a single sentence. This phenomenon is practiced in all multilingual communities and is … |
Suman Dowlagar; Radhika Mamidi; | 2021-01-01 | |
1438 | Post-editing Machine Translation Experiments Related Papers Related Patents Related Grants Related Venues Related Experts View |
I. V. Hotsuliak; | 2021-01-01 | |
1439 | Text Recognition Technology for Natural Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Text recognition technology is of great values for scene image translation, machine translation, license plate recognition and other fields. Aiming at the text recognition … |
DANDAN WU et. al. | 2021 4th International Conference on Data Science and … | 2021-01-01 |
1440 | The Usefulness of Bibles in Low-Resource Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Bibles are available in a wide range of languages, which provides valuable parallel text between languages since verses can be aligned accurately between all the different … |
Ling Liu; Zach Ryan; Mans Hulden; | Proceedings of the Workshop on Computational Methods for … | 2021-01-01 |
1441 | Machine Translation System Using Deep Learning for Punjabi to English Related Papers Related Patents Related Grants Related Venues Related Experts View |
Kamal Deep; Ajit Kumar; Vishal Goyal; | 2021-01-01 | |
1442 | Example-Based Hybrid Higher-Order Neural Network Cognition Applied for Archive Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper constructs the basic principles and system structure of example-based hybrid higher-order neural network cognition for machine translation. On this basis, the paper … |
Lilan Chen; Yongsheng Chen; | Advances in Intelligent Automation and Soft Computing | 2021-01-01 |
1443 | MENYO-20k: A Multi-domain English-Yorùbá Corpus for Machine Translation and Domain Adaptation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Massively multilingual machine translation (MT) has shown impressive capabilities, including zero and few-shot translation between low-resource language pairs. However, these … |
DAVID I. ADELANI et. al. | ArXiv | 2021-01-01 |
1444 | Incorporating Relative Position Information in Transformer-Based Sign Language Recognition and Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent advancements in machine translation tasks, with the advent of attention mechanisms and Transformer networks, have accelerated the research in Sign Language Translation … |
Neena Aloysius; Geetha M; Prema Nedungadi; | IEEE Access | 2021-01-01 |
1445 | Research on The Application of BERT in Mongolian-Chinese Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, the research of neural networks has brought new solutions to machine translation. The application of sequence-tosequence model has made a qualitative leap in the … |
Xiu Zhi; Siriguleng Wang; | 2021 13th International Conference on Machine Learning and … | 2021-01-01 |
1446 | Research on Military Text Machine Translation Based on Deep Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xiangwei Liu; Liang Tang; Xin Ma; Jiang Hu; | Advances in Intelligent Automation and Soft Computing | 2021-01-01 |
1447 | Analysis of Errors in Machine Translation from Roger T. Bell’s Translation Process Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: There are enough literature reviews about machine translation, but the numbers of texts studied are not large enough, and there are very limited varieties of machine translation … |
Jianbin Zhu; Min Zhang; | 2021-01-01 | |
1448 | Corpora Compilation for Prosody-informed Speech Processing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Research on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is … |
Alp Öktem; Mireia Farrús; Antonio Bonafonte; | Lang. Resour. Evaluation | 2021-01-01 |
1449 | Bilingual Machine Translation: Bengali to English Related Papers Related Patents Related Grants Related Venues Related Experts View |
Sauvik Bal; Supriyo Mahanta; Lopa Mandal; | 2021-01-01 | |
1450 | Low Resource Neural Machine Translation from English to Khasi: A Transformer-Based Approach IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
N. Donald Jefferson Thabah; Bipul Syam Purkayastha; | 2021-01-01 | |
1451 | Domain Adaptation for Hindi-Telugu Machine Translation Using Domain Specific Back Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we present a novel approachfor domain adaptation in Neural MachineTranslation which aims to improve thetranslation quality over a new domain.Adapting new domains is … |
Hema Ala; Vandan Mujadia; Dipti Misra Sharma; | 2021-01-01 | |
1452 | SocialSciTerm: An English-Chinese Parallel Term Resource for Collaborative Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Bilingual term resources are helpful on collaborative translation tasks. Firstly, we build an English-Chinese parallel term resource of social sciences (SocialSciTerm) based on … |
Chenxi Zhu; Lin Wang; Wuying Liu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
1453 | Cross-lingual Text Similarity Exploiting Neural Machine Translation Models IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article studies cross-lingual text similarity using neural machine translation models. A straightforward approach based on machine translation is to use translated text so as … |
Kazuhiro Seki; | Journal of Information Science | 2021-01-01 |
1454 | Why Find The Right One? Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The present paper investigates the impact of the anaphoric one words in English on the Neural Machine Translation (NMT) process using English-Hindi as source and target language … |
Payal Khullar; | 2021-01-01 | |
1455 | Low Resource Machine Translation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Shriphani Palakodety; Ashiqur R. KhudaBukhsh; Guha Jayachandran; | Low Resource Social Media Text Mining | 2021-01-01 |
1456 | Dual Knowledge Distillation for Bidirectional Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Building strong and robust neural machine translation systems needs large amount of high-quality parallel corpora. However, most of language pairs are limited in quantity, … |
Huaao Zhang; Shigui Qiu; Shilong Wu; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1457 | Optical Character Recognition and Neural Machine Translation Using Deep Learning Techniques Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Over the years, the applications of text detection and text translation have expanded across various fields. Many researchers have used several deep learning algorithms for text … |
K. Chandra Shekar; Maria Anisha Cross; Vignesh Vasudevan; | Innovations in Computer Science and Engineering | 2021-01-01 |
1458 | Development of English-to-Bengali Neural Machine Translation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View |
Anwesha Das; Thoudam Doren Singh; | 2021-01-01 | |
1459 | Towards Personalised and Document-level Machine Translation of Dialogue Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: State-of-the-art (SOTA) neural machine translation (NMT) systems translate texts at sentence level, ignoring context: intra-textual information, like the previous sentence, and … |
Sebastian T. Vincent; | 2021-01-01 | |
1460 | Beyond Grammatical Error Correction: Improving L1-influenced Research Writing in English Using Pre-trained Encoder-decoder Models Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: In this paper, we present a new method for training a writing improvement model adapted to the writer’s first language (L1) that goes beyond grammatical error correction (GEC). … |
G. Zomer; A. Frankenberg-Garcia; | Conference on Empirical Methods in Natural Language … | 2021-01-01 |
1461 | Transformer-Based Direct Speech-To-Speech Translation with Transcoder IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Traditional speech translation systems use a cascade manner that concatenates speech recognition (ASR), machine translation (MT), and text-to-speech (TTS) synthesis to translate … |
Takatomo Kano; Sakriani Sakti; Satoshi Nakamura; | 2021 IEEE Spoken Language Technology Workshop (SLT) | 2021-01-01 |
1462 | A2R2: Robust Unsupervised Neural Machine Translation With Adversarial Attack and Regularization on Representations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unsupervised neural machine translation (UNMT) has recently achieved significant progress without requirement on any parallel data. The models for UNMT are typically the … |
Heng Yu; Haoran Luo; Yuqi Yi; Fan Cheng; | IEEE Access | 2021-01-01 |
1463 | Transformer-IC: The Solution to Information Loss Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the development of information technology, machine translation technologies play a crucial role in cross-language communication. However, there is a problem of information … |
Zhigang Song; Jiazhao Chai; Wenqian Shang; Guo Yuning; | 2021 IEEE/ACIS 19th International Conference on Computer … | 2021-01-01 |
1464 | Equivalence Levels of Literary Corpus Translation Using A Freeware Analysis Toolkit Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation has the potential to make huge contributions to translation industries, but it seems, for now, that machine translation equivalence has led to a crucial point … |
Frans Sayogie; Moh. Supardi; | 2021-01-01 | |
1465 | Phrase Based Statistical Machine Translation Javanese-Indonesian Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This research aims to produce a statistical machine translation that can be implemented to perform Javanese-Indonesian translation and to know the influence of the main data … |
Aufa Eka Putri Lesatari; Arie Ardiyanti; Ibnu Asror; | 2021-01-01 | |
1466 | On Machine Translation of User Reviews Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This work investigates neural machine translation (NMT) systems for translating English user reviews into Croatian and Serbian, two similar morphologically complex languages. Two … |
Maja Popović; Andy Way; Alberto Poncelas; Marija Brkić Bakarić; | 2021-01-01 | |
1467 | Common Lexical Errors Made By Machine Translation On Cultural Text Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation is one tool of Google that presents various languages to translate. As a translator machine, the results of Google Translate are not always perfectly correct. … |
Nanda Fitri Mar’athus Sholikhah; | 2021-01-01 | |
1468 | Reinforced NMT for Sentiment and Content Preservation in Low-resource Scenario Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The preservation of domain knowledge from source to the target is crucial in any translation workflows. Hence, translation service providers that use machine translation (MT) in … |
Divya Kumari; Asif Ekbal; Rejwanul Haque; Pushpak Bhattacharyya; Andy Way; | Transactions on Asian and Low-Resource Language Information … | 2021-01-01 |
1469 | Multilingual Sentiment Analysis: A Systematic Literature Review IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the explosive growth of social media, the online community can freely express their opinions without disclosing their identities. People with hidden agendas can easily post … |
Nur Atiqah Sia Abdullah; Nur Ida Aniza Rusli; | pertanika journal of science and technology | 2021-01-01 |
1470 | Application of Translation Technologies in The Translation of IMTFE Transcripts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation has grown very rapidly in recent times due to the developments in big data, artificial intelligence, and cloud computing software and techniques. The first … |
Danhua Huang; | English Language and Literature Studies | 2021-01-01 |
1471 | Towards Precise Lexicon Integration in Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Terminological consistency is an essential requirement for industrial translation. High-quality, hand-crafted terminologies contain entries in their nominal forms. Integrating … |
Ogün Öz; Maria Sukhareva; | 2021-01-01 | |
1472 | Moses and The Character-Based Random Babbling Baseline: CoAStaL at AmericasNLP 2021 Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We evaluated a range of neural machine translation techniques developed specifically for low-resource scenarios. Unsuccessfully. In the end, we submitted two runs: (i) a standard … |
MARCEL BOLLMANN et. al. | 2021-01-01 | |
1473 | A Study on Arabic-Korean Machine Translation: -Focusing on The Non-Literary Text- Related Papers Related Patents Related Grants Related Venues Related Experts View |
Gwag, Soon-Lei; | 2021-01-01 | |
1474 | NRC-CNRC Machine Translation Systems for The 2021 AmericasNLP Shared Task Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We describe the NRC-CNRC systems submitted to the AmericasNLP shared task on machine translation. We submitted systems translating from Spanish into Wixárika, Nahuatl, Rarámuri, … |
Rebecca Knowles; Darlene Stewart; Samuel Larkin; Patrick Littell; | 2021-01-01 | |
1475 | Open Machine Translation for Low Resource South American Languages (AmericasNLP 2021 Shared Task Contribution) Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes the team (“Tamalli”)’s submission to AmericasNLP2021 shared task on Open Machine Translation for low resource South American languages. Our goal was to … |
SHANTIPRIYA PARIDA et. al. | 2021-01-01 | |
1476 | Low-Resource Machine Translation Using Cross-Lingual Language Model Pretraining Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper describes UTokyo’s submission to the AmericasNLP 2021 Shared Task on machine translation systems for indigenous languages of the Americas. We present a low-resource … |
Francis Zheng; Machel Reid; Edison Marrese-Taylor; Yutaka Matsuo; | 2021-01-01 | |
1477 | Findings of The AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of The Americas Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents the results of the 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas. The shared task featured two independent tracks, and … |
MANUEL MAGER et. al. | 2021-01-01 | |
1478 | Peru Is Multilingual, Its Machine Translation Should Be Too? Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Peru is a multilingual country with a long history of contact between the indigenous languages and Spanish. Taking advantage of this context for machine translation is possible … |
Arturo Oncevay; | 2021-01-01 | |
1479 | Source Side Pre-ordering Using Recurrent Neural Networks for English-Myanmar Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Word reordering has remained one of the challenging problems for machine translation when translating between language pairs with different word orders e.g. English and Myanmar. … |
May Kyi Nyein; Khin Mar Soe; | International Journal of Electrical and Computer Engineering | 2021-01-01 |
1480 | Should We Find Another Model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method Without Model Modification Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most of the recent Natural Language Processing(NLP) studies are based on the Pretrain-Finetuning Approach (PFA), but in small and medium-sized enterprises or companies with … |
Chanjun Park; Sugyeong Eo; Hyeonseok Moon; Heuiseok Lim; | 2021-01-01 | |
1481 | Impact of Computer-assisted Translation Tools By Novice Translators on The Quality of Written Translations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The main specificity of the modern translation market is the translation of large volumes of technical texts and business documents in the shortest time possible. The purpose of … |
Zulfiya Akhatovna Usmanova; Ekaterina Nikolayevna Zudilova; Pavel Alekseevich Arkatov; Nataliaya Grigorievna Vitkovskaya; Ekaterina Vladimirovna Kravets; | LAPLAGE EM REVISTA | 2021-01-01 |
1482 | MACHINE TRANSLATION TECHNIQUE Related Papers Related Patents Related Grants Related Venues Related Experts View |
N. Pushyk; V. Horda; | International Humanitarian University Herald. Philology | 2021-01-01 |
1483 | Machine Learning Differences in Machine Translation of Urban Publicity Texts Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zhang Hong; | 2021-01-01 | |
1484 | Not All Contexts Are Important: The Impact of Effective Context in Conversational Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multilingual chat systems are the need of the hour for organizations who render online conversational services to their customers. This can effectively be facilitated using the … |
Baban Gain; Rejwanul Haque; Asif Ekbal; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
1485 | ArabiaNer: A System to Extract Named Entities from Arabic Content Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The extraction of named entities from unstructured text is a crucial component in numerous Natural Language Processing (NLP) applications such as information retrieval, question … |
Mohammad Hudhud; Hamed Abdelhaq; Fadi Mohsen; | 2021-01-01 | |
1486 | THE TRANSLATION OF ENTERTAINMENT NEWS FROM ENGLISH TO INDONESIAN WITH MACHINE TRANSLATION Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: News has been spread internationally since it was digitalized. This situation makes machine translation used as a tool to solve the language barrier problem, as it is cheap and … |
Sasqia Asmawari Putri; Haru Deliana Dewi; | 2021-01-01 | |
1487 | Errors of Machine Translation of Terminology in The Patent Text from English Into Chinese IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper summarizes eight types of error of terminology in the patent text in the output of Machine Translation from English into Chinese, including term being mistranslated as … |
Ying Cheng; Shuyu Yue; Jing Li; Lin Deng; Qi Quan; | 2021-01-01 | |
1488 | Kashmiri to English Machine Translation: A Study in Translation Divergence Issues of Personal and Possessive Pronouns Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine translation (MT) as a sub-field of computational linguistics represents one of the most advanced and applied translation dimensions as a research field. Translation … |
Sajad Hussain Wani; | 2021-01-01 | |
1489 | Research on The Application of Artificial Intelligence in Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Under the influence of the development of big data and cloud computing technology, machine translation based on artificial intelligence has gradually entered people’s lives. … |
Wang LingZhi; | 2021-01-01 | |
1490 | OCR and Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This lesson covers how to convert images of text into text files and translate those text files. The lesson will also cover how to organize and edit images to make the conversion … |
Andrew Akhlaghi; | The Programming Historian | 2021-01-01 |
1491 | Artificial Intelligence: Machine Translation Accuracy in Translating French-Indonesian Culinary Texts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The use of machine translation as artificial intelligence (AI) keeps increasing and the world’s most popular translation tool is Google Translate (GT). This tool is not merely … |
Muhammad Hasyim; Firman Saleh; Rudy Yusuf; Asriani Abbas; | SSRN Electronic Journal | 2021-01-01 |
1492 | Post-editing Neural Machine Translation Versus Human Translation for Chinese Essays: A Pilot Study Related Papers Related Patents Related Grants Related Venues Related Experts View |
Shengfang Zhao; | 2021-01-01 | |
1493 | Gender Bias in Machine Translation: An Analysis of Google Translate in English and Spanish Related Papers Related Patents Related Grants Related Venues Related Experts View |
Maria Lopez Medel; | Academia Letters | 2021-01-01 |
1494 | Enhancing Language Generation with Effective Checkpoints of Pre-trained Language Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This work empirically explores effective exploiting of intermediate output from pretrained language models (PrLMs) for language generation tasks. For this purpose, we propose an … |
Jeonghyeok Park; Hai Zhao; | 2021-01-01 | |
1495 | MACHINE TRANSLATION: TEACHING AND LEARNING ISSUES Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Considering the boost in technological development, and that machine translation has been widely used by both industry and academia, the main goal of this paper is to describe … |
Marileide Dias Esqueda; | 2021-01-01 | |
1496 | Monolingual Corpus Driven Vietnamese-Chinese Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation (NMT) usually requires a massive parallel corpus of high quality as training data, the lack of which limits the performance of the NMT model for some … |
Lin Wang; Zhaoxuan Li; Hongyan Zhang; Wuying Liu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
1497 | Two Parents, One Child: Dual Transfer for Low-Resource Neural Machine Translation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Neural machine translation suffers when parallel data for training is scarce. Previous works have explored transfer learning to assist training in low-resource scenarios. However, … |
Meng Zhang; Liangyou Li; Qun Liu; | 2021-01-01 | |
1498 | Interpretation and Machine Translation Towards Google Translate As A Part of Machine Translation and Teaching Translation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Language comprehension is the capacity of someone to properly understand the language to fully communicate the message and details. When dialects are distinct, the problem arises. … |
Vichard L. Kane; | Applied Translation | 2021-01-01 |
1499 | Characteristics Recognition and Soft Multimedia System for Japanese Machine Translation and Edge-Driven Hardware Implementations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the development of economic globalization, international exchanges and cooperation are increasingly frequent and in-depth. In this process, there is a serious obstacle, that … |
Gang Song; | 2021-01-01 | |
1500 | Natural Language Processing: Components, Advances, Tools and Industrial Applications Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Abstract: Natural Language Processing is the study that focuses the interplay between computer and the human languages NLP has spread its applications in various fields such as an … |
Hima Yeldo; | International Journal for Research in Applied Science and … | 2021-01-01 |