Paper Digest: Recent Papers on Machine Translation
Paper Digest Team extracted all recent Machine Translation related papers on our radar, and generated highlight sentences for them. The results are then sorted by relevance & date. In addition to this ‘static’ page, we also provide a real-time version of this article, which has more coverage and is updated in real time to include the most recent updates on this topic. Based in New York, Paper Digest is dedicated to producing high-quality text analysis results that people can acturally use on a daily basis. Since 2018, we have been serving users across the world with a number of exclusive services on ranking, search, tracking and automatic literature review.
If you do not want to miss interesting academic papers, you are welcome to sign up our free daily paper digest service to get updates on new papers published in your area every day. You are also welcome to follow us on Twitter and Linkedin to get updated with new conference digests.
Paper Digest Team
New York City, New York, 10017
team@paperdigest.org
TABLE 1: Paper Digest: Recent Papers on Machine Translation
Paper | Author(s) | Source | Date | |
---|---|---|---|---|
1 | On The Impact of Noises in Crowd-Sourced Data for Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: What are the impacts of these data quality issues for model development and evaluation? In this paper, we propose an automatic method to fix or filter the above quality issues, using English-German (En-De) translation as an example. |
Siqi Ouyang; Rong Ye; Lei Li; | arxiv-cs.CL | 2022-06-28 |
2 | Towards Unsupervised Content Disentanglement in Sentence Representations Via Syntactic Roles Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The probabilistic model we propose is an Attention-Driven Variational Autoencoder (ADVAE). |
Ghazi Felhi; Joseph Le Roux; Djamé Seddah; | arxiv-cs.CL | 2022-06-22 |
3 | Comparing Formulaic Language in Human and Machine Translation: Insight from A Parliamentary Corpus Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: A recent study has shown that, compared to human translations, neural machine translations contain more strongly-associated formulaic sequences made of relatively high-frequency words, but far less strongly-associated formulaic sequences made of relatively rare words. |
Yves Bestgen; | arxiv-cs.CL | 2022-06-22 |
4 | Scaling Autoregressive Models for Content-Rich Text-to-Image Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present the Pathways Autoregressive Text-to-Image (Parti) model, which generates high-fidelity photorealistic images and supports content-rich synthesis involving complex compositions and world knowledge. |
JIAHUI YU et. al. | arxiv-cs.CV | 2022-06-21 |
5 | The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the submission of our end-to-end YiTrans speech translation system for the IWSLT 2022 offline task, which translates from English audio to German, Chinese, and Japanese. |
ZIQIANG ZHANG et. al. | arxiv-cs.CL | 2022-06-12 |
6 | A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To lower the bar of use and make it more practical in commercial, we propose a novel Chinese dialect TTS frontend with a translation module. |
Wudi Bao; Junhui Zhang; Junjie Pan; Xiang Yin; Zejun Ma; | arxiv-cs.CL | 2022-06-10 |
7 | LegoNN: Building Modular Encoder-Decoder Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To achieve reusability, the interface between each encoder and decoder modules is grounded to a sequence of marginal distributions over a discrete vocabulary pre-defined by the model designer. We present two approaches for ingesting these marginals; one is differentiable, allowing the flow of gradients across the entire network, and the other is gradient-isolating. |
SIDDHARTH DALMIA et. al. | arxiv-cs.CL | 2022-06-07 |
8 | VALHALLA: Visual Hallucination for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a visual hallucination framework, called VALHALLA, which requires only source sentences at inference time and instead uses hallucinated visual representations for multimodal machine translation. |
YI LI et. al. | cvpr | 2022-06-07 |
9 | MorisienMT: A Dataset for Mauritian Creole Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we describe MorisienMT, a dataset for benchmarking machine translation quality of Mauritian Creole. |
Raj Dabre; Aneerav Sukhoo; | arxiv-cs.CL | 2022-06-06 |
10 | Finetuning A Kalaallisut-English Machine Translation System Using Web-crawled Data Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we attempt to finetune a pretrained Kalaallisut-to-English neural machine translation (NMT) system using web-crawled pseudoparallel sentences from around 30 multilingual websites. |
Alex Jones; | arxiv-cs.CL | 2022-06-05 |
11 | Findings of The The RuATD Shared Task 2022 on Artificial Text Detection in Russian Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present the shared task on artificial text detection in Russian, which is organized as a part of the Dialogue Evaluation initiative, held in 2022. |
TATIANA SHAMARDINA et. al. | arxiv-cs.CL | 2022-06-03 |
12 | Exploring Diversity in Back Translation for Low-Resource Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work puts forward a more nuanced framework for understanding diversity in training data, splitting it into lexical diversity and syntactic diversity. We present novel metrics for measuring these different aspects of diversity and carry out empirical analysis into the effect of these types of diversity on final neural machine translation model performance for low-resource English$\leftrightarrow$Turkish and mid-resource English$\leftrightarrow$Icelandic. |
Laurie Burchell; Alexandra Birch; Kenneth Heafield; | arxiv-cs.CL | 2022-06-01 |
13 | NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on developing resources for languages in Indonesia. |
GENTA INDRA WINATA et. al. | arxiv-cs.CL | 2022-05-31 |
14 | Refining Low-Resource Unsupervised Translation By Language Disentanglement of Multilingual Model Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a simple refinement procedure to disentangle languages from a pre-trained multilingual UMT model for it to focus on only the target low-resource task. |
Xuan-Phi Nguyen; Shafiq Joty; Wu Kui; Ai Ti Aw; | arxiv-cs.CL | 2022-05-31 |
15 | Preparing An Endangered Language for The Digital Age: The Case of Judeo-Spanish Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: For text-to-speech synthesis, we present a 3.5 hour single speaker speech corpus for building a neural speech synthesis engine. |
Alp Öktem; Rodolfo Zevallos; Yasmin Moslem; Güneş Öztürk; Karen Şarhon; | arxiv-cs.CL | 2022-05-31 |
16 | VALHALLA: Visual Hallucination for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a visual hallucination framework, called VALHALLA, which requires only source sentences at inference time and instead uses hallucinated visual representations for multimodal machine translation. |
YI LI et. al. | arxiv-cs.CV | 2022-05-31 |
17 | Can Transformer Be Too Compositional? Analysing Idiom Processing in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate whether the non-compositionality of idioms is reflected in the mechanics of the dominant NMT model, Transformer, by analysing the hidden states and attention patterns for models with English as source language and one of seven European languages as target language. |
Verna Dankers; Christopher G. Lucas; Ivan Titov; | arxiv-cs.CL | 2022-05-30 |
18 | X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we fill this research gap and present an abstractive cross-lingual summarization dataset for four different languages in the scholarly domain, which enables us to train and evaluate models that process English papers and generate summaries in German, Italian, Chinese and Japanese. |
Sotaro Takeshita; Tommaso Green; Niklas Friedrich; Kai Eckert; Simone Paolo Ponzetto; | arxiv-cs.CL | 2022-05-30 |
19 | CoNT: Contrastive Neural Text Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. |
CHENXIN AN et. al. | arxiv-cs.CL | 2022-05-29 |
20 | BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: While most of the research attention is given to the English language in a monolingual setting, resource-constrained languages like Bangla remain out of focus, predominantly due to a lack of standard datasets. Addressing this issue, we present a new dataset BAN-Cap following the widely used Flickr8k dataset, where we collect Bangla captions of the images provided by qualified annotators. |
Mohammad Faiyaz Khan; S. M. Sadiq-Ur-Rahman Shifath; Md Saiful Islam; | arxiv-cs.CL | 2022-05-28 |
21 | Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore the challenging problem of performing a generative task (i.e., summarization) in a target language when labeled data is only available in English. |
TU VU et. al. | arxiv-cs.CL | 2022-05-25 |
22 | Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate data augmentation techniques for synthesizing Dialectal Arabic-English CS text. |
Injy Hamed; Nizar Habash; Slim Abdennadher; Ngoc Thang Vu; | arxiv-cs.CL | 2022-05-25 |
23 | DivEMT: Neural Machine Translation Post-Editing Effort Across Typologically Diverse Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce DivEMT, the first publicly available post-editing study of Neural Machine Translation (NMT) over a typologically diverse set of target languages. |
Gabriele Sarti; Arianna Bisazza; Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2022-05-24 |
24 | T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a new approach to perform zero-shot cross-modal transfer between speech and text for translation tasks. |
Paul-Ambroise Duquenne; Hongyu Gong; Benoît Sagot; Holger Schwenk; | arxiv-cs.CL | 2022-05-24 |
25 | Translating Hanja Historical Documents to Understandable Korean and English Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, we propose H2KE, the neural machine translation model that translates Hanja historical documents to understandable Korean and English. |
JUHEE SON et. al. | arxiv-cs.CL | 2022-05-20 |
26 | PreQuEL: Quality Estimation of Machine Translation Outputs in Advance Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. |
Shachar Don-Yehiya; Leshem Choshen; Omri Abend; | arxiv-cs.CL | 2022-05-18 |
27 | Data Augmentation to Address Out-of-Vocabulary Problem in Low-Resource Sinhala-English Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a word and phrase replacement-based DA technique that consider both types of OOV, by augmenting (1) rare words in the existing parallel corpus, and (2) new words from a bilingual dictionary. |
Aloka Fernando; Surangika Ranathunga; | arxiv-cs.CL | 2022-05-18 |
28 | Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we report our recent achievements in S2ST. |
QIANQIAN DONG et. al. | arxiv-cs.CL | 2022-05-18 |
29 | From Simultaneous to Streaming Machine Translation By Leveraging Streaming History Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task |
Javier Iranzo Sanchez; Jorge Civera; Alfons Juan-C�scar; | acl | 2022-05-17 |
30 | Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: While one possible solution is to directly take target contexts into these statistical metrics, the target-context-aware statistical computing is extremely expensive, and the corresponding storage overhead is unrealistic. To solve the above issues, we propose a target-context-aware metric, named conditional bilingual mutual information (CBMI), which makes it feasible to supplement target context information for statistical metrics. |
SONGMING ZHANG et. al. | acl | 2022-05-17 |
31 | Redistributing Low-Frequency Words: Making The Most of Monolingual Data in Non-Autoregressive Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we provide an appealing alternative for NAT – monolingual KD, which trains NAT student on external monolingual data with AT teacher trained on the original bilingual data. |
Liang Ding; Longyue Wang; Shuming Shi; Dacheng Tao; Zhaopeng Tu; | acl | 2022-05-17 |
32 | Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce multilingual crossover encoder-decoder (mXEncDec) to fuse language pairs at an instance level. |
YONG CHENG et. al. | acl | 2022-05-17 |
33 | Geographical Distance Is The New Hyperparameter: A Case Study Of Finding The Optimal Pre-trained Language For English-isiZulu Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This study explores the potential benefits of transfer learning in an English-isiZulu translation framework. |
Muhammad Umair Nasir; Innocent Amos Mchechesi; | arxiv-cs.CL | 2022-05-17 |
34 | An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. |
Sweta Agrawal; Marine Carpuat; | acl | 2022-05-17 |
35 | Scheduled Multi-task Learning for Neural Chat Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Although the NCT models have achieved impressive success, it is still far from satisfactory due to insufficient chat translation data and simple joint training manners. To address the above issues, we propose a scheduled multi-task learning framework for NCT. |
Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou; | acl | 2022-05-17 |
36 | Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an adaptive segmentation policy for end-to-end ST. Inspired by human interpreters, the policy learns to segment the source streaming speech into meaningful units by considering both acoustic features and translation history, maintaining consistency between the segmentation and translation. |
Ruiqing Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | acl | 2022-05-17 |
37 | Measuring and Mitigating Name Biases in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we describe a new source of bias prevalent in NMT systems, relating to translations of sentences containing person names. |
Jun Wang; Benjamin Rubinstein; Trevor Cohn; | acl | 2022-05-17 |
38 | MSCTD: A Multimodal Sentiment Chat Translation Dataset Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce a new task named Multimodal Chat Translation (MCT), aiming to generate more accurate translations with the help of the associated dialogue history and visual context. |
Yunlong Liang; Fandong Meng; Jinan Xu; Yufeng Chen; Jie Zhou; | acl | 2022-05-17 |
39 | Cross-Lingual Contrastive Learning for Fine-Grained Entity Typing for Low-Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a cross-lingual contrastive learning framework to learn FGET models for low-resource languages. |
XU HAN et. al. | acl | 2022-05-17 |
40 | Bridging The Data Gap Between Training and Inference for Unsupervised Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To narrow the data gap, we propose an online self-training approach, which simultaneously uses the pseudo parallel data {natural source, translated target} to mimic the inference scenario. |
Zhiwei He; Xing Wang; Rui Wang; Shuming Shi; Zhaopeng Tu; | acl | 2022-05-17 |
41 | Machine Translation for Livonian: Catering to 20 Speakers Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we tackle the task of developing neural machine translation (NMT) between Livonian and English, with a two-fold aim: on one hand, preserving the language and on the other – enabling access to Livonian folklore, lifestories and other textual intangible heritage as well as making it easier to create further parallel corpora. |
Matiss Rikters; Marili Tomingas; Tuuli Tuisk; Valts Ern�treits; Mark Fishel; | acl | 2022-05-17 |
42 | Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel data augmentation paradigm termed Continuous Semantic Augmentation (CsaNMT), which augments each training instance with an adjacency semantic region that could cover adequate variants of literal expression under the same meaning. |
XIANGPENG WEI et. al. | acl | 2022-05-17 |
43 | Consistent Human Evaluation of Machine Translation Across Language Pairs Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more consistent assessment. |
DANIEL LICHT et. al. | arxiv-cs.CL | 2022-05-17 |
44 | A Variational Hierarchical Model for Neural Cross-Lingual Summarization Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, it is very challenging for the model to directly conduct CLS as it requires both the abilities to translate and summarize. To address this issue, we propose a hierarchical model for the CLS task, based on the conditional variational auto-encoder. |
YUNLONG LIANG et. al. | acl | 2022-05-17 |
45 | Can Transformer Be Too Compositional? Analysing Idiom Processing in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate whether the non-compositionality of idioms is reflected in the mechanics of the dominant NMT model, Transformer, by analysing the hidden states and attention patterns for models with English as source language and one of seven European languages as target language. |
Verna Dankers; Christopher Lucas; Ivan Titov; | acl | 2022-05-17 |
46 | STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Existing techniques often attempt to transfer powerful machine translation (MT) capabilities to ST, but neglect the representation discrepancy across modalities. In this paper, we propose the Speech-TExt Manifold Mixup (STEMM) method to calibrate such discrepancy. |
Qingkai Fang; Rong Ye; Lei Li; Yang Feng; Mingxuan Wang; | acl | 2022-05-17 |
47 | BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a novel BiTIIMT system, Bilingual Text-Infilling for Interactive Neural Machine Translation. |
YANLING XIAO et. al. | acl | 2022-05-17 |
48 | DiBiMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present DiBiMT, the first entirely manually-curated evaluation benchmark which enables an extensive study of semantic biases in Machine Translation of nominal and verbal words in five different language combinations, namely, English and one or other of the following languages: Chinese, German, Italian, Russian and Spanish. |
Niccol� Campolungo; Federico Martelli; Francesco Saina; Roberto Navigli; | acl | 2022-05-17 |
49 | ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper explores a deeper relationship between Transformer and numerical ODE methods. |
BEI LI et. al. | acl | 2022-05-17 |
50 | Towards Making The Most of Cross-Lingual Transfer for Zero-Shot Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper demonstrates that multilingual pretraining and multilingual fine-tuning are both critical for facilitating cross-lingual transfer in zero-shot translation, where the neural machine translation (NMT) model is tested on source languages unseen during supervised training. Following this idea, we present SixT+, a strong many-to-English NMT model that supports 100 source languages but is trained with a parallel dataset in only six source languages. |
GUANHUA CHEN et. al. | acl | 2022-05-17 |
51 | DEEP: DEnoising Entity Pre-training for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Earlier named entity translation methods mainly focus on phonetic transliteration, which ignores the sentence context for translation and is limited in domain and language coverage. To address this limitation, we propose DEEP, a DEnoising Entity Pre-training method that leverages large amounts of monolingual data and a knowledge base to improve named entity translation accuracy within sentences. |
Junjie Hu; Hiroaki Hayashi; Kyunghyun Cho; Graham Neubig; | acl | 2022-05-17 |
52 | Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a Confidence Based Bidirectional Global Context Aware (CBBGCA) training framework for NMT, where the NMT model is jointly trained with an auxiliary conditional masked language model (CMLM). |
CHULUN ZHOU et. al. | acl | 2022-05-17 |
53 | Zero-Shot Cross-lingual Semantic Parsing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a multi-task encoder-decoder model to transfer parsing knowledge to additional languages using only English-logical form paired data and in-domain natural language corpora in each new language. |
Tom Sherborne; Mirella Lapata; | acl | 2022-05-17 |
54 | ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. |
Long Phan; Hieu Tran; Hieu Nguyen; Trieu H. Trinh; | arxiv-cs.CL | 2022-05-13 |
55 | Mitigating Gender Stereotypes in Hindi and Marathi Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We create a dataset of neutral and gendered occupation words, emotion words and measure bias with the help of Embedding Coherence Test (ECT) and Relative Norm Distance (RND). |
Neeraja Kirtane; Tanvi Anand; | arxiv-cs.CL | 2022-05-12 |
56 | AppTek’s Submission to The IWSLT 2022 Isometric Spoken Language Translation Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: To participate in the Isometric Spoken Language Translation Task of the IWSLT 2022 evaluation, constrained condition, AppTek developed neural Transformer-based systems for … |
Patrick Wilken; Evgeny Matusov; | arxiv-cs.CL | 2022-05-11 |
57 | Controlling Extra-Textual Attributes About Dialogue Participants — A Case Study of English-to-Polish Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We focus on the underresearched problem of utilising external metadata in automatic translation of TV dialogue, proposing a case study where a wide range of approaches for controlling attributes in translation is employed in a multi-attribute scenario. |
Sebastian T. Vincent; Loïc Barrault; Carolina Scarton; | arxiv-cs.CL | 2022-05-10 |
58 | CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce an annotated dataset (CoCoA-MT) and an associated evaluation metric for training and evaluating formality-controlled MT models for six diverse target languages. |
MARIA NĂDEJDE et. al. | arxiv-cs.CL | 2022-05-09 |
59 | ParaCotta: Synthetic Multilingual Paraphrase Corpora from The Most Diverse Translation Sample Pair Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We generate multiple translation samples using beam search and choose the most lexically diverse pair according to their sentence BLEU. |
ALHAM FIKRI AJI et. al. | arxiv-cs.CL | 2022-05-09 |
60 | Bridging The Domain Gap for Stance Detection for The Zulu Language Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a black-box non-intrusive method that utilizes techniques from Domain Adaptation to reduce the domain gap, without requiring any human expertise in the target language, by leveraging low-quality data in both a supervised and unsupervised manner. |
Gcinizwe Dlamini; Imad Eddine Ibrahim Bekkouch; Adil Khan; Leon Derczynski; | arxiv-cs.CL | 2022-05-06 |
61 | Example-Based Machine Translation from Text to A Hierarchical Representation of Sign Language Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article presents an original method for Text-to-Sign Translation. |
Élise Bertin-Lemée; Annelies Braffort; Camille Challant; Claire Danet; Michael Filhol; | arxiv-cs.CL | 2022-05-06 |
62 | Quantifying Synthesis and Fusion and Their Impact on Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, literature in Natural Language Processing (NLP) typically labels a whole language with a strict type of morphology, e.g. fusional or agglutinative. In this work, we propose to reduce the rigidity of such claims, by quantifying morphological typology at the word and segment level. |
ARTURO ONCEVAY et. al. | arxiv-cs.CL | 2022-05-06 |
63 | ON-TRAC Consortium Systems for The IWSLT 2022 Dialect and Low-resource Speech Translation Tasks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the ON-TRAC Consortium translation systems developed for two challenge tracks featured in the Evaluation Campaign of IWSLT 2022: low-resource and dialect speech translation. |
MARCELY ZANON BOITO et. al. | arxiv-cs.CL | 2022-05-04 |
64 | Original or Translated? A Causal Analysis of The Impact of Translationese on Machine Translation Performance Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we collect CausalMT, a dataset where the MT training data are also labeled with the human translation directions. |
Jingwei Ni; Zhijing Jin; Markus Freitag; Mrinmaya Sachan; Bernhard Schölkopf; | arxiv-cs.CL | 2022-05-04 |
65 | Training Mixed-Domain Translation Models Via Federated Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we leverage federated learning (FL) in order to tackle the problem. |
Peyman Passban; Tanya Roosta; Rahul Gupta; Ankit Chadha; Clement Chung; | arxiv-cs.CL | 2022-05-03 |
66 | Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, there is a need to create training and evaluation data for implementing machine learning tasks and bridging the research gap in the language. This work presents the Hausa Visual Genome (HaVG), a dataset that contains the description of an image or a section within the image in Hausa and its equivalent in English. |
IDRIS ABDULMUMIN et. al. | arxiv-cs.CL | 2022-05-02 |
67 | The Implicit Length Bias of Label Smoothing on Beam Search Decoding Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We verify our theory by applying a simple rectification function at inference time to restore the unbiased distributions from the label-smoothed model predictions. |
Bowen Liang; Pidong Wang; Yuan Cao; | arxiv-cs.CL | 2022-05-02 |
68 | Semantically Informed Slang Interpretation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a semantically informed slang interpretation (SSI) framework that considers jointly the contextual and semantic appropriateness of a candidate interpretation for a query slang. |
Zhewei Sun; Richard Zemel; Yang Xu; | arxiv-cs.CL | 2022-05-01 |
69 | The Cross-lingual Conversation Summarization Challenge Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose the shared task of cross-lingual conversation summarization, \emph{ConvSumX Challenge}, opening new avenues for researchers to investigate solutions that integrate conversation summarization and machine translation. |
YULONG CHEN et. al. | arxiv-cs.CL | 2022-04-30 |
70 | Can Machine Translation Be A Reasonable Alternative for Multilingual Question Answering Systems Over Knowledge Graphs? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we discuss Knowledge Graph Question Answering (KGQA) systems that aim at providing natural language access to data stored in Knowledge Graphs (KG). |
Aleksandr Perevalov; Andreas Both; Dennis Diefenbach; Axel-Cyrille Ngonga Ngomo; | www | 2022-04-29 |
71 | NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Translation-based similarity measures include direct and pivot translation probability, as well as translation cross-likelihood, which has not been studied so far. We analyze these measures in the common framework of multilingual NMT, releasing the NMTScore library (available at https://github.com/ZurichNLP/nmtscore). |
Jannis Vamvas; Rico Sennrich; | arxiv-cs.CL | 2022-04-28 |
72 | Efficient Machine Translation Domain Adaptation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore several approaches to speed up nearest neighbor machine translation. |
Pedro Henrique Martins; Zita Marinho; André F. T. Martins; | arxiv-cs.CL | 2022-04-26 |
73 | A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we conduct a systematic survey with comparisons and discussions of various non-autoregressive translation (NAT) models from different aspects. |
YISHENG XIAO et. al. | arxiv-cs.CL | 2022-04-20 |
74 | IndicXNLI: Evaluating Multilingual Inference for Indian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: While Indic NLP has made rapid advances recently in terms of the availability of corpora and pre-trained models, benchmark datasets on standard NLU tasks are limited. To this end, we introduce IndicXNLI, an NLI dataset for 11 Indic languages. |
Divyanshu Aggarwal; Vivek Gupta; Anoop Kunchukuttan; | arxiv-cs.CL | 2022-04-19 |
75 | PICT@DravidianLangTech-ACL2022: Neural Machine Translation On Dravidian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents a summary of the findings that we obtained based on the shared task on machine translation of Dravidian languages. |
Aditya Vyawahare; Rahul Tangsali; Aditya Mandke; Onkar Litake; Dipali Kadam; | arxiv-cs.CL | 2022-04-19 |
76 | Dynamic Position Encoding for Transformers Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: However, such embeddings are fixed after training regardless of the task and the word ordering system of the source or target language. In this paper, we propose a novel architecture with new position embeddings depending on the input text to address this shortcoming by taking the order of target words into consideration. |
Joyce Zheng; Mehdi Rezagholizadeh; Peyman Passban; | arxiv-cs.CL | 2022-04-17 |
77 | Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a novel data augmentation paradigm termed Continuous Semantic Augmentation (CsaNMT), which augments each training instance with an adjacency semantic region that could cover adequate variants of literal expression under the same meaning. |
XIANGPENG WEI et. al. | arxiv-cs.CL | 2022-04-14 |
78 | The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we experiment with zero-shot transfer of English models to four typologically different languages (Spanish, Russian, Vietnamese, and Hindi) and three NLP tasks (QA, NLI, and NER). |
Pavel Efimov; Leonid Boytsov; Elena Arslanova; Pavel Braslavski; | arxiv-cs.CL | 2022-04-13 |
79 | Creativity in Translation: Machine Translation As A Constraint for Literary Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article presents the results of a study involving the translation of a short story by Kurt Vonnegut from English to Catalan and Dutch using three modalities: machine-translation (MT), post-editing (PE) and translation without aid (HT). |
Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2022-04-12 |
80 | Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce it to streaming end-to-end speech translation (ST), which aims to convert audio signals to texts in other languages directly. |
Jian Xue; Peidong Wang; Jinyu Li; Matt Post; Yashesh Gaur; | arxiv-cs.CL | 2022-04-11 |
81 | Toward More Effective Human Evaluation for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate a simple way to reduce cost by reducing the number of text segments that must be annotated in order to accurately predict a score for a complete test set. |
Belén Saldías; George Foster; Markus Freitag; Qijun Tan; | arxiv-cs.CL | 2022-04-11 |
82 | Towards Better Chinese-centric Neural Machine Translation for Low-resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present the winner competition system that leverages monolingual word embeddings data enhancement, bilingual curriculum learning, and contrastive re-ranking. |
Bin Li; Yixuan Weng; Fei Xia; Hanjun Deng; | arxiv-cs.CL | 2022-04-08 |
83 | MMTAfrica: Multilingual Machine Translation for African Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the task of multilingual machine translation for African languages and describe our contribution in the 2021 WMT Shared Task: Large-Scale Multilingual Machine Translation. |
Chris C. Emezue; Bonaventure F. P. Dossou; | arxiv-cs.CL | 2022-04-08 |
84 | GigaST: A 10,000-hour Pseudo Speech Translation Corpus Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces GigaST, a large-scale pseudo speech translation (ST) corpus. |
RONG YE et. al. | arxiv-cs.CL | 2022-04-08 |
85 | Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Direct speech-to-speech translation (S2ST) models suffer from data scarcity issues as there exists little parallel S2ST data, compared to the amount of data available for conventional cascaded systems that consist of automatic speech recognition (ASR), machine translation (MT), and text-to-speech (TTS) synthesis. In this work, we explore self-supervised pre-training with unlabeled speech data and data augmentation to tackle this issue. |
SRAVYA POPURI et. al. | arxiv-cs.CL | 2022-04-06 |
86 | Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we show that two parameter-efficient approaches to cross-lingual transfer, namely Sparse Fine-Tuning Masks (SFTMs) and Adapters, allow for a more lightweight and more effective zero-shot transfer to multilingual and cross-lingual retrieval tasks. |
Robert Litschko; Ivan Vulić; Goran Glavaš; | arxiv-cs.CL | 2022-04-05 |
87 | Modeling Target-Side Morphology in Neural Machine Translation: A Comparison of Strategies Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we re-investigate two target-side linguistic processing techniques: a lemma-tag strategy and a linguistically informed word segmentation strategy. |
Marion Weller-Di Marco; Matthias Huck; Alexander Fraser; | arxiv-cs.CL | 2022-03-25 |
88 | Mitigating Gender Bias in Machine Translation Through Adversarial Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an adversarial learning framework that addresses these challenges to mitigate gender bias in seq2seq machine translation. |
Eve Fleisig; Christiane Fellbaum; | arxiv-cs.CL | 2022-03-20 |
89 | A New Approach to Calculating BERTScore for Automatic Assessment of Translation Quality Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The study of the applicability of the BERTScore metric was conducted to translation quality assessment at the sentence level for English -> Russian direction. |
A. A. Vetrov; E. A. Gorn; | arxiv-cs.CL | 2022-03-10 |
90 | From Simultaneous to Streaming Machine Translation By Leveraging Streaming History Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, a state-of-the-art simultaneous sentence-level MT system is extended to the streaming setup by leveraging the streaming history. |
Javier Iranzo-Sánchez; Jorge Civera; Alfons Juan; | arxiv-cs.CL | 2022-03-04 |
91 | UDAAN – Machine Learning Based Post-Editing Tool for Document Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce UDAAN, an open-source post-editing tool that can reduce manual editing efforts to quickly produce publishable-standard documents in different languages. |
Ayush Maheshwari; Ajay Ravindran; Venkatapathy Subramanian; Akshay Jalan; Ganesh Ramakrishnan; | arxiv-cs.CL | 2022-03-03 |
92 | OCR Improves Machine Translation for Low-Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We aim to investigate the performance of current OCR systems on low resource languages and low resource scripts. |
Oana Ignat; Jean Maillard; Vishrav Chaudhary; Francisco Guzmán; | arxiv-cs.CL | 2022-02-26 |
93 | Screening Gender Transfer in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper aims at identifying the information flow in state-of-the-art machine translation systems, taking as example the transfer of gender when translating from French into English. |
Guillaume Wisniewski; Lichao Zhu; Nicolas Ballier; François Yvon; | arxiv-cs.CL | 2022-02-25 |
94 | JParaCrawl V3.0: A Large-scale English-Japanese Parallel Corpus Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Most current machine translation models are mainly trained with parallel corpora, and their translation accuracy largely depends on the quality and quantity of the corpora. |
Makoto Morishita; Katsuki Chousa; Jun Suzuki; Masaaki Nagata; | arxiv-cs.CL | 2022-02-25 |
95 | The Reality of Multi-Lingual Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our book The Reality of Multi-Lingual Machine Translation discusses the benefits and perils of using more than two languages in machine translation systems. |
Tom Kocmi; Dominik Macháček; Ondřej Bojar; | arxiv-cs.CL | 2022-02-25 |
96 | Using Natural Language Prompts for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We explore the use of natural language prompts for controlling various aspects of the outputs generated by machine translation models. |
Xavier Garcia; Orhan Firat; | arxiv-cs.CL | 2022-02-23 |
97 | An Overview on Machine Translation Evaluation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This report mainly includes the following contents: a brief history of machine translation evaluation (MTE), the classification of research methods on MTE, and the the cutting-edge progress, including human evaluation, automatic evaluation, and evaluation of evaluation methods (meta-evaluation). |
Lifeng Han; | arxiv-cs.CL | 2022-02-22 |
98 | Domain Adaptation in Neural Machine Translation Using A Qualia-Enriched FrameNet Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present Scylla, a methodology for domain adaptation of Neural Machine Translation (NMT) systems that make use of a multilingual FrameNet enriched with qualia relations as an external knowledge base. |
Alexandre Diniz Costa; Mateus Coutinho Marim; Ely Edison da Silva Matos; Tiago Timponi Torrent; | arxiv-cs.CL | 2022-02-21 |
99 | CALCS 2021 Shared Task: Machine Translation for Code-Switched Data Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address machine translation for code-switched social media data. |
Shuguang Chen; Gustavo Aguilar; Anirudh Srinivasan; Mona Diab; Thamar Solorio; | arxiv-cs.CL | 2022-02-19 |
100 | PETCI: A Parallel English Translation Dataset of Chinese Idioms Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present PETCI, a parallel English translation dataset of Chinese idioms, aiming to improve idiom translation by both human and machine. |
Kenan Tang; | arxiv-cs.CL | 2022-02-18 |
101 | Improving English to Sinhala Neural Machine Translation Using Part-of-Speech Tag Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, in this research, we explore effective methods of incorporating Part of Speech (POS) tags to the Transformer input embedding and positional encoding to further enhance the performance of the baseline English to Sinhala neural machine translation model. |
Ravinga Perera; Thilakshi Fonseka; Rashmini Naranpanawa; Uthayasanker Thayasivam; | arxiv-cs.CL | 2022-02-17 |
102 | Sequence-to-Sequence Resources for Catalan Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce sequence-to-sequence language resources for Catalan, a moderately under-resourced language, towards two tasks, namely: Summarization and Machine Translation (MT). |
Ona de Gibert; Ksenia Kharitonova; Blanca Calvo Figueras; Jordi Armengol-Estapé; Maite Melero; | arxiv-cs.CL | 2022-02-14 |
103 | Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a fine-tuning loss that enables pre-trained model’s ability to mine pseudo-parallel data for fully unsupervised machine translation. |
XUAN-PHI NGUYEN et. al. | iclr | 2022-02-08 |
104 | Pirá: A Bilingual Portuguese-English Dataset for Question-Answering About The Ocean Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the Pir\’a dataset, a large set of questions and answers about the ocean and the Brazilian coast both in Portuguese and English. |
ANDRÉ F. A. PASCHOAL et. al. | arxiv-cs.CL | 2022-02-04 |
105 | A Survey on Retrieval-Augmented Text Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper aims to conduct a survey about retrieval-augmented text generation. |
Huayang Li; Yixuan Su; Deng Cai; Yan Wang; Lemao Liu; | arxiv-cs.CL | 2022-02-02 |
106 | Prabhupadavani: A Code-mixed Speech Translation Data for 25 Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Thus, we introduce Prabhupadavani, a multilingual code-mixed ST dataset for 25 languages, covering ten language families, containing 94 hours of speech by 130+ speakers, manually aligned with corresponding text in the target language. |
Jivnesh Sandhan; Ayush Daksh; Om Adideva Paranjay; Laxmidhar Behera; Pawan Goyal; | arxiv-cs.CL | 2022-01-27 |
107 | Tackling Data Scarcity in Speech Translation Using Zero-shot Multilingual Machine Translation Techniques Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the related field of multilingual text translation, several techniques have been proposed for zero-shot translation. |
Tu Anh Dinh; Danni Liu; Jan Niehues; | arxiv-cs.CL | 2022-01-26 |
108 | VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations. We introduce VISA, a new dataset that consists of 40k Japanese-English parallel sentence pairs and corresponding video clips with the following key features: (1) the parallel sentences are subtitles from movies and TV episodes; (2) the source subtitles are ambiguous, which means they have multiple possible translations with different meanings; (3) we divide the dataset into Polysemy and Omission according to the cause of ambiguity. |
Yihang Li; Shuichiro Shimizu; Weiqi Gu; Chenhui Chu; Sadao Kurohashi; | arxiv-cs.CL | 2022-01-20 |
109 | Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English. |
Zhuoyuan Mao; Chenhui Chu; Sadao Kurohashi; | arxiv-cs.CL | 2022-01-20 |
110 | Syntax-based Data Augmentation for Hungarian-English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We train Transformer-based neural machine translation models for Hungarian-English and English-Hungarian using the Hunglish2 corpus. |
Attila Nagy; Patrick Nanys; Balázs Frey Konrád; Bence Bial; Judit Ács; | arxiv-cs.CL | 2022-01-18 |
111 | Klexikon: A German Dataset for Joint Summarization and Simplification Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle this problem, we pose core requirements for a system that can jointly summarize and simplify long source documents. |
Dennis Aumiller; Michael Gertz; | arxiv-cs.CL | 2022-01-18 |
112 | Towards The Next 1000 Languages in Multilingual Machine Translation: Exploring The Synergy Between Supervised and Self-Supervised Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To this end, we present a pragmatic approach towards building a multilingual MT model that covers hundreds of languages, using a mixture of supervised and self-supervised objectives, depending on the data availability for different language pairs. |
ADITYA SIDDHANT et. al. | arxiv-cs.CL | 2022-01-09 |
113 | An Ensemble Approach to Acronym Extraction Using Transformers Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper discusses an ensemble approach for the task of Acronym Extraction, which utilises two different methods to extract acronyms and their corresponding long forms. |
Prashant Sharma; Hadeel Saadany; Leonardo Zilio; Diptesh Kanojia; Constantin Orăsan; | arxiv-cs.CL | 2022-01-09 |
114 | Phrase-level Adversarial Example Generation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a phrase-level adversarial example generation (PAEG) method to enhance the robustness of the model. |
JUNCHENG WAN et. al. | arxiv-cs.CL | 2022-01-06 |
115 | How Do Lexical Semantics Affect Translation? An Empirical Study Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we investigate these relationships on a variety of low-resource language pairs from the OpenSubtitles2016 database, where the source language is English, and find that the more similar the target language is to English, the greater the translation performance. |
Vivek Subramanian; Dhanasekar Sundararaman; | arxiv-cs.CL | 2021-12-31 |
116 | Pirá: A Bilingual Portuguese-English Dataset for Question-Answering About The Ocean Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the Pirá dataset, a large set of questions and answers about the ocean and the Brazilian coast both in Portuguese and English. |
ANDRÉ F. A. PASCHOAL et. al. | cikm | 2021-12-30 |
117 | Frequency-Aware Contrastive Learning for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specifically, we propose a frequency-aware token-level contrastive learning method, in which the hidden state of each decoding step is pushed away from the counterparts of other target words, in a soft contrastive way based on the corresponding word frequencies. |
TONG ZHANG et. al. | arxiv-cs.CL | 2021-12-29 |
118 | A Preordered RNN Layer Boosts Neural Machine Translation in Low Resource Settings Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to augment attention based neural network with reordering information to alleviate the lack of data. |
Mohaddeseh Bastan; Shahram Khadivi; | arxiv-cs.CL | 2021-12-27 |
119 | HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professional Post-Editing Towards More Effective MT Evaluation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce HOPE, a task-oriented and human-centric evaluation framework for machine translation output based on professional post-editing annotations. |
Serge Gladkoff; Lifeng Han; | arxiv-cs.CL | 2021-12-27 |
120 | Challenge Dataset of Cognates and False Friend Pairs from Indian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we describe the creation of two cognate datasets for twelve Indian languages, namely Sanskrit, Hindi, Assamese, Oriya, Kannada, Gujarati, Tamil, Telugu, Punjabi, Bengali, Marathi, and Malayalam. |
Diptesh Kanojia; Pushpak Bhattacharyya; Malhar Kulkarni; Gholamreza Haffari; | arxiv-cs.CL | 2021-12-17 |
121 | Learning and Analyzing Generation Order for Undirected Sequence Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we train a policy that learns the generation order for a pre-trained, undirected translation model via reinforcement learning. |
Yichen Jiang; Mohit Bansal; | arxiv-cs.CL | 2021-12-16 |
122 | Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we demonstrate the use of cross-lingual word embeddings for detecting cognates among fourteen Indian Languages. |
DIPTESH KANOJIA et. al. | arxiv-cs.CL | 2021-12-16 |
123 | Isometric MT: Neural Machine Translation for Automatic Dubbing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work introduces a self-learning approach that allows a transformer model to directly learn to generate outputs that closely match the source length, in short Isometric MT. In particular, our approach does not require to generate multiple hypotheses nor any auxiliary ranking function. |
Surafel M. Lakew; Yogesh Virkar; Prashant Mathur; Marcello Federico; | arxiv-cs.CL | 2021-12-16 |
124 | Improving Both Domain Robustness and Domain Adaptability in Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel approach, RMLNMT (Robust Meta-Learning Framework for Neural Machine Translation Domain Adaptation), which improves the robustness of existing meta-learning models. |
Wen Lai; Jindřich Libovický; Alexander Fraser; | arxiv-cs.CL | 2021-12-15 |
125 | Lesan — Machine Translation for Low Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present Lesan, an MT system for low resource languages. |
Asmelash Teka Hadgu; Abel Aregawi; Adam Beaudoin; | arxiv-cs.CL | 2021-12-15 |
126 | Prosody-Aware Neural Machine Translation for Dubbing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an implicit and explicit modeling approaches to integrate prosody information into neural machine translation. |
Derek Tam; Surafel M. Lakew; Yogesh Virkar; Prashant Mathur; Marcello Federico; | arxiv-cs.CL | 2021-12-15 |
127 | Step-unrolled Denoising Autoencoders for Text Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we propose a new generative model of text, Step-unrolled Denoising Autoencoder (SUNDAE), that does not rely on autoregressive models. |
Nikolay Savinov; Junyoung Chung; Mikolaj Binkowski; Erich Elsen; Aaron van den Oord; | arxiv-cs.CL | 2021-12-13 |
128 | Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specific use-cases are usually left out, since generic models tend to perform poorly in domain-specific cases. |
Javad Pourmostafa Roshan Sharami; Dimitar Shterionov; Pieter Spronck; | arxiv-cs.CL | 2021-12-11 |
129 | Communication-Efficient Federated Learning for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore how to efficiently build NMT models in an FL setup by proposing a novel solution. |
Tanya Roosta; Peyman Passban; Ankit Chadha; | arxiv-cs.CL | 2021-12-11 |
130 | Multitask Finetuning for Improving Neural Machine Translation in Indian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a Multitask Finetuning methodology which combines the Bilingual Machine Translation task with an auxiliary Causal Language Modeling task to improve performance on the former task on Indian Languages. |
Shaily Desai; Atharva Kshirsagar; Manisha Marathe; | arxiv-cs.CL | 2021-12-03 |
131 | Translating Politeness Across Cultures: Case of Hindi and English Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a corpus based study of politeness across two languages-English and Hindi. |
Ritesh Kumar; Girish Nath Jha; | arxiv-cs.CL | 2021-12-03 |
132 | Improvement in Machine Translation with Generative Adversarial Networks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we explore machine translation improvement via Generative Adversarial Network (GAN) architecture. |
Jay Ahn; Hari Madhu; Viet Nguyen; | arxiv-cs.CL | 2021-11-30 |
133 | Ensembling of Distilled Models from Multi-task Teachers for Constrained Resource Language Pairs Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes our submission to the constrained track of WMT21 shared news translation task. |
AMR HENDY et. al. | arxiv-cs.CL | 2021-11-25 |
134 | BARTScore: Evaluating Generated Text As Text Generation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we conceptualize the evaluation of generated text as a text generation problem, modeled using pre-trained sequence-to-sequence models. |
Weizhe Yuan; Graham Neubig; Pengfei Liu; | nips | 2021-11-20 |
135 | R-Drop: Regularized Dropout for Neural Networks IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a simple consistency training strategy to regularize dropout, namely R-Drop, which forces the output distributions of different sub models generated by dropout to be consistent with each other. |
XIAOBO LIANG et. al. | nips | 2021-11-20 |
136 | XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents XLS-R, a large-scale model for cross-lingual speech representation learning based on wav2vec 2.0. |
ARUN BABU et. al. | arxiv-cs.CL | 2021-11-17 |
137 | NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper provides an overview of NVIDIA NeMo’s neural machine translation systems for the constrained data track of the WMT21 News and Biomedical Shared Translation Tasks. |
Sandeep Subramanian; Oleksii Hrinchuk; Virginia Adams; Oleksii Kuchaiev; | arxiv-cs.CL | 2021-11-16 |
138 | Measuring Uncertainty in Translation Quality Evaluation (TQE) Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The methodology we applied for this work is from Bernoulli Statistical Distribution Modelling (BSDM) and Monte Carlo Sampling Analysis (MCSA). |
Serge Gladkoff; Irina Sorokina; Lifeng Han; Alexandra Alekseeva; | arxiv-cs.CL | 2021-11-15 |
139 | Developing Neural Machine Translation Models for Hungarian-English Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: I propose 5 different augmentation methods that are structure-aware, meaning that instead of randomly selecting words for blanking or replacement, the dependency tree of sentences is used as a basis for augmentation. |
Attila Nagy; | arxiv-cs.CL | 2021-11-07 |
140 | PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a high-quality and large-scale Vietnamese-English parallel dataset of 3.02M sentence pairs, which is 2.9M pairs larger than the benchmark Vietnamese-English machine translation corpus IWSLT15. |
Long Doan; Linh The Nguyen; Nguyen Luong Tran; Thai Hoang; Dat Quoc Nguyen; | emnlp | 2021-11-05 |
141 | HintedBT: Augmenting Back-Translation with Quality and Transliteration Hints Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To improve effectiveness of the available BT data, we introduce HintedBT-a family of techniques which provides hints (through tags) to the encoder and decoder. |
Sahana Ramnath; Melvin Johnson; Abhirut Gupta; Aravindan Raghuveer; | emnlp | 2021-11-05 |
142 | One Source, Two Targets: Challenges and Rewards of Dual Decoding Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider a stronger requirement: to jointly generate two texts so that each output side effectively depends on the other. |
Jitao Xu; Fran?ois Yvon; | emnlp | 2021-11-05 |
143 | Rule-based Morphological Inflection Improves Neural Terminology Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a modular framework for incorporating lemma constraints in neural MT (NMT) in which linguistic knowledge and diverse types of NMT models can be flexibly applied. |
Weijia Xu; Marine Carpuat; | emnlp | 2021-11-05 |
144 | Recurrent Attention for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we push further in this research line and propose a novel substitute mechanism for self-attention: Recurrent AtteNtion (RAN) . |
Jiali Zeng; Shuangzhi Wu; Yongjing Yin; Yufan Jiang; Mu Li; | emnlp | 2021-11-05 |
145 | A Generative Framework for Simultaneous Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a generative framework for simultaneous machine translation. |
Yishu Miao; Phil Blunsom; Lucia Specia; | emnlp | 2021-11-05 |
146 | Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a qualitative study that examines the role of datasets in stimulating the leverage of visual modality and we propose methods to highlight the importance of visual signals in the datasets which demonstrate improvements in reliance of models on the source images. |
Jiaoda Li; Duygu Ataman; Rico Sennrich; | emnlp | 2021-11-05 |
147 | Cross Attention Augmented Transducer Networks for Simultaneous Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a novel architecture, Cross Attention Augmented Transducer (CAAT), for simultaneous translation. |
Dan Liu; Mengge Du; Xiaoxi Li; Ya Li; Enhong Chen; | emnlp | 2021-11-05 |
148 | An Empirical Investigation of Word Alignment Supervision for Zero-Shot Multilingual Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we investigate the benefits of an explicit alignment to language labels in Transformer-based MNMT models in the zero-shot context, by jointly training one cross attention head with word alignment supervision to stress the focus on the target language label. |
Alessandro Raganato; Ra?l V?zquez; Mathias Creutz; J?rg Tiedemann; | emnlp | 2021-11-05 |
149 | GFST: Gender-Filtered Self-Training for More Accurate Gender in Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose gender-filtered self-training (GFST) to improve gender translation accuracy on unambiguously gendered inputs. |
Prafulla Kumar Choubey; Anna Currey; Prashant Mathur; Georgiana Dinu; | emnlp | 2021-11-05 |
150 | Encouraging Lexical Translation Consistency for Document-Level Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we apply one translation per discourse in NMT, and aim to encourage lexical translation consistency for document-level NMT. |
Xinglin Lyu; Junhui Li; Zhengxian Gong; Min Zhang; | emnlp | 2021-11-05 |
151 | Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose heterogeneous ways of embedding topic information at the sentence level into an NMT model to improve translation performance. |
Weixuan Wang; Wei Peng; Meng Zhang; Qun Liu; | emnlp | 2021-11-05 |
152 | Evaluating The Morphosyntactic Well-formedness of Generated Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose L’AMBRE – a metric to evaluate the morphosyntactic well-formedness of text using its dependency parse and morphosyntactic rules of the language. |
ADITHYA PRATAPA et. al. | emnlp | 2021-11-05 |
153 | BiSECT: Learning to Split and Rephrase Sentences with Bitexts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a novel dataset and a new model for this ‘split and rephrase’ task. |
Joongwon Kim; Mounica Maddela; Reno Kriz; Wei Xu; Chris Callison-Burch; | emnlp | 2021-11-05 |
154 | Document Graph for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we hypothesize that a document can be represented as a graph that connects relevant contexts regardless of their distances. |
Mingzhou Xu; Liangyou Li; Derek F. Wong; Qun Liu; Lidia S. Chao; | emnlp | 2021-11-05 |
155 | Towards Making The Most of Dialogue Characteristics for Neural Chat Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to promote the chat translation by introducing the modeling of dialogue characteristics into the NCT model. |
YUNLONG LIANG et. al. | emnlp | 2021-11-05 |
156 | Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose an approach, MultiUAT, that dynamically adjusts the training data usage based on the model’s uncertainty on a small set of trusted clean data for multi-corpus machine translation. |
MINGHAO WU et. al. | emnlp | 2021-11-05 |
157 | Cross-lingual Intermediate Fine-tuning Improves Dialogue State Tracking Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we enhance the transfer learning process by intermediate fine-tuning of pretrained multilingual models, where the multilingual models are fine-tuned with different but related data and/or tasks. |
Nikita Moghe; Mark Steedman; Alexandra Birch; | emnlp | 2021-11-05 |
158 | Wikily Supervised Neural Translation Tailored to Cross-Lingual Tasks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a simple but effective approach for leveraging Wikipedia for neural machine translation as well as cross-lingual tasks of image captioning and dependency parsing without using any direct supervision from external parallel data or supervised models in the target language. |
Mohammad Sadegh Rasooli; Chris Callison-Burch; Derry Tanti Wijaya; | emnlp | 2021-11-05 |
159 | Unsupervised Neural Machine Translation with Universal Grammar Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, we seek to leverage such shared grammar clues to provide more explicit language parallel signals to enhance the training of unsupervised machine translation models. |
Zuchao Li; Masao Utiyama; Eiichiro Sumita; Hai Zhao; | emnlp | 2021-11-05 |
160 | MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we improve multilingual text-to-text transfer Transformer with translation pairs (mT6). |
ZEWEN CHI et. al. | emnlp | 2021-11-05 |
161 | Neural Machine Translation Quality and Post-Editing Performance Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Across all models, we found that better MT systems indeed lead to fewer changes in the sentences in this industry setting. |
Vil?m Zouhar; Martin Popel; Ondrej Bojar; Ale? Tamchyna; | emnlp | 2021-11-05 |
162 | Translating Headers of Tabular Data: A Pilot Study of Schema Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To facilitate the research study, we construct the first parallel dataset for schema translation, which consists of 3,158 tables with 11,979 headers written in 6 different languages, including English, Chinese, French, German, Spanish, and Japanese. Also, we propose the first schema translation model called CAST, which is a header-to-header neural machine translation model augmented with schema context. |
Kunrui Zhu; Yan Gao; Jiaqi Guo; Jian-Guang Lou; | emnlp | 2021-11-05 |
163 | I Wish I Would Have Loved This One, But I Didn’t – A Multilingual Dataset for Counterfactual Detection in Product Review Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of counterfactual detection (CFD) in product reviews. |
James O?Neill; Polina Rozenshtein; Ryuichi Kiryo; Motoko Kubota; Danushka Bollegala; | emnlp | 2021-11-05 |
164 | Controlling Machine Translation for Multiple Attributes with Additive Interventions Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We address these problems by introducing vector-valued interventions which allow for fine-grained control over multiple attributes simultaneously via a weighted linear combination of the corresponding vectors. |
Andrea Schioppa; David Vilar; Artem Sokolov; Katja Filippova; | emnlp | 2021-11-05 |
165 | Enlivening Redundant Heads in Multi-head Self-attention for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a redundant head enlivening (RHE) method to precisely identify redundant heads, and then vitalize their potential by learning syntactic relations and prior knowledge in the text without sacrificing the roles of important heads. |
Tianfu Zhang; Heyan Huang; Chong Feng; Longbing Cao; | emnlp | 2021-11-05 |
166 | XLEnt: Mining A Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose Lexical-Semantic-Phonetic Align (LSP-Align), a technique to automatically mine cross-lingual entity lexica from mined web data. |
Ahmed El-Kishky; Adithya Renduchintala; James Cross; Francisco Guzm?n; Philipp Koehn; | emnlp | 2021-11-05 |
167 | Wino-X: Multilingual Winograd Schemas for Commonsense Reasoning and Coreference Resolution Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents Wino-X, a parallel dataset of German, French, and Russian schemas, aligned with their English counterparts. |
Denis Emelin; Rico Sennrich; | emnlp | 2021-11-05 |
168 | Lingua Custodia’s Participation at The WMT 2021 Machine Translation Using Terminologies Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes Lingua Custodia’s submission to the WMT21 shared task on machine translation using terminologies. |
Melissa Ailem; Jinghsu Liu; Raheel Qader; | arxiv-cs.CL | 2021-11-03 |
169 | Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose automatic translation of dialogue datasets with alignment to ensure faithful translation of slot values and eliminate costly human supervision used in previous benchmarks. Finally, we present RiSAWOZ English and German datasets, created using our translation methodology. |
Mehrad Moradshahi; Victoria Tsai; Giovanni Campagna; Monica S. Lam; | arxiv-cs.CL | 2021-11-03 |
170 | Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This report describes Microsoft’s machine translation systems for the WMT21 shared task on large-scale multilingual machine translation. |
JIAN YANG et. al. | arxiv-cs.CL | 2021-11-03 |
171 | Simultaneous Neural Machine Translation with Constituent Label Prediction Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by the concept of pre-reordering, we propose a couple of simple decision rules using the label of the next constituent predicted by incremental constituent label prediction. |
Yasumasa Kano; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2021-10-26 |
172 | Assessing Evaluation Metrics for Speech-to-Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate … |
Elizabeth Salesky; Julian Mäder; Severin Klinger; | arxiv-cs.CL | 2021-10-26 |
173 | Discontinuous Grammar As A Foreign Language Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To close the gap, we here extend the framework of sequence-to-sequence models for constituent parsing, not only by providing a more powerful neural architecture for improving their performance, but also by enlarging their coverage to handle the most complex syntactic phenomena: discontinuous structures. |
Daniel Fernández-González; Carlos Gómez-Rodríguez; | arxiv-cs.CL | 2021-10-20 |
174 | The Arabic Parallel Gender Corpus 2.0: Extensions and Analyses Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a new corpus for gender identification and rewriting in contexts involving one or two target users (I and/or You) — first and second grammatical persons with independent grammatical gender preferences. |
Bashar Alhafni; Nizar Habash; Houda Bouamor; | arxiv-cs.CL | 2021-10-18 |
175 | Towards Making The Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper demonstrates that multilingual pretraining and multilingual fine-tuning are both critical for facilitating cross-lingual transfer in zero-shot translation, where the neural machine translation (NMT) model is tested on source languages unseen during supervised training. Following this idea, we present SixT+, a strong many-to-English NMT model that supports 100 source languages but is trained with a parallel dataset in only six source languages. |
GUANHUA CHEN et. al. | arxiv-cs.CL | 2021-10-16 |
176 | Unifying Cross-lingual Summarization and Machine Translation with Compression Rate Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel task, Cross-lingual Summarization with Compression rate (CSC), to benefit Cross-Lingual Summarization by large-scale Machine Translation corpus. |
YU BAI et. al. | arxiv-cs.CL | 2021-10-15 |
177 | Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose Decision Attentive Regularization (DAR) to improve the decision policy of SimulST systems by using the simultaneous text-to-text translation (SimulMT) task. |
Mohd Abbas Zaidi; Beomseok Lee; Sangha Kim; Chanwoo Kim; | arxiv-cs.SD | 2021-10-13 |
178 | Evaluation of Abstractive Summarisation Models with Machine Translation in Deliberative Processes Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present work on summarising deliberative processes for non-English languages. |
M. Arana-Catania; Rob Procter; Yulan He; Maria Liakata; | arxiv-cs.CL | 2021-10-12 |
179 | WeTS: A Benchmark for Translation Suggestion Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: With the corpus we construct, we introduce the Transformer-based model for TS, and experimental results show that our model achieves State-Of-The-Art (SOTA) results on all four translation directions, including English-to-German, German-to-English, Chinese-to-English and English-to-Chinese. |
Zhen Yang; Fandong Meng; Yingxue Zhang; Ernan Li; Jie Zhou; | arxiv-cs.CL | 2021-10-11 |
180 | Unsupervised Neural Machine Translation with Generative Language Models Only Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show how to derive state-of-the-art unsupervised neural machine translation systems from generatively pre-trained language models. |
JESSE MICHAEL HAN et. al. | arxiv-cs.CL | 2021-10-11 |
181 | LightSeq2: Accelerated Training for Transformer-based Models on GPUs Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present LightSeq2, a system to accelerate training for a general family of Transformer models on GPUs. |
XIAOHUI WANG et. al. | arxiv-cs.CL | 2021-10-11 |
182 | Machine Translation Verbosity Control for Automatic Dubbing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the problem of controlling the verbosity of machine translation output, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. |
SURAFEL M. LAKEW et. al. | arxiv-cs.CL | 2021-10-07 |
183 | Sequence-to-Sequence Lexical Normalization with Multilingual Transformers Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose a sentence-level sequence-to-sequence model based on mBART, which frames the problem as a machine translation problem. |
Ana-Maria Bucur; Adrian Cosma; Liviu P. Dinu; | arxiv-cs.CL | 2021-10-06 |
184 | On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To gain insight into the role neurons play, we study the activation patterns corresponding to meaning preserving paraphrases (e.g., active-passive). |
Gal Patel; Leshem Choshen; Omri Abend; | arxiv-cs.CL | 2021-10-06 |
185 | The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we instead consider the impact of compression in a data-limited regime. |
Orevaoghene Ahia; Julia Kreutzer; Sara Hooker; | arxiv-cs.CL | 2021-10-06 |
186 | On The Complementarity Between Pre-Training and Back-Translation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce two probing tasks for PT and BT respectively and find that PT mainly contributes to the encoder module while BT brings more benefits to the decoder. |
XUEBO LIU et. al. | arxiv-cs.CL | 2021-10-05 |
187 | Sentiment-Aware Measure (SAM) for Evaluating Sentiment Transfer By Machine Translation Systems Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a numerical `sentiment-closeness’ measure appropriate for assessing the accuracy of a translated affect message in UGC text by an MT system. |
Hadeel Saadany; Constantin Orasan; Emad Mohamed; Ashraf Tantawy; | arxiv-cs.CL | 2021-09-30 |
188 | EdinSaar@WMT21: North-Germanic Low-Resource Multilingual NMT Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe the EdinSaar submission to the shared task of Multilingual Low-Resource Translation for North Germanic Languages at the Sixth Conference on Machine Translation (WMT2021). |
Svetlana Tchistiakova; Jesujoba Alabi; Koel Dutta Chowdhury; Sourav Dutta; Dana Ruiter; | arxiv-cs.CL | 2021-09-29 |
189 | BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing Critical Translation Errors in Sentiment-oriented Text Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we assess the ability of automatic quality metrics to detect critical machine translation errors which can cause serious misunderstanding of the affect message. |
Hadeel Saadany; Constantin Orasan; | arxiv-cs.CL | 2021-09-29 |
190 | Text Simplification for Comprehension-based Question-Answering Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we investigate the effect of text simplification in the task of question-answering using a comprehension context. |
Tanvi Dadu; Kartikey Pant; Seema Nagar; Ferdous Ahmed Barbhuiya; Kuntal Dey; | arxiv-cs.CL | 2021-09-28 |
191 | Integrated Training for Sequence-to-Sequence Models Using Non-Autoregressive Transformer Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a cascaded model based on the non-autoregressive Transformer that enables end-to-end training without the need for an explicit intermediate representation. |
EVGENIIA TOKARCHUK et. al. | arxiv-cs.CL | 2021-09-27 |
192 | Towards Reinforcement Learning for Pivot-based Neural Machine Translation with Non-autoregressive Transformer Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We utilize a non-autoregressive transformer and present an end-to-end pivot-based integrated model, enabling training on source-target data. |
EVGENIIA TOKARCHUK et. al. | arxiv-cs.CL | 2021-09-27 |
193 | The Volctrans GLAT System: Non-autoregressive Translation Meets WMT21 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the Volctrans’ submission to the WMT21 news translation shared task for German->English translation. |
LIHUA QIAN et. al. | arxiv-cs.CL | 2021-09-23 |
194 | One Source, Two Targets: Challenges and Rewards of Dual Decoding Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we consider a stronger requirement: to jointly generate two texts so that each output side effectively depends on the other. |
Jitao Xu; François Yvon; | arxiv-cs.CL | 2021-09-21 |
195 | The NiuTrans Machine Translation Systems for WMT21 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. |
SHUHAN ZHOU et. al. | arxiv-cs.CL | 2021-09-21 |
196 | CUNI Systems for WMT21: Terminology Translation Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes Charles University submission for Terminology translation Shared Task at WMT21. |
Josef Jon; Michal Novák; João Paulo Aires; Dušan Variš; Ondřej Bojar; | arxiv-cs.CL | 2021-09-20 |
197 | The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents the JHU-Microsoft joint submission for WMT 2021 quality estimation shared task. |
Shuoyang Ding; Marcin Junczys-Dowmunt; Matt Post; Christian Federmann; Philipp Koehn; | arxiv-cs.CL | 2021-09-17 |
198 | Back-translation for Large-Scale Multilingual Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work aims to build a single multilingual translation system with a hypothesis that a universal cross-language representation leads to better multilingual translation performance. |
Baohao Liao; Shahram Khadivi; Sanjika Hewavitharana; | arxiv-cs.CL | 2021-09-17 |
199 | Does Summary Evaluation Survive Translation to Other Languages? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To investigate how much we can trust machine translation of such a dataset, we translate the English dataset SummEval to seven languages and compare performance across automatic evaluation measures. |
Spencer Braun; Oleg Vasilyev; Neslihan Iskender; John Bohannon; | arxiv-cs.CL | 2021-09-16 |
200 | Beyond Glass-Box Features: Uncertainty Quantification Enhanced Quality Estimation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we extend the definition of glass-box QE generally to uncertainty quantification with both black-box and glass-box approaches and design several features deduced from them to blaze a new trial in improving QE’s performance. |
KE WANG et. al. | arxiv-cs.CL | 2021-09-15 |
201 | Miðeind’s WMT 2021 Submission Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present Mi{\dh}eind’s submission for the English$\to$Icelandic and Icelandic$\to$English subsets of the 2021 WMT news translation task. |
Haukur Barri Símonarson; Vésteinn Snæbjarnarson; Pétur Orri Ragnarsson; Haukur Páll Jónsson; Vilhjálmur Þorsteinsson; | arxiv-cs.CL | 2021-09-15 |
202 | Netmarble AI Center’s WMT21 Automatic Post-Editing Shared Task Submission Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes Netmarble’s submission to WMT21 Automatic Post-Editing (APE) Shared Task for the English-German language pair. |
Shinhyeok Oh; Sion Jang; Hu Xu; Shounan An; Insoo Oh; | arxiv-cs.CL | 2021-09-14 |
203 | Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose CorefCL, a novel data augmentation and contrastive learning scheme based on coreference between the source and contextual sentences. |
Yongkeun Hwang; Hyungu Yun; Kyomin Jung; | arxiv-cs.CL | 2021-09-13 |
204 | Neural Machine Translation Quality and Post-Editing Performance Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Across all models, we found that better MT systems indeed lead to fewer changes in the sentences in this industry setting. |
Vilém Zouhar; Aleš Tamchyna; Martin Popel; Ondřej Bojar; | arxiv-cs.CL | 2021-09-10 |
205 | Rethinking Zero-shot Neural Machine Translation: From A Perspective of Latent Variables Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce a denoising autoencoder objective based on pivot language into traditional training objective to improve the translation accuracy on zero-shot directions. |
WEIZHI WANG et. al. | arxiv-cs.CL | 2021-09-10 |
206 | Collecting A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gender-role assignments (e.g., female nurses versus male dancers) in corpora from three domains, resulting in a first large-scale gender bias dataset of 108K diverse real-world English sentences. |
Shahar Levy; Koren Lazar; Gabriel Stanovsky; | arxiv-cs.CL | 2021-09-08 |
207 | Infusing Future Information Into Monotonic Attention Through Language Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Simultaneous neural machine translation(SNMT) models start emitting the target sequence before they have processed the source sequence. |
Mohd Abbas Zaidi; Sathish Indurthi; Beomseok Lee; Nikhil Kumar Lakumarapu; Sangha Kim; | arxiv-cs.CL | 2021-09-07 |
208 | IndicBART: A Pre-trained Model for Natural Language Generation of Indic Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present IndicBART, a multilingual, sequence-to-sequence pre-trained model focusing on 11 Indic languages and English. |
RAJ DABRE et. al. | arxiv-cs.CL | 2021-09-07 |
209 | Transformer Models for Text Coherence Assessment Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Accordingly, in this paper, we propose four different Transformer-based architectures for the task: vanilla Transformer, hierarchical Transformer, multi-task learning-based model, and a model with fact-based input representation. |
Tushar Abhishek; Daksh Rawat; Manish Gupta; Vasudeva Varma; | arxiv-cs.CL | 2021-09-05 |
210 | Masked Adversarial Generation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose the Masked Adversarial Generation (MAG) model, that learns to perturb the translation model throughout the training process. |
Badr Youbi Idrissi; Stéphane Clinchant; | arxiv-cs.CL | 2021-09-01 |
211 | An Unsupervised Method for Building Sentence Simplification Corpora in Multiple Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an unsupervised method to build SS corpora from large-scale bilingual translation corpora, alleviating the need for SS supervised corpora. By taking the pair of the source sentences of translation corpus and the translations of their references in a bridge language, we can construct large-scale pseudo parallel SS data. |
Xinyu Lu; Jipeng Qiang; Yun Li; Yunhao Yuan; Yi Zhu; | arxiv-cs.CL | 2021-08-31 |
212 | MMARCO: A Multilingual Version of The MS MARCO Passage Ranking Dataset Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present mMARCO, a multilingual version of the MS MARCO passage ranking dataset comprising 13 languages that was created using machine translation. |
LUIZ HENRIQUE BONIFACIO et. al. | arxiv-cs.CL | 2021-08-31 |
213 | Secoco: Self-Correcting Encoding for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper presents Self-correcting Encoding (Secoco), a framework that effectively deals with input noise for robust neural machine translation by introducing self-correcting predictors. |
TAO WANG et. al. | arxiv-cs.CL | 2021-08-27 |
214 | Examining Covert Gender Bias: A Case Study in Turkish and English Machine Translation Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Specifically, we introduce a method to investigate asymmetrical gender markings. |
Chloe Ciora; Nur Iren; Malihe Alikhani; | arxiv-cs.CL | 2021-08-23 |
215 | Recurrent Multiple Shared Layers in Depth for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose to train a deeper model with recurrent mechanism, which loops the encoder and decoder blocks of Transformer in the depth direction. |
GuoLiang Li; Yiyang Li; | arxiv-cs.CL | 2021-08-23 |
216 | CushLEPOR: Customising HLEPOR Metric Using Optuna for Higher Agreement with Human Judgments or Pre-trained Language Model LaBSE Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose to customise traditional metrics by taking advantages of the pre-trained language models (PLMs) and the limited available human labelled scores. |
Lifeng Han; Irina Sorokina; Gleb Erofeev; Serge Gladkoff; | arxiv-cs.CL | 2021-08-21 |
217 | Attentive Fine-tuning of Transformers for Translation of Low-resourced Languages @LoResMT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper reports the Machine Translation (MT) systems submitted by the IIITT team for the English->Marathi and English->Irish language pairs LoResMT 2021 shared task. |
KARTHIK PURANIK et. al. | arxiv-cs.CL | 2021-08-19 |
218 | Active Learning for Massively Parallel Translation of Constrained Text Into Low Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose an algorithm for human and machine to work together seamlessly to translate a closed text into a severely low resource language. |
Zhong Zhou; Alex Waibel; | arxiv-cs.CL | 2021-08-16 |
219 | Findings of The LoResMT 2021 Shared Task on COVID and Sign Language for Low-resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present the findings of the LoResMT 2021 shared task which focuses on machine translation (MT) of COVID-19 data for both low-resource spoken and sign languages. |
ATUL KR. OJHA et. al. | arxiv-cs.CL | 2021-08-14 |
220 | Improving Stylized Neural Machine Translation with Iterative Dual Knowledge Transfer Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we propose an iterative dual knowledge transfer framework that utilizes informal training data of machine translation and formality style transfer data to create large-scale stylized paired data, for the training of stylized machine translation model. |
XUANXUAN WU et. al. | ijcai | 2021-08-13 |
221 | Sampling-Based Minimum Bayes Risk Decoding for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We analyse this approximation and establish that it has no equivalent to the beam search curse, i.e. better search always leads to better translations. |
Bryan Eikema; Wilker Aziz; | arxiv-cs.CL | 2021-08-10 |
222 | Improving Similar Language Translation With Transfer Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work is part of our contribution to the WMT 2021 Similar Languages Translation Shared Task where we submitted models for different language pairs, including French-Bambara, Spanish-Catalan, and Spanish-Portuguese in both directions. |
Ife Adebara; Muhammad Abdul-Mageed; | arxiv-cs.AI | 2021-08-07 |
223 | Facebook AI WMT21 News Translation Task Submission Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe Facebook’s multilingual model submission to the WMT2021 shared task on news translation. |
CHAU TRAN et. al. | arxiv-cs.CL | 2021-08-06 |
224 | WeChat Neural Machine Translation Systems for WMT21 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper introduces WeChat AI’s participation in WMT 2021 shared news translation task on English->Chinese, English->Japanese, Japanese->English and English->German. |
XIANFENG ZENG et. al. | arxiv-cs.CL | 2021-08-05 |
225 | ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality Estimation and Corrective Feedback Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce ChrEnTranslate, an online machine translation demonstration system for translation between English and an endangered language Cherokee. |
Shiyue Zhang; Benjamin Frey; Mohit Bansal; | arxiv-cs.CL | 2021-07-30 |
226 | The Cross-Lingual Arabic Information REtrieval (CLAIRE) System Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we build our end-to-end Cross-Lingual Arabic Information REtrieval (CLAIRE) system based on the cross-lingual word embedding where searchers are assumed to have a passable passive understanding of Arabic and various supporting information in English is provided to aid retrieval experience. |
Zhizhong Chen; Carsten Eickhoff; | arxiv-cs.IR | 2021-07-29 |
227 | Difficulty-Aware Machine Translation Evaluation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel difficulty-aware MT evaluation metric, expanding the evaluation dimension by taking translation difficulty into consideration. |
Runzhe Zhan; Xuebo Liu; Derek F. Wong; Lidia S. Chao; | arxiv-cs.CL | 2021-07-29 |
228 | Using Perturbed Length-aware Positional Encoding for Non-autoregressive Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose sequence-level knowledge distillation (SKD) using perturbed length-aware positional encoding and apply it to a student model, the Levenshtein Transformer. |
Yui Oka; Katsuhito Sudoh; Satoshi Nakamura; | arxiv-cs.CL | 2021-07-28 |
229 | Cross-lingual Transferring of Pre-trained Contextualized Language Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, building upon the recent works connecting cross-lingual model transferring and neural machine translation, we thus propose a novel cross-lingual model transferring framework for PrLMs: TreLM. |
ZUCHAO LI et. al. | arxiv-cs.CL | 2021-07-27 |
230 | Cross-language Sentence Selection Via Data Augmentation and Rationale Training Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an approach to cross-language sentence selection in a low-resource setting. |
YANDA CHEN et. al. | acl | 2021-07-26 |
231 | Modeling Bilingual Conversational Characteristics for Neural Chat Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we aim to promote the translation quality of conversational text by modeling the above properties. |
Yunlong Liang; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou; | acl | 2021-07-26 |
232 | Revisiting Negation in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we evaluate the translation of negation both automatically and manually, in English–German (EN–DE) and English–Chinese (EN–ZH). |
Gongbo Tang; Philipp Rönchen; Rico Sennrich; Joakim Nivre; | arxiv-cs.CL | 2021-07-26 |
233 | From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we adapt a state-of-the-art neural machine translation model to generate Hindi-English code-switched sentences starting from monolingual Hindi sentences. |
Ishan Tarunesh; Syamantak Kumar; Preethi Jyothi; | acl | 2021-07-26 |
234 | Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel bilingual mutual information (BMI) based adaptive objective, which measures the learning difficulty for each target token from the perspective of bilingualism, and assigns an adaptive weight accordingly to improve token-level adaptive training. |
YANGYIFAN XU et. al. | acl | 2021-07-26 |
235 | Do Context-Aware Translation Models Pay The Right Attention? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To answer these questions, we introduce SCAT (Supporting Context for Ambiguous Translations), a new English-French dataset comprising supporting context words for 14K translations that professional translators found useful for pronoun disambiguation. |
KAYO YIN et. al. | acl | 2021-07-26 |
236 | Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we introduce another decoder, called seer decoder, into the encoder-decoder framework during training, which involves future information in target predictions. |
Yang Feng; Shuhao Gu; Dengji Guo; Zhengxin Yang; Chenze Shao; | acl | 2021-07-26 |
237 | Don’t Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we present a data collection strategy for MT which, in contrast, is cheap and simple, as it does not require bilingual speakers. |
Rajat Bhatnagar; Ananya Ganesh; Katharina Kann; | acl | 2021-07-26 |
238 | End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we focus on methods based on training the model with constraints provided as part of the input sequence. |
Josef Jon; Jo?o Paulo Aires; Dusan Varis; Ondrej Bojar; | acl | 2021-07-26 |
239 | Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to improve the sampling procedure by selecting the most informative monolingual sentences to complement the parallel data. |
WENXIANG JIAO et. al. | acl | 2021-07-26 |
240 | Beyond Noise: Mitigating The Impact of Fine-grained Semantic Divergences on Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on these findings, we introduce a divergent-aware NMT framework that uses factors to help NMT recover from the degradation caused by naturally occurring divergences, improving both translation quality and model calibration on EN-FR tasks. |
Eleftheria Briakou; Marine Carpuat; | acl | 2021-07-26 |
241 | XLPT-AMR: Cross-Lingual Pre-Training Via Multi-Task Learning for Zero-Shot AMR Parsing and Text Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Upon the availability of English AMR dataset and English-to- X parallel datasets, in this paper we propose a novel cross-lingual pre-training approach via multi-task learning (MTL) for both zeroshot AMR parsing and AMR-to-text generation. |
Dongqin Xu; Junhui Li; Muhua Zhu; Min Zhang; Guodong Zhou; | acl | 2021-07-26 |
242 | Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a sophisticated neural architecture to incorporate bilingual dictionaries into Neural Machine Translation (NMT) models. |
TONG ZHANG et. al. | acl | 2021-07-26 |
243 | On Compositional Generalization of Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we study NMT models from the perspective of compositional generalization by building a benchmark dataset, CoGnition, consisting of 216k clean and consistent sentence pairs. |
Yafu Li; Yongjing Yin; Yulong Chen; Yue Zhang; | acl | 2021-07-26 |
244 | Towards User-Driven Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To fill this gap, we introduce a novel framework called user-driven NMT. |
HUAN LIN et. al. | acl | 2021-07-26 |
245 | Selective Knowledge Distillation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we design a novel protocol that can effectively analyze the different impacts of samples by comparing various samples’ partitions. |
Fusheng Wang; Jianhao Yan; Fandong Meng; Jie Zhou; | acl | 2021-07-26 |
246 | Improving Speech Translation By Understanding and Learning from The Auxiliary Text Translation Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we are interested in training a speech translation model along with an auxiliary text translation task. |
Yun Tang; Juan Pino; Xian Li; Changhan Wang; Dmitriy Genzel; | acl | 2021-07-26 |
247 | Beyond Sentence-Level End-to-End Speech Translation: Context Helps Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate several decoding approaches, and introduce in-model ensemble decoding which jointly performs document- and sentence-level translation using the same model. |
Biao Zhang; Ivan Titov; Barry Haddow; Rico Sennrich; | acl | 2021-07-26 |
248 | Contrastive Learning for Many-to-many Multilingual Neural Machine Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we aim to build a many-to-many translation system with an emphasis on the quality of non-English language directions. |
Xiao Pan; Mingxuan Wang; Liwei Wu; Lei Li; | acl | 2021-07-26 |
249 | Mid-Air Hand Gestures for Post-Editing of Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Here, we present the first study that investigates the usefulness of mid-air hand gestures in combination with the keyboard (GK) for text editing in PE of MT. Guided by a gesture elicitation study with 14 freelance translators, we develop a prototype supporting mid-air hand gestures for cursor placement, text selection, deletion, and reordering. |
Rashad Albo Jamara; Nico Herbig; Antonio Kr?ger; Josef van Genabith; | acl | 2021-07-26 |
250 | Prevent The Language Model from Being Overconfident in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on the property, we propose a Margin-based Token-level Objective (MTO) and a Margin-based Sentence-level Objective (MSO) to maximize the Margin for preventing the LM from being overconfident. |
Mengqi Miao; Fandong Meng; Yijin Liu; Xiao-Hua Zhou; Jie Zhou; | acl | 2021-07-26 |
251 | CCMatrix: Mining Billions of High-Quality Parallel Sentences on The Web IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We show that margin-based bitext mining in a multilingual sentence space can be successfully scaled to operate on monolingual corpora of billions of sentences. |
HOLGER SCHWENK et. al. | acl | 2021-07-26 |
252 | Good for Misconceived Reasons: An Empirical Revisiting on The Need for Visual Context in Multimodal Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Upon further investigation, we discover that the improvements achieved by the multimodal models over text-only counterparts are in fact results of the regularization effect. |
Zhiyong Wu; Lingpeng Kong; Wei Bi; Xiang Li; Ben Kao; | acl | 2021-07-26 |
253 | Diverse Pretrained Context Encodings Improve Document Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pre-trained document context signals and assess the impact on translation performance of (1) different pretraining approaches for generating these signals, (2) the quantity of parallel data for which document context is available, and (3) conditioning on source, target, or source and target contexts. |
Domenic Donato; Lei Yu; Chris Dyer; | acl | 2021-07-26 |
254 | Extending Challenge Sets to Uncover Gender Bias in Machine Translation: Impact of Stereotypical Verbs and Adjectives Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present an extension of this challenge set, called WiBeMT, with gender-biased adjectives and adds sentences with gender-biased verbs. |
Jonas-Dario Troles; Ute Schmid; | arxiv-cs.CL | 2021-07-24 |
255 | The USYD-JD Speech Translation System for IWSLT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the University of Sydney& JD’s joint submission of the IWSLT 2021 low resource speech translation task. |
Liang Ding; Di Wu; Dacheng Tao; | arxiv-cs.CL | 2021-07-24 |
256 | Confidence-Aware Scheduled Sampling for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we propose confidence-aware scheduled sampling. |
Yijin Liu; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou; | arxiv-cs.CL | 2021-07-21 |
257 | Integrating Unsupervised Data Generation Into Self-Supervised Neural Machine Translation for Low-Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, unsupervised machine translation (UMT) exploits large amounts of monolingual data by using synthetic data generation techniques such as back-translation and noising, while self-supervised NMT (SSNMT) identifies parallel sentences in smaller comparable data and trains on them. |
Dana Ruiter; Dietrich Klakow; Josef van Genabith; Cristina España-Bonet; | arxiv-cs.CL | 2021-07-19 |
258 | Darmok and Jalad at Tanagra: A Dataset and Model for English-to-Tamarian Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work assembles a Tamarian-English dictionary of utterances from the original episode and several follow-on novels, and uses this to construct a parallel corpus of 456 English-Tamarian utterances. |
Peter Jansen; | arxiv-cs.CL | 2021-07-16 |
259 | FST: The FAIR Speech Translation System for The IWSLT21 Multilingual Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we describe our end-to-end multilingual speech translation system submitted to the IWSLT 2021 evaluation campaign on the Multilingual Speech Translation shared task. |
YUN TANG et. al. | arxiv-cs.CL | 2021-07-14 |
260 | The IWSLT 2021 BUT Speech Translation Systems Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study their efficiency from the perspective of having a large amount of separate ASR training data and MT training data, and a smaller amount of speech-translation training data. |
Hari Krishna Vydana; Martin Karafi’at; Luk’as Burget; Honza Cernock’y; | arxiv-cs.CL | 2021-07-13 |
261 | DSGPT: Domain-Specific Generative Pre-Training of Transformers for Text Generation in E-commerce Title and Review Summarization Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a novel domain-specific generative pre-training (DSGPT) method for text generation and apply it to the product title and review summarization problems on E-commerce mobile display. |
XUEYING ZHANG et. al. | sigir | 2021-07-13 |
262 | Zero-shot Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: These models tend to output the wrong language when performing zero-shot ST. We tackle the issues by including additional training data and an auxiliary loss function that minimizes the text-audio difference. |
Tu Anh Dinh; | arxiv-cs.CL | 2021-07-13 |
263 | Putting Words Into The System’s Mouth: A Targeted Attack on Neural Machine Translation Using Monolingual Data Poisoning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present two methods for crafting poisoned examples, and show that only a tiny handful of instances, amounting to only 0.02% of the training set, is sufficient to enact a successful attack. |
JUN WANG et. al. | arxiv-cs.CL | 2021-07-12 |
264 | Using Machine Translation to Localize Task Oriented NLG Output Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper explores doing this by applying machine translation to the English output. |
SCOTT ROY et. al. | arxiv-cs.CL | 2021-07-09 |
265 | Cross-model Back-translated Distillation for Unsupervised Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a novel component to the standard UMT framework called Cross-model Back-translated Distillation (CBD), that is aimed to induce another level of data diversification that existing principles lack. |
Xuan-Phi Nguyen; Shafiq Joty; Thanh-Tung Nguyen; Kui Wu; Ai Ti Aw; | icml | 2021-07-08 |
266 | Self-supervised and Supervised Joint Training for Resource-rich Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a joint training approach, F2-XEnDec, to combine self-supervised and supervised learning to optimize NMT models. |
Yong Cheng; Wei Wang; Lu Jiang; Wolfgang Macherey; | icml | 2021-07-08 |
267 | A Topic Guided Pointer-Generator Model for Generating Natural Language Code Summaries Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a neural network model named ToPNN for code summarization, which uses the topics in a broader context (e.g., class) to guide the neural networks that combine the generation of new words and the copy of existing words in code. |
XIN WANG et. al. | arxiv-cs.SE | 2021-07-04 |
268 | IITP at WAT 2021: System Description for English-Hindi Multimodal Translation Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We participate in the 8th Workshop on Asian Translation (WAT – 2021) for English-Hindi multimodal translation task and achieve 42.47 and 37.50 BLEU points for Evaluation and Challenge subset, respectively. |
Baban Gain; Dibyanayan Bandyopadhyay; Asif Ekbal; | arxiv-cs.CL | 2021-07-04 |
269 | Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we study the benefit that such compositionality brings about to several machine translation tasks. |
Rahma Chaabouni; Roberto Dessì; Eugene Kharitonov; | arxiv-cs.CL | 2021-07-03 |
270 | Zero-pronoun Data Augmentation for Japanese-to-English Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a data augmentation method that provides additional training signals for the translation model to learn correlations between local context and zero pronouns. |
Ryokan Ri; Toshiaki Nakazawa; Yoshimasa Tsuruoka; | arxiv-cs.CL | 2021-07-01 |
271 | Modeling Target-side Inflection in Placeholder Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we propose a novel method of placeholder translation that can inflect specified terms according to the grammatical construction of the output sentence. |
Ryokan Ri; Toshiaki Nakazawa; Yoshimasa Tsuruoka; | arxiv-cs.CL | 2021-07-01 |
272 | The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes USTC-NELSLIP’s submissions to the IWSLT2021 Simultaneous Speech Translation task. |
Dan Liu; Mengge Du; Xiaoxi Li; Yuchen Hu; Lirong Dai; | arxiv-cs.CL | 2021-07-01 |
273 | IMS’ Systems for The IWSLT 2021 Low-Resource Speech Translation Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the submission to the IWSLT 2021 Low-Resource Speech Translation Shared Task by IMS team. |
Pavel Denisov; Manuel Mager; Ngoc Thang Vu; | arxiv-cs.CL | 2021-06-30 |
274 | Rethinking The Evaluation of Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel evaluation protocol, which not only avoids the effect of search errors but provides a system-level evaluation in the perspective of model ranking. |
Jianhao Yan; Chenming Wu; Fandong Meng; Jie Zhou; | arxiv-cs.CL | 2021-06-29 |
275 | DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation By Augmenting Pretrained Multilingual Encoders IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To reduce this gap, we introduce DeltaLM, a pretrained multilingual encoder-decoder model that regards the decoder as the task layer of off-the-shelf pretrained encoders. |
SHUMING MA et. al. | arxiv-cs.CL | 2021-06-25 |
276 | On The Influence of Machine Translation on Language Origin Obfuscation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we analyze the ability to detect the source language from the translated output of two widely used commercial machine translation systems by utilizing machine-learning algorithms with basic textual features like n-grams. |
Benjamin Murauer; Michael Tschuggnall; Günther Specht; | arxiv-cs.CL | 2021-06-24 |
277 | End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we focus on methods based on training the model with constraints provided as part of the input sequence. |
Josef Jon; João Paulo Aires; Dušan Variš; Ondřej Bojar; | arxiv-cs.CL | 2021-06-23 |
278 | Dealing with Training and Test Segmentation Mismatch: FBK@IWSLT2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes FBK’s system submission to the IWSLT 2021 Offline Speech Translation task. |
Sara Papi; Marco Gaido; Matteo Negri; Marco Turchi; | arxiv-cs.CL | 2021-06-23 |
279 | Phrase-level Active Learning for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we address this problem in an active learning setting where we can spend a given budget on translating in-domain data, and gradually fine-tune a pre-trained out-of-domain NMT model on the newly translated data. |
Junjie Hu; Graham Neubig; | arxiv-cs.CL | 2021-06-21 |
280 | Recurrent Stacking of Layers in Neural Networks: An Application to Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose to share parameters across all layers thereby leading to a recurrently stacked neural network model. |
Raj Dabre; Atsushi Fujita; | arxiv-cs.CL | 2021-06-18 |
281 | Lost in Interpreting: Speech Translation from Source or Interpreter? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate if such an automatic system should rather follow the original speaker, or an interpreter to achieve better translation quality at the cost of increased delay. |
Dominik Macháček; Matúš Žilinec; Ondřej Bojar; | arxiv-cs.CL | 2021-06-17 |
282 | Central Kurdish Machine Translation: First Large Scale Parallel Corpus and Experiments Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present the first large scale parallel corpus of Central Kurdish-English, Awta, containing 229,222 pairs of manually aligned translations. |
Zhila Amini; Mohammad Mohammadamini; Hawre Hosseini; Mehran Mansouri; Daban Jaff; | arxiv-cs.AI | 2021-06-17 |
283 | Alternated Training with Synthetic and Authentic Data for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose alternated training with synthetic and authentic data for NMT. Compared with previous work, we introduce authentic data as guidance to prevent the training of NMT models from being disturbed by noisy synthetic data. |
Rui Jiao; Zonghan Yang; Maosong Sun; Yang Liu; | arxiv-cs.CL | 2021-06-16 |
284 | Evaluating Gender Bias in Hindi-English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We implement a modified version of the existing TGBI metric based on the grammatical considerations for Hindi. |
Gauri Gupta; Krithika Ramesh; Sanjay Singh; | arxiv-cs.CL | 2021-06-16 |
285 | English to Bangla Machine Translation Using Recurrent Neural Network Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes an architecture of English to Bangla machine translation system. |
Shaykh Siddique; Tahmid Ahmed; Md. Rifayet Azam Talukder; Md. Mohsin Uddin; | arxiv-cs.CL | 2021-06-14 |
286 | Machine Translation Into Low-resource Language Varieties Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a general framework to rapidly adapt MT systems to generate language varieties that are close to, but different from, the standard target language, using no parallel (source–variety) data. |
Sachin Kumar; Antonios Anastasopoulos; Shuly Wintner; Yulia Tsvetkov; | arxiv-cs.CL | 2021-06-12 |
287 | UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To generalize this success to non-English languages, we introduce UC^2, the first machine translation-augmented framework for cross-lingual cross-modal representation learning. |
MINGYANG ZHOU et. al. | cvpr | 2021-06-11 |
288 | Exploring Unsupervised Pretraining Objectives for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we systematically compare masking with alternative objectives that produce inputs resembling real (full) sentences, by reordering and replacing words based on their context. |
Christos Baziotis; Ivan Titov; Alexandra Birch; Barry Haddow; | arxiv-cs.CL | 2021-06-10 |
289 | Crosslingual Embeddings Are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we show that initializing the embedding layer of UNMT models with cross-lingual embeddings shows significant improvements in BLEU score over existing approaches with embeddings randomly initialized. |
Tamali Banerjee; Rudra Murthy V; Pushpak Bhattacharyya; | arxiv-cs.CL | 2021-06-09 |
290 | Multilingual Neural Semantic Parsing for Low-Resourced Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle the data limitation problem, we propose using machine translation to bootstrap multilingual training data from the more abundant English data. To evaluate our multilingual models on human-written sentences as opposed to machine translated ones, we introduce a new multilingual semantic parsing dataset in English, Italian and Japanese based on the Facebook Task Oriented Parsing (TOP) dataset. |
Menglin Xia; Emilio Monti; | arxiv-cs.CL | 2021-06-07 |
291 | The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce the FLORES-101 evaluation benchmark, consisting of 3001 sentences extracted from English Wikipedia and covering a variety of different topics and domains. |
NAMAN GOYAL et. al. | arxiv-cs.CL | 2021-06-06 |
292 | Cross-language Sentence Selection Via Data Augmentation and Rationale Training Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes an approach to cross-language sentence selection in a low-resource setting. |
YANDA CHEN et. al. | arxiv-cs.CL | 2021-06-04 |
293 | Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we propose to improve the sampling procedure by selecting the most informative monolingual sentences to complement the parallel data. |
WENXIANG JIAO et. al. | arxiv-cs.CL | 2021-06-02 |
294 | ViTA: Visual-Linguistic Translation By Aligning Object Tags Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose our system under the team name Volta for the Multimodal Translation Task of WAT 2021 from English to Hindi. |
Kshitij Gupta; Devansh Gautam; Radhika Mamidi; | arxiv-cs.CL | 2021-06-01 |
295 | Part of Speech and Universal Dependency Effects on English Arabic Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this research paper, I will elaborate on a method to evaluate machine translation models based on their performance on underlying syntactical phenomena between English and Arabic languages. |
Ofek Rafaeli; Omri Abend; Leshem Choshen; Dmitry Nikolaev; | arxiv-cs.CL | 2021-06-01 |
296 | Rejuvenating Low-Frequency Words: Making The Most of Parallel Data in Non-Autoregressive Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Accordingly, we propose reverse KD to rejuvenate more alignments for low-frequency target words. |
LIANG DING et. al. | arxiv-cs.CL | 2021-06-01 |
297 | Multilingual Speech Translation with Unified Transformer: Huawei Noah’s Ark Lab at IWSLT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the system submitted to the IWSLT 2021 Multilingual Speech Translation (MultiST) task from Huawei Noah’s Ark Lab. |
Xingshan Zeng; Liangyou Li; Qun Liu; | arxiv-cs.CL | 2021-05-31 |
298 | Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, we present a thorough analysis of 75 BNLP research papers and categorize them into 11 categories, namely Information Extraction, Machine Translation, Named Entity Recognition, Parsing, Parts of Speech Tagging, Question Answering System, Sentiment Analysis, Spam and Fake Detection, Text Summarization, Word Sense Disambiguation, and Speech Processing and Recognition. |
OVISHAKE SEN et. al. | arxiv-cs.CL | 2021-05-31 |
299 | Korean-English Machine Translation with Multiple Tokenization Strategy Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, alphabet tokenization, morpheme tokenization, and BPE tokenization were applied to Korean as the source language and English as the target language respectively, and the comparison experiment was conducted by repeating 50,000 epochs of each 9 models using the Transformer neural network. |
Dojun Park; Youngjin Jang; Harksoo Kim; | arxiv-cs.CL | 2021-05-29 |
300 | Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We develop models under different conditions, employing both (i) standard end-to-end sequence-to-sequence (S2S) Transformers trained from scratch and (ii) pre-trained S2S language models (LMs). |
El Moatez Billah Nagoudi; AbdelRahim Elmadany; Muhammad Abdul-Mageed; | arxiv-cs.LG | 2021-05-27 |
301 | How Does Distilled Data Complexity Impact The Quality and Confidence of Non-Autoregressive Machine Translation? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this issue, we seek to understand why distillation is so effective. |
Weijia Xu; Shuming Ma; Dongdong Zhang; Marine Carpuat; | arxiv-cs.CL | 2021-05-26 |
302 | UniDrop: A Simple Yet Effective Technique to Improve Transformer Without Extra Cost Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Therefore, in this paper, we integrate different dropout techniques into the training of Transformer models. |
ZHEN WU et. al. | naacl | 2021-05-23 |
303 | XOR QA: Cross-lingual Open-Retrieval Question Answering IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Based on this dataset, we introduce a task framework, called Cross-lingual Open-Retrieval Question Answering (XOR QA), that consists of three new tasks involving cross-lingual document retrieval from multilingual and English resources. |
AKARI ASAI et. al. | naacl | 2021-05-23 |
304 | Backtranslation Feedback Improves User Confidence in MT, Not Quality Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Translating text into a language unknown to the text’s author, dubbed outbound translation, is a modern need for which the user experience has significant room for improvement, beyond the basic machine translation facility. We demonstrate this by showing three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected: backward translation, quality estimation (with alignment) and source paraphrasing. |
VIL�M ZOUHAR et. al. | naacl | 2021-05-23 |
305 | Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In particular, we propose to conduct mask-and-predict pre-training on text-only and image-only corpora and introduce the object tags detected by an object recognition model as anchor points to bridge two modalities. |
LIUNIAN HAROLD LI et. al. | naacl | 2021-05-23 |
306 | Cross-lingual Cross-modal Pretraining for Multimodal Retrieval Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper proposes a new approach to learn cross-lingual cross-modal representations for matching images and their relevant captions in multiple languages. |
Hongliang Fei; Tan Yu; Ping Li; | naacl | 2021-05-23 |
307 | SGL: Speaking The Graph Languages of Semantic Parsing Via Multilingual Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, instead, we reframe semantic parsing towards multiple formalisms as Multilingual Neural Machine Translation (MNMT), and propose SGL, a many-to-many seq2seq architecture trained with an MNMT objective. |
Luigi Procopio; Rocco Tripodi; Roberto Navigli; | naacl | 2021-05-23 |
308 | Generative Imagination Elevates Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose ImagiT, a novel machine translation method via visual imagination. |
Quanyu Long; Mingxuan Wang; Lei Li; | naacl | 2021-05-23 |
309 | Non-Autoregressive Translation By Learning Target Categorical Codes Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose CNAT, which learns implicitly categorical codes as latent variables into the non-autoregressive decoding. |
YU BAO et. al. | naacl | 2021-05-23 |
310 | Neural Machine Translation Without Embeddings Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Surprisingly, replacing the ubiquitous embedding layer with one-hot representations of each byte does not hurt performance; experiments on byte-to-byte machine translation from English to 10 different languages show a consistent improvement in BLEU, rivaling character-level and even standard subword-level models. |
Uri Shaham; Omer Levy; | naacl | 2021-05-23 |
311 | Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on sequence-level knowledge distillation (SeqKD) from external text-based NMT models. |
Hirofumi Inaguma; Tatsuya Kawahara; Shinji Watanabe; | naacl | 2021-05-23 |
312 | MT5: A Massively Multilingual Pre-trained Text-to-Text Transformer IF:6 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we introduce mT5, a multilingual variant of T5 that was pre-trained on a new Common Crawl-based dataset covering 101 languages. |
LINTING XUE et. al. | naacl | 2021-05-23 |
313 | Training Data Augmentation for Code-Mixed Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present an m-BERT based procedure whose core learnable component is a ternary sequence labeling model, that can be trained with a limited code-mixed corpus alone. |
Abhirut Gupta; Aditya Vavre; Sunita Sarawagi; | naacl | 2021-05-23 |
314 | Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present an end-to-end framework that exploits compositionality to learn searchable hidden representations at intermediate stages of a sequence model using decomposed sub-tasks. |
Siddharth Dalmia; Brian Yan; Vikas Raunak; Florian Metze; Shinji Watanabe; | naacl | 2021-05-23 |
315 | Counterfactual Data Augmentation for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a data augmentation method for neural machine translation. |
Qi Liu; Matt Kusner; Phil Blunsom; | naacl | 2021-05-23 |
316 | Harnessing Multilinguality in Unsupervised Machine Translation for Rare Languages IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we show that multilinguality is critical to making unsupervised systems practical for low-resource settings. |
Xavier Garcia; Aditya Siddhant; Orhan Firat; Ankur Parikh; | naacl | 2021-05-23 |
317 | From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer. |
ROB VAN DER GOOT et. al. | naacl | 2021-05-23 |
318 | Investigating Math Word Problems Using Pretrained Multilingual Language Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we revisit math word problems~(MWPs) from the cross-lingual and multilingual perspective. |
Minghuan Tan; Lei Wang; Lingxiao Jiang; Jing Jiang; | arxiv-cs.CL | 2021-05-19 |
319 | Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe models focused at the understudied problem of translating between monolingual and code-mixed language pairs. |
Ganesh Jawahar; El Moatez Billah Nagoudi; Muhammad Abdul-Mageed; Laks V. S. Lakshmanan; | arxiv-cs.CL | 2021-05-18 |
320 | Jointly Trained Transformers Models for Spoken Language Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, degradation in performance is reduced by creating an End-to-End differentiable pipeline between the ASR and MT systems. |
H. K. Vydana; M. Karafi�t; K. Zmolikova; L. Burget; H. Cernock�; | icassp | 2021-05-16 |
321 | Modeling Homophone Noise for Robust Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a robust neural machine translation (NMT) framework to deal with homophone errors. |
W. QIN et. al. | icassp | 2021-05-16 |
322 | A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this study, we propose a general multi-task learning framework to leverage text data for ASR and ST tasks. |
Y. Tang; J. Pino; C. Wang; X. Ma; D. Genzel; | icassp | 2021-05-16 |
323 | Data Augmentation for Sign Language Gloss Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We focus here on gloss-to-text translation, which we treat as a low-resource neural machine translation (NMT) problem. |
Amit Moryossef; Kayo Yin; Graham Neubig; Yoav Goldberg; | arxiv-cs.CL | 2021-05-16 |
324 | Cascaded Models with Cyclic Feedback for Direct Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a technique that allows cascades of automatic speech recognition (ASR) and machine translation (MT) to exploit in-domain direct speech translation data in addition to out-of-domain MT and ASR data. |
T. K. Lam; S. Schamoni; S. Riezler; | icassp | 2021-05-16 |
325 | Machine Translation Verbosity Control for Automatic Dubbing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on the problem of controlling the verbosity of machine translation out-put, so that subsequent steps of our automatic dubbing pipeline can generate dubs of better quality. |
S. M. Lakew; et al. | icassp | 2021-05-16 |
326 | Task Aware Multi-Task Learning for Speech to Text Tasks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a task modulation network which allows the model to learn task specific features, while learning the shared features simultaneously. |
S. Indurthi; et al. | icassp | 2021-05-16 |
327 | An Empirical Study on Task-Oriented Dialogue Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we systematically investigate advanced models on the task-oriented dialogue translation task, including sentence-level, document-level and non-autoregressive NMT models. |
S. Liu; | icassp | 2021-05-16 |
328 | From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To tackle the challenge, we propose a joint learning approach, with English SLU training data and non-English auxiliary tasks from raw text, syntax and translation for transfer. |
ROB VAN DER GOOT et. al. | arxiv-cs.CL | 2021-05-15 |
329 | The Volctrans Neural Speech Translation System for IWSLT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the systems submitted to IWSLT 2021 by the Volctrans team. |
CHENGQI ZHAO et. al. | arxiv-cs.CL | 2021-05-15 |
330 | Do Context-Aware Translation Models Pay The Right Attention? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we ask several questions: What contexts do human translators use to resolve ambiguous words? |
KAYO YIN et. al. | arxiv-cs.CL | 2021-05-14 |
331 | Dynamic Multi-Branch Layers for On-Device Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by conditional computation, we propose to improve the performance of on-device NMT systems with dynamic multi-branch layers. |
ZHIXING TAN et. al. | arxiv-cs.CL | 2021-05-14 |
332 | Automatic Classification of Human Translation and Machine Translation: A Study from The Perspective of Lexical Diversity Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: By using a trigram model and fine-tuning a pretrained BERT model for sequence classification, we show that machine translation and human translation can be classified with an accuracy above chance level, which suggests that machine translation and human translation are different in a systematic way. |
Yingxue Fu; Mark-Jan Nederhof; | arxiv-cs.CL | 2021-05-10 |
333 | Self-Guided Curriculum Learning for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Inspired by this, we propose a self-guided curriculum strategy to encourage the learning of neural machine translation (NMT) models to follow the above recovery criterion, where we cast the recovery degree of each training example as its learning difficulty. |
LEI ZHOU et. al. | arxiv-cs.CL | 2021-05-10 |
334 | End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Machine Translation group. |
Gerard I. Gállego; Ioannis Tsiamas; Carlos Escolano; José A. R. Fonollosa; Marta R. Costa-jussà; | arxiv-cs.CL | 2021-05-10 |
335 | Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present a continual pre-training (CPT) framework on mBART to effectively adapt it to unseen languages. |
Zihan Liu; Genta Indra Winata; Pascale Fung; | arxiv-cs.CL | 2021-05-09 |
336 | Learning Shared Semantic Space for Speech-to-Text Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In observation of this obstacle, we propose to bridge this representation gap with Chimera. |
Chi Han; Mingxuan Wang; Heng Ji; Lei Li; | arxiv-cs.CL | 2021-05-07 |
337 | Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper investigates two key aspects of end-to-end simultaneous speech translation: (a) how to encode efficiently the continuous speech flow, and (b) how to segment the speech flow in order to alternate optimally between reading (R: encoding input) and writing (W: decoding output) operations. |
Ha Nguyen; Yannick Estève; Laurent Besacier; | arxiv-cs.CL | 2021-04-29 |
338 | Potential Idiomatic Expression (PIE)-English: Corpus for Classes of Idioms Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a fairly large, Potential Idiomatic Expression (PIE) dataset for Natural Language Processing (NLP) in English. |
TOSIN P. ADEWUMI et. al. | arxiv-cs.CL | 2021-04-25 |
339 | End-to-end Speech Translation Via Cross-modal Progressive Training Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Cross Speech-Text Network (XSTNet), an end-to-end model for speech-to-text translation. |
Rong Ye; Mingxuan Wang; Lei Li; | arxiv-cs.CL | 2021-04-21 |
340 | Should We Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Most work in NLP makes the assumption that it is desirable to develop solutions in the native language in question. |
Tim Isbister; Fredrik Carlsson; Magnus Sahlgren; | arxiv-cs.CL | 2021-04-21 |
341 | Addressing The Vulnerability of NMT in Input Perturbations Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we improve the robustness of NMT models by reducing the effect of noisy words through a Context-Enhanced Reconstruction (CER) approach. |
Weiwen Xu; Ai Ti Aw; Yang Ding; Kui Wu; Shafiq Joty; | arxiv-cs.CL | 2021-04-20 |
342 | Grammatical Error Generation Based on Translated Fragments Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Our method aims at simulating mistakes made by second language learners, and produces a wider range of non-native style language in comparison to state-of-the-art synthetic data creation methods. |
Eetu Sjöblom; Mathias Creutz; Teemu Vahtola; | arxiv-cs.CL | 2021-04-20 |
343 | Stream-level Latency Evaluation for Simultaneous Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work proposes a stream-level adaptation of the current latency measures based on a re-segmentation approach applied to the output translation, that is successfully evaluated on streaming conditions for a reference IWSLT task. |
Javier Iranzo-Sánchez; Jorge Civera; Alfons Juan; | arxiv-cs.CL | 2021-04-18 |
344 | DCH-2: A Parallel Customer-Helpdesk Dialogue Corpus with Distributions of Annotators’ Labels Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We introduce a data set called DCH-2, which contains 4,390 real customer-helpdesk dialogues in Chinese and their English translations. |
Zhaohao Zeng; Tetsuya Sakai; | arxiv-cs.CL | 2021-04-18 |
345 | Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on a zero-shot cross-lingual transfer task in NMT. |
GUANHUA CHEN et. al. | arxiv-cs.CL | 2021-04-18 |
346 | MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we improve multilingual text-to-text transfer Transformer with translation pairs (mT6). |
ZEWEN CHI et. al. | arxiv-cs.CL | 2021-04-17 |
347 | From Fully Trained to Fully Random Embeddings: Improving Neural Machine Translation with Compact Word Embedding Tables Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: \footnote{In this paper words and subwords are referred to as \textit{tokens} and the term \textit{embedding} only refers to embeddings of inputs.} In this paper, we analyze the impact and utility of such matrices in the context of neural machine translation (NMT). |
Krtin Kumar; Peyman Passban; Mehdi Rezagholizadeh; Yiu Sing Lau; Qun Liu; | arxiv-cs.CL | 2021-04-17 |
348 | XLEnt: Mining A Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this, we propose Lexical-Semantic-Phonetic Align (LSP-Align), a technique to automatically mine cross-lingual entity lexica from mined web data. |
Ahmed El-Kishky; Adithya Renduchintala; James Cross; Francisco Guzmán; Philipp Koehn; | arxiv-cs.CL | 2021-04-17 |
349 | Sentence Alignment with Parallel Documents Facilitates Biomedical Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents an unsupervised algorithm for deriving parallel corpora from document-level translations by using sentence alignment and explores how training materials affect the performance of biomedical NMT systems. |
Shengxuan Luo; Huaiyuan Ying; Jiao Li; Sheng Yu; | arxiv-cs.CL | 2021-04-17 |
350 | Crossing The Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Although this technology represents one of the central objectives of AI and has been the focus of ever more intense research and development efforts, it is currently limited to a few narrow domains (e.g., food ordering, ticket booking) and a handful of languages (e.g., English, Chinese). This work provides an extensive overview of existing methods and resources in multilingual ToD as an entry point to this exciting and emerging field. |
EVGENIIA RAZUMOVSKAIA et. al. | arxiv-cs.CL | 2021-04-17 |
351 | Robust Open-Vocabulary Translation from Visual Text Representations Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by the robustness of human language processing, we propose the use of visual text representations, which dispense with a finite set of text embeddings in favor of continuous vocabularies created by processing visually rendered text with sliding windows. |
Elizabeth Salesky; David Etter; Matt Post; | arxiv-cs.CL | 2021-04-16 |
352 | Towards Variable-Length Textual Adversarial Attacks Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose variable-length textual adversarial attacks~(VL-Attack) and integrate three atomic operations, namely \textit{insertion}, \textit{deletion} and \textit{replacement}, into a unified framework, by introducing and manipulating a special \textit{blank} token while attacking. |
JUNLIANG GUO et. al. | arxiv-cs.CL | 2021-04-16 |
353 | Hierarchical Learning for Generation with Long Source Sequences IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We design and study a new Hierarchical Attention Transformer-based architecture (HAT) that outperforms standard Transformers on several sequence to sequence tasks. |
Tobias Rohde; Xiaoxia Wu; Yinhan Liu; | arxiv-cs.CL | 2021-04-15 |
354 | Simultaneous Multi-Pivot Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To solve this issue, we propose multi-pivot translation and apply it to a simultaneous translation setting involving pivot languages. |
Raj Dabre; Aizhan Imankulova; Masahiro Kaneko; Abhisek Chakrabarty; | arxiv-cs.CL | 2021-04-15 |
355 | Improving Gender Translation Accuracy with Filtered Self-Training Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose a gender-filtered self-training technique to improve gender translation accuracy on unambiguously gendered inputs. |
Prafulla Kumar Choubey; Anna Currey; Prashant Mathur; Georgiana Dinu; | arxiv-cs.CL | 2021-04-15 |
356 | I Wish I Would Have Loved This One, But I Didn’t — A Multilingual Dataset for Counterfactual Detection in Product Reviews Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We consider the problem of counterfactual detection (CFD) in product reviews. For this purpose, we annotate a multilingual CFD dataset from Amazon product reviews covering counterfactual statements written in English, German, and Japanese languages. |
James O’Neill; Polina Rozenshtein; Ryuichi Kiryo; Motoko Kubota; Danushka Bollegala; | arxiv-cs.CL | 2021-04-14 |
357 | Backtranslation Feedback Improves User Confidence in MT, Not Quality Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we describe an experiment on outbound translation from English to Czech and Estonian. |
VILÉM ZOUHAR et. al. | arxiv-cs.CL | 2021-04-12 |
358 | Family of Origin and Family of Choice: Massively Parallel Lexiconized Iterative Pretraining for Severely Low Resource Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To translate named entities correctly, we build a massive lexicon table for 2,939 Bible named entities in 124 source languages, and include many that occur once and covers more than 66 severely low resource languages. |
Zhong Zhou; Alex Waibel; | arxiv-cs.CL | 2021-04-12 |
359 | Sentiment-based Candidate Selection for NMT Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Grounded in the observation that UGC features highly idiomatic, sentiment-charged language, we propose a decoder-side approach that incorporates automatic sentiment scoring into the MT candidate selection process. |
Alex Jones; Derry Tanti Wijaya; | arxiv-cs.CL | 2021-04-10 |
360 | Design and Implementation of English To Yoruba Verb Phrase Machine Translation System Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We aim to develop an English to Yoruba machine translation system which can translate English verb phrase text to its Yoruba equivalent.Words from both languages Source Language and Target Language were collected for the verb phrase group in the home domain.The lexical translation is done by assigning values of the matching word in the dictionary.The syntax of the two languages was realized using Context-Free Grammar,we validated the rewrite rules with finite state automata.The human evaluation method was used and expert fluency scored.The evaluation shows the system performed better than that of sampled Google translation with over 70 percent of the response matching that of the system’s output. |
Safiriyu Eludiora; Benjamin Ajibade; | arxiv-cs.CL | 2021-04-08 |
361 | Extended Parallel Corpus for Amharic-English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper describes the acquisition, preprocessing, segmentation, and alignment of an Amharic-English parallel corpus. |
Andargachew Mekonnen Gezmu; Andreas Nürnberger; Tesfaye Bayu Bati; | arxiv-cs.CL | 2021-04-08 |
362 | Interpreting Verbal Metaphors By Paraphrasing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we interpret metaphors with BERT and WordNet hypernyms and synonyms in an unsupervised manner, showing that our method significantly outperforms the state-of-the-art baseline. |
Rui Mao; Chenghua Lin; Frank Guerin; | arxiv-cs.CL | 2021-04-07 |
363 | AI4D — African Language Program Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work details the AI4D – African Language Program, a 3-part project that 1) incentivised the crowd-sourcing, collection and curation of language datasets through an online quantitative and qualitative challenge, 2) supported research fellows for a period of 3-4 months to create datasets annotated for NLP tasks, and 3) hosted competitive Machine Learning challenges on the basis of these datasets. |
KATHLEEN SIMINYU et. al. | arxiv-cs.CL | 2021-04-06 |
364 | IndT5: A Text-to-Text Transformer for 10 Indigenous Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we introduce IndT5, the first Transformer language model for Indigenous languages. |
El Moatez Billah Nagoudi; Wei-Rui Chen; Muhammad Abdul-Mageed; Hasan Cavusogl; | arxiv-cs.CL | 2021-04-04 |
365 | Sampling and Filtering of Neural Machine Translation Distillation Data Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The highest-scoring hypothesis of the teacher model is commonly used to train a new model (student). |
Vilém Zouhar; | arxiv-cs.CL | 2021-04-01 |
366 | Many-to-English Machine Translation Tools, Data, and Pretrained Models Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present useful tools for machine translation research: MTData, NLCodec, and RTG. |
Thamme Gowda; Zhao Zhang; Chris A Mattmann; Jonathan May; | arxiv-cs.CL | 2021-04-01 |
367 | Low-Resource Neural Machine Translation for Southern African Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Motivated by this challenge we compare zero-shot learning, transfer learning and multilingual learning on three Bantu languages (Shona, isiXhosa and isiZulu) and English. |
Evander Nyoni; Bruce A. Bassett; | arxiv-cs.CL | 2021-04-01 |
368 | Autocorrect in The Process of Translation — Multi-task Learning Improves Dialogue Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we conduct a deep analysis of a dialogue corpus and summarize three major issues on dialogue translation, including pronoun dropping (\droppro), punctuation dropping (\droppun), and typos (\typo). To properly evaluate the performance, we propose a manually annotated dataset with 1,931 Chinese-English parallel utterances from 300 dialogues as a benchmark testbed for dialogue translation. |
Tao Wang; Chengqi Zhao; Mingxuan Wang; Lei Li; Deyi Xiong; | arxiv-cs.CL | 2021-03-30 |
369 | An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we present a case study of Tigrinya where we investigate several back-translation methods to generate synthetic source sentences. |
Lidia Kidane; Sachin Kumar; Yulia Tsvetkov; | arxiv-cs.CL | 2021-03-30 |
370 | English-Twi Parallel Corpus for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present a parallel machine translation training corpus for English and Akuapem Twi of 25,421 sentence pairs. |
PAUL AZUNRE et. al. | arxiv-cs.CL | 2021-03-29 |
371 | Contextual Text Embeddings for Twi Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The specific contribution of this research work is the development of several pretrained transformer language models for the Akuapem and Asante dialects of Twi, paving the way for advances in application areas such as Named Entity Recognition (NER), Neural Machine Translation (NMT), Sentiment Analysis (SA) and Part-of-Speech (POS) tagging. |
PAUL AZUNRE et. al. | arxiv-cs.CL | 2021-03-29 |
372 | Unsupervised Machine Translation On Dravidian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we focus on unsupervised translation between English and Kannada, a low resource Dravidian language. |
Sai Koneru; Danni Liu; Jan Niehues; | arxiv-cs.CL | 2021-03-29 |
373 | PENELOPIE: Enabling Open Information Extraction for The Greek Language Through Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we present our submission for the EACL 2021 SRW; a methodology that aims at bridging the gap between high and low-resource languages in the context of Open Information Extraction, showcasing it on the Greek language. |
Dimitris Papadopoulos; Nikolaos Papadakis; Nikolaos Matsatsinis; | arxiv-cs.CL | 2021-03-28 |
374 | Low-Resource Machine Translation Training Curriculum Fit for Low-Resource Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We conduct an empirical study of neural machine translation (NMT) for truly low-resource languages, and propose a training curriculum fit for cases when both parallel training data and compute resource are lacking, reflecting the reality of most of the world’s languages and the researchers working on these languages. |
GARRY KUWANTO et. al. | arxiv-cs.CL | 2021-03-24 |
375 | Repairing Pronouns in Translation with BERT-Based Post-Editing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We investigate the severity of this pronoun issue, showing that (1) in some domains, pronoun choice can account for more than half of a NMT systems’ errors, and (2) pronouns have a disproportionately large impact on perceived translation quality. |
Reid Pryzant; | arxiv-cs.CL | 2021-03-23 |
376 | The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This paper evaluates the performance of several modern subword segmentation methods in a low-resource neural machine translation setting. |
Jonne Sälevä; Constantine Lignos; | arxiv-cs.CL | 2021-03-20 |
377 | Dependency Graph-to-String Statistical Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We present graph-based translation models which translate source graphs into target strings. |
Liangyou Li; Andy Way; Qun Liu; | arxiv-cs.CL | 2021-03-20 |
378 | Congolese Swahili Machine Translation for Humanitarian Response Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we describe our efforts to make a bidirectional Congolese Swahili (SWC) to French (FRA) neural machine translation system with the motivation of improving humanitarian translation workflows. For training, we created a 25,302-sentence general domain parallel corpus and combined it with publicly available data. |
Alp Öktem; Eric DeLuca; Rodrigue Bashizi; Eric Paquin; Grace Tang; | arxiv-cs.CL | 2021-03-19 |
379 | Gumbel-Attention for Multi-modal Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose a novel Gumbel-Attention for multi-modal machine translation, which selects the text-related parts of the image features. |
Pengbo Liu; Hailong Cao; Tiejun Zhao; | arxiv-cs.CL | 2021-03-16 |
380 | Towards The Evaluation of Automatic Simultaneous Speech Translation from A Communicative Perspective Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present the results of an experiment aimed at evaluating the quality of a real-time speech translation engine by comparing it to the performance of professional simultaneous interpreters. |
Claudio Fantinuoli; Bianca Prandi; | arxiv-cs.CL | 2021-03-15 |
381 | The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we present MENYO-20k, the first multi-domain parallel corpus with a special focus on clean orthography for Yor\`ub\’a–English with standardized train-test splits for benchmarking. We provide several neural MT benchmarks and compare them to the performance of popular pre-trained (massively multilingual) MT models both for the heterogeneous test set and its subdomains. |
DAVID I. ADELANI et. al. | arxiv-cs.CL | 2021-03-15 |
382 | Visual Cues and Error Correction for Translation Robustness Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we focus on three types of realistic noise that are commonly generated by humans and introduce the idea of visual context to improve translation robustness for noisy texts. |
Zhenhao Li; Marek Rei; Lucia Specia; | arxiv-cs.CL | 2021-03-12 |
383 | Unsupervised Transfer Learning in Multilingual Neural Machine Translation with Cross-Lingual Word Embeddings Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work we look into adding a new language to a multilingual NMT system in an unsupervised fashion. |
Carlos Mullov; Ngoc-Quan Pham; Alexander Waibel; | arxiv-cs.CL | 2021-03-11 |
384 | Learning Feature Weights Using Reward Modeling for Denoising Parallel Corpora Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work presents an alternative approach which learns weights for multiple sentence-level features. |
Gaurav Kumar; Philipp Koehn; Sanjeev Khudanpur; | arxiv-cs.CL | 2021-03-11 |
385 | Bilingual Dictionary-based Language Model Pretraining for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To alleviate the need for expensive parallel corpora by TLM, in this work, we incorporate the translation information from dictionaries into the pretraining process and propose a novel Bilingual Dictionary-based Language Model (BDLM). |
Yusen Lin; Jiayong Lin; Shuaicheng Zhang; Haoying Dai; | arxiv-cs.CL | 2021-03-11 |
386 | Translating The Unseen? Yoruba-English MT in Low-Resource, Morphologically-Unmarked Settings Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we perform fine-grained analysis on how an SMT system compares with two NMT systems (BiLSTM and Transformer) when translating bare nouns in Yor\`ub\’a into English. |
Ife Adebara; Muhammad Abdul-Mageed; Miikka Silfverberg; | arxiv-cs.CL | 2021-03-06 |
387 | Multichannel LSTM-CNN for Telugu Technical Domain Identification Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we proposed the Multichannel LSTM-CNN methodology for Technical Domain Identification for Telugu. |
Sunil Gundapu; Radhika Mamidi; | arxiv-cs.CL | 2021-02-24 |
388 | Machine Translation Customization Via Automatic Training Data Selection from The Web Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We describe an approach for customizing MT systems on specific domains by selecting data similar to the target customer data to train neural translation models. |
Thuy Vu; Alessandro Moschitti; | arxiv-cs.CL | 2021-02-19 |
389 | Crowdsourcing Parallel Corpus for English-Oromo Neural Machine Translation Using Community Engagement Platform Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: The paper deals with implementing a translation of English to Afaan Oromo and vice versa using Neural Machine Translation. |
SISAY CHALA et. al. | arxiv-cs.AI | 2021-02-15 |
390 | InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: We propose InsNet, an expressive insertion-based text generator with efficient training and flexible decoding (parallel or sequential). |
Sidi Lu; Tao Meng; Nanyun Peng; | arxiv-cs.CL | 2021-02-12 |
391 | Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Experiments on three translation directions show that by fine-tuning from FAT-MLM, our proposed speech translation models substantially improve translation quality by up to +5.9 BLEU. |
Renjie Zheng; Junkun Chen; Mingbo Ma; Liang Huang; | arxiv-cs.CL | 2021-02-10 |
392 | Listen, Understand and Translate: Triple Supervision Decouples End-to-end Speech-to-text Translation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Listen-Understand-Translate, (LUT), a unified framework with triple supervision signals to decouple the end-to-end speech-to-text translation task. |
QIANQIAN DONG et. al. | aaai | 2021-02-09 |
393 | Multilingual Transfer Learning for QA Using Translation As Data Augmentation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we explore strategies that improve cross-lingual transfer by bringing the multilingual embeddings closer in the semantic space. |
Mihaela Bornea; Lin Pan; Sara Rosenthal; Radu Florian; Avirup Sil; | aaai | 2021-02-09 |
394 | Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work shows the first attempt of a source-target bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. |
Tianfu Zhang; Heyan Huang; Chong Feng; Longbing Cao; | aaai | 2021-02-09 |
395 | Empirical Regularization for Synthetic Sentence Pairs in Unsupervised Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this work, we empirically study the core training procedure of UNMT to analyze the synthetic sentence pairs obtained from back-translation. |
Xi Ai; Bin Fang; | aaai | 2021-02-09 |
396 | Bridging The Domain Gap: Improve Informal Language Translation Via Counterfactual Domain Adaptation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: To address this problem, we propose a counterfactual domain adaptation method to better leverage both large-scale source-domain data (formal texts) and small-scale target-domain data (informal texts). |
Ke Wang; Guandan Chen; Zhongqiang Huang; Xiaojun Wan; Fei Huang; | aaai | 2021-02-09 |
397 | Towards Fully Automated Manga Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we make the following four contributions that establishes the foundation of manga translation research. |
Ryota Hinami; Shonosuke Ishiwatari; Kazuhiko Yasuda; Yusuke Matsui; | aaai | 2021-02-09 |
398 | Accelerating Neural Machine Translation with Partial Word Embedding Compression Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we propose Partial Vector Quantization (P-VQ) for NMT models, which can both compress the word embedding matrix and accelerate word probability prediction in the softmax layer. |
Fan Zhang; Mei Tu; Jinyao Yan; | aaai | 2021-02-09 |
399 | Learning Light-Weight Translation Models from Deep Transformer IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we take a natural step towards learning strong but light-weight NMT systems. |
BEI LI et. al. | aaai | 2021-02-09 |
400 | Studying The Usage of Text-To-Text Transfer Transformer to Support Code-Related Tasks IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper, we empirically investigate how the T5 model performs when pre-trained and fine-tuned to support code-related tasks. |
ANTONIO MASTROPAOLO et. al. | arxiv-cs.SE | 2021-02-03 |
401 | Uncertainty Estimation in Autoregressive Structured Prediction Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: A Deep Investigation of Ensemble-based Uncertainty Estimation for Autoregressive ASR and NMT models. |
Andrey Malinin; Mark Gales; | iclr | 2021-01-21 |
402 | GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding IF:4 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: In this paper we demonstrate conditional computation as a remedy to the above mentioned impediments, and demonstrate its efficacy and utility. |
DMITRY LEPIKHIN et. al. | iclr | 2021-01-21 |
403 | GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This work introduces GENIE, an extensible human evaluation leaderboard, which brings the ease of leaderboards to text generation tasks. We introduce several datasets in English to GENIE, representing four core challenges in text generation: machine translation, summarization, commonsense reasoning, and machine comprehension. |
DANIEL KHASHABI et. al. | arxiv-cs.CL | 2021-01-16 |
404 | The Impact of Post-editing and Machine Translation on Creativity and Reading Experience Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: This article presents the results of a study involving the translation of a fictional story from English into Catalan in three modalities: machine-translated (MT), post-edited (MTPE) and translated without aid (HT). |
Ana Guerberof Arenas; Antonio Toral; | arxiv-cs.CL | 2021-01-15 |
405 | Context- and Sequence-Aware Convolutional Recurrent Encoder for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Highlight: Existing models use recurrent neural networks to construct both the encoder and decoder modules. |
Ritam Mallick; Seba Susan; Vaibhaw Agrawal; Rizul Garg; Prateek Rawal; | arxiv-cs.CL | 2021-01-11 |
406 | The Solution of The Problem of Unknown Words Under Neural Machine Translation of The Kazakh Language Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: ABSTRACT The paper proposes a solution to the problem of unknown words for neural machine translation (NMT). The proposed solution is shown by the example of NMT of the … |
Aliya Turganbayeva; Ualsher Tukeyev; | Journal of Information and Telecommunication | 2021-01-01 |
407 | PhraseAttn: Dynamic Slot Capsule Networks for Phrase Representation in Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Word representation plays a vital role in most Natural Language Processing systems, especially for Neural Machine Translation. It tends to capture semantic and similarity between … |
Binh Nguyen; Binh Van Le; Long H.B. Nguyen; Dien Dinh; | Journal of Intelligent & Fuzzy Systems | 2021-01-01 |
408 | Topology-Sensitive Neural Architecture Search for Language Modeling Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Recently Neural Architecture Search has drawn interest from researchers because of its ability to learn neural network architectures from data automatically. The differentiable … |
Quan Du; Nuo Xu; Yinqiao Li; Tong Xiao; Jingbo Zhu; | IEEE Access | 2021-01-01 |
409 | Neural Machine Translation for Turkish to English Using Deep Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Fatih Balki; Hilmi Demirhan; Salih Sarp; | Digital Interaction and Machine Intelligence | 2021-01-01 |
410 | Neural Network-Based Tree Translation for Knowledge Base Construction Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Knowledge bases (KB), such as Probase and ConceptNet, play an important role in many natural language processing tasks. Compared with resource-poor languages such as Chinese, the … |
Haijun Zhang; | IEEE Access | 2021-01-01 |
411 | Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Though great progress has been made in the Aspect-Based Sentiment Analysis(ABSA) task through research, most of the previous work focuses on English-based ABSA problems, and there … |
Hanqian Wu; Zhike Wang; Feng Qing; Shoushan Li; | Electronics | 2021-01-01 |
412 | IITP-MT at WAT2021: Indic-English Multilingual Neural Machine Translation Using Romanized Vocabulary Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper describes the systems submitted to WAT 2021 MultiIndicMT shared task by IITP-MT team. We submit two multilingual Neural Machine Translation (NMT) systems … |
Ramakrishna Appicharla; Kamal Kumar Gupta; Asif Ekbal; Pushpak Bhattacharyya; | 2021-01-01 | |
413 | The COVID-19 Fake News Detection in Thai Social Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: One important obstruction against Thai COVID-19 recovery is fake news shared on social media that is one of the “ Artificial Intelligence Open Issues against COVID-19 ” reported … |
Pakpoom Mookdarsanit; Lawankorn Mookdarsanit; | Bulletin of Electrical Engineering and Informatics | 2021-01-01 |
414 | Multilingual Machine Translation Systems at WAT 2021: One-to-Many and Many-to-One Transformer Based NMT Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, we present the details of the systems that we have submitted for the WAT 2021 MultiIndicMT: An Indic Language Multilingual Task. We have submitted two separate … |
Shivam Mhaskar; Aditya Jain; Aakash Banerjee; Pushpak Bhattacharyya; | 2021-01-01 | |
415 | Optimal Word Segmentation for Neural Machine Translation Into Dravidian Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Dravidian languages, such as Kannada and Tamil, are notoriously difficult to translate by state-of-the-art neural models. This stems from the fact that these languages are … |
Prajit Dhar; Arianna Bisazza; Gertjan van Noord; | 2021-01-01 | |
416 | A New Model for Coreference Resolution Based on Knowledge Representation and Multi-criteria Ranking Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Coreference resolution is critical for improving the performance of all text-based systems including information extraction, document summarization, machine translation, and … |
Samira Hourali; Morteza Zahedi; Mansour Fateh; | J. Intell. Fuzzy Syst. | 2021-01-01 |
417 | ANVITA Machine Translation System for WAT 2021 MultiIndicMT Shared Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper describes ANVITA-1.0 MT system, architected for submission to WAT2021 MultiIndicMT shared task by mcairt team, where the team participated in 20 translation directions: … |
Pavanpankaj Vegi; J. Sivabhavani; Biswajit Paul; Chitra Viswanathan; K. R. Prasanna Kumar; | 2021-01-01 | |
418 | English-Vietnamese Machine Translation Using Deep Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Recently, artificial intelligence-based machine translation has been much improved over the traditional methods. A machine translator is very useful for translating text or speech … |
Tuan Nguyen Minh; Phayung Meesad; Huy Cuong Nguyen Ha; | Lecture Notes in Networks and Systems | 2021-01-01 |
419 | Hybrid System Combination Framework for Uyghur-Chinese Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Both the statistical machine translation (SMT) model and neural machine translation (NMT) model are the representative models in Uyghur–Chinese machine translation tasks with … |
Yajuan Wang; Xiao Li; Yating Yang; Azmat Anwar; Rui Dong; | Inf. | 2021-01-01 |
420 | Common Lexical Errors Made By Machine Translation On Cultural Text Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine translation is one tool of Google that presents various languages to translate. As a translator machine, the results of Google Translate are not always perfectly correct. … |
Nanda Fitri Mar’athus Sholikhah; | 2021-01-01 | |
421 | TMEKU System for The WAT2021 Multimodal Translation Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: We introduce our TMEKU system submitted to the English-Japanese Multimodal Translation Task for WAT 2021. We participated in the Flickr30kEnt-JP task and Ambiguous MSCOCO … |
Yuting Zhao; Mamoru Komachi; Tomoyuki Kajiwara; Chenhui Chu; | 2021-01-01 | |
422 | Development of A Model and Software Solution for The Problem of Determining Unknown Words in Post-editing Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine translation is the technology of consecutive translation of texts from one language to another by a computer program. As a result of machine translation, there are always … |
D. R. Rakhimova; N. M. Pazylkhan; A. A. Kulzhanova; Zh.G. Alen; | 2021-01-01 | |
423 | Adaptation of Back-translation to Automatic Post-Editing for Synthetic Data Generation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Automatic Post-Editing (APE) aims to correct errors in the output of a given machine translation (MT) system. Although data-driven approaches have become prevalent also in the APE … |
WonKee Lee; Baikjin Jung; Jaehun Shin; Jong-Hyeok Lee; | 2021-01-01 | |
424 | Statistical and Neural Machine Translation Systems of English to Manipuri: A Preliminary Study Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The present work reports the findings of the experimental result of English to Manipuri machine translation systems using neural and statistical approaches. The experiment on the … |
Salam Michael Singh; Thoudam Doren Singh; | 2021-01-01 | |
425 | Comparing Statistical and Neural Machine Translation Performance on Hindi-To-Tamil and English-To-Tamil Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Phrase-based statistical machine translation (PB-SMT) has been the dominant paradigm in machine translation (MT) research for more than two decades. Deep neural MT models have … |
Akshai Ramesh; Venkatesh Balavadhani Parthasarathy; Rejwanul Haque; Andy Way; | 2021-01-01 | |
426 | OCR Error Correction for Vietnamese Handwritten Text Using Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: OCR post-processing is an important step for improving the quality of OCR output texts. Long short-term memory (LSTM) is a deep learning model, which has wide-range applications … |
D. Q. Nguyen; A. D. Le; M. N. Phan; P. Kromer; I. Zelinka; | 1ST VAN LANG INTERNATIONAL CONFERENCE ON HERITAGE AND … | 2021-01-01 |
427 | Comparative Analysis of Language Translation and Detection System Using Machine Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract: Words are the meaty component which can be expressed through speech, writing or signals. It is important that the actual message or meaning of the words sent must … |
Aishwarya R. Verma; | International Journal for Research in Applied Science and … | 2021-01-01 |
428 | Machine Translation System Using Deep Learning for Punjabi to English Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Kamal Deep; Ajit Kumar; Vishal Goyal; | 2021-01-01 | |
429 | MENYO-20k: A Multi-domain English-Yorùbá Corpus for Machine Translation and Domain Adaptation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Massively multilingual machine translation (MT) has shown impressive capabilities, including zero and few-shot translation between low-resource language pairs. However, these … |
DAVID I. ADELANI et. al. | ArXiv | 2021-01-01 |
430 | Utilizing Machine Translation Systems to Generate Word Lists for Learning Vocabulary in English Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Jin-Ha Woo; Heeyoul Choi; | 2021-01-01 | |
431 | Networked Artificial Intelligence English Translation System Based on An Intelligent Knowledge Base and Translation Method Thereof Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Language translation is often conducted in work and study. Traditional language translation is based on lexical structure analysis. However, natural language is not so … |
Shuping Ren; | Mob. Inf. Syst. | 2021-01-01 |
432 | TMU NMT System with Japanese BART for The Patent Task of WAT 2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, we introduce our TMU Neural Machine Translation (NMT) system submitted for the Patent task (Korean Japanese and English Japanese) of 8th Workshop on Asian … |
Hwichan Kim; Mamoru Komachi; | 2021-01-01 | |
433 | Improved English to Hindi Multimodal Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine translation performs automatic translation from one natural language to another. Neural machine translation attains a state-of-the-art approach in machine translation, but … |
Sahinur Rahman Laskar; Abdullah Faiz Ur Rahman Khilji; Darsh Kaushik; Partha Pakray; Sivaji Bandyopadhyay; | 2021-01-01 | |
434 | METHOD OF SYSTEM ENGINEERING OF NEURAL MACHINE TRANSLATION SYSTEMS Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Background. There are not many machine translation companies on the market whose products are in demand. These are, for example, free and commercial products such as … |
Pavlo P. Maslianko; Yevhenii P. Sielskyi; | KPI Science News | 2021-01-01 |
435 | Build Italian-Chinese Parallel Sentence Corpus to Implement Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The cooperation in infrastructure, economics between China and Italy is deepen and cultural exchanges are more closely related, thus the demand for Italian-Chinese translation … |
Wuying Liu; Lin Bai; Randie Yi; Han Wu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
436 | Example-Based Hybrid Higher-Order Neural Network Cognition Applied for Archive Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper constructs the basic principles and system structure of example-based hybrid higher-order neural network cognition for machine translation. On this basis, the paper … |
Lilan Chen; Yongsheng Chen; | Advances in Intelligent Automation and Soft Computing | 2021-01-01 |
437 | Translation Mechanism of Neural Machine Algorithm for Online English Resources Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: At the level of English resource vocabulary, due to the lack of vocabulary alignment structure, the translation of neural machine translation has the problem of unfaithfulness. … |
Yanping Ye; | Complex. | 2021-01-01 |
438 | Research on Military Text Machine Translation Based on Deep Neural Network Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Xiangwei Liu; Liang Tang; Xin Ma; Jiang Hu; | Advances in Intelligent Automation and Soft Computing | 2021-01-01 |
439 | Research on The Application of BERT in Mongolian-Chinese Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In recent years, the research of neural networks has brought new solutions to machine translation. The application of sequence-tosequence model has made a qualitative leap in the … |
Xiu Zhi; Siriguleng Wang; | 2021 13th International Conference on Machine Learning and … | 2021-01-01 |
440 | Different Processes for Translating Expressive Versus Informative Texts? A Computer-assisted Study of Professionals’ English-Chinese Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Jianwei Zheng; Wenjun Fan; | Digit. Scholarsh. Humanit. | 2021-01-01 |
441 | Monolingual Corpus Driven Vietnamese-Chinese Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Neural machine translation (NMT) usually requires a massive parallel corpus of high quality as training data, the lack of which limits the performance of the NMT model for some … |
Lin Wang; Zhaoxuan Li; Hongyan Zhang; Wuying Liu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
442 | An Investigation of Machine Translation Output Quality and The Influencing Factors of Source Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The use of machine translation (MT) in the academic context has increased in recent years. Hence, language teachers have found it difficult to ignore MT, which has led to some … |
Sangmin-Michelle Lee; | ReCALL | 2021-01-01 |
443 | Better Chinese Sentence Segmentation with Reinforcement Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: A long-standing challenge in Chinese–English machine translation is that sentence boundaries are ambiguous in Chinese orthography, but inferring good splits is necessary for … |
Srivatsan Srinivasan; Chris Dyer; | 2021-01-01 | |
444 | Product Review Translation: Parallel Corpus Creation and Robustness Towards User-generated Noisy Text Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Reviews written by the users for a particular product or service play an influencing role for the customers to make an informative decision. Although online e-commerce portals … |
Kamal Kumar Gupta; Soumya Chennabasavaraj; Nikesh Garera; Asif Ekbal; | Proceedings of The 4th Workshop on e-Commerce and NLP | 2021-01-01 |
445 | Deep Residual and Deep Dense Attentions in English Chinese Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Neural Machine Translation (NMT) with attention mechanism has achieved impressively improvement for automated translation. However, such models may lose information during … |
Yi-Xing Lin; Kai-Wen Liang; Chih-Hsuan Yang; Jia-Ching Wang; | 2021 IEEE International Conference on Consumer … | 2021-01-01 |
446 | Translating IdiomsusingParaphrasing, Machine Translation and Rescoring Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Idioms are rich multi-word expressions that can be found in many works of literature. The meaning of most idioms cannot be deduced literally. This makes translating idioms … |
Tan Et.al Tien-Ping; | 2021-01-01 | |
447 | Domain-Aware Self-Attention for Multi-Domain Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, we investigate multi-domain neural machine translation (NMT) that translates sentences of different domains in a single model. To this end, we propose a … |
Shiqi Zhang; Yan Liu; Deyi Xiong; Pei Zhang; Boxing Chen; | Interspeech 2021 | 2021-01-01 |
448 | The Advantages and Disadvantages of Machine Translation from The Perspective of Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: With the rapid popularization of machine translation, there are always differences between machine translation and human translation, and there are also many differences between … |
Ying Peng; | OALib | 2021-01-01 |
449 | SocialSciTerm: An English-Chinese Parallel Term Resource for Collaborative Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Bilingual term resources are helpful on collaborative translation tasks. Firstly, we build an English-Chinese parallel term resource of social sciences (SocialSciTerm) based on … |
Chenxi Zhu; Lin Wang; Wuying Liu; | Advances in Natural Computation, Fuzzy Systems and … | 2021-01-01 |
450 | Unsupervised Neural Machine Translation for Similar and Distant Language Pairs Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Unsupervised neural machine translation (UNMT) has achieved remarkable results for several language pairs, such as French–English and German–English. Most previous studies have … |
HAIPENG SUN et. al. | ACM Transactions on Asian and Low-Resource Language … | 2021-01-01 |
451 | Fast Streaming Translation Using Machine Learning with Transformer Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation is the usage of machine learning techniques in translation from one language to another. It has recently been applied to streaming translation, also known as … |
Jiabao Qiu; Melody Moh; Teng-Sheng Moh; | Proceedings of the 2021 ACM Southeast Conference | 2021-01-01 |
452 | Post-editing Guidelines for Korean-English Machine Translation of Informative Texts Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Kunyoung Park; | 2021-01-01 | |
453 | Enhancing Language Generation with Effective Checkpoints of Pre-trained Language Model Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This work empirically explores effective exploiting of intermediate output from pretrained language models (PrLMs) for language generation tasks. For this purpose, we propose an … |
Jeonghyeok Park; Hai Zhao; | 2021-01-01 | |
454 | Exploring The Effectiveness of Employing Limited Resources for Deep Neural Pairwise Evaluation of Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, a light resource learning schema, i.e. a schema that depends on limited resources, is introduced, which aims to choose the better translation between two machine … |
Despoina Mouratidis; Katia Lida Kermanidis; | 2021 12th International Conference on Information, … | 2021-01-01 |
455 | Multilingual Sequence to Sequence Convolutional Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, to improve the translation quality of a sentence, Convolutional sequence to sequence architecture has been applied to English-Punjabi, Punjabi-English, … |
Mani Bansal; D. K. Lobiyal; | Multim. Tools Appl. | 2021-01-01 |
456 | Various Approaches of Machine Translation for Marathi to English Language Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation (MT) is a generic term for computerised systems that generate translations from one natural language to another, with or without human intervention. Text may … |
Nilesh Shirsath; Aniruddha Velankar; Ranjeet Patil; Shilpa Shinde; | ITM Web of Conferences | 2021-01-01 |
457 | Neural Machine Translation Approach for Singlish to English Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Comprehension of “Singlish” (an alternative writing system for Sinhala language) texts by a machine had been a requirement for a long period. It has been a choice of many Sri … |
Dinidu Sandaruwan; Sagara Sumathipala; Subha Fernando; | International Journal on Advances in ICT for Emerging … | 2021-01-01 |
458 | THE TRANSLATION RESULTS OF GOOGLE TRANSLATE FROM INDONESIAN TO ENGLISH Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Google Translate is a free multilingual translation machine developed by Google that can assist translators to make their translation functions easier and faster. The aim of the … |
Tiara Noviarini; | 2021-01-01 | |
459 | Re-Transformer: A Self-Attention Based Model for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract Machine translation is one of the most popular and hardest tasks in Natural Language Processing. This paper proposes a self-attention based model for machine translation, … |
Huey-Ing Liu; Wei-Lin Chen; | Procedia Computer Science | 2021-01-01 |
460 | Automatic Evaluation of The Quality of Machine Translation of A Scientific Text: The Results of A Five-year-long Experiment Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: We report on various approaches to automatic evaluation of machine translation quality and describe three widely used methods. These methods, i.e. methods based on string matching … |
Ilya Ulitkin; Irina Filippova; Natalia Ivanova; Alexey Poroykov; | E3S Web of Conferences | 2021-01-01 |
461 | Improving Neural Machine Translation with Sentence Alignment Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we … |
Xuewen Shi; Heyan Huang; Ping Jian; Yi-Kun Tang; | Neurocomputing | 2021-01-01 |
462 | Morphology Generation for English-Indian Language Statistical Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: When translating into morphologically rich languages, statistical MT approaches face the problem of data sparsity. The severity of the sparseness problem will be high when the … |
S Sreelekha; | Soft Comput. | 2021-01-01 |
463 | Parallel Corpora Preparation for English-Amharic Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Yohanens Biadgligne; Kamel Smaïli; | 2021-01-01 | |
464 | ATLASLang NMT: Arabic Text Language Into Arabic Sign Language Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract ATLASLang is a machine translation system from Arabic text language into Arabic sign language (ArSL). The first version of the system (Brour and Benabbou, 2019) is based … |
Mourad Brour; Abderrahim Benabbou; | J. King Saud Univ. Comput. Inf. Sci. | 2021-01-01 |
465 | Findings of The Second Workshop on Automatic Simultaneous Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper presents the results of the shared task of the 2nd Workshop on Automatic Simultaneous Translation (AutoSimTrans). The task includes two tracks, one for text-to-text … |
Ruiqing Zhang; Chuanqiang Zhang; Zhongjun He; Hua Wu; Haifeng Wang; | 2021-01-01 | |
466 | Intelligent English Translation System Based on Evolutionary Multi-objective Optimization Algorithm Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The difficulty of obtaining the characteristics of the corpus database of neural machine translation is a factor hindering its development. In order to improve the effect of … |
Xin Song; | J. Intell. Fuzzy Syst. | 2021-01-01 |
467 | Exploring Subword Segmentation Methods in English-Vietnamese Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Thang H. Nguyen-Vo; Duc Toan Truong; Long H. B. Nguyen; Dien Dinh; | 2021-01-01 | |
468 | Hindi to English: Transformer-Based Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation (MT) is one of the most prominent tasks in Natural Language Processing (NLP) which involves the automatic conversion of texts from one natural language to … |
Kavit Gangar; Hardik Ruparel; Shreyas Lele; | Lecture Notes in Electrical Engineering | 2021-01-01 |
469 | Design and Research of English-Chinese Translation Platform Based on BP Neural Network Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Jinchun Zhang; Dongshui Zhang; | 2021-01-01 | |
470 | Tutorial Proposal: End-to-End Speech Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Speech translation is the translation of speech in one language typically to text in another, traditionally accomplished through a combination of automatic speech recognition and … |
Jan Niehues; Elizabeth Salesky; Marco Turchi; Matteo Negri; | 2021-01-01 | |
471 | English to Yoruba Short Message Service Speech and Text Translator for Android Phones Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine language translation (MLT) has acquired a substantial quantity of investigation consideration in Europe and Asia, but works on African languages, especially the Yoruba … |
AKINBOWALE NATHANIEL BABATUNDE et. al. | Int. J. Speech Technol. | 2021-01-01 |
472 | Video-guided Machine Translation with Spatial Hierarchical Attention Network Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Neural machine translation (NMT) has achieved high performance for domains where there is almost no ambiguity in data such as newspaper domain [1, 2]. However, for other domains … |
Weiqi Gu; Haiyue Song; Chenhui Chu; Sadao Kurohashi; | 2021-01-01 | |
473 | Translation Shifts on Reference By Machine Translation in Descriptive Text Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Translation shifts are one of strategy to get a high-quality translation. It’s also used to solve the absent meaning on the target text. The objectives of this research are to … |
Kammer Tuahman Sipayung; | 2021-01-01 | |
474 | The Inspiration of Effort Model to The Practice Teaching of Simultaneous Translation with Shorthand Typing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: With the rapid development of science and technology and global integration today, modern information technology plays an irreplaceable role in education. The course of … |
Beilei Chen; | 2021-01-01 | |
475 | Design of Text and Voice Machine Translation Tool for Presentations Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Thi-My-Thanh Nguyen; Xuan-Dung Phan; Ngoc-Bich Le; Xuan-Quy Dao; | 2021-01-01 | |
476 | Progress in Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract After more than 70 years of evolution, great achievements have been made in machine translation. Especially in recent years, translation quality has been greatly improved … |
Haifeng Wang; Hua Wu; Zhongjun He; Liang Huang; Kenneth Ward Church; | Engineering | 2021-01-01 |
477 | An Overview of The Basic NLP Resources Towards Building The Assamese-English Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation (MT) is the process of automatically converting one natural language into another, preserving the exact meaning of the input text to the output text. It is one … |
Nibedita Roy; Apurbalal Senapati; | Proceedings of Intelligent Computing and Technologies … | 2021-01-01 |
478 | Translating Sentimental Statements Using Deep Learning Techniques Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Natural Language Processing (NLP) allows machines to know nature languages and helps us do tasks, such as retrieving information, answering questions, text summarization, … |
Yin-Fu Huang; Yi-Hao Li; | Electronics | 2021-01-01 |
479 | Source-side Reordering to Improve Machine Translation Between Languages with Distinct Word Orders Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: English and Hindi have significantly different word orders. English follows the subject-verb-object (SVO) order, while Hindi primarily follows the subject-object-verb (SOV) order. … |
Karunesh Kumar Arora; Shyam Sunder Agrawal; | Transactions on Asian and Low-Resource Language Information … | 2021-01-01 |
480 | Low Resource Neural Machine Translation from English to Khasi: A Transformer-Based Approach Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
N. Donald Jefferson Thabah; Bipul Syam Purkayastha; | 2021-01-01 | |
481 | Research on Uyghur-Chinese Neural Machine Translation Based on The Transformer at Multistrategy Segmentation Granularity Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In recent years, machine translation based on neural networks has become the mainstream method in the field of machine translation, but there are still challenges of insufficient … |
Zhiwang Xu; Huibin Qin; Yongzhu Hua; | Mob. Inf. Syst. | 2021-01-01 |
482 | Optical Character Recognition and Neural Machine Translation Using Deep Learning Techniques Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Over the years, the applications of text detection and text translation have expanded across various fields. Many researchers have used several deep learning algorithms for text … |
K. Chandra Shekar; Maria Anisha Cross; Vignesh Vasudevan; | Innovations in Computer Science and Engineering | 2021-01-01 |
483 | Empirical Analysis of Performance of MT Systems and Its Metrics for English to Bengali: A Black Box-Based Approach Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: There are numerous use cases of machine translation (MT) systems. Therefore, it has become very important to evaluate the performance of MT which can help researchers design a … |
Goutam Datta; Nisheeth Joshi; Kusum Gupta; | 2021-01-01 | |
484 | Semantic and Syntactic Information for Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Introducing factors such as linguistic features has long been proposed in machine translation to improve the quality of translations. More recently, factored machine translation … |
Jordi Armengol-Estapé; Marta R. Costa-jussà; | Mach. Transl. | 2021-01-01 |
485 | A Comparative Study on The Quality of Translation in Korean-English Translation Using Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Gilja Byun; | The Journal of Mirae English Language and Literature | 2021-01-01 |
486 | Development of English-to-Bengali Neural Machine Translation Systems Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Anwesha Das; Thoudam Doren Singh; | 2021-01-01 | |
487 | NAIST English-to-Japanese Simultaneous Translation System for IWSLT 2021 Simultaneous Text-to-text Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper describes NAIST’s system for the English-to-Japanese Simultaneous Text-to-text Translation Task in IWSLT 2021 Evaluation Campaign. Our primary submission is based on … |
RYO FUKUDA et. al. | 2021-01-01 | |
488 | Neural Machine Translation 2020, By Philipp Koehn, Cambridge, Cambridge University Press, ISBN 978-1-108-49732-9, Pages 393 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Neural Machine Translation delivers a thorough and well-structured walk through the core concepts of the field. The book is primarily aimed at students who will want to go on to … |
Alexandra Birch; | Natural Language Engineering | 2021-01-01 |
489 | Assessing Human Post-Editing Efforts to Compare The Performance of Three Machine Translation Engines for English to Russian Translation of Cochrane Plain Language Health Information: Results of A Randomised Comparison Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Cochrane produces independent research to improve healthcare decisions. It translates its research summaries into different languages to enable wider access, relying largely on … |
Liliya Eugenevna Ziganshina; Ekaterina V. Yudina; Azat I. Gabdrakhmanov; Juliane Ried; | Informatics | 2021-01-01 |
490 | Analysis of Errors in Machine Translation from Roger T. Bell’s Translation Process Model Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: There are enough literature reviews about machine translation, but the numbers of texts studied are not large enough, and there are very limited varieties of machine translation … |
Jianbin Zhu; Min Zhang; | 2021-01-01 | |
491 | English-Arabic Cross-language Plagiarism Detection Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The advancement of the web and information technology has contributed to the rapid growth of digital libraries and automatic machine translation tools which easily translate texts … |
Naif Alotaibi; Mike Joy; | 2021-01-01 | |
492 | Dual Knowledge Distillation for Bidirectional Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Building strong and robust neural machine translation systems needs large amount of high-quality parallel corpora. However, most of language pairs are limited in quantity, … |
Huaao Zhang; Shigui Qiu; Shilong Wu; | 2021 International Joint Conference on Neural Networks … | 2021-01-01 |
493 | Translational Equivalence in Statistical Machine Translation or Meaning As Co-occurrence Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: In this paper, we will describe the current state-of-the-art of Statistical Machine Translation (SMT), and reflect on how SMT handles meaning. Statistical Machine Translation is a … |
Lieve Macken; Els Lefever; | Linguistica Antverpiensia, New Series – Themes in … | 2021-01-01 |
494 | Multilingual Simultaneous Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Simultaneous machine translation (SIMT) involves translating source utterances to the target language in real-time before the speaker utterance completes. This paper proposes the … |
Philip Arthur; Dongwon Ryu; Gholamreza Haffari; | 2021-01-01 | |
495 | Factors Behind The Effectiveness of An Unsupervised Neural Machine Translation System Between Korean and Japanese Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Korean and Japanese have different writing scripts but share the same Subject-Object-Verb (SOV) word order. In this study, we pre-train a language-generation model using a Masked … |
Yong-Seok Choi; Yo-Han Park; Seung Yun; Sang-Hun Kim; Kong-Joo Lee; | Applied Sciences | 2021-01-01 |
496 | Using Dependency-Based Contextualization for Transferring Passive Constructions from English to Spanish Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: We hypothesize that parallel corpora as well as machine translation outputs contain many literal translations that are the result of transferring the constructions of the source … |
Pablo Otero; Gorka Labaka Intxauspe; | Procesamiento Del Lenguaje Natural | 2021-01-01 |
497 | Recent Progress, Emerging Techniques, and Future Research Prospects of Bangla Machine Translation: A Systematic Review Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation (MT), the way of translating texts or documents from a source language to a target language automatically without human intervention, has gained popularity in … |
M. A. H. Akhand; Arna Roy; Argha Chandra Dhar; Abdus Samad Kamal; | International Journal of Advanced Computer Science and … | 2021-01-01 |
498 | Should We Find Another Model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method Without Model Modification IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Most of the recent Natural Language Processing(NLP) studies are based on the Pretrain-Finetuning Approach (PFA), but in small and medium-sized enterprises or companies with … |
Chanjun Park; Sugyeong Eo; Hyeonseok Moon; Heuiseok Lim; | 2021-01-01 | |
499 | Considering Machine Translation (MT) As An Aid or A Threat to The Human Translator: Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The present study aims to evaluate the output quality of an online MT; namely, Google Translate, from English into Persian and compare its output with the translations made by the … |
Hamidreza Abdi; | 2021-01-01 | |
500 | Investigating Usability in Postediting Neural Machine Translation: Evidence from Translation Trainees’ Self-perception and Performance Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This is a report on an empirical study on the usability for translation trainees of neural machine translation systems when post-editing (mtpe). Sixty Chinese translation trainees … |
Xiangling Wang; Tingting Wang; Ricardo Muñoz Martín; Yanfang Jia; | Across Languages and Cultures | 2021-01-01 |
501 | Applying Machine Translation Methods in The Problem of Automatic Text Correction Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Wojciech Jarmosz; | 2021-01-01 | |
502 | Research on The Application of Artificial Intelligence in Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Under the influence of the development of big data and cloud computing technology, machine translation based on artificial intelligence has gradually entered people’s lives. … |
Wang LingZhi; | 2021-01-01 | |
503 | Improving Neural Machine Translation with Latent Features Feedback Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract Most state-of-the-art neural machine translation (NMT) models progressively encode feature representation in a bottom-up feed-forward fashion. This traditional encoding … |
Yachao Li; Junhui Li; Min Zhang; | Neurocomputing | 2021-01-01 |
504 | On Knowledge Distillation for Translating Erroneous Speech Transcriptions Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Recent studies argue that knowledge distillation is promising for speech translation (ST) using end-to-end models. In this work, we investigate the effect of knowledge … |
Ryo Fukuda; Katsuhito Sudoh; Satoshi Nakamura; | 2021-01-01 | |
505 | A Case Study on User Evaluation of Scientific Publication Summarization By Japanese Students Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Summaries of scientific publications enable readers to gain an overview of a large number of studies, but users’ preferences have not yet been explored. In this paper, we conduct … |
Shintaro Yamamoto; Ryota Suzuki; Tsukasa Fukusato; Hirokatsu Kataoka; Shigeo Morishima; | Applied Sciences | 2021-01-01 |
506 | Transformer with Syntactic Position Encoding for Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: It has been widely recognized that syntax information can help end-to-end neural machine translation (NMT) systems to achieve better translation. In order to integrate dependency … |
Yikuan Xie; Wenyong Wang; Mingqian Du; Qing He; | 2021-01-01 | |
507 | The Quality of Machine Translation Assessment On Gender Markers Lingual Units Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation (MT) is one of the most advanced and elaborate research fields within Translation Technology, the quality of MT output has always been a great concern, and MT … |
Hapni Nurliana H.D Hasibuan; | Lensa: Kajian Kebahasaan, Kesusastraan, dan Budaya | 2021-01-01 |
508 | The USYD-JD Speech Translation System for IWSLT2021 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper describes the University of Sydney & JD’s joint submission of the IWSLT 2021 low resource speech translation task. We participated in the Swahili->English direction and … |
Liang Ding; Di Wu; Dacheng Tao; | 2021-01-01 | |
509 | A Contrastive Study Between Machine Translation and Human Translation: Taking Japanese Translation Application As An Example Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: With the development of artificial intelligence, machine translation is making progress. This paper will take Japanese translation application as an example to analyze the … |
Guifang Zhang; Zixi Wei; Rui Zhao; | OALib | 2021-01-01 |
510 | A Study of English Translation Teaching from The Perspective of Ecology Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper conducted a research into English translation teaching at China’s universities from the perspective of ecology. In the ecological environment of translation teaching, … |
Qiong Fang; | 2021 2nd International Conference on Computers, Information … | 2021-01-01 |
511 | Tag Assisted Neural Machine Translation of Film Subtitles Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: We implemented a neural machine translation system that uses automatic sequence tagging to improve the quality of translation. Instead of operating on unannotated sentence pairs, … |
Aren Siekmeier; WonKee Lee; Hongseok Kwon; Jong-Hyeok Lee; | 2021-01-01 | |
512 | An Improved English-to-Mizo Neural Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Machine Translation is an effort to bridge language barriers and misinterpretations, making communication more convenient through the automatic translation of languages. The … |
Candy Lalrempuii; Badal Soni; Partha Pakray; | Transactions on Asian and Low-Resource Language Information … | 2021-01-01 |
513 | Variational Multimodal Machine Translation with Underlying Semantic Alignment Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Abstract Capturing the underlying semantic relationships of sentences is helpful for machine translation. Variational neural machine translation approaches provide an effective … |
Xiao Liu; Jing Zhao; Shiliang Sun; Huawen Liu; Hao Yang; | Inf. Fusion | 2021-01-01 |
514 | GX at SemEval-2021 Task 2: BERT with Lemma Information for MCL-WiC Task Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper presents the GX system for the Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC) task. The purpose of the MCL-WiC task is to tackle the challenge … |
Wanying Xie; | 2021-01-01 | |
515 | Neural Machine Translation for Amharic-English Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Andargachew Mekonnen Gezmu; Andreas Nürnberger; Tesfaye Bayu Bati; | 2021-01-01 | |
516 | Machine Translation from Text to Sign Language: A Systematic Review Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: An equal opportunity for all is the basic right of every human being. The deaf society of the world needs to have access to all the information just like hearing people do. For … |
Navroz Kaur Kahlon; Williamjeet Singh; | Universal Access in the Information Society | 2021-01-01 |
517 | Corpora Compilation for Prosody-informed Speech Processing Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Research on speech technologies necessitates spoken data, which is usually obtained through read recorded speech, and specifically adapted to the research needs. When the aim is … |
Alp Öktem; Mireia Farrús; Antonio Bonafonte; | Lang. Resour. Evaluation | 2021-01-01 |
518 | Transformer-IC: The Solution to Information Loss Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: With the development of information technology, machine translation technologies play a crucial role in cross-language communication. However, there is a problem of information … |
Zhigang Song; Jiazhao Chai; Wenqian Shang; Guo Yuning; | 2021 IEEE/ACIS 19th International Conference on Computer … | 2021-01-01 |
519 | Revealing Translation Techniques Applied in The Translation of Batik Motif Names in See Instagram Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This article discusses one of the forms of machine translation, the Instagram translation feature called “see translation”. The research is focused on the translation techniques … |
Dyah Raina Purwaningsih; Ika Maratus Sholikhah; Erna Wardani; | Celt: A Journal of Culture, English Language Teaching & … | 2021-01-01 |
520 | Multilingual Translation from Denoising Pre-Training IF:3 Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Recent work demonstrates the potential of training one model for multilingual machine translation. In parallel, denoising pretraining using unlabeled monolingual data as a … |
YUQING TANG et. al. | 2021-01-01 | |
521 | Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across Domains Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Automated Term Extraction (ATE), even though well-investigated, continues to be a challenging task. Approaches conventionally extract terms on corpus or document level and the … |
Christian Lang; Lennart Wachowiak; Barbara Heinisch; Dagmar Gromann; | 2021-01-01 | |
522 | Context Based Machine Translation with Recurrent Neural Network for English-Amharic Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Context-aware machine translation approaches improve the quality of translation by incorporating the context of the surrounding phrases in the translation of a phrase. So far, for … |
Yeabsira Asefa Ashengo; Rosa Tsegaye Aga; Surafel Lemma Abebe; | Mach. Transl. | 2021-01-01 |
523 | Probing Multi-modal Machine Translation with Pre-trained Language Model Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Multi-modal machine translation (MMT) aimed at using images to help disambiguate the target during translation and improving robustness, but some recent works showed that the … |
Yawei Kong; Kai Fan; | 2021-01-01 | |
524 | Overview of Machine Translation Development Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Access to information is increasingly global, which brings with it the growth in a non-English speaking public and, as such, a demand for tools that allow users to access this … |
Irene Rivera-Trigueros; María-Dolores Olvera-Lobo; Juncal Gutiérrez-Artacho; | 2021-01-01 | |
525 | Design and Testing of Automatic Machine Translation System Based on Chinese-English Phrase Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: With the development of linguistics and the improvement of computer performance, the effect of machine translation is getting better and better, and it is widely used. The … |
Jing Ning; Haidong Ban; | Mobile Information Systems | 2021-01-01 |
526 | Improving German Image Captions Using Machine Translation and Transfer Learning Literature Review Related Patents Related Grants Related Orgs Related Experts Details |
Rajarshi Biswas; Michael Barz; Mareike Hartmann; Daniel Sonntag; | 2021-01-01 | |
527 | Improving Neural Machine Translation Using Gated State Network and Focal Adaptive Attention Networtk Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The currently predominant token-to-token attention mechanism has demonstrated its ability to capture word dependencies in neural machine translation. This mechanism treats a … |
Li Huang; Wenyu Chen; Yuguo Liu; He Zhang; Hong Qu; | Neural Comput. Appl. | 2021-01-01 |
528 | A Comprehensive Survey on Machine Translation for English, Hindi and Sanskrit Languages Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Transforming text from one language to another by using computer systems automatically or with little human interventions is known as Machine Translation System (MTS). Divergence … |
Sitender; Seema Bawa; Munish Kumar; Sangeeta; | Journal of Ambient Intelligence and Humanized Computing | 2021-01-01 |
529 | Leveraging Machine Translation to Support Distributed Teamwork Between Language-Based Subgroups: The Effects of Automated Keyword Tagging Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Modern teamwork often happens between subgroups located in different countries. Members of the same subgroup prefer to communicate in their native language for efficiency, which … |
YONGLE ZHANG et. al. | Extended Abstracts of the 2021 CHI Conference on Human … | 2021-01-01 |
530 | The Development of A Comprehensive Data Set for Systematic Studies of Machine Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper presents our on-going efforts to develop a comprehensive data set and benchmark for machine translation beyond high-resource languages. The current release includes … |
Jörg Tiedemann; | 2021-01-01 | |
531 | Improving Transformer-Based Neural Machine Translation with Prior Alignments Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Transformer is a neural machine translation model which revolutionizes machine translation. Compared with traditional statistical machine translation models and other neural … |
Thien Nguyen; Lam Nguyen; Phuoc Tran; Huu Nguyen; | Complex. | 2021-01-01 |
532 | CoMeT: Towards Code-Mixed Translation Using Parallel Monolingual Sentences Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Code-mixed languages are very popular in multilingual societies around the world, yet the resources lag behind to enable robust systems on such languages. A major contributing … |
DEVANSH GAUTAM et. al. | 2021-01-01 | |
533 | Kannada to English Machine Translation Using Deep Neural Network Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Received: 9 August 2020 Accepted: 15 November 2020 In this paper, we focus on the unidirectional translation of Kannada text to English text using Neural Machine Translation … |
Pushpalatha Kadavigere Nagaraj; Kshamitha Shobha Ravikumar; Mydugolam Sreenivas Kasyap; Medhini Hullumakki Srinivas Murthy; Jithin Paul; | Ingénierie des Systèmes d Inf. | 2021-01-01 |
534 | IITP-MT at CALCS2021: English to Hinglish Neural Machine Translation Using Unsupervised Synthetic Code-Mixed Parallel Corpus Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: This paper describes the system submitted by IITP-MT team to Computational Approaches to Linguistic Code-Switching (CALCS 2021) shared task on MT for English→Hinglish. We submit a … |
Ramakrishna Appicharla; Kamal Kumar Gupta; Asif Ekbal; Pushpak Bhattacharyya; | 2021-01-01 | |
535 | Pipeline Signed Japanese Translation Focusing on A Post-positional Particle Complement and Conjugation in A Low-resource Setting Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: Because sign language is a visual language, the translation of it into spoken language is typically performed through an intermediate representation called gloss notation. In sign … |
Ken Yano; Akira Utsumi; | 2021-01-01 | |
536 | Preordering Encoding on Transformer for Translation Literature Review Related Patents Related Grants Related Orgs Related Experts Details Abstract: The difference in word orders between source and target languages is a serious hurdle for machine translation. Preordering methods, which reorder the words in a source sentence … |
Yuki Kawara; Chenhui Chu; Yuki Arase; | IEEE/ACM Transactions on Audio, Speech, and Language … | 2021-01-01 |
537 | MAIN DIFFICULTIES IN TRANSLATING CONTRACTUAL DOCUMENTATION (ENGLISH/RUSSIAN) Literature Review Related Patents Related Grants & |