Write a Blog >>
Thu 12 Nov 2020 08:07 - 08:08 at Virtual room 2 - ML Testing 2

Machine translation software has become heavily integrated into our daily lives due to the recent improvement in the performance of deep neural networks. However, machine translation software has been shown to regularly return erroneous translations, which can lead to harmful consequences such as economic loss and political conflicts. Additionally, due to the complexity of the underlying neural models, testing machine translation systems presents new challenges. To address this problem, we introduce a novel methodology called PatInv. The main intuition behind PatInv is that sentences with different meanings should not have the same translation. Under this general idea, we provide two realizations of PatInv that given an arbitrary sentence, generate syntactically similar but semantically different sentences by: (1) replacing one word in the sentence using a masked language model or (2) removing one word or phrase from the sentence based on its constituency structure. We then test whether the returned translations are the same for the original and modified sentences. We have applied PatInv to test Google Translate and Bing Microsoft Translator using 200 English sentences. Two language settings are considered: English-Hindi (En-Hi) and English-Chinese (En-Zh). The results show that PatInv can accurately find 308 erroneous translations in Google Translate and 223 erroneous translations in Bing Microsoft Translator, most of which cannot be found by the state-of-the-art approaches.

Thu 12 Nov

Displayed time zone: (UTC) Coordinated Universal Time change

08:00 - 08:30
08:00
2m
Talk
DeepSearch: A Simple and Effective Blackbox Attack for Deep Neural Networks
Research Papers
Fuyuan Zhang MPI-SWS, Germany, Sankalan Pal Chowdhury MPI-SWS, Germany, Maria Christakis MPI-SWS
DOI
08:03
1m
Talk
Machine Learning Based Test Data Generation for Safety-critical Software
Paper Presentations
Ján Čegiň Faculty of Informatics and Information Technologies Slovak Technical University
08:05
1m
Talk
Machine Learning Testing: Survey, Landscapes and Horizons
Journal First
Jie M. Zhang University College London, UK, Mark Harman University College London, UK, Lei Ma Kyushu University, Yang Liu Nanyang Technological University, Singapore
08:07
1m
Talk
Machine Translation Testing via Pathological Invariance
Research Papers
Shashij Gupta IIT Bombay, India, Pinjia He ETH Zurich, Switzerland, Clara Meister ETH Zurich, Switzerland, Zhendong Su ETH Zurich
DOI
08:09
1m
Talk
Model-Based Exploration of the Frontier of Behaviours for Deep Learning System Testing
Research Papers
Vincenzo Riccio USI Lugano, Switzerland, Paolo Tonella USI Lugano, Switzerland
DOI
08:11
1m
Talk
PRODeep: A Platform for Robustness Verification of Deep Neural Networks
Tool Demos
Renjue Li Institute of Software at Chinese Academy of Sciences, China, Jianlin Li Institute of Software at Chinese Academy of Sciences, China, Cheng-Chao Huang Institute of Intelligent Software, China, Pengfei Yang Institute of Software at Chinese Academy of Sciences, China, Xiaowei Huang University of Liverpool, Lijun Zhang Institute of Software, Chinese Academy of Sciences, Bai Xue Institute of Software at Chinese Academy of Sciences, China, Holger Hermanns Saarland University
DOI
08:13
1m
Talk
Testing Machine Learning Code using Polyhedral Region
Visions and Reflections
Md Sohel Ahmed National Institute of Informatics, Japan, Fuyuki Ishikawa National Institute of Informatics, Mahito Sugiyama National Institute of Informatics, Japan
DOI
08:15
15m
Talk
Conversations on ML Testing 2
Paper Presentations
Fuyuan Zhang MPI-SWS, Germany, Ján Čegiň Faculty of Informatics and Information Technologies Slovak Technical University, Mark Harman University College London, UK, Renjue Li Institute of Software at Chinese Academy of Sciences, China, Shashij Gupta IIT Bombay, India, Vincenzo Riccio USI Lugano, Switzerland, M: Shin Yoo Korea Advanced Institute of Science and Technology