Automating Behavioral Testing in Machine Translation

Behavioral testing in NLP permits fine-grained analysis of methods by inspecting their linguistic capabilities by way of the evaluation of input-output habits. Sadly, present work on behavioral testing in Machine Translation (MT) is at present restricted to largely handcrafted exams protecting a restricted vary of capabilities and languages. To deal with this limitation, we suggest utilizing Massive Language Fashions (LLMs) to generate a various set of supply sentences tailor-made to check the habits of MT fashions in a variety of conditions. We will then confirm whether or not the MT mannequin displays the anticipated habits by way of matching candidate units which might be additionally generated utilizing LLMs. Our method goals to make behavioral testing of MT methods sensible whereas requiring solely minimal human effort. In our experiments, we apply our proposed analysis framework to evaluate a number of out there MT methods, revealing that whereas on the whole cross charges observe the tendencies observable from conventional accuracy-based metrics, our methodology was capable of uncover a number of vital variations and potential bugs that go unnoticed when relying solely on accuracy.

Source link

Automating Behavioral Testing in Machine Translation

Helping nonexperts build advanced generative AI models | MIT News

ML/AI Platform Build vs Buy Decision: What Factors to Consider

Researchers leverage shadows to model 3D scenes, including objects blocked from view | MIT News

SUS Corporation deploys ABB’s YuMi cobots to better manage lead times

Microsoft’s Azure AI Model Catalog Expands with Groundbreaking Artificial Intelligence Models

Recommended For You

Helping nonexperts build advanced generative AI models | MIT News

ML/AI Platform Build vs Buy Decision: What Factors to Consider

Researchers leverage shadows to model 3D scenes, including objects blocked from view | MIT News

Conformer-Based Speech Recognition on Extreme Edge-Computing Devices

Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

Microsoft's Azure AI Model Catalog Expands with Groundbreaking Artificial Intelligence Models

This is the Humane Ai Pin

Technique enables AI on edge devices to keep learning over time | MIT News

Leave a Reply Cancel reply

Helping robots grasp the unpredictable | MIT News

A technique for more effective multipurpose robots | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

Helping nonexperts build advanced generative AI models | MIT News

Unveiling the Power of AI in Shielding Businesses from Phishing Threats: A Comprehensive Guide for Leaders

Zion Solutions Group Joins Forces with Locus Robotics to Supercharge Warehouse Productivity

Neya Systems, AUVSI to develop cybersecurity certification program for UGVs

Achieving Superior Vision in Robotics with Automation in Low Light USB 3.0 Camera

A method to enable safe mobile robot navigation in dynamic environments

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Automating Behavioral Testing in Machine Translation

You might also like

SUS Corporation deploys ABB’s YuMi cobots to better manage lead times

Microsoft’s Azure AI Model Catalog Expands with Groundbreaking Artificial Intelligence Models

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password