Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR

This paper was accepted on the Federated Studying within the Age of Basis Fashions workshop at NeurIPS 2023.

Whereas automated speech recognition (ASR) has witnessed outstanding achievements lately, it has not garnered a widespread focus throughout the federated studying (FL) and differential privateness (DP) communities. In the meantime, ASR can be a effectively suited benchmark for FL and DP as there’s (i) a pure information break up throughout customers through the use of speaker info; (ii) heterogeneous information throughout audio system near sensible settings; (iii) interaction between acoustic and language modeling; (iv) and it’s a sequence-to-sequence process. Latest production-ready state-of-the-art fashions in ASR embody textit{giant} conformer and transformer fashions, optimization of which is thought to pose challenges even for the central coaching. Whereas the principle tendencies and benchmarks in FL and DP concentrate on textit{small} fashions, we present the need of disentangling optimization and mannequin dimension: the behaviour of FL and DP for textit{giant} fashions is completely different from the one for textit{small} fashions. We speculate that FL and DP is tougher for textit{small} fashions on account of tougher optimization drawback even in central coaching. On this paper, we analyze the important thing FL parameters (optimizers, coaching from scratch or from a seed mannequin pre-trained centrally, cohort dimension, information heterogeneity) and suggest textit{first} benchmark of textit{FL with DP} within the context of textit{giant} fashions in ASR. We study the applicability of prior outcomes and current an outline of noticed departures from the tendencies in prior works and from coaching completely different ASR fashions. By way of this work, we offer researchers and practitioners within the fields of FL and DP with beneficial insights into the elemental variations that will come up when making use of FL and DP analysis to large-scale ASR coaching.

Source link

Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR

ML/AI Platform Build vs Buy Decision: What Factors to Consider

Researchers leverage shadows to model 3D scenes, including objects blocked from view | MIT News

Conformer-Based Speech Recognition on Extreme Edge-Computing Devices

What does the future hold for generative AI? | MIT News

Accelerate data preparation for ML in Amazon SageMaker Canvas

Recommended For You

ML/AI Platform Build vs Buy Decision: What Factors to Consider

Researchers leverage shadows to model 3D scenes, including objects blocked from view | MIT News

Conformer-Based Speech Recognition on Extreme Edge-Computing Devices

Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness

Understanding the visual knowledge of language models | MIT News

Accelerate data preparation for ML in Amazon SageMaker Canvas

Rice husk and recycled newspaper may be the eco-friendly insulation material of the future - Science & research news

ADAR Editor in Chief debunks common myths on substance abuse disorder - Science & research news

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

Zion Solutions Group Joins Forces with Locus Robotics to Supercharge Warehouse Productivity

A method to enable safe mobile robot navigation in dynamic environments

Robot Talk Episode 90 – Robotically Augmented People

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

RBR50 Spotlight: Slip Robotics minimizes trailer loading times with simple approach

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Federated Learning for Speech Recognition: Revisiting Current Trends Towards Large-Scale ASR

You might also like

What does the future hold for generative AI? | MIT News

Accelerate data preparation for ML in Amazon SageMaker Canvas

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password