VibraXX
Live Quiz Arena
🎁 1 Free Round Daily
⚡ Enter Arena
HomeCategoriesLanguage & CommunicationQuestion
Question
Language & Communication

Why does a language model trained on text-to-speech sometimes produce unexpected emphasis on certain words during spoken output?

A)Incomplete data augmentation degrades signal
B)Prosodic transfer induces pitch errors
C)WaveNet vocoders amplify spectral artifacts
D)Attention mechanisms reduce semantic coherence

💡 Explanation

Prosodic transfer explains the phenomenon because pitch patterns from the training data are inadvertently applied, leading to misplaced emphasis; therefore, the model emphasizes words according to learned, often inappropriate, melodic contours, rather than contextual semantics or syntax. The problem is prosodic transfer, rather than data augmentation.

🏆 Up to £1,000 monthly prize pool

Ready for the live challenge? Join the next global round now.
*Terms apply. Skill-based competition.

⚡ Enter Arena

Related Questions

Browse Language & Communication