...

Hearing Impairments Translation Personal Assistant


View a PDF of the paper titled HI-TransPA: Hearing Impairments Translation Personal Assistant, by Zhiming Ma and 12 other authors

View PDF
HTML (experimental)

Abstract:Hearing-impaired individuals often face significant barriers in daily communication due to the inherent challenges of producing clear speech. To address this, we introduce the Omni-Model paradigm into assistive technology and present HI-TransPA, an instruction-driven audio-visual personal assistant. The model fuses indistinct speech with lip dynamics, enabling both translation and dialogue within a single multimodal framework. To address the distinctive pronunciation patterns of hearing-impaired speech and the limited adaptability of existing models, we develop a multimodal preprocessing and curation pipeline that detects facial landmarks, stabilizes the lip region, and quantitatively evaluates sample quality. These quality scores guide a curriculum learning strategy that first trains on clean, high-confidence samples and progressively incorporates harder cases to strengthen model robustness. Architecturally, we employs a novel unified 3D-Resampler to efficiently encode the lip dynamics, which is critical for accurate interpretation. Experiments on purpose-built HI-Dialogue dataset show that HI-TransPA achieves state-of-the-art performance in both literal accuracy and semantic fidelity. Our work establishes a foundation for applying Omni-Models to assistive communication technology, providing an end-to-end modeling framework and essential processing tools for future research.

Submission history

From: Peidong Wang [view email]
[v1]
Thu, 13 Nov 2025 03:27:39 UTC (17,234 KB)
[v2]
Fri, 14 Nov 2025 18:05:10 UTC (17,125 KB)

Source link

#Hearing #Impairments #Translation #Personal #Assistant