Saturday, May 10, 2025

Assessing ASR efficiency with which means preservation

Which means preservation as a substitute metric

Our analysis leveraged the Mission Euphonia corpus, a repository of disordered speech encompassing over 1.2 million utterances from roughly 2,000 people with numerous speech impairments. To broaden information assortment to Spanish audio system, Mission Euphonia partnered with the Worldwide Alliance of ALS/MND Associations, which facilitated the contribution of speech samples from people residing with ALS in Mexico, Colombia, and Peru. Equally, Mission Euphonia expanded to French audio system by way of a partnership with Romain Gombert from the Paris Mind Institute to gather information from individuals with atypical speech in France.

For our experiments, we generated a dataset of 4,731 examples consisting of floor reality and transcription error pairs together with a human label figuring out whether or not these pairs could be which means preserving or not (see particulars in our paper). We cut up the dataset into coaching, take a look at, and validation units (80% / 10% / 10%, respectively) making certain the three units wouldn’t overlap on the bottom reality phrase stage.

With this information, we skilled a classifier for which means preservation on prime of a base LLM. Utilizing prompt-tuning — a parameter-efficient technique to adapt LLMs — we conditioned our base LLM on our coaching set to foretell the labels “sure” or “no” to point whether or not the which means has been preserved or not.

We use the next format to symbolize the information to the LLM:

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles