hessen.social ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
hessen.social ist die Mastodongemeinschaft für alle Hessen:innen und alle, die sich Hessen verbunden fühlen

Serverstatistik:

1,6 Tsd.
aktive Profile

#voiceai

0 Beiträge0 Beteiligte0 Beiträge heute

“Perplexity AI introduced a new voice interface, outshining ChatGPT's feature with a better user experience and more natural conversational flow.
Perplexity's voice feature is more reliable, in contrast to connection issues faced with ChatGPT, and its live transcription enhances the experience.”

howtogeek.com/perplexity-just-

#perplexity
#perplexityai
#aivoice
#voiceai
#voicechatbot

How-To Geek · Perplexity Just Stole ChatGPT's Best Feature and Is Doing a Better JobVon Dibakar Ghosh

For the past couple of years, as each new @mozilla #CommonVoice dataset of #voice #data is released, I've been using @observablehq to visualise the #metadata coverage across the 100+ languages in the dataset.

Version 17 was released yesterday (big ups to the team - EM Lewis-Jong, @jessie, Gina Moape, Dmitrij Feller) and there's some super interesting insights from the visualisation:

➡ Catalan (ca) now has more data in Common Voice than English (en) (!)

➡ The language with the highest average audio utterance duration at nearly 7 seconds is Icelandic (is). Perhaps Icelandic words are longer? I suspect so!

➡ Spanish (es), Bangla (Bengali) (bn), Mandarin Chinese (zh-CN) and Japanese (ja) all have a lot of recorded utterances that have not yet been validated. Albanian (sq) has the highest percentage of validated utterances, followed closely by Erzya / Arisa (myv).

➡ Votic (vot) has the highest percentage of invalidated utterances, but with 76% of utterances invalidated, I wonder if this language has been the target of deliberate invalidation activity (invalidating valid sentences, or recording sentences to be deliberately invalid) given the geopolitical instability in Russia currently.

See the visualisation here and let me know your thoughts below!

➡ observablehq.com/@kathyreid/mo

ObservableMozilla Common Voice v17 dataset metadata coverageThis visualisation uses "@d3/stacked-horizontal-bar-chart" to visualise the Common Voice metadata coverage. The original data is taken from the Common Voice `cv-dataset` repository - direct link Table of contents Splits by age range - shows how many clips have been provided by speakers of different age ranges for each locale (language) Splits by age range scaled to 100% - as above, but scaled to 100% so that the metadata coverage of low resource languages is more visible Splits by gender - shows how many cl

I use Eleven Labs to read my writing out loud to me in a natural voice, and I noticed a new feature today: Speech to Speech.

You can record or upload audio, and it will create new audio of what you said using one of its generated voices. It uses your intonations and even does laughter. Here's an 8-second example that turned me into "Adam."

The only advantage I can think of for this over plain AI voices is that it can do a wider range of emotions. What else?

00:00/00:08

Last week, as part of my #PhD program at the #ANU School of #cybernetics, I gave my final presentation, which is a summary of my methods and #research findings. I covered my interview work, the #dataset documentation analysis work I've been doing and my analysis work around #accents in @mozilla's #CommonVoice platform.

There were some insightful and thought-provoking questions from my panel and audience members, and of course - so many ideas for future research inquiry!

A huge thanks to my panel, chaired so well by Professor Alexandra Zafiroglu, to Dr Elizabeth Williams, my meticulous, methodical and always-encouraging Primary Supervisor, and to my co-supervisors Dr Jofish Kaye and Dr Paul Wong 黃仲熙 for their deep expertise in #HCI and #data respectively.

Similarly, a huge thank you to my #PhD cohort - Charlotte Bradley, Tom Chan, Danny Bettay and Sam Backwell - as well as the other cohorts in the School - for your encouragement and intellectual journeying.

#PhDlife#milestone#voiceAI

All the more reason why to stop publicly posting about your kids and letting kids have public social media accounts. It will be interesting to see if this is just a passing fad scam, or if spoofing kid's voices will become more prevalent. (Get your safe word prepped now)
lifehacker.com/don-t-fall-for-

LifehackerDon’t Fall for This ‘Virtual Kidnapping’ ScamVon Daniel Oropeza