When you think of voice assistants, you probably think of Google Assistant, Apple's Siri, and Amazon's Alexa – but that could all change, thanks to Mozilla's Common Voice Library project.
The Common Voice Library (opens in new tab), which was released today, is an open source collection of human voices speaking in 18 different languages, including Dutch, Esperanto, Farsi, Welsh, and Hakha-Chin.
- Check out the best smart speakers of 2019
- The best Alexa skills and commands
- Our guide to whether you're better off with an Amazon Echo or Google Home.
While Mozilla has been working on the project for a few years now, the Common Voice Library has grown enormously over the last year – a report by Engadget (opens in new tab) says that "the collection is composed of 1,400 hours of recorded voice clips from 42,000 contributors."
When Mozilla launched the library in 2017, it explained that its goal was to “build a speech corpus that's free, open source, and big enough to create meaningful products with.”
Speaking your language
After introducing non-English languages last year, digital strategist Michael Henretty explained in a blog post (opens in new tab) how the project could help small or independent developers across the world to incorporate voice recognition technology into their products:
“Going multilingual marks a big step for Common Voice, and we hope that it’s also a big step for speech technology in general. Democratizing voice technology will not only lower the barrier for global innovation, but also the barrier for access to information.”
By providing a huge library of human voices in a range of languages for free, Mozilla could be opening the doors for companies that don't have the resources of Apple, Amazon, and Google, to develop their own voice assistants.
Whether this will lead to an influx of new voice assistants and voice-enabled devices remains to be seen, but having this open linguistic repository could be extremely useful for language researchers as well as tech developers in years to come.
Via Engadget (opens in new tab)