CommonVoice: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
(removed old logos)
 
(7 intermediate revisions by 2 users not shown)
Line 1: Line 1:
== What is Common Voice ==
== What is Common Voice ==


voice.mozilla.org
Mozilla Common Voice is an initiative to help teach machines how real people speak.


This project is an effort to bridge the digital speech divide. Voice recognition technologies bring a human dimension to our devices, but developers need an enormous amount of voice data to build them. Currently, most of that data is expensive and proprietary.


== Materials & Assets to Use and Remix ==
We want to make voice data freely and publicly available, and make sure the data represents the diversity of real people. Together we can make voice recognition better for everyone.
If you make something, please add it below!
 
You can contribute today on [https://commonvoice.mozilla.org/ Common Voice].
 
== How does Common Voice work? ==
 
We’re crowdsourcing an open-source dataset of voices, to start and support languages on Common Voice the following steps are made.
 
1. [https://github.com/common-voice/common-voice/issues/new?assignees=Heyhillary&labels=Type%3A+localization&template=language_request.md&title=LOCALIZATION+REQUEST%3A+ New language request] and localisation of Common Voice platform via Pontoon
 
2. Collecting and validating public domain sentences via the [https://commonvoice.mozilla.org/sentence-collector/#/ sentence collector], [https://github.com/common-voice/cv-sentence-extractor sentence extractor] or [https://common-voice.github.io/community-playbook/sub_pages/cc0waiver_process.html CC0 text waiver agreement]. 
 
3. Recording and validating the recordings of the sentences on the [https://commonvoice.mozilla.org/ Common Voice platform]
 
4. Repeating this process to grow the size of the dataset
 
5. [https://commonvoice.mozilla.org/en/datasets Generating a dataset] which is released by the [https://github.com/common-voice/cv-dataset Common Voice team]
 
This dataset can then be [https://discourse.mozilla.org/t/talk-to-us-how-are-you-using-common-voice/82005 used by developers] to create voice-enabled technologies.
 
== Common Voice Communities ==
 
Common Voice wouldn’t be possible without our language communities. As of September 2021, we have [https://commonvoice.mozilla.org/en/languages 80 languages] launched for voice data collection.
 
'''Community Playbook'''
 
Language community members and organisers; mobilise participation, provide valuable feedback and inspire us as a team. Our [https://common-voice.github.io/community-playbook/ Community Playbook] outlines how communities participate in Common Voice.
 
'''Communications Channels'''
 
To support our communities our two main channels are [https://discourse.mozilla.org/c/voice discourse] for group and topical discussions and [https://chat.mozilla.org/#/room/#common-voice:mozilla.org matrix] for community chats. Our communities also have their own communication channels to help with [https://github.com/common-voice/common-voice/blob/main/docs/COMMUNITIES.md self-organising].
 
We share [https://discourse.mozilla.org/t/weekly-update-thread-2021/84411 weekly updates] from the Common Voice Team on discourse, coordinated by Hillary, Common Voice Community Manager.


* Mini Business Card (90x40mm) | [https://www.dropbox.com/s/l121xkvc3ghlofa/NameCard%20-%20Common%20Voice_ref_qr.ai.pdf?dl=0 Illustrator CC (editable PDF)]
'''Community Sessions and Council'''
[[File:Common Voice - Mini Business Card.png|thumb|none]]


* Tabletop Tent Sign (294mm x 210mm Half fold) | [https://www.dropbox.com/s/e09k0nvnft9aln3/Common%20Voice%20-%20Tabletop%20Tent%20Sign.pages?dl=0 Apple Pages] [https://www.dropbox.com/s/c8vjs7oi08zxse8/Common%20Voice%20-%20Tabletop%20Tent%20Sign_300dpi.png?dl=0 PNG]
As part of our Community strategy, we seek to build on and create new ways to support our language communities. So far as part of this strategy we have; hosted [https://discourse.mozilla.org/t/common-voice-roadmap-2021/85340 Community Sessions on the Common Voice Roadmap], open discourse discussions on [https://discourse.mozilla.org/t/recognition-rewards-and-contribution-pathways/84408/8 Reward and recognition] and launched V1.2 Common Voice Community Playbook.  
[[File:Common Voice - Tabletop Tent Sign 300dpi.png|thumb|none]]


* X Banner Stand (60cm x 160cm) | [https://www.dropbox.com/s/870p10i0lwjttna/Common%20Voice%20%EF%BC%B8Banner%2060x160.pages?dl=0 Apple Pages] [https://www.dropbox.com/s/j51jc79sbfvvmj6/Common%20Voice%20%EF%BC%B8Banner%2060x160.pdf?dl=0 PDF] [https://www.dropbox.com/s/wraqe3lf5xqu4jx/Common%20Voice%20%EF%BC%B8Banner%2060x160.png?dl=0 PNG]
To further support communal voice, we would like to trial out a Common Voice reps council to support the community to have even more say in important decisions. Learn more about the Common Voice Reps programme and how you can apply today on the [https://discourse.mozilla.org/t/apply-to-be-a-common-voice-reps-expression-of-interest/85326 Common Voice Discourse].
[[File:Common Voice XBanner 60x160.png|120px|thumb|none]]


* X Banner Stand - Event (60cm x 160cm) | [https://www.dropbox.com/s/b7w0jjr8krwlu18/Common%20Voice%20%EF%BC%B8Banner%20-%20Event%2060x160.pages?dl=0 Apple Pages] [https://www.dropbox.com/s/4ev7l99thth7wtx/Common%20Voice%20%EF%BC%B8Banner%20-%20Event%2060x160.pdf?dl=0 PDF] [https://www.dropbox.com/s/2wix7e9st84fsj7/Common%20Voice%20%EF%BC%B8Banner%20-%20Event%2060x160.png?dl=0 PNG]
== Materials & Assets to Use and Remix ==
[[File:Common Voice XBanner - Event 60x160.png|120px|thumb|none]]


* (zh-tw) Community Channel Flyer (A5) | [https://www.dropbox.com/s/n63s14b01iqzfv9/Common%20Voice%20%E5%8F%B0%E7%81%A3%E9%A0%BB%E9%81%93%E6%A1%8C%E7%89%8C%20A5.pages?dl=0 Apple Pages] [https://www.dropbox.com/s/iwjz66jb30o9itk/Common%20Voice%20%E5%8F%B0%E7%81%A3%E9%A0%BB%E9%81%93%E6%A1%8C%E7%89%8C%20A5.pdf?dl=0 PDF]
Common Voice assets and presentations can be viewed on our [https://drive.google.com/drive/folders/15eh2FIlgDZSQgnWGt3JDqKMA1fWRpdnP shared drive]
[[File:Common Voice - Community Channnels Flyer.png|thumb|none]]

Latest revision as of 14:20, 21 September 2021

What is Common Voice

Mozilla Common Voice is an initiative to help teach machines how real people speak.

This project is an effort to bridge the digital speech divide. Voice recognition technologies bring a human dimension to our devices, but developers need an enormous amount of voice data to build them. Currently, most of that data is expensive and proprietary.

We want to make voice data freely and publicly available, and make sure the data represents the diversity of real people. Together we can make voice recognition better for everyone.

You can contribute today on Common Voice.

How does Common Voice work?

We’re crowdsourcing an open-source dataset of voices, to start and support languages on Common Voice the following steps are made.

1. New language request and localisation of Common Voice platform via Pontoon

2. Collecting and validating public domain sentences via the sentence collector, sentence extractor or CC0 text waiver agreement.

3. Recording and validating the recordings of the sentences on the Common Voice platform

4. Repeating this process to grow the size of the dataset

5. Generating a dataset which is released by the Common Voice team

This dataset can then be used by developers to create voice-enabled technologies.

Common Voice Communities

Common Voice wouldn’t be possible without our language communities. As of September 2021, we have 80 languages launched for voice data collection.

Community Playbook

Language community members and organisers; mobilise participation, provide valuable feedback and inspire us as a team. Our Community Playbook outlines how communities participate in Common Voice.

Communications Channels

To support our communities our two main channels are discourse for group and topical discussions and matrix for community chats. Our communities also have their own communication channels to help with self-organising.

We share weekly updates from the Common Voice Team on discourse, coordinated by Hillary, Common Voice Community Manager.

Community Sessions and Council

As part of our Community strategy, we seek to build on and create new ways to support our language communities. So far as part of this strategy we have; hosted Community Sessions on the Common Voice Roadmap, open discourse discussions on Reward and recognition and launched V1.2 Common Voice Community Playbook.

To further support communal voice, we would like to trial out a Common Voice reps council to support the community to have even more say in important decisions. Learn more about the Common Voice Reps programme and how you can apply today on the Common Voice Discourse.

Materials & Assets to Use and Remix

Common Voice assets and presentations can be viewed on our shared drive