Intellego: Difference between revisions

From MozillaWiki
Jump to navigation Jump to search
(→‎Resources: Removed reference to GSOC project)
(+Obsolete flag)
 
(12 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{RELEASE_MANAGEMENT_OBSOLETE}}
Intellego is a machine translation project for the benefit of Mozilla and the Open Web.
Intellego is a machine translation project for the benefit of Mozilla and the Open Web.
__NOTOC__
__NOTOC__
== Project details ==
== Project details ==
Intellego is a machine translation (MT) platform that seeks to unify existing open MT projects by providing a single API for engine developers and a unified web service that hosts a number of different language pairs/engines/implementations in the back end. The Intellego platform will allow users to select from a number of open MT engines based on the most prominent MT methodologies in order to find the best target MT output for their on-the-fly translation.
Intellego is a machine translation (MT) initiative that seeks to unify existing open MT projects by providing a single platform for engine developers and a unified web service that hosts a number of different language pairs/engines/implementations in the back end. The Intellego platform will allow users to select from a number of open MT engines based on the most prominent MT methodologies in order to find the best target MT output for their on-the-fly translation. This will be accomplished by partnering with existing open MT projects and hooking into their infrastructures, freeing Mozilla of the requirement to develop and host MT engines and promoting the use of MT engines within the fragmented open ecosystem.
;Breaking closed ecosystems
The current machine translation ecosystem is dominated by proprietary, closed systems. This includes their code base, their data collection processes, and public accessibility to their language resources. Additionally, the open MT ecosystem suffers from being unable to reach the vast majority of participants on the web through web services or APIs.


MT users are limited to using engines that follow a single MT methodology for all language pairs and content types. Studies have shown that a one-size-fits-all approach in MT does not provide the user with optimal translation output. Users need a single access point to different MT engines following different MT methodologies that will produce the best quality output by selecting the right engine for the right language pair. An open MT ecosystem is the best way to accomplish this.
MT users are limited to using engines that follow a single MT methodology for all language pairs and content types. Studies have shown that a one-size-fits-all approach in MT does not provide the user with optimal translation output. Users need a single access point to different MT engines following different MT methodologies that will produce the best quality output by selecting the right engine for the right language pair. Intellego seeks to further establish an open MT ecosystem, as we feel it is the best way to quickly provide high-quality MT services to users on the web at the lowest cost and in a way that engages the open source community.
;Machine translation in Firefox
=== Intellego Platform ===
Google's ability to provide users with automatic translation of web content using Google Translate attracts global users to the Chrome browser. Intellego aims to be to Firefox what Google Translate is to Chrome by powering the automatic translation feature within the browser.
Intellego is a centralized machine translation platform that provides the following services:
;Machine translation at Mozilla
#A unified web service for open source machine translation engines, allowing them front-end accessibility on the Web. This service will function as follows:
Many Mozilla l10n teams consist of only 1-2 people. While they would love to be able to provide l10n coverage for all of the Mozilla support sites (and other projects) they do not have the time or resources to accomplish the task. Users, thus, have a localized Firefox, but lack documented product support in their language.
#*Users insert text into a text field and see it translated on the fly in an accompanying text field.
#*Users insert an URL into a text field, Intellego strips the text nodes from the DOM, runs the text nodes through Intellego, and returns a rendered page in a new tab containing target language text.
#*Users can select their preferred MT engine.
#*Intellego will intelligently determine the most appropriate MT engine given a language pair and content type.
#*Advanced mode allows users to select and arrange MT pipelines utilizing multiple MT engines
#*Terminology processing utility (from GSOC project) will ensure accurate and consistent terminology translation despite MT method.
#An easy and simple API for the following services
#*A widget for developers to add to their sites for automatic translation by Intellego MT engines
#*Allows mozilla l10n tools developers to link directly into hosted open MT services through Intellego
#Users can post-edit MT output in context and submit as feedback (leveraging Pontoon modules). Community can vote on suggested post-edits, and highest voted feedback is entered into the engine corpuses.


Additionally, Mozilla's unique, linguistically diverse community frequently encounters language barriers to participation. Accessibiity to a Mozilla operated MT project can lower the language barrier and create a more inclusive experience for the next million mozillians.
Explore the wiki for more details about the [[Intellego/Mission|Intellego project's purpose and focus]] as well as the [[Intellego/Goals Milestones|goals and milestones]] for the platform's development and the [https://intellego.etherpad.mozilla.org/tech-spec-process-model platform technical spec (in progress)].
;Advancements in MT research
Language support selection for machine translation projects are  driven, in part, by ROI and availability of resources. This often  results in minority languages, and even some majority languages (see Indic languages) being under-represented in the machine translation  ecosystem. While ROI continues to be a primary motivator for  incorporating support for these languages, they will remain under-represented and unsupported.
 
 
Explore the wiki for more details about the [https://wiki.mozilla.org/Intellego/Mission Intellego project's purpose and focus.]
 
== Project meetings ==
 
The Intellego team [[/Meetings/Status|meets every week]] to discuss the progress of the project.
 
We also occasionally have [[/Meetings/Sprints|sprint meetings]], where we work on a particular aspect of the project for a long stretch of time.
 
For more information about meetings, see our [[/Meetings|meetings page]].


== Resources ==
== Resources ==
 
* [[Intellego/Goals_Milestones|Project goals and milestones]]
* [[Intellego/Research|MT Research]]
* [[Intellego/Research|Our MT Research]]
* [[Intellego/Mission|Project philosophy]]
* [http://intellego.etherpad.mozilla.org/ Intellego team etherpad]
* [http://intellego.etherpad.mozilla.org/ Intellego team etherpad]
* Related effort: [[mw:Content translation]], [[mw:Content translation/cxserver|cxserver]]


== Team ==
== Team ==
Line 45: Line 42:
; {{Mozillian|mekki|Mekki MacAulay}} (mekki)
; {{Mozillian|mekki|Mekki MacAulay}} (mekki)
: Strategic management, partnerships, grants, business collaboration, evangelism.
: Strategic management, partnerships, grants, business collaboration, evangelism.
== Project meetings ==
The Intellego team [[/Meetings/Status|meets every week]] to discuss the progress of the project.
We also occasionally have [[/Meetings/Sprints|sprint meetings]], where we work on a particular aspect of the project for a long stretch of time.
For more information about meetings, see our [[/Meetings|meetings page]].


== Discussion ==
== Discussion ==
Line 50: Line 55:
* IRC: {{IRC|intellego}}
* IRC: {{IRC|intellego}}
* Newsgroup: [http://www.mozilla.org/about/forums/#tools-l10n mozilla.tools.l10n]
* Newsgroup: [http://www.mozilla.org/about/forums/#tools-l10n mozilla.tools.l10n]
* [https://discourse.mozilla-community.org/c/intellego Forum]

Latest revision as of 13:48, 2 January 2019

Warning: The content of this page is obsolete and kept for archiving purposes of past processes.

Intellego is a machine translation project for the benefit of Mozilla and the Open Web.

Project details

Intellego is a machine translation (MT) initiative that seeks to unify existing open MT projects by providing a single platform for engine developers and a unified web service that hosts a number of different language pairs/engines/implementations in the back end. The Intellego platform will allow users to select from a number of open MT engines based on the most prominent MT methodologies in order to find the best target MT output for their on-the-fly translation. This will be accomplished by partnering with existing open MT projects and hooking into their infrastructures, freeing Mozilla of the requirement to develop and host MT engines and promoting the use of MT engines within the fragmented open ecosystem.

MT users are limited to using engines that follow a single MT methodology for all language pairs and content types. Studies have shown that a one-size-fits-all approach in MT does not provide the user with optimal translation output. Users need a single access point to different MT engines following different MT methodologies that will produce the best quality output by selecting the right engine for the right language pair. Intellego seeks to further establish an open MT ecosystem, as we feel it is the best way to quickly provide high-quality MT services to users on the web at the lowest cost and in a way that engages the open source community.

Intellego Platform

Intellego is a centralized machine translation platform that provides the following services:

  1. A unified web service for open source machine translation engines, allowing them front-end accessibility on the Web. This service will function as follows:
    • Users insert text into a text field and see it translated on the fly in an accompanying text field.
    • Users insert an URL into a text field, Intellego strips the text nodes from the DOM, runs the text nodes through Intellego, and returns a rendered page in a new tab containing target language text.
    • Users can select their preferred MT engine.
    • Intellego will intelligently determine the most appropriate MT engine given a language pair and content type.
    • Advanced mode allows users to select and arrange MT pipelines utilizing multiple MT engines
    • Terminology processing utility (from GSOC project) will ensure accurate and consistent terminology translation despite MT method.
  2. An easy and simple API for the following services
    • A widget for developers to add to their sites for automatic translation by Intellego MT engines
    • Allows mozilla l10n tools developers to link directly into hosted open MT services through Intellego
  3. Users can post-edit MT output in context and submit as feedback (leveraging Pontoon modules). Community can vote on suggested post-edits, and highest voted feedback is entered into the engine corpuses.

Explore the wiki for more details about the Intellego project's purpose and focus as well as the goals and milestones for the platform's development and the platform technical spec (in progress).

Resources

Team

These are the members of our Intellego team, with a brief overview of their relevant skills:

Jeff Beatty (gueroJeff) (gueroJeff)
Team lead.
Localization, organization, programming.
Majken Connor (Kensie) (Kensie)
Community outreach, evangelism.
Gordon P. Hemsley (GPHemsley) (GPHemsley)
Linguistics, programming, BCP 47 (language tags).
Mekki MacAulay (mekki) (mekki)
Strategic management, partnerships, grants, business collaboration, evangelism.

Project meetings

The Intellego team meets every week to discuss the progress of the project.

We also occasionally have sprint meetings, where we work on a particular aspect of the project for a long stretch of time.

For more information about meetings, see our meetings page.

Discussion