Community:SummerOfCode11:MediaWikiParser: Difference between revisions

Link to updated design doc.
(Created page with "MediaWiki is the dominant wiki syntax in the wild, largely due to the influence of Wikipedia. It was chosen as the language for SUMO's knowledge base during our migration from Ti...")
 
(Link to updated design doc.)
Line 4: Line 4:
# Right now, it's impossible to translate wiki snippets (forum posts, for example) to plain text for inclusion in emails.
# Right now, it's impossible to translate wiki snippets (forum posts, for example) to plain text for inclusion in emails.


We propose to implement a new MediaWiki parser using proper parsing techniques: generating a parse tree, manipulating it, and then outputting (at least initially) HTML. Erik Rose has already done some research toward this: see the continually developing [http://github.com/erikrose/mediawiki-parser/blob/master/design.txt design document] and some [http://github.com/erikrose/mediawiki-parser/blob/master/lexer.py initial] [http://github.com/erikrose/mediawiki-parser/blob/master/parser.py code].
We propose to implement a new MediaWiki parser using proper parsing techniques: generating a parse tree, manipulating it, and then outputting (at least initially) HTML. Erik Rose has already done some research toward this: see the continually developing [http://github.com/erikrose/mediawiki-parser/blob/master/README.rst design document] and some [http://github.com/erikrose/mediawiki-parser/blob/master/lexer.py initial] [http://github.com/erikrose/mediawiki-parser/blob/master/parser.py code].
Confirmed users
574

edits