Editor retention: Editing tools
In November, the team worked primarily on finalizing the code re-engineering of VisualEditor so that it is more modular and easier to extend, and on the integration ahead of deploying it for wider testing in December. The early version of the VisualEditor on mediawiki.org was updated twice (1.21-wmf4
), fixing a number of bugs and missing wikitext compatibility, and wide-spread improvements to much of the user interface code so that it will be easier to change in future.
In preparation for the upcoming deployment on the English Wikipedia, the Parsoid team concentrated on the preservation of existing content. Automated round-trip testing on 100,000 randomly chosen pages from the English Wikipedia using distributed test runners helped to identify many issues, which were fixed and often resulted in new minimal test cases being added to the parser test suite. Currently, 79.4% test articles (up from about 65% last month) round-trip without any differences at all, an additional 18% round-trip with only minor (whitespace, quote style etc) differences, and the remaining 2.6% of pages have differences that still need fixing (down from about 15% last month). Selective serialization will further avoid dirty diffs in unmodified parts of a page by using the original wikitext for those. This will help further fix the 20% of pages that had any kind of difference in wikitext. The implementation of this algorithm is currently being finalized.
This month, we continued to develop final features for Article Feedback
, and researched how people are using this tool on the English Wikipedia. With the help of community members, we designed new features
to reduce the editor workload, including improved moderation tools and a more prominent feedback link. These features will be developed next month, once we’ve completed code re-factoring to improve database performance. We also analyzed new research data to track how moderators use the feedback page
, and measure how many readers who post feedback become editors or registered users. Next month, we will invite Wikipedians to evaluate the usefulness of feedback posts and the effectiveness of our new moderation tools. Once these tasks are done, we plan to release Article Feedback v5 to 100% of the English Wikipedia in early 2013. For more information about this tool, check our project overview
is now in ‘maintenance mode’, following its release on the English Wikipedia in September 2012. We have been tracking the impact of this tool with a metrics dashboard
, which confirms that it is being used actively, with over 27,000 pages reviewed since launch. To learn more, visit our introduction page
, watch this video tour
or read this tutorial
. If you are an experienced editor, try out the final version
on the English Wikipedia.
The Agora extension moves ever-closer to completion, with help from Munaf Assaf, Trevor Parscal, Rob Moen and Vibha Bamba. Several templates on the English-language Wikipedia have been redesigned to reduce interface clutter, with some already implemented.
Editor engagement experiments
In November, the Editor Engagement Experiments team (E3) deployed the third and final A/B test of the new account creation page
, including client-side validation. Results from basic data analysis of all three tests were published on Meta, and the project will now move to the productization stage. Extension:PostEdit
was put in maintenance mode after being deployed to a further seven Wikipedias, including French and Portuguese. On the analytics side, E3 transitioned permanently to Extension:EventLogging
for data collection purposes, and collaborated with the mobile team to track activity on Wikipedia’s mobile beta. Last but not least, the team also deployed a small design improvement to the personal tools
menu in MediaWiki core.
The work of Ankur Anand (a.k.a drecodream) on Flickr integration, done during GSoC, has now been merged, and Wikimedia engineers are working towards its deployment in the near future. Specifically, several bugs related to Internet Explorer were fixed. Once all the bug fixes are deployed, the feature will be turned on for Commons (hopefully in early December). Initially it will only be available to administrators.
Architecture & Platform support
This month, we designed and started building key features of the Notifications project (code-named ‘Echo’), towards a first experimental deployment in early 2013. Fabrice Florin wrote detailed feature requirements
for our first release, and Vibha Bamba designed the first components of the user experience
. Ryan Kaldari and Benny Situ developed the main features of this application, including the notifications flyout, the all-notifications archive, as well as email notifications and preferences. To test our work in progress, visit our first prototype
(create an account and post on your talkpage from a separate account). New employee Luke Welling
is also starting work on an HTML email module for this project. For more information, visit our project hub
, or check our overview slides
The official start of Flow will follow Echo
development. An initial team will be forming next month to explore solutions here.
2012 Wikimedia fundraiser
November has been a busy month for Fundraising as the team helped to kick-off the annual 2012 fundraiser on November 26th with heavy testing before then. So far the 2012 fundraiser has been a resounding success raising over $12M in the 5 full days and limited testing days since November 15th. For current information, see the live stats
Shortly before the full launch, it was announced that the annual fundraiser would be splitting into an English-language fundraiser in Australia, Canada, Great Britain, the United States and New Zealand during the traditional November/December period with other languages and all countries in April. For more details see the announcement on wikimedia-l
Wikimedia engineers deployed 1.21wmf3
to all Wikimedia sites, and began deploying 1.21wmf5
(with a momentary breakage
). These updates included many significant improvements, including one-click (AJAX) patrolling, for both new page and diff patrol, and a Template Sandbox
, which lets users preview changes to a template by previewing an example page where it’s used.
We’re still very much looking forward to deploying the latest version of Gerrit (see last month’s update), but unfortunately remain blocked on a complicated LDAP propagation issue. Chad Horohoe is working with the Gerrit developers on finalizing the fix for this issue. Chad also attended the Gerrit Developer Summit in November, and both Chad and Rob Lanphier attended the Gerrit Users Summit (notes
We have deployed TimedMediaHandler to all wikis. Jan Gerber and Michael Dale continue to fix bugs. Jan Gerber and Aaron Schulz are working on an improved file upload mechanism in UploadWizard to make larger file uploads more practical.
Chris Steipp and Chad Horohoe have reviewed the Wikibase set of extensions, as well as DataValues. Deployment of these extensions is planned for December.
Wikivoyage was launched into public beta on November 10. The site is running on Wikimedia servers, and accounts and text content was migrated. Images from the old site have not been automatically imported, because some contain non-free content, and need to be added to each language wiki in accordance with the Exemption Doctrine Policy
for that site. Public announcement and promotion of the site is delayed while the community is working on the image transfer.
Thumbnails (and math/timeline files) are now written to nas1 and Swift. More improvements have been made to FileBackend to avoid extra HEAD requests for 404 errors. Webm thumbnails use temporary Swift URLs to support range requests. Feature requests and bugs reports are filed against Ceph as MediaWiki takes advantage of other Swift features.
Brad Jorsch and Chad Horohoe have joined Tim Starling on this project. Brad has built a template sandbox
which will help in debugging both Lua scripts and regular templates. Chad is working on a shared repository for scripts, and Tim has been extending the API. His latest work has been around adding multilingual APIs for handling things like plurals within Lua. We’re currently seeking a volunteer product manager
to help out with the roll-out of this.
Various improvements to the job queue have been made to avoid CPU time wasted on duplicate jobs and redundant page cache purges. Changes have also been made to make it possible to edit heavily used templates without timeouts.
Incremental architectural improvements
Support needed for more complex data structures (lists, sets) in memcached (with atomic updates) is awaiting more review and testing. The coding is essentially done (changeset
Admin tools development
The team’s work continued on writing an interface for Stewards to mass-lock user accounts
and on being able to use AbuseFilter extension
across all wikis at once.
Mark Hershberger published the MediaWiki 1.20
stable tarball on November 7th. Chris Steipp published a security update (1.20.1) on November 29th.
Wikia wants to attract motivated app developers and companies using Wikia’s products to use the API. They also want to make the API more standards-compliant (a RESTful interface, using HTTP verbs), but that’s a high-level goal. Mobile-related work is first, but this redesign would improve the whole platform, including the enterprise. The Wikimedia Foundation and Wikia want to work together on this; The Wikimedia Foundation also wants to avoid boxing ourselves into special-purpose, specific apps. Wikia developer Federico Lucignano is currently working on a Request for comments on the REST proposal.
Security auditing and response
The team continued to respond to several reported vulnerabilities, and released new versions of all supported MediaWiki branches (1.20.1, 1.19.3, 1.18.6) to address vulnerabilities in core. Significant security reviews continued for Wikidata and Wikivoyage extensions.
QA and testing
The team contributed to the community QA draft strategy
and presented the Acceptance Test-Driven Development concept to Wikimedia Product/Project managers. Regression testing of software deployments is ongoing.
We deployed ArticleFeedbackv5
to the beta cluster, which is the primary host for AFTv5 testing, including browser test automation. New Page Patrol
is being maintained there as well. We are still working on issues of ongoing maintenance, and this cluster played a role
in catching a defect that recently escaped to production.
A continuous integration summit occurred during the Netherlands Hackathon. integration-jenkins2 is now fully operational with Jenkins / Gerrit and a Zuul installation. Antoine Musso has generated the new MediaWiki core Jenkins jobs. Zuul has been deployed in production successfully. It triggers a new set of Jenkins jobs that will eventually replace the old MediaWiki-.* ones. The new Jenkins jobs for MediaWiki core (triggered by Zuul) have been tested in production and are successful. The new workflow has been documented.
In November, the QA team created a backlog of tests to be automated
, ported existing tests from RSpec to Cucumber, and is now working on browser testing architecture, creating basic new tests (see the
repository in Gerrit), and refactoring tests for cleanliness. Chris McMahon began discussing automated browser tests with Wikimedia tech managers to get developers writing those tests as they develop extensions deployed on Wikimedia sites; public announcement will be coming very soon, when the existing example tests are in final or near-final form. Noisy tests failing for known reasons have been removed from the suite, which is now completely green (that is, passing); the team will soon be writing and adding more tests. Browser tests in November identified a serious regression in UploadWizard running on test2
and prevented its release to production.
Kraken (Analytics Cluster)
The Analytics team has received all of the hardware purchased back in the Spring. The Hadoop nodes have been moved onto their final homes. Evan Rosen from the Global Development team is helping us test this setup with real use cases for his team. Kafka has been puppetized and installed. It is currently consuming all of Banner Impression- and Wikpedia Zero-related logs. As a proof of concept, the Zero logs are being fed daily into Hadoop for analysis by the Global Development team. Debs for Storm have been built. Storm has been puppetized and is running on several of the Cisco nodes.
David Schoonover and Dan Andreescu are working on a major rework of Limn, using Knockout.js and d3.js. The team hopes to have this ready to present the metrics for the December 6 metrics meeting at the Wikimedia Foundation.
improved and cleaned up updated large parts of the bug management
and Bugzilla documentation. This includes the beginnings of a triage guide
. He also published his Greasemonkey scripts in a Git repository
and went through obsolete extensions and updated their Bugzilla descriptions. Andre started analyzing how Wikimedia engineering teams use Bugzilla and their related workflows. He also investigated a potential upgrade of Bugzilla to version 4.2 by doing some basic testing. Furthermore, a wikitech-l discussion
on standardizing the meaning of “highest priority” in Bugzilla resulted in creating a new “Immediate” priority status.
The first phase of the Outreach Program for Women
(OPW) has been completed, receiving the submissions of more than 15 firm candidates, delivered to 8 mentors available. The Wikimedia Foundation is funding 4 full-time internship positions between January and March 2013. There is a possibility to obtain more, depending on external sponsors of the program. The selected candidates will be announced on December 11. The OPW is organized by the GNOME Foundation and 11 FLOSS projects are taking part.
Management reviewed options to determine the direction this activity would follow in future months. In the meantime, Guillaume Paumier
cleaned up and expanded the Wikimedia glossary
with terms related to Wikimedia technology and engineering, and volunteers & engineers came to expand it further. He also followed up on the consultation process
initiated in October to identify how to improve dialogue between technical communities and user communities. He’s now in the process of widening this discussion to more communities. Sumana Harihareswara
sent a call for volunteers
to lead or advise Wikimedia engineering staff on select activities, and followed up on the offers.
Volunteer coordination and outreach
started sharing new volunteer coordination tasks with Quim Gil
, the new technical contributor coordinator who started working with the Wikimedia foundation in November. They continued to follow up on contacts (such as those gained at October’s Grace Hopper Celebration of Women in Computing), recruit new contributors to the Wikimedia tech community, and mentor newer contributors. The weekly online tech chats
continued on Thursdays. Sumana and others continued to grant developer access
and work on Gerrit project ownership requests