- the start of recruitment for a multimedia engineering team;
- the deployment of a better translation interface;
- the release of Kiwix of Android, an app to download and view Wikimedia content offline;
- the milestone of 500 million monthly unique visitors reached in March;
- improvements to the main page of translatewiki.net;
- the migration of Wikidata, and the English and German Wikipedias, to MariaDB 5.5;
- the second phase of Wikidata, whose structured data can now be displayed in Wikipedia articles;
- the deployment of VisualEditor’s alpha version to 14 more language versions of Wikipedia;
- a proposed replacement for the login and account creation interface on Wikimedia projects;
- the launch of the Language Mavens program, an advocacy and advisory body in the domain of language engineering;
- the ramp-up of technical mentorship programs;
- the launch of an official Wikimedia Commons app for iOS and Android.
Note: We’re also providing a shorter, simpler and translatable version of this report that does not assume specialized technical knowledge.
- 116 unique committers contributed patchsets of code to MediaWiki.
- The total number of unresolved commits remained stable around 815.
- About 49 shell requests were processed.
- Wikimedia Labs now hosts 163 projects and 1224 users; to date 1782 instances have been created.
Personnel
Work with us
Are you looking to work for Wikimedia? We have a lot of hiring coming up, and we really love talking to active community members about these roles.
- Director of Analytics
- Software Engineer – Parser
- Software Engineer – Fundraising
- Software Engineer – Language Engineering
- Software Engineer – Mobile
- Software Engineer – Multimedia Systems
- Software Engineer – Multimedia User Interfaces
- Software Engineer – Search
- Product Manager – Mobile
- UX Designer
- Dev-Ops Engineer – SRE
- MySQL Database Administrator
- Director of Technical Operations
Announcements
- Monte Hurd joined the Mobile engineering group as Software Engineer in the Apps team (announcement).
- Brandon Black joined the Operations team as Dev/Ops Engineer (announcement).
- Erik Bernhardson joined the Features team as Features Engineer (announcement).
- Nischay Nahata joined the Features team as Features Contractor (announcement).
Technical Operations
Site infrastructure
- Several large wikis were migrated to MariaDB, with positive results. A new class of redis servers were deployed in support of the migration of our asynchronous job queuing infrastructure from MySQL to redis, enabling us to better meet the demands of Wikidata and Echo. New file uploads are now being written to Ceph in Eqiad, in addition to Swift in pmtpa, in support of a potential migration. The current plan is to open up the Eqiad Ceph cluster for ‘reads’ the second week of May. Currently ‘reads’ are served by the Tampa Swift cluster.
- With the core cluster migrated to Eqiad, we are now working on the miscellaneous server cluster. As part of the cleanup, we retired servers as well.
- New tools for import of partial or full content into a new wiki have been released. A step-by-step walkthrough of their use has been added to the documentation for users of the dumps on Meta.
- True incremental dumps are now a GSOC proposal and several students have applied for this project.
- The logging table XML dump on Wikidata was taking days to run, due in part to the high volume of edits there, much more than even the English Wikipedia. Most of those edits wind up being recorded as autopatrol in the log, making it already about half the size of the logging table for the English Wikipedia. Breaking up the database query into smaller batches works around the issue.
- Work on tool labs is progressing nicely. 32 bots/tools have been added to the tools project. Most of the functionality of Toolserver should now be available in tool labs. Database replication is still being worked on, but is progressing well. The pre-labs replication databases are being replicated to, and the Redactatron application has been finished, allowing us to mark tables as ok to replicate. Our current roadmap is for database replication to be accessible by the time of the Amsterdam Hackathon. Instance creation performance greatly improved this month by replacing the generic Ubuntu cloud images with our own custom images that pre-installs and pre-configures most of what an initial puppet run would handle. Work on single-instance MediaWiki continued this month, making the initial MediaWiki installation more robust and handling a number of legal issues (such as showing the terms of use, using proper logos, linking to a proper privacy policy, etc.). Work began on adding Ajax interactivity to the OpenStackManager interface. Currently changes are in for reboot and get console output actions for managing instances. A more reasonable project filter change using jQuery Chosen has been added as well. Work on replacing glusterfs is mostly done. Two projects have been switched to use the new NFS server and the rest will be switched next month. Work has begun on upgrading OpenStack from the essex to the folsom release. Our testing environment has been upgraded and production tests are currently ongoing. During the OpenStack summit, work was done to push the Moniker DNS application into OpenStack incubation to be added as a supported OpenStack project. Ryan Lane gave a talk during the OpenStack summit about the state of OpenStack’s user committee, along with Tim Bell of CERN and JC Martin of eBay. Work on the user committee is in hopes of making OpenStack easier to use an upgrade, which should increase the frequency of updates in Labs.
Features Engineering
Editor retention: Editing tools
Several other features for the July release are on track. The specification for extensions containing templates and templates containing extensions were fleshed out and are currently being implemented. Similarly, our specs for images and thumbnails were vastly improved so that we will soon support full editing for all parameters.
We also improved our code quality and testing infrastructure.
In preparation for the July release, we did more benchmarking and capacity planning. A caching strategy that avoids overwhelming the API with requests was developed, hardware to run Parsoid was ordered and work on the implementation started.
Editor engagement features
Editor engagement experiments
For the team’s Onboarding new Wikipedians project, we completed quantitative analysis of the latest version of the GettingStarted landing page, and began prototyping a new landing page and navigation system for usability testing prior to further development and launch, which is expected in early May as well.
On the analytics and infrastructure front, the team handed off the product roadmap for the User Metrics API to the Analytics team and colleagues in the Grantmaking and Programs department. Ori Livneh, in support of the data analysis needs on the team, began work supporting a Foundation instance of IPython Notebook.
Last but not least, the E3 team held its second Quarterly Review session, and began work planning its next high-level goals for the April–June quarter.
Support
Language engineering
Language engineering communications and outreach
Mobile
Platform Engineering
MediaWiki Core
Site performance and architecture
Security auditing and response
Quality assurance
Timo Tijhof overhauled the automatically generated MediaWiki documentation for Javascript and PHP with Doxygen 1.7. He also fixed the duplicate test runs that happened in specific cases (bug 43391). Finally he set up QUnit tests for the VisualEditor extension; if this proves successful, QUnit runs will be generalized to all extensions.
Mark Holmquist improved the Jenkins jobs that track Parsoid regressions tests.
Finally, we now have linters for several languages: PHP, Python, Ruby and even Yaml. If your git repositories are missing a lint check, please contact us or file in a bug against Wikimedia > Continuous Integration.
Analytics
We’re also now importing 1:1000 traffic streams, enabling us to migrate reports from our legacy analytics platform, WikiStats, onto our big data cluster, Kraken. In the future, this will make it easier for us to publish data and visualize reports using our newer infrastructure.
We have implemented secure login to the User Metrics API via SSL. We’ve also introduce a new metric called <code|pages_created, allowing us to count the number of pages created by a specific editor.
We improved the accuracy of the udp2log monitoring and upgraded the machines to Ubuntu Precise in order to make the system more robust.
Analytics Visualization, Reporting & Applications
- We’ve started to analyze mobile site pageviews by device class, in order to determine how we will invest in building applications and sites that support various device formats.
- We’ve also started to perform session analysis of mobile site visits, in order to help us understand user behavior when using the mobile sites, which will inform decisions about ongoing development efforts. At present, this data is only for internal consumption by the Mobile team.
- A new overall mobile pageviews report is now available, which has improved the accuracy of our reporting due to changes in how the MobileFrontend extension requests a wiki article (improving performance).
- More information about how we’re calculating mobile pageviews is available in our documentation.
We also introduced new dashboards for our Editor engagement team, that will help them monitor the usage of the new Notifications system. Finally, we’ve added pageview stats for the Hungarian and Ukranian Wikivoyages.
Engineering community team
Volunteer coordination and outreach
Kiwix
The Kiwix project is funded and executed by Wikimedia CH.
- In April, we released for the first time Kiwix for Android. This version doesn’t provide as many features as the desktop app, but it works well with all ZIM files. Two Kiwix developers will attend Wikimania and have started preparing for a a small hackathon, two presentations and a permanent booth.
Wikidata
The Wikidata project is funded and executed by Wikimedia Deutschland.
- The team hit a big milestone with the deployment of the first iteration of phase 2 of Wikidata on all remaining Wikipedias (it had been enabled on 11 Wikipedias previously). Qualifiers were also enabled on Wikidata, making it possible to add additional information to certain data. Wikipedians are now able to make use of the data available on Wikidata in articles, allowing the data to be collaboratively collected, curated and used by all Wikipedias.
- The team also fixed a few issues to make it possible to use Wikidata with Internet Explorer 8, and worked on the time datatype. Together with bot owners, they massively improved the time it takes for Wikidata changes to show up in the recent changes and watchlists on Wikipedia sites. The code and architecture got an external professional review; the reviewers were quite happy with the quality of the code base and gave useful tips for improvements.
Future
- The engineering management team continues to update the Deployments page weekly, providing up-to-date information on the upcoming deployments to Wikimedia sites, as well as the engineering roadmap, listing ongoing and future Wikimedia engineering efforts.
This article was written collaboratively by Wikimedia engineers and managers. See revision history and associated status pages. A wiki version is also available.
Can you help us translate this article?
In order for this article to reach as many people as possible we would like your help. Can you translate this article to get the message out?
Start translation