ReleaseEngineering/Applications/Mapper: Difference between revisions
Line 34: | Line 34: | ||
The mapper process is managed by supervisord which will ensure it is started up if the machine is ever rebooted, or if the process crashes. (configured in /etc/supervisord.conf) | The mapper process is managed by supervisord which will ensure it is started up if the machine is ever rebooted, or if the process crashes. (configured in /etc/supervisord.conf) | ||
Source is available at [https://github.com/catlee/mapper] | |||
= Mapper Data Source = | = Mapper Data Source = |
Revision as of 18:52, 30 May 2014
What is it?
When we convert hg repositories to git, and vice versa, the hg changeset SHA (the 40 character hexadecimal string that you get when you commit a change) is different to the git commit id (the equivalent SHA used by git).
In order to keep track of which hg changeset SHAs relate to which git commit SHAs, we keep a database of the mappings, together with details about the project the SHAs come from, and what time they were inserted into the database.
The vcs sync tool (checked into mozharness) is the tool which performs the conversion between hg repos and git repos, and this is documented separately. It is responsible for performing the conversion - this is outside the scope of mapper.
Mapper is a rest api, that allows:
- insertion of new mappings and projects (a "project" is essentially the name of the repo - e.g. build-tools) (HTTP POST)
- insertion of git/hg mappings for a given project (HTTP POST)
- retrieval of mappings for a given project (HTTP GET)
Behind the scenes, it is reading/writing from the database (using sqlalchemy).
Note: the vcs sync tool is a client of the mapper: it is vcs sync that inserts into mapper (i.e. uses the HTTP POST methods). The other clients of mapper will be:
- people / developers - wanting to query mappings
- b2g_build.py - the build script for b2g - since this needs to lookup shas in order to reference frozen commit versions in manifests
Mapper is written as a RelEng API blueprint - please note RelEng API has its own documentation too.
Source
mapper's source is currently hosted at https://github.com/petemoore/mapper
This will be moving as soon as it is ready to go to production. Currently it is in staging (see bug 847640)
Old Mapper
Until the "New Mapper" goes live, this is the information about the "old mapper" (which is being superceded).
mapper only requires bottle to run. It's recommended to run inside a virtual environment
Our current production deployment of mapper lives on cruncher under /home/buildduty/mapper. It listens locally on port 8888 (specified in mapper/app.py). The apache instance on cruncher is configured to forward requests to http://cruncher/mapper/* to http://locahost:8888/* (configured in /etc/httpd/conf/httpd.conf)
The mapper process is managed by supervisord which will ensure it is started up if the machine is ever rebooted, or if the process crashes. (configured in /etc/supervisord.conf)
Source is available at [1]
Mapper Data Source
This section describes (roughly) how vcs-sync provides the map files served by mapper.
The map files are generated and combined on the vcs-sync machines, then pulled onto cruncher and used by `mapper`. See the source and docs for vcs-sync for more details. (Especially the mapper support section.)
mapper expects hggit map files to be available under the 'mapfiles' directory of the application. On cruncher, these are in /home/buildduty/mapper/mapfiles. Each subdirectory of mapfiles corresponds loosely to a different repository being tracked. On cruncher, the mapfiles for each of these projects are symlinked to the mapfiles being published to cruncher via the process outlined above.
Note that the above docs will be integrated into the wiki after vcs-sync development stabilizes.