Diff for "PolicyAndProcess/DatabaseSchemaChangesProcess"

Not logged in - Log In / Register

Differences between revisions 10 and 29 (spanning 19 versions)
Revision 10 as of 2010-02-12 21:46:53
Size: 9512
Editor: flacoste
Comment:
Revision 29 as of 2011-02-16 11:47:55
Size: 10847
Editor: stub
Comment:
Deletions are marked like this. Additions are marked like this.
Line 3: Line 3:
We change the schema through database patches. These live in branches that go through review and landing, just like code changes.
Line 4: Line 5:
== Overview == A schema change needs a review of type "db" from the DBA (`stub`) or, in his absence, the technical architect (`lifeless`). But ''request db reviews from both'' these people, so that they can both track changes. A database patch also needs an identifying "patch number"; the db reviewer will assign one.

Changes to the permissions in `database/schema/security.py` or the comments in `database/schema/comments.sql` do not require db review. If you get both code and db reviews, the code reviewer should look at them. (But if nothing else in your branch requires code review, then they're for the db reviewer to inspect).


== Procedure ==
Line 10: Line 16:
 1. Submit a merge proposal for your branch into lp:launchpad (lp:~launchpad-pqm/launchpad/db-devel), requesting two '''db''' reviews from both the technical architect (BjornT) and the DBA (stub). If the branch also contains code changes, a third review request is needed same as any other branch with code changes (no type, launchpad team or occasionally an explicit code reviewer).  1. Submit a merge proposal for your branch into `lp:launchpad/db-devel` (`lp:~launchpad-pqm/launchpad/db-devel`), requesting two '''db''' reviews from respectively the technical architect (lifeless) and the DBA (stub).
Line 12: Line 18:
 1. Follow instructions given in the review feedback.  1. If your branch also contains code changes, then request a separate code review as well.

 1. Follow your reviewers' instructions. The DBA or Technical Architect will allocate a database patch number as required. The schema change is approved for landing when you have an 'Approved' vote from the DBA - no need to block waiting on the Technical Architect review.
Line 18: Line 26:
 1. Land the branch on lp:launchpad (lp:~launchpad-pqm/launchpad/db-devel) unless someone has stated it is being landed it on your behalf.  1. Land the branch on `lp:launchpad/db-devel` (`lp:~launchpad-pqm/launchpad/db-devel`) unless someone has stated it is being landed it on your behalf.
Line 31: Line 40:
    1. Comments on new tables and columns need to be added to `database/schema/comments.sql`.
Line 41: Line 49:
 1. In `database/schema/` run `make newsampledata`.  1. Run `psql launchpad_ftest_playground -f your-patch.sql` as the ftest playground db is used to generate newsampledata.sql in the following step. Also run `psql launchpad_dev -f your-patch.sql` to update the dev database's sampledata.
Line 43: Line 51:
 1. Review the sample data changes that occured using `diff current.sql newsampledata.sql`. This diff can be hard to review as-is. You might want to use a graphical diff viewer like `kompare` or `meld` which will make it easier. Make sure that you understand all the changes you see.  1. Run `make newsampledata`.

 1. R
eview the sample data changes that occured using `diff database/sampledata/current.sql database/sampledata/newsampledata.sql`. This diff can be hard to review as-is. You might want to use a graphical diff viewer like `kompare` or `meld` which will make it easier. Make sure that you understand all the changes you see.
Line 52: Line 62:

 1. Comments on new tables and columns need to be added to `database/schema/comments.sql`.
Line 144: Line 156:
 2. DBA (in our case, Stuart)
 3. Technical Archict (in our case, BjornT)
 2. DBA (in our case, StuartBishop)
 3. Technical Architect (in our case, RobertCollins)
Line 167: Line 179:

== Current db patch number ==

This is the highest allocated patch number, which may not have landed yet.

{{{2208-44-0}}}

We change the schema through database patches. These live in branches that go through review and landing, just like code changes.

A schema change needs a review of type "db" from the DBA (stub) or, in his absence, the technical architect (lifeless). But request db reviews from both these people, so that they can both track changes. A database patch also needs an identifying "patch number"; the db reviewer will assign one.

Changes to the permissions in database/schema/security.py or the comments in database/schema/comments.sql do not require db review. If you get both code and db reviews, the code reviewer should look at them. (But if nothing else in your branch requires code review, then they're for the db reviewer to inspect).

Procedure

  1. Discuss non-trivial or controversial changes.
  2. Prepare a branch containing your database patch for review. Feel free to request assistance from or outsource work to the DBA (stub).
  3. Submit a merge proposal for your branch into lp:launchpad/db-devel (lp:~launchpad-pqm/launchpad/db-devel), requesting two db reviews from respectively the technical architect (lifeless) and the DBA (stub).

  4. If your branch also contains code changes, then request a separate code review as well.
  5. Follow your reviewers' instructions. The DBA or Technical Architect will allocate a database patch number as required. The schema change is approved for landing when you have an 'Approved' vote from the DBA - no need to block waiting on the Technical Architect review.
  6. Ensure the sampledata is up to date on your branch.
  7. Ensure the test suite still passes.
  8. Land the branch on lp:launchpad/db-devel (lp:~launchpad-pqm/launchpad/db-devel) unless someone has stated it is being landed it on your behalf.

Making a database patch

You need to run these steps whenever you make a schema change, regardless of whether you intend to add new sample data or not. For example, if you are adding a new column to the Person table, these steps ensure that the new sample data will include this new column.

  1. Run make schema to get a pristine database of sample data.

  2. Create a SQL file in database/schema/pending/ containing the changes you want.

    1. The first line of your file should be SET client_min_messages=ERROR;.

    2. Don't use the TRUNCATE or DROP TABLE statements as they don't work with Slony-I replication.
      1. To drop a table, rename it into the todrop namespace using a statement like ALTER TABLE FooBar RENAME TO todrop.FooBar.

    3. If you are doing data initialization or data migration, don't use indeterminate values that might give different results on different databases, such as random() or CURRENT_TIMESTAMP. This would create skew in our data between the master and slave databases.
      1. For a new column with an indeterminate DEFAULT, you may need to manually set the initial values to something determinate before adding the DEFAULT to the column. eg. initialize the new date_eviscerated column to a fixed, known timestamp before setting its default to CURRENT_TIMESTAMP AT TIME ZONE 'UTC'.
  3. Run your new SQL patch on the development database to ensure that it works. Do this by running psql launchpad_dev -f your-patch.sql

  4. (Optional): Interact with launchpad.dev to add any additional sample data you want to demonstrate in the web u/i.

  5. Run psql launchpad_ftest_playground -f your-patch.sql as the ftest playground db is used to generate newsampledata.sql in the following step. Also run psql launchpad_dev -f your-patch.sql to update the dev database's sampledata.

  6. Run make newsampledata.

  7. Review the sample data changes that occured using diff database/sampledata/current.sql database/sampledata/newsampledata.sql. This diff can be hard to review as-is. You might want to use a graphical diff viewer like kompare or meld which will make it easier. Make sure that you understand all the changes you see.

  8. In database/sampledata/, move newsampledata.sql to current.sql, replacing the latter.

  9. In database/sampledata/, move newsampledata-dev.sql to current-dev.sql, replacing the latter.

  10. Move your pending SQL file into database/schema/ with a name like patch-xx-99-0.sql (where xx matches the existing patches), and ending with the line INSERT INTO LaunchpadDatabaseRevision VALUES (xx, 99, 0);. When your patch is reviewed and approved, you will be assigned an official patch number, which you will use instead of 99 in both the name of the file and this last line.

  11. Run make schema again to ensure that it works, and that you now have a pristine database with the new sample data.

  12. Comments on new tables and columns need to be added to database/schema/comments.sql.

  13. Make any necessary changes to database/schema/fti.py, database/schema/security.cfg, and at least the minimum code changes required to keep the test suite passing.

  14. Make any necessary changes to the SQL patch to reflect new default values.
  15. Run the full test suite to ensure that your new sample data doesn't break any existing tests by side effect.

  16. Go have lunch.

Note that if you make subsequent additional changes, you may be able to skip straight to step 5.

Sample data

Let's say your branch needs to make changes to the database schema. You need to follow the steps on this page to ensure that the sample data is updated to match your schema changes.

In fact, there are now two sets of sampledata that need to be updated.

We use sample data to provide well-known baseline data for the test suite, and to populate a developer's Launchpad instance so that launchpad.dev can display interesting stuff. There are some guidelines and recommendations you should be aware of before you make changes to the test suite sample data, or you may break the tests for yourself or others.

Please note that sample data is for developer's instances only. It would make no sense to use the sample data on production systems!

If your tests require new data, you should strongly consider creating the data in your test's harness instead of adding new sample data. This will often make the tests themselves more readable because you're not relying on magical values in the sample database. Doing it this way also reduces the chance that your changes will break other tests by side-effect. Add the new data in your test's setUp() or in the narrative of your doctest. Because the test suite uses the launchpad_ftest database, there is no chance that running the test suite will accidentally add new sample data.

However, if you interact with the web U/I for launchpad.dev your changes will end up in the launchpad_dev database. This database is used to create the new sample data, so it is imperative that you run make schema to start with a pristine database before generating new sample data. If in fact you do want the effects of your u/i interactions to land in the new sample data, then the general process is to

  • run make schema

  • interact with launchpad.dev

  • follow the make newsampledata steps above.

Be aware though that your generation of new sample data will probably have an effect on tests not related to your changes! For example, if you generate new karma events, you will probably break the karma_sample_data tests because they expect all karma events to be dated prior to the year 2002. If you make changes to the sample data, you must run the full test suite and ensure that you get no failures, otherwise there is a very high likelihood that you will break the trunk.

Resolving schema conflicts

Resolving conflicts in current.sql manually is usually more trouble than it's worth. Instead, first resolve any conflicts in comments.sql, then:

cd database/schema/
mv {patch-in-question}-0.sql comments.sql pending/
cp {parent branch, e.g. rocketfuel}/database/schema/comments.sql ./
cp ../sampledata/current.sql.OTHER ../sampledata/current.sql
make
psql launchpad_dev -f pending/patch-xx-99-0.sql
make newsampledata
mv ../sampledata/newsampledata.sql ../sampledata/current.sql
mv pending/{patch-in-question}-0.sql pending/comments.sql ./
make   # Just to make sure everything works
cd ../..
bzr resolve database/sampledata/current.sql

Notes on Changing security.cfg

It is possible to land changes to security.cfg on lp:~launchpad-pqm/launchpad/devel rather than lp:~launchpad-pqm/launchpad/db-devel. These changes can cause OOPSes on edge.launchpad.net if edge requires a permission that has not been granted to the production DB. So if you are landing a security.cfg change that needs to land on lp:~launchpad-pqm/launchpad/devel, you need to email stub & the losas, CC: launchpad@ asking them to apply the manual change to the production databases as well. Reviewers should remind about this when seeing a security.cfg change.

Triggers

  1. Developer needs a database schema change implemented as part of his/her development activities
  2. Developer needs a database schema change implemented as part of his/her bug fixing activities

Inputs

  1. Developer originated schema change concept

Outputs

  1. SQL Patch File

Participants

  1. Developer
  2. DBA (in our case, StuartBishop)

  3. Technical Architect (in our case, RobertCollins)

Subprocesses

  • N/A.

Standard Path Events/Activities

  1. When a developer has a change they want to make to the database schema, they write a database patch (see database/schema in the source tree).
  2. They can give themselves a provisional patch number on their development tree.
  3. A merge proposal is submitted, with reviews requested from the Launchpad Product Strategist and the DBA.
  4. When it has passed review, the DBA gives them a database patch number, which becomes the filename of the SQL.
  5. The DBA will also issue further instuctions on how to proceed.

Notes:

  • Because each database patch number is unique, various database schema changes can be worked on by different members of the team in parallel, and successfully merged together in different ways.
  • This is most often used just for schema changes.
  • Sometimes schema changes have accompanying data updates.

Alternative Path Events/Activities

Comments

Current db patch number

This is the highest allocated patch number, which may not have landed yet.

2208-44-0

PolicyAndProcess/DatabaseSchemaChangesProcess (last edited 2022-09-29 10:26:50 by cjwatson)