Diff for "Foundations/QA/OOPSToolsMiniSprint"

Not logged in - Log In / Register

Differences between revisions 10 and 11
Revision 10 as of 2010-09-23 20:42:16
Size: 2926
Editor: ursinha
Comment:
Revision 11 as of 2010-09-28 20:57:06
Size: 1926
Editor: matsubara
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
Mini OOPS Tools sprint, taking place in Campinas, SP, Brazil, 20-24 September 2010. = OopsTools Sprint =
Line 3: Line 3:
== We're discussing == Where: Campinas, SP, Brazil
When: 20-24 September 2010
Line 5: Line 6:
 * Bug Bug:461269: oops reports should be grouped by oops signature not exception type and exception value (better grouping of oopses; better relation oopses-real problems)
 * How to move this information to be easily accessible via web
 * Improve oops-tools to handle queries for a given pageid
Ursula and Diogo got together for a week to discuss improvements for [[oops-tools|https://edge.launchpad.net/oops-tools/]]
Line 9: Line 8:
=== How to reduce all OOPS emails into one with grouped information === Things discussed:
Line 11: Line 10:
Send only one email to the list including information broke down by team:     * how to get rid of the reports sent to the launchpad@ list, at most have
      just a single email sent to the launchpad list
    * improve the content of the reports
    * provide web ui so developers can generate a customized report with
      oopses only interesting to them
    * how to fix bug Bug:461269 in a way that new oops attributes can be used
      to uniquely identify an infestation
Line 13: Line 18:
 * stats section is global, for all production instances
 * each team section shows top exception and top timeout and the percentage of oops reports the team is responsible for.
    * maybe show top offending pageid (i.e. the pageid with the greatest number of oopses) instead?
 * all team sections show link to a full team report
 * remove unnecessary sections from the report, such as soft time out, informational only, user generated errors
 * provide a way to find oopses that would appear in those sections through the web ui
 * remove unnecessary oopses from the report (e.g bug Bug:540890 and Bug:251896)
== User Stories ==
Line 21: Line 20:
{{{     * Deryck wants to see all OOPSes related to the Bugs team, but without the
      checkwatches noise.
    * Danilo wants to see all OOPSes related to Translations in a single
      report.
    * Robert wants to see reports grouped by pageid
    * Julian doesn't want to receive any more email
    * Francis, Robert and Jono wants to see an overall state of the production
      instances
    * Gary wants the connection between infestations and Launchpad bug report
      to be very reliable (i.e. once the tool is taught about a false
      positive, it should do the right thing the next time)
Line 23: Line 32:
Subject: Oops report for 2010-09-22 == Action items ==
Line 25: Line 34:
= Stats for 2010-09-22 =

* 10000 Exceptions
* 50000 Time Outs

== Top 5 Exceptions per Page ID ==

* 300 Person:+index - ConjoinedBugTaskError: 'Foo bar is foobared'
* 120 BugTask:+index - ConjoinedBugTaskError: 'Foo bar is foobared'
* 100 Distribution:+search - ConjoinedBugTaskError: 'Foo bar is foobared'
* 80 DistributionSourcePackage:+filebug - ConjoinedBugTaskError: 'Foo bar is foobared'
* 78 Archive:+packages - ConjoinedBugTaskError: 'Foo bar is foobared'
    * Bug Bug:XXX: change ErrorSummary object to accept sections so it can be
      built dynamically
    * Bug Bug:XXX: web ui so developers can generate reports customized to
      what they need (http://ubuntuone.com/p/HvI/)
    * Bug Bug:461269: new oops attributes, such as pageid, should be used to
      uniquely identify an infestation
    * File RT to have lp-production-configs on devpad automatically updated
Line 39: Line 43:
== Bugs 20 % ==
Full report: https://lp-oops.canonical.com/summary/?team=Bugs&date=2010-09-22
== Bugs fixed during the sprint ==
Line 42: Line 45:
 * 230 ConjoinedBugTaskError: 'Foo bar is foobared'
 * 440 TimeOut: 'some broken page'

== Code 50 % ==
Full report: https://lp-oops.canonical.com/summary/?team=Code&date=2010-09-22

 * 100 TooNewRecipeError: 'recipe is foobared'
 * 1140 TimeOut: 'some broken page'


== Foundations 10 % ==
Full report: https://lp-oops.canonical.com/summary/?team=Code&date=2010-09-22

 * 330 LibrarianDiskError: 'librarian is down'
 * 666 TimeOut: 'some other page is broken'

}}}

 * Better team reports
   * !PageId to become an object, not just an attribute
     * Find out !PageId 'owners'
     * Use django admin interface to set 'orphan' PageIds teams
     * Will have to migrate data to the new model
   * Change how an oops' team is calculated:
     * Prefix > !PageId > vhost


== We're doing ==

 * Bug Bug:540890: exclude robot posts from reports


== We did ==

 * Bug Bug:612354: fix oops-tools bootstraping
 * Bug Bug:251896: oops-tools should filter out not found errors referred from non-local domains
    * Bug Bug:612354: fix oops-tools bootstraping
    * Bug Bug:251896: oops-tools should filter out not found errors referred
      from non-local domains

OopsTools Sprint

Where: Campinas, SP, Brazil When: 20-24 September 2010

Ursula and Diogo got together for a week to discuss improvements for https://edge.launchpad.net/oops-tools/

Things discussed:

  • how to get rid of the reports sent to the launchpad@ list, at most have
    • just a single email sent to the launchpad list
  • improve the content of the reports
  • provide web ui so developers can generate a customized report with
    • oopses only interesting to them
  • how to fix bug 461269 in a way that new oops attributes can be used

    • to uniquely identify an infestation

User Stories

  • Deryck wants to see all OOPSes related to the Bugs team, but without the
    • checkwatches noise.
  • Danilo wants to see all OOPSes related to Translations in a single
    • report.
  • Robert wants to see reports grouped by pageid
  • Julian doesn't want to receive any more email
  • Francis, Robert and Jono wants to see an overall state of the production
    • instances
  • Gary wants the connection between infestations and Launchpad bug report
    • to be very reliable (i.e. once the tool is taught about a false positive, it should do the right thing the next time)

Action items

  • Bug XXX: change ErrorSummary object to accept sections so it can be

    • built dynamically
  • Bug XXX: web ui so developers can generate reports customized to

  • Bug 461269: new oops attributes, such as pageid, should be used to

    • uniquely identify an infestation
  • File RT to have lp-production-configs on devpad automatically updated

Bugs fixed during the sprint

  • Bug 612354: fix oops-tools bootstraping

  • Bug 251896: oops-tools should filter out not found errors referred

    • from non-local domains

Foundations/QA/OOPSToolsMiniSprint (last edited 2010-10-01 22:12:01 by gary)