## Template for LP Production Meeting logs. Just paste xchat log below and the format IRC line will take care of formatting correctly #format IRC #startmeeting Meeting started at 10:00. The chair is matsubara. Commands Available: [TOPIC], [IDEA], [ACTION], [AGREED], [LINK], [VOTE] Welcome to this week's Launchpad Production Meeting. For the next 45 minutes or so, we'll be coordinating the resolution of specific Launchpad bugs and issues. [TOPIC] Roll Call New Topic: Roll Call me Not on the Launchpad Dev team? Welcome! Come "me" with the rest of us! me me me rockstar, hi al-maisan, hi * stub (n=stub@canonical/launchpad/stub) has joined #launchpad-meeting flacoste, hi me (so-so) ok, foundations, code and soyuz missing. they can join in later [TOPIC] Agenda New Topic: Agenda * Actions from last meeting * Oops report & Critical Bugs * Operations report (mthaddon/herb/spm) * DBA report (stub) [TOPIC] * Actions from last meeting New Topic: * Actions from last meeting me me * matsubara to file a bug about the missing select permissions that delayed the rollout * https://bugs.edge.launchpad.net/launchpad-foundations/+bug/353926 * cprov to look up soyuz bugs 353568 * matsubara to include francis suggestion to bug 353530 and ursinha to summarize what spm told her * matsubara commented on the bug. * salgado to debug and fix bug 353863 * sinzui to email the list how we should address critical bugs on unmaintained apps (e.g. blueprint) * matsubara to talk to mrevell to announce a maintenance in the DB for about 10 min outage in the next 2 weeks. ask mrevell to talk to stub about it * matsubara emailed mrevell about this. Launchpad bug 353568 in soyuz "ubuntu/source/package/+index timing out" [High,Fix released] https://launchpad.net/bugs/353568 Launchpad bug 353530 in malone "OOPS filing a bug using the email interface " [Undecided,Fix released] https://launchpad.net/bugs/353530 Error: This bug is private Launchpad bug 353863 in launchpad-registry "TypeError when finishing creating user account in lpnet" [Critical,Fix released] https://launchpad.net/bugs/353863 matsubara: not done the info I had was useless to the bug report very superficial and not helpful, so I didn't add me cprov and salgado bugs are fix released. so that's done sinzui, do you want me to add another action for your item? for next week matsubara: please do [action] sinzui to email the list how we should address critical bugs on unmaintained apps (e.g. blueprint) ACTION received: sinzui to email the list how we should address critical bugs on unmaintained apps (e.g. blueprint) [TOPIC] * Oops report & Critical Bugs New Topic: * Oops report & Critical Bugs go ahead Ursinha all right! one puzzle for losas/stub, three bugs for foundations, three bugs for registry flacoste, bug 354593, bug 353926, openid resetting password, bug 358498 sinzui: bug 357307, bug 358486, bug 358492 herb/stub: we're having *lots* of oopses like https://lp-oops.canonical.com/oops.py/?oopsid=OOPS-1194D1005. I've sent one email to lp list and spoken with jtv about that, something is killing the db connections, so when a request tries to re-use a connection that died it oopses like that. So herb/stub: do you know what can be possibly happening to the db? Launchpad bug 354593 in launchpad-foundations "SSO exceptions views need proper branding" [High,Triaged] https://launchpad.net/bugs/354593 Bug 353926 on http://launchpad.net/bugs/353926 is private Launchpad bug 358498 in launchpad-foundations "AssertionError OOPS on openid when resetting password " [Undecided,New] https://launchpad.net/bugs/358498 Launchpad bug 357307 in launchpad-foundations "TypeError when creating new account in lpnet" [Undecided,New] https://launchpad.net/bugs/357307 Launchpad bug 358486 in launchpad-registry "AttributeError when user is confirming new account and LP is checking if it was a suspended one" [Undecided,New] https://launchpad.net/bugs/358486 Launchpad bug 358492 in launchpad-foundations "ProgrammingError OOPS resetting password" [Undecided,New] https://launchpad.net/bugs/358492 https://devpad.canonical.com/~jamesh/oops.cgi/1194D1005 Ursinha: Account and LoginToken are Foundations issues Ursinha: looking at the oops now... sinzui, looking.. sinzui, thanks for changing that Ursinha: The OOPS is showing that the database reconnection isn't working as it should. Why that connection died isn't on that OOPS - it would have happened on the previous request handled by that thread. so that's flacoste's too * sinzui did it a few minutes ago sinzui, for AuthToken: +resetpassword, do you think salgado could look into this one? flacoste: Not this week, He is gone. I can take the +resetpassword in a few hours Ursinha: for the branding bug, how often does it happen? Ursinha: reworking these templates is going to happen, but it's not easy to fix now Ursinha: We have watchdogs that kill bad connections. I don't recall seeing any reaped connections from the appservers recently. flacoste, about 6,7 a day stub, we had 2 thousand oopses like this one I showed Ursinha: actually, i think these are related to the DisallowedStore error i'm seeing Ursinha: because these links don't appear on normal SSO pages stub: what the permission bug in 358492? nm, i know what this is about stub, jtv said he saw a lot of "administrator terminated connection" errors on lp-errors-report list When? yesterday I couldn't find them Anyway - the reason the OOPS count is so high is the appserver isn't recovering like it should. stub: could you look at this bug tomorrow? I can look at the oops - I don't know if there is a bug yet (I think there is - not sure though) for the db killing spree no, there isn't What db killing spree? at least I didn't open one, I'll do now ah I mean, the oopses the lots of oopses because of the appserver isn't recovering like it should ok anyway, it's that bug you're talking about flacoste? yes I'll open one and let you know that's all for me we have one critical bug, in progress so, if matsubara has nothing else to say, oops section is closed [action] ursinha to file a bug about "appserver isn't recovering like it should causing too many oopses" ACTION received: ursinha to file a bug about "appserver isn't recovering like it should causing too many oopses" thanks sinzui, flacoste, stub and herb and matsubara, of course intellectronica, can you move bug 269538 from fix committed to fix released? Launchpad bug 269538 in bugzilla-launchpad/bugzilla-3.2 "Compilation error in plugin when authenticating" [Critical,Fix committed] https://launchpad.net/bugs/269538 or at least chase why it's not fix released yet? that bug been in fix committed for ages matsubara: i have no idea what's going on with that. i'll talk to gmb about it thanks intellectronica thanks everyone let's move on [action] intellectronica to talk to gmb about bug 269538 ACTION received: intellectronica to talk to gmb about bug 269538 Launchpad bug 269538 in bugzilla-launchpad/bugzilla-3.2 "Compilation error in plugin when authenticating" [Critical,Fix committed] https://launchpad.net/bugs/269538 [TOPIC] * Operations report (mthaddon/herb/spm) New Topic: * Operations report (mthaddon/herb/spm) 2009-04-04 - Launchpad experienced an outage most likely due to hitting some connection limits on the DB. Some users may have experienced issues for up to 90 minutes. 2009-04-08 - Deployed r7947 to soyuz and xmlrpc servers. Bug 156453 and bug 118625 continue to be problematic for us. Just want to make sure I'm keeping it on your radar. Launchpad bug 156453 in loggerhead "production loggerhead branch leaks memory" [Critical,In progress] https://launchpad.net/bugs/156453 Launchpad bug 118625 in launchpad-bazaar "codebrowse sometimes hangs" [High,Triaged] https://launchpad.net/bugs/118625 That's all for this week, unless there are questions. herb, I have something to report here again! woohoo! herb, so we've identified the real memory pig. Unfortunately, it won't be trivial to change. cool! herb, so we know where the issue is, and now we just need to schedule about two weeks and re-write loggerhead. haha hehe the problem is that he is serious :-/ mine was a laugh of despair rockstar, so are you tackling that for 2.2.4 and maybe 2.2.5? flacoste: I believe the problem with +resetpassword is that it sends logintokens to users who have not setup a person yet. matsubara, well, I doubt it'll be 2.2.4, because mwhudson is on leave for so much of it. matsubara, what really needs to happen is that we need to be sequestered again for a week to do nothing but fix it. sinzui: sounds about right, that shouldn't happen :-) flacoste: I'll get this fix today by we, you mean you and mwhudson or the whole code team? sinzui: thanks a lot matsubara, mwhudson and I. matsubara, we got some really good work done at the Pycon sprints last week. there's all hands and uds coming, maybe during that? anyway, that's beyond the scope of this meeting. I think that's all. anything else for herb? [TOPIC] * DBA report (stub) New Topic: * DBA report (stub) thanks herb Can you describe the memory leak? thanks matsubara During the last rollout, one of the database patches turned out to be relying on database row ordering for some data migration, with the end result being some newly created rows on the slaves had different primary key values to the master and each other. This caused replication to block later when changes to the data on the master could not be duplicated on the slaves due to constraint violations, alerting us to the problem. We rebuild the slave databases to correct the problem (the safest way of recovering the situation). The corruption was not noticable to end users and did not infect the master, as only the internal database ids where affected. I was hoping to switch our master to the 16 core box, but public holidays and illness have put a hold on that this week. On the 6th and 7th, some batch jobs erroneously had their database connections terminated. Sorry about that. It is unlikely this was end user visible. echo... echo... oi oi stub, you're coordinating the downtime annoucement with mrevell, right? I will stub, ok. thanks. anything else for stub? thanks stub. I think that's all for today. Thank you all for attending this week's Launchpad Production Meeting. #endmeeting