## Template for LP Production Meeting logs. Just paste xchat log below and the format IRC line will take care of formatting correctly #format IRC #startmeeting Meeting started at 09:00. The chair is matsubara. Commands Available: [TOPIC], [IDEA], [ACTION], [AGREED], [LINK], [VOTE] * salgado is now known as salgado-lunch Welcome to this week's Launchpad Production Meeting. For the next 45 minutes or so, we'll be coordinating the resolution of specific Launchpad bugs and issues. * Notify: salgado is offline (simmons.freenode.net). [TOPIC] Roll Call New Topic: Roll Call me me me me me I'm standing in for henninge today, since he's on sprint. thanks jtv me me so, if any of you can't make the meeting next week, please coordinate with another teammate to cover for you and add a notice in the Apologies section in the MeetingAgenda page. flacoste: ping me bigjools: ping me me * henninge (n=henning@port-213-160-23-156.static.qsc.de) has joined #launchpad-meeting ok, let's move on, stub can join later [TOPIC] Agenda New Topic: Agenda * Actions from last meeting * Oops report & Critical Bugs * Operations report (mthaddon/herb/spm) * DBA report (DBA contact) [TOPIC] * Actions from last meeting New Topic: * Actions from last meeting * bac to check with barry if there's an open bug for OOPS-1125A1096, if not, Ursinha to file one - there was, bug 280925 * intellectronica to work on bug 279561 * rockstar to check OOPS-1125CEMAIL1 * bac to take a look at OOPS-1125A165 - bac filed bug 322792 * Ursinha to check with kiko if any other rollouts will happen this week Launchpad bug 280925 in launchpad-registry "Project overview page shows obsolete series" [Low,Fix released] https://launchpad.net/bugs/280925 https://devpad.canonical.com/~jamesh/oops.cgi/1125A1096 Error: Could not parse data returned by Launchpad: The read operation timed out (https://launchpad.net/bugs/279561/+text) Launchpad bug 322792 in launchpad-bazaar "Attempting traversal past an unknown object causes an OOPS" [Undecided,Fix released] https://launchpad.net/bugs/322792 https://devpad.canonical.com/~jamesh/oops.cgi/1125A165 holy crap * stub (n=stub@canonical/launchpad/stub) has joined #launchpad-meeting that bug is timing out or what? anyway intellectronica: any news about the api bug? my items were done rockstar: what's up about that oops? matsubara: sorry, no news yet matsubara, he landed a fix for that Ursinha: who's he? :-) matsubara, rockstar :) sorry matsubara, I got an RC in for that one. ok, I remember that one. so I guess the only pending one is the api bug which is a mistery to everyone the good news is that intellectronica found out another thing about that bug that might lead to its root cause intellectronica: thanks for keeping us posted in the report let's move on [TOPIC] * Oops report & Critical Bugs New Topic: * Oops report & Critical Bugs so in today's oops section I'd like to talk about the timeout bugs you guys are working for the LPW https://dev.launchpad.net/PerformanceWeeks/February2009 I'm going to review all the landings related to LPW work and add to that page. so if you wanna help, point me to revision numbers on RF related to that work matsubara: bug 324264 is now Fix Committed. Launchpad bug 324264 in rosetta "Speed up +translations" [High,Fix committed] https://launchpad.net/bugs/324264 matsubara: r7705 for Bug: 325321 jtv, great intellectronica, BjornT : any news about the bug number 1 time out? matsubara: EdwinGrubbs will land his branch today work on bug 302798 is in progress in different ways (there's a commit from 323something which disabled external suggestions to give us a better idea on how stuff is working, and henning is working on removal of obsolete translations which will reduce our DB size by ~33%) Launchpad bug 302798 in rosetta "Timeout on +translate page" [High,Triaged] https://launchpad.net/bugs/302798 matsubara: me, intellectronica, and allenap are working on it danilos: hey, I was putting that paragraph together! matsubara: allenap is looking at reducing the time it take to render the comments, possibly by not showing all by default jtv: ok, I'll let you handle it all from now on :) matsubara: intellectronica is working on loading the subscribers portlet in a different request matsubara: you can have full trust in jtv as far as PW is concerned :) *cough* matsubara: and i'm working on optimizing code, based off profiling information danilos: I do! he's been very helpful with the status updates matsubara: also, intellectronica's inital tests on dogfood were successful, reducing the time quite a lot :) great matsubara: i'm working on bug 316881 (which isn't an OOPS per-se, but related to performance anyhow) Launchpad bug 316881 in launchpad "Page headers not suitable for HTTP caching" [High,In progress] https://launchpad.net/bugs/316881 stub: there's an email from jono asking for some help with the +project-cloud oops, so if you could help out there, would be awesome thanks flacoste, i'll add it to the page I've replied on the bug. Not sure if I'm helpful though. stub: cool. thank you bigjools: news in soyuz. how about the one muharem is taking care of? s/./?/ matsubara: it's not going so well unfortunately I don't expect any progress this week [action] matsubara to add 316881 to foundations section in LPW wiki page ACTION received: matsubara to add 316881 to foundations section in LPW wiki page bigjools: why is that? matsubara: the first attempt to fix it failed. he's also been at the distro sprint this week bigjools: oh right. well, you guys are excused since the whole team is sprinting and you already landed 2(?) timeout fixes :-) matsubara: thanks :) I guess that's it from me. Ursinha, anything else? matsubara, no, the pending oops for soyuz I already talked with bigjools yeah, get edge updating again and we'll see how it went great. thanks everyone! [TOPIC] * Operations report (mthaddon/herb/spm) New Topic: * Operations report (mthaddon/herb/spm) - 2009-01-30 - Friday we updated lpnet, edge and the scripts servers to to r7676. - 2009-02-04 - Yesterday we updated codebrowse to r43 - I feel like I'm starting to sound like a broken record... But we're still being bothered daily, often multiple times, by bug #156453 and bug #118625 which seem to be related. Launchpad bug 156453 in loggerhead "production loggerhead branch leaks memory" [Critical,In progress] https://launchpad.net/bugs/156453 - We also continue to run into issues associated with bug #260171 Launchpad bug 118625 in launchpad-bazaar "codebrowse sometimes hangs" [High,Triaged] https://launchpad.net/bugs/118625 Bug 260171 on http://launchpad.net/bugs/260171 is private herb: i herd mwhudson was working on loggerhead performance this week as part of LPW yes! herb: There is a sprint in the planning to fix that too yes, he is mwhudson is indeed working on that herb: and well-placed sources also told me that a sprint is being organized to fix those damn issues herb: so there is hope! excellent. a fix would be huge for the LOSAs herb, mwhudson is on the verge of insanity tracking loggerhead issues down. thanks for all the work on that. it is much appreciated. thanks herb [TOPIC] * DBA report (stub) New Topic: * DBA report (stub) The production dbs seem to be ticking away nicely. Staging db updates are being disabled by the losas for some testing, so expect a drop in timeout OOPSES. I've had some db patches for this cycle come through already, which is great. 3 2 1 all right. thanks stub what about the staging issues? herb: ^ staging isn't available at the moment it's restoring but we didn't understand why it failed? or did we? herb: how long will it take to restore? and it's just that gmb is done with the testing and we can resume normal staging updates? matsubara: should be back up within the next couple of hours. herb, about 10 hours ago I was talking with spm and he was unsuccessful to put staging on again flacoste: upgrade.py failed because there will still connections open to the staging db. This left the DB in an indeterminate state, so we had to go through the full restore process with a new copy of the staging DB. flacoste: I'm not done yet. flacoste: I need staging to be up for that so this means that we are still screwed? This is all useful information for read only launchpad btw. we either need to fix upgrade.py to work without a db restore or to turn off staging ugprades for the duration of gmb's test flacoste: upgrade.py works fine and we'll need to turn off upgrades while gmb's testing, per stubs note above. why were there still connections open? upgrade.py cannot work when there are active db connections, as upgrade requires exclusive locks on all the replicated db tables. flacoste: the rollout process shuts down the app servers, but there are still potentially cron scripts running, etc. why usually is this not a problem then? Because the restore, replicate and upgrade process is done as a fresh db. Once it is finished, it is swapped into place. ah right! * bigjools has to run, will catch scrollback later if you need anything from me so when I said fix upgrade.py, i should have said 'fix rollout process' so I guess it's best to simply disbale upgrade for now i also heard that gmb might do his tests elsewhere so that might becomes a moot issue but like stub said, very instructive for read-only launchpad flacoste: Well, that's an embryonic idea right now. I still need a staging / demo machine for the forseeable future. I want to make an action item for the fix rollout process thing not sure who would be responsible for that losas? whoa with the help of stub probably [action] losas and stub to fix rollout process to avoid the staging restore problems ACTION received: losas and stub to fix rollout process to avoid the staging restore problems there isn't anything inherently wrong with the rollout process. but ok thanks everyone anythign else before I close? well, it's currently not reliable if we don't do a DB restore it seems nope Thank you all for attending this week's Launchpad Production Meeting. See the channel topic for the location of the logs. #endmeeting Meeting finished at 09:36.