Minutes

Minutes of the LXCERT meeting 09.03.04

Agenda:

  • status update of the situation with Red Hat
  • formal certification decision / planned end date
  • Planned changes in the next release:
    Linux support sent around a document with our change proposals for the next release before the meeting (archived copy), and asked the other participants to do so as well.
  • communication improvements:
    Discussion on how to improve communication with the user community.
  • AOB

Points 2 and 3 were swapped to first have Jarek's presentation, then the certification decision. No AOB points were added.

Attendance:

present
Bruce M Barnett ATLAS-online (replaced by Marc Dobson)
Alastair Bland AB/CO
Eric Cano CMS-online
Marco Cattaneo LHCb (replaced by Jo´┐Żl Closier)
Nicolas De Metz-Noblat AB/CO
Benigno Gobbo non-LHC experiments
Jan Iven IT-services catchall (chair)
Jarek Polok general Desktops (secretary)
Thorsten Kleinwort IT PLUS/BATCH service (replaced by G.Cancio and H.Renshall)
Stephan Wynhoff CMS-offline
Helge Meinhard CLUG
Alan Silverman (Red Hat negotiations, present during first part only)
absent
Alberto Aimar LCG Application Area
Gilbert Poulard ATLAS-offline
Fons Rademakers ALICE

Red Hat situation overview:

(slides from the presentation)

Questions:

Q: why pay at all for these 200 nodes if we do CEL3 anyway?
A: test drive their support line for next round, keep momentum from negotiations

Q: who will use these nodes?
A: should be binary compatible to CEL3, so can be used during Data Challenges. No silent introduction into LXBATCH, always ask explicit user agreement to run on these.

Q: What is the message for the outside institutes?
A: CEL3 is freely available for this year. Come to HEPiX and discuss what should be done next year.

Q: pricing details?
A: no need to know in this group, but affordable. Should be available at HEPiX. RHE-3-ES (for ORACLE): "few hundred EUR", talk to Alan for details and orders.

CEL3 feature summary:

(slides from the presentation)

First there were some questions on the cluster management tools QUATTOR and functionality of apt-rpm. The discussion then turned to the announced removal of "fvwm2" (which is still the default desktop), it was stressed that the decision lies with the DTF, and that increased helpdesk calls are to be expected after the transition. An explicit migration strategy may have to be proposed. It was suggested to provide a similarly low-footprint window manager.

Some products have trouble if run from AFS or in multiple instances (GNOME, evolution), and the effect and usefulness of browser caches on AFS was discussed. Similarly, German pointed out the remaining 160 ASIS packages, but it was felt that most of them were obsolete and the rest could easily be provided at system install time or via the new package selection tool. The ASIS framework for compilation was not requested anymore.

LXPLUS service managers agreed to switch (a part of) LXPLUS and BATCH to the new system as quickly as possible to CEL3 after certification. Further migration would be demand-driven, which worked well in the 6.1.1->7.2 transition. Some "legacy" systems will have to be kept on 7.3.

Certification decision:

After discussion, a tentative end date for the certification could not be agreed on, since the experiments pointed out that they would in fact be blocked by missing dependencies for an unknown amount of time. However, it was agreed that the LHC experiments would certify 1 month after the required libraries from LCG/SEAL and POOL would be available.

Similarly, AB/CO pointed out that a working version of ORACLE was a hard requirement for their users (specifically, they need a working ORACLE Pro*C to use Oracle embedded SQL with gcc-3.2.3), and that they expect support from IT even if ORACLE would not certify the new platform directly. The non-LHC experiments (namely HARP and COMPASS) also depend strongly on the ORACLE client libraries, since all the metadata is stored in ORACLE.

The dependency on PVSS seems to be a minor issue as ETM (PVSS producer) has agreed to collaborate closely with CERN on the issue.

The non-LHC experiments are most interested in CERNLIB being available soon and can start certification as soon as it is available.

Communication with users

(Problem summary: information does not reach individual users that are not in a formal role. The Linux team has mailing lists, but no forced subscription. No user-at-large communication exists since CLUG's decline)

Discussion: Information is perhaps being lost inside the experiment hierarchy, communication from link people is not always perfect. However, experiments at least do have formal structures. Users don't and probably don't get information at all.

Some of the suggestions that came out were to use /etc/motd information (on login) and to clearly establish that http://cern.ch/linux is the reference source for official announcements.

Addition by M.Cattaneo: For everything related to the desktop, the DTF is the official communication channel - so migration plans affecting the desktop environment should be discussed there.