Difference between revisions of "New SGE cluster installation"
From GridPP Wiki
(No difference)
|
Latest revision as of 14:44, 15 January 2007
CE installation
- Install lcg-CE rpms
- up2date-nox -u lcg-CE
- install perl-Net-LDAP
- Re-try lcg-CE installation
[root@ce00 root]# up2date-nox -u lcg-CE --nosig Fetching Obsoletes list for channel: rhel-i386-as-3... Fetching Obsoletes list for channel: rhel-i386-as-3-extras... Fetching Obsoletes list for channel: ic-hep-as3-i386... Fetching Obsoletes list for channel: rhel-i386-as-3-fastrack... Fetching Obsoletes list for channel: glite3... Fetching Obsoletes list for channel: lcg2_CA... Fetching rpm headers... ######################################## Name Version Rel ---------------------------------------------------------- lcg-CE 3.0.5 0 noarch Testing package set / solving RPM inter-dependencies... Downloading headers to solve dependencies... ####################################### Downloading headers to solve dependencies... ####################################### Downloading headers to solve dependencies... ####################################### Downloading headers to solve dependencies... ######################################## The following packages were added to your selection to satisfy dependencies: Name Version Release -------------------------------------------------------------- CASTOR-client 1.7.1.5 1.longname CGSI_gSOAP_2.3 1.1.5 1 CGSI_gSOAP_2.6 1.1.15 6 MySQL-client 4.1.11 0 MySQL-devel 4.1.11 0 MySQL-server 4.1.11 0 MySQL-shared 4.0.25 sl3 ares-devel 1.1.1 cel3 bdii 3.8.1 1_sl3 boost-g3 1.29.1 06vh_sl3 bouncycastle-jdk14 1.19 2 classads-g3 0.9.4 vh7_sl3 classads-jar 1.1 2 cleanup-grid-accounts 1.0.1 1 cog-jar 1.1 1 commons-cli 1.0_beta2_edg 2edg commons-logging 1.0.2 12 cppunit 1.10.2 3 edg-allschema-config 0.2.1 1 edg-brokerinfo_gcc3_2_2 2.1 5_sl3 edg-fabricMonitoring 2.5.4 4 edg-gpt-profile 1.0.0 1 edg-gridftp-client 1.2.5 1 edg-gridftpd 1.1.2 1_sl3 edg-info-ce lcg2.6.39 1_sl3 edg-info-main lcg3.0.23 1_sl3 edg-info-service 1.0.0 1 edg-java-data-util 1.3.22 1_sl3 edg-java-security 1.5.11 1_sl3 edg-java-security-client 1.5.11 1_sl3 edg-java-security-test 1.5.11 1_sl3 edg-lcas_gcc3_2_2 1.1.22 1_sl3 edg-lcas_gcc3_2_2-interface 1.0.3 1_sl3 edg-lcas_gcc3_2_2-voms_plugins 1.1.22 1_sl3 edg-lcmaps_gcc3_2_2 0.0.30 1_sl3 edg-lcmaps_gcc3_2_2-basic_plugins 0.0.30 1_sl3 edg-lcmaps_gcc3_2_2-dummy_plugins 0.0.30 1_sl3 edg-lcmaps_gcc3_2_2-interface 0.0.1 1_sl3 edg-lcmaps_gcc3_2_2-voms_plugins 0.0.30 1_sl3 edg-mkgridmap 2.6.1 1_sl3 edg-mkgridmap-conf 2.6.1 1_sl3 edg-netconf 1.1.3 1_sl3 edg-netmon-info-provider 1.0.8 1_sl3 edg-pool2info 1.0.1 1_sl3 edg-profile 2.0.9 1 edg-wl-bypass_gcc3_2_2 lcg2.5.3 29_sl3 edg-wl-chkpt-api_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-common-api-java-interface_gcc3_2_2lcg2.1.74 3_sl3 edg-wl-common-api-java_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-common-api_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-config_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-locallogger_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-logging-api-c_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-logging-api-cpp_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-logging-api-sh_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-services-common_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-api-cpp_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-api-java-interface_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-api-java_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-cli_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-config_gcc3_2_2 lcg2.1.74 3_sl3 edg-wl-ui-gui_gcc3_2_2 lcg2.1.74 3_sl3 edg_gatekeeper_gcc3_2_2-gcc32dbg_pgm 2.2.15 1_sl3 fetch-crl 2.0 1 gacl 0.9.2 1_gcc3_2_2_sl3 glite-apel-core 1.0.1 0 glite-apel-lsf 1.0.0 1 glite-apel-pbs 1.0.0 1 glite-apel-publisher 1.0.0 1 glite-essentials-cpp 1.1.1 1_EGEE glite-essentials-java 1.2.0 2_EGEE glite-rgma-api-c 5.0.8 1 glite-rgma-api-cpp 5.0.13 1 glite-rgma-api-java 5.0.3 1 glite-rgma-api-python 5.0.7 1 glite-rgma-base 5.0.6 1 glite-rgma-command-line 5.0.3 1 glite-rgma-gin 5.0.7 1 glite-rgma-log4cpp 5.0.3 1 glite-rgma-log4j 5.0.2 1 glite-rgma-stubs-servlet-java 5.0.5 1 glite-security-trustmanager 1.8.3 1 glite-security-util-java 1.3.4 1 glite-security-voms-admin-client 1.2.13 1 glite-security-voms-admin-interface 1.0.3 1 glite-security-voms-api 1.6.16 3 glite-security-voms-api-c 1.6.16 4 glite-security-voms-api-cpp 1.6.16 4 glite-security-voms-clients 1.6.16 2 globus-config 0.23 1.lcg globus-initialization 2.2.4 5 glue-schema 1.2.2 1_sl3 gpt VDT1.2.2rh9 1 gridice-sensor 1.6.0 23 gsiopenssh VDT1.2.2rh9 1 gssklog-cern 0.10 1 j2sdk_profile 1.4.2_08 sl3 jakarta-axis 1.1rc2 3 jakarta-commons-logging 1.0.2 lcg1_sl3 jas-jar 1.0.0 1 jug 1.0.2_edg edg2 jxUtil-jar 1.0.1 1 lcg-auditlog 1.1.1 1_sl3 lcg-expiregridmapdir 2.0.0 1 lcg-extra-jobmanagers 1.1.8 1_sl3 lcg-info-dynamic-condor 1.1.1 1_sl3 lcg-info-dynamic-lsf 1.0.9 3_sl3 lcg-info-dynamic-pbs 1.0.12 1_sl3 lcg-info-dynamic-scheduler-generic 1.6.1 1 lcg-info-dynamic-scheduler-pbs 1.6.0 1 lcg-info-dynamic-software 1.0.3 1_sl3 lcg-info-generic 1.0.22 1_sl3 lcg-info-provider-software 1.0.5 1_sl3 lcg-info-templates 1.0.15 1_sl3 lcg-lcas-lcmaps 1.1.1 1 lcg-pbs-utils 1.0.0 1 lcg-schema 1.2.1 1_sl3 lcg-tank-gcc32dbg 2.0 1_sl3 lcg-tankspark-conf 2.0 2_sl3 lcg-version 3.0.2 1 lcg-vomscert-na48 1.0.0 1 lcg-vomscerts 4.2.0 1 libstdc++-ssa 3.5ssa 0.20030801.48 log4j 1.2.6 1jpp mm.mysql 2.0.14 1edg mpich 1.2.6 1.sl3.cl mpiexec 0.77 3.sl3 myproxy VDT1.2.2rh9 1 myproxy-config 1.1.8 13.edg1 mysql++_1.7.9_mysql.4.0.13__LCG_rh73_gcc321 1 netlogger-jar 1.0.0 1 perl-Crypt-SSLeay 0.51 4 perl-File-Tail 0.98 cel3 perl-IO-Socket-SSL 0.96 sl3 perl-Net-SSLeay 1.23 0.dag.rhel3 perl-SOAP-Lite 0.55 sl3 perl-TermReadKey 2.20 12 perl-Tie-Syslog 1.07 1 perl-Time-HiRes 1.38 3 perl-TimeDate 1.16 3_1.el3.at perl-XML-SAX-Base 1.04 1 python-logging 0.4.6 1 swig-runtime 1.3.21 1_EGEE torque 1.0.1p6 11.SL30X.st uberftp-client VDT1.2.2rh9_LCG2 vdt_globus_data_server VDT1.2.2rh9_LCG1 vdt_globus_essentials VDT1.2.2rh9_LCG2 vdt_globus_info_client VDT1.2.2rh9 1 vdt_globus_info_essentials VDT1.2.2rh9 1 vdt_globus_info_server VDT1.2.2rh9 1 vdt_globus_jobmanager_condor VDT1.2.2rh9 1 vdt_globus_jobmanager_lsf VDT1.2.2rh9 1 vdt_globus_jobmanager_pbs VDT1.2.2rh9 1 vdt_globus_rls_client VDT1.2.2rh9 1 vdt_globus_rm_client VDT1.2.2rh9 1 vdt_globus_rm_essentials VDT1.2.2rh9 1 vdt_globus_rm_server VDT1.2.2rh9 1 vdt_globus_sdk VDT1.2.2rh9_LCG2 voms-client_gcc3_2_2 1.5.4 2_sl3 xerces-c 1.7.0 sl3 xerces-j1 1.4.4 12jpp xml-commons 1.0 0.b2.3jpp_sl3 xml-commons-apis 1.0 0.b2.3jpp_sl3 libgcj-ssa 3.5ssa 0.20030801.48 redhat-java-rpm-scripts 1.0.2 2 ....
- Install latest version of yaim [ glite-yaim-3.0.0-11.noarch.rpm ]
- Install lcg-CA rpms (up2date-nox -u lcg-CA --nosig)
- Change users to lt2- prefix in yaim functions.
- Rest of the installation same as IC-LeSC [1]
- For WN's just change the environment settings for SGE to different location.
Site problems
SFT problem
- SAM tests seems to be using 1.8GB of virtual memory. The job was killed by SGE.
- Solution : Removed memory limit from SGE. Also added email-address in sge.pm for aborted jobs.
Information system unstable
- Change rgma-gin config on the site CE [2]
- Setup priority for processes using renice.
#!/bin/bash for i in `pgrep -f "grid_manager_monitor_agent"`; do renice +18 -p $i; done for i in `pgrep -f "grid-monitor-job-status"`; do renice +18 -p $i; done for i in `pgrep -f "globus-job-manager"`; do renice +18 -p $i; done
- Check the information plugin for the new version.
- Change bdii.conf
BDII_SEARCH_TIMEOUT=120 BDII_BREATHE_TIME=240 Restart bdii service.
- Change the cachetime in /opt/globus/etc/grid-info-resource-ldif.conf
# This file was automatically generated by globus-mds startup script. Do not modify. dn: Mds-Vo-name=local,o=grid objectclass: GlobusTop objectclass: GlobusActiveObject objectclass: GlobusActiveSearch type: exec path: /opt/lcg/libexec/ base: lcg-info-wrapper args: cachetime: 60 timelimit: 20 sizelimit: 250