New SGE cluster installation

From GridPP Wiki
Revision as of 14:44, 15 January 2007 by Aggarwa (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

CE installation

  • Install lcg-CE rpms
  • up2date-nox -u lcg-CE
  • install perl-Net-LDAP
  • Re-try lcg-CE installation
[root@ce00 root]# up2date-nox -u lcg-CE --nosig

Fetching Obsoletes list for channel: rhel-i386-as-3...

Fetching Obsoletes list for channel: rhel-i386-as-3-extras...

Fetching Obsoletes list for channel: ic-hep-as3-i386...

Fetching Obsoletes list for channel: rhel-i386-as-3-fastrack...

Fetching Obsoletes list for channel: glite3...

Fetching Obsoletes list for channel: lcg2_CA...

Fetching rpm headers...
########################################

Name                                    Version        Rel     
----------------------------------------------------------
lcg-CE                                  3.0.5          0                 noarch


Testing package set / solving RPM inter-dependencies...

Downloading headers to solve dependencies...
#######################################
Downloading headers to solve dependencies...
#######################################
Downloading headers to solve dependencies...
#######################################
Downloading headers to solve dependencies...
########################################
The following packages were added to your selection to satisfy dependencies:

Name                                    Version        Release
--------------------------------------------------------------
CASTOR-client                           1.7.1.5        1.longname          
CGSI_gSOAP_2.3                          1.1.5          1                   
CGSI_gSOAP_2.6                          1.1.15         6                   
MySQL-client                            4.1.11         0                   
MySQL-devel                             4.1.11         0                   
MySQL-server                            4.1.11         0                   
MySQL-shared                            4.0.25         sl3                 
ares-devel                              1.1.1          cel3                
bdii                                    3.8.1          1_sl3               
boost-g3                                1.29.1         06vh_sl3            
bouncycastle-jdk14                      1.19           2                   
classads-g3                             0.9.4          vh7_sl3             
classads-jar                            1.1            2                   
cleanup-grid-accounts                   1.0.1          1                   
cog-jar                                 1.1            1                   
commons-cli                             1.0_beta2_edg  2edg                
commons-logging                         1.0.2          12                  
cppunit                                 1.10.2         3                   
edg-allschema-config                    0.2.1          1                   
edg-brokerinfo_gcc3_2_2                 2.1            5_sl3               
edg-fabricMonitoring                    2.5.4          4                   
edg-gpt-profile                         1.0.0          1                   
edg-gridftp-client                      1.2.5          1                   
edg-gridftpd                            1.1.2          1_sl3               
edg-info-ce                             lcg2.6.39      1_sl3               
edg-info-main                           lcg3.0.23      1_sl3               
edg-info-service                        1.0.0          1                   
edg-java-data-util                      1.3.22         1_sl3               
edg-java-security                       1.5.11         1_sl3               
edg-java-security-client                1.5.11         1_sl3               
edg-java-security-test                  1.5.11         1_sl3               
edg-lcas_gcc3_2_2                       1.1.22         1_sl3               
edg-lcas_gcc3_2_2-interface             1.0.3          1_sl3               
edg-lcas_gcc3_2_2-voms_plugins          1.1.22         1_sl3               
edg-lcmaps_gcc3_2_2                     0.0.30         1_sl3               
edg-lcmaps_gcc3_2_2-basic_plugins       0.0.30         1_sl3               
edg-lcmaps_gcc3_2_2-dummy_plugins       0.0.30         1_sl3               
edg-lcmaps_gcc3_2_2-interface           0.0.1          1_sl3               
edg-lcmaps_gcc3_2_2-voms_plugins        0.0.30         1_sl3               
edg-mkgridmap                           2.6.1          1_sl3               
edg-mkgridmap-conf                      2.6.1          1_sl3               
edg-netconf                             1.1.3          1_sl3               
edg-netmon-info-provider                1.0.8          1_sl3               
edg-pool2info                           1.0.1          1_sl3               
edg-profile                             2.0.9          1                   
edg-wl-bypass_gcc3_2_2                  lcg2.5.3       29_sl3              
edg-wl-chkpt-api_gcc3_2_2               lcg2.1.74      3_sl3               
edg-wl-common-api-java-interface_gcc3_2_2lcg2.1.74      3_sl3               
edg-wl-common-api-java_gcc3_2_2         lcg2.1.74      3_sl3               
edg-wl-common-api_gcc3_2_2              lcg2.1.74      3_sl3               
edg-wl-config_gcc3_2_2                  lcg2.1.74      3_sl3               
edg-wl-locallogger_gcc3_2_2             lcg2.1.74      3_sl3               
edg-wl-logging-api-c_gcc3_2_2           lcg2.1.74      3_sl3               
edg-wl-logging-api-cpp_gcc3_2_2         lcg2.1.74      3_sl3               
edg-wl-logging-api-sh_gcc3_2_2          lcg2.1.74      3_sl3               
edg-wl-services-common_gcc3_2_2         lcg2.1.74      3_sl3               
edg-wl-ui-api-cpp_gcc3_2_2              lcg2.1.74      3_sl3               
edg-wl-ui-api-java-interface_gcc3_2_2   lcg2.1.74      3_sl3               
edg-wl-ui-api-java_gcc3_2_2             lcg2.1.74      3_sl3               
edg-wl-ui-cli_gcc3_2_2                  lcg2.1.74      3_sl3               
edg-wl-ui-config_gcc3_2_2               lcg2.1.74      3_sl3               
edg-wl-ui-gui_gcc3_2_2                  lcg2.1.74      3_sl3               
edg_gatekeeper_gcc3_2_2-gcc32dbg_pgm    2.2.15         1_sl3               
fetch-crl                               2.0            1                   
gacl                                    0.9.2          1_gcc3_2_2_sl3      
glite-apel-core                         1.0.1          0                   
glite-apel-lsf                          1.0.0          1                   
glite-apel-pbs                          1.0.0          1                   
glite-apel-publisher                    1.0.0          1                   
glite-essentials-cpp                    1.1.1          1_EGEE              
glite-essentials-java                   1.2.0          2_EGEE              
glite-rgma-api-c                        5.0.8          1                   
glite-rgma-api-cpp                      5.0.13         1                   
glite-rgma-api-java                     5.0.3          1                   
glite-rgma-api-python                   5.0.7          1                   
glite-rgma-base                         5.0.6          1                   
glite-rgma-command-line                 5.0.3          1                   
glite-rgma-gin                          5.0.7          1                   
glite-rgma-log4cpp                      5.0.3          1                   
glite-rgma-log4j                        5.0.2          1                   
glite-rgma-stubs-servlet-java           5.0.5          1                   
glite-security-trustmanager             1.8.3          1                   
glite-security-util-java                1.3.4          1                   
glite-security-voms-admin-client        1.2.13         1                   
glite-security-voms-admin-interface     1.0.3          1                   
glite-security-voms-api                 1.6.16         3                   
glite-security-voms-api-c               1.6.16         4                   
glite-security-voms-api-cpp             1.6.16         4                   
glite-security-voms-clients             1.6.16         2                   
globus-config                           0.23           1.lcg               
globus-initialization                   2.2.4          5                   
glue-schema                             1.2.2          1_sl3               
gpt                                     VDT1.2.2rh9    1                   
gridice-sensor                          1.6.0          23                  
gsiopenssh                              VDT1.2.2rh9    1                   
gssklog-cern                            0.10           1                   
j2sdk_profile                           1.4.2_08       sl3                 
jakarta-axis                            1.1rc2         3                   
jakarta-commons-logging                 1.0.2          lcg1_sl3            
jas-jar                                 1.0.0          1                   
jug                                     1.0.2_edg      edg2                
jxUtil-jar                              1.0.1          1                   
lcg-auditlog                            1.1.1          1_sl3               
lcg-expiregridmapdir                    2.0.0          1                   
lcg-extra-jobmanagers                   1.1.8          1_sl3               
lcg-info-dynamic-condor                 1.1.1          1_sl3               
lcg-info-dynamic-lsf                    1.0.9          3_sl3               
lcg-info-dynamic-pbs                    1.0.12         1_sl3               
lcg-info-dynamic-scheduler-generic      1.6.1          1                   
lcg-info-dynamic-scheduler-pbs          1.6.0          1                   
lcg-info-dynamic-software               1.0.3          1_sl3               
lcg-info-generic                        1.0.22         1_sl3               
lcg-info-provider-software              1.0.5          1_sl3               
lcg-info-templates                      1.0.15         1_sl3               
lcg-lcas-lcmaps                         1.1.1          1                   
lcg-pbs-utils                           1.0.0          1                   
lcg-schema                              1.2.1          1_sl3               
lcg-tank-gcc32dbg                       2.0            1_sl3               
lcg-tankspark-conf                      2.0            2_sl3               
lcg-version                             3.0.2          1                   
lcg-vomscert-na48                       1.0.0          1                   
lcg-vomscerts                           4.2.0          1                   
libstdc++-ssa                           3.5ssa         0.20030801.48       
log4j                                   1.2.6          1jpp                
mm.mysql                                2.0.14         1edg                
mpich                                   1.2.6          1.sl3.cl            
mpiexec                                 0.77           3.sl3               
myproxy                                 VDT1.2.2rh9    1                   
myproxy-config                          1.1.8          13.edg1             
mysql++_1.7.9_mysql.4.0.13__LCG_rh73_gcc321              1                   
netlogger-jar                           1.0.0          1                   
perl-Crypt-SSLeay                       0.51           4                   
perl-File-Tail                          0.98           cel3                
perl-IO-Socket-SSL                      0.96           sl3                 
perl-Net-SSLeay                         1.23           0.dag.rhel3         
perl-SOAP-Lite                          0.55           sl3                 
perl-TermReadKey                        2.20           12                  
perl-Tie-Syslog                         1.07           1                   
perl-Time-HiRes                         1.38           3                   
perl-TimeDate                           1.16           3_1.el3.at          
perl-XML-SAX-Base                       1.04           1                   
python-logging                          0.4.6          1                   
swig-runtime                            1.3.21         1_EGEE              
torque                                  1.0.1p6        11.SL30X.st         
uberftp-client                          VDT1.2.2rh9_LCG2                   
vdt_globus_data_server                  VDT1.2.2rh9_LCG1                   
vdt_globus_essentials                   VDT1.2.2rh9_LCG2                   
vdt_globus_info_client                  VDT1.2.2rh9    1                   
vdt_globus_info_essentials              VDT1.2.2rh9    1                   
vdt_globus_info_server                  VDT1.2.2rh9    1                   
vdt_globus_jobmanager_condor            VDT1.2.2rh9    1                   
vdt_globus_jobmanager_lsf               VDT1.2.2rh9    1                   
vdt_globus_jobmanager_pbs               VDT1.2.2rh9    1                   
vdt_globus_rls_client                   VDT1.2.2rh9    1                   
vdt_globus_rm_client                    VDT1.2.2rh9    1                   
vdt_globus_rm_essentials                VDT1.2.2rh9    1                   
vdt_globus_rm_server                    VDT1.2.2rh9    1                   
vdt_globus_sdk                          VDT1.2.2rh9_LCG2                   
voms-client_gcc3_2_2                    1.5.4          2_sl3               
xerces-c                                1.7.0          sl3                 
xerces-j1                               1.4.4          12jpp               
xml-commons                             1.0            0.b2.3jpp_sl3       
xml-commons-apis                        1.0            0.b2.3jpp_sl3       
libgcj-ssa                              3.5ssa         0.20030801.48       
redhat-java-rpm-scripts                 1.0.2          2                   


....


  • Install latest version of yaim [ glite-yaim-3.0.0-11.noarch.rpm ]
  • Install lcg-CA rpms (up2date-nox -u lcg-CA --nosig)
  • Change users to lt2- prefix in yaim functions.
  • Rest of the installation same as IC-LeSC [1]
  • For WN's just change the environment settings for SGE to different location.

Site problems

SFT problem

  • SAM tests seems to be using 1.8GB of virtual memory. The job was killed by SGE.
  • Solution : Removed memory limit from SGE. Also added email-address in sge.pm for aborted jobs.

Information system unstable

  • Change rgma-gin config on the site CE [2]
  • Setup priority for processes using renice.
#!/bin/bash
for i in `pgrep -f "grid_manager_monitor_agent"`; do renice +18 -p $i; done
for i in `pgrep -f "grid-monitor-job-status"`; do renice +18 -p $i; done
for i in `pgrep -f "globus-job-manager"`; do renice +18 -p $i; done
  • Check the information plugin for the new version.
  • Change bdii.conf
BDII_SEARCH_TIMEOUT=120
BDII_BREATHE_TIME=240
Restart bdii service.
  • Change the cachetime in /opt/globus/etc/grid-info-resource-ldif.conf
# This file was automatically generated by globus-mds startup script. Do not modify.

dn: Mds-Vo-name=local,o=grid
objectclass: GlobusTop
objectclass: GlobusActiveObject
objectclass: GlobusActiveSearch
type: exec
path: /opt/lcg/libexec/
base: lcg-info-wrapper
args:
cachetime: 60
timelimit: 20
sizelimit: 250