Working Schedule for UPS test on July 14th, 2008 The "Plan" This is a rough plan of actions which need to be performed on that day (all times are CEST=UTC 0200) ...
How to add a new host (salt era) This example will use einstein12 as a sample machine which before was known as ra15. Before you begin, you need to have ssh agent...
HTCondor configuration updates in 2015 (1) Using cgroups to softly enforce memory and core limits Reasoning In the past, we either relied on users' jobs to obey...
Configuration Management (primer/summary/brainstormer) What's out there? These are not really meant for configuration mgmt (alone) and have their strengths somew...
Principle The node number a particular file lives on is determined by its filename: take the first six digits of the md5sum of the name, take this as a hex number...
With 1340 nodes it might be wise to split services among many boxes to ensure that not half of the cluster is waiting to a single server to serve data through its...
This explains how to get FreeDOS working with TCP/IP and ssh. The Problem Sometimes, it is necessary to flash the BIOS, the IPMI card or to set the BIOS. For som...
FAI Installation Category:fai Fai Installation at the Max Planck Institute This command install FAI with all dependences..!!! aptitude install fai quickstart P...
what happens before the FAI installation * client * server * provides * * admin action * NICs do a DHCP request (BIOS default ) DHCP server IP addre...
FAI Jessie set up 1 base install via old fai jessie 1 base minimal config via salt 1 echo 'deb http://repo.atlas.local/reprepro fai contrib' /etc/apt/...
RFC: Classes list for FAI/Lenny The Etch installation scheme with our FAI server used one class for each type of node (NODE_COMPUTE, STORAGE, ...) but this is som...
Faimond faimond catches installation messages on port 4711 sent by the clients. The clients use natcat deliver the current status of the installation tasks like...
For Users * General Introduction for Users * Useful Items * How ATLAS stores files * ErrorMessages and how to fix them (not updated) General Document...
Collection of HowTos OS hangs Try to reset node. OS hangs, even after reset Possible causes 1. hdd broken look if everything is well wired, change hdd, ma...
Create Service Data File for 6780 In case a hard drive is about to fail (or has already failed), Oracle support needs one special file collection, to create this,...
HSM file system check stats (last update: 2016 07 26T18:42Z) Planned steps (starting at 2016 07 26T11:00Z): 1 Issuing condor_hold to all jobs on all submit hos...
Main.HenningFehrmann 24 Jul 2008 abstract This page contains detected symptoms and the corresponding hardware problems. It is based on experiences. See the list ...
HSM upgrade Current status 2014 02 01T12:33Z: samfsdump/final backup stopped due to too many non archived files. Rushing to archive those as fast as possible. 20...
Logcheck mail locations, related scripts and other mail locations on postfix server Log mail location on postfixserver logadmin account 1. Normal logcheck mail...
Trying to get iPXE as the default method to netinstalls working (based on http://ipxe.org/howto/chainloading and https://doc.rogerwhittaker.org.uk/ipxe installati...
Netboot This is a simple description how to boot over a network using kernel on the remote server. Server side configuration To proivde net boot capabilities, yo...
NodeTests Tasks to do: Initial work (HP) Manual work * Blank disk of node, wipe by: dd if=/dev/zero of=/dev/sda; sync * Put MAC address into DHCP table o...
Abstract This documentation describes server, the corresponding functions and the location. Table of Server name location function * ip * FAI manage...
IP Scheme for Servers (Lenny) This is an IP Scheme for Lenny Servers. For an IP Scheme of the Management and Data Network, refer here. For a scheme of Etch server...
IP Scheme for Servers (Squeeze) This is an IP Scheme for Squeeze Servers. For an IP Scheme of the Management and Data Network, refer here. For a scheme of Etch se...
IP Scheme for Servers (Etch) This is an IP Scheme for Servers. For an IP Scheme of the Management and Data Network, refer here. For a scheme of Lenny Servers, re...
Services we want/need to offer Please write down a list of services which we want/need to host along with the contact person who requested this and a brief descri...
Shutdown priorities The following list puts priorities on computers, equipment and other items of interest. Computers in racks, which will stay powered up, should...
Softupdate Softupdate runs through the fai installation and performs all the changes which have been made after the installation process. On the client side f...
Work planned for cluster shutdown on 2013 01 15 shutdown plan The following services will be shut down 1 all compute nodes possibly with the exception of "r...