Canonical
Environment of an Enstore System:
5 Server
PCs
0: Runs the file and volume databases –
requires 4 large SCSI disks, software raid
1: Runs the pnfs servers – requires 4 large
SCSI disks, software raid
2: Runs config, alarm, log, inquisitor server
(+other related tasks)
runs the web server for the
system
requires 2 large ide disks
3: Runs the
console server and admin integrity checks
4: Runs the
library manager and media changes
1 Mover PC per tape drive – scalable unit.
1 PC/drive is
simpler to manage operationally than multiple drives and is not a big expense
Misc. Equipment necessary for operating a robust system
Power
switches, Console monitor, Cyclades pods, 2 serial cables per node,
blowers, etc…
All PCs are currently Intel Lancewood systems:
PIII-800 L440GX System: Intel L440GX MB,
2 x 800Mhz PIII Proc., (450 or better is fine)
1 GB ECC SDRAM MEM, (memory is so cheap now, why get
less?)
Sys Disk 8 to 15 GB EIDE,
Mid Tower Case, (this is not crucial, 1-2U unit are fine)
Floppy Disk Drive
No Keyboard (ok if there is one)
No CDROM, (ok if there is one)
Minimum 300 W Power Supply.
They were last ordered on Sep 26. 2001 on PO #540646. ($1645/PC was last
price.) We are aware that this hardware
is end-of-life and are researching a stable replacement.
All nodes
support Fast Ethernet. We use the
on-board FE port. We are exploring
replacing fast ethernet with gigabit ethernet on a new motherboard.
All tapes drives are the low voltage differential SCSI
variety. We use the on-board SCSI
device.
All PCs
are running a 2.2.16 kernel with the following changes:
1.
Multiple
interface arp change – only arp on selected interface
2.
Reboot
after memory error
3.
Large
IDE support
4.
Stop
after a non-initiated scsi bus reset
5.
Raid
patches, on nodes that need them
6.
St
patches to allow operation of LTO rives, on nodes that need them
ACSLS, the
STK robot controller and server, runs on a Sun node and is completely managed
by STK, including backups of the system software and tape database. The Sun node is on a back-end network to
prevent hacking or malicious tampering.
All PCs
are monitored for temperature, node configuration, network compliance, etc.
The rest
of the software is a hybrid of Fermi RH 5.2 and 6.2 and directly built from
source code. We are currently
investigating an upgrade to Fermi RH 7.1. (and beyond) to reach a new stable
base-code platform.
All system
software is backed up to remote disks and occasionally to tape. There is a clone procedure to bring new PCs
into the system.
Enstore
mover and server software is updated via CVS from the hppc cvs repository. When bugs are found, they are fixed, tested
on a test stand and then put into production as rapidly as possible.
Enstore
depends on these external products:
Python – we use our version, downloaded from python.org. The major changes are (1)No built-in libdb support, (2) static options to allow freeze process for encp to work
FTT – we use v2_10
SWIG – we use v1_1p5 (primarily for
our interface to robot software and tape drives)
Other standard products used for
producing documentation
Internally,
Enstore uses BerkeleyDB for databases, HTMLgen for web pages, gadfly as a
linter. All of these packages come with
the enstore cvs distribution.