[wplug] FileSystem problems

Vanco, Donald VANCOD at PIOS.com
Tue Jul 15 13:31:47 EDT 2003


Mike Griffin <mailto:mike at dmrnetworks.com> wrote:
> I seem to be having some problems with one of my servers and was
> wondering where to start troubleshooting the machine. I would imagine
> that it's at the hardware level.
> 
> A fileserver crashed yesterday with a kernel panic. This machine has
> been running for nearly a year, solid. I started getting errors on the
> hardrive during writes to the drive, and accessing was very slow. I
> ran maxtor utilites on the drive and it found a few problems that the
> software fixed. I reinstalled the server OS (RH7.3) and performed my
> data recovery from backups. My backups are stored on the same system
> but on a different HDD, which gets mounted with a script everynight
> and has tarballs written to it, I also ran fsck -t ext3 on this drive
> (/dev/hdb1).  I checked for a new backup this morning, and all was
> well. I just tried mounting the drive a few minutes ago and had a ton
> of bad sector attempt timeouts saying it cannot find a valid FAT
> partition. I tried to run fsck on this partition I get this as a
> result: 
> 
> [root at fileserver root]# fsck /dev/hdb1
> fsck 1.27 (8-Mar-2002)
> e2fsck 1.27 (8-Mar-2002)
> fsck.ext2: Attempt to read block from filesystem resulted in short
> read while trying to open /dev/hdb1
> Could this be a zero-length partition?
> [root at fileserver root]# fsck -t ext3 /dev/hdb1
> fsck 1.27 (8-Mar-2002)
> e2fsck 1.27 (8-Mar-2002)
> fsck.ext3: Attempt to read block from filesystem resulted in short
> read while trying to open /dev/hdb1
> Could this be a zero-length partition?
> 
> These are two different ATA drives. One is a 20G and one is a 10G. I
> thought it was kind of weird that this would happen to both drives one
> day apart. possibly a controller problem on the motherboard?

	Are all your cables (power and data) in good shape and well seated?
	Any chance there's been a change in quality of the power?
	Do BIOS and kernel see the drive geometry in like fashion? (there
was a time, circa RH6.1 or .2, that fdisk added an "extra" cylinder to the
drive - fun!)
	Do you have a "like system" you can swap the drives into to see how
they behave?

	IMHO - using IDE HDD tools to "fix" a drive that's reporting media
errors is like putting a band-aid on a leper.  Underneath, it's still
rotten.  If you've got a drive spitting bad sector errors it's time to
$h!tcan the drive and get a new one... MaxTools may delay death, but it's
still terminal, and data loss is almost assured.

Don



More information about the wplug mailing list