why do my hard drives keep vanishing?

Currently reading:
why do my hard drives keep vanishing?

arc

this is where i stand
Joined
Oct 8, 2003
Messages
19,727
Points
3,337
Location
Manchester
Hi all.

I have;

Gigabyte 965P-DS3 (rev 3)
2x 1TB Samsung HD103UJ'S
2x WD 300gb drives in RAID0

The array holds my OS, Win7 Pro (x64). When copying large amounts of data from one 1TB drive to the other, it'll work for a while and then drive will vanish from the system. The only way to get it back is to power cycle the system.

It doesn't matter which drive i copy from / to - it is always the destination drive that will vanish.

In event viewer I get this error under Disk;

The device, \Device\Harddisk1\DR1, is not ready for access yet.

and this one under atapi;

The driver detected a controller error on \Device\Ide\IdePort4.

Obviously the numbers in the errors change to reflect which drive has vanished.

So far I have set the system back to default clock, and aimed a large fan at the chipset area as it was rather warm.

Below is the SMART data from the two drives.

Code:
----------------------------------------------------------------------------
CrystalDiskInfo 3.5.6 (C) 2008-2010 hiyohiyo
                                Crystal Dew World : http://crystalmark.info/
----------------------------------------------------------------------------

    OS : Windows 7  [6.1 Build 7600] (x64)
  Date : 2010/05/16 19:59:52

-- Controller Map ----------------------------------------------------------
 + Standard Dual Channel PCI IDE Controller [ATA]
   + ATA Channel 0 (0)
     - SAMSUNG HD103UJ ATA Device
   + ATA Channel 1 (1)
     - SAMSUNG HD103UJ ATA Device
 + Standard AHCI 1.0 Serial ATA Controller [ATA]
   + ATA Channel 0 (0)
     - PIONEER DVD-RW  DVR-112D ATA Device
   - ATA Channel 1 (1)
   - ATA Channel 4 (4)
   - ATA Channel 5 (5)
 + JMicron JMB36X Controller [SCSI]
   - GRAID  SCSI Disk Device
 + Virtual CloneDrive [SCSI]
   - ELBY CLONEDRIVE SCSI CdRom Device
   - ELBY CLONEDRIVE SCSI CdRom Device

-- Disk List ---------------------------------------------------------------
 (1) SAMSUNG HD103UJ : 1000.2 GB [1-3-0, pd1]
 (2) SAMSUNG HD103UJ : 1000.2 GB [2-4-0, pd1]

----------------------------------------------------------------------------
 (1) SAMSUNG HD103UJ
----------------------------------------------------------------------------
           Model : SAMSUNG HD103UJ
        Firmware : 1AA01113
   Serial Number : S13PJ1CQ727059
       Disk Size : 1000.2 GB (8.4/137.4/1000.2)
     Buffer Size : 32767 KB
     Queue Depth : 32
    # of Sectors : 1953523055
   Rotation Rate : Unknown
       Interface : Serial ATA
   Major Version : ATA/ATAPI-7
   Minor Version : ATA8-ACS version 3b
   Transfer Mode : SATA/300
  Power On Hours : 6172 hours
  Power On Count : 785 count
     Temparature : 28 C (82 F)
   Health Status : Good
        Features : S.M.A.R.T., APM, AAM, 48bit LBA, NCQ
       APM Level : 0000h [OFF]
       AAM Level : FE00h [OFF]

-- S.M.A.R.T. --------------------------------------------------------------
ID Cur Wor Thr RawValues(6) Attribute Name
01 100 100 _51 000000000000 Read Error Rate
03 _77 _77 _11 000000001E28 Spin-Up Time
04 _99 _99 __0 000000000320 Start/Stop Count
05 100 100 _10 000000000000 Reallocated Sectors Count
07 100 100 _51 000000000000 Seek Error Rate
08 100 100 _15 000000000000 Seek Time Performance
09 _99 _99 __0 00000000181C Power-On Hours
0A 100 100 _51 000000000000 Spin Retry Count
0B 100 100 __0 000000000000 Recalibration Retries
0C _99 _99 __0 000000000311 Power Cycle Count
0D 100 100 __0 000000000000 Soft Read Error Rate stab
B7 100 100 __0 000000000000 Unknown
B8 __1 __1 _99 000000000176 End-to-End Error
BB 100 100 __0 000000000000 Reported Uncorrectable Errors
BC 100 100 __0 000000000000 Command Timeout
BE _71 _65 __0 00001D1D001D Airflow Temperature
C2 _72 _64 __0 00001D1C001C Temperature
C3 100 100 __0 0000000038E2 Hardware ECC recovered
C4 100 100 __0 000000000000 Reallocation Event Count
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Uncorrectable Sector Count
C7 100 100 __0 00000000000B UltraDMA CRC Error Count
C8 100 100 __0 000000000005 Write Error Rate
C9 253 253 __0 000000000000 Soft Read Error Rate

----------------------------------------------------------------------------
 (2) SAMSUNG HD103UJ
----------------------------------------------------------------------------
           Model : SAMSUNG HD103UJ
        Firmware : 1AA01113
   Serial Number : S13PJ1CQ727058
       Disk Size : 1000.2 GB (8.4/137.4/1000.2)
     Buffer Size : 32767 KB
     Queue Depth : 32
    # of Sectors : 1953525168
   Rotation Rate : Unknown
       Interface : Serial ATA
   Major Version : ATA/ATAPI-7
   Minor Version : ATA8-ACS version 3b
   Transfer Mode : SATA/300
  Power On Hours : 6167 hours
  Power On Count : 783 count
     Temparature : 28 C (82 F)
   Health Status : Good
        Features : S.M.A.R.T., APM, AAM, 48bit LBA, NCQ
       APM Level : 0000h [OFF]
       AAM Level : FE00h [OFF]

-- S.M.A.R.T. --------------------------------------------------------------
ID Cur Wor Thr RawValues(6) Attribute Name
01 100 100 _51 000000000000 Read Error Rate
03 _77 _77 _11 000000001E3C Spin-Up Time
04 _99 _99 __0 00000000031B Start/Stop Count
05 100 100 _10 000000000000 Reallocated Sectors Count
07 253 253 _51 000000000000 Seek Error Rate
08 100 100 _15 000000000000 Seek Time Performance
09 _99 _99 __0 000000001817 Power-On Hours
0A 100 100 _51 000000000000 Spin Retry Count
0B 100 100 __0 000000000000 Recalibration Retries
0C _99 _99 __0 00000000030F Power Cycle Count
0D 100 100 __0 000000000000 Soft Read Error Rate stab
B7 100 100 __0 000000000000 Unknown
B8 __1 __1 _99 000000000096 End-to-End Error
BB 100 100 __0 000000000000 Reported Uncorrectable Errors
BC 100 100 __0 000000000000 Command Timeout
BE _72 _66 __0 00001C1C001C Airflow Temperature
C2 _72 _65 __0 00001D1C001C Temperature
C3 100 100 __0 000000099097 Hardware ECC recovered
C4 100 100 __0 000000000000 Reallocation Event Count
C5 100 100 __0 000000000000 Current Pending Sector Count
C6 100 100 __0 000000000000 Uncorrectable Sector Count
C7 100 100 __0 000000000001 UltraDMA CRC Error Count
C8 100 100 __0 000000000002 Write Error Rate
C9 253 253 __0 000000000000 Soft Read Error Rate

anyone any idea why this might be happening?

ok i've run HDTune and on (B8) (unknown attribute) Its saying Failed. Threshold is 99, data is 374 on one drive 150 on the other.

translates too; 184 End-to-End error Number of parity errors during transfer between the cache RAM and the host.
 
if is there in disk managment, can you not just re-make the partition tables etc on it? and reallocate it a drive letter?

dunno,i was wary of messing round with a working boot partition
im dual boot with vista/W7 on a partitioned drive
drivbe shows in disk management
if i boot vista then then all the drives show under vista
so it must be a W7 issue
 
I had this problem and it drove me mental for a few months. Turns out the cheap orange SATA cables that come with the gigabyte board where just not upto the job. Ive got some adaptec server cables on now and it never misses a beat.
 
Last edited:
Would suggest either cable failure (remember I used to get them all the time on IDE) and or Ram issue from the comments you mentioned about the error log.
 
I tried using the othe two ports, same thing happened.

i've moved one of the drivers onto a usb > sata adaptor and just transfered ~500gb onto the drive. It took an age (20mb/s is slooww), but the drive didn't drop out.

Gonna strip the case down, give it all a good clean and fit new cables I think.

I don't think its RAM as it's disk cache / buffer errors, and it doesn't do it on the array at all.
 
ok, new cable. Same deal.

Tried another cable, same deal.

Tried running the drive off a seperate powersupply, same deal.

So the drive seems fine (it works fine when copying over USB>sata). So i can only conclude the controller is messing it up.
 
i've put another drive on the controller, but it's slow (20mb/s) but its copying to it fine. but, my normal drives run around 80-90mb/s.. so this isn't much of a test
 
looks like a driver issue maybe. Could download the latest drivers, but even installing the new drivers might not clear it without a full rebuild. You know the script, files just linger about like a bad smell.

Thats why its handy to have an array fail from time to time. Forces a spring clean :) Plus you mobo is like how old?
 
it's around 3 years old, can't find any updated drivers (or drivers for that matter for the AHCI controller - these drives are not on RAID controller)

I don't need a spring clean either, i'm not Chris! This install is around 3months old!

Am going to ignore the issue for now, and next time i'm back - i'll bring a drive round to yours and we can test it?
 
Does it happen for single large files?
Does it happen to the same drive every time?
 
Not single large files, no. It happens when transfering large amounts of ~500mb files.

And no, it's the same drive. It's always when copying to either of the 1TB Samsung HD103UJ drives. The two drives were bought at the same time, and the serial numbers are consecutive. They are the only drives that are using the 'normal' AHCI onboard controller. The other two drives are on the RAID controller.
 
Back
Top