Issue with two Samsung HD400LJ in a RAID1 configur

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 3, 2007 12:25:38 GMT 7

Quote

Post by maximehilaire on Jan 3, 2007 12:25:38 GMT 7

I have two Samsung HD400LJ in a RAID1 configuration (381,554Mb).

They seemed to be ok... until i changed to 2.1.03, yesterday.
(I got my N2100 3 days ago)
Now, the status is "Degraded"

hereunder , what i did :
1 - Change Firmware to 2.1.03 (OK)
2 - Try to reconstruct RAID 1 (rmq 32 from the 'Thecus download center')
3 - The result : always stop at 45.1% and automaticly re-try
4 - i Suppressed the partitions (thanks to my pc)
5 - put back the disks in my N2100
6 - try to reconstruct RAID 1
7 - result : wait a long time at the same place, but finished with the status 'Healthy'
8 - Unfortunately, a short time later, the status was 'DEGRATED'

Did you exprecienced the same issue ? Did you think it might come from the disk ?

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 3, 2007 12:45:29 GMT 7

Quote

Post by maximehilaire on Jan 3, 2007 12:45:29 GMT 7

getmythe said:

In case you have access via ssh try this command "mdadm --detail /dev/md0". It will give you a more detailed answer of what might be wrong with your drives. What does the Status line report?

Hi,
The result of this command is below...
At this moment my situation is :
- reinstall all (it's the second time)
- succeeded to obtain RAID 1 'Healthy' (as at the first time)
- but ... at first reboot N2100 detects 'DEGRADED' and try to recovery and stop à 2.2 % ... and try again and again
mdadm --detail /dev/md0
/dev/md0:
Version : 00.90.01
Creation Time : Tue Jan 2 17:46:16 2007
Raid Level : raid1
Array Size : 389198656 (371.17 GiB 398.54 GB)
Device Size : 389198656 (371.17 GiB 398.54 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Persistence : Superblock is persistent

Update Time : Wed Jan 3 06:32:04 2007
State : clean, degraded, recovering
Active Devices : 1
Working Devices : 2
Failed Devices : 0
Spare Devices : 1

Rebuild Status : 1% complete

UUID : 5f36a92f:85262024:da0ed1c8:e2f3f494
Events : 0.5386

Number Major Minor RaidDevice State
0 0 0 - removed
1 8 18 1 active sync /dev/sdb2

2 8 2 0 spare rebuilding /dev/sda2

madru
New Member

Posts: 5

Issue with two Samsung HD400LJ in a RAID1 configur Jan 3, 2007 14:28:44 GMT 7

Quote

Post by madru on Jan 3, 2007 14:28:44 GMT 7

I use use 2.1.03 with the same disk configuration without any problems (OK, the n2100 is brand new

)

I reconstructed the RAID 3 times yesterday for test purpose, is it possible that your drives have different FW on board and/or the bad block marking does not work ....... ?

N2100:~# mdadm --detail /dev/md0
/dev/md0:
Version : 00.90.01
Creation Time : Tue Jan 2 18:35:39 2007
Raid Level : raid1
Array Size : 389198656 (371.17 GiB 398.54 GB)
Device Size : 389198656 (371.17 GiB 398.54 GB)
Raid Devices : 2
Total Devices : 2
Preferred Minor : 0
Persistence : Superblock is persistent

Update Time : Wed Jan 3 08:19:08 2007
State : clean
Active Devices : 2
Working Devices : 2
Failed Devices : 0
Spare Devices : 0

UUID : 4e90ac16:873036be:958c91c3:8299d8cc
Events : 0.8570

Number Major Minor RaidDevice State
0 8 2 0 active sync /dev/sda2
1 8 18 1 active sync /dev/sdb2
N2100:~#

oreos
Junior Member

Posts: 57

Issue with two Samsung HD400LJ in a RAID1 configur Jan 3, 2007 16:54:05 GMT 7

Quote

Post by oreos on Jan 3, 2007 16:54:05 GMT 7

I have the same discs in raid 1 too and had do issues whatsoever since FW 2.1.00, I now run also the latest one. The only thing that comes to mind could be perhaps incorrect jumper settings (M, S, CS) on the HDs? If you have no important data I would also try to delete the Raid completely and build a new raid again.

Good luck.

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 5, 2007 17:15:47 GMT 7

Quote

Post by maximehilaire on Jan 5, 2007 17:15:47 GMT 7

madru said:

I use use 2.1.03 with the same disk configuration without any problems (OK, the n2100 is brand new

)

Thank madru,
My N2100 is a new one (I bought it as a new one ;-) ) :SN : 00-14-FD-10-0B-AE AF or N2100S00632)

Hereunder the thecus support answer ... and my new questions to them :

Thecus Support Tech a écrit :
> Dear Customer,
> Thank you for purchased Thecus N2100!!
> In case your RAID status was always "DEGRADE", we would suggest you should remove the faulty hard
> drive and check the status of the faulty drive on the PC through HDD diagnostic tool.
[/qoute]
Hi and happy new year,
Thank for your advise.... but I already did that and didn't find special issues with the HDDs.
I suppressed the partitions and tryed again, but the result is always the same :
1 - Healthy at the end of the RAID 1 construction
2 - Degraded after the first reboot
This morming I tryed and succeeded in RAID 0 Creation. It seems to run well after the first reboot and i put already 25GB as a test.
... but i bought N2100 for its RAID 1 option (not for RAID 0)
1 That is your advise to get the RAID 1 operational ?
2 If you though it may came from Hardware, I probably can obtain the remplacement of the N2100
3 If you thought it came from the HDD models, could you advise me for supported models
4 if you thought it came from the firmware, did you believe that 2.0.013beta might be a solution
Thanks for yours answers.

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 5, 2007 17:19:10 GMT 7

Quote

Post by maximehilaire on Jan 5, 2007 17:19:10 GMT 7

oreos said:

I have the same discs in raid 1 too and had do issues whatsoever since FW 2.1.00, I now run also the latest one. The only thing that comes to mind could be perhaps incorrect jumper settings (M, S, CS) on the HDs? If you have no important data I would also try to delete the Raid completely and build a new raid again.
Good luck.

Thanks oreos and happy new year,
I'm afraid there is no more jumper on my S-ata disks ....

oreos Junior Member Posts: 57	Issue with two Samsung HD400LJ in a RAID1 configur Jan 6, 2007 18:16:42 GMT 7 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by oreos on Jan 6, 2007 18:16:42 GMT 7 So true, stupid comment on my side... Had this issue in the past with IDE and forgot that SATA were more intelligent on that side.

dbridges
Senior Member

Posts: 448

Issue with two Samsung HD400LJ in a RAID1 configur Jan 6, 2007 18:52:41 GMT 7

Quote

Post by dbridges on Jan 6, 2007 18:52:41 GMT 7

This is probably a bit out there but have you tried building a raid0 array on the disks and then rebuilding the raid 1 from scratch.

The raid 0 build will completely wipe the disks and allow you to build the raid 1 from fresh disks rather than trying to recover a degraded system (which obviously isn't working)

Out of curiosity... Have you got any modules installed??? and which ones???

Edit:
Just re-read your first post. Is it always the same disk that's failing? What happened when you swapped them over?

Last Edit: Jan 6, 2007 18:55:07 GMT 7 by dbridges

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 8, 2007 5:39:31 GMT 7

Quote

Post by maximehilaire on Jan 8, 2007 5:39:31 GMT 7

Hi dbridges,

dbridges said:

This is probably a bit out there but have you tried building a raid0 array on the disks and then rebuilding the raid 1 from scratch.

The raid 0 build will completely wipe the disks and allow you to build the raid 1 from fresh disks rather than trying to recover a degraded system (which obviously isn't working)

Yes, i did.

dbridges said:

Edit:
Just re-read your first post. Is it always the same disk that's failing? What happened when you swapped them over?

no it changes when i swap them over ;
Today i did two experiences :
i put them in my linux PC (fedora core 5) :
- I create one partition via fdisk
- then a " mkfs -V -t ext3 -c /dev/sdb1" (-c : verify corrupted blocks.)
for the 2 disks the mkfs stops at the same place as in the N2100 (one at 45 % , the other at 2,5 %). they didn't stop completly but the progress was very very slow (and i have to stop them)

I put then in my XP PC :
- i create a partition : no problem
- i forced a CHKDSK : no problem

My diag, at this moment, is ..... not clear

dbridges said:

Out of curiosity... Have you got any modules installed??? and which ones???

- ROOTPSW.mod
- SSH.mod

dbridges Senior Member Posts: 448	Issue with two Samsung HD400LJ in a RAID1 configur Jan 8, 2007 6:39:07 GMT 7 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by dbridges on Jan 8, 2007 6:39:07 GMT 7 I was asking about modules as a long shot but your FC tests would suggest that there's something wrong with the drives.

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Jan 9, 2007 2:03:35 GMT 7

Quote

Post by maximehilaire on Jan 9, 2007 2:03:35 GMT 7

dbridges said:

I was asking about modules as a long shot but your FC tests would suggest that there's something wrong with the drives.

I just got two drives 40GB for a test period, thanks to my company.
I will try with this night ....

Result (this morning) :
With those Western Digital 40GB HDD, all works fine ; I tested in a RAID 1 config
Western Digital 40GB WD400.
Drive parameters : LBA 78125000

=============================
Today I got a mail from the Thecus Support. They proposed me to get control on my N2100 through internet.
I hope they could find what is wrong with my 400GB Samsung HDD !

Last Edit: Jan 10, 2007 5:24:15 GMT 7 by maximehilaire

maximehilaire
New Member

Posts: 15

Issue with two Samsung HD400LJ in a RAID1 configur Feb 1, 2007 3:54:27 GMT 7

Quote

Post by maximehilaire on Feb 1, 2007 3:54:27 GMT 7

maximehilaire said:

Today I got a mail from the Thecus Support. They proposed me to get control on my N2100 through internet.
I hope they could find what is wrong with my 400GB Samsung HDD !

20 days later .... Thecus Support helped me to diag : many tests, boots and reboots, disks position exchanges ...

Finally they concluded that one of my new disks has some defaults and the best were to run diag from Samsung.
For Information, they used smartctl (http://smartmontools.sourceforge.net) to do their diags.
I got the samsung disk utility(HUTIL.exe) , but the only useful thing i found to do with this utility was a "low level format" (15 hours each !!!).
After that, I try to obtain from Thecus the smartctl parameters to re-do the smartctl tests by myself...

But, without answer, I put back, a week ago, the 2 disks inside my N2100 ... and now It's OK

Last Edit: Feb 1, 2007 4:01:45 GMT 7 by maximehilaire

Issue with two Samsung HD400LJ in a RAID1 configur

Post by maximehilaire on Jan 3, 2007 12:25:38 GMT 7

Post by maximehilaire on Jan 3, 2007 12:45:29 GMT 7

Post by madru on Jan 3, 2007 14:28:44 GMT 7

Post by oreos on Jan 3, 2007 16:54:05 GMT 7

Post by maximehilaire on Jan 5, 2007 17:15:47 GMT 7

Post by maximehilaire on Jan 5, 2007 17:19:10 GMT 7

Post by oreos on Jan 6, 2007 18:16:42 GMT 7

Post by dbridges on Jan 6, 2007 18:52:41 GMT 7

Post by maximehilaire on Jan 8, 2007 5:39:31 GMT 7

Post by dbridges on Jan 8, 2007 6:39:07 GMT 7

Post by maximehilaire on Jan 9, 2007 2:03:35 GMT 7

Post by maximehilaire on Feb 1, 2007 3:54:27 GMT 7