Need help recovering from 6404 Raid controller failure

View: New views
6 Messages — Rating Filter:   Alert me  

Need help recovering from 6404 Raid controller failure

by Barry Friedman :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

OS:  FreeBSD 7.0-RELEASE-p2
Machine:  HP Proliant DL380G4
SCSI Raid Controller: 6404

The situation is that one disk of a raid5 array failed.  When the
machine boots, the controller tries to initiate a rebuild onto the spare
disk.  The machine boots and within a short time the system resets.
(This is the same problem that I had with this machine a while back,
however, reseating cards does not help)

I need to find out how to run the disk rebuild off line, i.e. what
software is required and what is the procedure.  HP only lists ACU
for Linux and windows systems.  What do you use with a FreeBSD machine?

Also,  if I can get access to another system is it possible to do an
image copy of the good disks so that I can get a backup before proceeding
with the rebuild?

Any help appreciated.

Regards,
--
Barry Friedman                        
Emax Computer Systems Inc., 480 Tweedsmuir Ave., Ottawa, Ont. Canada K1Z 5N9
bfriedman@...    Phone: (613) 725-3198  Fax: 725-0298  
_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."

Parent Message unknown RE: Need help recovering from 6404 Raid controller failure

by RA Cohen-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

I would try and move the entire array to another functioning machine and
let it rebuild there. I wish there were some offline utilities to help
but I'm not aware of any. Good Luck!

--
Roy A Cohen
Network Advantage LLC
www.net-vantage.com
413.223.9007 option 1
--------------------------------------------------
"Bringing Cost-Saving, State-of-the-Art Technology
Solutions to Small and Mid-Size Organizations"

_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."

Re: Need help recovering from 6404 Raid controller failure

by Ulf Zimmermann :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Oct 21, 2009 at 09:22:17AM -0400, Barry Friedman wrote:
> OS:  FreeBSD 7.0-RELEASE-p2
> Machine:  HP Proliant DL380G4
> SCSI Raid Controller: 6404
>
> The situation is that one disk of a raid5 array failed.  When the
> machine boots, the controller tries to initiate a rebuild onto the spare
> disk.  The machine boots and within a short time the system resets.
> (This is the same problem that I had with this machine a while back,
> however, reseating cards does not help)

You can just let it sit in the BIOS of the SmartArray controller it will
rebuild there too. Or you can download the SmartStart CD and boot it.
Run the Raid management tool from there to see progress.

>
> I need to find out how to run the disk rebuild off line, i.e. what
> software is required and what is the procedure.  HP only lists ACU
> for Linux and windows systems.  What do you use with a FreeBSD machine?
>
> Also,  if I can get access to another system is it possible to do an
> image copy of the good disks so that I can get a backup before proceeding
> with the rebuild?

The disks itself carry the information about the raid, so if you have
another server with any SmartArray supporting raid5, you should be
able to plug them in there and let it rebuild. It might warn about
drive position movement and cache memory changes.

>
> Any help appreciated.
>
> Regards,
> --
> Barry Friedman                        
> Emax Computer Systems Inc., 480 Tweedsmuir Ave., Ottawa, Ont. Canada K1Z 5N9
> bfriedman@...    Phone: (613) 725-3198  Fax: 725-0298  
> _______________________________________________
> freebsd-proliant@... mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
> To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."
>

--
Regards, Ulf.

---------------------------------------------------------------------
Ulf Zimmermann, 1525 Pacific Ave., Alameda, CA-94501, #: 510-865-0204
You can find my resume at: http://www.Alameda.net/~ulf/resume.html
_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."

Re: Need help recovering from 6404 Raid controller failure

by Barry Friedman-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Oct 21, 2009 at 7:38 PM, Ulf Zimmermann <ulf@...> wrote:

> You can just let it sit in the BIOS of the SmartArray controller it will

How do you do this?

> rebuild there too. Or you can download the SmartStart CD and boot it.
> Run the Raid management tool from there to see progress.

Thanks Ulf.   The machine is stable again after reseating all the
cards for the third time.
Will the 6404 rebuild while the machine is running or is it necessary
to run SmartStart?

dmesg shows:
ciss1: *** Physical drive failure: SCSI port 1 ID 4
ciss1: *** State change, logical drive 1
ciss1: logical drive 1 (da1) changed status recovering->OK, spare status
       0x15<configured,failed,available>

Any idea how to translate this?


Regards,
--
Barry Friedman
Emax Computer Systems Inc.,
480 Tweedsmuir Ave.,
Ottawa, Ont. Canada K1Z 5N9
Phone: (613) 725-3198  Fax: 725-0298
_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."

Re: Need help recovering from 6404 Raid controller failure

by Ulf Zimmermann :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

On Wed, Oct 21, 2009 at 07:57:40PM -0400, Barry Friedman wrote:
> On Wed, Oct 21, 2009 at 7:38 PM, Ulf Zimmermann <ulf@...> wrote:
>
> > You can just let it sit in the BIOS of the SmartArray controller it will
>
> How do you do this?

As the server goes through the bios, press F8 when it gets to the 6404 display.
You should be able to select the Info option to see the progress.

>
> > rebuild there too. Or you can download the SmartStart CD and boot it.
> > Run the Raid management tool from there to see progress.
>
> Thanks Ulf.   The machine is stable again after reseating all the
> cards for the third time.
> Will the 6404 rebuild while the machine is running or is it necessary
> to run SmartStart?

It should automatic rebuild in any powered on state.

>
> dmesg shows:
> ciss1: *** Physical drive failure: SCSI port 1 ID 4
> ciss1: *** State change, logical drive 1
> ciss1: logical drive 1 (da1) changed status recovering->OK, spare status
>        0x15<configured,failed,available>

Hmm, not 100% sure, I haven't run Proliants with FreeBSD in a while (unfortunatly).

But my guess is it marked the standby failed? If you enter the BIOS via F8
at the 6404 prompt and display the logical drive, it should show the exact
status there too.

>
> Any idea how to translate this?
>
>
> Regards,
> --
> Barry Friedman
> Emax Computer Systems Inc.,
> 480 Tweedsmuir Ave.,
> Ottawa, Ont. Canada K1Z 5N9
> Phone: (613) 725-3198  Fax: 725-0298
>

--
Regards, Ulf.

---------------------------------------------------------------------
Ulf Zimmermann, 1525 Pacific Ave., Alameda, CA-94501, #: 510-865-0204
You can find my resume at: http://www.Alameda.net/~ulf/resume.html
_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."

Re: Need help recovering from 6404 Raid controller failure

by kama-4 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message



On Wed, 21 Oct 2009, Barry Friedman wrote:

> On Wed, Oct 21, 2009 at 7:38 PM, Ulf Zimmermann <ulf@...> wrote:
>
> > You can just let it sit in the BIOS of the SmartArray controller it will
>
> How do you do this?
>
> > rebuild there too. Or you can download the SmartStart CD and boot it.
> > Run the Raid management tool from there to see progress.
>
> Thanks Ulf.   The machine is stable again after reseating all the
> cards for the third time.
> Will the 6404 rebuild while the machine is running or is it necessary
> to run SmartStart?
>
> dmesg shows:
> ciss1: *** Physical drive failure: SCSI port 1 ID 4
> ciss1: *** State change, logical drive 1
> ciss1: logical drive 1 (da1) changed status recovering->OK, spare status
>        0x15<configured,failed,available>
>
> Any idea how to translate this?

For what I can read out about it it says: (Its the default behavour)

Recovery to spare disk was successful. (recovering->OK)

The failed disk that was in the raid becomes spare and are now marked
failed.

When you replace the faulty disk, it will become the new spare. It will
not copy the data back to the disk in the possition that failed.

/Bjorn
_______________________________________________
freebsd-proliant@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-proliant
To unsubscribe, send any mail to "freebsd-proliant-unsubscribe@..."