problems with gmirror on ggate over slow link

View: New views
6 Messages — Rating Filter:   Alert me  

problems with gmirror on ggate over slow link

by Pete French-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

[ originally sent to geom, but am throwing it open to a wider
audience as I didn;t get any replies there]

I am using 7.2-STABLE from October 7th on all amchines, but this
has been going on a while. Very simply I am mirroring together a pair
of discs, one local, one remote. The remote disc is accessed using ggate.

If the remote diisc is actually on a very close machine - e.g. a server
plugged into the same ether net - then all works fine. If I make
the remote disc somewhere actually substantially further away on the
nbetwork, however, then when I attach the disc it starts to rebuild the
mirror but then fails a fraction of a second later thus:

GEOM_MIRROR: Device mysql0: rebuilding provider ggate1a.
GEOM_MIRROR: Synchronization request failed (error=5). ggate1a[WRITE(offset=1310720, length=131072)]
GEOM_MIRROR: Device mysql0: provider ggate1a disconnected.
GEOM_MIRROR: Device mysql0: rebuilding provider ggate1a stopped.

The interesting this is that the problem is only with gmirror, not with
the underlying ggate disc which remains attached and accessible. I tested
this by adding a second partition (ggate1b in the example above) and
mounting a UFS filesystem on that.

I've looked at the kernel code briefly, but it is not clear to me
what is causing that write to fail. My conjecture would be that a buffer
somewhere is filling up, causing a write to fail, and instead of gmirror
waiting and retrying, instead it just fails the synchronisation.

Any ideas ? Is this actually a bug ? I am wondering if it would also happen
if mirroring a very fast disc against a very slow one (i.e. maybe it is
independent of ggate)

-pete.

_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."

Re: problems with gmirror on ggate over slow link

by Oliver Brandmueller :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Hi,

On Fri, Oct 23, 2009 at 11:56:24AM +0100, Pete French wrote:

> If the remote diisc is actually on a very close machine - e.g. a server
> plugged into the same ether net - then all works fine. If I make
> the remote disc somewhere actually substantially further away on the
> nbetwork, however, then when I attach the disc it starts to rebuild the
> mirror but then fails a fraction of a second later thus:
>
> GEOM_MIRROR: Device mysql0: rebuilding provider ggate1a.
> GEOM_MIRROR: Synchronization request failed (error=5). ggate1a[WRITE(offset=1310720, length=131072)]
> GEOM_MIRROR: Device mysql0: provider ggate1a disconnected.
> GEOM_MIRROR: Device mysql0: rebuilding provider ggate1a stopped.
>
> The interesting this is that the problem is only with gmirror, not with
> the underlying ggate disc which remains attached and accessible. I tested
> this by adding a second partition (ggate1b in the example above) and
> mounting a UFS filesystem on that.

Just a wild guess, have you tried to set kern.geom.mirror.timeout to a
higher value?

- Olli

--
| Oliver Brandmueller          http://sysadm.in/         ob@... |
|                        Ich bin das Internet. Sowahr ich Gott helfe. |
_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."

Re: problems with gmirror on ggate over slow link

by Pete French-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

> Just a wild guess, have you tried to set kern.geom.mirror.timeout to a
> higher value?

Yes, I tried values all the way up to 600, no effect at all - plus the
failure comes way before that timeout value (which is in seconds I assume).

-pete.
_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."

Re: problems with gmirror on ggate over slow link

by Oliver Fromme :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Pete French wrote:
 > [...]
 > > Just a wild guess, have you tried to set kern.geom.mirror.timeout to a
 > > higher value?
 >
 > Yes, I tried values all the way up to 600, no effect at all - plus the
 > failure comes way before that timeout value (which is in seconds I assume).

Have you done any sockets tuning?
In an older posting the following values were recommended:

/etc/sysctl.conf:
net.inet.tcp.sendspace=1048576
net.inet.tcp.recvspace=1048576
kern.ipc.maxsockbuf=2049152

/boot/loader.conf:
kern.ipc.nmbclusters="32768"

Command line options to ggate[cd]:
ggate[dc]_buf_size="1310720"
ggatec_timeout="5"
ggatec_queue_size="2048"

Best regards
   Oliver

--
Oliver Fromme, secnetix GmbH & Co. KG, Marktplatz 29, 85567 Grafing b. M.
Handelsregister: Registergericht Muenchen, HRA 74606,  Geschäftsfuehrung:
secnetix Verwaltungsgesellsch. mbH, Handelsregister: Registergericht Mün-
chen, HRB 125758,  Geschäftsführer: Maik Bachmann, Olaf Erb, Ralf Gebhart

FreeBSD-Dienstleistungen, -Produkte und mehr:  http://www.secnetix.de/bsd

"C++ is over-complicated nonsense. And Bjorn Shoestrap's book
a danger to public health. I tried reading it once, I was in
recovery for months."
        -- Cliff Sarginson
_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."

Re: problems with gmirror on ggate over slow link

by Pete French-2 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

> Have you done any sockets tuning?
> In an older posting the following values were recommended:

Yes, I need that to get the speed out of it for normal use
to a disc on a machine on the same ether - but even so, surely
it should block on a slow disc, not just abandon the mirroring ?

-pete.

_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."

Re: problems with gmirror on ggate over slow link

by Yoshihiro Ota :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

I think you hit the same bug as I did a while ago.

http://www.freebsd.org/cgi/query-pr.cgi?pr=bin/132798

You can get a patch at PR and give a try.
Make sure you update both server and client;
otherwise, it will cause a panic or so.

Hiro

On Tue, 03 Nov 2009 16:23:24 +0000
Pete French <petefrench@...> wrote:

> > Have you done any sockets tuning?
> > In an older posting the following values were recommended:
>
> Yes, I need that to get the speed out of it for normal use
> to a disc on a machine on the same ether - but even so, surely
> it should block on a slow disc, not just abandon the mirroring ?
>
> -pete.
>
> _______________________________________________
> freebsd-stable@... mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."
_______________________________________________
freebsd-stable@... mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscribe@..."