[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: status of the Richmond cluster



Greetings,

I believe I have all of the library issues dealt with.

I noticed a possably confusing behaviour that might have been the root of
some of this.

Perl depends on several libraries in /lib to run. Unlike those in
/usr/lib, they were being managed by caching rather than just being
available from NFS. It can take about a minute for the libs to be fetched
from the master. During that time, the app will appear hung, but will
eventually start.

I have pre-cached the files onto the node's local drive to try to avoid
that delay.

Since the libs are cached, once that startup penelty is paid, it doesn't
happen again for those libs on that node until reboot.

You can see this happen using tcpdump (I have a binary of it in my home
directory). The libs are transferred as a stream of multicast packets.

Please let me know if this gets it going. If problems remain, a good
approach might be for me to make a copy of your test data and try the runs
myself until the expected results come up.

G'day,
sjames




On Thu, 14 Nov 2002, gilfoyle wrote:

> hi steven,
> 
>    i'm checking in (when there is no beam) to find out the
> status of the cluster. have the library issues been resolved?
> if so, what was the solution? i'm itching to let this thing
> get cooking.
> 
> jerry
> 
> 

-- 
-------------------------steven james, director of research, linux labs
... ........ ..... ....                     230 peachtree st nw ste 701
the original linux labs                             atlanta.ga.us 30303
      -since 1995                              http://www.linuxlabs.com
                                   office 404.577.7747 fax 404.577.7743
-----------------------------------------------------------------------