[ome-devel] CellProfiler on the cluster crashes OMERO

Frederik Grüll frederik.gruell at unibas.ch
Wed Dec 14 11:26:41 GMT 2016


Hi Simon,

Thank you for your offer to have a look. I uploaded all the logs and the
output of the commands in a zip file "debug.zip".

I started my cluster jobs at 9:14, and OMERO was restarted at 10:49 on
Dec 14 2016. All times are CET.

Best regards

Frederik


On 13.12.2016 14:41, Simon Li wrote:
> Hi Frederik
>
> Could you give us your server configuration and diagnostics:
>
>     omero config get --hide-password
>     omero admin diagnostics
>
> It would also be helpful if we could see your logs for all OMERO
> services, not just Blitz. Would you mind uploading them to
> https://www.openmicroscopy.org/qa2/qa/upload/
> <https://www.openmicroscopy.org/qa2/qa/upload/> and giving us the
> timestamp of when the problem first arises following a restart?
>
> Best wishes
>
> Simon
>
>
> On 13 December 2016 at 10:49, Frederik Grüll
> <frederik.gruell at unibas.ch <mailto:frederik.gruell at unibas.ch>> wrote:
>
>     Dear all,
>
>     I am using CellProfiler on our cluster to process plates for
>     screening.
>     The images are fetched from OMERO with the CellProfiler-OMERO
>     integration. A typical job consists of a command like this:
>
>     cellprofiler -b -p Entry-pipeline_omero.cpproj -c -r -o $OUT_DIR -t
>     $TMPDIR -f $FIRST_IMAGE_SET -l $LAST_IMAGE_SET --data-file
>     plate_303_iids.csv -d $DONE_FILE --omero-credentials
>     host=omero.biozentrum.unibas.ch
>     <http://omero.biozentrum.unibas.ch>,port=4064,session-id=33c6118d-f8b2-4ac2-adb2-12d48ae37a2f
>
>     When I run about 20 jobs in parallel, performance looks good at the
>     beginning, only limited by the performance of CellProfiler and not by
>     the I/O with OMERO. The plate I am processing has 2400 sites with
>     three
>     channels and the OMERO IDs are in the CSV file plate_303_iids.csv
>     that I
>     generated before. A job processes 50 image sets, selected with
>     $FIRST_IMAGE_SET and $LAST_IMAGE_SET. The results of the pipeline are
>     correct.
>
>     However, after about 4/5 of the images have been processed, OMERO
>     becomes very slow. The load on the OMERO server reaches 10, with the
>     Java process for Blitz consuming 10 cores. Eventually, my CellProfiler
>     jobs will loose connection ("JavaException:
>     Ice.ConnectionLostException"), and OMERO recovers in a few cases or
>     otherwise the CPU load falls back to normal, but OMERO needs to be
>     restarted anyway.
>
>     If I run more than 20 jobs in parallel, I would occasional get an
>     error
>     message "ome.conditions.OverUsageException: servantsPerSession reached
>     for 05dbc314-3030-40af-8e72-68b3688e8c94: 10000" after CellProfiler
>     processed only 1665 single-channel images, implying 6 servants per
>     image
>     per channel.
>
>     I have already had a look into the logs, especially Blitz-0.log, but
>     could not find a reason why OMERO would become so slow after a while.
>     Jstat indicates that all time is spend on garbage collection. Our
>     OMERO
>     server has 250GB of RAM with omero.jvmcfg.percent.blitz=40.
>
>     Where else could I look into to find the cause and prevent the
>     degradation in performance? I use OMERO.server 5.2.5 with OpenJDK
>     version 1.8.0_65 and CellProfiler 2.2.0 with Oracle Java 1.8.0_92.
>
>     Cheers and thank you for your time,
>     Frederik
>
>     --
>     Dr. Frederik Grüll | Image Analysis Specialist | G1055, Biozentrum,
>     University of Basel | Klingelbergstr. 50/70 | CH-4056 Basel Phone: +41
>     (61) 207 2250 | frederik.gruell at unibas.ch
>     <mailto:frederik.gruell at unibas.ch> | www.biozentrum.unibas.ch
>     <http://www.biozentrum.unibas.ch>
>
>
>     _______________________________________________
>     ome-devel mailing list
>     ome-devel at lists.openmicroscopy.org.uk
>     <mailto:ome-devel at lists.openmicroscopy.org.uk>
>     http://lists.openmicroscopy.org.uk/mailman/listinfo/ome-devel
>     <http://lists.openmicroscopy.org.uk/mailman/listinfo/ome-devel>
>
>
>
> The University of Dundee is a registered Scottish Charity, No: SC015096
>
>
> _______________________________________________
> ome-devel mailing list
> ome-devel at lists.openmicroscopy.org.uk
> http://lists.openmicroscopy.org.uk/mailman/listinfo/ome-devel

-- 
Dr. Frederik Grüll | Image Analysis Specialist | G1055, Biozentrum,
University of Basel | Klingelbergstr. 50/70 | CH-4056 Basel Phone: +41
(61) 207 2250 | frederik.gruell at unibas.ch | www.biozentrum.unibas.ch
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openmicroscopy.org.uk/pipermail/ome-devel/attachments/20161214/294c2480/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 473 bytes
Desc: OpenPGP digital signature
URL: <http://lists.openmicroscopy.org.uk/pipermail/ome-devel/attachments/20161214/294c2480/attachment.asc>


More information about the ome-devel mailing list