[ome-users] OMERO with multiple data directories

Josh Moore josh at glencoesoftware.com
Tue Mar 27 10:51:51 BST 2018


Hi Bene,


On Tue, Mar 27, 2018 at 11:03 AM, Benjamin Schmid
<benjamin.schmid at fau.de> wrote:
> Hi again,
>
> as I have no experience with LVM, are there any caveats using/configuring
> it?
> The only parameter I came across is the PE size, and obviously it doesn't
> matter too much in lvm2.
>
> Is there anything else I should pay attention to?

You can see the LVM configuration for the Demo server in
https://github.com/openmicroscopy/prod-playbooks/blob/6c42f443ede53c828b88f65901fd0a111b9b9b89/ome-demoserver.yml#L84

Previously, we were running the Dundee production server on LVM, but
now only the database is on LVM since a move of the data directories
to GPFS. See https://github.com/openmicroscopy/prod-playbooks/blob/6c42f443ede53c828b88f65901fd0a111b9b9b89/ome-dundeeomero.yml#L26
if interested.


> Am 26.03.2018 um 16:39 schrieb Benjamin Schmid:
...
>> Thanks for your answers. Good to hear that you have experience (and
>> obviously not bad ones) with LVM. This is somewhat my favourite since it
>> ensure scalability in the future.
>>
>> The size of the Dropbox is currently 4.7 TB, the ManagedRepository is 11
>> TB.
>>
>> After the expansion (and after creating a 2nd iSCSI LUN), I have basically
>> 2 partitions with 16 TB each, one is more or less full, the other one is
>> more or less empty.
>>
>> There are ca. 200 user folders.

Hmmmm.... if LVM doesn't provide you with the DropBox notifications
and the 2 partitions MUST be separate, then I'd probably try to do to
distribute _groups_ over the two separate mounts.

/mount1/group1a
/mount1/group1b
...
/mount2/group2a
/mount2/group2b

/OMERO/ManagedRepository/group1a -> /mount1/group1a
/OMERO/ManagedRepository/group2a -> /mount2/group2a
...

As long as you place the DropBox for the user importing into the given
group on the same mount, the hardlinks should succeed despite the
intermediate softlinks. If you have users in multiple groups, this
becomes much more complicated.

~J.



>> Thanks a lot,
>> Bene
>>
>>
>>
>>
>>
>> Am 26.03.2018 um 16:22 schrieb Josh Moore:
>>>
>>> On Mon, Mar 26, 2018 at 10:12 AM, Benjamin Schmid
>>> <benjamin.schmid at fau.de> wrote:
>>>>
>>>> Dear all,
>>>
>>> Hi Bene,
>>>
>>>
>>>> Sorry for the lengthy mail, and thanks to those who read it fully to the
>>>> end
>>>> ;)
>>>
>>> Exposition in emails is encouraged.
>>>
>>>
>>>> So far, we have been running OMERO (and the Dropbox) on an Ubuntu server
>>>> with an attached Thecus storage array (N16000pro) with 16 TB storage
>>>> space.
>>>> The Thecus machine is connected to the server via iSCSI and hosts
>>>> (amongst
>>>> others) the users' home and OMERO Dropbox folders. These are shared via
>>>> Samba to the microscope computers.
>>>>
>>>> Because storage was filling up, we expanded the RAID volume on the
>>>> Thecus
>>>> machine. Afterwards, I also wanted to expand the iSCSI LUN. That's were
>>>> trouble started because I realized that the maximum LUN size on the
>>>> Thecus
>>>> system is 16 TB. I can create another LUN, but this will basically end
>>>> up as
>>>> a second partition on the server. My question is now whether OMERO can
>>>> use
>>>> multiple data directories, or if there is another solution to this
>>>> problem.
>>>
>>> In general, yes, OMERO can use multiple directories but there are, as
>>> always, caveats. Regarding other technical, non-OMERO solutions, I'll
>>> defer to the community.
>>>
>>>
>>>
>>>> What I thought about so far:
>>>>
>>>> * Put some of the users (the ones that occupy most storage sapce) on the
>>>> 2nd
>>>> partition
>>>> Create symbolic links in both the Dropbox and ManagedRepository folders
>>>> that
>>>> point to the respective folders for these users on the 2nd partition:
>>>> ManagedRepository/user1 -> /partition2/ManagedRepository/user1
>>>> DropBox/user1 -> /partition2/DropBox/user1
>>>> However, it seems OMERO Dropbox does not follow symbolic links (is there
>>>> any
>>>> way to make this work?)
>>>
>>> DropBox needs to have notifications of what's going on. So if DropBox
>>> is watching /old_storage and the notification comes in on
>>> /new_storage, then yes, DropBox won't be aware of it. One option would
>>> be to run multiple DropBox servers. Another would be to configure
>>> DropBox for the new location on a pure user basis, which is
>>> theoretically doable but neither well-tested nor sysadmin-friendly.
>>>
>>>
>>>> * Leaving the DropBox in the primary OMERO data folder, and only moving
>>>> the
>>>> ManagedRepository folder of some users to the 2nd partition
>>>> Does not work because importing from the DropBox is done via hard links
>>>> (which I very much appreciate), and hard links cannot cross file system
>>>> borders.
>>>
>>> Understood. This is valuable feedback.
>>>
>>>
>>>> * mhddfs (https://romanrm.net/mhddfs)
>>>> I found this small Linux tool that basically joins several filesystems
>>>> into
>>>> a (virtual) large partition. However, it seems to have some impact on
>>>> performance, and, more severely, does also not create correct hard links
>>>> when importing via OMERO DropBox.
>>>
>>> This is new for me.
>>>
>>>
>>>> * LVM
>>>> This could also be used for combining several partitions into a single
>>>> big
>>>> one. Advantages over mhddfs is that it's integrated in the Linux kernel.
>>>> Disadvantage is that the used partitions need to be formatted for LVM,
>>>> so
>>>> unlike mhddfs, it doesn't work with existing partitions (with existing
>>>> data). I could however initialize the new partition with LVM, copy data
>>>> from
>>>> the existing LUN onto it, free the first partition and then add it to
>>>> the
>>>> LVM managed volume. Downside: data copying will take a lot of time, and
>>>> I
>>>> have no experience with LVM, in particular I do not know whether hard
>>>> linking will work properly. Also I have no idea how LVM would impact
>>>> performance. Maybe somebody can provide some information about this.
>>>
>>> Perhaps someone will chime in with the specifics of LVM in your
>>> scenario, but we *do* use LVM on most if not all OME team-managed
>>> systems. I haven't seen any hard-linking issues with LVM.
>>>
>>>
>>>> * Going away from iSCSI, instead share the entire RAID volume via NFS,
>>>> which
>>>> is then mounted on the server (and re-shared via Samba to the microscope
>>>> computers)
>>>> However, I read a couple of times that re-sharing an NFS mount via Samba
>>>> causes trouble and is not recommended. Can anybody confirm this?
>>>
>>> Without knowing more, I'd be concerned that you wouldn't get the
>>> DropBox notifications that you need.
>>>
>>>
>>>> * Giving up on the hard-linking import and make users delete their data
>>>> in
>>>> their DropBox folders once it's imported.
>>>> Not really nice.
>>>
>>> Have you looked at any of the "move" strategies? Do you have something
>>> internally that would work like `rsync --delete-source` that you would
>>> trust?
>>>
>>>
>>>> * Giving up OMERO Dropbox and make users use OMERO.insight to import the
>>>> acquired data.
>>>> Not really nice.
>>>>
>>>>
>>>> Has anybody had a similar problem in the past?
>>>> What is the preferred way to solve this?
>>>> Have I overseen anything obvious to make this work?
>>>> I'm not really happy with any of the things I outlined above.
>>>
>>> If you don't mind, could you share with us approximate sizes for
>>> ManagedRepository and DropBox (and other large directories) that you
>>> are looking to re-arrange? That along with how many users and
>>> used/free sizes of your various mount points might help to suggest
>>> something.
>>>
>>> All the best,
>>> ~Josh.
>>>
>>>
>>>
>>>> Thank you very much in advance,
>>>> Benjamin
>>>
>>> _______________________________________________
>>> ome-users mailing list
>>> ome-users at lists.openmicroscopy.org.uk
>>> http://lists.openmicroscopy.org.uk/mailman/listinfo/ome-users
>>
>>
>
> --
> Optical Imaging Centre Erlangen
>
> Hartmannstr. 14
> 91052 Erlangen, Germany
>
> http://www.oice.erlangen.de
>
> _______________________________________________
> ome-users mailing list
> ome-users at lists.openmicroscopy.org.uk
> http://lists.openmicroscopy.org.uk/mailman/listinfo/ome-users


More information about the ome-users mailing list