Histochemistry and Cell Biology
1 3
Discussion
Over the last twenty years, the number of bioimaging file
formats has been a constant source of confusion and frustra-
tion. While that has often been a struggle that each user man-
ages in isolation, increasingly data sizes from more sophis-
ticated hardware and more advanced modalities are leaving
users with significant infrastructure burdens for efficiently
converting and sharing their imaging data. Data acquisi-
tion systems often design formats specifically for writing
data quickly to timely capture the scale and the breadth of
modern experiments. The tension between the requirements
of quickly writing and quickly reading bioimaging datasets
force both data providers and consumers to be aware of the
costs of converting, relinking, downsampling, or otherwise
modifying datasets for reuse. A one-time conversion of such
“write-optimized” data can lower the overhead of repeated
analysis and visualization of the data, but requires a widely
adopted target format. With proper support, a small suite of
storage file formats like HDF5, TIFF and Zarr can cover the
essential use cases for optimally achieving the community’s
scientific goals as has been achieved by other projects [e.g.,
PDB (Berman etal. 2012) and NetCDF (Unidata Ltd 1973)].
The strategy outlined in this paper is to encourage com-
munity cooperation towards a common representation.
Increased focus from the community of developers acceler-
ates the features delivered to the user community. Increases
in the expressive power of the format through specifications,
the number and ability of available tools, and data publicly
available for end users all motivate further developments in
each of the other areas. In turn, this progress drives the abil-
ity of the bioimaging community to better enact the FAIR
principles. The growth of OME-Zarr tools, resources, and
specifications, however, should not be taken as a reason
to wait on adoption. The opposite is true. The hope is that
more users and more developers will drive further growth,
further unifying the bioimaging ecosystem. Users should
identify whether or not the advancements detailed here will
simplify and accelerate their scientific practice, and if so, are
encouraged to start using OME-Zarr today. The community
is growing and membership is open, free and encouraged.
User feedback is critical to help make the most FAIR repre-
sentation of bioimaging data possible.
Author contributions Conceptualization: JM. Software: DB-L, SB, JB,
JB, EMB, J-MB, GdM, DG, SSG, IG, YOH, MH, DH, NH, MSK, GK,
AKY, KK, ML, TL, PL, DL, ML, JL, JM-S, TM, LM, MM, KM, JM,
WM, WO, BÖ, GP, CP, LP, TP, SP, NR, StS, SaS, NS, HS, ACS, DRS,
JS, CT, DT, IV, AMW, EW, KAY. Resources: SB, MH, KK, SO, MP.
Data curation: AKY, FW. Visualization: EED, SSG, GK, AlTdlV, JL,
MM, JM, WM, MP, CT, DT, AMW. Writing—original draft prepara-
tion: DB-l, SB, JB, EMB, J-MB, GdM, EED, SSG, IG, YOH, MH, DH,
MSK, GK, AKY, TL, PL, JL, JM-S, TM, MM, JM, BÖ, CP, LP, TP,
SP, NR, CT, DT, PW, AMW, FW, KAY. Writing—review and editing:
J-MB, XCM, BAC, YOH, NH, MSK, TM, JM, WM, NN, SP, NR, StS,
JS, JRS, EW. Supervision: OB, BAC, NG, MH, NH, SO, LAR, StS,
OS, JRS, FJT.
Funding J.M. was supported by Chan Zuckerberg Initiative DAF
for work on OME-NGFF by grant numbers 2019-207272 and 2022-
310144 and on Zarr by grant numbers 2019-207338 and 2021-237467.
S.S.G. and Y.O.H were supported by US National Institutes of Health
BRAIN Initiative award R24MH117295. The development of the Bio-
Image Archive has been supported by European Molecular Biology
Laboratory member states, Wellcome Trust grant 212962/Z/18/Z and
UKRI-BBSRC grant BB/R015384/1. The EMBL-EBI IT infrastructure
supporting the IDR and the BioImage Archive is funded by the UK
Research and Innovation Strategic Priorities Fund. M.K. was supported
by NHGRI 5T32HG002295. J.L was supported by grant 2022-252401
of the Chan Zuckerberg Initiative DAF, an advised fund of Silicon
Valley Community Foundation. T.M. was supported by the National
Science Foundation Graduate Research Fellowship under Grant No.
(DGE1745303). M.M. was supported by the US BRAIN Initiative
National Institutes of Health under award number 1RF1MH126732-
01. B.Ö. was supported by the EOSC Future project grant agreement
number: 101017536. S.O. was supported by JST NBDC Grant Number
JPMJND2201 and JST CREST Grant Number JPMJCR1926. N.A.H.,
N.J.S., S.B.S. and H.S. were funded via NCATS intramural research
fund. G.P. is supported by the Helmholtz Association under the joint
research school Munich School for Data Science and by the Joachim
Herz Foundation. L.M. is supported by the EMBL International PhD
Programme. M.L., J.Br., A.C.S. and L.A.R. were supported by the
Chan Zuckerberg Biohub San Francisco. N.N. was supported by Vin-
nova, grant number 2020-04702. T.P. was supported by grant number
2021-237557 from the Chan Zuckerberg Initiative DAF, an advised
fund of Silicon Valley Community Foundation. C.T. was funded by
grant number 2020-225265 from the Chan Zuckerberg Initiative DAF,
an advised fund of Silicon Valley Community Foundation. A.M.W was
supported by the US BRAIN Initiative National Institutes of Health
under award R24MH114793 and the Chan Zuckerberg Initiative for the
Brain Image Library Data Viewer Plugin Enhancement award 2022-
309651K.A.Y. was supported by the Open Research Data Program of
the ETH Board. Wellcome (Senior Clinical Research Fellowship, Well-
come Science Strategic Award) Work on OME-NGFF and IDR was
supported by the Wellcome Trust (ref. 212962/Z/18/Z), BBSRC (ref.
BB/R015384/1) and the National Institutes of Health Common Fund
4D Nucleome Program grant UM1HG011593. E.W. was supported by
Calico Life Sciences LLC; B.A.C was funded by NIH P41 GM135019,
and grant number 2020-225720 from the Chan Zuckerberg Initiative
DAF, an advised fund of Silicon Valley Community Foundation. O.W.
was supported by the SciLifeLab & Wallenberg Data Driven Life Sci-
ence Program (grant: KAW 2020.0239). N.G. was supported by NIH
OT2OD033758 and NIH R33CA263666.
Data availability All data shown in the figures is available publicly
under permissive licenses.
Code availability All code described is available publicly under free
and open-source licenses.
Declarations
Conflict of interest S.B., E.D., M.L., D.R.S. and J.R.S. are affiliated
with Glencoe Software, a commercial company that builds, deliv-
ers, supports and integrates image data management systems across
academic, biotech and pharmaceutical industries; J.M. and W.M. also
hold equity in Glencoe Software. M.M. is affiliated with Kitware, Inc.,
a commercial company built around open-source platforms that pro-
vides advanced technical computing, state-of-the-art AI, and tailored