FAIR Publishing of large scale datasets

Providing scalable and sustainable solutions for scientific data management and publications in large consortia

The Pennsieve Platform is used as the primary data management solution for several large scale NIH funded programs including the NIH SPARC program, NIH HEAL RE-JOIN, and NIH HEAL PRECISION. Over 80 research groups from around the world are using the Pennsieve platform to submit, curate and publish their datasets as part of these efforts.

The Pennsieve platform provides advanced mechanisms to support the generation of high quality scientific public and private datasets. This enables scores of scientist to reuse, and interrogate scientific findings as the published data is optimized for downstream analysis and visualization. Pennsieve works closely with the NIH and other consortia to ensure that data remains available at scale in a sustainable way and that data adheres to all required and recommended guidelines for FAIR sharing of data.

At this time, over 300 datasets have been made publicly available through the Pennsieve Platform ranging from smaller datasets (<10GB) to larger datasets (>5TB) which contain significantly complex data-structures and meta-data graphs.

For more information and to browse public datasets, visit: https://sparc.science