Echodataflow: Recipe-based Fisheries Acoustics Workflow Orchestration
-
2024
-
Details
-
Journal Title:Proceedings of the Python in Science Conference
-
Personal Author:
-
NOAA Program & Office:
-
Description:With the influx of large data from multiple instruments and experiments, scientists are wrangling complex data pipelines that are context-dependent and non-reproducible. We demonstrate how we leverage Prefect (Prefect, 2024), a modern orchestration framework, to facilitate fisheries acoustics data processing. We built a Python package Echodataflow (Echodataflow, 2024) which 1) allows users to specify workflows and their parameters through editing text “recipes” which provide transparency and reproducibility of the pipelines; 2) supports scaling of the workflows while abstracting the computational infrastructure; 3) provides monitoring and logging of the workflow progress. Under the hood, Echodataflow uses Prefect to execute the workflows while providing a domain-friendly interface to facilitate diverse fisheries acoustics use cases. We demonstrate the features through a typical ship survey data processing pipeline.
-
Keywords:
-
Source:Proceedings of the Python in Science Conference (2024)
-
DOI:
-
ISSN:2575-9752
-
Format:
-
Publisher:
-
Document Type:
-
Funding:
-
License:
-
Rights Information:CC BY
-
Compliance:Submitted
-
Main Document Checksum:urn:sha-512:1939de6037e4b6f54725d9dbd1c616dca33f7521b0497470d51284611190fe8742213ca035093e1f76afcb1e79fd4185de6a37bedf9d1bc3259eef3c3a84519f
-
Download URL:
-
File Type:
ON THIS PAGE
The NOAA IR serves as an archival repository of NOAA-published products including scientific findings, journal articles,
guidelines, recommendations, or other information authored or co-authored by NOAA or funded partners. As a repository, the
NOAA IR retains documents in their original published format to ensure public access to scientific information.
You May Also Like