HARVESTER
Contents
Data products
Data flow
Sky subtraction
The ARRAKIHS Harvester is the dedicated data reduction pipeline of the ARRAKIHS mission, designed specifically to preserve and accurately recover extremely low surface brightness (LSB) signals that are typically lost in standard astronomical processing. LSB science requires exceptional control of sky modelling, background subtraction, and instrumental systematics. Even small residual errors can remove or distort the faint structures that trace galaxy formation, accretion processes, and diffuse stellar halos. The Harvester is therefore optimised to maintain these faint signals throughout the full processing chain.
Data Products
The Harvester processes the full ARRAKIHS data flow, from calibrated single exposures to final science products:
- Level1 (L1): Calibrated single exposures (input, similar format to ATREIDS outputs)
- Level2 (L2): Cleaned and calibrated science frames
- Level3 (L3): Deep co-added images per target
- Level4 (L4): Scientific catalogs and derived products
Starting from L1, the pipeline generates L2 products through a full calibration sequence including bad pixel masking, bias and dark subtraction, flat-fielding, non-linearity corrections, cosmic ray and satellite trail removal, astrometric refinement, and photometric calibration. A key aspect of the processing is the treatment of extended point spread functions (PSF) and optical artefacts such as internal reflections (ghosts), which are modelled and corrected to avoid contamination of faint diffuse emission. L3 products are built by co-adding multiple L2 exposures of the same target, with different stacking strategies available (e.g. PSF-matched or optimised for low surface brightness recovery). Finally, L4 products consist of scientific catalogues and derived measurements.
Besides the data products derived by Harvester, the AMC will produce and additional low surface brightness optimized images as high level data products (Level 5).
Sky subtraction for LSB science.
A central goal of the Harvester is the robust recovery of extremely faint, extended emission. This requires a careful separation between instrumental background and true astrophysical signal.
Sky modelling and subtraction are therefore implemented with particular care to avoid over-subtraction, which can artificially remove diffuse structures such as galaxy halos or tidal streams. The pipeline is designed to minimise these effects by combining instrument modelling, masking strategies, and large-scale background estimation techniques.
The resulting processing strategy is continuously validated using simulations and representative observational datasets, ensuring consistency and stability across different observing conditions.
Software framework
The Harvester is developed as open-source software under the GNU GPL v3 license and will be released alongside ARRAKIHS data products. It follows a modular and reproducible workflow design, allowing individual stages to be executed independently or as part of full end-to-end processing. The pipeline is built on established tools such as Gnuastro and the Maneage environment system, ensuring long-term reproducibility and portability.
