SITCOMTN-128
Unrecognized Blends in Operations Rehearsal 3#
Abstract
Unrecognized blends are a class of blended objects that are mistakenly identified as a single object. We can identify such objects using higher resolution imaging from a space based telescope that is not affected by ground based seeing or in the Operations Rehearsal 3, use the simulation truth catalog. Using RA and DEC we can match between the two catalogs to label observed objects as isolated objects, recognized blends, or unrecognized blends. This allows us to investigate the fraction of unrecognied blends present and how different factors influence that rate. We have performed a variety of tests into unrecognized blends on the simulated data which can act as a benchmark for upcoming commissioning where we will have to use a space based catalog to match.
Data#
The third Operations Rehearsal (OR3) simulated 3 nights of observation that were then processed by the science pipelines.
The simulations are built on top of a set of DC2 patches which were then processed to mimic realistic observational conditions.
We use both the nightly catalog and the cumulative co-add catalog.
The catalog can be accessed using the /repo/embargo
repo and found in the LSSTComCamSim/runs/intermittentcumulativeDRP/20240402_03_04/d_2024_03_29/DM-43865
collection.
The observed catalog goes through a detection and deblending step which can deal with recognized blends.
In the science pipeline, the process of deblending produces children objects from a given parent object with both existing in the catalog.
In order to remove duplicates from the catalog we work only with the deblended children and not parents along with two other quality control flags — one to ensure that an object is not a sky object and another to ensure that the object is in the inner part of an image.
All three flags are combined into the detect_isPrimary
flag which we apply to the observed catalog.
In the observed catalog, a loose proxy for star-galaxy classification can be done via the extendedness
flag.
In this technote we will use “observed galaxy” and “extended object” interchangibly.
To label detected objects as isolated galaxies or unrecognized blends we require a higher resolution catalog which corresponds to space based data for actual operations and input truth for simulations.
The truth catalogs for this run include galaxies, stars, and solar system objects of which we only consider the galaxies and stars.
The input truth catalog for solar system objects does not include an i-magnitude and while they will cause blending, difference imaging is likely to offer much better mitigation of these sources than deblending.
The simulated images are built on top of a set of DC2 patches with input catalogs in /sdf/data/rubin/shared/ops-rehearsals/ops-rehearsal-3/imSim_catalogs
.
Matching#
Observed and truth catalogs in hand, we can start to label objects in the observed catalog as isolated objects (pure), recognized blends, or unrecognized blends by matching between the two catalogs. After aligning astrometry, we can use position information (RA and DEC) to match objects between the two catalogs. If we query around the ground based RA and DEC using a small radius we would expect to get a single match in the truth catalog for a pure galaxy whereas we would expect to see multiple for a blended object. To differentiate between recognized and unrecognized blends, we would also need to query a small radius in the ground catalog with the recognized blends having an equal number of objects to that in the truth query while unrecognized blends will have fewer. Formally, let \(N_g\) be the number of ground catalog objects in some radius \(r\) centered on the position of a ground object (\(N_g > 1\) always), \(N_s\) be the number of space catalog objects in some radius \(r\) centered on the same position. If \(N_s > N_g\) we have a candidate unrecognized blend whereas the converse \(N_s \leq N_g\) can be a pure galaxy, recognized blend, or spurious detection.
Once we have a set of candidate blends, we can apply magnitude cuts on these to isolate to the blends we anticipate to be problematic. For each candidate, we require that the set of truth objects pass 2 cuts:
The difference between a truth object and the brightest in the set in band X should be small \(\Delta_X < m_\Delta\)
The truth objects should not be too dim in band X \(m_X > m_b\)
Once applying the cuts on the truth catalog we recount the number of objects in the sets and promote any surviving candidate unrecognized blends to candidate blends. In summary we have the following process:
- To generate a candidate blend
Querying a radius \(r\) around the ground RA and DEC in the ground catalog gives \(N_g\) ground objects
Querying a radius \(r\) around the ground RA and DEC in the space catalog gives \(N_s\) space objects
If \(N_s > N_g\) we have a candidate unrecognized blend
If \(N_s \leq N_g\) we have a recognized blend or a spurious detection.
- Promoting a candidate unrecognized blend requires the truth objects to pass 2 cuts:
\(\Delta_X < m_\Delta\)
\(m_X > m_b\)
It is important to note that this is not the only option to match for unrecognized blends. More complicated schemes include Shuang’s ellipse matching algorithm and Manon’s friendly
producing a blend entropy for each object.
friendly
is being implemented into the science pipelines but for comparison we present some results when used on DC2 simulations on NERSC.
One benefit to our naive matching scheme is that it allows us to use KDtrees which are fast and efficient.
Unrecognized Blends#
Using the kd-tree matching algorithm we can label unrecognized blends in the ground catalog. Our fiducial settings have \(r = 1''\), \(m_\Delta = 2.5\), and \(m_b = 28\). We then investigate how unrecognized blends correlate with various parameters such as object i-magnitude, shape parameters, and local density. During commissioning we can repeat this process on real data allowing for more robust tests of the science pipeline.
Magnitude Dependence#
We expect that blending will increase at the higher magnitudes as dimmer objects are easier to avoid detection and there are more galaxies. Other studies of unrecognized have found a similar trend which we show below.
The fraction of unrecognized blends as a function of the observed i-mag is shown below including the results using friendly
on DC2.
If we restrict to only detected galaxies we se a slight increase
Shape Parameters#
Accurate shape measurements is necessary for weak lensing studies and it is expected that unrecognized blends will greatly impact shear. The inverse question, if certain shapes will impact unrecognized blends is not as well studied. It is possible that there would be a bias due to the orientation of the pixel grid which we investigate below.
We look at the second moment, \(Q_{ij}\), of extended objects. We combine the second moments via
We create \(Q_{rr} = \sqrt{Q_{xx}^2 + Q_{yy}^2}\).
Given that there is little to no difference among the shape parameters, this gives good confidence that the pixel grid is not impacting shape measurements and unrecognized blends in strange ways. The wing structure is not necessarily cause for concern but it is interesting that objects with larger shapes tend to be unrecognized blends. The bins at the edge of the torus have \(N\approx O(10)\) so a larger sample is required for any definitive claim.
Local Density#
Finally, we know clusters and other dense fields (like the deep fields) are expected to be extremely blended motivating looking into how local density affects unrecognized blend fraction.
To estimate the local density, \(\sum(r_i)\), we use Equation 7 from Darvish et al.
Where \(d_{ij}\) is the distance between object \(i\) and \(j\). When querying for neighbors, we can either look at the object catalog when testing the pipeline or the truth catalog when testing for science. There will likely be some underlying science that can be extracted by using the truth catalog density but we limit our focus to the detected catalog to test the pipeline.
The distribution of density and the relationship with unrecognized blends are shown below
As expected, the fraction of unrecognized blends monotonically increases with true density however the observed density flat-lines.
Note
Removed the heatmaps section since I’m not sure what the actual take away is…
Conclusion#
We have created a set of tools that enable us to match between catalogs to label unrecognized blends and investigate how the rate of unrecognized blends vary with object properties. This technote has the ideal case using simulated data along with true input catalogs which gives a good goalpost for commissioning data. During commissioning and observation we intend to re-do this analysis using space based data which will enable future studies on unrecognized blends and how to mitigate them.