-
Notifications
You must be signed in to change notification settings - Fork 0
/
dm.tex
158 lines (123 loc) · 12.7 KB
/
dm.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
% Audience Ed Ajhar and Kathy Turner, Helmut Meineke, Matt Mountain
% Start each section with a 2-3 line intro to what it is
% dCould drop ORR and Obj subsections
% Sentence ... prompt processing is .. see sec X in doc Y
%% PP/DRP consistes of the following functional elements - complete when these are complete
\section{Verification of Data Processing, Products and User Services} \label{sec:dm}
The Data Management System provides the functionality necessary to process the raw image data into usable data products, and to make those data products accessible to the Rubin scientific community.
\subsection{Operations Readiness Requirement}
The project team shall demonstrate that the integrated LSST Data Management Subsystem has met its technical specifications as enumerated in the Data Management System Requirements \citeds{LSE-61}, specifically those designated as 1a and 1b priority.
\subsection{Objectives}
The objective of this operational requirement is to ensure that the integrated as-delivered Data Management System (DMS), including all supporting infrastructure, has been verified against its requirements.
The top-level requirements for the DMS are given in the Data Management System Requirements \citeds{LSE-61}, and are derived from the Observatory System Specifications (OSS), \citeds{LSE-30}, which in turn are derived from the LSST System Requirements (LSR), \citeds{LSE-29} and the Science Requirements Document (SRD) \citeds{LPM-17}.
The DMSR is complemented by the Data Product Definition Document (DPDD), \citeds{LSE-163}, which describes the data products to be delivered by the Large Synoptic Survey Telescope (LSST).
\subsubsection{Approach to verification and validation}\label{sec:dm-approach}
The approach to verification and validation adopted by the LSST Data Management Subsystem is described in detail in the DM Test Plan (\citeds{LDM-503}), which provides a series of high-level milestones and the accompanying the test schedule.
Broadly, this approach consists of three aspects:
\begin{enumerate}
\item Verification that the Data Management system as delivered meets the requirements placed upon it;
\item Validation that the system as delivered meets the needs of the scientific community;
\item Rehearsing the sustained operation of the system in operational scenarios.
\end{enumerate}
The approach to verifying each individual requirement is described in the DM Acceptance Test Specification, (\citeds{LDM-639}), which provides the dedicated test specifications.
Prior to start of commissioning and Operations, the data processing system will be verified to the extent possible using precursor data.
Final verification and construction completeness will be determined with data obtained during the commissioning phase of the project and in collaboration with the commissioning team, \ref{sec:svs}.
Functional verification will be achieved through a series of operations rehearsals and data challenges.
All requirements in the DMSR have been prioritized as follows:
\begin{enumerate}
\item "This must be done to enter commissioning (a) or Operations (b); no waivers will be granted if not met."
\begin{itemize}
\item 1a: Must be demonstrated to be working before the start of the commissioning period.
\item 1b: Must be demonstrated to be working before the start of the observing.
\end{itemize}
\item ``Should be done to enter Operations; but waiver likely to be granted if not met,'' i.e., we could enter Operations without this fulfilled, for first 3 years.
\item ``Overall capability/efficiency/ease of use/etc., may be reduced but science will not critically suffer if not done.'' Could enter Operations without this requirement fulfilled, and have the soperations team decide whether they want to pursue it.
\end{enumerate}
The verification status of requirements in the DMSR will be reported at each of the Construction Closeout Reviews.
Most priority 1a requirements are expected to be verified by CCR1, coinciding with the start of on-sky commissioning with ComCam. Those that are not will be verified by the end of ComCam on-sky commissioning and ready for the start of on-sky commissioning with LSSTCam.
Priority 1b requirements are expected to be fully verified by CCR3 to demonstrate readiness for the handover to Operations.
Most priority 2 requirements are expected to be verified during the course of on-sky commissioning and early operations. For those that are not, a waiver will be sought to enter Operations and they will be completed within the first three years of Operations.
\subsection{Criteria for Completeness} \label{sec:dm-completeness}
The DM system will be considered successfully complete when all high-level requirements in the DMSR have been verified.
At a minimum, all priority 1 requirements must be verified at the end of construction.
The DM Verification Control Document, \citeds{LDM-692}, provides an overview of the verification status of the Data Management Subsystem with respect to its requirements.
\subsection{Pre-Operations Interaction}
None, unless there are non-compliance issues against the CCR requirements and specifications.
\subsection{Artifacts for Completion} \label{sec:dm-artifacts}
The following artifacts will be provided:
\begin{itemize}
\item A verification matrix containing entries for all DMSR requirements (\citeds{LSE-61}) and specifications (\citeds{LDM-639}). Methods, inspections, demonstration, analysis or test, shall be identified for every DMSR requirement. This verification matrix is provided by the DM Verification Control Document (\citeds{LDM-692});
\item Final compliance status, including all non-compliance reports;
\item All Data Management test plans and reports for all test campaigns;
\item A Performance characterization report;
\item System documentation and code repositories;
\item Drafts of all construction papers.
\end{itemize}
%%%% Prompt Processing %%%%%%
% -prompt needs nw with a given latency - need moderate throughput, high reliabilty (1 visit at a time)
\subsection{Prompt Processing}
% {\it Responsible: Eric Bellm (and Leanne Guy)}
% Was in ORC -- move about
The Project shall demonstrate the Prompt (Alert) Processing meets its requirements as defined in the DMSR (\citeds{LSE-61}) and the DPDD (\citeds{LSE-163}). In particular the Prompt (Alert) Processing shall demonstrate its technical ability to meet the 60--second latency requirement for the transfer of data, processing difference images, and publishing detect sources from the Difference Imaging Analysis (DIA).
Additionally, we shall demonstrate that nightly Solar System Processing (SSP) meets the DMSR requirements for identification of Solar System Objects.
\subsubsection{Objectives}
The objective of this Operational Requirement is to ensure that the Prompt Processing pipelines have been verified against requirements and produce the Prompt data products necessary for LSST Transient, Variable, and Solar System science, and to enable rapid follow-up of time domain events.
Demonstration of an integrated LSST system for Prompt Processing must include, at some level, testing interfaces to the Minor Planet Center (MPC) for Solar System Data products and with Community Brokers (\citeds{LDM-612}) for Alerts.
%% All of this is details of the verification process that should be captured in the requirements/test spec - or in the criteria
%Prompt Processing requires templates images to enable Difference Image Analysis.
%During normal survey operations, templates will be produced as part of Data Release Processing.
%During commissioning and early operations, however, templates can only produced incrementally through separate execution of the Template Generation Payload as data of sufficient quality is taken in various areas of the sky.
Given the dependence of Prompt Processing on the availability of templates, validating DM's template generation capability is an important objective for Operations Readiness.
Where and when templates are available, we expect Prompt Processing to proceed normally.
%The Prompt Products Database should be populated and alerts generated.
%In the alert packets, there would be less than 12 months of previous DIASource records available, and, as there will be no available DR in commissioning, providing matching Object IDs would depend on what DRP data products were available.
We expect to provide a machine-learned spuriousness classifier for \DIASources.
Good performance of such classifiers requires a large sample of labeled data representative of the entire survey, which may not be available prior to routine survey operations.
Accordingly, initial validation of the spuriousness classifier and a plan for incremental retraining in operations is sufficient for operational readiness.
We will run Solar System Processing in commissioning to validate the solar system products pipelines, generate some solar system data products, and test the interfaces with the MPC.
We should be able to attribute Solar System objects known from other surveys and previously catalogued by the MPC with single-apparition LSST \DIASources.
Once the astrometry is sufficiently good (for asteroids, $\sim0.05--0.1^{\prime\prime}$), we can start regularly submitting to the MPC and testing the linking software.
It should be clear, that at least in early commissioning, alert distribution and submission to the MPC will be with substantial latency with respect to the SRD operations-era latencies.
Similarly, OSS completeness and purity metrics for both transients and solar system objects may not be achievable prior to the availability of DR1 templates.
%%
\subsubsection{Criteria for Completeness}
The criteria for completeness are described in \ref{sec:dm-completeness}.
\subsubsection{Pre-Operations Interactions}
Validation and operations readiness will be assessed via the operations rehearsals and the DPs.
Distribution of DPs by the early operations teams
Results will be made available to the community - early operations team
Through the planned data previews
\subsubsection{Artifacts for Completion}
The high-level artifacts for completion of the Prompt Processing pipelines are detailed in \ref{sec:dm-artifacts}.
%%%%%%%%%% DRP %%%%%%%%%%%%%%%%%
\subsection{Data Release Processing}
% {\it Responsible: Jim Bosch (and Leanne Guy)}
\subsubsection{Objectives}
The objective of this Operational Requirement is to ensure that the Data Release Processing pipelines have been verified against requirements and produce the Data Release data products necessary for LSST science.
Data Release Production involves not just the image processing pipeline, which is the component most visible to scientists, but integration with and usage of many other DM deliverables as well, including:
\begin{itemize}
\item data access middleware that archives and organizes both raw data from the observatory and processing outputs;
\item process control middleware that provides a harness for running the pipelines at scale;
\item systems for transferring processing outputs to components of the Rubin Science Platform (RSP) for user access, including database ingest;
\item hardware, operators, and other production services.
\end{itemize}
\subsubsection{Criteria for Completeness}
The criteria for completeness are described in \ref{sec:dm-completeness}.
The project team shall process the data from the one (or more) of the Science Validation Surveys to produce a Data Preview and make it available to the Commissioning Team through the Rubin Science Platform as well as a subset for the EPO Public User Interface.
\subsubsection{Pre-Operations Interactions}
None, unless there are non-compliance issues against the DMSR requirements and specifications.
\subsubsection{Artifacts for Completion}
The high-level artifacts for completion of the DRP pipelines are detailed in \ref{sec:dm-artifacts}.
%%%%%% RSP %%%%%%%%%%%
\subsection{Rubin Science Platform}
% {\it Responsible: Gregory Dubois-Felsmann (and Leanne Guy)}
\subsubsection{Objectives}
The objectives of this Operational Requirement are to ensure that the Rubin Science Platform (RSP) has been verified against requirements, and that the LSST science community can access, visualize, interact with, and analyze LSST data products.
The high-level vision of the Rubin Science Platform describing an integrated platform of three distinct aspects is described in \citeds{LSE-319}
\subsubsection{Criteria for Completeness}
The high-level criteria for completeness are detailed in \ref{sec:dm-completeness}.
Specifically for the RSP, this means that the scientific community can retrieve the Rubin data products with a reasonable latency. The RSP will not be complete at the stage of commissioning.
\subsubsection{Pre-Operations Interactions}
None, unless there are non-compliance issues against the DMSR requirements and specifications.
\subsubsection{Artifacts for Completion}
The high-level artifacts for completion of the RSP are detailed in \ref{sec:dm-artifacts}.