-
Notifications
You must be signed in to change notification settings - Fork 1
/
coherency.tex
executable file
·1481 lines (1316 loc) · 77.7 KB
/
coherency.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
%% LyX 2.0.5 created this file. For more info, see http://www.lyx.org/.
%% Do not edit unless you really know what you are doing.
\documentclass[a4paper,useAMS,usenatbib]{mn2e}
% \usepackage[T1]{fontenc}
% \usepackage[latin9]{inputenc}
\setcounter{secnumdepth}{3}
\setcounter{tocdepth}{3}
% \usepackage{textcomp}
\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{graphicx}
\usepackage{esint}
\usepackage[authoryear]{natbib}
%\inputencoding{utf8}
\PassOptionsToPackage{caption=false}{subfig}
\usepackage{subfig}
\makeatletter
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% LyX specific LaTeX commands.
\pdfpageheight\paperheight
\pdfpagewidth\paperwidth
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% User specified LaTeX commands.
% mn2esample.tex
%
% v2.1 released 22nd May 2002 (G. Hutton)
%
% The mnsample.tex file has been amended to highlight
% the proper use of LaTeX2e code with the class file
% and using natbib cross-referencing. These changes
% do not reflect the original paper by A. V. Raveendran.
%
% Previous versions of this sample document were
% compatible with the LaTeX 2.09 style file mn.sty
% v1.2 released 5th September 1994 (M. Reed)
% v1.1 released 18th July 1994
% v1.0 released 28th January 1994
% If your system does not have the AMS fonts version 2.0 installed, then
% remove the useAMS option.
%
% useAMS allows you to obtain upright Greek characters.
% e.g. \umu, \upi etc. See the section on "Upright Greek characters" in
% this guide for further information.
%
% If you are using AMS 2.0 fonts, bold math letters/symbols are available
% at a larger range of sizes for NFSS release 1 and 2 (using \boldmath or
% preferably \bmath).
%
% The usenatbib command allows the use of Patrick Daly's natbib.sty for
% cross-referencing.
%
% If you wish to typeset the paper in Times font (if you do not have the
% PostScript Type 1 Computer Modern fonts you will need to do this to get
% smoother fonts in a PDF file) then uncomment the next line
% \usepackage{Times}
%%%%% AUTHORS - PLACE YOUR OWN MACROS HERE %%%%%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\title[Generalized Formalisms of the RIME]{Generalized Formalisms of the Radio Interferometer Measurement Equation}
\author[D.C.~Price and O.~Smirnov]{
D. C. Price$^{1}$\thanks{E-mail: [email protected]}
and O. M. Smirnov$^{2,3}$\\
$^1$Harvard-Smithsonian Center for Astrophysics, MS42, 60 Garden Street, Cambridge MA, 01243 United States\\
$^2$Department of Physics and Electronics, Rhodes University, PO Box 94, Grahamstown, 6140, South Africa\\
$^3$SKA South Africa, 3rd Floor, The Park, Park Road, Pinelands, 7405, South Africa
\\}
\makeatother
\begin{document}
\date{Accepted 2015 Mmm dd. Received 2014 Nov 04}
\maketitle
\pagerange{\pageref{firstpage}--\pageref{lastpage}} \pubyear{2015}
\label{firstpage}
\begin{abstract}
The Radio Interferometer Measurement Equation (RIME) is a matrix-based mathematical
model that describes the response of a radio interferometer. The Jones
calculus it employs is not suitable for describing the analogue components
of a telescope. This is because it does not consider the effect of impedance
mismatches between components. This paper aims to highlight the limitations
of Jones calculus, and suggests some alternative methods that are
more applicable.
We reformulate the RIME with a different basis that includes magnetic and mixed coherency statistics.
We present a microwave network inspired 2N-port version of the RIME, and a tensor formalism based upon
the electromagnetic tensor from special relativity.
We elucidate the limitations of the Jones-matrix-based RIME for describing analogue components. We show how
measured scattering parameters of analogue components can be used in a 2N-port version of the RIME. In addition, we
show how motion at relativistic speed affects the observed flux.
We present reformulations of the RIME that correctly account for magnetic field coherency.
These reformulations extend the standard formulation, highlight its limitations, and may have applications in
space-based interferometry and precise absolute calibration experiments.
\end{abstract}
\begin{keywords}
Methods: data analysis
--- Techniques: interferometric
--- Techniques: polarimetric
\end{keywords}
\section{Introduction}
Coherency in electromagnetic fields is a fundamental topic within
optics. Its importance in fields such as radio astronomy can not be
overstated: interferometry and synthesis imaging techniques rely heavily
upon coherency theory \citep{Taylor1999,BookMandelWolf,ThompsonMoranSwenson2004}.
Of particular importance to radio astronomy is the Van-Cittert-Zernicke
theorem (vC-Z, \citealp{VanCittertZernicke1938}) and the radio interferometer
Measurement Equation (RIME, \citealp{Hamaker:1996p5735}). The vC-Z
relates the brightness of a source to its mutual coherency as measured
by an interferometer, and the RIME provides a polarimetric framework
to calibrate out corruptions caused along the signal's path.
While the vC-Z theorem dates back to 1938, more recent work such as
that of \citet{Carozzi:2009hf} extends its applicability to polarized
measurements over wide fields of view. The RIME has a much shorter
history: it was not formulated until 1996 \citep{Hamaker:1996p5735}.
Before the RIME, calibration was conducted in an ad-hoc manner, with
each polarization essentially treated separately. The framework was
expounded in a series of follow-up papers \citep{Sault:1996p5731,Hamaker:1996p5733,Hamaker:2000p7625,Hamaker:2006p7626}; recent work by Smirnov extends the formalism to the full sky case,
and reformulates the RIME using tensor algebra \citep{Smirnov:2011a,Smirnov:2011d}.
This article introduces two reformulations of the RIME, that extend its applicability and demonstrates
limitations with the Jones-matrix-based formalism.
Both of these reformulations consider full electromagnetic coherency statistics
(i.e. electric, magnetic and mixed coherencies). The first is inspired by transmission matrix methods
from microwave engineering. We show that this reformulation better accounts for the changes in
impedance between analogue components within a radio telescope.
The second reformulation is a relativistic-aware formulation of the RIME,
starting with the electromagnetic field tensor. This formalism allows for relativistic motion to be treated
as instrumental effect and incorporated into the RIME.
This article is organized as follows. In Section 2, we review existing formalisms of the RIME and methods from
microwave engineering. Section 3 defines coherency matrices, which are used in Section 4 to formulate
general coherency relationships for radio astronomy. In Section 5, we introduce a tensor formulation of the RIME based upon
the electromagnetic tensor from special relativity. Discussion and example applications are given in Section 6;
concluding remarks are given in Section 7.
\section{Jones and Mueller RIME formulations}
Before continuing on to derive a more general relationship between
the two-point coherency matrix and the voltage-current coherency,
we would like to give a brief overview and derivation of the radio
interferometer Measurement Equation of \citet{Hamaker:1996p5735}.
Our motivation behind this is to highlight that Hamaker et. al.'s
RIME is a special case of a more general (and thus less limited) coherency
relationship.
In their seminal Measurement Equation (ME) paper, \citet{Hamaker:1996p5735}
showed that Mueller and Jones calculuses provide a good framework for
modelling radio interferometers. In optics, Jones and Mueller matrices
are used to model the transmission of light through optical elements
\citep{Jones1941,Mueller1948}. Mueller matrices are $\mbox{4}\times\mbox{4}$
matrices that act upon the Stokes vector
\begin{equation}
\mbox{\textbf{\emph{s}}}=\begin{pmatrix}I & Q & U & V\end{pmatrix}^{T},
\end{equation}
whereas Jones matrices are only $\mbox{2}\times\mbox{2}$ in dimension
and act upon the `Jones vector': the electric field vector in a coordinate
system such that z-axis is aligned with the Poynting vector
\begin{equation}
\mbox{\textbf{\emph{e}}}(\mbox{\emph{\textbf{r}}},t)=\begin{pmatrix}e_{x}(\mbox{\emph{\textbf{r}}},t) & e_{y}(\mbox{\emph{\textbf{r}}},t)\end{pmatrix}^{T}.
\end{equation}
Jones calculus dictates that along a signal's path, any (linear) transformation
can be represented with a Jones matrix, \textbf{\emph{J}}:
\begin{equation}
\mbox{\textbf{\emph{e}}}_{\rm{out}}(\mbox{\textbf{\emph{r}}},t)=\mbox{\textbf{\emph{J}}}\mbox{\textbf{\emph{e}}}_{\rm{in}}(\mbox{\textbf{\emph{r}}},t)\label{eq:jones-transmission}
\end{equation}
A useful property of Jones calculus is that multiple effects along
a signal's path of propagation correspond to repeated multiplications:
\begin{equation}
\mbox{\textbf{\emph{e}}}_{\mathrm{out}}(\mbox{\textbf{\emph{r}}},t)=\emph{\textbf{J}}_{n}\cdots\emph{\textbf{J}}_{2}\emph{\textbf{J}}_{1}\mbox{\textbf{\emph{e}}}_{\rm{in}}(\mbox{\textbf{\emph{r}}},t),
\end{equation}
which can be collapsed into a single matrix when required.
The RIME uses Jones matrices to model the various corruptions and
effects during a signal's journey from a source right though to the
correlator. A block diagram for a (simplified) two-element interferometer
is shown in Figure~\ref{fig:RIME-cartoon}. From left to right, the
figure shows the journey of a signal from a source right through to
the correlator. The radiation from the source is picked up by two
antennas, which we have denoted with subscript $p$ and $q$. The
radiation follows a unique path to both of these antennas; each antenna
also has associated with it a unique chain of analogue components
that amplify and filter the signal to prepare it for correlation.
Each of these subscripted boxes may be represented by a Jones matrix;
alternatively an overall Jones matrix can be formed for the $p$ and
$q$ branches (the dashed areas).
\begin{figure}
\begin{centering}
\includegraphics[width=0.95\columnwidth]{figures-rime-diagram}
\par\end{centering}
\caption{Block diagram showing a simple model of an interferometer that can
be modelled with the RIME. Radiation from a source propagates through
free space to two telescopes, \emph{p }and \emph{q}. After passing
through the telescope's analogue chain, the two signals are interfered
in a cross-correlator.\label{fig:RIME-cartoon}}
\end{figure}
\subsection{Hamaker's RIME derivation}
The derivation of the RIME is remarkably simple and elegant. For a
single point source of radiation, the voltages induced at the terminals
of a pair of antennas, $p$ and $q$ are
\begin{align}
\mbox{\textbf{\emph{v}}}_{p}(t) & =\mbox{\textbf{\emph{J}}}_{p}\mbox{\textbf{\emph{e}}}_{0}(t)\\
\mbox{\textbf{\emph{v}}}_{q}(t) & =\mbox{\textbf{\emph{J}}}_{q}\mbox{\textbf{\emph{e}}}_{0}(t)
\end{align}
In its simplest form, the RIME is formed by taking the outer product
of these two relationships. Note that in their original paper, the
authors use the Kronecker incarnation of the outer product, which
we will denote with $\star$. We reserve the symbol $\otimes$ for
the matrix outer product of two matrices $\emph{\textbf{A}}\otimes\emph{\textbf{B}}=\mbox{\textbf{\emph{A}}}\emph{\textbf{B}}^{H},$
where \emph{H} denotes the Hermitian conjugate transpose%
\footnote{For a discussion on the subtleties of outer product definition, see
\citealt{Smirnov:2011d}, §A.6.1%
}. Using the Kronecker outer product, the RIME is given by
\begin{equation}
\left\langle \mbox{\textbf{\emph{v}}}_{p}\star\mbox{\textbf{\emph{v}}}_{q}\right\rangle =\left(\mbox{\textbf{\emph{J}}}_{p}\star\mbox{\textbf{\emph{J}}}_{q}\right)\left\langle \mbox{\textbf{\emph{e}}}_{0}\star\mbox{\textbf{\emph{e}}}_{0}\right\rangle =\left(\mbox{\textbf{\emph{J}}}_{p}\star\mbox{\textbf{\emph{J}}}_{q}\right)\mbox{\textbf{\emph{e}}}_{00}\label{eq:mueller-rime}
\end{equation}
where $\mbox{\textbf{\emph{J}}}_{p}$ and $\mbox{\textbf{\emph{J}}}_{q}$
are the Jones matrices representing all transformations along the
two signal paths, and $\left(\mbox{\textbf{\emph{J}}}_{p}\star\mbox{\textbf{\emph{J}}}_{q}\right)$
is a $\mbox{4}\times\mbox{4}$ matrix. Here, $\mbox{\textbf{\emph{e}}}_{00}$
is the sky brightness of a single point source of radiation. For
a multi-element interferometer, every antenna has its own unique Jones
matrix, and a RIME may be written for every pair of antennas.
Due to their choice of outer product, Hamaker et. al. arrive at a
coherency vector
\begin{equation}
\mbox{\textbf{\emph{e}}}_{pq}(\mbox{\textbf{\emph{r}}}_{p},\mbox{\textbf{\emph{r}}}_{q},\tau)=\left\langle \mbox{\textbf{\emph{e}}}_{p}(\mbox{\textbf{\emph{r}}}_{p},t)\star\mbox{\textbf{\emph{e}}}_{q}(\mbox{\textbf{\emph{r}}}_{q},t+\tau)\right\rangle =\begin{pmatrix}\left\langle e_{px}e_{qx}^{*}\right\rangle \\
\left\langle e_{px}e_{qy}^{*}\right\rangle \\
\left\langle e_{py}e_{qx}^{*}\right\rangle \\
\left\langle e_{py}e_{qy}^{*}\right\rangle
\end{pmatrix},
\end{equation}
as opposed to the coherency matrix of \citet{Wolf1954}; this is introduced
in §\ref{sub:Electromagnetic-coherency} below. The column vector
of a point source at $\mbox{\textbf{\emph{r}}}_{0}$ is then $\mbox{\textbf{\emph{e}}}_{00}$;
that is, $p=q$ and $\tau=0$. The vector $\mbox{\textbf{\emph{e}}}_{00}$
is related to the Stokes vector by the transform\footnote{Here, $j=\sqrt{-1}$, to avoid confusion with current, $i$, used later.}.
\begin{equation}
\begin{pmatrix}I\\
Q\\
U\\
V
\end{pmatrix}=\begin{pmatrix}1 & 0 & 0 & 1\\
1 & 0 & 0 & -1\\
0 & 1 & 1 & 0\\
0 & -j & j & 0
\end{pmatrix}\begin{pmatrix}\left\langle e_{0x}e_{0x}^{*}\right\rangle \\
\left\langle e_{0x}e_{0y}^{*}\right\rangle \\
\left\langle e_{0y}e_{0x}^{*}\right\rangle \\
\left\langle e_{0y}e_{0y}^{*}\right\rangle
\end{pmatrix}.
\end{equation}
The quantity $\left(\mbox{\textbf{\emph{J}}}_{p}\star\mbox{\textbf{\emph{J}}}_{q}\right)$
in Eq.~\ref{eq:mueller-rime} can therefore be viewed as a Mueller
matrix. That is, Eq.~\ref{eq:mueller-rime} can be considered a Mueller-calculus-based ME for a radio interferometer. To
summarize, Hamaker et. al. showed that:
\begin{itemize}
\item Jones matrices can be used to model the propagation of a signal from
a radiation source through to the voltage at the terminal of an antenna.
\item A Mueller matrix can be formed from the Jones terms of a pair of antennas,
which then relates the measured voltage coherency of that pair to
a source's brightness.
\end{itemize}
Showing that these calculuses were applicable and indeed useful for modelling
and calibrating radio interferometers was an important step forward
in radio polarimetry.
\subsection{The 2$\times$2 RIME}
In a later paper, \citet{Hamaker:2000p7625} presents a modified formulation
of the RIME, where instead of forming the coherency vector from the Kronecker
outer product ($\star$), the coherency matrix is formed from the matrix
outer product ($\otimes$):
\begin{equation}
\mbox{\textbf{\emph{E}}}_{pq}=\left\langle \emph{\textbf{e}}_{p}\otimes\emph{\textbf{e}}_{q}\right\rangle =\begin{pmatrix}\left\langle e_{px}e_{qx}^{*}\right\rangle & \left\langle e_{px}e_{qy}^{*}\right\rangle \\
\left\langle e_{px}e_{qx}^{*}\right\rangle & \left\langle e_{py}e_{qy}^{*}\right\rangle
\end{pmatrix}
\end{equation}
The resulting coherency matrix is then shown to be related to the
Stokes parameters by $\mbox{\textbf{\emph{E}}}_{00}=\mbox{ \textsf{B} }$,
where
\begin{align}
\mathsf{B} & =\begin{pmatrix}I+Q & U+jV\\
U-jV & I-Q
\end{pmatrix}.
\end{align}
The equivalent to the RIME of Eq.~\ref{eq:mueller-rime} is
\begin{equation}
\left\langle \mbox{\textbf{\emph{v}}}_{p}\otimes\mbox{\textbf{\emph{v}}}_{q}\right\rangle =2\left\langle \mbox{(\ensuremath{\mbox{\textbf{\emph{J}}}_{p}\textbf{\emph{e}}}}_{0})\otimes(\mbox{\textbf{\emph{J}}}_{q}\mbox{\textbf{\emph{e}}}_{0})\right\rangle =2\mbox{\textbf{\emph{J}}}_{p}\left\langle \mbox{\textbf{\emph{e}}}_{0}\otimes\mbox{\textbf{\emph{e}}}_{0}\right\rangle \mbox{\textbf{\emph{J}}}_{q}^{H}\label{eq:jones-rime}
\end{equation}
or more simply,
\begin{equation}
\mathsf{V}_{pq}=\mbox{\textbf{\emph{J}}}_{p}\mbox{\textbf{\emph{B}}}\mbox{\textbf{\emph{J}}}_{q}^{H}.\label{eq:jones-rime-2}
\end{equation}
This approach avoids the need to use $\mbox{4}\times\mbox{4}$ Mueller
matrices, so is both simpler and computationally advantageous. This
form is also cleaner in appearance, as fewer indices are required.
\citet{Smirnov:2011a} takes the $\mbox{2}\times\mbox{2}$ version
of the RIME as a starting point and extends the RIME to a full sky
case. By treating the sky as a brightness distribution $\mathsf{B}$($\sigma$),
where $\sigma$ is a direction vector, each antenna has a Jones term
$\mbox{\textbf{\emph{J}}}_{p}(\sigma)$ describing the signal path
for a given direction. The visibility matrix $\mathsf{V}_{pq}$ is
then found by integrating over the entire sky:
\begin{equation}
\mathsf{V}_{pq}=\int_{4\pi}\mbox{\textbf{\emph{J}}}_{p}(\sigma)\mathsf{B}\mbox{(\ensuremath{\sigma})}\mbox{\textbf{\emph{J}}}_{q}^{H}(\sigma)d\Omega.
\end{equation}
This is a more general form of\emph{ Zernicke's propagation law}.
Smirnov goes on to derive the Van-Cittert Zernicke theorem from this
result; we return to vC-Z later in this article.
\subsection{A generalized tensor RIME}
In \citet{Smirnov:2011d}, a generalized tensor formalism of the
RIME is presented. The coherency of two voltages is once again defined
via the outer product $e^{i}\bar{e}_{j}$, giving a (1,1)-type
tensor expression:
\begin{equation}
\mbox{\emph{V}}_{qj}^{pi}=\mbox{\emph{J}}_{\alpha}^{pi}\mbox{\emph{B}}_{\beta}^{\alpha}\bar{\mbox{\emph{J}}}_{qj}^{\beta}.
\end{equation}
This formalism is better capable of describing mutual coupling between
antennas, beamforming, and wide field polarimetry. $ $ In this paper,
we focus on a matrix based formalism which considers the propagation
of the magnetic field in addition to the electric field. We then show
that this formulation is equivalent to the tensor formalism presented
in \citet{Smirnov:2011d}, but is instead in the vector space $\mathbb{C}^{6}$.
\subsection{Microwave engineering transmission matrix methods}
All formulations of the RIME to date --- including the tensor formulation
--- do not consider the propagation of the magnetic field. In free
space, magnetic field coherency can be easily derived from the electric
field coherency. However, at the boundary between two media, the magnetic
field must be considered. Here, we introduce some results from microwave
engineering which contrast with the Jones formalism.
In circuit theory, the well-known impedance relation, $V=ZI$ relates
current and voltage over a terminal pair (or `port'). However, this
relation is specific to a 1-port network; for microwave networks with
more than one port, the matrix form $[V]=[Z][I]$ must be used:
\begin{equation}
\begin{pmatrix}v_{1}\\
v_{2}\\
\vdots\\
v_{N}
\end{pmatrix}=\begin{pmatrix}Z_{11} & Z_{12} & \cdots & Z_{1N}\\
Z_{21} & \ddots & & \vdots\\
\vdots & & & \vdots\\
Z_{N1} & \cdots & \cdots & Z_{NN}
\end{pmatrix}\begin{pmatrix}i_{1}\\
i_{2}\\
\vdots\\
i_{N}
\end{pmatrix},
\end{equation}
where $Z_{ab}$ is the port-to-port impedance from port \emph{a }to
port \emph{b}, $v_{n}$ is the voltage on port $n$, and $i_{n}$
is the current. A common example of a 2-port network is a coaxial cable, and a common
3-port network is the Wilkinson power divider.
The analogue components of most radio telescopes can be considered
2-port networks. The 2-port transmission, or $ABCD$ matrix, relates
the voltages and currents of a 2-port network:
\begin{equation}
\begin{pmatrix}v_{1}\\
i_{1}
\end{pmatrix}=\begin{pmatrix}A & B\\
C & D
\end{pmatrix}\begin{pmatrix}v_{2}\\
i_{2}
\end{pmatrix},
\end{equation}
this is shown in Figure~\ref{fig:transmission-cascade}. If two 2-port
networks are connected in cascade (see Figure~\ref{fig:transmission-cascade}),
then the output is equal to the product of the transmission matrices
representing the individual components:
\begin{equation}
\begin{pmatrix}v_{1}\\
i_{1}
\end{pmatrix}=\begin{pmatrix}A_{1} & B_{1}\\
C_{1} & D_{1}
\end{pmatrix}\begin{pmatrix}A_{2} & B_{2}\\
C_{2} & D_{2}
\end{pmatrix}\begin{pmatrix}v_{2}\\
i_{2}
\end{pmatrix},
\end{equation}
as is shown in texts such as \citet{Pozar2005}. The elements in the
2-port transmission matrix are related to port-to-port impedances
by%
\footnote{Note that $Z_{21}=0$ is zero impedance, which is never satisfied
in real components, and $Z_{21}=\infty$ represents an open circuit%
}
\begin{align}
A & =Z_{11}/Z_{21}\\
B & =\frac{Z_{11}Z_{22}-Z_{12}Z_{21}}{Z_{21}}\\
C & =1/Z_{21}\\
D & =Z_{22}/Z_{21}.
\end{align}
Like the Jones matrix, the $ABCD$ matrix allows a signal's path to
be modelled through multiplication of matrices representing discrete
components. While the Jones matrix acts upon a pair of orthogonal
electric field components, the $ABCD$ matrix acts upon a voltage-current
pair at a single port. As Jones matrices do not consider changes in
impedance (free space impedance is implicitly assumed), it is not
suitable for describing analogue components. Conversely, the $2\times2$
$ABCD$ matrix cannot model cross-polarization response of a telescope.
In the section that follows, we derive a more general coherency relationship
which weds the advantages of both approaches.
\begin{figure}
\begin{centering}
\includegraphics[width=0.8\columnwidth]{figures-transmission-matrix}
\par\end{centering}
\caption{Top: The ABCD matrix for a 2-port network. In this diagram, voltage
is denoted with $V$, and current with $I$. Bottom: Connecting two
components in cascade. Diagram adapted from \citet{Pozar2005}\label{fig:transmission-cascade}}
\end{figure}
\section{Coherency in radio astronomy}
We now turn our attention to formulating a more general RIME that
is valid in a larger range of cases. In this section, we introduce
the coherency matrices of \citet{Wolf1954}, along with voltage-current
coherency matrices. The following section then formulates relationships
between source brightness and measured coherency based upon these
matrices.
\subsection{Electromagnetic coherency\label{sub:Electromagnetic-coherency}}
To begin, we introduce the coherency matrices of \citet{Wolf1954},
that fully describe the coherency statistics of an electromagnetic
field. We may start by introducing $\mbox{\textbf{\emph{e}}}(\mbox{\textbf{\emph{r}}},t)$
and $\mbox{\textbf{\emph{h}}}(\mbox{\textbf{\emph{r}}},t)$ as the
complex analytic representations of the electric and magnetic field
vectors at a spacetime point $(\mbox{\textbf{\emph{r}}},t)$:
\begin{align}
\mbox{\textbf{\emph{e}}}(\mbox{\textbf{\emph{r}}},t) & =\begin{pmatrix}e_{x}(\mbox{\textbf{\emph{r}}},t) & e_{y}(\mbox{\textbf{\emph{r}}},t) & e_{z}(\mbox{\textbf{\emph{r}}},t)\end{pmatrix}^{T}\label{eq:elec-vec}\\
\mbox{\textbf{\emph{h}}}(\mbox{\textbf{\emph{r}}},t) & =\begin{pmatrix}h_{x}(\mbox{\textbf{\emph{r}}},t) & h_{y}(\mbox{\textbf{\emph{r}}},t) & h_{z}(\mbox{\textbf{\emph{r}}},t)\end{pmatrix}^{T}
\end{align}
The coherency matrices are then defined by the formulae
\begin{align}
\mbox{\textbf{\emph{E}}}_{pq}(\mbox{\textbf{\emph{r}}}_{p},\mbox{\textbf{\emph{r}}}_{q},\tau) & =\begin{pmatrix}\left\langle e_{k}(\mbox{\textbf{\emph{r}}}_{p},t)e_{l}^{*}(\mbox{\textbf{\emph{r}}}_{q},t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{H}}}_{pq}(\mbox{\textbf{\emph{r}}}_{p},\mbox{\textbf{\emph{r}}}_{q},\tau) & =\begin{pmatrix}\left\langle h_{k}(\mbox{\textbf{\emph{r}}}_{p},t)h_{l}^{*}(\mbox{\textbf{\emph{r}}}_{q},t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{M}}}_{pq}(\mbox{\textbf{\emph{r}}}_{p},\mbox{\textbf{\emph{r}}}_{q},\tau) & =\begin{pmatrix}\left\langle e_{k}(\mbox{\textbf{\emph{r}}}_{p},t)h_{l}^{*}(\mbox{\textbf{\emph{r}}}_{q},t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{N}}}_{pq}(\mbox{\textbf{\emph{r}}}_{p},\mbox{\textbf{\emph{r}}}_{q},\tau) & =\begin{pmatrix}\left\langle h_{k}(\mbox{\textbf{\emph{r}}}_{p},t)e_{l}^{*}(\mbox{\textbf{\emph{r}}}_{q},t+\tau)\right\rangle \end{pmatrix}.
\end{align}
Here, $k$ and $l$ are indices representing the $x,y,z$ subscripts
from Cartesian coordinates. $\mbox{\textbf{\emph{E}}}_{pq}$ and $\mbox{\textbf{\emph{H}}}_{pq}$
are called the electric and the magnetic coherency matrices, and $\mbox{\textbf{\emph{M}}}_{pq}$
and $\mbox{\textbf{\emph{N}}}_{pq}$ are called the mixed coherency
matrices. The subscripts $p$ and $q$ correspond to the spacetime
points $(\mbox{\textbf{\emph{r}}}_{p},t)$ and $(\mbox{\textbf{\emph{r}}}_{q},t+\tau)$,
respectively. We may arrange these into a single $\mbox{6}\times\mbox{6}$
matrix \textbf{$\mathbb{B}_{pq}$} that is equivalent to the time
averaged outer product of the electromagnetic field column vectors
at spacetime points $(\mbox{\textbf{\emph{r}}}_{p},t)$ and $(\mbox{\textbf{\emph{r}}}_{q},t+\tau)$:
\begin{equation}
\mathbb{B}_{pq}=\left\langle \begin{pmatrix}\mbox{\textbf{\emph{e}}}_{p}\\
\mbox{\textbf{\emph{h}}}_{p}
\end{pmatrix}\otimes\begin{pmatrix}\mbox{\textbf{\emph{e}}}_{q}\\
\mbox{\textbf{\emph{h}}}_{q}
\end{pmatrix}\right\rangle =\begin{pmatrix}\mbox{\textbf{\emph{E}}}_{pq} & \mbox{\textbf{\emph{M}}}_{pq}\\
\mbox{\textbf{\emph{N}}}_{pq} & \mbox{\textbf{\emph{H}}}_{pq}
\end{pmatrix}\label{eq:coherency-matrix}
\end{equation}
This matrix fully describes the coherency properties of an electromagnetic
field at two points in spacetime. We will refer to this matrix as
the \emph{two point coherency matrix}. It is worth noting that:
\begin{itemize}
\item When $\mbox{\textbf{\emph{r}}}_{p}=\mbox{\textbf{\emph{r}}}_{q}$
and $\tau=0$ we retrieve what \citet{Bergman:2008p7859} refer to
as the `EM sixtor matrix'. \citet{Bergman:2008p7859} show this sixtor
matrix is related to what they refer to as `canonical electromagnetic
observables': a unique set of Stokes-like parameters that are irreducible
under Lorentz transformations. These are used in the the analysis
of electromagnetic field data from spacecraft.
\item For monochromatic plane waves, when $\mbox{\textbf{\emph{r}}}_{p}=\mbox{\textbf{\emph{r}}}_{q}$
and $\tau=0$, and we choose a coordinate system with $z$ in the
direction of propagation (i.e. along the Poynting vector), $\textbf{\emph{E}}_{pq}$
becomes what \citet{Smirnov:2011a} refers to as the \emph{brightness
matrix}, $\mathsf{B}$.
From here forward, we drop the subscript $\mathbb{B}=\mathbb{B}_{00}$ and shall refer to
this as the \emph{brightness coherency} to highlight its relationship with $\mathsf{B}$.
\end{itemize}
\subsection{Voltage and current coherency}
A radio telescope converts a free space electromagnetic field into
a time varying voltage, which we then measure after signal conditioning
(e.g. amplification and filtering). As such, radio interferometers
measure coherency statistics between time varying voltages.
One may model the analogue components of a telescope as a 6-port network,
with three inputs ports and three output ports. We propose this so
that there is an input-output pair of ports for each of the orthogonal
components of the electromagnetic field. We can then define a set
of voltages \textbf{\emph{v}}\emph{(t)} and currents $\mbox{\textbf{\emph{i}}}(t)$
\begin{align}
\mbox{\textbf{\emph{v}}}(t) & =\begin{pmatrix}v_{x}(t) & v_{y}(t) & v_{z}(t)\end{pmatrix}^{T}\\
\mbox{\textbf{\emph{i}}}(t) & =\begin{pmatrix}i_{x}(t) & i_{y}(t) & i_{z}(t)\end{pmatrix}^{T}.
\end{align}
In practice, most telescopes are single or dual polarization, so only
the $x$ and $y$ components are sampled. Nonetheless, it is possible
to sample all three components with three orthogonal antenna elements
\citep{Bergman:2005p7825}. The voltage response of an antenna is
linearly related to the electromagnetic field strength \citep{Hamaker:1996p5735},
and the current is linearly related to voltage by Ohm's law, so we
may write a general linear relationship
\begin{equation}
\begin{pmatrix}\mbox{\textbf{\emph{v}}}(t)\\
\mbox{\textbf{\emph{i}}}(t)
\end{pmatrix}=\begin{pmatrix}\mbox{\textbf{\emph{A}}} & \mbox{\textbf{\emph{B}}}\\
\mbox{\textbf{\emph{C}}} & \mathbf{\mbox{\textbf{\emph{D}}}}
\end{pmatrix}\begin{pmatrix}\mbox{\textbf{\emph{e}}}(\mbox{\textbf{\emph{r}}},t)\\
\mbox{\textbf{\emph{h}}}(\mbox{\textbf{\emph{r}}},t)
\end{pmatrix},
\end{equation}
where \textbf{\emph{A}}, \textbf{\emph{B}},\textbf{ }\textbf{\emph{C}}\textbf{
}and\textbf{ }\textbf{\emph{D}}\textbf{ }are block matrices forming
an overall transmission matrix $\mathbb{T}'$. We can now define a
matrix of voltage-current coherency statistics that consists of the
block matrices
\begin{align}
\mbox{\textbf{\emph{V}}}_{pq}(\tau) & =\begin{pmatrix}\left\langle v_{k}(t)v_{l}^{*}(t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{W}}}_{pq}(\tau) & =\begin{pmatrix}\left\langle i_{k}(t)i_{l}^{*}(t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{K}}}_{pq}(\tau) & =\begin{pmatrix}\left\langle v_{k}(t)i_{l}^{*}(t+\tau)\right\rangle \end{pmatrix}\\
\mbox{\textbf{\emph{L}}}_{pq}(\tau) & =\begin{pmatrix}\left\langle i_{k}(t)v_{l}^{*}(t+\tau)\right\rangle \end{pmatrix},
\end{align}
these are analogous to (and related to)\textbf{ }the electromagnetic
coherency matrices above%
\footnote{Spatial location $\mathbf{r}$ is no longer relevant as the voltage
propagates through analogue components with clearly defined inputs
and outputs.%
}. In a similar manner to the two-point coherency matrix, we define
$\mathbb{V}_{pq}$
\begin{equation}
\mathbb{V}_{pq}=\left\langle \begin{pmatrix}\mbox{\textbf{\emph{v}}}_{p}\\
\mbox{\textbf{\emph{i}}}_{p}
\end{pmatrix}\otimes\begin{pmatrix}\mbox{\textbf{\emph{v}}}_{q}\\
\mbox{\textbf{\emph{i}}}_{q}
\end{pmatrix}\right\rangle =\begin{pmatrix}\mbox{\textbf{\emph{V}}}_{pq} & \mathbf{\mbox{\textbf{\emph{K}}}}_{pq}\\
\mbox{\textbf{\emph{L}}}_{pq} & \mbox{\textbf{\emph{W}}}_{pq}
\end{pmatrix},
\end{equation}
which we will refer to as the \emph{voltage-current coherency matrix}.
This is analogous to the `visibility matrix', $\mathsf{V}_{pq}$,
of \citet{Smirnov:2011a}.
\section{Two point coherency relationships\label{sec:2pt-coherency}}
Now we have introduced the two-point coherency matrix $\mathbb{B}_{pq}$
and the voltage-current coherency matrix $\mathbb{V}_{pq}$, we can
formulate relationships between the two. In this section, we first
formulate a general coherency relationship describing propagation
from a source of electromagnetic radiation to two spacetime coordinates.
We then show that this relationship underlies both the RIME and the
vC-Z relationship.
\subsection{A general two point coherency relationship}
Suppose we have two sensors, located at points $\mbox{\textbf{\emph{r}}}_{p}$
and $\mbox{\textbf{\emph{r}}}_{q}$, which fully measure all components
of the electromagnetic field. Assuming linearity, the propagation
of an electromagnetic field $\mbox{\textbf{\emph{f}}}_{0}=\begin{pmatrix}\mbox{\textbf{\emph{e}}}(\mbox{\textbf{\emph{r}}}_{0},t) & \mbox{\textbf{\emph{h}}}(\mbox{\textbf{\emph{r}}}_{0},t)\end{pmatrix}^{T}$
from a point $\mbox{\textbf{\emph{r}}}_{0}$ to $\mbox{\textbf{\emph{r}}}_{p}$
and $\mbox{\textbf{\emph{r}}}_{q}$ can be encoded into a 6$\times$6
matrices, $\mathbb{T}_{p}$ and $\mathbb{T}_{q}$:
\begin{align}
\mbox{\textbf{\emph{f}}}_{p} & =\mathbb{T}_{p}\mbox{\textbf{\emph{f}}}_{0}\\
\mbox{\textbf{\emph{f}}}_{q} & =\mathbb{T}_{q}\mbox{\textbf{\emph{f}}}_{0}
\end{align}
The coherency between the two signals $\mbox{\textbf{\emph{f}}}_{p}$
and $\mbox{\textbf{\emph{f}}}_{q}$ is then given by the matrix $\mathbb{B}_{pq}$:
\begin{align}
\mathbb{B}_{pq} & =\left\langle \mbox{\textbf{\emph{f}}}_{p}\otimes\mbox{\textbf{\emph{f}}}_{q}\right\rangle \label{eq:tp_tq_outer}\\
& =\left\langle (\mathbb{T}_{p}\mbox{\textbf{\emph{f}}}_{0})\otimes(\mathbf{\mathbb{T}}_{q}\mbox{\textbf{\emph{f}}}_{0})\right\rangle \\
& =\left\langle \mathbf{\mathbb{T}}_{p}(\mbox{\textbf{\emph{f}}}_{0}\otimes\mbox{\textbf{\emph{f}}}_{0})\mathbb{T}_{q}^{H}\right\rangle \\
& =\mathbf{\mathbb{T}}_{p}\mathbb{B}\mathbf{\mathbb{T}}_{q}^{H}
\end{align}
we can write this in terms of block matrices
\begin{equation}
\begin{pmatrix}\mbox{\textbf{\emph{E}}}_{pq} & \mbox{\textbf{\emph{M}}}_{pq}\\
\mbox{\textbf{\emph{N}}}_{pq} & \mbox{\textbf{\emph{H}}}_{pq}
\end{pmatrix}=\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{p} & \mbox{\textbf{\emph{B}}}_{p}\\
\mbox{\textbf{\emph{C}}}_{p} & \mbox{\textbf{\emph{D}}}_{p}
\end{pmatrix}\begin{pmatrix}\mbox{\textbf{\emph{E}}}_{00} & \mbox{\textbf{\emph{M}}}_{00}\\
\mbox{\textbf{\emph{N}}}_{00} & \mbox{\textbf{\emph{H}}}_{00}
\end{pmatrix}\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{q} & \mbox{\textbf{\emph{B}}}_{q}\\
\mbox{\textbf{\emph{C}}}_{q} & \mbox{\textbf{\emph{D}}}_{q}
\end{pmatrix}^{H}\label{eq:two-pt-coherency}
\end{equation}
This is the most general form that relates the coherency at two points
$\mbox{\textbf{\emph{r}}}_{p}$ and $\mbox{\textbf{\emph{r}}}_{q}$,
to the electromagnetic energy density at point $\mbox{\textbf{\emph{r}}}_{0}$.
In radio astronomy, antennas are used as sensors to measure the electromagnetic
field. Following from Eq.~\ref{eq:two-pt-coherency}, we may write
an equation relating voltage and current coherency:
\begin{equation}
\mathbb{V}_{pq}=\mathbf{\mathbb{T}}{}_{p}^{'}(\mathbf{\mathbb{T}}_{p}\mathbb{B}\mathbb{T}_{q}^{H})\mathbf{\mathbb{T}}{}_{q}^{'H}.\label{eq:measurement-eq-non-collapsed}
\end{equation}
As the $\mathbb{T}'$ and \textbf{$\mathbb{T}$} matrices are both
$\mbox{6}\times\mbox{6}$ , we can are both collapse these matrices
into one overall matrix. Eq.~\ref{eq:measurement-eq-non-collapsed}
then becomes
\begin{equation}
\mathbb{V}_{pq}=\mathbf{\mathbb{T}}{}_{p}\mathbb{B}\mathbf{\mathbb{T}}_{q}^{H},\label{eq:RIME-basic}
\end{equation}
which is the general form that relates the voltage-coherency matrix
$\mathbb{V}_{pq}$ to the brightness coherency $\mathbb{B}.$
Equation\ \ref{eq:RIME-basic} is a central result of this paper.
It is a general case which relates the EM field at a given point in
space-time to the voltage and current coherencies in between pairs
of telescopes. In the sections that follow, we show that generalized
versions of the Van-Cittert-Zernicke theorem and RIME may be formulated based upon this coherency relationship,
and that the common formulations can be derived from these general
results.
\subsection{The Radio Interferometer Measurement Equation}
By comparing Eq.~\ref{eq:jones-rime-2} with Eq.~\ref{eq:two-pt-coherency},
it is apparent that the Jones formulation of the RIME is retrieved
by setting all but the top left block matrices to zero, such that
we have
\begin{equation}
\mbox{\textbf{\emph{V}}}_{pq}=\mbox{\textbf{\emph{A}}}_{p}\mbox{\textbf{\emph{E}}}_{00}\mbox{\textbf{\emph{A}}}_{q}^{H}.
\end{equation}
But under what assumptions may we ignore the other entries of Eq.~\ref{eq:two-pt-coherency}?
To answer this, we may note that monochromatic plane waves in free
space have \textbf{\emph{E}}\textbf{ }and \textbf{\emph{H}}\textbf{
}are in phase and mutually perpendicular:
\begin{align}
\mbox{\textbf{\emph{e}}}(\mbox{\textbf{\emph{r}}},t) & =\begin{pmatrix}e_{x}(\mbox{\textbf{\emph{r}}},t) & e_{y}(\mbox{\textbf{\emph{r}}},t)\end{pmatrix}^{T}\nonumber\\
\mbox{\textbf{\emph{h}}}(\mbox{\textbf{\emph{r}}},t) & =\frac{1}{c_{0}}\begin{pmatrix}-e_{y}(\mbox{\textbf{\emph{r}}},t) & e_{x}(\mbox{\textbf{\emph{r}}},t)\end{pmatrix}^{T}
\label{eq:MPWZ}
\end{align}
Where $c_{0}$ is the magnitude of the speed of light. In such a case,
all coherency statistics can be derived from the $\mbox{2}\times\mbox{2}$
brightness matrix~$\mathsf{B}$. \citet{Carozzi:2009hf} show that
the field coherencies can be written
\begin{align}
\mathbb{B} & =\begin{pmatrix}\mbox{\textbf{\emph{E}}}_{pq} & \mbox{\textbf{\emph{M}}}_{pq}\\
\mbox{\textbf{\emph{N}}}_{pq} & \mbox{\textbf{\emph{H}}}_{pq}
\end{pmatrix}=\begin{pmatrix}\mathsf{B} & \mathsf{B}\mbox{\textbf{\emph{F}}}^{T}\\
\mbox{\textbf{\emph{F}}}\mathsf{B} & \mbox{\textbf{\emph{F}}}\mathsf{B}\mbox{\textbf{\emph{F}}}^{T}
\end{pmatrix}\label{eq:Carozzi-matrix}
\end{align}
where $\mathbf{F}$ is the matrix
\begin{equation}
\mbox{\textbf{\emph{F}}}=\frac{1}{c_{0}}\begin{pmatrix}0 & 1\\
-1 & 0
\end{pmatrix}.
\end{equation}
Under these conditions, the rank of $\mathbb{B}$ is 2, so the relationship
in Eq.~\ref{eq:two-pt-coherency} is over constrained. It follows
that the $\mbox{2}\times\mbox{2}$ RIME is perfectly acceptable ---
and indeed preferable to Eq.~\ref{eq:two-pt-coherency} --- for describing
coherency of plane waves that propagate through free space.
There are numerous situations in which we cannot assume that we have
monochromatic plane waves. This includes near field sources where
the wavefront is not well approximated by a plane wave; propagation
through ionized gas; and situations where we choose not to treat our
field as a superposition of quasi-monochromatic components. Most importantly,
the assumptions that underlie the $\mbox{2}\times\mbox{2}$ RIME do
not hold within the analogue components of a telescope, where the
signal does not enjoy free space impedance.\emph{ }
\subsection{A 2N-port transmission matrix based RIME\label{sub:2N-port-trans-RIME}}
For a dual polarization telescope, a 4-port description (2-in 2-out)
of our analogue system is more appropriate than the general 6-port
description. Using the result \ref{eq:Carozzi-matrix} above and Eq.~\ref{eq:RIME-basic},
we can write a relationship
\begin{equation}
\mbox{\textbf{\emph{V}}}_{pq}=\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{p} & \mbox{\textbf{\emph{B}}}_{p}\\
\mbox{\textbf{\emph{C}}}_{p} & \mbox{\textbf{\emph{D}}}_{p}
\end{pmatrix}\begin{pmatrix}\mathsf{B} & \mathsf{B}\mbox{\textbf{\emph{F}}}^{T}\\
\mbox{\textbf{\emph{F}}}\mathsf{B} & \mbox{\textbf{\emph{F}}}\mathsf{B}\mbox{\textbf{\emph{F}}}^{T}
\end{pmatrix}\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{q} & \mbox{\textbf{\emph{B}}}_{q}\\
\mbox{\textbf{\emph{C}}}_{q} & \mbox{\textbf{\emph{D}}}_{q}
\end{pmatrix}^{H},\label{eq:2n-port-RIME}
\end{equation}
Here, all block matrices have been reduced in dimensions from $\mbox{3}\times\mbox{3}$
to $\mbox{2}\times\mbox{2}$. This version of the RIME retains the
ability to model analogue components, but uses the approximations
of the vC-Z to express $\mathbb{B}$ in terms of the regular $\mbox{2}\times\mbox{2}$
brightness matrix B. The transmission matrices matrices here are similar
to the 2N-port transmission matrices as defined by \citet{BrandaoFaria2002}.%
\footnote{We note that our definition here is the inverse of that of Faria (input
and output are swapped).%
}
The transmission matrices may be broken down into a chain of cascaded components. That is, for
a cascade of $n$ components, we may write the overall transmission matrix as a product of matrices
representing the individual components:
\begin{equation}
\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{p} & \mbox{\textbf{\emph{B}}}_{p}\\
\mbox{\textbf{\emph{C}}}_{p} & \mbox{\textbf{\emph{D}}}_{p}
\end{pmatrix}=\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{np} & \mbox{\textbf{\emph{B}}}_{np}\\
\mbox{\textbf{\emph{C}}}_{np} & \mbox{\textbf{\emph{D}}}_{np}
\end{pmatrix}\cdots\begin{pmatrix}\mbox{\textbf{\emph{A}}}_{1p} & \mbox{\textbf{\emph{B}}}_{1p}\\
\mbox{\textbf{\emph{C}}}_{1p} & \mbox{\textbf{\emph{D}}}_{1p}
\end{pmatrix}
\end{equation}
This is similar to, but more general than, Jones calculus. In the following section we will explore the difference.
\subsection{Limitations of Jones calculus}
Jones calculus essentially asserts two things. Firstly, it asserts that the voltage 2-vector at the output of a
dual-polarization system $\bmath{v}_p=(v_{p1},v_{p2})^T$
is linear with respect to the EMF 2-vector $\bmath{e}=(e_x,e_y)^T$ at the input of the system:
\begin{equation}
\bmath{v}_p = \bmath{J}_p \bmath{e},
\end{equation}
where the Jones matrix $\bmath{J}_p$ describes the voltage transmission properties of the system.
The second assertion is that for a system composed of $n$ components, the effective Jones matrix
is a product of the Jones matrices of the components:
\begin{equation}
\bmath{v}_p = \bmath{J}_{np} \cdots \bmath{J}_{1p} \bmath{e}.
\end{equation}
With 2N-port transmission matrices, we instead describe the input of the system by the 4-vector $[\bmath{e},\bmath{h}]^T$,
which for a monochromatic plane wave is equal to
\begin{equation}
\begin{pmatrix}\bmath{e}\\ \bmath{h}\end{pmatrix} =
(e_x,e_y,-e_y/c_0,e_x/c_0)^T =
\begin{pmatrix}\bmath{e}\\ \bmath{F}\bmath{e}\end{pmatrix},
\end{equation}
and the output of the system is a 4-vector of 2 voltages and 2 currents $(\bmath{v}_p,\bmath{i}_p)^T$, which is
linear with respect to the input:
\begin{equation}
\begin{pmatrix}\bmath{v}_p\\ \bmath{i}_p\end{pmatrix} =
\begin{pmatrix}\bmath{A}_p & \bmath{B}_p\\ \bmath{C}_p & \bmath{D}_p \end{pmatrix}
\begin{pmatrix}\bmath{e}\\ \bmath{F}\bmath{e}\end{pmatrix}.
\end{equation}
Note that the output voltage is still linear with respect to the input $\bmath{e}$. Indeed, if one is only interested
in the voltage, the above becomes
\begin{equation}
\bmath{v}_p = (\bmath{A}_p+\bmath{B}_p\bmath{F})\bmath{e},
\end{equation}
i.e. the system has an effective Jones matrix (i.e. a voltage transmission matrix) of
\begin{equation}
\bmath{J}_p = \bmath{A}_p+\bmath{B}_p\bmath{F}.
\end{equation}
However, the Jones formalism breaks down when the system is composed of multiple components. For example, with 2
components, we may naively attempt to apply Jones calculus, and describe the voltage transmission matrix of the
system as a product of the components' voltage transmission matrices:
\begin{equation}
\bmath{J}_p = (\bmath{A}_{2p}+\bmath{B}_{2p}\bmath{F})(\bmath{A}_{1p}+\bmath{B}_{1p}\bmath{F}),
\end{equation}
i.e.
\begin{equation}
\bmath{v}_p = (
\bmath{A}_{2p}\bmath{A}_{1p} + \bmath{B}_{2p}\bmath{F}\bmath{A}_{1p} +
\bmath{A}_{2p}\bmath{B}_{1p}\bmath{F} + \bmath{B}_{2p}\bmath{F}\bmath{B}_{1p}\bmath{F})\bmath{e}.
\label{eq:jones-naive}
\end{equation}
This, however, completely neglects the current transmission properties. Applying the 2N-port transmission matrix
formalism, we can see the difference:
\begin{equation}
\begin{pmatrix}\bmath{v}_p\\ \bmath{i}_p\end{pmatrix} =
\begin{pmatrix}\bmath{A}_{2p} & \bmath{B}_{2p}\\ \bmath{C}_{2p} & \bmath{D}_{2p} \end{pmatrix}
\begin{pmatrix}\bmath{A}_{1p} & \bmath{B}_{1p}\\ \bmath{C}_{1p} & \bmath{D}_{1p} \end{pmatrix}
\begin{pmatrix}\bmath{e}\\ \bmath{F}\bmath{e}\end{pmatrix},
\end{equation}
from which we can derive an expression for the voltage vector:
\begin{equation}
\bmath{v}_p =
(\bmath{A}_{2p}\bmath{A}_{1p} + \bmath{B}_{2p}\bmath{C}_{1p} +
\bmath{A}_{2p}\bmath{B}_{1p}\bmath{F} + \bmath{B}_{2p}\bmath{D}_{1p}\bmath{F})\bmath{e},
\label{eq:jones-2N}
\end{equation}
which differs from Eq.~\ref{eq:jones-naive} in the second and fourth term of the sum, since in general
\begin{equation}
\bmath{F}\bmath{A}_{1p} \neq \bmath{C}_{1p},~~\bmath{F}\bmath{B}_{1p} \neq \bmath{D}_{1p}.
\label{eq:jones-difference}
\end{equation}
To summarize, because Jones calculus operates on voltages alone, and ignores impedance matching,
we cannot use it to accurately represent the voltage response of an analogue system by a product of the voltage
responses of its individual components. The difference is summarized in Eqs.~\ref{eq:jones-naive}--\ref{eq:jones-difference};
a practical example is given in Sect.~\ref{sub:Modelling-real-analogue}.
By contrast, the 2N-port transmission matrix formalism does allow us to break down the overall system response
into a product of the component responses, by taking both voltages and currents into account.
Since we have now shown the $2\times2$ form of the RIME to be insufficient, the obvious question arises, why have
we been getting away with using it? Historically, practical applications of the RIME have tended to follow
the formulation of \citet{JEN:note185}, rolling the electronic response of the overall system (as well as
tropospheric phase, etc.) into a single `$G$-Jones' term that is solved for during calibration. Under these
circumstances, the $2\times2$ formalism is perfectly adequate -- it is only when we attempt to model the individual
components of the analogue receiver chain that its limitations are exposed. On the other hand, \citet{Carozzi:2009hf}
have highlighted the limitations of Jones calculus in the wide-field polarization regime.
\section{Tensor formalisms of the RIME}
Up until now, we have presented our $6\times6$ RIME using matrix
notation. We now briefly discuss how the work presented here is closely
related to the tensor formalism presented in \citet{Smirnov:2011d}.
As is discussed in \citet{Smirnov:2011d}, the classical Jones formulation
of the RIME is in the vector space $\mathbb{C}^{2}$. The formulation
proposed by \citet{Carozzi:2009hf} is instead in $\mathbb{C}^{3}$.
In contrast, our Eq.\ \ref{eq:RIME-basic} can be considered to work
in $\mathbb{C}^{6}$; that is, our EMF vector has 6 components:
\begin{eqnarray}
e^{i} & = & \sum_{j=1}^{3}e_{j}x^{j}+\sum_{k=1}^{3}h_{j}x^{j}\equiv\sum_{j=1}^{6}e_{j}x^{j}.
\end{eqnarray}
The coherency of two voltages is once again defined via the outer
product $e^{i}\bar{e}_{j}$, quite remarkably giving a (1,1)-type
tensor expression identical Eq.\ 9 of \citet{Smirnov:2011d}:
\begin{equation}
\mbox{\emph{V}}_{qj}^{pi}=\mbox{\emph{J}}_{\alpha}^{pi}\mbox{\emph{B}}_{\beta}^{\alpha}\bar{\mbox{\emph{J}}}_{qj}^{\beta}.
\end{equation}
In the next section, we show an alternative RIME based upon a (2,0)-type tensor commonly encountered in special relativity.
\subsection{A relativistic RIME}
Another potential reformulation of the RIME involves the electromagnetic tensor of special relativity. The classical RIME formulation
implicitly assumes that both antennas measure the EMF in the same inertial reference frame. If this is not the
case (consider, e.g., space VLBI), then we must in principle account for the fact that the observed EMF is
altered when moving from one reference frame to another, and in particular, that the $\bmath{e}$ and $\bmath{h}$
components intermix. In special relativity, this can be elegantly formulated in terms of the \emph{electromagnetic
field tensor}, which represents the 6 independent components of the EMF by a (2-0)-type tensor:
\begin{equation}
F^{\alpha\beta}=\left(\begin{array}{cccc}
0 & -e_{x}/c & -e_{y}/c & -e_{z}/c\\
e_{x}/c & 0 & -h_{z} & h_{y}\\
e_{y}/c & h_{z} & 0 & -h_{x}\\
e_{z}/c & -h_{y} & h_{x} & 0
\end{array}\right),
\label{eq:F-general}
\end{equation}
The advantage of this formulation is that the EMF tensor follows standard coordinate transform laws of special
relativity. That is, for a different inertial reference frame given by the Lorentz transformation tensor
$\Lambda^\alpha_{\alpha'}$, the EMF tensor transforms as:
\begin{equation}
F'^{\alpha\beta} = \Lambda^\alpha_{\alpha'} \Lambda^\beta_{\beta'} F^{\alpha'\beta'}.
\end{equation}
The measured 2-point coherency between $[F_p]$ and $[F_q]$ can be formally defined as the average of the
outer product:
\begin{equation}
[V_{pq}]^{\alpha\beta\gamma\delta} = 2 c^2 \langle [F_p]^{\alpha\beta}[F^H_q]^{\gamma\delta} \rangle,
\end{equation}
where $\cdot^H$ represents the conjugate tensor, i.e. $[F^H]^{\gamma\delta}=\bar{F}^{\delta\gamma}$. A factor of 2
is introduced for the same reasons as in \citet{Smirnov:2011a}, and the reason for $c^2$ will be apparent below.
Note that the indices in the brackets should be treated as labels, while those outside the brackets are proper tensor indices.
Let us now pick a reference frame for the signal (`frame zero'), and designate the EMF tensor in that frame
by $[F_0]$, or $[F_0(\bmath{\bar{x}})]$ to emphasize that this is a function of the four-position
$\bmath{\bar{x}}=(ct,\bmath{x})$. The $[F_0(\bmath{\bar{x}})]$ field follows Maxwell's equations; in the case
of a monochromatic plane wave propagating along direction $\bmath{\bar{z}}=(1,\bmath{z})$, this has a particularly simple solution
of
\begin{equation}
[F_0(\bmath{\bar{x}})] = [F_0(\bmath{\bar{x}}_0)]e^{-2\pi i \lambda^{-1} (\bmath{\bar{x}}-\bmath{\bar{x}}_0)\cdot \bmath{\bar{z}}}.
\label{eq:F0-x}
\end{equation}
Let us now consider two antennas located at $\bmath{p}$ and $\bmath{q}$. The 2-point coherency measured in frame zero
becomes
\begin{eqnarray}
[V_{pq}]^{\alpha\beta\gamma\delta} & = &
2 c^2 \langle [F_0(\bmath{\bar{p}})]^{\alpha\beta}[F^H(\bmath{\bar{q}})]^{\gamma\delta} \rangle \nonumber\\
&=& K_p \left [ 2 c^2 \langle [F_0(\bmath{\bar{x}}_0)]^{\alpha\beta}[F^H(\bmath{\bar{x}}_0)]^{\gamma\delta} \rangle \right ] K^H_q,
\end{eqnarray}
where $K_p$ is the complex exponent of Eq.~\ref{eq:F0-x}, and is the direct equivalent of the $K$-Jones
term of the RIME \citep{Smirnov:2011a}. The quantity in the square brackets is the equivalent of the
brightness matrix $\mathsf{B}$, which we'll call the \emph{brightness tensor}:
\begin{equation}
B^{\alpha\beta\gamma\delta} = [B_0]^{\alpha\beta\gamma\delta} = 2c^2 \langle [F_0(\bmath{\bar{x}}_0)]^{\alpha\beta}[F_0^H(\bmath{\bar{x}}_0)]^{\gamma\delta} \rangle.
\end{equation}
Each element of the brightness tensor gives the coherency between two components of the EMF observed in the chosen
reference frame (`frame zero'). Nominally, the brightness tensor has $4^4=256$ components, but only 36 are unique
and non-zero (given the 6 components of the EMF). \citet{Carozzi:2006bj} show that the brightness tensor may be
decomposed into a set of antisymmetric second rank tensors (`sesquilinear-quadratic tensor concomitants')
that are irreducible under Lorentz transformations. The physical interpretation of the 36 unique quantities within
the brightness matrix is discussed in \citet{Bergman:2008p7859}, with regards to the aforementioned irreducible tensorial set.
While the brightness tensor has redundancy not present in the tensorial set of \citet{Carozzi:2006bj}, we shall continue to use it
as a basis to our relativistic RIME for clarity of analogy to the brightness coherency matrix of Eq.~\ref{eq:RIME-basic}, and as
it leads to a relativistic RIME for which we can define transformation tensors analogous to Jones matrices. The redundancy
can be described by a number of symmetry properties of the brightness tensor: it is
(a) \emph{Hermitian} with respect to swapping the first and second pair of indices:
\begin{equation}
B^{\alpha\beta\gamma\delta} = \bar{B}^{\gamma\delta\alpha\beta},
\label{eq:BT-herm}
\end{equation}
(b) \emph{antisymmetric} within each index pair (since the EMF tensor itself is antisymmetric, i.e. $F^{\alpha\beta}=-F^{\beta\alpha}$):
\begin{equation}
B^{\alpha\beta\gamma\delta} = -B^{\beta\alpha\gamma\delta} = -B^{\alpha\beta\delta\gamma}
\label{eq:BT-antisym}
\end{equation}
To see the direct analogy to the brightness matrix, consider again the case of the monochromatic plane wave propagating
along $\bmath{z}$ (Eq.~\ref{eq:MPWZ}). The EMF tensor then takes a particularly simple form:
\begin{equation}
\label{eq:F-planewave}
F^{\alpha\beta} = \frac{1}{c} \left(\begin{array}{cccc}
0 & -e_{x} & -e_{y} & 0 \\
e_{x} & 0 & 0 & e_x \\
e_{y} & 0 & 0 & e_y \\
0 & -e_{x} & -e_y & 0
\end{array}\right),
\end{equation}
and the brightness tensor has only $8^2=64$ non-zero components, with the additional `0-3' symmetry property:
\begin{eqnarray}
B^{0\beta\gamma\delta} = B^{3\beta\gamma\delta} && B^{\alpha0\gamma\delta} = B^{\alpha3\gamma\delta} \nonumber\\
B^{\alpha\beta0\delta} = B^{\alpha\beta3\delta} && B^{\alpha\beta\gamma0} = B^{\alpha\beta\gamma3}
\label{eq:BT-sym}
\end{eqnarray}
Four unique components can be defined in terms of the Stokes parameters:
\begin{eqnarray}
B^{0110} = I+Q && B^{0220} = I-Q \nonumber\\
B^{0120} = U+iV && B^{0210} = U-iV
\label{eq:BT-IQUV}
\end{eqnarray}
and conversely,
\begin{eqnarray}
I = \frac{B^{0110}+B^{0220}}{2} && Q = \frac{B^{0110}-B^{0220}}{2}, \nonumber\\
U = \frac{B^{0120}+B^{0210}}{2} && V = \frac{B^{0120}-B^{0210}}{2i}.
\label{eq:IQUV-BT}
\end{eqnarray}
The other non-zero components of the brightness tensor can be derived using the Hermitian, antisymmetry and
0-3 symmetry properties. Finally, for an unpolarized plane wave, only 32 components of the brightness tensor
are non-zero and equal to $\pm I$. As these will be useful in further calculations, they are summarized in Table~\ref{tab:BT}.
\begin{table}
\begin{centering}
\begin{tabular}{c|rrrrrrrr}
& 01 & 02 & 10 & 13 & 20 & 23 & 31 & 32 \\
\hline
01 & $-I$ & & $I$ & $I$ & & & $-I$ & \\
02 & & $-I$ & & & $I$ & $I$ & & $-I$ \\
10 & $I$ & & $-I$ & $-I$ & & & $I$ & \\
13 & $I$ & & $-I$ & $-I$ & & & $I$ & \\
20 & & $I$ & & & $-I$ & $-I$ & & $I$ \\
23 & & $I$ & & & $-I$ & $-I$ & & $I$ \\
31 & $-I$ & & $I$ & $I$ & & & $-I$ & \\
32 & & $-I$ & & & $I$ & $I$ & & $-I$ \\
\end{tabular}
\end{centering}
\caption{The non-zero components of the brightness tensor $B^{\alpha\beta\gamma\delta}$ for an unpolarized
plane wave. Rows correspond to $\alpha\beta$, columns to $\gamma\delta$.}
\label{tab:BT}
\end{table}
So far this has been nothing more than a recasting of the RIME using EMF tensors. Consider, however, the case where antennas $p$ and $q$ measure the signal in different inertial reference frames. The EMF tensor observed by