Low Level RF
Part 3: Implementation

8. Design Example: Linac4
   - Presentation of Linac4
   - Cavity Controller Functionalities
   - Architecture
   - Modeling
   - Hardware
   - A few Key Building Blocks
   - PCBs
   - Installation

CAS RF
P. Baudrenghien  CERN-BE-RF
Linac4

- 160 MeV H⁻ Pulsed Linac
- PSB injector
- 40 mA average current (nominal)
- Normal Conducting cavities, 352.2 MHz operation
- ~ 1 MW/cavity (except DTL2 and DTL3 ~ 2 MW/cavity) with 40 mA
- 2 Hz rep rate
- Commissioning in 2013. Injection in PSB in 2015
- Proposed LLRF is much inspired from SNS

20-30% klystron power margin for regulation
For each cavity:

- A Tuner Loop to keep the structure on resonance
- An RF Feedback, and an Adaptive Feedforward (AFF) to keep the accelerating voltage at the desired value in the presence of beam transient
- A Klystron Polar Loop to compensate the variation of klystron gain and phase shift caused by High Voltage (HV) supply fluctuations and droop
- A Conditioning System monitoring the Cavity Vacuum while feeding the Line with Frequency Modulated bursts of RF power of increasing amplitude
- A Klystron Drive Limiter that prevents from driving the klystron over the saturation limit during loop transients.
Architecture
LLRF Block Diagram

- **Main Coupler**
  - Vacuum
  - IC rev
  - From Tuner Loop

- **Conditioning Loop**
  - DDS
  - AM Chopper

- **Klystron Polar Loop**
  - Gain & Phase
  - Limiter to prevent from overdriving the klystron

- **TUNER LOOP**
  - Tuner Control
  - Digital IQ pair
  - Analog baseband
  - RF at 352 MHz

- **Linac Module Servo Controller**
  - Simplified Block Diagram
  - Signals:
    - Digital
    - Analog baseband
  - Technology:
    - DSP
    - CPLD or FPGA
    - Analog RF

**Version:** 20100524
Klystron Loop

- **Klystron HV pulse**
  - Flat-top 1.2 ms
  - Precision 1%
  - Ripple 0.1%

- **LEP klystron:**
  - 1% HV -> 8.4 degree phase shift @ RF

- **Linac4 Klystron loop:**
  - Expect < 50 μs risetime

---

3 MeV Test Stand. RF Off. Pulsing the P.S: 100 kV/23 A at 2 Hz rate. Zoom at flat-top: Ch1 – Cathode voltage (2kV/div); Ch2 – Cathode current (0.8A/div); Ch3 – Anode voltage to ground (1 kV/div).

- **Improved PS understudy**
  - Reduced ripples…but at higher frequency
  - Simulations must tell whether it is desirable.

- **HV ripple hopefully reduced by AFF if synchronized with rep rate**

LHC: Klystron loop phase response, 20 μs risetime
RF feedback

- **Loop Delay:** 1100 ns
  - 160 m Cable/Waveguide = 550 ns
  - Klystron 250 ns
  - Circulator 50 ns
  - Driver 50 ns
  - LLRF 200 ns

- **With Fdbk Closed we expect**
  - ~150 KHz single-sided Loop bandwidth
  - <10 μs response time

![Graph showing cavity impedance with and without feedback](image)

**PIM:** Cavity impedance without feedback, QL=7000 (red) and with PI feedback (blue). P gain ~3.

**Closed Loop response with PI RF feedback:** Same as above. Single-sided -3 dB BW ~ 150 kHz

**LHC:** 100 kV in 3 μs, 2 μs per div, T=650 ns
Linac4 reaction should be two times slower.
Pulsing

- The beam pulse is 400 μs long but it can be later extended to 1.2 ms
- We have 300 μs for LLRF loops stabilization. We will use a sequence as at SNS
  1. Filling of the cavity open-loop but with feedforward from the result of the previous pulses. Cavity filling time < 20 μs (Q_L<20k)
  2. Switch the RF feedback ON. Ramp fdbk gain. Allow ~ 100μs for stabilization.
  3. Switch the AFF ON and the Chopping Compensation ON
  4. Beam ON
- 2 Hz rep rate
Step 1: Is the klystron power sufficient for the intended voltage/phase manipulations (set point)?

Step 2: Coarse adjustment of the loop parameters (Feedback, Klystron Polar Loop, AFF) to keep the ensemble “well behaved”

Step 3: Introduce perturbations: klystron noise, beam loading, source current ripples. Derive expected field stability. Optimization of the algorithms. Modifications to other parts of the machines: Power Converter, source, beam current, voltage distribution among the structures (beam dynamics)…

If interested, contact Anirban Krishna Battacharyya, present at the course
Hardware
Digital I/Q Demodulator

- In a feedback system with high loop gain, the quality of the measurement front-end sets the quality of the ensemble: In the frequency band where the loop gain is high, the LLRF will adjust the klystron drive so that the Measured Cavity Field is equal to the set-point. Measurement error will be directly reflected in the cavity field.

- In large Synchrotrons and Linacs, the RF frequency is too high to allow for direct sampling of the cavity field and waveguide coupler signals. But the needed BW is typically very small as the signals come from narrow-band elements such as cavities or klystrons.

- A typical front-end will therefore contain a Mixer that shifts the RF signal to an IF frequency that can be conveniently sampled by an ADC.

- Amplitude/phase loops can be used in low current machines. In a high beam loading configuration they become unstable and (I,Q) implementations are preferred.
We consider a narrowband signal $V(t)$ (cavity field for example) at the RF frequency. It can be represented either in polar $(A, \phi)$, or cartesian $(I,Q)$ notations.

$$V(t) = A(t) \sin(2\pi f_{RF} t + \phi(t)) = I(t) \sin(2\pi f_{RF} t) + Q(t) \cos(2\pi f_{RF} t)$$

Amplitude $A(t)$, phase $\phi(t)$ and cartesian coordinates $(I(t), Q(t))$ are slowly varying signals. Their BW is much smaller than the RF frequency. They are related by

$$I(t) = A(t) \cos \phi(t)$$
$$Q(t) = A(t) \sin \phi(t)$$
The RF signal is shifted to the IF frequency band using a mixer

\[ V_{RF}(t) = A(t)\sin(2\pi f_{RF} t + \phi(t)) \]
\[ V_{IF}(t) = A(t)\sin(2\pi f_{IF} t + \phi(t)) \]

The sampling frequency is chosen to be four times the IF. Let \( x_n \) be the sampled signal, with \( n \) the time index, we get

\[ x_n = A(nT_{CK})\sin(2\pi n f_{IF} T_{CK} + \phi(nT_{CK})) \]

\[ x_n = A_n \sin\left(\frac{n\pi}{2} + \phi_n\right) \]

\[ x_n = (-1)^{\frac{n}{2}} A_n \sin(\phi_n) = (-1)^{\frac{n}{2}} Q_n \quad \text{for } n \text{ even} \]
\[ x_n = (-1)^{\frac{n+1}{2}} A_n \cos(\phi_n) = (-1)^{\frac{n+1}{2}} I_n \quad \text{for } n \text{ odd} \]
The demodulator generates the data \{Q_0, -l_1, -Q_2, l_3, Q_4, \ldots\} from which we derive, after sign correction and decimation, the data flow \{(I_n, Q_n)\} at the rate \(f_{CK}/2\).

Note that the I and Q are measured at alternate samples. The decimation introduces an error that increases with frequency. The I/Q demodulator is therefore narrow-band.

For Linac4 we use:

\[
\begin{align*}
  f_{RF} &= 352.2 \text{ MHz} \\
  f_{LO} &= 330.1875 \text{ MHz} \\
  f_{IF} &= 22.0125 \text{ MHz} \\
  f_{CK} &= 88.05 \text{ MHz}
\end{align*}
\]

and we get an (I,Q) pair at 44.025 MHz rate that we interpolate to do the Signal Processing at the full 88.05 MHz FPGA clock.

Except for its narrow BW, the I/Q demodulator has many advantages: The effect of ADC offset can be nulled. The ADC gain is not critical as I and Q are affected in the same way. For slow applications (tuning for example), the very large oversampling leads to extremely good SNR after decimation.

It was first proposed for PEP2 [Ziomek]. It is now used most everywhere…

Much more on I/Q Demodulator in *CAS Digital Signal Processing, June 2007, T. Schilcher, RF Applications in digital signal processing*.

[Ziomek] C. Ziomek, P. Corredoura, Digital I/Q Demodulator, PAC95
Cavity Loops Module

Implements:
RF Feedback,
Adaptive FF and
Klystron Polar Loop

System Independent of LO phase

IQ Demodulators synchronized by Fc (uses same demux signals as Ref-line demodulator).

De-serialized Reference-line RF Phase used to normalize the phase of the RF Measurements.

Courtesy of John Molendijk

System Independent of LO phase

$f_{RF} = 352.2$ MHz
$f_{LO} = 330.1875$ MHz
$f_{IF} = 22.0125$ MHz
$f_{CK} = 88.05$ MHz

IQ Mod* $Iout = Iin \cdot \cos(x)$, $[1, 0, -1, 0]$
$Qout = Qin \cdot \sin(x)$, $[0, 1, 0, -1]$
A few Key Building Blocks

For more info on this subject, contact:

John.Molendijk@cern.ch

or

Gregoire Hagman and Jose Noirjean, present at this course
FPGA design and Visual Elite

- **Signal Processing in FPGA**
  - Fixed Point Arithmetic
  - Caution: scaling to avoid overflow but preserve SNR
  - Use embedded fixed point multipliers
  - Much “inspiration” from *U. Meyer-Baese, Digital signal Processing with Field Programmable Gate arrays, Springer Verlag, 2001*

- **Design with Visual Elite. Advantages:**
  - Generates Device independent portable VHDL code.
  - Code reusability
  - Graphical Interface. Accept State-Diagram, Truth Tables, Processing Blocks (Adder, Logics, Multiplier, Memory) or VHDL code
  - Hierarchical
  - Graphics to text, text to graphics conversion
  - Powerful auto-documenting creates html
Digital I/Q Demodulator

Visual Elite
Graphical tool for FPGA design/simulation

Courtesy of John Molendijk
Phase Rotator

\[ i_{\text{out}} = i_{\text{in}} \cdot \cos \varphi - q_{\text{in}} \cdot \sin \varphi \]
\[ q_{\text{out}} = q_{\text{in}} \cdot \cos \varphi + i_{\text{in}} \cdot \sin \varphi \]

Bit Pn(31) dropped since \( X"C000FFFF" \leq Pn \leq X"3FFF0001" \)

Rotator Gain = 1
\[ 0x8001 \leq \cos(\varphi) \leq 0x7fff \]
\[ 0x8001 \leq \sin(\varphi) \leq 0x7fff \]
FS = 0x7fff

Used to adjust Open Loop phase

 Courtesy of John Molendijk
Decimating filters

- The signals are sampled at 88.05 MHz. But in some application (tuning) the closed loop response is a few seconds only (mechanical part)
- **Oversampling is good for Signal to Noise Ratio (SNR):** The quantization noise of the ADC is (assumed to be) white noise extending from DC to the Sampling Frequency, and with a Standard Deviation of 1-2 Least Significant bits.
- After filtering, the noise power is reduced by the ratio filter BW/sampling frequency. **Potentially a very significant improvement**…
- But brute-force implementation of a few Hz LPF FIR clocked at 88.05 MHz will ask for a lot of resources (taps -> multipliers)
- The classic solution is **Decimation:** We Low-Pass filter the signal and reduce the sampling rate at the same time.
- Good fixed-point architectures are the Cascaded Integrator Comb (CIC) filter and the Half-Band filters
2 Stage CIC16


Used as "not too demanding" decimating filter.

Courtesy of John Molendijk
Cross Product as Phase Discriminator

- \( \mathbf{u} \times \mathbf{v} = u_x v_y - u_y v_x \)
- \( |\mathbf{u} \times \mathbf{v}| = |\mathbf{u}| \times |\mathbf{v}| \sin \theta = \varepsilon \ \text{IcFwd} \)
- \( \mathbf{u} = \mathbf{Vacc}, \mathbf{v} = \text{IcFwd} \)

Used to measure phase shift \( V_{cav-IcFwd} \) for Tuning

Cross Product for 2D vectors \( \mathbf{U}(x,y) \) & \( \mathbf{V}(x,y) \)

Courtesy of John Molendijk
Limiting Adder

Very important: elegant saturation in case of adder output overflow

In case both A and B have a similar sign but the Sum output is of the other sign the output is set to be either MaxInt or MinInt depending on the input signal sign.

Courtesy of John Molendijk
Coordinate Rotation Digital Computer (CORDIC)

Imagine computing an Arctan[x] in fixed-point arithmetics….and with any desired precision...
No problem with the CORDIC….iterative algorithm…you set the number of digits...

- Iterative hardware algorithm for the computation of trigonometric functions
- Can also be used to compute square roots and divisions
- Uses shifts and adds only -> well suited for fixed-point FPGA implementation
- Pipe-lined implementation possible for higher throughput
- Proposed in late 1950s by J.Volder
- (To my knowledge) first use in an accelerator LLRF: RHIC (BNL) Beam Phase loop (M. Brennan, J. Delong, early 2000s)
Consider a rotation by angle $\phi$

\[
x' = x \cos \phi - y \sin \phi = \cos \phi [x - y \tan \phi]
\]

\[
y' = x \sin \phi + y \cos \phi = \cos \phi [y + x \tan \phi]
\]

If the rotation angle is restricted to $\pm \arctan[2^{-i}]$ the multiplication by the tangent term is reduced to a simple shift operation.

Arbitrary rotation can be achieved by a series of rotations of amplitudes $\arctan[1], \arctan[2^{-1}], \arctan[2^{-2}], \ldots$.

Let $d_i$ be the decision whether to rotate in the positive or negative direction at iteration $i$ ($d_i = +1$) then the $i^{th}$ iteration can be written

\[
x_{i+1} = K_i \left[ x_i - y_i \, d_i \, 2^{-i} \right]
\]

\[
y_{i+1} = K_i \left[ y_i + x_i \, d_i \, 2^{-i} \right]
\]

\[
K_i = \cos \left[ \arctan \left( 2^{-i} \right) \right] = \frac{1}{\sqrt{1 + 2^{-2i}}}
\]

with $d_i = \pm 1$

Let $z_i$ be the rotation left to be done before iteration $i$ we get

\[
d_i = 1 \quad \text{if} \quad z_i \geq 0
\]

\[
d_i = -1 \quad \text{otherwise}
\]

\[
z_{i+1} = z_i - d_i \, \arctan[2^{-i}]
\]
The Rotation Form of CORDIC

Example: rotation by 60.35 deg

<table>
<thead>
<tr>
<th>i</th>
<th>Z (deg)</th>
<th>K</th>
<th>d</th>
<th>Arctan[2^i] (deg)</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>60.35</td>
<td>0.707</td>
<td>1</td>
<td>45</td>
</tr>
<tr>
<td>1</td>
<td>15.35</td>
<td>0.894</td>
<td>1</td>
<td>26.56</td>
</tr>
<tr>
<td>2</td>
<td>-11.21</td>
<td>0.970</td>
<td>-1</td>
<td>14.03</td>
</tr>
<tr>
<td>3</td>
<td>2.82</td>
<td>0.992</td>
<td>1</td>
<td>7.12</td>
</tr>
<tr>
<td>4</td>
<td>-4.30</td>
<td>0.998</td>
<td>-1</td>
<td>3.57</td>
</tr>
<tr>
<td>5</td>
<td>-0.73</td>
<td>0.999</td>
<td>-1</td>
<td>1.79</td>
</tr>
<tr>
<td>6</td>
<td>1.06</td>
<td>0.999</td>
<td>1</td>
<td>0.89</td>
</tr>
<tr>
<td>7</td>
<td>0.17</td>
<td>1</td>
<td>1</td>
<td>0.45</td>
</tr>
</tbody>
</table>

Load the rotation angle in \( z_0 \)
Iterate on \( i \):
\[
d_i = 1 \quad \text{if} \quad z_i \geq 0
\]
\[
d_i = -1 \quad \text{otherwise}
\]
\[
x_{i+1} = K_i \left[ x_i - y_i \cdot d_i \cdot 2^{-i} \right]
\]
\[
y_{i+1} = K_i \left[ y_i + x_i \cdot d_i \cdot 2^{-i} \right]
\]
\[
z_{i+1} = z_i - d_i \cdot \arctan(2^{-i})
\]
\[
K_i = \cos \left[ \arctan(2^{-i}) \right] = \frac{1}{\sqrt{1 + 2^{-2i}}}
\]
untill \( z_n = 0 \) or \( n \geq \text{max

The values of \( \arctan[2^i] \) must be stored in a table
The Rotation Form can be used to compute Cosine and Sine
The Vectorial Form of the CORDIC starts with a vector \((x,y)\) and compute its angle with respect to the x-axis.

Both forms are used in the Klystron Polar Loop
The Rotation Form can be used to compute Cosine and Sine

The Vectorial Form of the CORDIC starts with a vector \((x, y)\) and keeps on applying the series of decreasing amplitude rotations until the resulting vector is on the x-axis \((y_n = 0)\). It is used to compute magnitude and angle of a vector.

Both forms are used in the Klystron Polar Loop

Rotation form with the original vector unit-length on the x-axis. Result gives \((\cos \phi, \sin \phi)\)

Vectorial form: the original vector is rotated until \(y_n = 0\). The sum of the rotation angles gives the original \(\phi\) (inverted)
PCBs
Tuner module: RF front-end

8 channels is the maximum among Linac4 structures. It matches the DTL2-3 that have 3 antennas and 2 windows (2 fwd and 2 ref) plus one ref line.

LO distri
RF demodulators (8 channels)
IF out (8 channels, differential)
Tuner module: Processing card

Logging memory
Post-Mortem memory
Tiger Shark DSP. Slow Post-processing (floating-point)
FPGA Fast Pre-Processing (fixed-point arithmetic), controls of the memories and VME interface

If interested, contact John Molendijk or Jose Noirjean
Installation
Custom-designed VME crates used in the LHC and Linac4 (same crates)
VXI is also a popular standard (PEP2, SNS)
Thank you for your attention.

If you have liked it, join the LLRF club...Come to the next LLRF workshop in Autumn 2011

LLRF02
LLRF05 Cern
LLRF07 Knoxville
LLRF09 Tsukuba
LLRF11