Operational Amplifier Design Optimization [Settling Time and Power Dissipation]

Published by

on

Common Design

Design Constraints Table

Figure 1: Illustrated Proposed Amplifier Design

The goal of this project is to design a fully-differential high-speed high-precision amplifier with the following design specifications shown in the table to the right. 

Keeping this in mind, the project also specifics two different designs that satisfy different objectives.

Design 1: Minimize Large-Signal Settling Time to 0.5% accuracy, Power Budget of 20mW

Design 2: Minimize power dissipation, 0.5% large-signal settling time of 40ns

As a first round pass, our team decided to start with a 5 transistor operational transconductance amplifier and build off of whatever deficits exist in the design. 

In order to properly bias the amplifier, it’s necessary to know the relevant values of the figures of merit that we must design to. In a closed-loop negative feedback system we know that VoutVin = H(s)1+G(s)H(s), where H(s) is the open loop transfer function and G(s) is feedback transfer function . Referencing the illustrated representation of a finalized differential amplifier design in figure 1, we can draw out symbolic expressions for open loop gain (A0) and closed-loop gain, and come out with the following expression:

With a gain error of less than 1%, we know that [ (1+Z2/Z1)(1/A0) ] should not exceed 0.01, and with the closed loop gain requirement, we know that (Z2 / Z1) = 8, meaning that A0 is greater than or equal to 900.

Design Exploration

5 Transistor Operational Transconductance Amplifier 1st Stage Design

The absolute maximum power dissipation of both designs is 20mW, with VDD = 1.8V that leaves ~11mA of supply current total for the amplifier design. As a first-pass conservative design, we go for a current mirror reference of 500uA for NMOS current mirror biasing, and 500uA for PMOS current mirror biasing; leaving 5mA maximum for each branch of the differential pair. Additionally, a differential swing requirement of 1.8V implies a single-ended swing requirement of 0.9V. Given the single-ended swing requirement, we are left with 0.9V that must be distributed among the transistors in the current path; leading us to aim for a drain-source voltage of 300mV per transistor (300mV for current source load, amplifying transistor, and tail current source)

Figure 2: Transistor Sizing Method

With a requirement for drain-source voltage, the following circuits (Figure 2) were built in cadence in order to determine the required widths for the transistors to support the required current (we will employ a conservative 1mA for differential legs and 2mA for the tail current source in this design). 

In order to size the transistors according to the method shown in figure 2, we start with a desired current (5mA for differential leg transistors and 10mA for the tail transistor). The resistor value is chosen such that R*Ibias=Vth, guaranteeing saturation because the gate voltage will be one threshold above the drain. Transistor sizing is done by increasing the width such that the drain-source voltage is equivalent to our desired value (300mV per transistor in this design example).

The design procedure described above yields the following result (Figure 3):

              Figure 3: 5 Transistor OTA Gain

It’s clear to see that the 5-transistor OTA that we designed is  nowhere near our desired open loop gain of 900 that we need in order to meet our gain error criteria. In this case it would be beneficial to switch the topology to that of a telescopic cascode, to improve the differential gain at the cost of differential output swing.

Design Exploration


Figure 4: Telescopic Cascode 1st Stage Differential Amplifier

In order to increase the amplifier gain from the previously designed 5 Transistor OTA, the approach of using a telescopic cascode was explored. The Telescopic Cascode has a gain defined by: |Av| = gm_n(gm_n*ro_n2 || gm_p*ro_p2) which is considerably higher that the previous gain of |Av| = gm_n(ro_n || ro_p) realized by the 5 Transistor OTA. In addition to the higher intrinsic gain of the amplifier topology, additional gain can also be realized by dropping the bias currents as well. The major drawback of the telescopic cascode is its lack of output swing so we based this design on the assumption that we will require a second stage to meet our design requirements.

With an open-loop gain requirement of 900, we will aim for 100 in the first stage (telescopic cascode) and 9 in the second stage. A gain of 9 in the second stage allows us to restrain our single-ended swing in the first stage to 100mV. The sum of 5 drain-source voltages in a single leg of the first stage must now add up to 1.8V – 100mV = 1.7V. In this new topology we distribute the headroom among the transistors in the following manner: 400mV per PMOS and 300mV per NMOS. In this design case we also design for a conservative 2mA tail current, meaning a bias current of 1mA per leg.

In order to properly bias this topology, M7/M8 were biased with a standard PMOS current mirror, M5/M6 were biased with a PMOS variation of a low-voltage cascode mirror, M3/M4 were biased with a NMOS variation of a low-voltage cascode mirror, and M9 was biased with a standard NMOS current mirror

Upon simulating the results of the telescopic cascode stage, we began to notice signs of instability. In the DC operating point of the Telescopic Cascode Design, we can observe that due to mismatches, the bias currents in each differential leg are not exactly equal to Iss/2, which forces the output common-mode level to rail up to VDD or down to ground. In order to combat this phenomenon, we look to employ a form of common-mode feedback for compensation. The issue of common mode feedback stems from differences in the currents from the PMOS and NMOS cascode sections of each telescopic cascode branch. Acting as ideal sources, the cascode sections try to impose a certain current through the branch, but process variations and temperature changes cause them to fight against each other, saturating the output common mode level high or low. To fix this issue, we employ common-mode feedback in the described manner (Figure 5):

Figure 5: Telescopic Cascode with CMFB

The common-mode feedback in this topology (Figure 5) works by having M9 and M10 sense the output common mode level through their gate terminals. Increases/decreases that would alter the output common-mode level are negated by increases/decreases in the tail current. This feedback topology allows common-mode sensing without loading the high impedances at the output nodes that allow for a high gain. 

      Figure 6: Open Loop Gain of Telescopic Cascode with CMFB

Figure 6 shows that our initial design was successful in realizing a large open loop gain; and with a desired two-stage open loop gain of 900, we decrease our expectations of the 2nd stage gain to only ~2.2 which is easily achieved with a 5 transistor OTA as shown in figure 3. The final device sizes for our designs can be found in the table below. 

*C1/C2 DC Blocking Caps , **C9/C10 and **C5/C6 are 10% parasitic caps

Device Initial TC designSettling Optimized Power Optimized
M1/2 120um/0.3um120um/0.3um120um/0.3um
M3/M4120um/0.3um120um/0.3um120um/0.3um
M5/M6180um/0.3um180um/0.3um180um/0.3um
M6/M7180um/0.3um180um/0.3um180um/0.3um
M8/M9180um/0.3um180um/0.3um180um/0.3um
M9/10120um/0.3um120um/0.3um120um/0.3um
M11/M12N/A300um/0.3um90um/0.3um
M13/M14N/A50um/0.3um15um/0.3um
M15N/A3um/0.3um3um/0.3um
M16N/A1um/0.3um1um/0.3um
M17N/A1um/0.3um1um/0.3um
M18N/A1um/0.3um1um/0.3um
M19N/A1um/0.3um1um/0.3um
M20N/A1um/0.3um1um/0.3um
M21N/A1um/0.3um1um/0.3um
M22N/A8um/0.3um8um/0.3um
M23N/A1um/0.3um1um/0.3um
M24N/A1um/0.3um1um/0.3um
M25N/A1um/0.3um1um/0.3um
M26N/A1um/0.3um1um/0.3um
I_refN/A90uA50 uA
C1/C2*N/A1 F1 F
C3/C4N/A5 pF2.4 pF
C5/C6**N/A0.5 pF0.24 pF
C7/C8N/A625 fF300 fF
C9/C10**N/A62.5 fF30 fF
C11/C12 (Load)N/A2 pF2 pF
C13/C14 (M-Comp)N/A1 pF2 pF
R1/R2N/A400 KOhms50M
R3/R4N/A600 KOhms60M
R5/R6 (M-Comp)N/A600 Ohms250 Ohm
R7/R8N/A1 KOhms1 KOhms

Currents

Single Branch Bias CurrentSettling OptimizedPower Optimized
Stage 11.536 mA802 uA
Stage 2679.6 uA424 uA
Total Amplifier4.43 mA2.452 mA
Total (including biases)4.859 mA2.85 mA
Total Power8.74 mW5.13 mW

Design Exploration

Settling Time Optimized Closed Loop Design

Figure 7: Two Stage Amplifier Design for Settling Time Optimization

With the first stage design finished, efforts switched to the second amplifier stage as well as the closed loop design in order to reach the other design requirements. 

The outcome of the first stage telescopic cascode design was a gain of roughly ~400, meaning that the 2nd stage design can be a 5 transistor OTA with a gain of roughly ~2.2.

Figure 8: Open Loop Response of Two Stage Design (Settling Time Optimized)

Referencing Figure 8, by the end of the design process, we were left with an open loop DC gain of 69.74dB (roughly a gain magnitude of 3000) which is way higher than the initial minimum constraint of 900 open-loop gain that was set in order to achieve a 1% gain error. 

In the case of settling time optimization, the main parameter to tune would be the open loop bandwidth. In building out the final design, it was easy to see that changes in bias current, feedback capacitor size, and device sizing could yield increases in the bandwidth of the amplifier’s closed-loop response. Tweaking these parameters also led to instability in the closed-loop output that would manifest in the transient response as oscillations, and in the open-loop frequency response as negative phase margin. In order to combat the instability, we used miller compensation to change the location of the dominant pole.

The size of our miller capacitance and resistance would result in a stable response, but would also affect the bandwidth of the amplifier, a metric that we wished to maximize in the settling-time optimized design. After some rounds of parameter tweaking, the following closed response was obtained.

Figure 9: Settling Time Optimized Design Closed Loop Frequency Response (Top) and Transient Step Response (Bottom)

In Figure 9 we can notice that the closed loop gain is at ~8dB and the -3dB bandwidth is in the 10’s of megahertz. The closed loop gain of this design meets the requirement, and the impact of the wide bandwidth is encapsulated in Figure 9 where we observe the step response of the amplifier. To test step response, we configured the system with a differential input step of 112.5mV (the single-ended swing requirement of 900mV / 8) and we expect a 900mV differential output. In Figure 9, the steady state value of the signal is at 894.85mV which is well within the gain error margin of 1% ( 891mV < Output < 909mV), and the signal is able to reach 0.5% large signal settling time accuracy within 5.85ns; showing that we meet all requirements of the design whilst also maintaining a fast settling time. 

It’s always important to understand the tradeoffs that we make in our designs, especially in this case where we optimize for speed. In order to maximize bandwidth and achieve a fast settling time, Figure 8 will show that a big cost of this approach is stability. Increasing the gain bandwidth allows for a higher chance that the phase response will cross 180 degrees before the amplifier’s unity gain frequency. During the design process, we were able to see this outcome clearly as the faster we made this amplifier, the more oscillations we saw at the output, leading to the final design specification having a 3 degree phase margin. 

In addition, tweaking parameters like bias currents, the miller compensation components, and the size of the feedback capacitors led to biasing issues that would show up in dc operating points of the amplifier transistors, forcing us to go back and renegotiate transistor sizes to support these changes. 

In the end, there is definitely room for improvement in this design, the final power consumption was 8.75mW, well within range of the maximum limit of 20mW for this design. Speed gains can be further realized by increasing the current consumption of the 2nd stage and compensating for the stability issues as they arise. A byproduct of this approach would be the significantly larger device widths that would add capacitance to the circuit and reduce the “return on investment” if pushed too far.

Design Exploration

Power Dissipation Optimized Closed Loop Design

The second portion of the design aims to minimize power while constraining settling time to 40ns. Essentially, when we reduce the power (current) running through each branch, we can expect to see our settling time increase. 

Since we want a total closed loop gain of 8, with a maximum error of 1%, our total open loop gain needs to be at least 900. We achieved a gain of roughly 400 from our first stage, therefore we only require a minimum gain of 2.25 for the second stage. The 5 OTA designed from our initial exploration phase indicates that this is easily achievable. As we’ll see later on, we indeed achieve a total open loop gain of over 2000. 

Figure 10: Open Loop Response (Top) and Closed Loop Frequency Response (Bottom) of 2 Stage Amplifier – Power Optimized

After the first iteration of this design, the amplifier showed extreme instability, with a large spike in closed-loop frequency response where the gain and phase crossover frequencies coincide. Additionally, during transient analysis (step response), the output voltage showed extreme and continuous oscillations. This was easily addressed using “miller’s compensation” by adding R5, R6, C13, and C14 between the output nodes of stage one and two. With additional tweaking to the size of these components, we achieve a more stable response with a comfortable phase margin just over 20 degrees, as shown in Figure 10. 

Also shown in Figure 10, the final closed-loop gain is within 1% of 8. Furthermore, Figure 11 shows the final transient response, with an input step of 112.5 mV (⅛ of 900mV), the final settling voltage is about 894 mV (within 1% of 900) and thus hits the requires settling voltage (99.5% → 890.4mV) in about 15 ns. 

Figure 11: Transient Step Response (Top) and Zoomed View (Bottom)

Overall, this design has the same configuration as shown in Figure 7, as well as the same bias network. The key difference being the sizing of devices, see Table 1. The devices in this design typically have smaller dimensions for overall less power consumption. Furthermore, Table 2 shows the branch currents plus bias currents, totalling 2.85 mA, giving this design a total power consumption of 5.15mW. With a final settling time of about 15ns, this design could have afforded to trade off more speed for smaller devices to achieve even less power consumption but still falls within the original target specifications. 

Leave a comment

Previous Post