Low-Power Two-Phase Clock Generator Architectures for CMOS Circuits
Introduction
Two-phase clock generators produce two non-overlapping clock signals (φ1, φ2) used widely in dynamic logic, charge-coupled devices, and clocked storage elements. Low-power design is critical for battery-powered and energy-constrained CMOS systems. This article explains low-power objectives, common architectures, key design techniques, and practical trade-offs for implementing efficient two-phase clock generators.
Low-power design objectives
- Minimize dynamic switching: Reduce capacitor charging/discharging energy and switching frequency where possible.
- Reduce leakage: Use transistor sizing, stacking, and power-gating to limit subthreshold and gate leakage.
- Maintain timing integrity: Ensure non-overlap, duty-cycle control, and low jitter while conserving energy.
- Area and complexity: Favor compact implementations with few analog components when integration cost matters.
Basic architecture categories
- Ring-oscillator-based generators
- PLL/DLL-based solutions
- Delay-line/edge-shaping circuits
- Charge-pump / bootstrapped switches for low-loss transmission
1. Ring-oscillator-based generators
- Structure: Odd-numbered inverter chain (ring oscillator) followed by logic to derive two phases.
- Strengths: Simple, fully digital, easy to integrate, low area.
- Low-power techniques:
- Use fewer stages and low-supply-voltage design to reduce switching energy.
- Implement current-starving or digitally-controlled delay elements to slow oscillation when full speed isn’t needed.
- Power-gate sections during idle periods.
- Trade-offs: Higher phase noise/jitter versus PLLs; frequency control coarse without additional control loops.
2. PLL/DLL-based solutions
- Structure: Phase-locked loop (PLL) or delay-locked loop (DLL) generates precise frequency and phase; two phases derived from quadrature or controlled divider/phase interpolator.
- Strengths: Tight frequency/phase control, low jitter, stable duty cycle.
- Low-power techniques:
- Use DLL instead of PLL for lower power when only phase alignment is required.
- Use low-power phase detectors (e.g., bang-bang or sampled-data detectors) and simplified loop filters.
- Employ duty-cycled or adaptive biasing for VCO/VCDL to reduce static current when not locked or at lower speeds.
- Use coarse/fine control: coarse digital divider for large adjustments, fine voltage control only when necessary.
- Trade-offs: Higher complexity and area; analog components may dominate power if not carefully optimized.
3. Delay-line and edge-shaping circuits
- Structure: A single high-quality clock edge is split and passed through matched delay cells; complementary phases formed and non-overlap enforced via logic gates.
- Strengths: Moderate complexity, good control of non-overlap, avoids continuous oscillator power draw.
- Low-power techniques:
- Use sparse or event-driven delay tuning: enable delay elements only when switching or during calibration.
- Implement gate-level logic with minimum drive strength sufficient for downstream loads.
- Exploit local resonant or adiabatic techniques for very low-energy charge transfer (specialized).
- Trade-offs: Delay element accuracy and matching affect phase timing; calibration may be needed over PVT.
4. Charge-pump and bootstrapped switch techniques
- Purpose: Reduce energy loss during switch transitions, critical when using transmission gates or pass transistors for clock distribution.
- Techniques:
- Bootstrapped transmission gates hold near-constant gate-source voltage to reduce on-resistance and switching energy.
- Charge-pump circuits recycle charge between phases instead of dissipating it to rails.
- Implement adiabatic switching where feasible to approach reversible logic energy savings.
- Trade-offs: Added circuit complexity and control; may require careful sizing and layout to avoid reliability issues.
Key practical design techniques
- Clock gating and adaptive frequency scaling: Gate clocks to inactive regions and lower frequency or voltage when workload permits.
- Non-overlap tuning: Use digitally adjustable delays or feedback to set minimum necessary non-overlap, minimizing dead time while preventing short-circuit currents.
- Duty-cycle and phase balancing: Use symmetric layout, matched loadings, and balanced routing to preserve amplitude and timing.
- Low-voltage operation: Design transistors and logic to operate at reduced VDD; consider body-biasing to control thresholds.
- Transistor sizing: Optimize for capacitive loading — smaller devices reduce dynamic power but increase delay; size drivers for minimum energy per transition.
- Calibration and PVT compensation: Digital calibration loops or background tuning to maintain timing across process, voltage, temperature variations.
- Spatial clock distribution: Local generation of two-phase clocks near the load reduces distribution capacitance and energy.
Design examples (concise)
- Low-power ring-based: 3-stage ring with current-starved inverters + edge-extractor producing φ1/φ2 with programmable delay elements. Use power gating when idle.
- DLL-based generator: Reference clock -> VCDL with digitally controlled taps -> phase-select logic produces φ1/φ2; VCDL bias is adaptive based on required frequency.
- Charge-recycle splitter: Single clock drives capacitor network that redistributes charge to form complementary phases, reducing net energy drawn from supply.
Trade-offs summary
- Simplicity vs. precision: Ring oscillators are simple/compact but noisy; PLL/DLL are precise but cost more power/area.
- Continuous vs. event-driven power: Continuous oscillators consume standby power; event-driven or on-demand generators save energy when idle.
- Non-overlap length: Longer non-overlap prevents short circuits but increases inactive time for dynamic nodes, potentially increasing refresh energy.
Layout and implementation tips
- Place two-phase generator close to the load to reduce routing capacitance.
- Match rising/falling path parasitics to maintain phase symmetry.
- Shield sensitive analog blocks and use separate well ties where needed to minimize substrate noise.
- Simulate across corners and include Monte Carlo for mismatch sensitivity of non-overlap and phase jitter.
Conclusion
Selecting a low-power two-phase clock generator requires balancing precision, area, and energy. For ultra-low-power or bursty workloads, event-driven ring or delay-based generators with aggressive clock gating and charge-recycle techniques excel. For applications demanding tight timing and low jitter, a carefully optimized DLL with adaptive biasing is preferable. Combining techniques—local generation, charge recycling, and calibration—typically yields the best energy efficiency for CMOS two-phase clock generation.
Leave a Reply