Upload
others
View
5
Download
0
Embed Size (px)
Citation preview
Arend Huysman
for frequency synthesisIntegrated Circuit design of a Mixed Analog Digital PLL
Academic year 2013-2014Faculty of Engineering and ArchitectureChairman: Prof. dr. ir. Jan Van CampenhoutDepartment of Electronics and Information Systems
Master of Science in Electrical EngineeringMaster's dissertation submitted in order to obtain the academic degree of
Counsellor: Dr. ir. Johan RamanSupervisor: Prof. dr. ir. Pieter Rombouts
Arend Huysman
for frequency synthesisIntegrated Circuit design of a Mixed Analog Digital PLL
Academic year 2013-2014Faculty of Engineering and ArchitectureChairman: Prof. dr. ir. Jan Van CampenhoutDepartment of Electronics and Information Systems
Master of Science in Electrical EngineeringMaster's dissertation submitted in order to obtain the academic degree of
Counsellor: Dr. ir. Johan RamanSupervisor: Prof. dr. ir. Pieter Rombouts
i
Preamble
Today’s society relies more than ever on mobile high speed communications and wireless devices.
Everyone expects having a 4G internet connection in the most deserted places or use high
performance smartphones that can run the most demanding apps. Furthermore people expect
these devices to be as small as possible and use as little power as possible. While technology
evolves, designers of these devices are faced with many conflicting specifications. Constant
research into new architectures and new ideas is an absolute must to keep up with the rising
demands of the world around us. This thesis hopes to give a small contribution is this quest for
high speed communications, or at least formulate some ideas and analyses that might one day
help developing the future.
I would like to thank Pieter Rombouts, my promoter,who proposed this subject and was himself
also always willing to help. Furthermore this thesis would not have been possible without the
help of Johan Raman, who was always willing to answer any question or discuss new ideas, I
would like to thank him for his help during the year. Also the help of Amir and Dries, who
are part the team, and partook in some discussions is acknowledged. Furthermore I should also
thank friends and family, who o↵ered all the necessary support during this year.
Arend Huysman, juni 2014
iii
Permission for usage
“The author gives permission to make this master dissertation available for consultation and
to copy parts of this master dissertation for personal use. In the case of any other use, the
limitations of the copyright have to be respected, in particular with regard to the obligation to
state expressly the source when quoting results from this master dissertation.”
Arend Huysman, juni 2014
Integrated Circuit design
of a Mixed Analog Digital PLL
for frequency synthesis.
by
Arend Huysman
Master’s dissertation submitted in order to obtain the academic degree of
Master of Science in Electrical Engineering
Academic year 2013-2014
Supervisor: Prof. dr. ir. Pieter Rombouts
Counsellor: Dr. ir. Johan Raman
Faculty of Engineering and Architecture
Ghent university
Department of Electronics and Information Systems
Chariman: Prof. Dr. Ir. Jan Van Campenhout
Summary
This work tries to investigate the possibilities of using a ring oscillator in an ADPLL. By analysisof multirate phase noise, the output noise spectrum can be shaped as desired by the designerby changing the properties of digital filters. Furthermore a new TDC architecture is discussed,integrated fully within the ring oscillator. The proposed design is compared to theoretical phasenoise.
Trefwoorden
ADPLL, TDC, DCO, frequency-synthesis
Integrated Circuit design of a Mixed
Analog Digital PLL for frequency
synthesis
Arend Huysman
Supervisor: Pieter RomboutsCounsellor: Johan Raman
Abstract– This article presents an ALL
Digital Phase Locked Loop, used as fre-
quency synthesiser for frequency modula-
tion in the bluetooth range. Both a the-
oretical analysis of the loop behaviour as
an actual chip design of the analog inten-
sive parts are discussed. New design pos-
sibilities combining a ring oscillator and
TDC are explored. Spectre simulations
confirm the analysed loop behaviour.
I. Introduction
MOST high performance PLL’s today usea classical charge pump architecture. Fre-
quency synthesis is performed by dividing theoutput frequency by a digital counter. Thephase detector compares the in and outputclocks and generates a phase difference usingthe low pass charge pump (figure 1). Fractionalfrequency division is performed by dithering thedivision factor at high speed with a ⌃� modu-lator in a structure called an N-fractional PLL(figure 2). This architecture becomes harderto implement in modern CMOS technologies.This is because the voltage headroom decreases,meaning lower dynamic range for the analogsignals. Furthermore on modern day chips, alarge part is digital logic which can inject switch-ing noise into the analog parts. The sometimesvery large capacitance that are required for thecharge pump also contribute to the fact thatnew architectures are explored. A fairly recentarchitecture is the ADPLL. It will become in-creasingly interesting, since the shift towards
more digital integration will only grow in thefuture.
D Q
D Q
A
B
1
1rst
QA
QB
Figure 1: Phase detector with charge pump
PD Loop Filter VCO
Divider
Frequency control
f
out
(e.g. 2.4GHz)f
in
(50MHz)
Figure 2: Fractional-N PLL structure
II. ADPLL overview
The basic ADPLL architecture is presented infigure 3. A few important differences can im-mediately be noted. The input is a frequencycontrol word FCW, this is an exact digital word,consisting out of a defined number of bits. Ithas the same function as the division factor Nof the fractional PLL, with the difference thatit can also be a fractional number. The inputclock is not a direct input, but all digital blocksuse this reference frequency as clock. For exam-ple the FCW is accumulated at the referenceclock rate to determine the input phase. Phaseis normalised to the output oscillator frequency,
1
meaning a phase increase of 1 equals one extraclock transition. The output phase has to besampled at the reference clock transitions. It isthan substracted from the input phase. Phasedetection in the digital domain is just a subtrac-tion. A benefit of the ADPLL is that all phaseinformation is directly represented by a digitalword. Making filtering very straightforward toimplement.Analysis of loop dynamics will alsobe performed in the digital domain. Only theDCO and some parts of the phase detector arevery analog intensive and have to be designedmanually. The digital control part can be de-signed with a hardware description languageand synthesised automatically. Significantly re-ducing the design time.
⌃ Loop filterFCW �
in
��
�
out
f
n+ -
clk
ref
clk
ref
Analog time blocks
clk
ref
DCO PD
Figure 3: Basic ADPLL structure in the phase do-main
I. DCO
The digital control logic of the ADPLL gener-ates an oscillator tuning word that is also digital.The oscillator that is used in most ADPLL loopsis a DCO, or digitally controlled oscillator.Its fre-quency is controlled by switching on and off dis-crete elements using the bits of the tuning word.In most literature about ADPLL’s for frequencysynthesis in communications, the used oscillatoris a LC-oscillator. Which gives supreme resultsin terms of phase noise. However in this work adesign based on a ring oscillator (figure 4) willbe used. The tuning happens by changing thecurrent trough the ring.
V
DD
TuningDCW
Figure 4: Structure of the used ring oscillator withcurrent tuning block
Tuning is done by dynamically switching offcurrent sources (figure 5), directly using the dig-ital word. A larger current results in a fasterfrequency of the oscillator
�I
tune
I
min
2Imin
4Imin
tune2 tune1 tune0
I0
Figure 5: Switched current sources
II. TDC
The phase of the output oscillator has to beestimated at the sampling instances of the ref-erence clock. This is done by calculating howmuch the phase has changed since the last ris-ing edge. This can be done with a fractionalcounter that counts the integer phase increase.To increase the resolution a TDC is used. Thisis to convert the time between the rising edgesof the two clocks. Normally an array of delayelements is used for this purpose. Because aring oscillator already consists out of such anarray. An attempt will be made to integratethis ring oscillator directly into the TDC. Thiswill allow the use of a short delay line, with stillreasonable resolution.
III. Results
A circuit of both the DCO and the TDC is de-signed in 65 nm CMOS. The digital control logic
2
is modelled in verilog. Together they are sim-ulated in a mixed signal analysis. The phasenoise in the output spectrum is extracted andcompared with the theoretically expected phasenoise. Experiments are done both with andwithout DCO phase noise. Results are shown infigure 6 and 7. The spectrum is quite differentin both cases, DCO phase noise is dominant andstill too large for communications applications.
Figure 6: Output phase noise PSD, without DCOnoise
Figure 7: Output phase noise PSD, with DCOnoise
IV. Conclusion
A design was demonstrated implementing a ringoscillator DCO and a novel TDC implementa-tion. Phase noise analysis corresponds with thetheoretical model. The DCO itself is not suit-able for communication applications, but otherapplication such as clock generation might be
possible. Extra design effort should be put in alow phase noise ring oscillator
References
[1] Robert Bogdan Staszwski and Poras T. Bal-sara, All-Digital frequency synthesizer indeep-submicron CMOS Wiley, New Jersey,1st edition, 2006.
3
CONTENTS ix
Contents
Preamble i
Permission for usage iii
Summary v
Extended abstract vi
Used Abbreviations xiii
1 Introduction 1
1.1 Fractional-N PLL for frequency synthesis . . . . . . . . . . . . . . . . . . . . . . 2
1.2 The ADPLL structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2.1 Digital Controlled Oscillator . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.2.2 Time to digital converter . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
x CONTENTS
2 ADPLL loop analysis 7
2.1 Closed loop response . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 Noise behaviour in the loop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2.1 Clock jitter and phase noise . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2.2 TDC noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.2.3 Tuning resolution phase noise . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.2.4 DCO phase noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.2.5 Noise contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.3 Multirate ADPLL analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.3.1 Multirate block diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.2 Multirate transfer funtions . . . . . . . . . . . . . . . . . . . . . . . . . . 23
3 Design of a DCO 29
3.1 Digital controlled oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.2 LC oscillators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
3.3 Ring oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.3.1 Basic principle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.3.2 Di↵erential ring oscillators . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.3.3 Design choices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.4 Tuning of the ring frequency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.4.1 Concept of Banks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
3.4.2 Circuit implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3.4.3 Designing the switches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
3.5 Output bu↵ering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.6 Simulation verifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.7 DCO phase noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
CONTENTS xi
4 TDC implementation in a ring oscillator 49
4.1 Basic TDC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4.2 Vernier delay line . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.3 Implementation in a ring oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . 52
4.3.1 Coarse fractional phase estimation . . . . . . . . . . . . . . . . . . . . . . 52
4.3.2 Fine fractional phase estimation . . . . . . . . . . . . . . . . . . . . . . . 54
5 Digital implementation 61
5.1 Clock retiming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
5.2 Phase acquisition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.3 Phase error . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
5.4 Switching state . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
6 Experimental results and conclusion 67
6.1 PLL tracking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
6.2 Phase noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
6.2.1 Proportional filter ↵=0.5 . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
6.2.2 Proportional filter ↵=1/256 . . . . . . . . . . . . . . . . . . . . . . . . . . 70
6.2.3 Type II loop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.2.4 DCO noise . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
6.3 Conclusions and further work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Used Abbreviations
ADPLL All-Digital PLL
PLL Phased Locked Loop
DSP Digital Signal Processor
FCW Frequency Control Word
DCO Digital Controlled Oscillator
TDC Time to Digital Converter
PD Phase Detector
PFD Phase Frequency Detector
PWM Pulse Width Modulation
PSD Power Spectral Density
STF Signal Transfer Function
NTF Noise Transfer Function
DAC Digital to Analog Converter
DCW Digital Control Word
PHI Integer Phase word
ZOH Zero Hold Filter
SNR Signal Noise Ratio
PVT Process Voltage Temperature
NTW Normalised Tuning Word
OTW Oscillator Tuning Word
1
Chapter 1
Introduction
The PLL or Phased Locked Loop, is a well known electronic circuit that is since long used
in e.g. frequency synthesis. It is found in most of today’s integrated communication systems.
A frequency synthesiser generates a waveform with an accurately controllable stable output
frequency. While CMOS-technologies get smaller and smaller, an important issue to face is the
integration of digital processing power with the analog front-end on the same chip. This can lead
to di↵erent problems on chip, since fast switching digital electronics inject so called switching
noise in the analog parts of the system. Furthermore, while CMOS scales down the switching
becomes faster and faster, but the analog properties of these transistor get worse, they behave
less and less like standard transistor models describe. Furthermore, supply voltage is being
scaled down, which leaves less headroom for analog signals and deteriorates the SNR. Therefore,
nowadays there is a trend towards more digital based design, replacing as much of the analog
building blocks as possible by digital counterparts.
The classical PLL however is an analog intensive structure, often depending on large capacitances
used in the filters. This has resulted in the development of an All Digital Phased Locked Loop.
The basic concept is to represent all information by digital control words, which can be processed
by digital logic. For example, the analog intensive loop filters found in the classical PLL can be
replaced by digital operations on the control words. A very important contribution to ADPLL
research was performed by a team lead by Staszewski at Texas Instruments [1]. His book can
be considered as an important standard work within this domain. There are however still some
very analog dependant elements in the ADPLL, most notably the oscillator itself. In many
cases an LC-oscillator is used for reasons of better phase noise performance. This is however
2 CHAPTER 1. INTRODUCTION
less integrable in recent CMOS technologies, especially because of the inductor. In this thesis
the possibility will be investigated to use a ring oscillator, which lies closer to a digital system.
This chapter will first give a short overview of the classical PLL used in modern day frequency
synthesisers and will introduce the new structure of the ADPLL. Next chapters will first perform
a full analysis of the ADPLL in the digital domain and look at di↵erent noise sources that are
introduced in this loop. The design of an oscillator and phase detector will be discussed on circuit
level. The ADPLL will then be designed in a combination of both circuit level blocks for analog
parts and behavioural verilog code for the digital signal properties. Finally a time domain phase
noise analysis will be performed to compare the simulation results with the expected analytical
results.
1.1 Fractional-N PLL for frequency synthesis
An example of a frequency synthesiser for communication purposes is presented in figure 1.1.
The digital processor generates a baseband signal based on digital information to be transmitted
over the channel. This baseband signal modulates the carrier via frequency modulation ( or more
complex modulation techniques are possible). For example, bluetooth or GSM use a structure
like this. Because they are wireless application power consumption is an important factor. In
this work, the ADPLL will not be designed for a specific type of communication, but bluetooth
will be used as a reference to make certain design choices such as frequency bands and to compare
the resulted phase noise with the bluetooth specification. Table 1.1 lists some of the bluetooth
specifications.
Table 1.1: Bluetooth specifications
Band 2.402 - 2.480GHz
Channel spacing 1MHz
Accuracy 75 kHz
Rate 1600 hops/s
Phase noise @ 550 kHz -180 dBc/Hz
Phase noise @ 2MHz -101 dBc/Hz
Phase noise @ 3 kHz -111 dBc/Hz
1.1 Fractional-N PLL for frequency synthesis 3
DSPFCW
fref = 50 kHz
fdco = FCW.fref : around 2.4GHzFrequencySynthesizer
Figure 1.1: Basic principle of a frequency synthesiser
PD Loop Filter VCO
Divider
Frequency control
fout (e.g. 2.4GHz)fin(50MHz)
Figure 1.2: N-fractional PLL for frequency synthesis
An often used structure is the so called N-fractional PLL. It is a loop based on a PLL, equipped
with a frequency divider to generate a higher output frequency than the input. Its general
structure is shown in figure 1.2. A normal PLL would try to generate an output frequency
equal to the input frequency. By dividing the output frequency by an integer number N (this
can be easily performed by a digital counter), the loop will force the signals at the phase
detector to have the same frequency, meaning the output frequency will be N times higher. The
PLL is already partially digital since square waves are used and the phase detector is partially
implemented with digital logic.
A standard phase detector is the so called phase frequency detector, or PFD followed by a charge
pump, shown in figure 1.3. It is here, in this phase detector that the two digital clocks, A and
B are converted to an analog voltage representing the phase di↵erence. This conversion of the
PWM signals QA and QB, using the charge pump is getting problematic in modern CMOS
technologies.
Furthermore, dividing frequency with a counter can only be done with an integer factor. This
means the output resolution can only be an integer times the input frequency, resulting in a
frequency resolution of tens of MHz’s. Of course, for communication applications this is not
su�cient. Resolution in the kHz range at least is required. To solve this problem the factor N
4 CHAPTER 1. INTRODUCTION
D Q
D Q
A
B
1
1
rst
QA
QB
Figure 1.3: PSD and charge pump used in many modern PLL’s
is dithered with a high speed clock, higher than the reference signal. The frequency division
is quickly alternated between N1 and N1 + 1. The mean value of N than represents the actual
output frequency. However this inserts spurs into the system because of the periodicity of the
alternation. Higher order ⌃� modulators are used to spread out the noise over a wider frequency
band. Fractional-N PLL’s will not be discussed further in this thesis, since it will focus on the
design of an ADPLL for frequency synthesis.
1.2 The ADPLL structure
While a standard PLL is often analysed in the phase domain, there is no physical signal that
directly represents phase. The phase detector only performs an action on two oscillating signals
that generates a voltage proportional to the phase (after filtering). The basic idea of an All
Digital PLL is to represent both the input reference phase as the output oscillator phase by
an actual digital word. This is then sampled at certain fixed time instances determined by the
stable reference clock of the system. In the digital domain, it is easy to generate the phase by
accumulating the rising edges of the oscillator and just interpret the number of edges as the
phase. This phase is then sampled at the rising edge of the reference clock and compared to
a certain desired input phase. Since both phases are directly represented by a digital word,
calculating the phase di↵erence is just a digital sum, easily implemented and insensitive to
noise. This is one of the great strengths of an ADPLL, once signals are represented in the
digital domain, all subsequent operations can be modelled exactly, exploiting the full power of
1.2 The ADPLL structure 5
⌃ Loop filterFCW�in
���outfn
+ -
clkref clkref
Analog time blocks
clkref
DCO PD
Figure 1.4: General schematic layout of the ADPLL
the digital domain for a normally analog application. Figure 1.4 shows the schematic of the
ADPLL. When compared to a fractional-N PLL, there are some big di↵erences. First of all, the
input is no longer the reference clock. But it is a digital control word represented by a fixed
number of bits. The reference clock fref is not a direct input of the system, but all the digital
blocks are clocked with this signal. The FCW is accumulated at this rate, generating the input
phase. Since phase is in fact an integral (or accumulation) of frequency, phase is also represented
in the digital domain by a fixed number of bits. The same is done with the output frequency.
At the sampling instances of the reference clock a value of the output phase is determined in
the phase detector. This phase detector will consist out of a counter, accumulating the number
of rising edges of the output clock, giving the floor of the phase. There is also a fractional phase
part, by estimating the time between the two clock edges, an estimate of the output phase can
be made. Calculating the phase error is just a subtraction in the digital domain.
1.2.1 Digital Controlled Oscillator
An important building block in every PLL is the oscillator. While in a classical PLL a standard
voltage controlled oscillator is used, converting an analog input voltage to an output frequency,
for an ADPLL is it desirable to have an oscillator that varies its frequency based on a digital
input. Hence a DCO or digital controlled oscillator. A DCO is mostly based on the principle
that the bits of the oscillator tuning word (OTW) switch on or o↵ discrete components such
as capacitors or current sources. In the phase domain the DCO behaves as an accumulator,
accumulating the input frequency word to the output phase. An important work in this thesis
is the design of the DCO. Chapter 3 will discuss this design.
6 CHAPTER 1. INTRODUCTION
t
t
01234567 �out
Vdco
Vhigh
Vlow
Figure 1.5: Relationship between DCO output and output phase
t
fdco
fref
�out,frac(n)
n – 1 nt
Figure 1.6: Principle of fractional phase sampling at reference clock
1.2.2 Time to digital converter
Another crucial building block is responsible for the conversion of the time di↵erences between
the variable oscillator and the reference oscillator to a phase quantity that can be used to calcu-
late the phase error. Figure 1.5 shows the relationship between the time DCO output waveform
and the instantaneous output phase. The phase throughout this thesis will be normalised. This
means instead of considering an increase in phase of 2⇡ per clock period, this will be a phase
increase of 1. This simplifies the notations and is more intuitive in the digital domain, where
a counter indeed increases its value by 1 per clock transition. The integer part can indeed be
determined by a counter and figure 1.6 shows conceptually how the time di↵erence between two
clock transitions has to be converted to a fractional phase part. TDC design is discussed in
chapter 4. The novelty in this work will be that is attempted to integrate the TDC within the
oscillator itself, resulting in a shorter delay line and a faster phase detection.
7
Chapter 2
ADPLL loop analysis
To get a good idea on the dynamics of the ADPLL and especially the influence of di↵erent noise
and quantisation errors on the output, this chapter will start with a general analysis of the closed
loop ADPLL using idealised models for the di↵erent components. Since the ADPLL behaves as
a fully discrete system, digital transfer functions can be used to model every idealized building
block. As standard for every PLL analysis, signals are represented in the phase domain, with
the big advantage that in an ADPLL these signals represent physical digital words. The output
phase is first considered only at instances of the reference frequency fref . In later analysis it
will also be sampled at the variable DCO frequency fdco. Because the entire digital system is
clocked at the reference clock, it is easy and intuitive to only consider phase in the reference clock
domain. This represents a system where all signals are sub-sampled at the reference frequency.
This analysis however only contains spectral information in the reference band. The oscillator
itself will also add phase noise upto higher frequencies and various sub-sample e↵ects of the
output spectrum should be taken into account. This will be done in the last section where a full
multirate analysis will be presented.
2.1 Closed loop response
Figure 1.4 shows the general layout of a ADPLL composed out of its building blocks. All signals
are represented by a digital word except the interaction between the inherently analog DCO and
phase detector (PD). At this point the signal is a time varying clock pulse. The entire analysis
8 CHAPTER 2. ADPLL LOOP ANALYSIS
1
z – 1H(z)
1
z – 1FCW
�in �� �outfn+
-
Figure 2.1: ADPLL model in the phase domain
is done in the phase domain with a normalised DCO phase, as discussed in the previous chapter.
At every clock transition of the reference clock; the input phase is accumulated with the FCW
or frequency command word. This digital input determines the desired relationship between
reference signal and variable output signal since in steady state:
fdco = FCW · fref .
FCW can perfectly be a fractional number, an advantage of the ADPLL structure over a
fractional-N PLL, where the frequency division is in the feedback path can only be integer.
The FCW accumulator can be modelled as a digital integrator with transfer functionz
z – 1.
The loop filter is a custom chosen filter H(z) that outputs the normalised frequency fn or NTW.
This signal is applied to the normalised DCO, normalising the DCO means the output frequency
equals fn · fref . Assuming a perfect phase detector, the output phase at a reference clock transi-
tion equals the previous value added with the number of DCO clock transitions that happened
since the last reference pulse, including the fractional di↵erence. The entire cascade DCO-TDC
can simply be modelled in the digital phase domain as another digital integrator accumulating
the DCO frequency control fNTW, since phase is defined as the integral of the frequency.
�out(n) = �out(n – 1) + NTW(n – 1)
) HDCO =1
z – 1
(2.1)
The equivalent model is represented in figure 2.1. It is fully in the phase domain, sampled at
the reference clock. The actual behaviour of interest for the application of frequency modula-
tion is the transfer function of the input FCW to the output frequency fn. When writing the
accumulator of the frequency control word as G(z) , the DCO-transfer function as HDCO(z), the
z-transform of the FCW as Fi(z) and the z-transform of the normalised frequency fn as Fo(z),
the result is as follows:
Fo(z) =H(z)G(z)
1 + H(z)Hdco(z)Fi(z)
=zH(z)
z – 1 + H(z)Fi(z)
(2.2)
2.1 Closed loop response 9
0.0001 0.001 0.01 0.1 0.5−30
−25
−20
−15
−10
−5
0
f(fr ef)
Magnitude(d
B)
Figure 2.2: Loop magnitude response for ↵ = 0.1
The transfer function from input phase �in to output �out is very similar and will be more
important in following noise analysis, it is given by
HADPLL(z) =H(z)
z – 1 + H(z)(2.3)
When choosing a simple proportional loop filter of the form H(z) = ↵, the closed loop transfer
function becomes:
Fo(z) =↵z
z – 1 + ↵Fi(z) (2.4)
The transfer function (2.4) is a standard first order filter. It is stable for 0 < ↵ < 2 and has a
bandwidth fBW =1 – ↵
2⇡fref for fBW ⌧ fref/2. Figure 2.2 shows the magnitude response of the
ADPLL closed loop response for a value of ↵ = 0.1. In a classical PLL the phase detection adds
a high frequency component to the phase di↵erence that has to be filtered out by the (analog)
loop filter, meaning it is normally not possible to choose a proportional filter. Because of the
digital nature of the phase detection in an ADPLL, there are less stringent demands on the loop
filter and the bandwidth of the loop. Note that there will always be a certain uncertainty on
the filter coe�cient value because a normalised DCO is assumed, meaning the oscillator gain Kp
has to be known. Even with estimation techniques there will always be a certain uncertainty on
this non linear parameter.
10 CHAPTER 2. ADPLL LOOP ANALYSIS
2.2 Noise behaviour in the loop
In every realistic circuit, noise will be added at di↵erent points in the circuit. The great benefit of
digital electronics is of course that the signals can not be corrupted any further by noise, and all
calculations are exact. The resolution is only determined by the number of bits. Subsequently
all the noise sources in the ADPLL come from the analog domain or from the quantisation
of analog signals. A first major contribution of phase noise is the quantisation of the output
phase by the TDC. Figure 1.6 shows how at the reference sampling instances, the DCO output
phase contains a fractional part, determined by the time di↵erence between the rising edge of
the reference clock and the DCO. There will be a minimal amount of time di↵erence that can
be detected based on the architecture of the TDC. More details on the design of a TDC are
presented in chapter 4. For now it is enough to understand how much phase error a certain
minimum time resolution will cause, disused in 2.2.2.
A second source of noise is where the digital signal path is converted again to an analog signal,
the tuning of the DCO. Because of the nature of the oscillator, digitally controlled, its frequency
can only be tuned in discrete time steps, defined as �dco. As discussed further it is possible
to update the frequency tuning word NTW at a higher rate then fref , using a ⌃� modulator
resulting in a smaller e↵ective resolution. A similar concept to dithering in N-fractional PLL’s.
Finally the DCO itself is an oscillator, which is an analog circuit. As any oscillator it adds phase
noise to its output spectrum. Contrary to the other two sources, the DCO-phase noise is not
added in the fref clock domain. This is analog noise and should be sampled at fdco for a full
analysis. For a first analysis however, the noise can be sampled at fref and higher frequency
components can be ignored, section 2.3 will discuss an exacter phase noise analysis in the DCO
clock domain. Figure 2.3 shows a schematic block diagram of the ADPLL with the main noise
sources, all signals are sampled at the reference frequency.
2.2.1 Clock jitter and phase noise
The phase noise of the output spectrum is one of the most important characteristics of the
ADPLL. However small or low-power the ADPLL might be, if the output spectrum is too noisy
for the application, the game is over. It is the phase noise that will be used to compare di↵erent
2.2 Noise behaviour in the loop 11
H(z)1
z – 1
�in �� �outfn
+-
�TDC
�tune �dcos
Figure 2.3: The ADPLL in the phase domain including the main noise sources, all sampled at
the reference frequency
architecture and decide where the design of this thesis stands, and how using a ring oscillator
compares to the use of a commonly used LC-oscillator.
For communication applications the output spectrum of the oscillator is analysed to measure
the phase noise. In analysis of clock signals however, the amount of clock jitter is often used.
It can prove useful to describe a relationship between these two properties of oscillators. First
both properties will be defined. A single frequency carrier waveform is a sinusoidal periodic
function of phase
V(t) = VC cos(!t + �(t)). (2.5)
Here !t + �(t) is the total phase at instance t. For �(t) ⌘ 0, the waveform V(t) represents
a single frequency wave represented by a single Dirac in the frequency domain. However, the
phase will not be a deterministic signal in any realistic situation. Noise sources deform every
waveform which can be modelled as a certain uncertainty in the phase, modelled by the stochastic
signal �(t), the phase error. The power spectrum of V(t) will then not be a single Dirac, but the
power will be spread out over a continuous frequency range. The di↵erence in power spectrum is
represented in figure 2.4. This figure demonstrate the problem of phase noise for communication
applications. A strong oscillator will inject noise in nearby frequency bands, which could possibly
mask weaker oscillators. There is a direct relationship between the properties of the phase
error �(t) and the output power spectral density of the waveform, the measurable quantity by
spectrum analysers [2, p. 35,...]. When �(t) is relatively small (2.5) can be rewritten as
V(t) = VC cos(!0t) + VC�(t) sin(!0t). (2.6)
The spectrum now consists out of a Dirac at the carrier frequency added with the power spectrum
of �(t), upconverted to the carrier frequency due to multiplication in the time-domain. The
12 CHAPTER 2. ADPLL LOOP ANALYSIS
f
PSD
f0 f
PSD
f0
Figure 2.4: Comparison between the power spectra of a single frequency waveform. Both with
(right) and without (left) phase noise
power density is however still infinite at the carrier frequency in this approximation, in reality
this approximation is not valid for frequencies very close to the carrier, but it will be an adequate
relation between phase noise power and the PSD of the waveform for this thesis. The phase
noise power density L(�f) is expressed in dBc (dB of signal power/Hz relative to the carrier
power).
L(�f) =SV(f +�f)
CarrierPower(2.7)
SV(f +�f) represents the power of the signal at (f +�f) in a 1Hz bandwidth. Using this and
the relation between phase noise and the frequency spectrum, it can be shown that
S�(f) =Sv(f0 + f)
Pc.
With Pc the power of the carrier wave. This analysis was now done for a single frequency
waveform, but the result is equivalent for a random shaped periodic function. The frequency
spectrum will not consist out of one dirac, but the power of the carrier is divided over di↵erent
multiples of the base frequency. As long as the noise power is contained within a region ± f02,
di↵erent side bands of the PSD will not overlap.
The previous analysis is focussed on the frequency domain and corresponds with a continuous
time interpretation of the waveform. However in a digital system it is often more convenient to
quantize the entire system at the rising edges of the output square wave clock. In this light the
clock jitter is defined. It describes the process of time error between the expected rising edge of
an ideal clock and the actual clock transitions. Figure 2.5 shows this concept. The ideal clock
has a fixed period T0. Because of phase noise, the set of subsequent time intervals between clock
transitions ,{T1, T2, ..., Tn, ...} ,deviates from the expected period T0. The standard deviation
2.2 Noise behaviour in the loop 13
Ideal clock
jittered clock
��(n)
T0
T1 T2
Figure 2.5: Real clock transitions vs. ideal clock
of this timing error �t is defined as the clock-jitter. The value is directly related to phase noise.
Of course the clock jitter does not contain any spectral information. But, in case of an open
loop oscillator, it can be shown [3] that if the integrated white noise with power spectrum
N0
f2
is dominant, the clock jitter is given by
�2t =N0
f30. (2.8)
With f0 equal to the oscillation frequency.
One last note is about the fact that phase analysis in this thesis is done with normalised phase.
The output �out(t) represents normalised phase while in equation (2.6) however, the function
�(t) describes phase in the classical sense. To interpret the output spectrum of the phase as
oscillator phase noise, it should be multiplied with 2⇡ to compensate the normalisation.
2.2.2 TDC noise
Figure 1.6 shows the fractional part of the output phase at the reference clock transitions.
Because of the finite resolution of the TDC, a quantisation error is injected in the system here.
This minimal time step will be converted to a phase quantisation error. A full integer phase
14 CHAPTER 2. ADPLL LOOP ANALYSIS
step of 1 corresponds to a time interval TDCO, the period of the current DCO-waveform. If
subsequent quantisation errors are uncorrelated, the quantisation can be modelled as white
noise. All power equally spread from 0 to the nyquist frequency, with single-sided spectral
density equal to
L =�2�TDC
fref(2.9)
The TDC time resolution TTDCis divided by the DCO period TDCO to end up with the phase
quantisation error in the phase domain, expressed in normalised phase.
�� =�t
TDCO(2.10)
The phase noise is expected to be uniformly distributed over the interval [–��/2 ��/2]. The
standard deviation of this process if given by:
�2� =�2
�
12(2.11)
In this thesis, the phase quantisation of the TDC will be around1
40corresponding to a time
error of around 10 ps. This leads to a single sided power spectrum, when using a reference
frequency of 50MHz, of
L = 10 log10
(2⇡)2
�2�12fref
!= –110 dBc (2.12)
As noted in figure 2.3 this quantisation phase noise is added at the input of the system, the
transfer function to the output is equal to HADPLL(z). Meaning the e↵ect of TDC phase noise
on the output noise spectrum is
�out,TDC = LTDCHADPLL = L H(z)
z – 1 + H(z)(2.13)
The phase noise is constant and equal to LTDC up to the loop bandwidth. After which it starts
decreasing.
2.2.3 Tuning resolution phase noise
A second source of phase noise is the finite resolution of the oscillator. As figure 2.3 shows, this
noise is also added at the input of the oscillator but after the loop filter. This means the tuning
2.2 Noise behaviour in the loop 15
noise is directly related to TDC quantisation noise, as both can be converted to the input. One
can write
�in = �TDC +�tuneH(z)
(2.14)
In the case of the proportional filter this would mean that the total input noise equals
�TDC + �tune/↵.
Often, to increase the frequency resolution, the oscillator tuning signal is oversampled and
the oscillator is tuned at a higher clock-rate. This technique is similar to the tuning of the
fractional-N PLL. It will lower the quantisation noise but introduces fractional spurs. This
oversampling is often used in combination with ⌃�-modulation to spread out the phase noise
to higher frequencies, where they will be filtered out by the ADPLL low pass characteristics.
Since introducing oversampling means using another clock domain, the analysis will be done in
section 2.3, where the oversampling clock domain will also be included. For now tuning will
be assumed to happen in the reference clock domain without any sort of oversampling or ⌃�
modulation.
A practical tuning step for a DCO with oscillation frequency around 2.4GHz will be around
20 kHz in this thesis. In the case of a reference signal 50MHz the result is
�tune =fref2500
(2.15)
This can also be converted to phase noise finding
Ltune = 10 log10
(2⇡)2
�2�12fref
!= –155( dBc) (2.16)
2.2.4 DCO phase noise
The last source of phase noise it the noise added by the oscillator itself. Contrary to the
previous two noise sources this is not caused by the digital operation of quantisation but it is
a fully analog phenomenon caused by voltage noise in the active devices. While for di↵erent
types of oscillators, principles behind phase noise are very di↵erent, results are similar. In all
cases noise in the output phase spectrum is caused by the integration of noise sources in the
transistors. White noise is the most common noise source and causes a typical 20 dB per decade
slope in the phase noise spectrum. However for short channel transistors 1/f-noise is dominant
16 CHAPTER 2. ADPLL LOOP ANALYSIS
Noise floor
1f2
1f3
f
S�out
Figure 2.6: Di↵erent noise regions in oscillators
for low frequencies, causing a higher slope of 30 dB per decade in the output spectrum after
integration. Finally white noise sources at the output give rise to a minimal noise level on all
frequencies known as the noise floor. Figure 2.6 gives a typical output spectrum for an open
loop oscillator.
In this primary analysis however, only frequencies upto fref are considered, assuming the higher
frequency content will be neglectable. In a multirate analysis, frequencies upto fDCO will be
considered. However technically this is still a subsampling operation that will introduce spectral
folding since the oscillator phase noise is an analog property.However spectral content above the
oscillator frequency can be safely neglected, also since it is so far out of band.
A ring oscillator has typically higher phase noise levels than an LC-oscillator for the same power
consumption. Since this thesis uses a ring oscillator, it might be considered to use the full
freedom of the digital loop filter design to shape the DCO noise contribution in the output
spectrum. Since the DCO noise is added at the output, the transfer function is di↵erent from
the HADPLL transfer function, controlling the e↵ect of TDC and tuning noise.
Hnoise,DCO(z) =�out(z)
�DCO(z)=
z – 1
z – 1 + H(z)(2.17)
This results in a High pass transfer function, suppressing frequencies lower than the loop band-
width. Figure 2.7 shows this high pass characteristic of an ADPLL, for a proportional loop filter
H(z) = 0.1, corresponding to the input transfer function of figure 2.2.
The benefit of the ADPLl here is that there is more freedom in optimising the DCO noise. In
a classical PLL a small bandwidth is needed to suppress the high frequency components in the
phase detector. Now it is possible to design the loop for a higher bandwidth and suppress more
higher frequency phase noise. It might also be a good idea to not use a proportional loop filter
2.2 Noise behaviour in the loop 17
0.0001 0.001 0.01 0.1 0.5−45
−40
−35
−30
−25
−20
−15
−10
−5
0
5
f (fr ef)
Magnitude(d
B)
Figure 2.7: Transfer function for the DCO noise contribution to the output phase noise spectrum
H(z), but to add an integrating part. Since this results in a steeper slope than 40 dB per decade,
resulting in decreasing DCO phase noise for in band frequencies and complete suppression for
1/f noise. The integrated phase noise with its 20 dB per decade slope, is mostly the dominant
source of output phase noise and it will be the only one taken into account for DCO noise.
2.2.5 Noise contributions
Figure 2.8 shows the output noise PSD for di↵erent noise sources. The loop filter is designed
as a simple proportional filter with H(z) = 0.1. The resulting ADPLL transfer function has a
bandwidth of around 1MHz. The following realistic values were chosen for the di↵erent noise
sources
• LTDC(f) = –110 dBc
• Ltune = –155 dBc
• LDCO(f) =N0
f2N0 = 100
In open loop, the output noise of the oscillator has a value of –100 dBc at a 1MHz o↵set. This
would be a realistic noise level for a ring oscillator.
18 CHAPTER 2. ADPLL LOOP ANALYSIS
104
105
106
107
−200
−190
−180
−170
−160
−150
−140
−130
−120
−110
−100
−90
f (Hz)
Magnitude(d
Bc)
TDC noisetuning noiseDCO noiseTotal noise
Figure 2.8: Sampled output phase noise for di↵erent noise sources, proportional loop filter with
bandwidth ⇡ 1MHz( ↵=0.1 )
When optimizing the loop to limit the DCO noise, it is better to choose a higher loop bandwidth.
Furthermore the loop filter H(z) is given an integrating part resulting in the following transfer
function
H(z) = ↵+�
z – 1=
↵z + � – ↵
z – 1. (2.18)
This leads to a closed loop response, using (2.3), of
HADPLL(z) =↵z + (� – ↵)
(z – 1)2 + ↵z + � – ↵=
↵z + (� – ↵)
z2 + (↵ – 2)z + (1 + � – ↵). (2.19)
Which has the following zero and poles
z1 =↵ – �
↵
p1,2 =1
2
✓(2 – ↵)±
q↵2 + 2↵ – 4�
◆.
(2.20)
According to (2.17) the transfer function of the DCO phase noise becomes
Hnoise,DCO(z) =z2 – 2z + 1
z2 + (↵ – 2)z + (1 – ↵+ �). (2.21)
This has a double zero at z = 1, and the following poles,
z1,2 = 1
p1,2 =1
2
✓(2 – ↵)±
q↵2 + 2↵ – 4�
◆.
(2.22)
2.3 Multirate ADPLL analysis 19
A double zero at z = 1 for HnoiseDC0means that for low frequencies, the frequency response will
decrease with 40 dB per decade. The poles are the same for HADPLL and Hnoise,DCO, to avoid
unwanted resonance peaks it is better to choose those poles real. Resulting in the condition that
↵ < –1 –p1 + 4� or ↵ > –1 +
p1 + 4� (2.23)
The chosen values for demonstration purposes for a wide bandwidth PLL are ↵ = 0.8 and
� = 0.1. Figure 2.9 shows the plot of the loop filter H(z) itself. Figure 2.10 shows the two noise
transfer functions. Finally figure 2.11 shows the output noise for the integrating loop filter with
104
105
106
107
−5
0
5
10
15
20
25
30
35
40
f (Hz)
Magnitude
Figure 2.9: Magnitude plot of an integrating
loop filter: H(z) = 0.80.1
z – 1
104
105
106
107
−40
−30
−20
−10
0
10
20
f (Hz)
Magnitude
HADPLL
Hnoise,DCO
Figure 2.10: Magnitude of the noise transfer
function for a type II ADPLL
the previously specified noise values.
2.3 Multirate ADPLL analysis
In the previous section, all noise sources where assumed to be sampled at the reference clock
and the output spectrum only gave spectral information upto fref . While this might be true
for the TDC and tuning noise, which are digital by nature, the DCO noise contains spectral
content at higher frequencies. Also, a PSD for frequencies from 0 to fref is not enough for noise
analysis, information about the spectral content upto the oscillator frequency fDCO is desired.
[4] describes a way to perform a full multirate analysis of the ADPLL using a time-variant
analysis. In this section will be started from the same basic ideas to convert the signals clocked
at reference to the higher DCO frequency. However the analysis itself will be done di↵erently,
trying to give a more intuitive approach, using time-invariant methods.
20 CHAPTER 2. ADPLL LOOP ANALYSIS
104
105
106
107
−180
−170
−160
−150
−140
−130
−120
−110
−100
−90
−80
f (Hz)
Magnitude(d
Bc)
TDC noisetuning noiseDCO noiseTotal noise
Figure 2.11: Sampled output phase noise for di↵erent noise sources,type 2 loop filter, ↵ = 0.8
and � = 0.1
2.3.1 Multirate block diagram
The block diagram in figure 2.12 shows the conversions between the di↵erent clock domains. A
fixed relation N is assumed between the reference frequency and the variable DCO frequency.
This is the case if the ADPLL is locked to a fixed frequency. N can be fractional, but to perform
up and down sampling N is considered integer. Technically, when N is fractional and can be
written as NNND
, upsampling by N should be represented by integer upsampling by NN followed
by downsampling by ND. But this will not be done in this analysis.
The phase error �� is processed in the reference clock domain, and filtered trough the loop
filter H(z). Next this phase error has to be converted to the higher frequency used by the ⌃�
modulator for dithering the DCO. The way this is implemented in the block diagram is by
upsampling to the DCO-frequency using the factor N. Subsequently placing a zero-hold-filter
and sampling back down with a factor M. When no ⌃� modulation is used, M = N and this
conversion block can be omitted. The zero hold filter has the e↵ect that the upsampled signal
remains constant over a length N.. The key is to make sure all blocks work with signals in the
highest clock domain, e.g. the DCO clock. In figure 2.13 the blocks are replaced so that the loop
is now modelled completely at fDCO. All input noise signals are also converted to the higher
frequency. Placing an upsampling block " N from behind a transfer function H(z) to the front,
means this transfer function should be converted to one that operates at fDCO. Meaning the z
that was defined with respect to a frequencyfDCO
Nnow has to be replaced by zN.
2.3 Multirate ADPLL analysis 21
H(z) HN(z)�ref +-
"N
#N
�TDC
#M HSTF
HNTF�tune
"M HM HDCO
�dco
�out++
++
Figure 2.12: Multirate block diagram of the ADPLL
H(zN) HN(z)�in +-
#"N #"M HSTF(zM)
�tune,shaped
HM(z) HDCO(z)
�dco
�out++
Figure 2.13: Multirate block diagram of the ADPLL with blocks replaced for full operation in
DCO-clock domain
The behaviour of all the blocks will now be analysed to find the correct transfer functions. A first
important building block is the down conversion immediately followed by a down conversion.
Upsampling
Upsampling here means the conversion of a signal x(n), with sampling time T, to a signal y(k)
with sampling time T/N. x(n) is a discrete signal with all the energy concentrated at the time
instances nT. Meaning y(k) will have zero amplitude in all other time instances. Resulting in
the following y(k),
y(nN) = x(n)
y(k) = 0 , k 6= nN(2.24)
The power spectrum of y(k) is related to x(n), but not exactly the same. By specifying the
signal to be zero at certain time steps, the total signal power decreases and this power is spread
out over a higher frequency band.
X(f) =+1X
n=0
x(n)e–2⇡jfnTs
Y(f) =+1X
n=0
x(n)e–2⇡jfnTs
(2.25)
22 CHAPTER 2. ADPLL LOOP ANALYSIS
f
SXX
1
2Ts
1
f
SYY
1
2Ts
1
N2
N
2Ts
Figure 2.14: The e↵ect of upsampling on the power spectra with N = 3
The frequency spectra of both signals are equal, but the nyquist frequency of Y(f) is N times
higher than the one of X(f). Meaning the power spectrum of y(k) will be periodic and a factor1
N2 lower in magnitude. If SYY is defined as the power spectrum of y(k) (with sampling time
Ts/N), the following relation with SXXf is found
) SYY(f) =1
N2SXX(f), (2.26)
with power on frequencies from [–N
Ts
N
Ts]. Figure 2.14 demonstrates this e↵ect schematically.
Real signals are assumed so that the frequency spectrum is also symmetric with respect to f = 0.
Downsampling
Downsampling is the exact opposite operation of converting a signal from one clock domain to
another one with a lower sampling rate. If we take x(n) to be a signal with the highest sampling
rate Ts and y(k) a signal with lower sampling rate NTs then their relation is
y(k) = x(kN). (2.27)
Contrary to the case of upsampling, downsampling means the nyquist frequency gets lower and
spectral content may be lost. It is at the downsampling blocks in diagram 2.12 that aliasing may
occur. To analyse the relation between transfer functions Y and X, it is easier to consider the
operation downsampling followed by upsampling. This is actually the important operation in
the block diagram. Figure 2.15 shows the behaviour of this operation. This important down-up
sampling will be denoted by the star operation x⇤(n). The behaviour of the star operator is equal
to the multiplication in the time domain with a Dirac-series �N(n) : {1, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0}
with 1 pulse every N samples. In the frequency domain, this means X⇤(f) equals the convolution
2.3 Multirate ADPLL analysis 23
#" Nx = {x0, x1, x2, x3, x4, x5, x6, ...} x⇤ = {x0, 0, 0, x3, 0, 0, x6, ...}
Figure 2.15: Down-up conversion
of the Dirac-sequence spectrum and X(f). �N(n) is periodic, so it has a discrete time fourier
series �N(l):
�N(l) =N–1X
n=0
�N(n)e(2⇡jl nN) =
1
N. (2.28)
This means that the continuous spectrum �N(f) consists out of a series of Diracs:
�N(f) =1
N
N–1X
l=0
�(f – fsl
N) (2.29)
Finally resulting in the following spectrum for X⇤
X⇤(f) =1
N
N–1X
n=0
X⇣f +
n
Nfs⌘
(2.30)
This can also be represented directly in the z-domain:
X⇤(z) =1
N
N–1X
n=0
X⇣z e2⇡jfTs
nN
⌘(2.31)
The spectrum of the signal y(n), is then
Y(f) =1
N
N–1X
n=0
X
✓f +
fsNn
◆(2.32)
With fs =1
Ts, the highest sampling rate. It is clear that when the spectral content of X is
limited to a bandwidthfsN
(as an anti-aliasing filter would become), there is no aliasing and
Y(f) =X(f)
N.
2.3.2 Multirate transfer funtions
The system in figure 2.13 can further simplified to result in the schematic of figure 2.16 if it is
assumed that the up-down conversion with M of the ⌃� modulator can be approximated by a
factor1
M. However in the continuation, it will be assumed that M = N, and ⌃� modulation
will not be taken into account. The transfer function HLG then equals
24 CHAPTER 2. ADPLL LOOP ANALYSIS
�in+
-#"N
�Q
�outHLG(z)
Figure 2.16: Simplification of the multirate model
HLG = H(zN)HN(z)1
MHSTF(z
M)HM(z)HDCO(z) (2.33)
The zero hold filter has an impulse response where the first N samples equal 1 and all the other
0. Simply using the definition of z-transformation leads to the following transfer function
HN(z) =+1X
n=0
xN(n)z–n
=N–1X
n=0
z–n
=1 – z–N
1 – z–1.
(2.34)
To calculate the transfer function HLG, a system is used without ⌃� modulation by taking
H(z)�ref +-
#N
�TDC
�tune
"N HN HDCO
�dco
�out+
+
++
Figure 2.17: Block diagram of the multirate ADPLL without ⌃�-modulation
M = N. This means the signal shaping function HSTF and noise shaping function HNTF and
HM(z) equal 1. Figure 2.17 shows the diagram for the situation without ⌃�. This simplifies
HLG(z) to
HLG(z) = H(zN)HN(z)HDCO(z). (2.35)
2.3 Multirate ADPLL analysis 25
106
107
108
109
−80
−60
−40
−20
0
20
40
60
80
f (Hz)
Magnitude(d
B)
Figure 2.18: Magnitude bode plot of the multirate open loop transfer function HLG(z)
The transfer functions of these di↵erent blocks, and the used numerical values are given by:
H(zN) = ↵+�
zN – 1
HN(z) =1 – z–N
1 – z–1
HDCO(z) =1/N
z – 1
fref = 50MHz
fDCO = 2.45GHz
=) N = 49
↵ = 0.8
� = 0.1
(2.36)
The normalised DCO transfer function still behaves as an integrator, however since the integra-
tion is now monitored at a rate N times faster than the reference clock, it takes N timesteps for
the output phase to rise with the tuning word, hence a factor 1/N is included in the transfer
function. Finally this results in
HLG(z) =1
N
↵zN + (� – 1)�zN – zN–1
�(z – 1)
. (2.37)
Figure 2.18 shows the magnitude of the transfer function for the previously specified values
from (2.36), clearly showing the spectral folding e↵ects for frequency larger than 50MHz.
The frequency spectra of the input and output noise sources �in and �Q can now be written in
function of the previously defined noise sources. �tune is now referred with respect to the output
26 CHAPTER 2. ADPLL LOOP ANALYSIS
instead of the input in the previous section because of simpler calculations.
�in(z) = [�TDC]"N
�Q(z) = �DCO + [�tune]"NHN(z)HDCO(z)(2.38)
Using the star-operator, and looking at figure 2.16 ,the output phase noise can be written as
�out = (�in – �out)⇤HLG(z) + �Q. (2.39)
Next the star-operation ⇤ is applied on both sides of the equation. The star-operation is clearly
a linear operation, so this results in
�⇤out =
✓(�in – �out)
⇤HLG(z)
◆⇤+ �⇤
Q. (2.40)
To solve this the following property of the star-operator is used:
✓F(z)⇤.G(z)
◆⇤= F(z)⇤.G(z)⇤ (2.41)
This property can be proven by directly applying equation (2.31). Using this on the output
phase equation leads to
�⇤out =
H⇤LG
1 + H⇤LG
�⇤in +
1
1 + H⇤LG
�⇤Q
(2.39) =) �out =HLG
1 + H⇤LG
(�in – �Q)⇤ + �Q
(2.42)
The result of this analysis is shown in figure 2.19. It shows the output phase noise for the
di↵erent noise sources. For frequencies under fref the phase noise approximates the results of
the single rate system very accurately.
2.3 Multirate ADPLL analysis 27
104
105
106
107
108
109
−180
−170
−160
−150
−140
−130
−120
−110
−100
f (Hz)
Magnitude(d
Bc)
TDC noisetuning noiseDCO noiseTotal noise
Figure 2.19: Phase noise in the multirate system
28 CHAPTER 2. ADPLL LOOP ANALYSIS
29
Chapter 3
Design of a DCO
Now that the ADPLL principles are explained and explored,a good circuit implementation is
searched that matches the behaviour of the ideal building blocks as good as possible. The first
building block is the most important one, namely the DCO. Of course without a controllable
oscillator any PLL is pretty useless. In this chapter there will be a quick look at the di↵erent
oscillator types that are used in current day loops. After which the actual circuit implementation
of a ring-oscillator will be discussed in 65 nm CMOS.
3.1 Digital controlled oscillator
As discussed in the previous sections, the tuning is controlled by a digital word. This word could
be either binary weighted or thermometer encoded or whatever format the designer has chosen.
The oscillator has to be designed as a fully analog system, meaning this digital control word
somehow has to be converted to analog quantities. Basically there are two possible ways one
could interpret this. The first way is to use a standard VCO and convert the digital control word
into an analog control signal with a DAC, as shown in figure 3.1. At first glance this might seem
the most intuitive implementation. However, converting the signal to an analog voltage might
require large capacitances, which was one of the reasons in the first place to remove the analog
phase detector in a classical-PLL and choose for a ADPLL. Furthermore, this extra conversion
at this critical point in the loop can only inject more noise and make the system less predictable.
Another approach is to use the digital bits directly to switch on or o↵ discrete components in
30 CHAPTER 3. DESIGN OF A DCO
DACDTW[1:N] VC VCO
Figure 3.1: Using a DAC to generate a control voltage for a standard analog VCO
the DCO itself. Technically this also a DAC, but it is integrated directly in the DCO. Normally
in standard VCO’s, the tuning voltage controls another property, like current or capacitance,
that changes the frequency. For example, in an LC-oscillator the tuning voltage controls the
capacitance of a varactor. In a relaxation oscillator and ring oscillator it can both control the
capacitance or the current. In a DCO, one could try to change one of these properties directly
with the tuning bits, e.g. by using a switch to add extra capacitances or switch o↵ fixed current
sources. This will have better properties than an analog conversion since there are only two
possible states. A popular approach in LC-oscillator still concerns varactors but switches every
varactor between only a low and a high state. This particular approach will be discussed a bit
more further on.
3.2 LC oscillators
Most communications applications nowadays make use of the LC-oscillator. So does design of
Staszwski and many others with similar approaches [1]. A lot of research has been performed
on this topic and very stable and low noise oscillators can be designed. It’s frequency is given
by
f0 =1
2⇡pLC
(3.1)
Assuming the inductance L is rather fixed, especially on chip, a popular tuning method of this
type of oscillator is changing the value of the capacitance C by varactors. A varactor essentially is
a diode of which the value can be changed by applying a di↵erent backwards voltage. A popular
method is using a pmos transistor as varactor.[1, p.-31] In traditional CMOS the capacitance has
a rather linear relation with the applied voltage. In deep submicron CMOS the transition from
maximum to minimal value becomes less gradual. This means this type of varactor becomes
less attractive for standard VCO’s, but remain or even become more useful for digital tuning,
since they behave more like a switch. LC-oscillators might be the preferred choice for for
design on PCB, on chip it poses quite some design challenges. The largest problem being the
implementation of the inductor. Since ADPLL’s are being researched for applications in small
3.3 Ring oscillator 31
VoL C –gm
Figure 3.2: LC oscillator
on-chip systems, large discrete inductors are the last thing one wants. It is possible to integrate
inductors on chip, but these require long conducting paths and take up a lot of chip area.
Furthermore, parasitic couplings cause the inductor to behave less than ideally. This results in
many di�culties ilayouting an inductor on chip. A final concern is the fact that the inductance
does not scale well with new technologies, meaning the surface area becomes relatively larger
and the layout process have to be restarted for every new technology step. Therefore, in this
thesis it was chosen to go for a more digital approach that is better integrable with the digital
control logic in deep submicron.
3.3 Ring oscillator
3.3.1 Basic principle
The basic concept of a ring oscillator is fairly easy to understand. Figure 3.3 shows how an odd
number (N) of inverters is put in series. Clearly this is an unstable system. The number of cells
has to be odd in this case, otherwise a stable state exists. At first it is simply assumed that every
inverter has a certain TD before the output is changed as a reaction on a new input. If this is
the case there is always at least one inverter switching it’s output state. For a correct oscillator
operation it is always desired that there is exactly one inverter switching. When a certain cell
has switched, the next cell will begin switching, this way the active cell travels around the ring.
Every state will thus switch between 0 and 1. When the switching cell has travelled around the
ring 2N times, the state at a certain point will be back in it’s original position. This leads to
the following simple equation for the ring oscillator frequency,
f0 =1
2NTD. (3.2)
For ideal digital inverters the output is a nice square wave switching from 0 to VDD. Ring
oscillators have the benefit of very wide tuning ranges with rather linear response to input,
32 CHAPTER 3. DESIGN OF A DCO
Figure 3.3: Standard single-ended ring oscillator
compared to LC-oscillator. The tuning design should go a bit easier. However in general they
exhibit more phase noise for the same power consumption.
When simply connecting 3 standard inverters back to back one would be lucky to get any
oscillation at all. While the basic principle of ring oscillator is very easy, designing a good
tunable oscillator with low phase noise is quite a challenge. A lot of research has been done and
is still being performed on this topic.
A first issue with this simple structure is the fact that inverters can be in a metastable state
between the high and low level. During switching, the voltage does not drop with a very steep
slope, but there is gradual transition from high to low. For example in the case of a single
ended inverter, the very important switching delay will be a function of the drive current of the
transistors and the capacitive load. While the drive current can be a very complex function of
the input voltage and time, depending on the type of oscillator used, the dependence of TD on
the capacitance should be proportional for all types. Figure 3.4 shows an inverter with an ideal
voltage step VDD to 0 at the input. The pmos is completely in cut-o↵ while the nmos is in
saturation and behaves as a resistor Ron. The output voltage is then given by
Vout = VDD
✓1 – exp
✓–t
RonCload
◆◆. (3.3)
If the output is assumed to have switched when the voltage reaches the threshold voltage VT,p
of a pmos inverter then the delay time becomes,
TD = ln
✓VDD
VDD – VT,p
◆RonCload. (3.4)
However, before Vout reaches VT,p the nmos will already start conducting current. Transistor
are most certainly not perfect switches. Meaning the TD of a second pair will already be
di↵erent. This means, when the input is not a perfect step, the output voltage will behave
di↵erent and will not have an exponential characteristic. The ideal square wave output is thus
far removed from reality. The good thing however is that, since all elements are connected in
a ring, the voltage transitions will converge to a steady state waveform and all delay times of
3.3 Ring oscillator 33
Vin
Vout
VDD
ron
Cload
Figure 3.4: Switching of an ideal inverter
Figure 3.5: Ring oscillators with di↵erantial delay cells
the di↵erent elements will be equal. It is also possible that a certain cell starts switching again
in the opposite direction before the output is even close to it’s final value. This decreases the
e↵ective switching time TD and makes the entire system less predictable. Especially when the
output waveform is exponential is this problematic, because it decreases very slowly in voltage
near the end of transition.
3.3.2 Di↵erential ring oscillators
A good improvement is the use of a di↵erential ring oscillator. These configuration will also
allow the use of an even number of stages, using a wire inversion, as figure 3.5 shows. All signals
are defined di↵erentially in this structure A first possibility to implement this at circuit level is
a fully di↵erential cell, which is basically a di↵erential pair. It is shown in figure 3.6. Only when
the input voltage di↵erence is small will both transistor act in their linear operation area and
will the cell behave as an amplifier with gain gm. When the input voltage di↵erence becomes
larger, one transistor will cut-o↵ while the other one conducts the full current. The slew-rate
will determine the delay time.
34 CHAPTER 3. DESIGN OF A DCO
VDD
Vin+ Vin–
Vout+Vout–
Figure 3.6: Fully di↵erential delay cell
Vin+
Vin– Vout+
Vout–
Figure 3.7: Latch structure concept of a pseudo-di↵erential delay cell
A second implementation is the so called pseudo-di↵erential delay-cell. Signals are still defined
di↵erentially, but the switching threshold of each input line is defined independently with re-
spect to the supply voltage.This while in the fully di↵erential case, switching occurs when the
di↵erential inputs cross each other’s levels and equal 0. Figure 3.7 will clarify this principle.
The design itself is based on a latch structure for master-slave flip-flops. The cross-coupled
inverters result in positive feedback and will ensure a 180� phase di↵erence between the two
output sides. When the input signal at one of the terminals crosses the switching threshold of
the input inverter, the corresponding output voltage will start to change. At some point this
will switch the feedback inverters and force the other output line to the complementary value.
During operation there will be a certain time period when two inverters, feeding the same output
line try to impose a di↵erent output. When this happens there will be short circuit during some
time. Therefore it is important that the strength of every inverter is carefully tuned. Smaller
transistorW
Lresults in higher on resistance and lower currents. The input inverters have to be
the strongest. If the feedback inverters would be a lot stronger than the input cells, the output
would never change, as is the case in latch cells. Using a series of these pseudo-di↵erential cells
in a ring oscillator will ensure that both input lines change their value during the same time
3.3 Ring oscillator 35
Vin+ Vin–Vout+
Vout–
VDD VDD
ItuneVtune
NA+ NA-
PA+ PB+ PA-PB-
Figure 3.8: Circuit implementation of a pseudo-di↵erential delay-cell
period. The full configuration of figure 3.7 however is only a concept, and not the circuit that
was finally implemented. Instead of full inverters for inverters with both n- and pmos transistors
only a pmos transistors is used. This means a low output level on one of the lines can force the
other line to a high level, but not opposite. Figure 3.8 shows the final implementation. Note
that the nmos source is not connected to ground, but to the tuning circuit. This will limit the
available current for the cell and determine the frequency.
3.3.3 Design choices
The delay cell of figure 3.8 is implemented in Cadence, using the 65 nm technology. In the
particular design in this work on oscillator was needed to oscillate at a frequency of 2.4 to
2.5GHz. An even number of cells was chosen, since it might be useful to also have access to the
waveform in quadrature. As a first proof on concept design a low number of cells was chosen,
i.e. N = 4. When taking a center frequency of 2.45GHz, using equation (3.2), the desired delay
time is 51 ps. This is certainly no problem in the 65 nm CMOS technology. The design is of
course symmetrical, positive and negative side are equal. The nmos transistor is chosen to be
relatively stronger than the pmos. This is because the tuning circuit will change the current
36 CHAPTER 3. DESIGN OF A DCO
trough this nmos. All transistor lengths are chosen minimal. To design the cell, transistors are
modelled as behaving like resisors in the one state. However designing the the ring oscillator is
a very iterative process, relying heavily on simulations to fine-tune the transistors. The choice
of design cell and its sizing has a great influence on the design of the delay cell itself. Eventually
minimal length transistor were chosen, this to limit the load capacitances as much as possible.
Table 3.1: Transistor sizing of the delay cell in figure 3.8
Transistor Name WL
NA 6.5
PA 2
PB 3.3
Scaling is based on the fact that the width is inversely proportional with the on resistance. It is
repeated that this is still a very simulation intensive tuning. Waveforms are di�cult to predict.
3.4 Tuning of the ring frequency
As mentioned before the delay of a cell depends on the current trough the transistors and the
load capacitance. So there are two possibilities to tune a cell. In LC oscillators the capacitance
is tuned, which leads to a minimum capacitance di↵erence, dependant on the technology used.
The ring oscillator enables the designer to choose to limit the available current for charging the
load. In this thesis this will be the used strategy. To general idea is as follows: The drive current
of the nmos transistors, required to discharge the load, in figure 3.8 is limited, is done by placing
a tunable current source under the delay cell. When a cell is not switching it uses no current.
There will only be two cells switching at the same time if the design is well performed. So if the
current from the tuning circuit can’t be interrupted, the tuning circuit has to be connected to
all cells together, shown in figure 3.9. Only the current through the nmos will be limited. The
pmos transistors are designed much smaller. The e↵ect can be explained as follows, imagine
that Vin+ in figure 3.8 becomes high and Vin– low. A current I+ will discharge the load at the
negative output trough transistor NA+, while a current I– will start charging the load at the
possitive output. By design of the cell I– < I+. Note that a part of the current I+ will not
discharge the load, but will come directly from the supply voltage trough PB+. If the design
is well performed, the output Vout– will cross the pmos threshold voltage after some time and
activate PB–, to speed up the slope of Vout+. In the configuration the nmos is dominant for
3.4 Tuning of the ring frequency 37
VDD
TuningDCW
Figure 3.9: Layout of the tuning circuit in a di↵erential ring oscillator
the tuning, meaning the falling output edge comes first. It is this falling edge that will force
the rising edge to start rising faster as well. The big advantage of di↵erential ring oscillators is
that because of the symmetry, the voltage waveform will always be the same at every point of
the ring. An easy definition of the delay time can than be the time between zero-crossing at the
input of a cell and output. Where the zero crossing is the point where the positive and negative
signals are equal.
Idealy the shape of the waveform should be linear slopes. As if an ideal current source would
charge and discharge the capacitor. In reality it can not be assumed that the tuning current is
used completely to charge the load, i.e. this current will be divided over two cells when another
cell starts switching while the previous cell has not reached it’s final value. But this unrealistic
assumption can lead to some guidelines for the desired tuning current. Its value can be estimated
based on the used transistor sizes.
TD ⇡ ItuneVDD
2Cload(3.5)
For example, when the load capacitance is 5 fF and the required delay time is 51 ps, the required
current is around 60µA. Again this a very crude result, for this type of design an iterative
process is required. Based on time analysis simulations, transistor sizes and tuning current have
to be optimised. The load capacitance of a delay cell consists out of the input transistors of the
next delay cell, and the input transistor of the bu↵er.
Figure 3.10 shows the basic tuning circuit. The capacitor and resistor model the imperfections
in the current sources, such as the parasitic capacitances and a high but finite output resistance.
38 CHAPTER 3. DESIGN OF A DCO
Ring
Vtune
ItuneC
R
Figure 3.10: The tuning circuit of the ring os-
cillator
�Vtune
�ItuneCR
Rring
Vring
Figure 3.11: Small signal model of the tuning
circuit
For the current source to function, the voltage Vtune must be higher than the saturation voltage
of this current source. This voltage will depend on the sizing of the nmos transistors used and
raises the low-voltage of the output wave. An interesting phenomenon is that this tuning voltage
remains rather constant for changing current values. It could even be interpreted as if the tuning
cell feels the ring oscillator as a diode, having a rather constant forward polarised voltage. With
the help of simulation this value was chosen to be around 0.3 V . This is a good compromise
between current source headroom and voltage swing.
On this constant a varying AC-part is superposed. This AC-part is a result of the fact that the
cells are switching at a frequency 2Nf0, resulting in a AC-term with frequency around 20GHz.
A first order small signal is shown in figure 3.11. The ring oscillator is here replaced by a
small signal resistor. This means raising the current by �I is equivalent with lowering the
tuning voltage by �I ·R. Changing the supply voltage is also one of the techniques to tune the
oscillator, but note that they are in fact completely equivalent. The AC-term Vring also causes
an extra current in the tuning circuit given by
Iring =sC
1 + sCRringVring, (3.6)
ignoring the current source resistance R. Or equivalently, an extra AC-term is introduced on
the voltage �Vtune.
�Vtune =Vring
1 + sRringC(3.7)
3.4.1 Concept of Banks
To tune the oscillator over the complete frequency range with a certain discrete frequency step
a high number of bits might be needed. If someone wants to cover the entire bluetooth band
3.4 Tuning of the ring frequency 39
of 80MHz with a frequency resolution of 1 kHz, this would result in upto 17 binary weighted
elements. Since matching elements on chip is mostly possible upto 8 bits, a solution has to be
found to use elements that are not perfectly matched together. In the work of Staszewski, a
method using di↵erent banks is described. Each bank has a range of 8 bits and is internally
matched. Di↵erent banks however do not have to be matched with respect to each other [1].
The use of three banks is described. Every bank consists out of discrete varactors that have two
possible capacitance values. The first bank is to account for PVT, or variation due to Process,
Supply Voltage or Temperature deviations. It has a very wide tuning range with a crude tuning
step, used at start-up to find the center of the bluetooth band. Because of variations in the
process the transistor properties might not be well defined. Also when temperature changes,
oscillation frequency might change drastically. Once the center of the band is found, the 8 bits
used for tuning the PVT bank are locked and the second bank starts being used. This has a
resolution of about 0.5MHz and a total range of 118MHz. It will be used to find the correct
communications channel. Once this is found the loop switches to the third and final bank,
the tracking bank. Here the actual frequency modulation within the channel is tracked with a
resolution of 23 kHz and range of 1.5MHz.
In this thesis however, since it is the current that is tuned, a slightly di↵erent method will be
attempted. Instead of a separate bank it should be possible to change the value of the fixed
current source I0. This can be done by an analog tuning voltage. This tuning may be very
inaccurate. Once the frequency is in the desired range the voltage can be locked and the current
source is kept constant for the rest of the ADPLL operation. In this work, only a fixed current
source is used, it is assumed to be tuned to the correct value. The margin for error during
PVT range is very large, depending on the designed range of the acquisition bank. Meaning the
actual implementation should not pose many additional demands.
The concept of banks is not completely abandoned in this work, and two separate tuning banks
will be created, each with 8 bit tuning. The first bank, or acquisition bank will be designed for
a resolution of 1MHz and related range of 256MHz. 1MHz tuning will allow the loop to get
very close to desired bluetooth band. A second finer bank is then implemented, likewise with 8
bits resolution. A range of 2MHz will be more than su�cient to cover an entire channel. With
8 bits resolution, a frequency step op 25 kHz is found.
If ⌃�-modulation is used, it should be applied on the smallest current sources, switching them
on and o↵ at a higher rate. For now these 25 kHz will we taken as the lowest frequency step of
this DCO.
40 CHAPTER 3. DESIGN OF A DCO
�Itune
Imin2Imin4Imin
tune2 tune1 tune0
I0
Figure 3.12: Principle of switched current sources for oscillator tuning
3.4.2 Circuit implementation
To design an tunable current source and control it with a digital control word, a fixed current
source I0 will be designed, after which an array of other fixed current sources is designed to be
switched on or o↵ by digital switches controlled by the tuning word. Figure 3.12 demonstrates
this principle. The lowest possible frequency should be the one with the minimal amount of
current, or the frequency that is obtained when only the fixed current source I0 is running and
all other sources are cut-o↵. The application in mind is bluetooth, with a minimal frequency
of 2.4GHz. Therefore a slightly lower frequency will be chosen than 2.4GHz to have some
margin. A stated above the acquisition has a tuning range of 256MHz, so a good value would
be 2.32GHz. Using simulations to optimise the used current source, a value of 102µA is obtained.
This is quite a bit higher than the expected 60µA, indicating the inaccuracy of this formula.
Figure 3.13 shows an a plot of the actual current going trough the nmos transistor of a cell.
It’s value clearly reaches a peak around 110µA, but most of the time the current is lower, as
the tuning current is divided over multiple cells. However when considering the mean current
delivered to a cell from the moment of onset of conduction until the current is 0 again, this value
is closer to the theoretical value. However it is still to far-fetched to use this fact directly as
a design rule. Tuning via simulation stays necessary. Note that this peak is a bit higher than
the 102µA delivered by the tuning circuitry. There will be some current flowing from other
cells due to parasitic capacitances. However this small di↵erence will be ignored. A value of the
minimal current sources still has to be chosen. At first there will be tried to find an analytical
approximation of this value. After this, again simulations will be used to fine-tune the result. If
a current of 102µA results in a frequency of 2.4GHz, and a completely linear relation between
3.4 Tuning of the ring frequency 41
2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 3−20
0
20
40
60
80
100
120
time (ns)
curren
t(µA)
Figure 3.13: Current through the nmos of a single delay cell
current and frequency is assumed, a value of 42 nAMHz is found. After implementing this the
frequency step is too small. Tuning results in a minimal current source of 100 nA.
The array of 8 binary weighted current sources is than constructed following the principle of
figure 3.12. The current source itself is a single nmos transistor. All 8 current sources have the
same bias voltage, coming from a mutual bias control circuit. But the width of every transistor
is weighted with a binary scaling factor. If, due to process variations, the required current is
much di↵erent from this predefined value of 1MHz, it should still be possible to tune the bias
current source to a better value. This is an advantage over the use of discrete capacitors where
the value is fixed. The circuit implementation of the tuning section is shown in figure 3.14.
The tracking bank is implemented in an equivalent way. For a minimal current source, a first
value of100 nA
40is attempted. This is 1/40 of the minimal current of the acquisition bank. If
the oscillator frequency is locally linear, what it should be, a value of 2 nA should result in
25 kHz resolution. Simulation confirms this. The bias voltage for the tracking current sources
is generated by a separate bias circuit, completely independent
3.4.3 Designing the switches
The switches are themselves implemented with nmos transistors and will show imperfections.
The idea is to interrupt the current path of the current source. When the switch is turned on,
42 CHAPTER 3. DESIGN OF A DCO
�Itune
tune2 tune1 tune0
W = W0W = 2W0W = 4W0
Vbias Vbias VbiasVbias
Imin
W = W0
Figure 3.14: Circuit implementation of the current tuning, including bias
Ron
Vbias
Cpar
Vbias
Cpar
VGS VGS
Figure 3.15: Model of the switch in on (left) and o↵( right) state
the nmos transistor will have a finite on-resistance (figure 3.15 ).This means the voltage VGS
will depend on the value of this resistor and the current trough the source, it’s value is
VGS = Vbias – 2nIminRon (3.8)
If the current gets higher the bias voltage drops which results in a lower e↵ective current trough
the transistor. When turning the switch o↵, the voltage at the source will rise until the current
source cuts o↵. The current will charge the capacitance until it’s charged enough to lower VGS
su�ciently. This capacitance is the parasitic capacitance of both the switch and the current
source, Meaning it’s total value will scale with the larger current sources. A problem arises
however because when the current trough the source decreases, charging the capacitor will go
slower, and a low current will keep going through the source a rather long time. When one
of the larger current sources switches o↵, this remaining current might still be larger than the
smaller current sources. To solve this problem it is needed to charge the capacitor very rapidly
when the switch turns o↵. The implemented solution is shown in figure 3.16. A pmos is added
3.4 Tuning of the ring frequency 43
tune
Vtune
Vtune
Vbias
Figure 3.16: Using a pmos switch to quickly charge the parasitic capacitance
that is connected to the same tuning line. When the nmos turns o↵, this pmos will switch on
and rapidly charge the capacitor. It is best not to charge this capacitor higher than the drain
voltage of the current source ( this is a constant voltage, see previous section). Otherwise a large
current peak in the opposite direction might occur when the switch turns on again. Because
of the binary coding it can still happen that a lot of switches have to change state for only a
small current change. For example switching from to 10000000 to 1111111 can result in spikes
because of a small mismatch in time need to switch on and o↵. The largest current source needs
some more time to turn o↵ while all other sources switch immediately . Resulting in current
spikes. This results in a very short increase in frequency before the frequency decreases to it’s
final value. The phenomenon only happens during frequency decrease
The length of the current source transistor was not chosen minimal. This to improve output
resistance and also because minimal length transistors do not behave well as current source. The
length L = 200 nm. However the width was chosen minimal: W = 120 nm. The nmos switch
was initially taken minimal, but as mentioned above, when a large current has to flow trough
the switch, voltage over the transistor increases and the current source decreases in current.
Therefore the switch is scaled together with the current. Resulting in in W = 16microm for the
largest current. The scaling of the pmos transistor is not that important. It should not add too
much capacitance however. W was chosen to equal 400 nm.
44 CHAPTER 3. DESIGN OF A DCO
WL =3 W
L =3
VDD
WL =17
WL =17
Vin+ Vin–
Vout+Vout–
Figure 3.17: Level shifter output bu↵er
3.5 Output bu↵ering
Because of the tuning circuit the low voltage of a delay cell is raised to 0.3V. However in
subsequent digital blocks, a full switch to 0 is required. This raised voltage VLow is not capable
of turning an nmos transistor completely o↵ and could short-circuit the system. For this reason
a level shifter output bu↵er is designed, the design shown in figure 3.17. It is similar to the used
delay cell, using the cross coupled transistors. But here the input signal is not connected directly
to the nmos. Figure 3.17 shows the desig Furthermore, the frequency is highly dependant on
the load capacitance. However if delay cells are sampled by flipflops this might change the
capacitance, changing the frequency. Therefore the bu↵er has to e↵ectivally shield the ring
oscillator from the rest of the circuit. After the level shifter an array of 3 inverters is placed.
Keeping the load capacitance constant.
3.6 Simulation verifications
Figure 3.18 shows an plot of the waveform in the time domain. The rising edge starts switching
first, but the falling edge will be faster due to the feedback. Voltage swing is between 0.3V and
1.2V. To test the e↵ective tuning of the DCO, the input bits are toggled trough all possible
3.7 DCO phase noise 45
2.2 2.25 2.3 2.35 2.4 2.45 2.5 2.55 2.60
0.2
0.4
0.6
0.8
1
1.2
1.4
time (ns)
Voltage(V
)
Output Cell 4Output Cell 3+
+
-
-
Figure 3.18: Di↵erential output of two sequential delay cells
0 0.5 1 1.5 2 2.5 32350
2400
2450
2500
2550
2600
2650
time (µs)
DCO
freq
uen
cy(M
Hz)
Figure 3.19: Changing frequency of the DCO over the full range by changing the coarse input
states in order. The e↵ective frequency is determined by monitoring the time di↵erence between
subsequent clock edges. Inverting these times leads to the e↵ective output frequency. Figure
3.19 shows the frequency changing over the entire range in steps of 1MHz. The frequency spikes
seen in the graph are glitches in the simulator and have nothing to do with the phenomenon
discussed above. Decreasing frequency steps are not shown in the graph. Figure 3.20 shows a
smaller portion of the range.
3.7 DCO phase noise
Estimating noise source in ring oscillators is not straightforward. A lot of designs rely on
simulation data to estimate the phase noise. Transistors in the ring oscillator can not be modelled
46 CHAPTER 3. DESIGN OF A DCO
0.52 0.54 0.56 0.58 0.6 0.62 0.64
2400
2405
2410
2415
2420
2425
2430
2435
2440
2445
time (µs)
DCO
freq
uen
cy(M
Hz)
Figure 3.20: Detail of the DCO tuning, showing coarse frequency resolution
in their linear range, making it hard to calculate transistor noise sources. [3] describes some
relationships between the parameters and the phase noise. Assuming that most noise sources
are white noise that are integrated to an 1/f2 output spectrum, [3] states that the phase noise
is inversely proportional to the drive current and the supply voltage. Unfortunately, DCO
phase noise is very high. Some other designs in [3] or [10] mention ring oscillators with similar
noise performances. These are however not suitable for communication purposes. With careful
design, accounting for phase noise in every step of the design, it should be possible to achieve
communication specs with a ring oscillator. But at the cost of higher current [3]. The simulated
phase noise is shown in figure 3.21. It has a value of –100 dB at a frequency of 10MHz. And
the expected slope of 20 dBc per decade.
3.7 DCO phase noise 47
105
106
107
108
109
−180
−160
−140
−120
−100
−80
−60
frequency (Hz)
PSD
(dBc)
Figure 3.21: Open loop output phase noise of the DCO
48 CHAPTER 3. DESIGN OF A DCO
49
Chapter 4
TDC implementation in a ring
oscillator
Another very important analog intensive part is the Time To Digital converter or TDC. This
is a part of the phase detector, determining what the phase di↵erence is between the reference
clock and the output variable clock. At every rising edge of the reference clock, the counter
knows how many times the variable clock has ticked, or what the integer part of the variable
phase �V is. Only integer phase detection will however not be enough for adequate phase noise
levels. It is necessary to estimate the the fractional part of the phase di↵erence. This is done by
measuring the time di↵erence between the rising edges of the variable clock and the reference
clock.
4.1 Basic TDC
The goal of the TDC is to measure the time di↵erence Td between the rising edge of signal A
and B from figure 4.1. The result should be outputted in a digital format, hence time to digital
converter. The basic principle is to delay the signal A using a delay cell with delay TD, or the
resolution time of one delay element. Then the signal B is used to sample this delayed version
of A. If the output of the sampler equals 1, it means the delayed signal A has already switched
and the time di↵erence is larger than TD. If the output is 0 however it means the delayed signal
A has not yet switched, while the signal A itself has. This means the time di↵erence between
50 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
A
B
Td
Figure 4.1: Time di↵erence between two signals
the two signals is less than TD. By using a whole array of N delay cells, a time di↵erence upto
NTD can be calculated, with a resolution Tr of TD. The output of the samplers is a digital
thermometer coded word that can be converted to the required output by the digital logic.
In the case of ADPLL, the necessary time di↵erence is the one between the rising edge of the
variable clock fDCO and the reference clock fref . This method is the one used in Staszewski’s
design [1]. The maximal resolution of TDC depends on the minimal time delay of an inverter in
a certain technology, this is a value around 20 ps. However, the better the resolution becomes,
the larger the array of elements has to be to cover a certain time range. The required number
DQ
C
TD TD TD TD
DQ
C
DQ
C
DQ
C
A A1 A2 A3 A4
Q1 Q2 Q3 Q4
B
Figure 4.2: Layout of a simple TDC delay line
4.2 Vernier delay line 51
DQ
C
TD TD TD TD
DQ
C
DQ
C
DQ
C
TS TS TS TS
A A1 A2 A3 A4
B B1 B2 B3 B4
Q1 Q2 Q3 Q4
Figure 4.3: Layout of a Vernier delay line
N of elements, given a time span of Tmax is given by
N =Tmax
Tr(4.1)
In the case of the ADPLL, the maximum time di↵erence between the variable and reference
clock is one complete clock period of the variable clock. At a minimal bluetooth frequency of
2.4GHz, this equals 416 ps. If delay cells are designed for a resolution of 20 ps this results in a
required array with length 21. Using such a long array has some downsides. The time delay of
an inverter is not perfect. There will be some uncertainty on this value. Using a long array will
accumulate this error towards the end, meaning the later delay cells insert a larger quantisation
error. Then the quantisation error will increase, degrading the performance of the entire system.
Short lines seem better but this means a smaller time di↵erence can be measured for the same
resolution.
4.2 Vernier delay line
A similar technique, used to get a time resolution higher than the resolution time of a single
inverter, is the so called Vernier Delay line, shown in figure 4.3. Instead of sampling the delayed
versions of A with the signal B as clock, the signal B is now itself delayed by an inverter chain
with delay TS [7]. If the pulse at A comes earlier than the transistion at B, the delay TS has to
be smaller than TD. The signal A is delayed with an inverter chain with delay TD. The delayed
signal An, after a delay of n cells, is now sampled with the delayed signal Bn. Figure 4.4 shows
this principle. A Vernier delay line will measure the time it takes for pulse B to catch up with an
earlier pule A. The output of the flipflops is also thermometer coded. For example, in the case
52 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
of figure 4.4, the clock B has catched up with the signal A after the sixth delay. The ouput bit
format would be :“11111000...”. The resolution of a Vernier delay line can now be calculated.
Call n the number of the element where B has catched up with A, it is the number outputted in
thermometer format by the flipflops. Looking at figure 4.4 this leads to the following equations:
nTD > nTS +Td
(n – 1)TD < (n – 1)TS +Td
=) (n – 1)(TD – TS) < Td < n(TD – TS)
(4.2)
This means the resolution of of the Vernier delay cell equals the delay di↵erence of the two cells
used. The Vernier line su↵ers the same problems as the normal TDC. The higher the resolution
becomes, the longer the needed delay chain becomes to cover a certain range. An additional
problem with a vernier line is however that the total delay of the system increases as well. It
takes a time NTS before all flipflops have there final value. This means the system has to wait
this long before it can process the information. Depending on the application this might not
be acceptable. In practise, when both a high resolution and a long range are required, two
TDC’s might be used. A first one has a long range and a very coarse resolution. A second
TDC will than narrow down the time di↵erence. The second one has a fine resolution and a
range comparable to the resolution of the first TDC [8]. The method used in this work has some
similarities to this concept.
4.3 Implementation in a ring oscillator
4.3.1 Coarse fractional phase estimation
In the case of an ADPLL, a TDC is used to estimate the time di↵erence between the rising
edge of the variable output clock and the reference clock. This di↵erence has to be interpreted
as a phase di↵erence. Because of the use of a ring oscillator, di↵erent phase-flanks are already
available. The oscillator discussed in chapter 3 will be used. By sampling the output of every
delay cell, a phase resolution of 18 can be obtained. The di↵erential delay elements are simplified
to single non-inverting delay cells, the wire inversion is represented by the fact that the input of
the first cell is opposite to the output of the last cell. Since di↵erential delay cells are used, this
is in fact an accurate representation, by simple wire inversion of certain outputs this behaviour
is obtained. At the instance the variable output clock tranistions to 1, the fractional output
4.3 Implementation in a ring oscillator 53
Td
A B
A1 B1
TD TS
A2 B2
A3 B3
A4 B4
A5 B5
B6 A6
t
Figure 4.4: Propagation of signal and clock signal in the Vernier delay line
54 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
D3 D0 D1 D2 D3
C
DQQ0
C
DQ Q1
C
DQ Q2
C
DQ Q3
fDCO
fref
Figure 4.5: Model for coarse phase detection in the ring
Binary output fractional phase
1111 08
0111 18
0011 28
0001 38
0000 48
1000 58
1100 68
1110 78
Table 4.1: Conversion between sampler output and fractional phase part
phase = 0. This transition means the delay cell with index 3 has just switched from 0 to1. The
outputs of all other cells is also 1. A delay TD later, cell 0 will switch from 1 to 0. When this
has happened the fractional part of the output phase equals 18 . Another delay TD later the
second cell will switch to 0 and so one. By sampling every output of the output DCO clock, the
fractional phase can be determined with a resolution of 18 . The oscillator itself behaves as a delay
line of a TDC structure. Figure 4.6 shows what happens in the time domain during sampling.
Depending on the location of the reference clock rising edge, 8 di↵erent sampler outputs are
possible .
4.3.2 Fine fractional phase estimation
A phase quantisation of 1/8 is still too coarse in many applications. As a solution for finer
tuning, in this thesis a variation of the Vernier delay line will be used, where one of the delay
4.3 Implementation in a ring oscillator 55
t
D3 = fDCO
t
t
t
D0
D1
D2
Rising edge refernce clock
�frac
Figure 4.6: Sampling the ring phase
56 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
t
D3 = fDCO
t
t
t
D0
D1
D2
Last switched edge
�frac
Nex switching edge
TD
�fine�coarse
Figure 4.7: Di↵erence between the coarse and fine fractional phase part
lines is the oscillator itself. The information from the coarse phase detection provides the digital
control logic with the knowledge which cell is switching at the rising edge of the reference clock.
The situation is presented in figure 4.7. To increase the phase information further, a method
is needed to estimate the time di↵erence between the last switched edge within the ring, and
the reference clock. In the example of figure 4.7, the coarse phase detection has detected that
the last clock transition was a rising edge of D0. Now the fine phase estimator will try to
determine the time di↵erence between this rising edge of D0 and the reference clock. This will
be implemented using a Vernier delay line. The last switched output in the ring represents the
signal A from figure 4.4, the reference clock is the sampling clock or the signal B. This last
clock transition is already connected to a delay array, namely the ring itself, with delay time
TD. There are 4 subsequent cells before the clock transition is back at it’s starting point. In a
first design, a 4 stage Vernier line will be used. Each delay element in the ring is sampled with
the delayed clock. 4 Delay cells are designed with delay time TS, delaying the reference clock.
The output of the Delay line is thermometer coded and has 5 possible outputs. Ranging from
“0000” to “1111”. The timing range of TD between two subsequent transitions in the ring has
to be divided into 5 equal part, meaning a time resolution of 15TD is required. In the previous
section it is shown that the frequency resolution of a Vernier line equals TD – TS so this leads
4.3 Implementation in a ring oscillator 57
to the following requirement for TS:
(TD – TS) =TD
5
=) TS =4
5TD
(4.3)
However, the delay TD is not fixed. For a bluetooth application for example with frequency
between 2.4 and 2.5GHz, the value of TD varies between 50 ps and 52 ps. However if the center
value of 51 ps is used, this leads to
TS = 40.8 ps.
In an ideal case TS should be made tunable together with the ring tuning, but since only a small
tuning range is used relative to the frequency, one could opt to take TS fixed and accept a small
error when fDCO deviates from its center value. Because the array is short, error propagation is
not that much of an issue.
Unfortunately, when the reference clock rising edge occurs, any of the 4 delay cells can be the
the one switching. So any of the 4 cells can have to be the one at the start of the Vernier delay
line. For now, the only solution to e↵ectively take care of this problem is to sample every cell
output with every delay of the clock, schematically shown in figure 4.8. TDCik in the figure
means the k’th bit of the Vernier line output with Di as input signal. The performed algorithm
to determine the actual fractional phase error is then as follows:
• Use the 4 bits Q to determine what cell has just switched latest. This determines the
coarse phase error �coarse as stated in table 4.1. E.g. if the detected output pattern is
“0011”, this means the second cell has just switched form 1 to 0 and the third cell is now
in the process of switching. The coarse fractional phase is2
8.
• If cell i has just switched, select the i’th Vernier line to determine the fine fractional phase.
For example, using the previous case, the second cell has switched, so a multiplexer selects
the second Vernier line.
• Convert the 4 bits from the selected delay line from thermometer coding to the desired
digital format in units of 1/5, using table 4.2. If there is an ambiguity, use the actual bit
level, element i has switched to, to determine the correct fine fractional part. E.g. If the
output of the second delay line is “0000”, the fractional phase might either be0
5or
4
5.
Because the the second delay element switched from 1 to 0, an output of 0000 means the
delayed clock pulse did not overtake the delayed signal after the fourth delay. The fine
fractional part equals4
5.
58 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
D3 D0 D1 D2 D3
C
DQ
Q0
C
DQ
Q1
C
DQ
Q2
C
DQ
Q3
fDCO
fref
C
DQ
TDC00
C
DQ TDC01
C
DQ TDC02
C
DQ TDC03
TDC13
TDC23
TDC33
C1 C2 C3 C4fref TS TS TS TS
C
DQ
TDC10
C
DQ
C
DQ
C
DQ
C
DQ
TDC20
C
DQ
C
DQ
C
DQ
C
DQ
TDC30
C
DQ
C
DQ
C
DQ
D0
D0
D0
D0
D1
D1
D1
D1
D2
D2
D2
D2
D3
D3
D3
D3
C1 C2 C3 C4
C1 C2 C3 C4
C1 C2 C3 C4
C1 C2 C3 C4
TDC11
TDC21
TDC31
TDC12
TDC22
TDC32
Figure 4.8: Full TDC implementation with Vernier lines
4.3 Implementation in a ring oscillator 59
Binary output fine fractional phase
0000/1111 05
0001/1110 15
0011/1100 25
0111/1000 35
1111/0000 45
Table 4.2: Conversion between Vernier line output and the fine fractional phase part
• Add the coarse and the fine phase to end up with the actual fractional phase. E.g.2
8+
4
40=
14
40
It is clear that two di↵erent Vernier outputs result in the same phase. This is because the phase
represents the time di↵erence between a transition of a delay element in the ring and the rising
edge of the reference clock. This transition can be either positive or negative however. But only
in the two extreme cases is there an ambiguity that can be solved by knowing the transition
direction.
60 CHAPTER 4. TDC IMPLEMENTATION IN A RING OSCILLATOR
61
Chapter 5
Digital implementation
Finally in this chapter the digital control logic is discussed. It converts the information from
the phase detector and the Frequency Control Word to the digital tuning words for the DCO.
In the previous sections, digital words where assumed to represent integer number without any
resolution or finite range. O↵ course in a practical implementation all numbers are represented
by a finite number of bits. E↵ects of possible rounding and saturation/overflow will have to
be investigated. The actual digital design was made in verilog, which allows for mixed signal
analysis together with the analog blocks in Cadence.
5.1 Clock retiming
In [1] and other references (e.g. [9]), the principle of clock retiming is used. This is because the
ADPLL is a multirate system with two clock domains that are not synchronous. This means
care has to be taken to avoid metastability. Metastability occurs when a value is sampled at
the moment it is in an undefined logical area. In this case in can take a very long time for the
output of this block to reach a legal value itself, theoretically infinitely long. Clock retiming
means that the reference clock is resampled so that is coincides with the next rising edge of
the variable clock fDCO, the principle is shown in figure 5.1. However this mechanism is itself
vulnerable for metastability issues. In this work however, it will be attempted to avoid clock
resampling. In other works, the possibility to avoid clock retiming in di↵erent ways has also
been discussed [11]. At instances where metastability may occur special care will have to be
62 CHAPTER 5. DIGITAL IMPLEMENTATION
fDCO
fref
fretime
Figure 5.1: Principle of reference clock retiming
taken to avoid this. When clock retiming is used, an issue that can arise is the fact that the
system is clocked with a non uniform clock. Also according to [11] this can cause spurs in the
output phase noise due to coupling on chip. In thesis, the reference clock will be delayed a
certain fixed time before it triggers the digital logic. This way, input signals coming from the
analog parts of the design should have enough time to reach there final value, especially when it
is made sure that metastability is avoided there too. This delayed clock used in the system will
be denoted as fsys. Anyway, if would appear that metastability is an issue, the clock retiming
can be added to this circuit, all other blocks would remain the same.
5.2 Phase acquisition
The frequency control in the ADPLL is simply done by generating a FCW, or Frequency Control
Word and applying this to the digital block. Generating the reference phase �ref is done by
accumulating FCW. The output frequency is between 2.4 and 2.5GHz. With the reference
frequency of 50MHz, this means FCW can take a value between 48 and 50. An o↵set of 47 is
introduced. This means FCW can take a value between 1 and 3, and less bits are required. 4
bits will be used for the integer part of FCW (2 would su�ce). Of course, the most important
part is the fractional tuning of the output frequency. For this an additional 15 fractional bits
are used. Leading to a possible frequency resolution of
50MHz · 2–15 = 1.5 kHz. (5.1)
Which is higher than the frequency resolution of the used DCO. A standard accumulator accu-
mulates this FCW every fsys, generating the signal �ref . Phase is a continuously rising function.
5.2 Phase acquisition 63
This is not possible with actual digital words. Therefore a limited number of integer bits is
chosen. 8 in this case. This allows the reference phase to have a value between 0 and 255. The
accumulator will act as a modulo operator, when it reaches the maximal value it rolls over to
the minimal. This leads to the following equation for the reference phase:
�ref(n) = mod(�ref(n – 1) + FCW, 219). (5.2)
The variable phase �dco is a bit more complicated because it consists out of a an integer part
and a fractional part. This fractional part is itself composed out of a coarse and a fine part.
Only the integer part is accumulated. This is because the exact number of integer variable clock
transitions is known exactly, only the current fractional part between the current reference flank
and the previous variable transition is needed. The integer phase word PHI is accumulated by
a counter clocked by the variable clock fdco
PHI, internal(k) = PHI, internal(k – 1) + 1. (5.3)
This internal value is not o↵ered to the digital control logic, only when fref triggers will an
output value be generated. If between times k=N and k=N+1 the reference clock becomes high,
the value PHI, internal(N) is exported as value. However this will not happen at the rising edge
of fref , but at the next rising edge of the variable clock. The reference clock will only toggle an
internal variable telling the counter to output its current value and reset the internal counter.
It is here that metastability can occur, but the use of this one internal variable should improve
the situation. When rising edge of fref and fdco coincides it is possible that the internal variable
is in an undefined state at the time the counter tries to decide if it should count the rising DCO
edge, or reset and output its current value. However the problem lies only with this one bit
variable. There will never be a situation when the counter value has to be outputted before all
values have reached a stable state. With good circuit design it should be possible to always force
this internal bit in to a defined state, during coinciding flanks its precise value does not matter.
The counter will either decide the variable clock phase comes first or the other way around.
Figure 5.2 showes a state model of the digital counter. It also mentiones the output state, which
will be used later to solve mismatch between the integer counter and fractional TDC. The delay
between fsys and fref must be high enough to make sure PHI is stable at the moment the digital
logic is triggered by fsys. There might still be a mismatch between the integer phase counter
and the fractional TDC. When the reference clock is very close to a rising edge of the DCO it
is possible that the edge is counted by the counter, while the TDC registers the the clock to
come after the DCO pulse. The counter and TDC function independent of each other so there
is no way to avoid this due to small mismatches. In that case the fractional phase error will
64 CHAPTER 5. DIGITAL IMPLEMENTATION
Start counter at P int = 0
Wait for clock transistion
state int=0, update=0
Rising edge fdco
falling edge fdcoRising edge fref
Is update =1?
yes no
P int++Output=P int
P int=1OutputState=state int
state int=0
Is update =1?
yes
state int=1
update=1
no
Figure 5.2: State model of the integer phase counter
5.3 Phase error 65
be 3940 , while it should be equal to 0. That is why the counter outputs the clock state. This
state was the clock level registered by the counter when the reference clack transitioned. If it
is 0, the last DCO transition was not yet counted, if it is 1, it was. If the clock state equals 1
while the fractional part equals 39, a mismatch is detected and a correction block can change
the fractional phase to 0.
At the rising edge of fsys the integer part of the variable DCO phase is updated:
�dco(n) = �dco(n – 1) + PHI(n) (5.4)
PHI is also an 8 bit integer word.
5.3 Phase error
Once the reference and variable phase are updated at the rate of fsys, a large part of the rest of
the system can be completely combinatorial. The fractional phase error is only bu↵ered at fref ,
no accumulation. The phase error can now be calculated, it is given by
��(n) = �ref(n) – PHI(n) – �frac(n). (5.5)
Up to now the phase noise was expressed as a value representing 1/40 of the phase. This value
represents the upper bound of the possible interval, to minimize the quantisation error, the phase
error should be quantised to the value in the middle of the interval. Or the current value should
be subtracted with 1/80. The fractional phase will also be converted to a fixed point decimal
bit pattern with 15 fractional bits. This will introduce a very small rounding error. Both �ref
and PHI are unsigned cyclic numbers. The phase error however is a signed, fixed number. If
these two unsigned numbers are subtracted, and the result is interpreted as a two complement
signed number, this result is the corresponding phase error. It ranges from –128 to 127. This is
thus the maximal range of the phase error.
Next the phase error is filtered. This might either be just a multiplication with a constant
parameter, resulting in a type I PLL. Or an integrating filter might be used, resulting in a type
II loop. For a good implementation it would be best that the multiplicative factors are a power
of 2. In that case division can be performed by a simple bit shift, saving a lot of hardware
and proccesing power. The filtered phase error is the input for a normalised DCO called the
normalised tuning word (NTW). This means the NTW has to be divided by the DCO gain
66 CHAPTER 5. DIGITAL IMPLEMENTATION
KDCO. In the acquisition mode, this is about 1 MHz, meaning KDCO = 50. Eventually this
scaled NTW, called the OTW, is truncated, leaving only the 8 most significant bits. This is the
coarse tuning word and forms the output of the digital block, closing the loop.
5.4 Switching state
In the acquisition mode, the output frequency will track the desired frequency band. When it
reaches this frequency it will oscillate around it. If the OTW remains equal, or oscillates around
the same value, the ADPLL has to switch to tracking mode. There are di↵erent possibilities
to implement it, called gear shifting [1, p.142]. All modes have in common that the acquisition
tuning word is locked. Only the tracking tuning word will change from now on. One possibility
is the so called zero-phase restart. During mode shifting, this resets the input phase �ref and PHI
and at the same time changes the filter coe�cients. In this work was not chosen to implement
this method, but a method based on Autonomous gear shifting. This does not modify the value
of the accumulated phases. The principle is as follows. The fixed acquisition NTW is converted
back to a (filtered) phase error. The actual phase error is now subtracted by this value. What
remains is the information that was lost when the bits of NTW where punctuated. This new
phase error can now be filtered by a di↵erent loop constant, resulting in the tracking NTW,
that has to be scaled by the KDCO of the oscillator in tracking mode. In this case KDCO in the
tracking mode is50MHz
20 kHz= 2500
67
Chapter 6
Experimental results and conclusion
In this chapter result will be discussed that were obtained by simulating the previously dis-
cussed design. Results will be compared with theoretical phase noise analysis from chapter 2.
This will validate the correctness of the theoretical analysis and show remaining non-idealities.
Simulations will be done for a few di↵erent values of the loop filter. At first simulations will be
done without DCO phase noise to investigate the e↵ect of the loop on TDC quantisation noise.
Finally a simulation included DCO phase noise in performed.
6.1 PLL tracking
When a digital control word FCW is applied to the digital logic, the ADPLL output frequency
has to evolve towards this value. The validate this fact, a proportional loop filter ↵ = 0.5 is first
selected. Figure 6.1 shows the ADPLL output frequency in this case. The FCW is chosen equal
to72090
215= 2.2
Added with the o↵set of 47 the FCW represents a value of 49.2. With the reference frequency of
50MHz, this has to result in a desired output frequency of 2.46GHz. As the figure demonstrates,
the loop quickly tracks the desired frequency due to the high bandwidth. When the output is
close enough to the target frequency, the ADPLL goes into tracking mode. On figure 6.2 a detail
is shown of the output frequency during tracking mode Both figures immediately show the short
strong spurs that might occur during switching. This due to unideal switching in the tuning
circuit.
68 CHAPTER 6. EXPERIMENTAL RESULTS AND CONCLUSION
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 12440
2460
2480
2500
2520
2540
2560
2580
2600
2620
time (µs)
DCO
freq
uen
cy(M
Hz)
Figure 6.1: Instantanious output frequency of the ADPLL
0.5 0.55 0.6 0.65 0.7
2458.5
2459
2459.5
2460
2460.5
2461
2461.5
2462
2462.5
2463
time (µs)
DCO
freq
uen
cy(M
Hz)
Figure 6.2: Detail of the output frequency during tracking mode
6.2 Phase noise 69
2 2.2 2.4 2.6 2.8 3−0.015
−0.01
−0.005
0
0.005
0.01
0.015
0.02
time (µs))
Phase
dev
iation
Figure 6.3: Phase error in the output phase
6.2 Phase noise
To plot the phase noise spectrum, the information of output clock transitions has to be converted
to phase noise information. A module in the simulation test bench registers the time instances
of DCO clock transitions and writes these to a file. During lock, the output should ideally be
constant in frequency and these time instances have to form a linear rising function. The index
of a time instance represents the phase. However when phase noise is present these times will
not be on a straight line. The clock transistions of an ideal oscillator will be estimated by fitting
the best curve through the found times. This line has a slope 1fdco
. When the timestep at index
i is divided by this slope, the actual phase at time i is found. By substracting this value with
the ideal phase at index i (which is i) the phase error is found.
6.2.1 Proportional filter ↵=0.5
For the case of proportional loop filter this output phase error is plotted in figure 6.3. There
appears to be an oscillating behaviour in the output phase error, caused by the TDC resolution.
In analytical calculation, TDC noise was assumed to be completely random, but when the output
frequency is constant there appears to be some correlation between subsequent TDC quantisation
errors. This periodic disturbance should be seen as spurs in the output phase spectrum with
70 CHAPTER 6. EXPERIMENTAL RESULTS AND CONCLUSION
106
107
108
109
−180
−170
−160
−150
−140
−130
−120
−110
−100
−90
−80
−70
frequency (Hz)
PSD
(dBc)
SimulatedAnalytical
Figure 6.4: PSD of the phase noise for a proportional loop ↵ = 0.5
a frequency around 20MHz. This spectrum is shown in figure 6.4. When compared with the
theoretical phase noise, using the analysis from chapter 2, it appears that the simulated curve
is a few dBc above the theoretical result. An explanation could be that the TDC is not perfect
and the quantisation error is higher than the 1/40. The spurs at multiples of 20MHz are also
not modelled by this analysis.
6.2.2 Proportional filter ↵=1/256
An often used proprotional faction is 2–8. It results in a lot of TDC-noise suppression. When
a very good oscillator is used with low phase noise, this is indeed a good method to suppress
the TDC noise and reach very low phase noise levels for high demanding communication system
such as GSM. Figure 6.5 compares the simulation with the theoretical curve. The curves have
the same slope, and TDC phase noise is indeed suppressed a lot. However the di↵erence between
the two values is a lot higher than in the previous case. There also is a phase noise floor of
around –160 dBc. This might be caused by the fact that the time instances of the DCO clock
have a finite resolution in the simulation. This would introduce an output phase quantisation
error that can be modelled as white noise. Because of the high suppression however, spurs
have nearly disappeared. This might indicate that using a very high bandwidth and focus on
suppressing DCO noise might not be the ideal option. High suppression of TDC noise might
also be necessary to suppress spurs.
6.2 Phase noise 71
106
107
108
109
−180
−170
−160
−150
−140
−130
−120
−110
−100
−90
−80
−70
frequency (Hz)
PSD
(dBc)
SimulatedAnalytical
Figure 6.5: PSD of the phase noise for a proportional loop ↵ = 2–8
6.2.3 Type II loop
Finally the e↵ect of an integrating filter is examined in simulation. The used loop filter is
H(z) = 0.8 +0.1
z – 1. (6.1)
The resulting shaping of TDC noise is shown in figure 6.6. The result is similar as in figure 6.4,
and again there is the same o↵set between the two curves. Again spurs are present and not well
suppressed.
6.2.4 DCO noise
The biggest advantage of the type II loop is that it limits DCO phase noise at low frequencies.
As stated before, the actual noise performance of the DCO is disastrous. When extracting the
noise level and use it to model the DCO noise source, it completely dominates other noise sources
as shown in figure 6.7. The used DCO phase noise used in the model is
LDCO(f) =N0
f2
N0 = 60dBc(6.2)
This results in open loop DCO phase noise of –80 dBc at an o↵set 10MHz. The theoretical
total phase noise is now compared with the simulated value in figure 6.7. With the correct
72 CHAPTER 6. EXPERIMENTAL RESULTS AND CONCLUSION
106
107
108
109
−160
−150
−140
−130
−120
−110
−100
−90
−80
−70
frequency (Hz)
PSD
(dBc)
SimulatedAnalytical
Figure 6.6: Phase error PSD
noise level, the e↵ect of the ADPLL loop on output noise is nicely modelled. But as expected,
even with such a high bandwidth, there is a peak of 80 dBc. The oscillator is not suited for
communication purposes. Because of the integrating loop filter, phase noise starts to decrease
for lower frequency o↵sets, this behaviour is also clearly observed in the simulation.
6.3 Conclusions and further work
In this thesis, theoretical multirate models were developed.Experimental verification confirms
there validity, as injected TDC quantisation noise is indeed shaped as expected. DCO phase
noise is also shaped but has a very high value. However the theoretical model is correct and
predicts the DCO noise shaping. The used ring oscillator should be completely redesigned,
optimised for low phase noise, determined by the application. The behaviour of the integrated
TDC within the ring is quite good. However the simulated result is a few dB higher, what could
indicate that the TDC is less accurate than modelled. But indeed timing mismatches would
have this e↵ect. The TDC resolution budget is not yet depleted however. Choosing a ring with
more stages can only bring down the quantisation error.
As a general conclusion, the introduced design behaves as modelled, but more work is needed
in further optimising the analog blocks, before an actual prototype can be developed.
6.3 Conclusions and further work 73
104
105
106
107
108
109
−180
−160
−140
−120
−100
−80
−60
frequency(Hz)
PSD
(dBc)
TDC noisetuning noiseDCO noiseTotal noise
Figure 6.7: Theoretical phase error for the type II loop, with actual DCO phase noise
106
107
108
109
−160
−150
−140
−130
−120
−110
−100
−90
−80
−70
frequency (Hz)
PSD
(dBc)
SimulatedAnalytical
Figure 6.8: Total phase noise, including DCO noise for a type II loop
74 CHAPTER 6. EXPERIMENTAL RESULTS AND CONCLUSION
BIBLIOGRAPHY 75
Bibliography
[1] Robert Bogdan Staszwski and Poras T. Balsara, All-Digital frequency synthesizer in deep-
submicron CMOS Wiley, New Jersey, 1st edition, 2006.
[2] John A. McNeills’c and David S.Ricketts, The designer’s guide to jitter in ring oscillators
Springer, New York, 2009.
[3] Asas A. Abidi, “Phase Noise and Jitter in CMOS Ring Oscillators”, IEEE journal of solid-
state circuits,vol.41, NO.8, p. 1803,1815. August 2006.
[4] Ioannis L. Syllaios and Poras T. Balsara, “Linear Time-Variant Modeling and Analysis”,
IEEE transactions on circuits and systems,vol.59, NO.11, p. 2495,2505. November 2012.
[5] Maarten Lietaert, “Chipontwerp van een gemengd analoog-digitale PLL voor frequentiesyn-
these”[dutch], M.S. thesis. ELIS,UGent,Gent.
2013
[6] Pieter Rombouts, “Geadvanceerd analoog ontwerp[dutch].
Ugent,2013.
[7] G.S. Jovanovic and M.K. Stojcev, “Vernier’s Delay Line Time–to–Digital Converter”, SCI-
ENTIFIC PUBLICATIONS OF THE STATE UNIVERSITY OF NOVI PAZAR, Faculty of
Electronic Engineering, Nis, Serbia, May 2009.
[8] Hongjin Kim, SoYoung Kim and Kang-Yoon Lee, “Low power FSK transmitter using all-
digital PLL for IEEE 802.15.4g application”, Analog Integr Circ Sig Process,vol.47,p. 599,612.
January 2013.
[9] Robert Bogdan Staszewski et al, “Spur-Free Multirate All Digital PLL for Mobile Phones in
65 nm CMOS”, IEEE journal of solid-state circuits,vol.46,p. 2904,2919. December 2011.
76 BIBLIOGRAPHY
[10] Prashanth Muppala, Saiyu Ren and George Yu-Heng Lee, “Design of high-frequency wide-
range all digital phase locked loop in 90 nm CMOS”, Analog Integrated Circ SIg Pro-
cess,vol.75,p.133,145. 2013.
[11] S. Mendel, “A phase-Domain All-Digital Phase-Locked Loop Architecture Without Refer-
ence Clock Retiming”, Circuits and Systems II,vol.56,no.11,p.860,864. November 2009.
[12] Ali Hajimiri and Thomas H. Lee, “A General Theory of Phase Noise in Electrical Oscilla-
tors”, IEEE journal of solid-state circuits,vol.33,no.2,p.179,194. February 1998.
[13] S. Mendel, “State-of-the-Art and Future Directions of High-Performance All-Digital
Frequency Synthesis in Nanometer CMOS”, IEEE transactions on circuits and sys-
tems,vol.58,no.7,p.1497,1510. July 2011.
LIST OF FIGURES 77
List of Figures
1.1 Basic principle of a frequency synthesiser . . . . . . . . . . . . . . . . . . . . . . 3
1.2 N-fractional PLL for frequency synthesis . . . . . . . . . . . . . . . . . . . . . . 3
1.3 PSD and charge pump used in many modern PLL’s . . . . . . . . . . . . . . . . 4
1.4 General schematic layout of the ADPLL . . . . . . . . . . . . . . . . . . . . . . 5
1.5 Relationship between DCO output and output phase . . . . . . . . . . . . . . . 6
1.6 Principle of fractional phase sampling at reference clock . . . . . . . . . . . . . . 6
2.1 ADPLL model in the phase domain . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 Loop magnitude response for ↵ = 0.1 . . . . . . . . . . . . . . . . . . . . . . . . 9
2.3 The ADPLL in the phase domain including the main noise sources, all sampled
at the reference frequency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.4 Comparison between the power spectra of a single frequency waveform. Both
with (right) and without (left) phase noise . . . . . . . . . . . . . . . . . . . . . 12
2.5 Real clock transitions vs. ideal clock . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.6 Di↵erent noise regions in oscillators . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.7 Transfer function for the DCO noise contribution to the output phase noise spec-
trum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
78 LIST OF FIGURES
2.8 Sampled output phase noise for di↵erent noise sources, proportional loop filter
with bandwidth ⇡ 1MHz( ↵=0.1 ) . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.9 Magnitude plot of an integrating loop filter: H(z) = 0.80.1
z – 1. . . . . . . . . . . 19
2.10 Magnitude of the noise transfer function for a type II ADPLL . . . . . . . . . . 19
2.11 Sampled output phase noise for di↵erent noise sources,type 2 loop filter, ↵ = 0.8
and � = 0.1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.12 Multirate block diagram of the ADPLL . . . . . . . . . . . . . . . . . . . . . . . 21
2.13 Multirate block diagram of the ADPLL with blocks replaced for full operation in
DCO-clock domain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.14 The e↵ect of upsampling on the power spectra with N = 3 . . . . . . . . . . . . 22
2.15 Down-up conversion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.16 Simplification of the multirate model . . . . . . . . . . . . . . . . . . . . . . . . 24
2.17 Block diagram of the multirate ADPLL without ⌃�-modulation . . . . . . . . . 24
2.18 Magnitude bode plot of the multirate open loop transfer function HLG(z) . . . . 25
2.19 Phase noise in the multirate system . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.1 Using a DAC to generate a control voltage for a standard analog VCO . . . . . 30
3.2 LC oscillator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.3 Standard single-ended ring oscillator . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.4 Switching of an ideal inverter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.5 Ring oscillators with di↵erantial delay cells . . . . . . . . . . . . . . . . . . . . . 33
3.6 Fully di↵erential delay cell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.7 Latch structure concept of a pseudo-di↵erential delay cell . . . . . . . . . . . . . 34
LIST OF FIGURES 79
3.8 Circuit implementation of a pseudo-di↵erential delay-cell . . . . . . . . . . . . . 35
3.9 Layout of the tuning circuit in a di↵erential ring oscillator . . . . . . . . . . . . . 37
3.10 The tuning circuit of the ring oscillator . . . . . . . . . . . . . . . . . . . . . . . 38
3.11 Small signal model of the tuning circuit . . . . . . . . . . . . . . . . . . . . . . . 38
3.12 Principle of switched current sources for oscillator tuning . . . . . . . . . . . . . 40
3.13 Current through the nmos of a single delay cell . . . . . . . . . . . . . . . . . . . 41
3.14 Circuit implementation of the current tuning, including bias . . . . . . . . . . . . 42
3.15 Model of the switch in on (left) and o↵( right) state . . . . . . . . . . . . . . . . 42
3.16 Using a pmos switch to quickly charge the parasitic capacitance . . . . . . . . . . 43
3.17 Level shifter output bu↵er . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.18 Di↵erential output of two sequential delay cells . . . . . . . . . . . . . . . . . . . 45
3.19 Changing frequency of the DCO over the full range by changing the coarse input 45
3.20 Detail of the DCO tuning, showing coarse frequency resolution . . . . . . . . . . 46
3.21 Open loop output phase noise of the DCO . . . . . . . . . . . . . . . . . . . . . . 47
4.1 Time di↵erence between two signals . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.2 Layout of a simple TDC delay line . . . . . . . . . . . . . . . . . . . . . . . . . . 50
4.3 Layout of a Vernier delay line . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.4 Propagation of signal and clock signal in the Vernier delay line . . . . . . . . . . 53
4.5 Model for coarse phase detection in the ring . . . . . . . . . . . . . . . . . . . . . 54
4.6 Sampling the ring phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
4.7 Di↵erence between the coarse and fine fractional phase part . . . . . . . . . . . . 56
80 LIST OF FIGURES
4.8 Full TDC implementation with Vernier lines . . . . . . . . . . . . . . . . . . . . . 58
5.1 Principle of reference clock retiming . . . . . . . . . . . . . . . . . . . . . . . . . 62
5.2 State model of the integer phase counter . . . . . . . . . . . . . . . . . . . . . . . 64
6.1 Instantanious output frequency of the ADPLL . . . . . . . . . . . . . . . . . . . 68
6.2 Detail of the output frequency during tracking mode . . . . . . . . . . . . . . . . 68
6.3 Phase error in the output phase . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
6.4 PSD of the phase noise for a proportional loop ↵ = 0.5 . . . . . . . . . . . . . . . 70
6.5 PSD of the phase noise for a proportional loop ↵ = 2–8 . . . . . . . . . . . . . . . 71
6.6 Phase error PSD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
6.7 Theoretical phase error for the type II loop, with actual DCO phase noise . . . . 73
6.8 Total phase noise, including DCO noise for a type II loop . . . . . . . . . . . . . 73
LIST OF TABLES 81
List of Tables
1.1 Bluetooth specifications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
3.1 Transistor sizing of the delay cell in figure 3.8 . . . . . . . . . . . . . . . . . . . 36
4.1 Conversion between sampler output and fractional phase part . . . . . . . . . . . 54
4.2 Conversion between Vernier line output and the fine fractional phase part . . . . 59