Beruflich Dokumente
Kultur Dokumente
RAY OPTICS
AND OPTICAL
INSTRUMENTS
9.1 INTRODUCTION
Nature has endowed the human eye (retina) with the sensitivity to detect
electromagnetic waves within a small range of the electromagnetic
spectrum. Electromagnetic radiation belonging to this region of the
spectrum (wavelength of about 400 nm to 750 nm) is called light. It is
mainly through light and the sense of vision that we know and interpret
the world around us.
There are two things that we can intuitively mention about light from
common experience. First, that it travels with enormous speed and second,
that it travels in a straight line. It took some time for people to realise that
the speed of light is finite and measurable. Its presently accepted value
in vacuum is c = 2.99792458 108 m s1. For many purposes, it suffices
to take c = 3 108 m s1. The speed of light in vacuum is the highest
speed attainable in nature.
The intuitive notion that light travels in a straight line seems to
contradict what we have learnt in Chapter 8, that light is an
electromagnetic wave of wavelength belonging to the visible part of the
spectrum. How to reconcile the two facts? The answer is that the
wavelength of light is very small compared to the size of ordinary objects
that we encounter commonly (generally of the order of a few cm or larger).
In this situation, as you will learn in Chapter 10, a light wave can be
considered to travel from one point to another, along a straight line joining
Physics
them. The path is called a ray of light, and a bundle of such rays
constitutes a beam of light.
In this chapter, we consider the phenomena of reflection, refraction
and dispersion of light, using the ray picture of light. Using the basic
laws of reflection and refraction, we shall study the image formation by
plane and spherical reflecting and refracting surfaces. We then go on to
describe the construction and working of some important optical
instruments, including the human eye.
PARTICLE
MODEL OF LIGHT
9.2 REFLECTION
310
OF
LIGHT
BY
SPHERICAL MIRRORS
We are familiar with the laws of reflection. The angle of reflection (i.e., the
angle between reflected ray and the normal to the reflecting surface or
the mirror) equals the angle of incidence (angle between incident ray and
the normal). Also that the incident ray, reflected ray and the normal to
the reflecting surface at the point of incidence lie in the same plane
(Fig. 9.1). These laws are valid at each point on any reflecting surface
whether plane or curved. However, we shall restrict our discussion to the
special case of curved surfaces, that is, spherical surfaces. The normal in
311
Physics
The distance between the focus F and the pole P of the mirror is called
the focal length of the mirror, denoted by f. We now show that f = R/2,
where R is the radius of curvature of the mirror. The geometry
of reflection of an incident ray is shown in Fig. 9.4.
Let C be the centre of curvature of the mirror. Consider a
ray parallel to the principal axis striking the mirror at M. Then
CM will be perpendicular to the mirror at M. Let be the angle
of incidence, and MD be the perpendicular from M on the
principal axis. Then,
MCP = and MFP = 2
Now,
MD
MD
and tan 2 =
(9.1)
CD
FD
For small , which is true for paraxial rays, tan ,
tan 2 2. Therefore, Eq. (9.1) gives
tan =
MD
MD
=2
FD
CD
CD
or, FD =
(9.2)
2
Now, for small , the point D is very close to the point P.
Therefore, FD = f and CD = R. Equation (9.2) then gives
f = R/2
(9.3)
BF
FP
BA BF
( PM = AB)
(9.4)
BA
FP
Since APB = APB, the right angled triangles ABP and ABP are
also similar. Therefore,
or
BA B P
B A BP
Comparing Eqs. (9.4) and (9.5), we get
(9.5)
B F B P FP B P
(9.6)
FP
FP
BP
Equation (9.6) is a relation involving magnitude of distances. We now
apply the sign convention. We note that light travels from the object to
the mirror MPN. Hence this is taken as the positive direction. To reach
313
Physics
the object AB, image AB as well as the focus F from the pole P, we have
to travel opposite to the direction of incident light. Hence, all the three
will have negative signs. Thus,
B P = v, FP = f, BP = u
Using these in Eq. (9.6), we get
v f
f
v
u
v f
v
f
u
1 1
1
(9.7)
v u
f
This relation is known as the mirror equation.
The size of the image relative to the size of the object is another
important quantity to consider. We define linear magnification (m) as the
ratio of the height of the image (h) to the height of the object (h):
or
h
(9.8)
h
h and h will be taken positive or negative in accordance with the accepted
sign convention. In triangles ABP and ABP, we have,
m=
BA
BP
BA
BP
With the sign convention, this becomes
h
h
so that
v
u
h
v
(9.9)
h
u
We have derived here the mirror equation, Eq. (9.7), and the
magnification formula, Eq. (9.9), for the case of real, inverted image formed
by a concave mirror. With the proper use of sign convention, these are,
in fact, valid for all the cases of reflection by a spherical mirror (concave
or convex) whether the image formed is real or virtual. Figure 9.6 shows
the ray diagrams for virtual image formed by a concave and convex mirror.
You should verify that Eqs. (9.7) and (9.9) are valid for these cases
as well.
m=
314
FIGURE 9.6 Image formation by (a) a concave mirror with object between
P and F, and (b) a convex mirror.
EXAMPLE 9.1
Solution You may think that the image will now show only half of the
object, but taking the laws of reflection to be true for all points of the
remaining part of the mirror, the image will be that of the whole object.
However, as the area of the reflecting surface has been reduced, the
intensity of the image will be low (in this case, half).
Example 9.2 A mobile phone lies along the principal axis of a concave
mirror, as shown in Fig. 9.7. Show by suitable diagram, the formation
of its image. Explain why the magnification is not uniform. Will the
distortion of image depend on the location of the phone with respect
to the mirror?
EXAMPLE 9.2
FIGURE 9.7
Solution
The ray diagram for the formation of the image of the phone is shown
in Fig. 9.7. The image of the part which is on the plane perpendicular
to principal axis will be on the same plane. It will be of the same size,
i.e., BC = BC. You can yourself realise why the image is distorted.
Example 9.3 An object is placed at (i) 10 cm, (ii) 5 cm in front of a
concave mirror of radius of curvature 15 cm. Find the position, nature,
and magnification of the image in each case.
Solution
The focal length f = 15/2 cm = 7.5 cm
(i) The object distance u = 10 cm. Then Eq. (9.7) gives
1
v
or
1
10
v=
1
7. 5
10 7.5
2 .5
= 30 cm
v
( 30)
3
u
( 10)
The image is magnified, real and inverted.
Also, magnification m =
EXAMPLE 9.3
The image is 30 cm from the mirror on the same side as the object.
315
Physics
(ii) The object distance u = 5 cm. Then from Eq. (9.7),
1
v
1
5
v=
EXAMPLE 9.3
or
1
7.5
5 7 .5
7.5 5
15 cm
v
15
3
u
( 5)
The image is magnified, virtual and erect.
Magnification m =
fu
u
for u = 39 m, v
EXAMPLE 9.4
39 34 1365 1360
5
1
m
40 35
1400
1400 280
Therefore, the average speed of the image when the jogger is between
39 m and 34 m from the mirror, is (1/280) m s1
Similarly, it can be seen that for u = 29 m, 19 m and 9 m, the
speed with which the image appears to move is
1
1
1
m s 1 ,
m s 1 and
m s 1 , respectively.
150
60
10
Although the jogger has been moving with a constant speed, the speed
of his/her image appears to increase substantially as he/she moves
closer to the mirror. This phenomenon can be noticed by any person
sitting in a stationary car or a bus. In case of moving vehicles, a
similar phenomenon could be observed if the vehicle in the rear is
moving closer with a constant speed.
9.3 REFRACTION
316
n 21
(9.10)
1
n 21
(9.11)
Physics
318
EXAMPLE 9.5
Example 9.5 The earth takes 24 h to rotate once about its axis. How
much time does the sun take to shift by 1 when viewed from
the earth?
Solution
Time taken for 360 shift = 24 h
Time taken for 1 shift = 24/360 h = 4 min.
SNELLS
LAW
Consider a rectangular swimming pool PQSR; see figure here. A lifeguard sitting at G
outside the pool notices a child drowning at a point C. The guard wants to reach the
child in the shortest possible time. Let SR be the
side of the pool between G and C. Should he/she
take a straight line path GAC between G and C or
GBC in which the path BC in water would be the
shortest, or some other path GXC? The guard knows
that his/her running speed v1 on ground is higher
than his/her swimming speed v2.
Suppose the guard enters water at X. Let GX =l1
and XC =l 2. Then the time taken to reach from G to
C would be
t
l1
v1
l2
v2
v1
v2
In the case of light v1/v2, the ratio of the velocity of light in vacuum to that in the
medium, is the refractive index n of the medium.
In short, whether it is a wave or a particle or a human being, whenever two mediums
and two velocities are involved, one must follow Snells law if one wants to take the
shortest time.
319
Physics
This is called total internal reflection. When
light gets reflected by a surface, normally
some fraction of it gets transmitted. The
reflected ray, therefore, is always less intense
than the incident ray, howsoever smooth the
reflecting surface may be. In total internal
reflection, on the other hand, no
transmission of light takes place.
The angle of incidence corresponding to
an angle of refraction 90, say AO3N, is
called the critical angle (ic ) for the given pair
of media. We see from Snells law [Eq. (9.10)]
FIGURE 9.12 Refraction and internal reflection
of rays from a point A in the denser medium
that if the relative refractive index is less
(water) incident at different angles at the interface than one then, since the maximum value
with a rarer medium (air).
of sin r is unity, there is an upper limit
to the value of sin i for which the law can be satisfied, that is, i = ic
such that
sin ic = n 21
(9.12)
For values of i larger than ic, Snells law of refraction cannot be
satisfied, and hence no refraction is possible.
The refractive index of denser medium 1 with respect to rarer medium
2 will be n12 = 1/sin ic. Some typical critical angles are listed in Table 9.1.
TABLE 9.1 CRITICAL ANGLE OF SOME TRANSPARENT MEDIA WITH RESPECT TO AIR
Substance medium
Refractive index
Critical angle
Water
1.33
48.75
Crown glass
1.52
41.14
1.62
37.31
Diamond
2.42
24.41
320
All optical phenomena can be demonstrated very easily with the use of a
laser torch or pointer, which is easily available nowadays. Take a glass
beaker with clear water in it. Stir the water a few times with a piece of
soap, so that it becomes a little turbid. Take a laser pointer and shine its
beam through the turbid water. You will find that the path of the beam
inside the water shines brightly.
Shine the beam from below the beaker such that it strikes at the
upper water surface at the other end. Do you find that it undergoes partial
reflection (which is seen as a spot on the table below) and partial refraction
[which comes out in the air and is seen as a spot on the roof; Fig. 9.13(a)]?
Now direct the laser beam from one side of the beaker such that it strikes
the upper surface of water more obliquely [Fig. 9.13(b)]. Adjust the
direction of laser beam until you find the angle for which the refraction
FIGURE 9.14 (a) A tree is seen by an observer at its place when the air above the ground is
at uniform temperature, (b) When the layers of air close to the ground have varying
temperature with hottest layers near the ground, light from a distant tree may
undergo total internal reflection, and the apparent image of the tree may create
321
an illusion to the observer that the tree is near a pool of water.
Physics
(ii) Diamond : Diamonds are known for their
spectacular brilliance. Their brilliance
is mainly due to the total internal
reflection of light inside them. The critical
angle for diamond-air interface ( 24.4)
is very small, therefore once light enters
a diamond, it is very likely to undergo
total internal reflection inside it.
Diamonds found in nature rarely exhibit
the brilliance for which they are known.
It is the technical skill of a diamond
cutter which makes diamonds to
sparkle so brilliantly. By cutting the
diamond suitably, multiple total
internal reflections can be made
to occur.
(iii) Prism : Prisms designed to bend light by
FIGURE 9.15 Prisms designed to bend rays by
90 or by 180 make use of total internal
90 and 180 or to invert image without changing
reflection [Fig. 9.15(a) and (b)]. Such a
its size make use of total internal reflection.
prism is also used to invert images
without changing their size [Fig. 9.15(c)].
In the first two cases, the critical angle ic for the material of the prism
must be less than 45. We see from Table 9.1 that this is true for both
crown glass and dense flint glass.
(iv) Optical fibres: Now-a-days optical fibres are extensively used for
transmitting audio and video signals through long distances. Optical
fibres too make use of the phenomenon of total internal reflection.
Optical fibres are fabricated with high quality composite glass/quartz
fibres. Each fibre consists of a core and cladding. The refractive index
of the material of the core is higher than that of the cladding.
When a signal in the form of light is
directed at one end of the fibre at a suitable
angle, it undergoes repeated total internal
reflections along the length of the fibre and
finally comes out at the other end (Fig. 9.16).
Since light undergoes total internal reflection
at each stage, there is no appreciable loss in
the intensity of the light signal. Optical fibres
FIGURE 9.16 Light undergoes successive total
are fabricated such that light reflected at one
internal reflections as it moves through an
side of inner surface strikes the other at an
optical fibre.
angle larger than the critical angle. Even if the
fibre is bent, light can easily travel along its
length. Thus, an optical fibre can be used to act as an optical pipe.
A bundle of optical fibres can be put to several uses. Optical fibres
are extensively used for transmitting and receiving electrical signals which
are converted to light by suitable transducers. Obviously, optical fibres
can also be used for transmission of optical signals. For example, these
are used as a light pipe to facilitate visual examination of internal organs
322
like esophagus, stomach and intestines. You might have seen a commonly
MN
OM
tan NCM =
MN
MC
tan NIM =
MN
MI
323
Physics
LIGHT SOURCES AND PHOTOMETRY
It is known that a body above absolute zero temperature emits electromagnetic radiation.
The wavelength region in which the body emits the radiation depends on its absolute
temperature. Radiation emitted by a hot body, for example, a tungsten filament lamp
having temperature 2850 K are partly invisible and mostly in infrared (or heat) region.
As the temperature of the body increases radiation emitted by it is in visible region. The
sun with temperature of about 5500 K emits radiation whose energy versus wavelength
graph peaks approximately at 550 nm corresponding to green light and is almost in the
middle of the visible region. The energy versus wavelength distribution graph for a given
body peaks at some wavelength, which is inversely proportional to the absolute
temperature of that body.
The measurement of light as perceived by human eye is called photometry. Photometry
is measurement of a physiological phenomenon, being the stimulus of light as received
by the human eye, transmitted by the optic nerves and analysed by the brain. The main
physical quantities in photometry are (i) the luminous intensity of the source,
(ii) the luminous flux or flow of light from the source, and (iii) illuminance of the surface.
The SI unit of luminous intensity (I ) is candela (cd). The candela is the luminous intensity,
in a given direction, of a source that emits monochromatic radiation of frequency
540 1012 Hz and that has a radiant intensity in that direction of 1/683 watt per steradian.
If a light source emits one candela of luminous intensity into a solid angle of one steradian,
the total luminous flux emitted into that solid angle is one lumen (lm). A standard
100 watt incadescent light bulb emits approximately 1700 lumens.
In photometry, the only parameter, which can be measured directly is illuminance. It
is defined as luminous flux incident per unit area on a surface (lm/m2 or lux ). Most light
meters measure this quantity. The illuminance E, produced by a source of luminous
intensity I, is given by E = I/r2, where r is the normal distance of the surface from the
source. A quantity named luminance (L), is used to characterise the brightness of emitting
or reflecting flat surfaces. Its unit is cd/m2 (sometimes called nit in industry) . A good
LCD computer monitor has a brightness of about 250 nits.
MN
OM
MN
MC
(9.13)
Similarly,
r = NCM NIM
i.e., r =
MN
MC
MN
MI
324
n1i = n 2r
(9.14)
n2
n 2 n1
OM
MI
MC
(9.15)
or v = +100 cm
The image is formed at a distance of 100 cm from the glass surface,
in the direction of incident light.
E XAMPLE 9.6
1.5
1
v 100
n2
BI1
n 2 n1
BC1
(9.17)
n1 n 2 n 1
DI
DC2
(9.18)
* Note that now the refractive index of the medium on the right side of ADC is n1
while on its left it is n 2. Further DI1 is negative as the distance is measured
against the direction of incident light.
325
Physics
For a thin lens, BI 1 = DI 1. Adding
Eqs. (9.17) and (9.18), we get
n1
OB
n1
1
(n 2 n1 )
DI
BC1
1
DC2
(9.19)
(n 2 n1 )
1
BC1
1
DC2
(9.20)
n 21 1
1
R1
1
R2
n 21
n2
n1
(9.21)
n1
DI
n1
f
(9.22)
Again, in the thin lens approximation, B and D are both close to the
optical centre of the lens. Applying the sign convention,
BO = u, DI = +v, we get
1 1
v u
326
1
f
(9.23)
m=
E XAMPLE 9.7
Solution
The refractive index of the liquid must be equal to 1.47 in order to
make the lens disappear. This means n1 = n 2.. This gives 1/f = 0 or
f . The lens in the liquid will act like a plane sheet of glass. No,
the liquid is not water. It could be glycerine.
327
Physics
length bends the incident light more, while converging it
in case of a convex lens and diverging it in case of a
concave lens. The power P of a lens is defined as the
tangent of the angle by which it converges or diverges a
beam of light falling at unit distant from the optical centre
(Fig. 9.20).
h
; if h
f
tan
1 tan
1
f
or
1
f for small
value of . Thus,
FIGURE 9.20 Power of a lens.
P=
1
f
(9.25)
The SI unit for power of a lens is dioptre (D): 1D = 1m1. The power of
a lens of focal length of 1 metre is one dioptre. Power of a lens is positive
for a converging lens and negative for a diverging lens. Thus, when an
optician prescribes a corrective lens of power + 2.5 D, the required lens is
a convex lens of focal length + 40 cm. A lens of power of 4.0 D means a
concave lens of focal length 25 cm.
Example 9.8 (i) If f = 0.5 m for a glass lens, what is the power of the
lens? (ii) The radii of curvature of the faces of a double convex lens
are 10 cm and 15 cm. Its focal length is 12 cm. What is the refractive
index of glass? (iii) A convex lens has 20 cm focal length in air. What
is focal length in water? (Refractive index of air-water = 1.33, refractive
index for air-glass = 1.5.)
Solution
(i) Power = +2 dioptre.
(ii) Here, we have f = +12 cm, R1 = +10 cm, R2 = 15 cm.
Refractive index of air is taken as unity.
We use the lens formula of Eq. (9.22). The sign convention has to
be applied for f, R1 and R 2.
Substituting the values, we have
EXAMPLE 9.8
1
1
1
(n 1)
12
10
15
This gives n = 1.5.
(iii) For a glass lens in air, n 2 = 1.5, n1 = 1, f = +20 cm. Hence, the lens
formula gives
1
1
1
0.5
20
R1 R 2
For the same glass lens in water, n 2 = 1.5, n1 = 1.33. Therefore,
1.33
1
(1.5 1.33)
f
R1
1
R2
(9.26)
(9.27)
1 1 1
v v1 f 2
Adding Eqs. (9.27) and (9.28), we get
(9.28)
1 1
v u
(9.29)
1
f1
1
f2
1
f
so that we get
1
f
1
f1
1
f2
(9.30)
1
f1
1
f2
1
f3
(9.31)
(9.32)
where P is the net power of the lens combination. Note that the sum in
Eq. (9.32) is an algebraic sum of individual powers, so some of the terms
on the right side may be positive (for convex lenses) and some negative
(for concave lenses). Combination of lenses helps to obtain diverging or
converging lenses of desired magnification. It also enhances sharpness
of the image. Since the image formed by the first lens becomes the object
for the second, Eq. (9.25) implies that the total magnification m of the
combination is a product of magnification (m1, m 2, m 3,...) of individual
lenses
m = m1 m 2 m 3 ...
(9.33)
329
Physics
Such a system of combination of lenses is commonly used in designing
lenses for cameras, microscopes, telescopes and other optical instruments.
Example 9.9 Find the position of the image formed by the lens
combination given in the Fig. 9.22.
FIGURE 9.22
1
u1
1
f1
1
1
30 10
or
v1 = 15 cm
The image formed by the first lens serves as the object for the second.
This is at a distance of (15 5) cm = 10 cm to the right of the second
lens. Though the image is real, it serves as a virtual object for the
second lens, which means that the rays appear to come from it for
the second lens.
1
v2
1
10
1
10
EXAMPLE 9.9
or
v2 =
The virtual image is formed at an infinite distance to the left of the
second lens. This acts as an object for the third lens.
1
v3
1
u3
1
f3
or
1
v3
or
v3 = 30 cm
1
30
9.6 REFRACTION
330
THROUGH A
PRISM
Figure 9.23 shows the passage of light through a triangular prism ABC.
The angles of incidence and refraction at the first face AB are i and r1,
while the angle of incidence (from glass to air) at the second face AC is r2
and the angle of refraction or emergence e. The angle between the
emergent ray RS and the direction of the incident ray PQ is called the
angle of deviation, .
that is,
=i+eA
(9.35)
Thus, the angle of deviation depends on the angle of incidence. A plot
between the angle of deviation and angle of incidence is shown in
Fig. 9.24. You can see that, in general, any given value of , except for
i = e, corresponds to two values i and hence of e. This, in fact, is expected
from the symmetry of i and e in Eq. (9.35), i.e., remains the same if i
and e are interchanged. Physically, this is related
to the fact that the path of ray in Fig. 9.23 can be
traced back, resulting in the same angle of
deviation. At the minimum deviation Dm, the
refracted ray inside the prism becomes parallel
to its base. We have
= Dm, i = e which implies r1 = r2.
Equation (9.34) gives
A
2
In the same way, Eq. (9.35) gives
2r = A or r =
(9.36)
Dm = 2i A, or i = (A + Dm)/2
(9.37)
n2
n1
sin[( A Dm )/2]
sin [ A / 2]
(9.38)
n 21
A Dm /2
sin[( A Dm )/2]
sin[ A /2]
A /2
Dm = (n 211)A
It implies that, thin prisms do not deviate light much.
331
Physics
9.7 DISPERSION
BY A
PRISM
It has been known for a long time that when a narrow beam of sunlight,
usually called white light, is incident on a glass prism, the emergent
light is seen to be consisting of several colours. There is actually a
continuous variation of colour, but broadly, the different component
colours that appear in sequence are:
violet, indigo, blue, green, yellow, orange
and red (given by the acronym
VIBGYOR). The red light bends the
least, while the violet light bends the most
(Fig. 9.25).
The phenomenon of splitting of light
into its component colours is known as
dispersion. The pattern of colour
components of light is called the spectrum
of light. The word spectrum is now used
in a much more general sense: we
discussed in Chapter 8 the electroFIGURE 9.25 Dispersion of sunlight or white light
magnetic spectrum over the large range
on passing through a glass prism. The relative
of wavelengths, from -rays to radio
deviation of different colours shown is highly
waves, of which the spectrum of light
exaggerated.
(visible spectrum) is only a small part.
Though the reason for appearance of
spectrum is now common knowledge, it was a matter of much debate in
the history of physics. Does the prism itself create colour in some way or
does it only separate the colours already present in white light?
In a classic experiment known for its simplicity but great significance,
Isaac Newton settled the issue once for all. He put another similar prism,
but in an inverted position, and let the emergent beam from the first
prism fall on the second prism (Fig. 9.26). The resulting emergent beam
was found to be white light. The explanation was clear the first prism
splits the white light into its component colours, while the inverted prism
recombines them to give white light. Thus, white
light itself consists of light of different colours,
which are separated by the prism.
It must be understood here that a ray of light,
as defined mathematically, does not exist. An
actual ray is really a beam of many rays of light.
Each ray splits into component colours when it
enters the glass prism. When those coloured rays
come out on the other side, they again produce a
white beam.
FIGURE 9.26 Schematic diagram of
We now know that colour is associated with
Newtons classic experiment on
wavelength of light. In the visible spectrum, red
dispersion of white light.
light is at the long wavelength end (~700 nm) while
the violet light is at the short wavelength end
(~ 400 nm). Dispersion takes place because the refractive index of medium
332
for different wavelengths (colours) is different. For example, the bending
Wavelength (nm)
Crown glass
Flint glass
Violet
396.9
1.533
1.663
Blue
486.1
1.523
1.639
Yellow
589.3
1.517
1.627
Red
656.3
1.515
1.622
DUE TO
SUNLIGHT
Formation of rainbows
http://www.eo.ucar.edu/rainbows
http://www.atoptics.co.uk/bows.htm
The interplay of light with things around us gives rise to several beautiful
phenomena. The spectacle of colour that we see around us all the time is
possible only due to sunlight. The blue of the sky, white clouds, the redhue at sunrise and sunset, the rainbow, the brilliant colours of some
pearls, shells, and wings of birds, are just a few of the natural wonders
we are used to. We describe some of them here from the point of view
of physics.
333
Physics
FIGURE 9.27 Rainbow: (a) The sun rays incident on a water drop get refracted twice
and reflected internally by a drop; (b) Enlarge view of internal reflection and
refraction of a ray of light inside a drop form primary rainbow; and
(c) secondary rainbow is formed by rays
undergoing internal reflection twice
inside the drop.
334
the inner surface of the water drop and get internally reflected if the angle
between the refracted ray and normal to the drop surface is greater then
the critical angle (48, in this case). The reflected light is refracted again
as it comes out of the drop as shown in the figure. It is found that the
violet light emerges at an angle of 40 related to the incoming sunlight
and red light emerges at an angle of 42. For other colours, angles lie in
between these two values.
335
Physics
examples of optical devices and instruments that are in common use.
Our eye is, of course, one of the most important optical device the nature
has endowed us with. Starting with the eye, we then go on to describe
the principles of working of the microscope and the telescope.
336
Figure 9.29 (a) shows the eye. Light enters the eye through a curved
front surface, the cornea. It passes through the pupil which is the central
hole in the iris. The size of the pupil can change under control of muscles.
The light is further focussed by the eye lens on the retina. The retina is a
film of nerve fibres covering the curved back surface of the eye. The retina
contains rods and cones which sense light intensity and colour,
respectively, and transmit electrical signals via the optic nerve to the brain
which finally processes this information. The shape (curvature) and
therefore the focal length of the lens can be modified somewhat by the
ciliary muscles. For example, when the muscle is relaxed, the focal length
is about 2.5 cm and objects at infinity are in sharp focus on the retina.
When the object is brought closer to the eye, in order to maintain the
same image-lens distance ( 2.5 cm), the focal length of the eye lens
becomes shorter by the action of the ciliary muscles. This property of the
eye is called accommodation. If the object is too close to the eye, the lens
cannot curve enough to focus the image on to the retina, and the image
is blurred. The closest distance for which the lens can focus light on the
retina is called the least distance of distinct vision, or the near point.
The standard value for normal vision is taken as 25 cm. (Often the near
point is given the symbol D.) This distance increases with age, because
of the decreasing effectiveness of the ciliary muscle and the loss of
flexibility of the lens. The near point may be as close as about 7 to 8 cm
in a child ten years of age, and may increase to as much as 200 cm at 60
years of age. Thus, if an elderly person tries to read a book at about 25 cm
from the eye, the image appears blurred. This condition (defect of the eye)
is called presbyopia. It is corrected by using a converging lens for reading.
Thus, our eyes are marvellous organs that have the capability to
interpret incoming electromagnetic waves as images through a complex
process. These are our greatest assets and we must take proper care to
protect them. Imagine the world without a pair of functional eyes. Yet
many amongst us bravely face this challenge by effectively overcoming
their limitations to lead a normal life. They deserve our appreciation for
their courage and conviction.
In spite of all precautions and proactive action, our eyes may develop
some defects due to various reasons. We shall restrict our discussion to
some common optical defects of the eye. For example, the light from a
distant object arriving at the eye-lens may get converged at a point in
front of the retina. This type of defect is called nearsightedness or myopia .
This means that the eye is producing too much convergence in the incident
beam. To compensate this, we interpose a concave lens between the eye
and the object, with the diverging effect desired to get the image focussed
on the retina [Fig. 9.29(b)].
FIGURE 9.29 (a) The structure of the eye; (b) shortsighted or myopic eye and its correction;
(c) farsighted or hypermetropic eye and its correction; and (d) astigmatic eye and its correction.
1
1
1
1
or f
50
25 50
orf = + 50 cm (convex lens).
E XAMPLE 9.10
337
Physics
EXAMPLE 9.11
Example 9.11
(a) The far point of a myopic person is 80 cm in front of the eye. What
is the power of the lens required to enable him to see very distant
objects clearly?
(b) In what way does the corrective lens help the above person? Does
the lens magnify very distant objects? Explain carefully.
(c) The above person prefers to remove his spectacles while reading
a book. Explain why?
Solution
(a) Solving as in the previous example, we find that the person should
use a concave lens of focal length = 80 cm, i.e., of power = 1.25
dioptres.
(b) No. The concave lens, in fact, reduces the size of the object, but
the angle subtended by the distant object at the eye is the same
as the angle subtended by the image (at the far point) at the eye.
The eye is able to see distant objects not because the corrective
lens magnifies the object, but because it brings the object (i.e., it
produces virtual image of the object) at the far point of the eye
which then can be focussed by the eye-lens on the retina.
(c) The myopic person may have a normal near point, i.e., about
25 cm (or even less). In order to read a book with the spectacles,
such a person must keep the book at a distance greater than
25 cm so that the image of the book by the concave lens is produced
not closer than 25 cm. The angular size of the book (or its image)
at the greater distance is evidently less than the angular size
when the book is placed at 25 cm and no spectacles are needed.
Hence, the person prefers to remove the spectacles while reading.
338
EXAMPLE 9.12
v
u
1 1
v f
v
f
D
f
(9.39)
h
o
D
(9.40)
339
Physics
We now find the angle subtended at the eye by the image when the
object is at u. From the relations
h
v
m
h
u
we have the angle subtended by the image
h
v
is at u = f.
tan
h v
v u
h
. The angle subtended by the object, when it
u
h
f
(9.41)
D
f
i
o
(9.42)
This is one less than the magnification when the image is at the near
point, Eq. (9.39), but the viewing is more comfortable and the difference
in magnification is usually small. In subsequent discussions of optical
instruments (microscope and telescope) we shall assume the image to be
at infinity.
A simple microscope has a limited maximum magnification ( 9) for
realistic focal lengths. For much larger magnifications, one uses two
lenses, one compounding the effect of the other. This is known as a
compound microscope. A schematic diagram of
a compound microscope is shown in Fig. 9.31.
The lens nearest the object, called the objective,
forms a real, inverted, magnified image of the
object. This serves as the object for the second
lens, the eyepiece, which functions essentially
like a simple microscope or magnifier, produces
the final image, which is enlarged and virtual.
The first inverted image is thus near (at or
within) the focal plane of the eyepiece, at a
distance appropriate for final image formation
at infinity, or a little closer for image formation
at the near point. Clearly, the final image is
inverted with respect to the original object.
We now obtain the magnification due to a
FIGURE 9.31 Ray diagram for the
compound microscope. The ray diagram of
formation of image by a compound
Fig. 9.31 shows that the (linear) magnification
microscope.
due to the objective, namely h/h, equals
h
L
h
fo
where we have used the result
mO
340
tan
h
fo
h
L
(9.43)
me = (D/fe )
[9.44(b)]
m om e
L
fo
D
fe
20 25
250
1
2
Various other factors such as illumination of the object, contribute to
the quality and visibility of the image. In modern microscopes, multicomponent lenses are used for both the objective and the eyepiece to
improve image quality by minimising various optical aberrations (defects)
in lenses.
m om e
http://astro.nineplanets.org/bigeyes.html
L
D
(9.45)
fo
fe
Clearly, to achieve a large magnification of a small object (hence the
name microscope), the objective and eyepiece should have small focal
lengths. In practice, it is difficult to make the focal length much smaller
than 1 cm. Also large lenses are required to make L large.
For example, with an objective with fo = 1.0 cm, and an eyepiece with
focal length fe = 2.0 cm, and a tube length of 20 cm, the magnification is
m
9.9.3 Telescope
The telescope is used to provide angular magnification of distant objects
(Fig. 9.32). It also has an objective and an eyepiece. But here, the objective
has a large focal length and a much larger aperture than the eyepiece.
Light from a distant object enters the objective and a real image is formed
in the tube at its second focal point. The eyepiece magnifies this image
producing a final inverted image. The magnifying power m is the ratio of
the angle subtended at the eye by the final image to the angle which
the object subtends at the lens or the eye. Hence
fo
h fo
m
.
(9.46)
fe h
fe
In this case, the length of the telescope tube is fo + fe .
341
Physics
Terrestrial telescopes have, in
addition, a pair of inverting lenses to
make the final image erect. Refracting
telescopes can be used both for
terrestrial and astronomical
observations. For example, consider
a telescope whose objective has a focal
length of 100 cm and the eyepiece a
focal length of 1 cm. The magnifying
power of this telescope is
m = 100/1 = 100.
Let us consider a pair of stars of
actual separation 1 (one minute of
arc). The stars appear as though they
FIGURE 9.32 A refracting telescope.
are separated by an angle of 100 1
= 100 =1.67.
The main considerations with an astronomical telescope are its light
gathering power and its resolution or resolving power. The former clearly
depends on the area of the objective. With larger diameters, fainter objects
can be observed. The resolving power, or the ability to observe two objects
distinctly, which are in very nearly the same direction, also depends on
the diameter of the objective. So, the desirable aim in optical telescopes
is to make them with objective of large diameter. The largest lens objective
in use has a diameter of 40 inch (~1.02 m). It is at the Yerkes Observatory
in Wisconsin, USA. Such big lenses tend to be very heavy and therefore,
difficult to make and support by their edges. Further, it is rather difficult
and expensive to make such large sized lenses which form images that
are free from any kind of chromatic aberration and distortions.
For these reasons, modern telescopes use a concave mirror rather
than a lens for the objective. Telescopes with mirror objectives are called
reflecting telescopes. They have several advantages. First, there is no
chromatic aberration in a mirror. Second, if a parabolic reflecting surface
is chosen, spherical aberration is also removed. Mechanical support is
much less of a problem since a mirror weighs much less than a lens of
equivalent optical quality, and can be
supported over its entire back surface, not
just over its rim. One obvious problem with a
reflecting telescope is that the objective mirror
focusses light inside the telescope tube. One
must have an eyepiece and the observer right
there, obstructing some light (depending on
the size of the observer cage). This is what is
done in the very large 200 inch (~5.08 m)
diameters, Mt. Palomar telescope, California.
The viewer sits near the focal point of the
FIGURE 9.33 Schematic diagram of a reflecting mirror, in a small cage. Another solution to
telescope (Cassegrain).
the problem is to deflect the light being
focussed by another mirror. One such
arrangement using a convex secondary mirror to focus the incident light,
342
which now passes through a hole in the objective primary mirror, is shown
SUMMARY
1.
2.
3.
4.
1
v
5.
1
u
where u and v are object and image distances, respectively and f is the
focal length of the mirror. f is (approximately) half the radius of
curvature R. f is negative for concave mirror; f is positive for a convex
mirror.
For a prism of the angle A, of refractive index n 2 placed in a medium
of refractive index n1,
sin A
n2
n1
n 21
6.
1
f
D m /2
sin A /2
n 2 n1 n 2 n 1
v
u
R
Thin lens formula
1
v
1
u
1
f
343
Physics
Lens makers formula
1
f
n2
n1
n1
1
R1
1
R2
1
f
7.
8.
9.
1
f1
1
f2
1
f3
L
fo
D
fe
where fo and fe are the focal lengths of the objective and eyepiece,
respectively, and L is the distance between their focal points.
10. Magnifying power m of a telescope is the ratio of the angle subtended
at the eye by the image to the angle subtended at the eye by the
object.
fo
fe
where f0 and fe are the focal lengths of the objective and eyepiece,
respectively.
POINTS TO PONDER
1.
2.
344
The laws of reflection and refraction are true for all surfaces and
pairs of media at the point of the incidence.
The real image of an object placed between f and 2f from a convex lens
can be seen on a screen placed at the image location. If the screen is
removed, is the image still there? This question puzzles many, because
it is difficult to reconcile ourselves with an image suspended in air
3.
4.
5.
without a screen. But the image does exist. Rays from a given point
on the object are converging to an image point in space and diverging
away. The screen simply diffuses these rays, some of which reach our
eye and we see the image. This can be seen by the images formed in air
during a laser show.
Image formation needs regular reflection/refraction. In principle, all
rays from a given point should reach the same image point. This is
why you do not see your image by an irregular reflecting object, say
the page of a book.
Thick lenses give coloured images due to dispersion. The variety in
colour of objects we see around us is due to the constituent colours of
the light incident on them. A monochromatic light may produce an
entirely different perception about the colours on an object as seen in
white light.
For a simple microscope, the angular size of the object equals the
angular size of the image. Yet it offers magnification because we can
keep the small object much closer to the eye than 25 cm and hence
have it subtend a large angle. The image is at 25 cm which we can see.
Without the microscope, you would need to keep the small object at
25 cm which would subtend a very small angle.
EXERCISES
9.1
9.2
9.3
9.4
FIGURE 9.34
345
Physics
9.5
9.6
9.7
9.8
9.9
9.10
9.11
9.12
9.13
9.14
9.15
346
9.16
9.17
(d) an object placed between the pole and focus of a concave mirror
produces a virtual and enlarged image.
[Note: This exercise helps you deduce algebraically properties of
images that one obtains from explicit ray diagrams.]
A small pin fixed on a table top is viewed from above from a distance
of 50 cm. By what distance would the pin appear to be raised if it is
viewed from the same point through a 15 cm thick glass slab held
parallel to the table? Refractive index of glass = 1.5. Does the answer
depend on the location of the slab?
(a) Figure 9.35 shows a cross-section of a light pipe made of a
glass fibre of refractive index 1.68. The outer covering of the
pipe is made of a material of refractive index 1.44. What is the
range of the angles of the incident rays with the axis of the pipe
for which total reflections inside the pipe take place, as shown
in the figure.
(b) What is the answer if there is no outer covering of the pipe?
FIGURE 9.35
9.18
9.19
9.20
9.21
347
Physics
9.22
9.23
9.24
9.25
9.26
9.27
9.28
9.29
9.30
9.31
348
9.32
9.33
9.34
9.35
9.36
9.37
[Note: Exercises 9.29 to 9.31 will help you clearly understand the
difference between magnification in absolute size and the angular
magnification (or magnifying power) of an instrument.]
Answer the following questions:
(a) The angle subtended at the eye by an object is equal to the
angle subtended at the eye by the virtual image produced by a
magnifying glass. In what sense then does a magnifying glass
provide angular magnification?
(b) In viewing through a magnifying glass, one usually positions
ones eyes very close to the lens. Does angular magnification
change if the eye is moved back?
(c) Magnifying power of a simple microscope is inversely proportional
to the focal length of the lens. What then stops us from using a
convex lens of smaller and smaller focal length and achieving
greater and greater magnifying power?
(d) Why must both the objective and the eyepiece of a compound
microscope have short focal lengths?
(e) When viewing through a compound microscope, our eyes should
be positioned not on the eyepiece but a short distance away
from it for best viewing. Why? How much should be that short
distance between the eye and eyepiece?
An angular magnification (magnifying power) of 30X is desired using
an objective of focal length 1.25 cm and an eyepiece of focal length
5 cm. How will you set up the compound microscope?
A small telescope has an objective lens of focal length 140 cm and
an eyepiece of focal length 5.0 cm. What is the magnifying power of
the telescope for viewing distant objects when
(a) the telescope is in normal adjustment (i.e., when the final image
is at infinity)?
(b) the final image is formed at the least distance of distinct vision
(25 cm)?
(a) For the telescope described in Exercise 9.34 (a), what is the
separation between the objective lens and the eyepiece?
(b) If this telescope is used to view a 100 m tall tower 3 km away,
what is the height of the image of the tower formed by the objective
lens?
(c) What is the height of the final image of the tower if it is formed at
25 cm?
A Cassegrain telescope uses two mirrors as shown in Fig. 9.33. Such
a telescope is built with the mirrors 20 mm apart. If the radius of
curvature of the large mirror is 220 mm and the small mirror is
140 mm, where will the final image of an object at infinity be?
Light incident normally on a plane mirror attached to a galvanometer
coil retraces backwards as shown in Fig. 9.36. A current in the coil
produces a deflection of 3.5o of the mirror. What is the displacement
of the reflected spot of light on a screen placed 1.5 m away?
FIGURE 9.36
349
Physics
9.38
FIGURE 9.37
350
Wave Optics
Chapter Ten
tt
o N
be C
E
re R
pu T
bl
is
he
d
WAVE OPTICS
no
10.1 INTRODUCTION
In 1637 Descartes gave the corpuscular model of light and derived Snells
law. It explained the laws of reflection and refraction of light at an interface.
The corpuscular model predicted that if the ray of light (on refraction)
bends towards the normal then the speed of light would be greater in the
second medium. This corpuscular model of light was further developed
by Isaac Newton in his famous book entitled OPTICKS and because of
the tremendous popularity of this book, the corpuscular model is very
often attributed to Newton.
In 1678, the Dutch physicist Christiaan Huygens put forward the
wave theory of light it is this wave model of light that we will discuss in
this chapter. As we will see, the wave model could satisfactorily explain
the phenomena of reflection and refraction; however, it predicted that on
refraction if the wave bends towards the normal then the speed of light
would be less in the second medium. This is in contradiction to the
prediction made by using the corpuscular model of light. It was much
later confirmed by experiments where it was shown that the speed of
light in water is less than the speed in air confirming the prediction of the
wave model; Foucault carried out this experiment in 1850.
The wave theory was not readily accepted primarily because of
351
Newtons authority and also because light could travel through vacuum
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
and it was felt that a wave would always require a medium to propagate
from one point to the other. However, when Thomas Young performed
his famous interference experiment in 1801, it was firmly established
that light is indeed a wave phenomenon. The wavelength of visible
light was measured and found to be extremely small; for example, the
wavelength of yellow light is about 0.5 m. Because of the smallness
of the wavelength of visible light (in comparison to the dimensions of
typical mirrors and lenses), light can be assumed to approximately
travel in straight lines. This is the field of geometrical optics, which we
had discussed in the previous chapter. Indeed, the branch of optics in
which one completely neglects the finiteness of the wavelength is called
geometrical optics and a ray is defined as the path of energy
propagation in the limit of wavelength tending to zero.
After the interference experiment of Young in 1801, for the next 40
years or so, many experiments were carried out involving the
interference and diffraction of lightwaves; these experiments could only
be satisfactorily explained by assuming a wave model of light. Thus,
around the middle of the nineteenth century, the wave theory seemed
to be very well established. The only major difficulty was that since it
was thought that a wave required a medium for its propagation, how
could light waves propagate through vacuum. This was explained
when Maxwell put forward his famous electromagnetic theory of light.
Maxwell had developed a set of equations describing the laws of
electricity and magnetism and using these equations he derived what
is known as the wave equation from which he predicted the existence
of electromagnetic waves*. From the wave equation, Maxwell could
calculate the speed of electromagnetic waves in free space and he found
that the theoretical value was very close to the measured value of speed
o f l i g h t . F r o m t h i s , h e p r o p o u n d e d t h a t light must be an
electromagnetic wave. Thus, according to Maxwell, light waves are
associated with changing electric and magnetic fields; changing electric
field produces a time and space varying magnetic field and a changing
magnetic field produces a time and space varying electric field. The
changing electric and magnetic fields result in the propagation of
electromagnetic waves (or light waves) even in vacuum.
In this chapter we will first discuss the original formulation of the
Huygens principle and derive the laws of reflection and refraction. In
Sections 10.4 and 10.5, we will discuss the phenomenon of interference
which is based on the principle of superposition. In Section 10.6 we
will discuss the phenomenon of diffraction which is based on HuygensFresnel principle. Finally in Section 10.7 we will discuss the
phenomenon of polarisation which is based on the fact that the light
waves are transverse electromagnetic waves.
352
Wave Optics
DOES
tt
o N
be C
E
re R
pu T
bl
is
he
d
Light travels in a straight line in Class VI; it does not do so in Class XII and beyond! Surprised,
arent you?
In school, you are shown an experiment in which you take three cardboards with
pinholes in them, place a candle on one side and look from the other side. If the flame of the
candle and the three pinholes are in a straight line, you can see the candle. Even if one of
them is displaced a little, you cannot see the candle. This proves, so your teacher says,
that light travels in a straight line.
In the present book, there are two consecutive chapters, one on ray optics and the other
on wave optics. Ray optics is based on rectilinear propagation of light, and deals with
mirrors, lenses, reflection, refraction, etc. Then you come to the chapter on wave optics,
and you are told that light travels as a wave, that it can bend around objects, it can diffract
and interfere, etc.
In optical region, light has a wavelength of about half a micrometre. If it encounters an
obstacle of about this size, it can bend around it and can be seen on the other side. Thus a
micrometre size obstacle will not be able to stop a light ray. If the obstacle is much larger,
however, light will not be able to bend to that extent, and will not be seen on the other side.
This is a property of a wave in general, and can be seen in sound waves too. The sound
wave of our speech has a wavelength of about 50 cm to 1 m. If it meets an obstacle of the
size of a few metres, it bends around it and reaches points behind the obstacle. But when it
comes across a larger obstacle of a few hundred metres, such as a hillock, most of it is
reflected and is heard as an echo.
Then what about the primary school experiment? What happens there is that when we
move any cardboard, the displacement is of the order of a few millimetres, which is much
larger than the wavelength of light. Hence the candle cannot be seen. If we are able to move
one of the cardboards by a micrometer or less, light will be able to diffract, and the candle
will still be seen.
One could add to the first sentence in this box : It learns how to bend as it grows up!
no
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
Wave Optics
Christiaan Huygens
(1629 1695) Dutch
physicist, astr onomer,
mathematician and the
founder of the wave
theory of light. His book,
T reatise on light, makes
fascinating reading even
today. He brilliantly
explained the double
refraction shown by the
mineral calcite in this
work in addition to
reflection and refraction.
He was the first to
analyse circular and
simple harmonic motion
and designed and built
improved clocks and
telescopes. He discovered
the true geometry of
Saturns rings.
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
BC
v
= 1
A C AC
(10.1)
and
AE v 2
=
(10.2)
AC AC
where i and r are the angles of incidence and refraction, respectively.
sin r =
355
Physics
Thus we obtain
sin i
v
= 1
sin r
v2
(10.3)
356
c
v1
(10.4)
c
n2 = v
2
(10.5)
n1 =
and
(10.6)
http://www.falstad.com/ripple/
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
From the above equation, we get the important result that if r < i (i.e.,
if the ray bends toward the normal), the speed of the light wave in the
second medium (v 2) will be less then the speed of the light wave in the
first medium (v1). This prediction is opposite to the prediction from the
corpuscular model of light and as later experiments showed, the prediction
of the wave theory is correct. Now, if c represents the speed of light in
vacuum, then,
1
BC
v
=
= 1
AE
v2
2
or
v1
v2
(10.7)
The above equation implies that when a wave gets refracted into a
denser medium (v1 > v 2) the wavelength and the speed of propagation
decrease but the frequency (= v/) remains the same.
n2
n1
(10.8)
Thus, if i = ic then sin r = 1 and r = 90. Obviously, for i > ic, there can
not be any refracted wave. The angle ic is known as the critical angle and
for all angles of incidence greater than the critical angle, we will not have
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
any refracted wave and the wave will undergo what is known as total
internal reflection. The phenomenon of total internal reflection and its
applications was discussed in Section 9.4.
no
AE = BC = v
If we now consider the triangles EAC and BAC we will find that they
are congruent and therefore, the angles i and r (as shown in Fig. 10.6)
would be equal. This is the law of reflection.
Once we have the laws of reflection and refraction, the behaviour of
prisms, lenses, and mirrors can be understood. These phenomena were
357
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
FIGURE 10.7 Refraction of a plane wave by (a) a thin prism, (b) a convex lens. (c) Reflection of a
plane wave by a concave mirror.
From the above discussion it follows that the total time taken from a
point on the object to the corresponding point on the image is the same
measured along any ray. For example, when a convex lens focusses light
to form a real image, although the ray going through the centre traverses
a shorter path, but because of the slower speed in glass, the time taken
is the same as for rays travelling near the edge of the lens.
no
358
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
vradial
c
(10.9)
The formula given above is valid only when the speed of the source is
small compared to that of light. A more accurate formula for the Doppler
effect which is valid even when the speeds are close to that of light, requires
the use of Einsteins special theory of relativity. The Doppler effect for
light is very important in astronomy. It is the basis for the measurements
of the radial velocities of distant galaxies.
Example 10.1 What speed should a galaxy move with respect
to us so that the sodium line at 589.0 nm is observed
at 589.6 nm?
Solution Since = c,
0.6
= + 3.06 105 m s 1
589.0
= 306 km/s
no
Solution
E XAMPLE 10.2
Example 10.2
(a) When monochromatic light is incident on a surface separating
two media, the reflected and refracted light both have the same
frequency as the incident frequency. Explain why?
(b) When light travels from a rarer to a denser medium, the speed
decreases. Does the reduction in speed imply a reduction in the
energy carried by the light wave?
(c) In the wave picture of light, intensity of light is determined by the
square of the amplitude of the wave. What determines the intensity
of light in the photon picture of light.
E XAMPLE 10.1
or, v radial + c
v radial
c
359
Physics
E XAMPLE 10.2
tt
o N
be C
E
re R
pu T
bl
is
he
d
(a)
S1 P = S2 P
(b)
no
360
then, the displacement produced by the source S2 (at the point P) will
also be given by
y2 = a cos t
Wave Optics
interfere constructively and we have what is referred to as constructive
interference. We next consider a point Q [Fig. 10.9(a)]
for which
S2Q S1 Q = 2
tt
o N
be C
E
re R
pu T
bl
is
he
d
The waves emanating from S1 will arrive exactly two cycles earlier
than the waves from S2 and will again be in phase [Fig. 10.9(a)]. Thus, if
the displacement produced by S1 is given by
y1 = a cos t
y2 = a cos (t 4) = a cos t
where we have used the fact that a path difference of 2 corresponds to a
phase difference of 4. The two displacements are once again in phase
and the intensity will again be 4 I0 giving rise to constructive interference.
In the above analysis we have assumed that the distances S1Q and S 2Q
are much greater than d (which represents the distance between S1 and
S2) so that although S1Q and S2Q are not equal, the amplitudes of the
displacement produced by each wave are very nearly the same.
We next consider a point R [Fig. 10.9(b)] for which
FIGURE 10.9
(a) Constructive
The waves emanating from S1 will arrive exactly two and a half cycles
interference at a
later than the waves from S2 [Fig. 10.10(b)]. Thus if the displacement point Q for which the
path difference is 2.
produced by S1 is given by
(b) Destructive
y1 = a cos t
interference at a
point R for which the
then the displacement produced by S2 will be given by
path difference is
y2 = a cos (t + 5) = a cos t
2.5 .
S2R S1 R = 2.5
where we have used the fact that a path difference of 2.5 corresponds to
a phase difference of 5. The two displacements are now out of phase
and the two displacements will cancel out to give zero intensity. This is
referred to as destructive interference.
To summarise: If we have two coherent sources S1 and S2 vibrating
in phase, then for an arbitrary point P whenever the path difference,
S1P ~ S2P = n
(n = 0, 1, 2, 3,...)
(10.10)
no
1
) (n = 0, 1, 2, 3, ...)
(10.11)
2
FIGURE 10.10 Locus
we will have destructive interference and the resultant intensity will be of points for which
zero. Now, for any other arbitrary point G (Fig. 10.10) let the phase S P S P is equal to
1
2
difference between the two displacements be . Thus, if the displacement
zero, , 2 , 3 .
produced by S1 is given by
S1P ~ S2P = (n+
y1 = a cos t
361
Physics
then, the displacement produced by S2 would be
y2 = a cos (t + )
and the resultant displacement will be given by
y = y1 + y2
tt
o N
be C
E
re R
pu T
bl
is
he
d
= a [cos t + cos (t + )]
http://www.colorado.edu/physics/2000/applets/fourier.html
I = 4 I 0 cos2 ( /2)
(10.12)
(10.13)
no
I = 2 I0
362
(10.14)
at all points.
When the phase difference between the two vibrating sources changes
rapidly with time, we say that the two sources are incoherent and when
this happens the intensities just add up. This is indeed what happens
when two separate light sources illuminate a wall.
10.5 INTERFERENCE
EXPERIMENT
OF
LIGHT WAVES
AND
YOUNG S
We will now discuss interference using light waves. If we use two sodium
lamps illuminating two pinholes (Fig. 10.11) we will not observe any
interference fringes. This is because of the fact that the light wave emitted
from an ordinary source (like a sodium lamp) undergoes abrupt phase
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
(a)
(b)
no
n = 0, 1, 2 ...
(10.15)
Now,
(S2P ) (S1P ) =
D2 + x +
d
2
D2 + x
d
2
= 2x d
363
Physics
where S1S2 = d and OP = x . Thus
2 xd
S2 P S1P = S P+S P
2
1
(10.16)
tt
o N
be C
E
re R
pu T
bl
is
he
d
Thomas
Young
(1773 1829) English
physicist, physician and
Egyptologist. Young worked
on a wide variety of
scientific problems, ranging
from the structure of the eye
and the mechanism of
vision to the decipherment
of the Rosetta stone. He
revived the wave theory of
light and recognised that
interference phenomena
provide proof of the wave
properties of light.
200.01 cm
Thus if we replace S2P + S1 P by 2 D, the error involved is
about 0.005%. In this approximation, Eq. (10.16)
becomes
S2 P S1P
(10.17)
; n = 0, 1, 2
(10.19)
d
Thus dark and bright bands appear on the screen, as shown in
Fig. 10.13. Such bands are called fringes. Equations (10.18) and (10.19)
show that dark and bright fringes are equally spaced and the distance
between two consecutive bright and dark fringes is given by
x = x n = (n+
= x n+1 x n
no
or =
364
(10.20)
d
which is the expression for the fringe width. Obviously, the central point
O (in Fig. 10.12) will be bright because S1O = S2 O and it will correspond
to n = 0. If we consider the line perpendicular to the plane of the paper
and passing through O [i.e., along the y-axis] then all points on this line
will be equidistant from S1 and S2 and we will have a bright central fringe
which is a straight line as shown in Fig. 10.13. In order to determine the
shape of the interference pattern on the screen we note that a particular
fringe would correspond to the locus of points with a constant value of
S2 P S1 P. Whenever this constant is an integral multiple of , the fringe
will be bright and whenever it is an odd integral multiple of /2 it will be
a dark fringe. Now, the locus of the point P lying in the x-y plane such
that S2P S1P (= ) is a constant, is a hyperbola. Thus the fringe pattern
will strictly be a hyperbola; however, if the distance D is very large compared
to the fringe width, the fringes will be very nearly straight lines as shown
in Fig. 10.13.
tt
o N
be C
E
re R
pu T
bl
is
he
d
Wave Optics
FIGURE 10.13 Computer generated fringe pattern produced by two point source S1 and S2 on the
screen GG (Fig. 10.12); (a) and (b) correspond to d = 0.005 mm and 0.025 mm, respectively (both
figures correspond to D = 5 cm and = 5 105 cm.) (Adopted from OPTICS by A. Ghatak, Tata
McGraw Hill Publishing Co. Ltd., New Delhi, 2000.)
no
* Dennis Gabor received the 1971 Nobel Prize in Physics for discovering the
principles of holography.
365
Physics
interference. Thomas Young had expected it because he believed
in the wave theory of light.
EXAMPLE 10.3
http://vsg.quasihome.com/interfer.html
tt
o N
be C
E
re R
pu T
bl
is
he
d
We should mention here that the fringes are straight lines although
S1 and S 2 are point sources. If we had slits instead of the point sources
(Fig. 10.14), each pair of points would have produced straight line fringes
resulting in straight line fringes with increased intensities.
Example 10.3 Two slits are made one millimetre apart and the screen
is placed one metre away. What is the fringe separation when bluegreen light of wavelength 500 nm is used?
Solution Fringe spacing =
D 1 5 10 7
=
m
d
1 10 3
= 5 104 m = 0.5 mm
366
EXAMPLE 10.4
no
Wave Optics
( In each operation, take all parameters, other than the one specified,
to remain unchanged.)
tt
o N
be C
E
re R
pu T
bl
is
he
d
Solution
(a) Angular separation of the fringes remains constant
(= /d). The actual
separation of the fringes increases in
proportion to the distance of the screen from the plane of the
two slits.
(d) Let s be the size of the source and S its distance from the plane of
the two slits. For interference fringes to be seen, the condition
s/S < /d should be satisfied; otherwise, interference patterns
produced by different parts of the source overlap and no fringes
are seen. Thus, as S decreases (i.e., the source slit is brought
closer), the interference pattern gets less and less sharp, and
when the source is brought too close for this condition to be valid,
the fringes disappear. Till this happens, the fringe separation
remains fixed.
(e) Same as in (d). As the source slit width increases, fringe pattern
gets less and less sharp. When the source slit is so wide that the
condition s/S /d is not satisfied, the interference pattern
disappears.
Thus, the fringe closest on either side of the central white fringe
is red and the farthest will appear blue. After a few fringes, no
clear fringe pattern is seen.
E XAMPLE 10.4
no
10.6 DIFFRACTION
367
Physics
instruments such as telescopes or microscopes is limited due to the
phenomenon of diffraction. Indeed the colours that you see when a CD is
viewed is due to diffraction effects. We will now discuss the phenomenon
of diffraction.
tt
o N
be C
E
re R
pu T
bl
is
he
d
= a sin
no
368
(10.21)
Similarly, if two points M1 and M2 in the slit plane are separated by y, the
path difference M2 P M1P y. We now have to sum up equal, coherent
contributions from a large number of sources, each with a different phase.
This calculation was made by Fresnel using integral calculus, so we omit
it here. The main features of the diffraction pattern can be understood by
simple arguments.
At the central point C on the screen, the angle is zero. All path
differences are zero and hence all the parts of the slit contribute in phase.
This gives maximum intensity at C. Experimental observation shown in
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
/a .
(10.22)
=
3
3 2a
(10.23)
no
369
Physics
interesting to note what Richard Feynman* has said in his famous
Feynman Lectures on Physics:
370
http://www.phys.hawaii.edu/~teb/optics/java/slitdiffr/
Example 10.5 In Example 10.3, what should the width of each slit be
to obtain 10 maxima of the double slit pattern within the central
maximum of the single slit pattern?
EXAMPLE 10.5
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
Solution We want a = , =
a = = 0 .2 mm
a
5
Notice that the wavelength of light and distance of the screen do not
enter in the calculation of a.
10
=2
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
slit diffraction pattern on the screen. The centre of the central bright fringe
will appear at a point which lies on the straight line SS1 or SS2, as the
case may be.
We now compare and contrast the interference pattern with that seen
for a coherently illuminated single slit (usually called the single slit
diffraction pattern).
(i) The interference pattern has a number of equally spaced bright and
dark bands. The diffraction pattern has a central bright maximum
which is twice as wide as the other maxima. The intensity falls as we
go to successive maxima away from the centre, on either side.
(ii) We calculate the interference pattern by superposing two waves
originating from the two narrow slits. The diffraction pattern is a
superposition of a continuous family of waves originating from each
point on a single slit.
(iii) For a single slit of width a, the first null of the interference pattern
occurs at an angle of /a. At the same angle of /a, we get a maximum
(not a null) for two narrow slits separated by a distance a.
One must understand that both d and a have to be quite small, to be
able to observe good interference and diffraction patterns. For example,
the separation d between the two slits must be of the order of a milimetre
or so. The width a of each slit must be even smaller, of the order of 0.1 or
0.2 mm.
In our discussion of Youngs experiment and the single-slit diffraction,
we have assumed that the screen on which the fringes are formed is at a
large distance. The two or more paths from the slits to the screen were
treated as parallel. This situation also occurs when we place a converging
lens after the slits and place the screen at the focus. Parallel paths from
the slit are combined at a single point on the screen. Note that the lens
does not introduce any extra path differences in a parallel beam. This
arrangement is often used since it gives more intensity than placing the
screen far away. If f is the focal length of the lens, then we can easily work
out the size of the central bright maximum. In terms of angles, the
separation of the central maximum from the first null of the diffraction
pattern is /a . Hence, the size on the screen will be f /a.
no
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
In this experiment, the filament plays the role of the first slit S in
Fig. 10.16. The lens of the eye focuses the pattern on the screen (the
retina of the eye).
With some effort, one can cut a double slit in an aluminium foil with
a blade. The bulb filament can be viewed as before to repeat Youngs
experiment. In daytime, there is another suitable bright source subtending
a small angle at the eye. This is the reflection of the Sun in any shiny
convex surface (e.g., a cycle bell). Do not try direct sunlight it can damage
the eye and will not give fringes anyway as the Sun subtends an angle
of (1/2).
In interference and diffraction, light energy is redistributed. If it
reduces in one region, producing a dark fringe, it increases in another
region, producing a bright fringe. There is no gain or loss of energy,
which is consistent with the principle of conservation of energy.
(10.24)
no
r0
372
Wave Optics
where f is the focal length of the lens and 2a is the diameter of the circular
aperture or the diameter of the lens, whichever is smaller. Typically if
0.5 m, f 20 cm and a 5 cm
we have
tt
o N
be C
E
re R
pu T
bl
is
he
d
r0 1.2 m
Although the size of the spot is very small, it plays an important role
in determining the limit of resolution of optical instruments like a telescope
or a microscope. For the two stars to be just resolved
f r0
0.61 f
a
implying
0.61
(10.25)
a
Thus will be small if the diameter of the objective is large. This
implies that the telescope will have better resolving power if a is large. It
is for this reason that for better resolution, a telescope must have a large
diameter objective.
0.61 6 10 5
127
2.9 10 7 radians
E XAMPLE 10.6
no
FIGURE 10.20 Real image formed by the objective lens of the microscope.
373
Physics
DETERMINE
tt
o N
be C
E
re R
pu T
bl
is
he
d
You can estimate the resolving power of your eye with a simple experiment. Make
black stripes of equal width separated by white stripes; see figure here. All the black
stripes should be of equal width, while the width of the intermediate white stripes should
increase as you go from the left to the right. For example, let all black stripes have a width
of 5 mm. Let the width of the first two white stripes be 0.5 mm each, the next two white
stripes be 1 mm each, the next two 1.5 mm each, etc. Paste this pattern on a wall in a
room or laboratory, at the height of your eye.
Now watch the pattern, preferably with one eye. By moving away or closer to the wall,
find the position where you can just see some two black stripes as separate stripes. All
the black stripes to the left of this stripe would merge into one another and would not be
distinguishable. On the other hand, the black stripes to the right of this would be more
and more clearly visible. Note the width d of the white stripe which separates the two
regions, and measure the distance D of the wall from your eye. Then d/D is the resolution
of your eye.
You have watched specks of dust floating in air in a sunbeam entering through your
window. Find the distance (of a speck) which you can clearly see and distinguish from a
neighbouring speck. Knowing the resolution of your eye and the distance of the speck,
estimate the size of the speck of dust.
1.22
D
(10.27)
Two objects whose images are closer than this distance will not be
resolved, they will be seen as one. The corresponding minimum
separation, dmin, in the object plane is given by
v
no
dmin =
374
1. 22
D
1 .22 v
.
D
m
1 .22 f
D
Now, combining Eqs. (10.26) and (10.28), we get
=
d min =
1.22
2 tan
(10.28)
Wave Optics
1.22
2 sin
(10.29)
tt
o N
be C
E
re R
pu T
bl
is
he
d
If the medium between the object and the objective lens is not air but
a medium of refractive index n, Eq. (10.29) gets modified to
d min =
1.22
2 n sin
(10.30)
no
(10.31)
zF
a2 /
Equation (10.31) shows that for distances much smaller than z F , the
spreading due to diffraction is smaller compared to the size of the beam.
It becomes comparable when the distance is approximately zF . For
distances much greater than z F , the spreading due to diffraction
375
Physics
dominates over that due to ray optics (i.e., the size a of the aperture).
Equation (10.31) also shows that ray optics is valid in the limit of
wavelength tending to zero.
EXAMPLE 10.7
tt
o N
be C
E
re R
pu T
bl
is
he
d
Solution z F =
3 2
( 3 10 )
=
5 10
= 18 m
10.7 POLARISATION
no
Consider holding a long string that is held horizontally, the other end of
which is assumed to be fixed. If we move the end of the string up and
down in a periodic manner, we will generate a wave propagating in the
+x direction (Fig. 10.21). Such a wave could be described by the following
equation
376
Wave Optics
y (x,t ) = a sin (kx t)
(10.32)
tt
o N
be C
E
re R
pu T
bl
is
he
d
(10.34)
no
Light waves are transverse in nature; i.e., the electric field associated
with a propagating light wave is always at right angles to the direction of
propagation of the wave. This can be easily demonstrated using a simple
polaroid. You must have seen thin plastic like sheets, which are called
polaroids. A polaroid consists of long chain molecules aligned in a
particular direction. The electric vectors (associated with the propagating
light wave) along the direction of the aligned molecules get absorbed.
Thus, if an unpolarised light wave is incident on such a polaroid then
the light wave will get linearly polarised with the electric vector oscillating
along a direction perpendicular to the aligned molecules; this direction
is known as the pass-axis of the polaroid.
Thus, if the light from an ordinary source (like a sodium lamp) passes
through a polaroid sheet P1, it is observed that its intensity is reduced by
half. Rotating P1 has no effect on the transmitted beam and transmitted
intensity remains constant. Now, let an identical piece of polaroid P 2 be
placed before P1. As expected, the light from the lamp is reduced in
intensity on passing through P2 alone. But now rotating P1 has a dramatic
effect on the light coming from P2. In one position, the intensity transmitted
377
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
FIGURE 10.22 (a) Passage of light through two polaroids P 2 and P 1. The
transmitted fraction falls from 1 to 0 as the angle between them varies
from 0 to 90. Notice that the light seen through a single polaroid
P 1 does not vary with angle. (b) Behaviour of the electric vector
when light passes through two polaroids. The transmitted
polarisation is the component parallel to the polaroid axis.
The double arrows show the oscillations of the electric vector.
378
EXAMPLE 10.8
no
I = I 0 cos 2 ,
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
I = I 0 cos2 cos2
EXAMPLE 10.8
The light from a clear blue portion of the sky shows a rise and fall of
intensity when viewed through a polaroid which is rotated. This is nothing
but sunlight, which has changed its direction (having been scattered) on
encountering the molecules of the earths atmosphere. As Fig. 10.23(a)
shows, the incident sunlight is unpolarised. The dots stand for polarisation
perpendicular to the plane of the figure. The double arrows show
polarisation in the plane of the figure. (There is no phase relation between
these two in unpolarised light). Under the influence of the electric field of
the incident wave the electrons in the molecules acquire components of
motion in both these directions. We have drawn an observer looking at
90 to the direction of the sun. Clearly, charges accelerating parallel to
the double arrows do not radiate energy towards this observer since their
acceleration has no transverse component. The radiation scattered by
the molecule is therefore represented by dots. It is polarised
perpendicular to the plane of the figure. This explains the polarisation of
scattered light from the sky.
no
FIGURE 10.23 (a) Polarisation of the blue scattered light from the sky.
The incident sunlight is unpolarised (dots and arrows). A typical
molecule is shown. It scatters light by 90 polarised normal to
the plane of the paper (dots only). (b) Polarisation of light
reflected from a transparent medium at the Brewster angle
(reflected ray perpendicular to refracted ray).
379
Physics
A
tt
o N
be C
E
re R
pu T
bl
is
he
d
When light is incident on an interface of two media, it is observed that some part of it
gets reflected and some part gets transmitted. Consider a related question: Is it possible
that under some conditions a monochromatic beam of light incident on a surface
(which is normally reflective) gets completely transmitted with no reflection? To your
surprise, the answer is yes.
Let us try a simple experiment and check what happens. Arrange a laser, a good
polariser, a prism and screen as shown in the figure here.
Let the light emitted by the laser source pass through the polariser and be incident
on the surface of the prism at the Brewsters angle of incidence iB. Now rotate the
polariser carefully and you will observe that for a specific alignment of the polariser, the
light incident on the prism is completely transmitted and no light is reflected from the
surface of the prism. The reflected spot will completely vanish.
no
380
sin i B
sin i B
=
sin r
sin ( /2 i B )
Wave Optics
=
sin i B
= tan i B
cos iB
(10.36)
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 10.9
no
SUMMARY
1.
2.
381
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
3.
is the same measured along any ray. This principle leads to the well
known laws of reflection and refraction.
The principle of superposition of waves applies whenever two or more
sources of light illuminate the same point. When we consider the
intensity of light due to these sources at the given point, there is an
interference term in addition to the sum of the individual intensities.
But this term is important only if it has a non-zero average, which
occurs only if the sources have the same frequency and a stable
phase difference.
Youngs double slit of separation d gives equally spaced fringes of
angular separation /d. The source, mid-point of the slits, and central
bright fringe lie in a straight line. An extended source will destroy
the fringes if it subtends angle more than /d at the slits.
A single slit of width a gives a diffraction pattern with a central
2
, etc.,
maximum. The intensity falls to zero at angles of ,
a
a
with successively weaker secondary maxima in between. Diffraction
limits the angular resolution of a telescope to /D where D is the
diameter. Two stars closer than this give strongly overlapping images.
Similarly, a microscope objective subtending angle 2 at the focus,
in a medium of refractive index n, will just separate two objects spaced
at a distance /(2n sin ), which is the resolution limit of a
microscope. Diffraction determines the limitations of the concept of
light rays. A beam of width a travels a distance a2 /, called the Fresnel
distance, before it starts to spread out due to diffraction.
Natural light, e.g., from the sun is unpolarised. This means the electric
vector takes all possible directions in the transverse plane, rapidly
and randomly, during a measurement. A polaroid transmits only one
component (parallel to a special axis). The resulting light is called
linearly polarised or plane polarised. When this kind of light is viewed
through a second polaroid whose axis turns through 2, two maxima
and minima of intensity are seen. Polarised light can also be produced
by reflection at a special angle (called the Brewster angle) and by
scattering through /2 in the earths atmosphere.
4.
5.
6.
POINTS TO PONDER
1.
no
2.
3.
4.
5.
382
Waves from a point source spread out in all directions, while light was
seen to travel along narrow rays. It required the insight and experiment
of Huygens, Young and Fresnel to understand how a wave theory could
explain all aspects of the behaviour of light.
The crucial new feature of waves is interference of amplitudes from different
sources which can be both constructive and destructive, as shown in
Youngs experiment.
Even a wave falling on single slit should be regarded as a large number of
sources which interefere constructively in the forward direction ( = 0),
and destructively in other directions.
Diffraction phenomena define the limits of ray optics. The limit of the
ability of microscopes and telescopes to distinguish very close objects is
set by the wavelength of light.
Most interference and diffraction effects exist even for longitudinal waves
like sound in air. But polarisation phenomena are special to transverse
waves like light waves.
Wave Optics
EXERCISES
Monochromatic light of wavelength 589 nm is incident from air on a
water surface. What are the wavelength, frequency and speed of
(a) reflected, and (b) refracted light? Refractive index of water is
1.33.
tt
o N
be C
E
re R
pu T
bl
is
he
d
10.1
10.2
(b) Light emerging out of a convex lens when a point source is placed
at its focus.
(c) The portion of the wavefront of light from a distant star intercepted
by the Earth.
10.3
(a) The refractive index of glass is 1.5. What is the speed of light in
glass? (Speed of light in vacuum is 3.0 108 m s1)
10.4
10.5
10.6
(b) What is the least distance from the central maximum where the
bright fringes due to both the wavelengths coincide?
no
10.7
10.8
10.9
10.10 Estimate the distance for which ray optics is good approximation
for an aperture of 4 mm and wavelength 400 nm.
383
Physics
ADDITIONAL EXERCISES
tt
o N
be C
E
re R
pu T
bl
is
he
d
10.11 The 6563 H line emitted by hydrogen in a star is found to be redshifted by 15 . Estimate the speed with which the star is receding
from the Earth.
10.12 Explain how Corpuscular theory predicts the speed of light in a
medium, say, water, to be greater than the speed of light in vacuum.
Is the prediction confirmed by experimental determination of the
speed of light in wate r? If not, which alternative picture of light is
consistent with experiment?
10.13 You have lear nt in the text how Huygens principle leads to the laws
of reflection and refraction. Use the same principle to deduce directly
that a point object placed in front of a plane mirror produces a
virtual image whose distance from the mirror is equal to the object
distance fr om the mirror.
10.14 Let us list some of the factors, which could possibly influence the
speed of wave propagation:
(i) nature of the source.
(ii) direction of propagation.
10.15 For sound waves, the Doppler formula for frequency shift differs
slightly between the two situations: (i) source at rest; observer
moving, and (ii) source moving; observer at rest. The exact Doppler
formulas for the case of light waves in vacuum are, however, strictly
identical for these situations. Explain why this should be so. Would
you expect the formulas to be strictly identical for the two situations
in case of light travelling in a medium ?
no
384
Wave Optics
tt
o N
be C
E
re R
pu T
bl
is
he
d
10.18
10.19
10.20
no
10.21
385
Physics
Chapter Eleven
tt
o N
be C
E
re R
pu T
bl
is
he
d
DUAL NATURE OF
RADIATION AND
MATTER
no
11.1 I NTRODUCTION
386
tt
o N
be C
E
re R
pu T
bl
is
he
d
particles. They were found to travel with speeds ranging from about 0.1
to 0.2 times the speed of light (3 108 m/s). The presently accepted value
of e/m is 1.76 1011 C/kg. Further, the value of e/m was found to be
independent of the nature of the material/metal used as the cathode
(emitter), or the gas introduced in the discharge tube. This observation
suggested the universality of the cathode ray particles.
Around the same time, in 1887, it was found that certain metals, when
irradiated by ultraviolet light, emitted negatively charged particles having
small speeds. Also, certain metals when heated to a high temperature were
found to emit negatively charged particles. The value of e/m of these particles
was found to be the same as that for cathode ray particles. These
observations thus established that all these particles, although produced
under different conditions, were identical in nature. J. J. Thomson, in 1897,
named these particles as electrons, and suggested that they were
fundamental, universal constituents of matter. For his epoch-making
discovery of electron, through his theoretical and experimental
investigations on conduction of electricity by gasses, he was awarded the
Nobel Prize in Physics in 1906. In 1913, the American physicist R. A.
Millikan (1868-1953) performed the pioneering oil-drop experiment for
the precise measurement of the charge on an electron. He found that the
charge on an oil-droplet was always an integral multiple of an elementary
charge, 1.602 10 19 C. Millikans experiment established that electric
charge is quantised. From the values of charge (e ) and specific charge
(e/m ), the mass (m) of the electron could be determined.
no
We know that metals have free electrons (negatively charged particles) that
are responsible for their conductivity. However, the free electrons cannot
normally escape out of the metal surface. If an electron attempts to come
out of the metal, the metal surface acquires a positive charge and pulls the
electron back to the metal. The free electron is thus held inside the metal
surface by the attractive forces of the ions. Consequently, the electron can
come out of the metal surface only if it has got sufficient energy to overcome
the attractive pull. A certain minimum amount of energy is required to be
given to an electron to pull it out from the surface of the metal. This
minimum energy required by an electron to escape from the metal surface
is called the work function of the metal. It is generally denoted by 0 and
measured in eV (electron volt). One electron volt is the energy gained by an
electron when it has been accelerated by a potential difference of 1 volt, so
that 1 eV = 1.602 10 19 J.
This unit of energy is commonly used in atomic and nuclear physics.
The work function ( 0 ) depends on the properties of the metal and the
nature of its surface. The values of work function of some metals are
given in Table 11.1. These values are approximate as they are very
sensitive to surface impurities.
Note from Table 11.1 that the work function of platinum is the highest
(0 = 5.65 eV ) while it is the lowest ( 0 = 2.14 eV) for caesium.
The minimum energy required for the electron emission from the metal
surface can be supplied to the free electrons by any one of the following
physical processes:
387
Physics
TABLE 11.1 WORK FUNCTIONS OF
SOME METALS
Work function
(eV)
Metal
Work function
(eV)
Cs
2.14
Al
4.28
2.30
Hg
4.49
Na
2.75
Cu
4.65
Ca
3.20
Ag
4.70
Mo
4.17
Ni
5.15
Pb
4.25
Pt
5.65
tt
o N
be C
E
re R
pu T
bl
is
he
d
Metal
no
388
OF
http://www.kcvs.ca/site/projects/physics.html
tt
o N
be C
E
re R
pu T
bl
is
he
d
P HOTOELECTRIC
no
Figure 11.1 depicts a schematic view of the arrangement used for the
experimental study of the photoelectric effect. It consists of an evacuated
glass/quartz tube having a photosensitive plate C and another metal
plate A. Monochromatic light from the source S of sufficiently short
wavelength passes through the window W and falls on the photosensitive
plate C (emitter). A transparent quartz window is sealed on to the glass
tube, which permits ultraviolet radiation to pass through it and irradiate
the photosensitive plate C. The electrons are emitted by the plate C and
are collected by the plate A (collector), by the electric field created by the
battery. The battery maintains the potential difference between the plates
C and A, that can be varied. The polarity of the plates C and A can be
reversed by a commutator. Thus, the plate A can be maintained at a desired
positive or negative potential with respect to emitter C. When the collector
plate A is positive with respect to the emitter plate C, the electrons are
389
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
390
We first keep the plate A at some positive accelerating potential with respect
to the plate C and illuminate the plate C with light of fixed frequency
and fixed intensity I1. We next vary the positive potential of plate A gradually
and measure the resulting photocurrent each time. It is found that the
photoelectric current increases with increase in accelerating (positive)
potential. At some stage, for a certain positive potential of plate A, all the
emitted electrons are collected by the plate A and the photoelectric current
becomes maximum or saturates. If we increase the accelerating potential
of plate A further, the photocurrent does not increase. This maximum
value of the photoelectric current is called saturation current. Saturation
current corresponds to the case when all the photoelectrons emitted by
the emitter plate C reach the collector plate A.
We now apply a negative (retarding) potential to the plate A with respect
to the plate C and make it increasingly negative gradually. When the
tt
o N
be C
E
re R
pu T
bl
is
he
d
(11.1)
We can now repeat this experiment with incident radiation of the same
frequency but of higher intensity I2 and I3 (I3 > I2 > I 1). We note that the
saturation currents are now found to be at higher values. This shows
that more electrons are being emitted per second, proportional to the
intensity of incident radiation. But the stopping potential remains the
same as that for the incident radiation of intensity I1, as shown graphically
in Fig. 11.3. Thus, for a given frequency of the incident radiation, the
stopping potential is independent of its intensity. In other words, the
maximum kinetic energy of photoelectrons depends on the light source
and the emitter plate material, but is independent of intensity of incident
radiation.
no
391
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
tt
o N
be C
E
re R
pu T
bl
is
he
d
AND
WAVE T HEORY
no
The wave nature of light was well established by the end of the nineteenth
century. The phenomena of interference, diffraction and polarisation were
explained in a natural and satisfactory way by the wave picture of light.
According to this picture, light is an electromagnetic wave consisting of
electric and magnetic fields with continuous distribution of energy over
the region of space over which the wave is extended. Let us now see if this
wave picture of light can explain the observations on photoelectric
emission given in the previous section.
According to the wave picture of light, the free electrons at the surface
of the metal (over which the beam of radiation falls) absorb the radiant
energy continuously. The greater the intensity of radiation, the greater are
the amplitude of electric and magnetic fields. Consequently, the greater
the intensity, the greater should be the energy absorbed by each electron.
In this picture, the maximum kinetic energy of the photoelectrons on the
surface is then expected to increase with increase in intensity. Also, no
matter what the frequency of radiation is, a sufficiently intense beam of
radiation (over sufficient time) should be able to impart enough energy to
the electrons, so that they exceed the minimum energy needed to escape
from the metal surface . A threshold frequency, therefore, should not exist.
These expectations of the wave theory directly contradict observations (i),
(ii) and (iii) given at the end of sub-section 11.4.3.
Further, we should note that in the wave picture, the absorption of
energy by electron takes place continuously over the entire
wavefront of the radiation. Since a large number of electrons absorb energy,
the energy absorbed per electron per unit time turns out to be small.
Explicit calculations estimate that it can take hours or more for a single
electron to pick up sufficient energy to overcome the work function and
come out of the metal. This conclusion is again in striking contrast to
observation (iv) that the photoelectric emission is instantaneous. In short,
the wave picture is unable to explain the most basic features of
photoelectric emission.
393
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
394
0 =
(11.3)
h
Equation (11.3) shows that the greater the work
function 0, the higher the minimum or threshold
frequency 0 needed to emit photoelectrons. Thus,
there exists a threshold frequency0 (= 0 /h) for the
metal surface, below which no photoelectric emission
is possible, no matter how intense the incident
radiation may be or how long it falls on the surface.
In this picture, intensity of radiation as noted above,
is proportional to the number of energy quanta per
unit area per unit time. The greater the number of
energy quanta available, the greater is the number of
electrons absorbing the energy quanta and greater,
therefore, is the number of electrons coming out of
the metal (for > 0 ). This explains why, for > 0 ,
photoelectric current is proportional to intensity.
tt
o N
be C
E
re R
pu T
bl
is
he
d
e V 0 = h 0; for 0
0
(11.4)
e
e
This is an important result. It predicts that the V0 versus curve is a
straight line with slope = (h/e), independent of the nature of the material.
During 1906-1916, Millikan performed a series of experiments on
photoelectric effect, aimed at disproving Einsteins photoelectric equation.
He measured the slope of the straight line obtained for sodium, similar to
that shown in Fig. 11.5. Using the known value of e, he determined the
value of Plancks constant h . This value was close to the value of Plancks
contant (= 6.626 1034J s) determined in an entirely different context.
In this way, in 1916, Millikan proved the validity of Einsteins photoelectric
equation, instead of disproving it.
The successful explanation of photoelectric effect using the hypothesis
of light quanta and the experimental determination of values of h and 0 ,
in agreement with values obtained from other experiments, led to the
acceptance of Einsteins picture of photoelectric effect. Millikan verified
photoelectric equation with great precision, for a number of alkali metals
over a wide range of radiation frequencies.
or V0 =
OF
no
Photoelectric effect thus gave evidence to the strange fact that light in
interaction with matter behaved as if it was made of quanta or packets of
energy, each of energy h .
Is the light quantum of energy to be associated with a particle? Einstein
arrived at the important result, that the light quantum can also be
associated with momentum (h /c). A definite value of energy as well as
momentum is a strong sign that the light quantum can be associated
with a particle. This particle was later named photon. The particle-like
behaviour of light was further confirmed, in 1924, by the experiment of
A.H. Compton (1892-1962) on scattering of X-rays from electrons. In
1921, Einstein was awarded the Nobel Prize in Physics for his contribution
to theoretical physics and the photoelectric effect. In 1923, Millikan was
awarded the Nobel Prize in physics for his work on the elementary
charge of electricity and on the photoelectric effect.
We can summarise the photon picture of electromagnetic radiation
as follows:
395
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 11.1
Solution
(a) Each photon has an energy
34
14
E = h = ( 6.63 10 J s) (6.0 10 Hz)
19
= 3.98 10 J
(b) If N is the number of photons emitted by the source per second,
the power P transmitted in the beam equals N times the energy
per photon E, so that P = N E. Then
N=
P 2.0 103 W
=
19
E 3.98 10 J
15
= 5.0 10
Example 11.2 The work function of caesium is 2.14 eV. Find (a) the
threshold frequency for caesium, and (b) the wavelength of the incident
light if the photocurrent is brought to zero by a stopping potential of
0.60 V.
396
0 = 0 =
h
EXAMPLE 11.2
no
Solution
(a) For the cut-off or threshold frequency, the energy h 0 of the incident
radiation must be equal to work function 0, so that
2.14eV
6.63 10 34 J s
2.14 1.6 10 19 J
= 5.16 1014 Hz
6.63 10 34 J s
hc
= hc/(eV0 + 0 )
=
19.89 1026 J m
(2.74 eV)
19.89 1026 J m
= 454 nm
2.74 1.6 10 19 J
EXAMPLE 11.2
tt
o N
be C
E
re R
pu T
bl
is
he
d
or,
1.989 10 25 J m
no
EXAMPLE 11.3
397
Physics
11.8 W AVE NATURE
OF
MATTER
tt
o N
be C
E
re R
pu T
bl
is
he
d
h
h
=
p
mv
(11.5)
where m is the mass of the particle and v its speed. Equation (11.5) is
known as the de Broglie relation and the wavelength of the matter
wave is called de Broglie wavelength. The dual aspect of matter is evident
in the de Broglie relation. On the left hand side of Eq. (11.5), is the
attribute of a wave while on the right hand side the momentum p is a
typical attribute of a particle. Plancks constant h relates the two
attributes.
Equation (11.5) for a material particle is basically a hypothesis whose
validity can be tested only by experiment. However, it is interesting to see
that it is satisfied also by a photon. For a photon, as we have seen,
p = h /c
(11.6)
no
Therefore,
398
h c
= =
(11.7)
p
That is, the de Broglie wavelength of a photon given by Eq. (11.5) equals
the wavelength of electromagnetic radiation of which the photon is a
quantum of energy and momentum.
Clearly, from Eq. (11.5 ), is smaller for a heavier particle ( large m ) or
more energetic particle (large v). For example, the de Broglie wavelength
of a ball of mass 0.12 kg moving with a speed of 20 m s 1 is easily
calculated:
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
p = m v = 0.12 kg 20 m s1 = 2.40 kg m s1
6.63 10 34 J s
h
=
= 2.76 1034 m
2.40 kg m s 1
p
399
tt
o N
be C
E
re R
pu T
bl
is
he
d
Physics
K =e V
Now, K =
no
400
p=
(11.8)
1
p2
m v2 =
, so that
2
2m
2m K =
2 m eV
(11.9)
h
h
h
=
=
p
2mK
2 m eV
(11.10)
nm
(11.11)
V
where V is the magnitude of accelerating potential in
volts. For a 120 V accelerating potential, Eq. (11.11) gives
= 0.112 nm. This wavelength is of the same order as
the spacing between the atomic planes in crystals. This
suggests that matter waves associated with an electron could be verified
by crystal diffraction experiments analogous to X-ray diffraction. We
describe the experimental verification of the de Broglie hypothesis in the
next section. In 1929, de Broglie was awarded the Nobel Prize in Physics
for his discovery of the wave nature of electrons.
The matterwave picture elegantly incorporated the Heisenbergs
uncertainty principle. According to the principle, it is not possible to
measure both the position and momentum of an electron (or any other
particle) at the same time exactly. There is always some uncertainty ( x )
in the specification of position and some uncertainty (p ) in the
specification of momentum. The product of x and p is of the order of *
(with = h/2), i.e.,
x p
(11.12)
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
Solution
(a) For the electron:
31
6
Mass m = 9.1110 kg, speed v = 5.410 m/s. Then, momentum
31
6
p = m v = 9.1110 (kg) 5.4 10 (m/s)
24
p = 4.92 10 kg m/s
de Broglie wavelength, = h/p
6.63 1034 J s
24
4. 92 10 kg m/s
= 0.135 nm
=
EXAMPLE 11.4
401
6. 63 1034 J s
4. 50 kg m/s
34
= 1.47 10 m
The de Broglie wavelength of electron is comparable with X-ray
19
wavelengths. However, for the ball it is about 10
times the size of
the proton, quite beyond experimental measurement.
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 11.4
Physics
EXAMPLE 11.5
Solution
For a particle, de Broglie wavelength, = h/p
2
Kinetic energy, K = p /2m
Then, = h / 2mK
For the same kinetic energy K, the de Broglie wavelength associated
with the particle is inversely proportional to the square root of their
masses. A proton
1
1
an -particle 2
Hence, particle has the shortest de Broglie wavelength.
P ROBABILITY
402
EXAMPLE 11.6
no
It is worth pausing here to reflect on just what a matter wave associated with a particle,
say, an electron, means. Actually, a truly satisfactory physical understanding of the
dual nature of matter and radiation has not emerged so far. The great founders of
quantum mechanics (Niels Bohr, Albert Einstein, and many others) struggled with this
and related concepts for long. Still the deep physical interpretation of quantum
mechanics continues to be an area of active research. Despite this, the concept of
matter wave has been mathematically introduced in modern quantum mechanics with
great success. An important milestone in this connection was when Max Born (18821970) suggested a probability interpretation to the matter wave amplitude. According
to this, the intensity (square of the amplitude) of the matter wave at a point determines
the probability density of the particle at that point. Probability density means probability
per unit volume. Thus, if A is the amplitude of the wave at a point, |A| 2 V is the
probability of the particle being found in a small volume V around that point. Thus,
if the intensity of matter wave is large in a certain region, there is a greater probability
of the particle being found there than where the intensity is small.
h h
=
p mv
ve
v
EXAMPLE 11.6
tt
o N
be C
E
re R
pu T
bl
is
he
d
Mass, m = h/v
For an electron, mass m e = h/e ve
Now, we have v/ve = 3 and
/ e = 1.813 10 4
1.227
nm = 0.123 nm
100
The de Broglie wavelength associated with an electron in this case is of
the order of X-ray wavelengths.
=
11.9 DAVISSON
AND
EXAMPLE 11.7
= h /p =
GERMER EXPERIMENT
no
403
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
http://www.nobelprize.org/nobel_prizes/physics/laureates/1986/presentation-speech.html
Physics
404
= h /p =
1 .227
54
1 .227
V
nm
nm = 0.167 nm
tt
o N
be C
E
re R
pu T
bl
is
he
d
1.
2.
3.
4.
5.
6.
The classical wave theory could not explain the main features of
photoelectric effect. Its picture of continuous absorption of energy from
radiation could not explain the independence of Kmax on intensity, the
existence of o and the instantaneous nature of the process. Einstein
explained these features on the basis of photon picture of light.
According to this, light is composed of discrete packets of energy called
quanta or photons. Each photon carries an energy E (= h ) and
momentum p (= h/), which depend on the frequency ( ) of incident
light and not on its intensity. Photoelectric emission from the metal
surface occurs due to absorption of a photon by an electron.
7.
no
1
m v2max = V0 e = h 0 = h ( 0 )
2
8.
405
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
Physical
Quantity
Symbol
Dimensions
Unit
Remarks
Plancks
constant
[ML2 T 1 ]
Js
E = h
Stopping
potential
V0
[ML 2 T 3A 1]
e V 0= Kmax
Work
function
[ML2 T 2 ]
J ; eV
Kmax = E 0
Threshold
frequency
[T 1]
Hz
0 = 0 /h
de Broglie
wavelength
[L]
= h/p
POINTS TO PONDER
1.
no
2.
406
3.
Free electrons in a metal are free in the sense that they move inside the
metal in a constant potential (This is only an approximation). They are
not free to move out of the metal. They need additional energy to get
out of the metal.
Free electrons in a metal do not all have the same energy. Like molecules
in a gas jar, the electrons have a certain energy distribution at a given
temperature. This distribution is different from the usual Maxwells
distribution that you have learnt in the study of kinetic theory of gases.
You will lear n about it in later courses, but the difference has to do
with the fact that electrons obey Paulis exclusion principle.
Because of the energy distribution of free electrons in a metal, the
energy required by an electron to come out of the metal is different for
different electrons. Electrons with higher energy require less additional
energy to come out of the metal than those with lower energies. Work
function is the least energy required by an electron to come out of the
metal.
tt
o N
be C
E
re R
pu T
bl
is
he
d
5.
Observations on photoelectric ef fect imply that in the event of matterlight interaction, absorption of energy takes place in discrete units of h .
This is not quite the same as saying that light consists of particles,
each of energy h .
Observations on the stopping potential (its independence of intensity
and dependence on frequency) are the crucial discriminator between
the wave-picture and photon-picture of photoelectric effect.
6.
h
has physical
p
significance; its phase velocity vp has no physical significance. However,
the group velocity of the matter wave is physically meaningful and
equals the velocity of the particle.
The wavelength of a matter wave given by =
EXERCISES
11.1
11.2
Find the
(a) maximum frequency, and
(b) minimum wavelength of X-rays produced by 30 kV electrons.
The work function of caesium metal is 2.14 eV. When light of
frequency 6 1014Hz is incident on the metal surface, photoemission
of electrons occurs. What is the
(a) maximum kinetic energy of the emitted electrons,
(b) Stopping potential, and
(c) maximum speed of the emitted photoelectrons?
11.3
11.4
(c) How fast does a hydrogen atom have to travel in order to have
the same momentum as that of the photon?
The energy flux of sunlight reaching the surface of the earth is
3
2
1.388 10 W/m . How many photons (nearly) per square metre are
incident on the Earth per second ? Assume that the photons in the
sunlight have an average wavelength of 550 nm.
no
11.5
11.6
11.7
407
Physics
with the sodium light? (b) At what rate are the photons delivered to
the sphere?
The threshold frequency for a certain metal is 3.3 10 14 Hz. If light
of frequency 8.2 1014 Hz is incident on the metal, predict the cutoff voltage for the photoelectric emission.
11.9 The work function for a certain metal is 4.2 eV. Will this metal give
photoelectric emission for incident radiation of wavelength 330 nm?
11.10 Light of frequency 7.21 10 14 Hz is incident on a metal surface.
5
Electrons with a maximum speed of 6.0 10 m/s are ejected from
the surface. What is the threshold frequency for photoemission of
electrons?
tt
o N
be C
E
re R
pu T
bl
is
he
d
11.8
no
11.16 An
(a)
(b)
(c)
408
11.17 (a) For what kinetic energy of a neutron will the associated de Broglie
wavelength be 1.40 10 10 m?
(b) Also find the de Broglie wavelength of a neutron, in thermal
equilibrium with matter, having an average kinetic energy of
(3/2) k T at 300 K.
11.18 Show that the wavelength of electromagnetic radiation is equal to
the de Broglie wavelength of its quantum (photon).
11.19 What is the de Broglie wavelength of a nitrogen molecule in air at
300 K ? Assume that the molecule is moving with the root-meansquare speed of molecules at this temperature. (Atomic mass of
nitrogen = 14.0076 u)
ADDITIONAL EXERCISES
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
11.20 (a) Estimate the speed with which electrons emitted from a heated
emitter of an evacuated tube impinge on the collector maintained
at a potential difference of 500 V with respect to the emitter.
Ignore the small initial speeds of the electrons. The
specific charge of the electron, i.e., its e/m is given to be
11
1
1.76 10 C kg .
(b) Use the same formula you employ in (a) to obtain electron speed
for an collector potential of 10 MV. Do you see what is wrong ? In
what way is the formula to be modified ?
11.21 (a) A monoenergetic electron beam with electron speed of
5.20 106 m s1 is subject to a magnetic field of 1.30 104 T
normal to the beam velocity. What is the radius of the circle traced
by the beam, given e/m for electron equals 1.76 1011C kg1.
(b) Is the formula you employ in (a) valid for calculating radius of
the path of a 20 MeV electron beam? If not, in what way is it
modified ?
[Note: Exercises 11.20(b) and 11.21(b) take you to relativistic
mechanics which is beyond the scope of this book. They have been
inserted here simply to emphasise the point that the formulas you
use in part (a) of the exercises are not valid at very high speeds or
energies. See answers at the end to know what very high speed or
energy means.]
11.22 An electron gun with its collector at a potential of 100 V fires out
electrons in a spherical bulb containing hydrogen gas at low
pressure (102 mm of Hg). A magnetic field of 2.83 10 4 T curves
the path of the electrons in a circular orbit of radius 12.0 cm. (The
path can be viewed because the gas ions in the path focus the beam
by attracting electrons, and emitting light by electron capture; this
method is known as the fine beam tube method.) Determine
e/m from the data.
11.23 (a) An X-ray tube produces a continuous spectrum of radiation with
its short wavelength end at 0.45 . What is the maximum energy
of a photon in the radiation?
(b) From your answer to (a), guess what order of accelerating voltage
(for electrons) is required in such a tube ?
11.24 In an accelerator experiment on high-energy collisions of electrons
with positrons, a certain event is interpreted as annihilation of an
electron-positron pair of total energy 10.2 BeV into two -rays of
equal energy. What is the wavelength associated with each -ray?
9
(1BeV = 10 eV)
11.25 Estimating the following two numbers should be interesting. The
first number will tell you why radio engineers do not need to worry
much about photons! The second number tells you why our eye can
never count photons, even in barely detectable light.
(a) The number of photons emitted per second by a Medium wave
transmitter of 10 kW power, emitting radiowaves of wavelength
500 m.
(b) The number of photons entering the pupil of our eye per second
corresponding to the minimum intensity of white light that we
409
Physics
humans can perceive (1010 W m 2). Take the area of the pupil
to be about 0.4 cm2, and the average frequency of white light to
be about 6 1014 Hz.
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
410
Na: 2.75 eV; K: 2.30 eV; Mo: 4.17 eV; Ni: 5.15 eV. Which of these
metals will not give photoelectric emission for a radiation of
wavelength 3300 from a He-Cd laser placed 1 m away from the
photocell? What happens if the laser is brought nearer and placed
50 cm away ?
11.30 Light of intensity 105 W m2 falls on a sodium photo-cell of surface
2
area 2 cm . Assuming that the top 5 layers of sodium absorb the
incident energy, estimate time required for photoelectric emission
in the wave-picture of radiation. The work function for the metal is
given to be about 2 eV. What is the implication of your answer?
11.31 Crystal diffraction experiments can be performed using X-rays, or
electrons accelerated through appropriate voltage. Which probe has
greater energy? (For quantitative comparison, take the wavelength
of the pr obe equal to 1 , which is of the order of inter -atomic spacing
31
in the lattice) (m e=9.11 10
kg).
11.32 (a) Obtain the de Broglie wavelength of a neutron of kinetic energy
150 eV. As you have seen in Exer cise 11.31, an electron beam of
this ener gy is suitable for crystal diffraction experiments. Would
a neutron beam of the same energy be equally suitable ? Explain.
27
(m n = 1.675 10 kg)
tt
o N
be C
E
re R
pu T
bl
is
he
d
11.33
11.34
11.35
11.36
no
[Note: Exercises 11.35 and 11.36 reveal that while the wave-packets
associated with gaseous molecules under ordinary conditions are
non-overlapping, the electron wave-packets in a metal strongly
overlap with one another. This suggests that whereas molecules in
an ordinary gas can be distinguished apart, electrons in a metal
cannot be distintguished apart from one another. This
indistinguishibility has many fundamental implications which you
will explore in more advanced Physics courses.]
11.37 Answer the following questions:
(a) Quarks inside protons and neutrons are thought to carry
fractional charges [(+2/3)e ; (1/3)e]. Why do they not show up
in Millikans oil-drop experiment ?
(b) What is so special about the combination e/m ? Why do we not
simply talk of e and m separately?
(c) Why should gases be insulators at ordinary pressures and start
conducting at very low pressures ?
(d) Every metal has a definite work function. Why do all
photoelectrons not come out with the same energy if incident
radiation is monochromatic? Why is there an energy distribution
of photoelectrons ?
(e) The energy and momentum of an electron are related to the
frequency and wavelength of the associated matter wave by the
relations:
E = h , p =
411
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
APPENDIX
no
What is light? This question has haunted mankind for a long time. But systematic experiments were done by
scientists since the dawn of the scientific and industrial era, about four centuries ago. Around the same time,
theoretical models about what light is made of were developed. While building a model in any branch of
science, it is essential to see that it is able to explain all the experimental observations existing at that time.
It is therefore appropriate to summarize some observations about light that were known in the seventeenth
century.
The properties of light known at that time included (a) rectilinear propagation of light, (b) reflection from
plane and curved surfaces, (c) refraction at the boundary of two media, (d) dispersion into various colours, (e)
high speed. Appropriate laws were formulated for the first four phenomena. For example, Snell formulated his
laws of refraction in 1621. Several scientists right from the days of Galileo had tried to measure the speed of
light. But they had not been able to do so. They had only concluded that it was higher than the limit of their
measurement.
Two models of light were also proposed in the seventeenth century. Descartes, in early decades of seventeenth
century, proposed that light consists of particles, while Huygens, around 1650-60, proposed that light consists
of waves. Descartes proposal was merely a philosophical model, devoid of any experiments or scientific
ar guments. Newton soon after, ar ound 1660-70, extended Descartes particle model, known as corpuscular
theory , built it up as a scientific theory, and explained various known properties with it. These models, light
as waves and as particles, in a sense, are quite opposite of each other. But both models could explain all the
known properties of light. There was nothing to choose between them.
The history of the development of these models over the next few centuries is interesting. Bartholinus, in
1669, discovered double refraction of light in some crystals, and Huygens, in 1678, was quick to explain it on
the basis of his wave theory of light. In spite of this, for over one hundred years, Newtons particle model was
firmly believed and preferred over the wave model. This was partly because of its simplicity and partly because
of Newtons influence on contemporary physics.
Then in 1801, Young performed his double-slit experiment and observed interference fringes. This
phenomenon could be explained only by wave theory. It was realized that diffraction was also another
phenomenon which could be explained only by wave theory. In fact, it was a natural consequence of Huygens
idea of secondary wavelets emanating from every point in the path of light. These experiments could not be
explained by assuming that light consists of particles. Another phenomenon of polarisation was discovered
around 1810, and this too could be naturally explained by the wave theory. Thus wave theory of Huygens
came to the forefront and Newtons particle theory went into the background. This situation again continued
for almost a century.
Better experiments were performed in the nineteenth century to determine the speed of light. With more
accurate experiments, a value of 3108 m/s for speed of light in vacuum was arrived at. Around 1860, Maxwell
proposed his equations of electromagnetism and it was realized that all electromagnetic phenomena known at
that time could be explained by Maxwells four equations. Soon Maxwell showed that electric and magnetic
fields could propagate through empty space (vacuum) in the form of electromagnetic waves. He calculated the
speed of these waves and arrived at a theoretical value of 2.998108 m/s. The close agreement of this value
with the experimental value suggested that light consists of electromagnetic waves. In 1887 Hertz demonstrated
the generation and detection of such waves. This established the wave theory of light on a firm footing. We
might say that while eighteenth century belonged to the particle model, the nineteenth century belonged to
the wave model of light.
Vast amounts of experiments were done during the period 1850-1900 on heat and related phenomena, an
altogether different area of physics. Theories and models like kinetic theory and thermodynamics were developed
which quite successfully explained the various phenomena, except one.
412
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
Every body at any temperature emits radiation of all wavelengths. It also absorbs radiation falling on it.
A body which absorbs all the radiation falling on it is called a black body. It is an ideal concept in physics, like
concepts of a point mass or uniform motion. A graph of the intensity of radiation emitted by a body versus
wavelength is called the black body spectrum. No theory in those days could explain the complete black body
spectrum!
In 1900, Planck hit upon a novel idea. If we assume, he said, that radiation is emitted in packets of energy
instead of continuously as in a wave, then we can explain the black body spectrum. Planck himself regarded
these quanta, or packets, as a property of emission and absorption, rather than that of light. He derived a
formula which agreed with the entire spectrum. This was a confusing mixture of wave and particle pictures
radiation is emitted as a particle, it travels as a wave, and is again absorbed as a particle! Moreover, this put
physicists in a dilemma. Should we again accept the particle picture of light just to explain one phenomenon?
Then what happens to the phenomena of interference and diffraction which cannot be explained by the
particle model?
But soon in 1905, Einstein explained the photoelectric effect by assuming the particle picture of light.
In 1907, Debye explained the low temperature specific heats of solids by using the particle picture for lattice
vibrations in a crystalline solid. Both these phenomena belonging to widely diverse areas of physics could be
explained only by the particle model and not by the wave model. In 1923, Comptons x-ray scattering experiments
from atoms also went in favour of the particle pictur e. This increased the dilemma further.
Thus by 1923, physicists faced with the following situation. (a) There were some phenomena like rectilinear
propagation, reflection, refraction, which could be explained by either particle model or by wave model. (b)
There were some phenomena such as diffraction and interference which could be explained only by the wave
model but not by the particle model. (c) There were some phenomena such as black body radiation, photoelectric
effect, and Compton scattering which could be explained only by the particle model but not by the wave model.
Somebody in those days aptly remarked that light behaves as a particle on Mondays, Wednesdays and Fridays,
and as a wave on Tuesdays, Thursdays and Saturdays, and we dont talk of light on Sundays!
In 1924, de Broglie proposed his theory of wave-particle duality in which he said that not only photons
of light but also particles of matter such as electrons and atoms possess a dual character, sometimes
behaving like a particle and sometimes as a wave. He gave a formula connecting their mass, velocity, momentum
(particle characteristics), with their wavelength and frequency (wave characteristics)! In 1927 Thomson, and
Davisson and Germer, in separate experiments, showed that electrons did behave like waves with a wavelength
which agreed with that given by de Broglies formula. Their experiment was on diffraction of electrons through
crystalline solids, in which the regular arrangement of atoms acted like a grating. Very soon, diffraction
experiments with other particles such as neutrons and protons were performed and these too confirmed with
de Broglies formula. This confirmed wave-particle duality as an established principle of physics. Here was a
principle, physicists thought, which explained all the phenomena mentioned above not only for light but also
for the so-called particles.
But there was no basic theoretical foundation for wave-particle duality. De Broglies proposal was
merely a qualitative argument based on symmetry of nature. Wave-particle duality was at best a principle, not
an outcome of a sound fundamental theory. It is true that all experiments whatever agreed with de Broglie
formula. But physics does not work that way. On the one hand, it needs experimental confirmation, while on
the other hand, it also needs sound theoretical basis for the models proposed. This was developed over the
next two decades. Dirac developed his theory of radiation in about 1928, and Heisenberg and Pauli gave it a
firm footing by 1930. T omonaga, Schwinger, and Feynman, in late 1940s, produced further refinements and
cleared the theory of inconsistencies which were noticed. All these theories mainly put wave-particle duality
on a theoretical footing.
Although the story continues, it grows more and more complex and beyond the scope of this note. But
we have here the essential structure of what happened, and let us be satisfied with it at the moment. Now it
is regarded as a natural consequence of present theories of physics that electromagnetic radiation as well as
particles of matter exhibit both wave and particle properties in different experiments, and sometimes even in
the different parts of the same experiment.
413
Physics
Chapter Twelve
ATOMS
12.1 INTRODUCTION
414
Atoms
415
Physics
416
Atoms
be provided if the greater part of the
mass of the atom and its positive charge
were concentrated tightly at its centre.
Then the incoming -particle could get
very close to the positive charge without
penetrating it, and such a close
encounter would result in a large
deflection. This agreement supported
the hypothesis of the nuclear atom. This
is why Rutherford is credited with the
discovery of the nucleus.
In Rutherfords nuclear model of
the atom, the entire positive charge and
most of the mass of the atom are
concentrated in the nucleus with the
electrons some distance away. The
electrons would be moving in orbits
FIGURE 12.3 Experimental data points (shown by
about the nucleus just as the planets
dots) on scattering of -particles by a thin foil at
do around the sun. Rutherfords
different angles obtained by Geiger and Marsden
experiments suggested the size of
using the setup shown in Figs. 12.1 and
the nucleus to be about 1015 m to
12.2. Rutherfords nuclear model predicts the solid
1014 m. From kinetic theory, the size
curve which is seen to be in good agreement with
experiment.
of an atom was known to be 1010 m,
about 10,000 to 100,000 times larger
than the size of the nucleus (see Chapter 11, Section 11.6 in Class XI
Physics textbook). Thus, the electrons would seem to be at a distance
from the nucleus of about 10,000 to 100,000 times the size of the nucleus
itself. Thus, most of an atom is empty space. With the atom being largely
empty space, it is easy to see why most -particles go right through a
thin metal foil. However, when -particle happens to come near a nucleus,
the intense electric field there scatters it through a large angle. The atomic
electrons, being so light, do not appreciably affect the -particles.
The scattering data shown in Fig. 12.3 can be analysed by employing
Rutherfords nuclear model of the atom. As the gold foil is very thin, it
can be assumed that -particles will suffer not more than one scattering
during their passage through it. Therefore, computation of the trajectory
of an alpha-particle scattered by a single nucleus is enough. Alphaparticles are nuclei of helium atoms and, therefore, carry two units, 2e,
of positive charge and have the mass of the helium atom. The charge of
the gold nucleus is Ze, where Z is the atomic number of the atom; for
gold Z = 79. Since the nucleus of gold is about 50 times heavier than an
-particle, it is reasonable to assume that it remains stationary
throughout the scattering process. Under these assumptions, the
trajectory of an alpha-particle can be computed employing Newtons
second law of motion and the Coulombs law for electrostatic
force of repulsion between the alpha-particle and the positively
417
charged nucleus.
Physics
The magnitude of this force is
1 (2e )( Ze )
F
(12.1)
4 0
r2
where r is the distance between the -particle and the nucleus. The force
is directed along the line joining the -particle and the nucleus. The
magnitude and direction of the force on an -particle continuously
changes as it approaches the nucleus and recedes away from it.
418
EXAMPLE 12.1
Atoms
Example 12.2 In a Geiger-Marsden experiment, what is the distance
of closest approach to the nucleus of a 7.7 MeV -particle before it
comes momentarily to rest and reverses its direction?
(2e )( Ze )
d
2Ze 2
4 0d
2 Ze 2
4 0K
EXAMPLE 12.2
The radius of gold nucleus is, therefore, less than 3.0 1014 m. This
is not in very good agreement with the observed result as the actual
radius of gold nucleus is 6 fm. The cause of discrepancy is that the
distance of closest approach is considerably larger than the sum of
the radii of the gold nucleus and the -particle. Thus, the -particle
reverses its motion without ever actually touching the gold nucleus.
http://www-outreach.phy.cam.ac.uk/camphy/nucleus/nucleus6_1.htm
Solution The key idea here is that throughout the scattering process,
the total mechanical energy of the system consisting of an -particle
and a gold nucleus is conserved. The systems initial mechanical
energy is Ei, before the particle and nucleus interact, and it is equal
to its mechanical energy Ef when the -particle momentarily stops.
The initial energy Ei is just the kinetic energy K of the incoming
- particle. The final energy Ef is just the electric potential energy U
of the system. The potential energy U can be calculated from
Eq. (12.1).
Let d be the centre-to-centre distance between the -particle and
the gold nucleus when the -particle is at its stopping point. Then
we can write the conservation of energy Ei = Ef as
mv 2
r
1
4
e2
r2
(12.2)
419
Physics
Thus the relation between the orbit radius and the electron
velocity is
e2
4 0mv 2
(12.3)
1
mv 2
2
e2
8
and U
e2
4
e2
8
e2
r
e2
8
(12.4)
The total energy of the electron is negative. This implies the fact that
the electron is bound to the nucleus. If E were positive, an electron will
not follow a closed orbit around the nucleus.
Example 12.3 It is found experimentally that 13.6 eV energy is
required to separate a hydrogen atom into a proton and an electron.
Compute the orbital radius and the velocity of the electron in a
hydrogen atom.
Solution Total energy of the electron in hydrogen atom is 13.6 eV =
13.6 1.6 1019 J = 2.2 1018 J. Thus from Eq. (12.4), we have
e2
8
2.2 10
0r
18
EXAMPLE 12.3
e2
8
19
C)2
= 5.3 1011 m.
The velocity of the revolving electron can be computed from Eq. (12.3)
with m = 9.1 1031 kg,
e
4
0mr
420
Atoms
consists of bright lines on a
dark
background.
The
spectrum emitted by atomic
hydrogen is shown in
Fig. 12.5. Study of emission
line spectra of a material can
therefore serve as a type of
fingerprint for identification
of the gas. When white light
passes through a gas and we
analyse the transmitted light
using a spectrometer we find
some dark lines in the
FIGURE 12.5 Emission lines in the spectrum of hydrogen.
spectrum. These dark lines
correspond precisely to those wavelengths which were found in the
emission line spectrum of the gas. This is called the absorption spectrum
of the material of the gas.
1
1
(12.5)
2
2 n2
where is the wavelength, R is a constant called the Rydberg constant,
and n may have integral values 3, 4, 5, etc. The value of R is 1.097 107 m1.
This equation is also called Balmer formula.
Taking n = 3 in Eq. (12.5), one obtains the wavelength of the H line:
1
1.097
107
1
22
= 1.522 106 m1
i.e., = 656.3 nm
1
32
m 1
421
Physics
For n = 4, one obtains the wavelength of H line, etc. For n = , one obtains
the limit of the series, at = 364.6 nm. This is the shortest wavelength in
the Balmer series. Beyond this limit, no further distinct lines appear,
instead only a faint continuous spectrum is seen.
Other series of spectra for hydrogen were subsequently discovered.
These are known, after their discoverers, as Lyman, Paschen, Brackett,
and Pfund series. These are represented by the formulae:
Lyman series:
1
1 1
12 n 2
Paschen series:
1
1
1
2
3 n2
Brackett series:
1
1
1
42 n 2
Pfund series:
1
n = 2,3,4...
(12.6)
n = 4,5,6...
(12.7)
n = 5,6,7...
(12.8)
1
1
n = 6,7,8...
(12.9)
2
5 n2
The Lyman series is in the ultraviolet, and the Paschen and Brackett
series are in the infrared region.
The Balmer formula Eq. (12.5) may be written in terms of frequency
of the light, recalling that
c =
or
c
Thus, Eq. (12.5) becomes
1
1
(12.10)
2
2 n2
There are only a few elements (hydrogen, singly ionised helium, and
doubly ionised lithium) whose spectra can be represented by simple
formula like Eqs. (12.5) (12.9).
Equations (12.5) (12.9) are useful as they give the wavelengths that
hydrogen atoms radiate or absorb. However, these results are empirical
and do not give any reasoning why only certain frequencies are observed
in the hydrogen spectrum.
= Rc
422
OF THE
HYDROGEN ATOM
The model of the atom proposed by Rutherford assumes that the atom,
consisting of a central nucleus and revolving electron is stable much like
sun-planet system which the model imitates. However, there are some
fundamental differences between the two situations. While the planetary
system is held by gravitational force, the nucleus-electron system being
charged objects, interact by Coulombs Law of force. We know that an
Atoms
2.2 106 m s
2
5.3 10
11
6.6 10 Hz.
According to the classical electromagnetic theory we know that the
frequency of the electromagnetic waves emitted by the revolving
electrons is equal to the frequency of its revolution around the nucleus.
Thus the initial frequency of the light emitted is 6.6 1015 Hz.
15
EXAMPLE 12.4
v
2 r
423
Physics
It was Niels Bohr (1885 1962) who made certain modifications in
this model by adding the ideas of the newly developing quantum
hypothesis. Niels Bohr studied in Rutherfords laboratory for several
months in 1912 and he was convinced about the validity of Rutherford
nuclear model. Faced with the dilemma as discussed above, Bohr, in
1913, concluded that in spite of the success of electromagnetic theory in
explaining large-scale phenomena, it could not be applied to the processes
at the atomic scale. It became clear that a fairly radical departure from
the established principles of classical mechanics and electromagnetism
would be needed to understand the structure of atoms and the relation
of atomic structure to atomic spectra. Bohr combined classical and early
quantum concepts and gave his theory in the form of three postulates.
These are :
(i) Bohrs first postulate was that an electron in an atom could revolve
in certain stable orbits without the emission of radiant energy,
contrary to the predictions of electromagnetic theory. According to
this postulate, each atom has certain definite stable states in which it
can exist, and each possible state has definite total energy. These are
called the stationary states of the atom.
(ii) Bohrs second postulate defines these stable orbits. This postulate
states that the electron revolves around the nucleus only in those
orbits for which the angular momentum is some integral multiple of
h/2 where h is the Plancks constant (= 6.6 1034 J s). Thus the
angular momentum (L) of the orbiting electron is quantised. That is
L = nh/2
(12.11)
(iii) Bohrs third postulate incorporated into atomic theory the early
quantum concepts that had been developed by Planck and Einstein.
It states that an electron might make a transition from one of its
specified non-radiating orbits to another of lower energy. When it
does so, a photon is emitted having energy equal to the energy
difference between the initial and final states. The frequency of the
emitted photon is then given by
(12.12)
h = Ei Ef
where Ei and Ef are the energies of the initial and final states and Ei > Ef .
For a hydrogen atom, Eq. (12.4) gives the expression to determine
the energies of different energy states. But then this equation requires
the radius r of the electron orbit. To calculate r, Bohrs second postulate
about the angular momentum of the electronthe quantisation
condition is used. The angular momentum L is given by
L = mvr
Bohrs second postulate of quantisation [Eq. (12.11)] says that the
allowed values of angular momentum are integral multiples of h/2.
Ln = mvnrn =
424
nh
2
(12.13)
Atoms
1, 2, 3 ..., according to the values of n, which is called the principal
quantum number of the orbit.
From Eq. (12.3), the relation between vn and rn is
e
vn
4 0mrn
Combining it with Eq. (12.13), we get the following expressions for vn
and rn,
vn
1 e2
1
n4 0 h 2
(12.14)
and
rn
n2
m
h
2
4 0
e2
(12.15)
Eq. (12.14) depicts that the orbital speed in the nth orbit falls by a factor
of n. Using Eq. (12.15), the size of the innermost orbit (n = 1) can be
obtained as
r1
h2 0
me 2
This is called the Bohr radius, represented by the symbol a0. Thus,
h2 0
(12.16)
me 2
Substitution of values of h, m, 0 and e gives a 0 = 5.29 1011 m. From
Eq. (12.15), it can also be seen that the radii of the orbits increase as n2.
The total energy of the electron in the stationary states of the hydrogen
atom can be obtained by substituting the value of orbital radius in Eq.
(12.4) as
a0
En
or En
e2
8 0
m
n2
2
h
e2
4 0
me 4
8n 2 02h 2
(12.17)
13.6
eV
(12.19)
n2
The negative sign of the total energy of an electron moving in an orbit
means that the electron is bound with the nucleus. Energy will thus be
required to remove the electron from the hydrogen atom to a distance
infinitely far away from its nucleus (or proton in hydrogen atom).
En
425
Physics
The derivation of Eqs. (12.17) (12.19) involves the assumption that
the electronic orbits are circular, though orbits under inverse square
force are, in general elliptical. (Planets move in elliptical orbits under the
inverse square gravitational force of the sun.) However, it was shown by
the German physicist Arnold Sommerfeld (1868 1951) that, when the
restriction of circular orbit is relaxed, these equations continue to hold
even for elliptic orbits.
ORBIT
426
EXAMPLE 12.5
We are introduced to the Bohr Model of atom one time or the other in the course of
physics. This model has its place in the history of quantum mechanics and particularly
in explaining the structure of an atom. It has become a milestone since Bohr introduced
the revolutionary idea of definite energy orbits for the electrons, contrary to the classical
picture requiring an accelerating particle to radiate. Bohr also introduced the idea of
quantisation of angular momentum of electrons moving in definite orbits. Thus it was a
semi-classical picture of the structure of atom.
Now with the development of quantum mechanics, we have a better understanding
of the structure of atom. Solutions of the Schrdinger wave equation assign a wave-like
description to the electrons bound in an atom due to attractive forces of the protons.
An orbit of the electron in the Bohr model is the circular path of motion of an electron
around the nucleus. But according to quantum mechanics, we cannot associate a definite
path with the motion of the electrons in an atom. We can only talk about the probability
of finding an electron in a certain region of space around the nucleus. This probability
can be inferred from the one-electron wave function called the orbital. This function
depends only on the coordinates of the electron.
It is therefore essential that we understand the subtle differences that exist in the two
models:
Bohr model is valid for only one-electron atoms/ions; an energy value, assigned to
each orbit, depends on the principal quantum number n in this model. We know
that energy associated with a stationary state of an electron depends on n only, for
one-electron atoms/ions. For a multi-electron atom/ion, this is not true.
The solution of the Schrdinger wave equation, obtained for hydrogen-like atoms/
ions, called the wave function, gives information about the probability of finding an
electron in various regions around the nucleus. This orbital has no resemblance
whatsoever with the orbit defined for an electron in the Bohr model.
Atoms
EXAMPLE 12.5
* An electron can have any total energy above E = 0 eV. In such situations the
electron is free. Thus there is a continuum of energy states above E = 0 eV, as
shown in Fig. 12.8.
427
Physics
Fig. 12.8. The principal quantum number n labels the stationary
states in the ascending order of energy. In this diagram, the highest
energy state corresponds to n = in Eq, (12.19) and has an energy
of 0 eV. This is the energy of the atom when the electron is
completely removed (r = ) from the nucleus and is at rest. Observe how
the energies of the excited states come closer and closer together as n
increases.
FRANCK HERTZ
EXPERIMENT
The existence of discrete energy levels in an atom was directly verified in 1914 by James
Franck and Gustav Hertz. They studied the spectrum of mercury vapour when electrons
having different kinetic energies passed through the vapour. The electron energy was
varied by subjecting the electrons to electric fields of varying strength. The electrons
collide with the mercury atoms and can transfer energy to the mercury atoms. This can
only happen when the energy of the electron is higher than the energy difference between
an energy level of Hg occupied by an electron and a higher unoccupied level (see Figure).
For instance, the difference between an occupied energy level of Hg and a higher
unoccupied level is 4.9 eV. If an electron of having an energy of 4.9 eV or more passes
through mercury, an electron in mercury atom can absorb energy from the bombarding
electron and get excited to the higher level [Fig (a)]. The colliding electrons kinetic energy
would reduce by this amount.
The excited electron would subsequently fall back to the ground state by emission of
radiation [Fig. (b)]. The wavelength of emitted radiation is:
hc
E
6.625 10 34 3 108
= 253 nm
4.9 1.6 10 19
By direct measurement, Franck and Hertz found that the emission spectrum of
mercury has a line corresponding to this wavelength. For this experimental verification
of Bohrs basic ideas of discrete energy levels in atoms and the process of photon emission,
Frank and Hertz were awarded the Nobel prize in 1925.
428
OF THE
HYDROGEN ATOM
Atoms
hif = Eni Enf
(12.20)
me 4
1
8 02h 2 n f2
or if =
me 4
1
2 3
8 0h n f2
1
n i2
1
n i2
(12.21)
(12.22)
me 4
8 2h 3c
(12.23)
429
Physics
the same energy needed by the electron in a lower energy state to make
transitions to a higher energy state, the process is called absorption.
Thus if photons with a continuous range of frequencies pass through a
rarefied gas and then are analysed with a spectrometer, a series of dark
spectral absorption lines appear in the continuous spectrum. The dark
lines indicate the frequencies that have been absorbed by the atoms of
the gas.
The explanation of the hydrogen atom spectrum provided by Bohrs
model was a brilliant achievement, which greatly stimulated progress
towards the modern quantum theory. In 1922, Bohr was awarded Nobel
Prize in Physics.
Example 12.6 Using the Rydberg formula, calculate the wavelengths
of the first four spectral lines in the Lyman series of the hydrogen
spectrum.
Solution The Rydberg formula is
hc/if =
me 4
1
8 2h 2 n f2
1
n i2
The wavelengths of the first four lines in the Lyman series correspond
to transitions from ni = 2,3,4,5 to nf = 1. We know that
me 4
= 13.6 eV = 21.76 1019 J
8 2h 2
Therefore,
hc
i1
EXAMPLE 12.6
21.76
10
19
1
1
1
n i2
6.625 10 34 3 108 n i2
m =
21.76 10 19 (n i2 1)
0.9134 n i2
10
(n i2 1)
= 913.4 ni2/(ni2 1)
Substituting ni = 2,3,4,5, we get 21 = 1218 , 31 = 1028 , 41 = 974.3 ,
and 51 = 951.4 .
430
Of all the postulates, Bohr made in his model of the atom, perhaps the
most puzzling is his second postulate. It states that the angular
momentum of the electron orbiting around the nucleus is quantised (that
is, Ln = nh/2; n = 1, 2, 3 ). Why should the angular momentum have
only those values that are integral multiples of h/2 ? The French physicist
Louis de Broglie explained this puzzle in 1923, ten years after Bohr
proposed his model.
We studied, in Chapter 11, about the de Broglies hypothesis that
material particles, such as electrons, also have a wave nature. C. J. Davisson
and L. H. Germer later experimentally verified the wave nature of electrons
Atoms
in 1927. Louis de Broglie argued that the electron in its
circular orbit, as proposed by Bohr, must be seen as a particle
wave. In analogy to waves travelling on a string, particle waves
too can lead to standing waves under resonant conditions.
From Chapter 15 of Class XI Physics textbook, we know that
when a string is plucked, a vast number of wavelengths are
excited. However only those wavelengths survive which have
nodes at the ends and form the standing wave in the string. It
means that in a string, standing waves are formed when the
total distance travelled by a wave down the string and back is
one wavelength, two wavelengths, or any integral number of
wavelengths. Waves with other wavelengths interfere with
themselves upon reflection and their amplitudes quickly drop
to zero. For an electron moving in nth circular orbit of radius
rn, the total distance is the circumference of the orbit, 2rn.
Thus
n = 1, 2, 3...
(12.24)
2 rn = n,
Figure 12.10 illustrates a standing particle wave on a
circular orbit for n = 4, i.e., 2rn = 4, where is the de Broglie
wavelength of the electron moving in nth orbit. From Chapter
11, we have = h/p, where p is the magnitude of the electrons
momentum. If the speed of the electron is much less than the speed of
light, the momentum is mvn. Thus, = h/mvn. From Eq. (12.24), we have
2 rn = n h/mvn
or m vn rn = nh/2
This is the quantum condition proposed by Bohr for the angular
momentum of the electron [Eq. (12.13)]. In Section 12.5, we saw that
this equation is the basis of explaining the discrete orbits and energy
levels in hydrogen atom. Thus de Broglie hypothesis provided an
explanation for Bohrs second postulate for the quantisation of angular
momentum of the orbiting electron. The quantised electron orbits and
energy states are due to the wave nature of the electron and only resonant
standing waves can persist.
Bohrs model, involving classical trajectory picture (planet-like electron
orbiting the nucleus), correctly predicts the gross features of the
hydrogenic atoms*, in particular, the frequencies of the radiation emitted
or selectively absorbed. This model however has many limitations.
Some are:
(i) The Bohr model is applicable to hydrogenic atoms. It cannot be
extended even to mere two electron atoms such as helium. The analysis
of atoms with more than one electron was attempted on the lines of
Bohrs model for hydrogenic atoms but did not meet with any success.
Difficulty lies in the fact that each electron interacts not only with the
positively charged nucleus but also with all other electrons.
* Hydrogenic atoms are the atoms consisting of a nucleus with positive charge
+Ze and a single electron, where Z is the proton number. Examples are hydrogen
atom, singly ionised helium, doubly ionised lithium, and so forth. In these
atoms more complex electron-electron interactions are nonexistent.
431
Physics
The formulation of Bohr model involves electrical force between
positively charged nucleus and electron. It does not include the
electrical forces between electrons which necessarily appear in
multi-electron atoms.
(ii) While the Bohrs model correctly predicts the frequencies of the light
emitted by hydrogenic atoms, the model is unable to explain the
relative intensities of the frequencies in the spectrum. In emission
spectrum of hydrogen, some of the visible frequencies have weak
intensity, others strong. Why? Experimental observations depict that
some transitions are more favoured than others. Bohrs model is
unable to account for the intensity variations.
Bohrs model presents an elegant picture of an atom and cannot be
generalised to complex atoms. For complex atoms we have to use a new
and radical theory based on Quantum Mechanics, which provides a more
complete picture of the atomic structure.
LASER
LIGHT
Imagine a crowded market place or a railway platform with people entering a gate and
going towards all directions. Their footsteps are random and there is no phase correlation
between them. On the other hand, think of a large number of soldiers in a regulated march.
Their footsteps are very well correlated. See figure here.
This is similar to the difference between light emitted by
an ordinary source like a candle or a bulb and that emitted
by a laser. The acronym LASER stands for Light Amplification
by Stimulated Emission of Radiation. Since its development
in 1960, it has entered into all areas of science and technology.
It has found applications in physics, chemistry, biology,
medicine, surgery, engineering, etc. There are low power
lasers, with a power of 0.5 mW, called pencil lasers, which
serve as pointers. There are also lasers of different power,
suitable for delicate surgery of eye or glands in the stomach.
Finally, there are lasers which can cut or weld steel.
Light is emitted from a source in the form of packets of
waves. Light coming out from an ordinary source contains a mixture of many wavelengths.
There is also no phase relation between the various waves. Therefore, such light, even if it is
passed through an aperture, spreads very fast and the beam size increases rapidly with
distance. In the case of laser light, the wavelength of each packet is almost the same. Also
the average length of the packet of waves is much larger. This means that there is better
phase correlation over a longer duration of time. This results in reducing the divergence of
a laser beam substantially.
If there are N atoms in a source, each emitting light with intensity I, then the total
intensity produced by an ordinary source is proportional to NI, whereas in a laser source,
it is proportional to N2I. Considering that N is very large, we see that the light from a laser
can be much stronger than that from an ordinary source.
When astronauts of the Apollo missions visited the moon, they placed a mirror on its
surface, facing the earth. Then scientists on the earth sent a strong laser beam, which was
reflected by the mirror on the moon and received back on the earth. The size of the reflected
laser beam and the time taken for the round trip were measured. This allowed a very
accurate determination of (a) the extremely small divergence of a laser beam and (b) the
distance of the moon from the earth.
432
Atoms
SUMMARY
1.
2.
3.
4.
5.
6.
Rc
Balmer series:
Rc
1
n2
Paschen series:
Rc
Brackett series:
Rc
Pfund series:
7.
1
12
Rc
; n = 2, 3, 4,...
22
n2
32
n2
42
n2
52
n2
; n = 3, 4, 5,...
; n = 4, 5, 6,...
; n = 5, 6, 7,...
; n = 6, 7, 8,...
433
Physics
8.
rn
n2
m
h
2
4 0
e2
En
me 4
8n 2 02h 2
= 13.6 eV/n2
The n = 1 state is called ground state. In hydrogen atom the ground
state energy is 13.6 eV. Higher values of n correspond to excited
states (n > 1). Atoms are excited to these higher states by collisions
with other atoms or electrons or by absorption of a photon of right
frequency.
9. de Broglies hypothesis that electrons have a wavelength = h/mv gave
an explanation for Bohrs quantised orbits by bringing in the waveparticle duality. The orbits correspond to circular standing waves in
which the circumference of the orbit equals a whole number of
wavelengths.
10. Bohrs model is applicable only to hydrogenic (single electron) atoms.
It cannot be extended to even two electron atoms such as helium.
This model is also unable to explain for the relative intensities of the
frequencies emitted even by hydrogenic atoms.
POINTS TO PONDER
1.
2.
3.
4.
5.
434
Atoms
6.
7.
EXERCISES
12.1
12.2
Choose the correct alternative from the clues given at the end of
the each statement:
(a) The size of the atom in Thomsons model is .......... the atomic
size in Rutherfords model. (much greater than/no different
from/much less than.)
(b) In the ground state of .......... electrons are in stable equilibrium,
while in .......... electrons always experience a net force.
(Thomsons model/ Rutherfords model.)
(c) A classical atom based on .......... is doomed to collapse.
(Thomsons model/ Rutherfords model.)
(d) An atom has a nearly continuous mass distribution in a ..........
but has a highly non-uniform mass distribution in ..........
(Thomsons model/ Rutherfords model.)
(e) The positively charged part of the atom possesses most of the
mass in .......... (Rutherfords model/both the models.)
Suppose you are given a chance to repeat the alpha-particle
scattering experiment using a thin sheet of solid hydrogen in place
of the gold foil. (Hydrogen is a solid at temperatures below 14 K.)
What results do you expect?
435
Physics
12.3
ADDITIONAL EXERCISES
436
12.11 Answer the following questions, which help you understand the
difference between Thomsons model and Rutherfords model better.
(a) Is the average angle of deflection of -particles by a thin gold foil
predicted by Thomsons model much less, about the same, or
much greater than that predicted by Rutherfords model?
(b) Is the probability of backward scattering (i.e., scattering of
-particles at angles greater than 90) predicted by Thomsons
model much less, about the same, or much greater than that
predicted by Rutherfords model?
(c) Keeping other factors fixed, it is found experimentally that for
small thickness t, the number of -particles scattered at
moderate angles is proportional to t. What clue does this linear
dependence on t provide?
(d) In which model is it completely wrong to ignore multiple
scattering for the calculation of average angle of scattering of
-particles by a thin foil?
12.12 The gravitational attraction between electron and proton in a
hydrogen atom is weaker than the coulomb attraction by a factor of
about 1040. An alternative way of looking at this fact is to estimate
the radius of the first Bohr orbit of a hydrogen atom if the electron
and proton were bound by gravitational attraction. You will find the
answer interesting.
12.13 Obtain an expression for the frequency of radiation emitted when a
hydrogen atom de-excites from level n to level (n1). For large n,
show that this frequency equals the classical frequency of revolution
of the electron in the orbit.
Atoms
12.14 Classically, an electron can be in any orbit around the nucleus of
an atom. Then what determines the typical atomic size? Why is an
atom not, say, thousand times bigger than its typical size? The
question had greatly puzzled Bohr before he arrived at his famous
model of the atom that you have learnt in the text. To simulate what
he might well have done before his discovery, let us play as follows
with the basic constants of nature and see if we can get a quantity
with the dimensions of length that is roughly equal to the known
size of an atom (~ 1010m).
(a) Construct a quantity with the dimensions of length from the
fundamental constants e, me, and c. Determine its numerical
value.
(b) You will find that the length obtained in (a) is many orders of
magnitude smaller than the atomic dimensions. Further, it
involves c. But energies of atoms are mostly in non-relativistic
domain where c is not expected to play any role. This is what
may have suggested Bohr to discard c and look for something
else to get the right atomic size. Now, the Plancks constant h
had already made its appearance elsewhere. Bohrs great insight
lay in recognising that h, me, and e will yield the right atomic
size. Construct a quantity with the dimension of length from h,
me, and e and confirm that its numerical value has indeed the
correct order of magnitude.
12.15 The total energy of an electron in the first excited state of the
hydrogen atom is about 3.4 eV.
(a) What is the kinetic energy of the electron in this state?
(b) What is the potential energy of the electron in this state?
(c) Which of the answers above would change if the choice of the
zero of potential energy is changed?
12.16 If Bohrs quantisation postulate (angular momentum = nh/2) is a
basic law of nature, it should be equally valid for the case of planetary
motion also. Why then do we never speak of quantisation of orbits
of planets around the sun?
12.17 Obtain the first Bohrs radius and the ground state energy of a
muonic hydrogen atom [i.e., an atom in which a negatively charged
muon () of mass about 207me orbits around a proton].
437
Physics
Chapter Thirteen
tt
o N
be C
E
re R
pu T
bl
is
he
d
NUCLEI
no
13.1 I NTRODUCTION
438
In the previous chapter, we have learnt that in every atom, the positive
charge and mass are densely concentrated at the centre of the atom
forming its nucleus. The overall dimensions of a nucleus are much smaller
than those of an atom. Experiments on scattering of -particles
demonstrated that the radius of a nucleus was smaller than the radius
of an atom by a factor of about 104 . This means the volume of a nucleus
is about 10 12 times the volume of the atom. In other words, an atom is
almost empty. If an atom is enlarged to the size of a classroom, the nucleus
would be of the size of pinhead. Nevertheless, the nucleus contains most
(more than 99.9%) of the mass of an atom.
Does the nucleus have a structure, just as the atom does? If so, what
are the constituents of the nucleus? How are these held together? In this
chapter, we shall look for answers to such questions. We shall discuss
various properties of nuclei such as their size, mass and stability, and
also associated nuclear phenomena such as radioactivity, fission and fusion.
AND
COMPOSITION OF NUCLEUS
Nuclei
different mass unit is used for expressing atomic masses. This unit is the
atomic mass unit (u), defined as 1/12th of the mass of the carbon (12C)
atom. According to this definition
mass of one
12
12
C atom
tt
o N
be C
E
re R
pu T
bl
is
he
d
1u =
1.992647 10 26 kg
12
= 1.660539 10 27 kg
(13.1)
m p = 1.00727 u = 1.67262 10 27 kg
(13.2)
no
439
Physics
atomic electrons is thus (Ze), and since the atom is neutral, the charge
of the nucleus is (+Ze). The number of protons in the nucleus of the atom
is, therefore, exactly Z, the atomic number.
Discovery of Neutron
tt
o N
be C
E
re R
pu T
bl
is
he
d
Since the nuclei of deuterium and tritium are isotopes of hydrogen, they
must contain only one proton each. But the masses of the nuclei of
hydrogen, deuterium and tritium are in the ratio of 1:2:3. Therefore, the
nuclei of deuterium and tritium must contain, in addition to a proton,
some neutral matter. The amount of neutral matter present in the nuclei
of these isotopes, expressed in units of mass of a proton, is approximately
equal to one and two, respectively. This fact indicates that the nuclei of
atoms contain, in addition to protons, neutral matter in multiples of a
basic unit. This hypothesis was verified in 1932 by James Chadwick
who observed emission of neutral radiation when beryllium nuclei were
bombarded with alpha-particles. ( -particles are helium nuclei, to be
discussed in a later section). It was found that this neutral radiation
could knock out protons from light nuclei such as those of helium, carbon
and nitrogen. The only neutral radiation known at that time was photons
(electromagnetic radiation). Application of the principles of conservation
of energy and momentum showed that if the neutral radiation consisted
of photons, the energy of photons would have to be much higher than is
available from the bombardment of beryllium nuclei with -particles.
The clue to this puzzle, which Chadwick satisfactorily solved, was to
assume that the neutral radiation consists of a new type of neutral
particles called neutrons. From conservation of energy and momentum,
he was able to determine the mass of new particle as very nearly the
same as mass of proton.
The mass of a neutron is now known to a high degree of accuracy. It is
m n = 1.00866 u = 1.67491027 kg
(13.3)
no
Chadwick was awarded the 1935 Nobel Prize in Physics for his
discovery of the neutron.
A free neutron, unlike a free proton, is unstable. It decays into a
proton, an electron and a antineutrino (another elementary particle), and
has a mean life of about 1000s. It is, however, stable inside the nucleus.
The composition of a nucleus can now be described using the following
terms and symbols:
440
[13.4(a)]
[13.4(b)]
A - mass number = Z + N
= total number of protons and neutrons [13.4(c)]
One also uses the term nucleon for a proton or a neutron. Thus the
number of nucleons in an atom is its mass number A.
Nuclear species or nuclides are shown by the notation AZ X where X is
the chemical symbol of the species. For example, the nucleus of gold is
denoted by 197
. It contains 197 nucleons, of which 79 are protons
79 Au
and the rest118 are neutrons.
Nuclei
tt
o N
be C
E
re R
pu T
bl
is
he
d
13.3 SIZE
OF THE
N UCLEUS
no
R = R 0 A 1/3
(13.5)
15
441
Physics
Solution
mFe = 55.85,
26
u = 9.27 10
kg
mass
9.27 10 26
1
=
15 3
volume
(4 / 3)(1.2 10 ) 56
= 2.29 1017 kg m3
The density of matter in neutron stars (an astrophysical object) is
comparable to this density. This shows that matter in these objects
has been compressed to such an extent that they resemble a big nucleus.
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 13.1
Example 13.1 Given the mass of iron nucleus as 55.85u and A=56,
find the nuclear density?
Nuclear density =
13.4 MASS-ENERGY
AND
(13.6)
no
EXAMPLE 13.2
442
Solution
Energy, E = 10
8 2
( 3 10 ) J
E = 103 9 1016 = 9 10 13 J
Nuclei
the nuclear mass M is found to be always less than this. For example, let
us consider 168 O ; a nucleus which has 8 neutrons and 8 protons. We
have
Mass of 8 neutrons = 8 1.00866 u
tt
o N
be C
E
re R
pu T
bl
is
he
d
(13.7)
(13.8)
Example 13.3 Find the energy equivalent of one atomic mass unit,
first in Joules and then in MeV. Using this, express the mass defect
2
of 168 O in MeV/c .
Solution
1u = 1.6605 1027 kg
2
To convert it into energy units, we multiply it by c and find that
27
8 2
energy equivalent = 1.6605 10
(2.9979 10 ) kg m2/s2
10
= 1.4924 10
J
=
1.4924 1010
eV
19
1.602 10
no
16
8
127.5 MeV/c .
EXAMPLE 13.3
= 0.9315 10 9 eV
= 931.5 MeV
or, 1u = 931.5 MeV/ c 2
For 186 O , M = 0.13691 u = 0.13691931.5 MeV/c 2
2
= 127.5 MeV/c
443
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
in the process. The energy Eb is called the binding energy of the nucleus.
If we separate a nucleus into its nucleons, we would have to supply a
total energy equal to Eb, to those particles. Although we cannot tear
apart a nucleus in this way, the nuclear binding energy is still a convenient
measure of how well a nucleus is held together. A more useful measure
of the binding between the constituents of the nucleus is the binding
energy per nucleon, Ebn, which is the ratio of the binding energy Eb of a
nucleus to the number of the nucleons, A, in that nucleus:
no
E bn = E b / A
(13.9)
We can think of binding energy per nucleon as the average energy
per nucleon needed to separate a nucleus into its individual nucleons.
Figure 13.1 is a plot of the
binding energy per nucleon Ebn
versus the mass number A for a
large number of nuclei. We notice
the following main features of
the plot:
(i) the binding energy per
nucleon, Ebn , is practically
constant, i.e. practically
independent of the atomic
number for nuclei of middle
mass number ( 30 < A < 170).
The curve has a maximum of
about 8.75 MeV for A = 56
and has a value of 7.6 MeV
FIGURE 13.1 The binding energy per nucleon
for A = 238.
as a function of mass number.
(ii) E bn is lower for both light
nuclei ( A<30) and heavy
nuclei (A>170).
We can draw some conclusions from these two observations:
(i) The force is attractive and sufficiently strong to produce a binding
energy of a few MeV per nucleon.
(ii) The constancy of the binding energy in the range 30 < A < 170 is a
consequence of the fact that the nuclear force is short-ranged. Consider
a particular nucleon inside a sufficiently large nucleus. It will be under
the influence of only some of its neighbours, which come within the
range of the nuclear force. If any other nucleon is at a distance more
than the range of the nuclear force from the particular nucleon it will
have no influence on the binding energy of the nucleon under
consideration. If a nucleon can have a maximum of p neighbours
within the range of nuclear force, its binding energy would be
proportional to p. Let the binding energy of the nucleus be pk, where
k is a constant having the dimensions of energy. If we increase A by
adding nucleons they will not change the binding energy of a nucleon
inside. Since most of the nucleons in a large nucleus reside inside it
and not on the surface, the change in binding energy per nucleon
would be small. The binding energy per nucleon is a constant and is
444
approximately equal to pk. The property that a given nucleon
Nuclei
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
The force that determines the motion of atomic electrons is the familiar
Coulomb force. In Section 13.4, we have seen that for average mass
nuclei the binding energy per nucleon is approximately 8 MeV, which is
much larger than the binding energy in atoms. Therefore, to bind a
nucleus together there must be a strong attractive force of a totally
different kind. It must be strong enough to overcome the repulsion
between the (positively charged) protons and to bind both protons and
neutrons into the tiny nuclear volume. We have already seen
that the constancy of binding energy per nucleon can be
understood in terms of its short-range. Many features of the
nuclear binding force are summarised below. These are
obtained from a variety of experiments carried out during 1930
to 1950.
(i) The nuclear force is much stronger than the Coulomb force
acting between charges or the gravitational forces between
masses. The nuclear binding force has to dominate over
the Coulomb repulsive force between protons inside the
nucleus. This happens only because the nuclear force is
much stronger than the coulomb force. The gravitational
force is much weaker than even Coulomb force.
(ii) The nuclear force between two nucleons falls rapidly to FIGURE 13.2 Potential energy
of a pair of nucleons as a
zero as their distance is more than a few femtometres. This
function of their separation.
leads to saturation of forces in a medium or a large-sized
For a separation greater
nucleus, which is the reason for the constancy of the
than r0, the force is attractive
binding energy per nucleon.
and for separations less
than r0, the force is
A rough plot of the potential energy between two nucleons
strongly repulsive.
as a function of distance is shown in the Fig. 13.2. The
potential energy is a minimum at a distance r0 of about
0.8 fm. This means that the force is attractive for distances larger
than 0.8 fm and repulsive if they are separated by distances less
445
than 0.8 fm.
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
13.6 R ADIOACTIVITY
(ii) -decay in which electrons or positrons (particles with the same mass
as electrons, but with a charge exactly opposite to that of electron)
are emitted;
(iii) -decay in which high energy (hundreds of keV or more) photons are
emitted.
Each of these decay will be considered in subsequent sub-sections.
no
N
N
t
446
or, N/ t = N,
(13.10)
where is called the radioactive decay constant or disintegration constant.
The change in the number of nuclei in the sample* is dN = N in
time t. Thus the rate of change of N is (in the limit t 0)
dN
= N
dt
* N is the number of nuclei that decay, and hence is always positive. dN is the
change in N, which may have either sign. Here it is negative, because out of
original N nuclei, N have decayed, leaving (NN) nuclei.
Nuclei
dN
= dt
N
Now, integrating both sides of the above equation,we get,
or,
t
dN
= dt
N
N0
t0
tt
o N
be C
E
re R
pu T
bl
is
he
d
(13.11)
or, ln N ln N 0 = (t t0 )
(13.12)
N
= t
N0
(13.13)
which gives
N(t) = N0 e t
(13.14)
Note, for example, the light bulbs follow no such exponential decay law.
If we test 1000 bulbs for their life (time span before they burn out or
fuse), we expect that they will decay (that is, burn out) at more or less
the same time. The decay of radionuclides follows quite a different law,
the law of radioactive decay represented by Eq. (13.14).
The total decay rate R of a sample is the number of nuclei
disintegrating per unit time. Suppose in a time interval dt, the decay
count measured is N. Then dN = N.
The positive quantity R is then defined as
R=
dN
dt
or, R = R 0 e t
(13.15)
no
(13.16)
447
Physics
1 becquerel is simply equal to 1 disintegration or decay per second.
There is also another unit named curie that is widely used and is related
to the SI unit as:
1 curie = 1 Ci = 3.7 1010 decays per second
tt
o N
be C
E
re R
pu T
bl
is
he
d
= 3.7 1010 Bq
Different radionuclides differ greatly in their rate of
decay. A common way to characterize this feature is
through the notion of half-life. Half-life of a radionuclide
(denoted by T1/2 ) is the time it takes for a sample that has
initially, say N 0 radionuclei to reduce to N0 /2. Putting
N = N0/2 and t = T 1/2 in Eq. (13.14), we get
ln 2
T1/2 =
0.693
(13.17)
N 0 te t d t
= te t d t
N0
0
One can show by performing this integral that
= 1/
448
ln 2
= ln 2
(13.18)
EXAMPLE 13.4
no
T1/2 =
Nuclei
One k mol of any isotope contains Avogadros number of atoms, and
so 1g of 238
contains
92 U
EXAMPLE 13.4
tt
o N
be C
E
re R
pu T
bl
is
he
d
1
kmol 6.025 1026 atoms/kmol
238 103
20
= 25.3 10 atoms.
The decay rate R is
R = N
0.693
0.693 25.3 1020 1
N =
s
T1/ 2
1.42 1017
= 1.23 104 s1
4
= 1.23 10 Bq
Solution
By definition of half-life, half of the initial sample will remain
undecayed after 12.5 y. In the next 12.5 y, one-half of these nuclei
would have decayed. Hence, one fourth of the sample of the initial
pure tritium will remain undecayed.
EXAMPLE 13.5
234
90
Th + 42 He
(-decay)
238
92
U to
(13.19)
A 4
Z 2
Y + 42 He
(13.20)
no
(13.21)
Q is also the net kinetic energy gained in the process or, if the initial
nucleus X is at rest, the kinetic energy of the products. Clearly, Q> 0 for
exothermic processes such as -decay.
449
Physics
Example 13.6 We are given the following atomic masses:
23 8
4
= 238.05079 u
= 4.00260 u
92 U
2 He
23 4
1
= 234.04363 u 1 H = 1.00783 u
9 0 Th
237
91
tt
o N
be C
E
re R
pu T
bl
is
he
d
Pa = 237.05121 u
Here the symbol Pa is for the element protactinium (Z = 91).
(a) Calculate the energy released during the alpha decay of 238
U.
92
(b) Show that 238
U
can
not
spontaneously
emit
a
proton.
92
Solution
(a) The alpha decay of 238
is given by Eq. (13.20). The energy released
92 U
in this process is given by
Q = (M U M Th MHe) c 2
Substituting the atomic masses as given in the data, we find
Q = (238.05079 234.04363 4.00260)u c 2
= (0.00456 u) c 2
= (0.00456 u) (931.5 MeV/u)
= 4.25 MeV.
(b) If 238
U spontaneously emits a proton, the decay process would be
92
23 8
92
23 7
91
1
Pa + 1 H
EXAMPLE 13.6
32
P 16
S + e +
(13.22)
no
22
11
450
22
Na 10
Ne + e + +
(13.23)
The decays are governed by the Eqs. (13.14) and (13.15), so that one
can never predict which nucleus will undergo decay, but one can
characterize the decay by a half-life T1/2 . For example, T1/2 for the decays
above is respectively 14.3 d and 2.6y. The emission of electron in decay
is accompanied by the emission of an antineutrino ( ); in + decay, instead,
a neutrino () is generated. Neutrinos are neutral particles with very small
(possiblly, even zero) mass compared to electrons. They have only weak
interaction with other particles. They are, therefore, very difficult to detect,
since they can penetrate large quantity of matter (even earth) without any
interaction.
Nuclei
In both and + decay, the mass number A remains unchanged. In
decay, the atomic number Z of the nucleus goes up by 1, while in +
decay Z goes down by 1. The basic nuclear process underlying decay
is the conversion of neutron to proton
n p + e +
(13.24)
tt
o N
be C
E
re R
pu T
bl
is
he
d
(13.25)
Note that while a free neutron decays to proton, the decay of proton to
neutron [Eq. (13.25)] is possible only inside the nucleus, since proton
has smaller mass than neutron.
Like an atom, a nucleus also has discrete energy levels - the ground
state and excited states. The scale of energy is, however, very different.
Atomic energy level spacings are of the order of eV, while the difference in
nuclear energy levels is of the order of MeV. When a
nucleus in an excited state spontaneously decays
to its ground state (or to a lower energy state), a
photon is emitted with energy equal to the difference
in the two energy levels of the nucleus. This is the
so-called gamma decay. The energy (MeV)
corresponds to radiation of extremely short
wavelength, shorter than the hard X-ray region.
Typically, a gamma ray is emitted when a or
decay results in a daughter nucleus in an excited
state. This then returns to the ground state by a
single photon transition or successive transitions
involving more than one photon. A familiar example
FIGURE 13.4 -decay of 60
Ni nucleus
28
is the successive emmission of gamma rays of
followed by emission of two rays
energies 1.17 MeV and 1.33 MeV from the
from deexcitation of the daughter
deexcitation of 60
nuclei formed from decay
28 Ni
nucleus 60
Ni .
28
60
of 27 Co .
no
The curve of binding energy per nucleon Ebn, given in Fig. 13.1, has
a long flat middle region between A = 30 and A = 170. In this region
the binding energy per nucleon is nearly constant (8.0 MeV). For
the lighter nuclei region, A < 30, and for the heavier nuclei region,
A > 170, the binding energy per nucleon is less than 8.0 MeV, as we
have noted earlier. Now, the greater the binding energy, the less is the
total mass of a bound system, such as a nucleus. Consequently, if nuclei
with less total binding energy transform to nuclei with greater binding
energy, there will be a net energy release. This is what happens when a
heavy nucleus decays into two or more intermediate mass fragments
(fission) or when light nuclei fuse into a havier nucleus (fusion.)
Exothermic chemical reactions underlie conventional energy sources
such as coal or petroleum. Here the energies involved are in the range of
451
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
electron volts. On the other hand, in a nuclear reaction, the energy release
is of the order of MeV. Thus for the same quantity of matter, nuclear
sources produce a million times more energy than a chemical source.
Fission of 1 kg of uranium, for example, generates 10 14 J of energy;
compare it with burning of 1 kg of coal that gives 10 7 J.
13.7.1 Fission
235
92
U bombarded with a
236
144
89
1
n +235
92 U 92 U 56 Ba + 36 Kr + 3 0 n
(13.26)
236
133
n + 235
92 U 92 U 51Sb +
Or, as another example,
1
0
140
n + 235
92 U 54 Xe +
94
38
99
41
Nb + 4 10 n
Sr + 2 10n
(13.27)
(13.28)
no
452
Nuclei
I NDIAS ATOMIC
ENERGY PROGRAMME
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
The atomic energy programme in India was launched around the time of independence
under the leadership of Homi J. Bhabha (1909-1966). An early historic achievement
was the design and construction of the first nuclear reactor in India (named Apsara)
which went critical on August 4, 1956. It used enriched uranium as fuel and water as
moderator. Following this was another notable landmark: the construction of CIRUS
(Canada India Research U.S.) reactor in 1960. This 40 MW reactor used natural uranium
as fuel and heavy water as moderator. Apsara and CIRUS spurred research in a wide
range of areas of basic and applied nuclear science. An important milestone in the first
two decades of the programme was the indigenous design and construction of the
plutonium plant at Trombay, which ushered in the technology of fuel reprocessing
(separating useful fissile and fertile nuclear materials from the spent fuel of a reactor) in
India. Research reactors that have been subsequently commissioned include ZERLINA,
PURNIMA (I, II and III), DHRUVA and KAMINI. KAMINI is the countrys first large research
reactor that uses U-233 as fuel. As the name suggests, the primary objective of a research
reactor is not generation of power but to provide a facility for research on different aspects
of nuclear science and technology. Research reactors are also an excellent source for
production of a variety of radioactive isotopes that find application in diverse fields:
industry, medicine and agriculture.
The main objectives of the Indian Atomic Energy programme are to provide safe and
reliable electric power for the countrys social and economic progress and to be selfreliant in all aspects of nuclear technology. Exploration of atomic minerals in India
undertaken since the early fifties has indicated that India has limited reserves of uranium,
but fairly abundant reserves of thorium. Accordingly, our country has adopted a threestage strategy of nuclear power generation. The first stage involves the use of natural
uranium as a fuel, with heavy water as moderator. The Plutonium-239 obtained from
reprocessing of the discharged fuel from the reactors then serves as a fuel for the second
stage the fast breeder reactors. They are so called because they use fast neutrons for
sustaining the chain reaction (hence no moderator is needed) and, besides generating
power, also breed more fissile species (plutonium) than they consume. The third stage,
most significant in the long term, involves using fast breeder reactors to produce fissile
Uranium-233 from Thorium-232 and to build power reactors based on them.
India is currently well into the second stage of the programme and considerable
work has also been done on the third the thorium utilisation stage. The country
has mastered the complex technologies of mineral exploration and mining, fuel
fabrication, heavy water production, reactor design, construction and operation, fuel
reprocessing, etc. Pressurised Heavy Water Reactors (PHWRs) built at different sites in
the country mark the accomplishment of the first stage of the programme. India is now
more than self-sufficient in heavy water production. Elaborate safety measures both in
the design and operation of reactors, as also adhering to stringent standards of
radiological protection are the hallmark of the Indian Atomic Energy Programme.
produced, in some 3, etc. The extra neutrons in turn can initiate fission
processes, producing still more neutrons, and so on. This leads to the
possibility of a chain reaction, as was first suggested by Enrico Fermi. If
the chain reaction is controlled suitably, we can get a steady energy
453
Physics
http://www.npcil.nic.in/main/AllProjectOperationDisplay.aspx
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
239
U + n 239
92 U 93 Np +e +
239
93
Np 239
94 Pu+ e +
(13.29)
454
fission. It contains the fuel elements in suitably fabricated form. The fuel
may be say enriched uranium (i.e., one that has greater abundance of
235
than naturally occurring uranium). The core contains a moderator
92 U
to slow down the neutrons. The core is surrounded by a reflector to reduce
leakage. The energy (heat) released in fission is continuously removed by
a suitable coolant. A containment vessel prevents the escape of radioactive
fission products. The whole assembly is shielded to check harmful
radiation from coming out. The reactor can be shut down by means of
rods (made of, for example, cadmium) that have high absorption of
neutrons. The coolant transfers heat to a working fluid which in turn
may produce stream. The steam drives turbines and generates electricity.
Like any power reactor, nuclear reactors generate considerable waste
products. But nuclear wastes need special care for treatment since they
are radioactive and hazardous. Elaborate safety measures, both for reactor
operation as well as handling and reprocessing the spent fuel, are
required. These safety measures are a distinguishing feature of the Indian
Atomic Energy programme. An appropriate plan is being evolved to study
the possibility of converting radioactive waste into less active and shortlived material.
http://esa21.kennesaw.edu/activities/nukeenergy/nuke.htm
tt
o N
be C
E
re R
pu T
bl
is
he
d
Nuclei
no
When two light nuclei fuse to form a larger nucleus, energy is released,
since the larger nucleus is more tightly bound, as seen from the binding
energy curve in Fig.13.1. Some examples of such energy liberating nuclear
fusion reactions are :
1
1
[13.29(a)]
2
1
[13.29(b)]
2
1
[13.29(c)]
455
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
+
H + 11H 21H + e + + 0.42 MeV
e + + e + + 1.02 MeV
(i)
(ii)
2
1
3
2
(iii)
(13.30)
no
For the fourth reaction to occur, the first three reactions must occur
twice, in which case two light helium nuclei unite to form ordinary helium
nucleus. If we consider the combination 2(i) + 2(ii) + 2(iii) +(iv), the net
effect is
456
(13.31)
4
2
He atom with a
Helium is not the only element that can be synthesized in the interior
of a star. As the hydrogen in the core gets depleted and becomes helium,
the core starts to cool. The star begins to collapse under its own gravity
Nuclei
tt
o N
be C
E
re R
pu T
bl
is
he
d
NUCLEAR
HOLOCAUST
In a single uranium fission about 0.9235 MeV (200 MeV) of energy is liberated. If
each nucleus of about 50 kg of 235U undergoes fission the amount of energy involved is
about 4 1015J. This energy is equivalent to about 20,000 tons of TNT, enough for a
superexplosion. Uncontrolled release of large nuclear energy is called an atomic explosion.
On August 6, 1945 an atomic device was used in warfare for the first time. The US
dropped an atom bomb on Hiroshima, Japan. The explosion was equivalent to 20,000
tons of TNT. Instantly the radioactive products devastated 10 sq km of the city which
had 3,43,000 inhabitants. Of this number 66,000 were killed and 69,000 were injured;
more than 67% of the citys structures were destroyed.
High temperature conditions for fusion reactions can be created by exploding a fission
bomb. Super-explosions equivalent to 10 megatons of explosive power of TNT were tested
in 1954. Such bombs which involve fusion of isotopes of hydrogen, deuterium and tritium
are called hydrogen bombs. It is estimated that a nuclear arsenal sufficient to destroy
every form of life on this planet several times over is in position to be triggered by the
press of a button. Such a nuclear holocaust will not only destroy the life that exists now
but its radioactive fallout will make this planet unfit for life for all times. Scenarios based
on theoretical calculations predict a long nuclear winter, as the radioactive waste will
hang like a cloud in the earths atmosphere and will absorb the suns radiation.
no
457
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
458
EXAMPLE 13.7
no
Nuclei
SUMMARY
An atom has a nucleus. The nucleus is positively charged. The radius
of the nucleus is smaller than the radius of an atom by a factor of
104. More than 99.9% mass of the atom is concentrated in the nucleus.
tt
o N
be C
E
re R
pu T
bl
is
he
d
1.
2.
3.
4.
A
Z
X , where X is the
Nuclides with the same atomic number Z, but different neutron number
N are called isotopes. Nuclides with the same A are isobars and those
with the same N are isotones.
Most elements are mixtures of two or more isotopes. The atomic mass
of an element is a weighted average of the masses of its isotopes. The
masses are the relative abundances of the isotopes.
5.
6.
7.
M = (Z mp + (A Z )mn ) M
Using Einsteins mass energy relation, we express this mass difference
in terms of energy as
Eb = M c2
no
8.
9.
459
Physics
-rays are electrons. -rays are electromagnetic radiation of wavelengths
shorter than X-rays;
tt
o N
be C
E
re R
pu T
bl
is
he
d
T1/2 =
ln2
= ln 2
12. Energy is released when less tightly bound nuclei are transmuted into
more tightly bound nuclei. In fission, a heavy nucleus like
into two smaller fragments, e.g.,
23 5
92
1
0
13 3
51
U+ n
Sb +
99
41
235
92
U breaks
1
Nb + 4 0 n
13. The fact that more neutrons are produced in fission than are consumed
gives the possibility of a chain reaction with each neutron that is
produced triggering another fission. The chain reaction is uncontrolled
and rapid in a nuclear bomb explosion. It is controlled and steady in
a nuclear reactor. In a reactor, the value of the neutron multiplication
factor k is maintained at 1.
14. In fusion, lighter nuclei combine to form a larger nucleus. Fusion of
hydrogen nuclei into helium nuclei is the source of energy of all stars
including our sun.
Physical Quantity
Symbol
Dimensions
Units
Remarks
[M]
[T 1 ]
s1
T 1/2
[T]
Mean life
[T]
[ T1 ]
Bq
Disintegration or
decay constant
no
Half-life
POINTS TO PONDER
1.
2.
460
Nuclei
This is because electron scattering senses the charge distribution of
the nucleus, whereas alpha and similar particles sense the nuclear
matter.
After Einstein showed the equivalence of mass and energy, E = mc 2 ,
we cannot any longer speak of separate laws of conservation of mass
and conservation of energy, but we have to speak of a unified law of
conservation of mass and energy. The most convincing evidence that
this principle operates in nature comes from nuclear physics. It is
central to our understanding of nuclear energy and harnessing it as a
source of power. Using the principle, Q of a nuclear process (decay or
reaction) can be expressed also in terms of initial and final masses.
tt
o N
be C
E
re R
pu T
bl
is
he
d
3.
4.
The nature of the binding energy (per nucleon) curve shows that
exothermic nuclear reactions are possible, when two light nuclei fuse
or when a heavy nucleus undergoes fission into nuclei with intermediate
mass.
5.
For fusion, the light nuclei must have sufficient initial energy to
overcome the coulomb potential barrier. That is why fusion requires
very high temperatures.
6.
Although the binding energy (per nucleon) curve is smooth and slowly
varying, it shows peaks at nuclides like 4 He, 16O etc. This is considered
as evidence of atom-like shell structure in nuclei.
7.
8.
no
461
Physics
EXERCISES
tt
o N
be C
E
re R
pu T
bl
is
he
d
You may find the following data useful in solving the exercises:
e = 1.61019C
N = 6.02310 23 per mole
9
2
2
23 0 1
1/(4 0) = 9 10 N m /C
k = 1.38110 J K
1 MeV = 1.61013J
7
1 year = 3.15410 s
1 u = 931.5 MeV/c 2
mH = 1.007825 u
mn = 1.008665 u
m( He ) = 4.002603 u
m e = 0.000548 u
4
2
13.1
13.2
20
21
22
The three stable isotopes of neon: 10
Ne, 10
Ne and 10
Ne have
respective abundances of 90.51%, 0.27% and 9.22%. The atomic
masses of the three isotopes are 19.99 u, 20.99 u and 21.99 u,
respectively. Obtain the average atomic mass of neon.
13.3
13.4
13.5
13.6
22 6
88
(iii) -decay of
+
(v) -decay of
Ra
32
15
11
6
(ii) -decay of
no
Bi in units of
Pu
210
83
97
43
(vi) -decay of
120
54
242
94
(iv) -decay of
462
209
83
N ,
13.8
Fe and
14
7
m ( 56
Fe ) = 55.934939 u
m ( 209
Bi ) = 208.980388 u
26
83
A given coin has a mass of 3.0 g. Calculate the nuclear energy that
would be required to separate all the neutrons and protons from
each other. For simplicity assume that the coin is entirely made of
63
atoms (of mass 62.92960 u).
29 Cu
(i) -decay of
13.7
56
26
Bi
Tc
Xe
A radioactive isotope has a half-life of T years. How long will it take
the activity to reduce to a) 3.125%, b) 1% of its original value?
The normal activity of living carbon-containing matter is found to
be about 15 decays per minute for every gram of carbon. This activity
arises from the small proportion of radioactive 146 C present with the
stable carbon isotope 126 C . When the organism is dead, its interaction
with the atmosphere (which maintains the above equilibrium activity)
ceases and its activity begins to drop. From the known half-life (5730
years) of 146 C , and the measured activity, the age of the specimen
can be approximately estimated. This is the principle of 146 C dating
Nuclei
used in archaeology. Suppose a specimen from Mohenjodaro gives
an activity of 9 decays per minute per gram of carbon. Estimate the
approximate age of the Indus-Valley civilisation.
13.9
tt
o N
be C
E
re R
pu T
bl
is
he
d
m ( 222
Rn ) = 222.01750 u,
86
m ( 218 46 Po ) = 216.00189 u.
11
C decays according to
23
23
m ( 10
Ne ) = 22.994466 u
23
m ( 11
Na ) = 22.089770 u.
12
6
20
C+126 C 10
Ne+24 He
no
m ( 12 H ) = 2.014102 u
m ( 13 H ) = 3.016049 u
m ( 126 C ) = 12.000000 u
m ( 1200 Ne ) = 19.992439 u
463
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
H+ 1 H 2 He +n+3.27 MeV
13.20 Calculate the height of the potential barrier for a head on collision
of two deuterons. (Hint: The height of the potential barrier is given
by the Coulomb repulsion between the two deuterons when they
just touch each other. Assume that they can be taken as hard
spheres of radius 2.0 fm.)
1/3
13.21 From the relation R = R 0A , where R 0 is a constant and A is the
mass number of a nucleus, show that the nuclear matter density is
nearly constant (i.e. independent of A).
+
13.22 For the (positron) emission from a nucleus, there is another
competing process known as electron capture (electron from an inner
orbit, say, the Kshell, is captured by the nucleus and a neutrino is
emitted).
+
e +
A
Z
A
Z 1
Y+
ADDITIONAL EXERCISES
no
m ( 26
) = 25.986895 u
13 Al
464
m ( 27
13 Al ) = 26.981541 u
13.25 A source contains two phosphorous radio nuclides 32
(T 1/2 = 14.3d)
15 P
and 33
(T
=
25.3d).
Initially,
10%
of
the
decays
come
from 33
.
15 P
1/2
15 P
How long one must wait until 90% do so?
13.26 Under certain circumstances, a nucleus can decay by emitting a
particle more massive than an -particle. Consider the following
decay processes:
22 3
88
Ra
2 09
82
Pb +
14
6
Nuclei
22 3
88
Ra
2 19
86
Rn + 2 He
Calculate the Q-values for these decays and determine that both
are energetically allowed.
tt
o N
be C
E
re R
pu T
bl
is
he
d
13.27 Consider the fission of 239 28 U by fast neutrons. In one fission event,
no neutrons are emitted and the final end products, after the beta
99
decay of the primary fragments, are 140
58 Ce and 44 Ru . Calculate Q
for this fission process. The relevant atomic and particle masses
are
m( 238
) =238.05079 u
92 U
m( 140
58 Ce ) =139.90543 u
m( 99
) = 98.90594 u
44 Ru
H + 1 H 2 He + n
(a) Calculate the energy released in MeV in this reaction from the
data:
m( 21 H )=2.014102 u
m( 31 H ) =3.016049 u
no
FIGURE13.6
465
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
466
Chapter Fourteen
tt
o N
be C
E
re R
pu T
bl
is
he
d
SEMICONDUCTOR
ELECTRONICS:
MATERIALS, DEVICES
AND SIMPLE CIRCUITS
no
14.1 INTRODUCTION
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
and flow of charge carriers in the semiconductor devices are within the
solid itself, while in the earlier vacuum tubes/valves, the mobile electrons
were obtained from a heated cathode and they were made to flow in an
evacuated space or vacuum. No external heating or large evacuated space
is required by the semiconductor devices. They are small in size, consume
low power, operate at low voltages and have long life and high reliability.
Even the Cathode Ray Tubes (CRT) used in television and computer
monitors which work on the principle of vacuum tubes are being replaced
by Liquid Crystal Display (LCD) monitors with supporting solid state
electronics. Much before the full implications of the semiconductor devices
was formally understood, a naturally occurring crystal of galena (Lead
sulphide, PbS) with a metal point contact attached to it was used as
detector of radio waves.
In the following sections, we will introduce the basic concepts of
semiconductor physics and discuss some semiconductor devices like
junction diodes (a 2-electrode device) and bipolar junction transistor (a
3-electrode device). A few circuits illustrating their applications will also
be described.
no
468
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
469
Physics
completely occupied by the 4N valence electrons at temperature of absolute
zero is the valence band. The other band consisting of 4N energy states,
called the conduction band, is completely empty at absolute zero.
THEORY OF SOLIDS
tt
o N
be C
E
re R
pu T
bl
is
he
d
B AND
no
470
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
471
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
Case II: In this case, as shown in Fig. 14.2(b), a large band gap Eg exists
(Eg > 3 eV). There are no electrons in the conduction band, and therefore
no electrical conduction is possible. Note that the energy gap is so large
that electrons cannot be excited from the valence band to the conduction
band by thermal excitation. This is the case of insulators.
Case III: This situation is shown in Fig. 14.2(c). Here a finite but small
band gap (E g < 3 eV) exists. Because of the small band gap, at room
temperature some electrons from valence band can acquire enough
energy to cross the energy gap and enter the conduction band. These
electrons (though small in numbers) can move in the conduction band.
Hence, the resistance of semiconductors is not as high as that of the
insulators.
In this section we have made a broad classification of metals,
conductors and semiconductors. In the section which follows you will
learn the conduction process in semiconductors.
no
We shall take the most common case of Ge and Si whose lattice structure
is shown in Fig. 14.3. These structures are called the diamond-like
structures. Each atom is surrounded by four nearest neighbours. We
know that Si and Ge have four valence electrons. In its crystalline
structure, every Si or Ge atom tends to share one of its four valence
electrons with each of its four nearest neighbour atoms, and also to take
share of one electron from each such neighbour. These shared electron
pairs are referred to as forming a covalent bond or simply a valence
bond. The two shared electrons can be assumed to shuttle back-andforth between the associated atoms holding them together strongly.
Figure 14.4 schematically shows the 2-dimensional representation of Si
or Ge structure shown in Fig. 14.3 which overemphasises the covalent
bond. It shows an idealised picture in which no bonds are broken (all
bonds are intact). Such a situation arises at low
temperatures. As the temperature increases, more
thermal energy becomes available to these electrons
and some of these electrons may breakaway
(becoming free electrons contributing to conduction).
The thermal energy effectively ionises only a few atoms
in the crystalline lattice and creates a vacancy in the
bond as shown in Fig. 14.5(a). The neighbourhood,
from which the free electron (with charge q ) has come
out leaves a vacancy with an effective charge (+q ). This
vacancy with the effective positive electronic charge is
called a hole. The hole behaves as an apparent free
particle with effective positive charge.
In intrinsic semiconductors, the number of free
FIGURE 14.3 Three-dimensional diaelectrons,
ne is equal to the number of holes, n h. That is
mond-like crystal structure for Carbon,
n
=
n
= ni
(14.1)
Silicon or Germanium with
e
h
where
n
is
called
intrinsic
carrier
concentration.
respective lattice spacing a equal
i
Semiconductors posses the unique property in
to 3.56, 5.43 and 5.66 .
472
which, apart from electrons, the holes also move.
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
I = Ie + Ih
(14.2)
It may be noted that apart from the process of generation of conduction
electrons and holes, a simultaneous process of recombination occurs in
which the electrons recombine with the holes. At equilibrium, the rate of
generation is equal to the rate of recombination of charge carriers. The
recombination occurs due to an electron colliding with a hole.
(a)
(b)
FIGURE 14.5 (a) Schematic model of generation of hole at site 1 and conduction electron
due to thermal energy at moderate temperatures. (b) Simplified representation of
possible thermal motion of a hole. The electron from the lower left hand covalent bond
(site 2) goes to the earlier hole site1, leaving a hole at its site indicating an
473
apparent movement of the hole from site 1 to site 2.
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
An intrinsic semiconductor
will behave like an insulator at
T = 0 K as shown in Fig. 14.6(a).
It is the thermal energy at
higher temperatures (T > 0K),
which excites some electrons
from the valence band to the
conduction band. These
thermally excited electrons at
T > 0 K, partially occupy the
conduction band. Therefore,
the energy-band diagram of an
intrinsic semiconductor will be
as shown in Fig. 14.6(b). Here,
some electrons are shown in
the conduction band. These
have come from the valence
band leaving equal number of
holes there.
EXAMPLE 14.1
no
474
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
Physics
semiconductor doped with pentavalent impurity, electrons
become the majority carriers and holes the minority carriers.
These semiconductors are, therefore, known as n-type
semiconductors. For n-type semiconductors, we have,
ne >> nh
(14.3)
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
476
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
(14.5)
FIGURE 14.9 Energy bands of (a) n-type semiconductor at T > 0K, (b) p-type
semiconductor at T > 0K.
EXAMPLE 14.2
no
477
Physics
14.5 p-n JUNCTION
http://hyperphysics.phy-astr.gsu.edu/hbase/solids/pnjun.html
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
Solution No! Any slab, howsoever flat, will have roughness much
larger than the inter-atomic crystal spacing (~2 to 3 ) and hence
continuous contact at the atomic level will not be possible. The junction
will behave as a discontinuity for the flowing charge carriers.
EXAMPLE 14.3
no
479
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
480
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
bias, the current first increases very slowly, almost negligibly, till the
voltage across the diode crosses a certain value. After the characteristic
voltage, the diode current increases significantly (exponentially), even for
a very small increase in the diode bias voltage. This voltage is called the
threshold voltage or cut-in voltage (~0.2V for germanium diode and
~0.7 V for silicon diode).
For the diode in reverse bias, the current is very small (~A) and almost
remains constant with change in bias. It is called reverse saturation
current. However, for special cases, at very high reverse bias (break down
voltage), the current suddenly increases. This special action of the diode
is discussed later in Section 14.8. The general purpose diode are not
used beyond the reverse saturation current region.
The above discussion shows that the p-n junction diode primerly
allows the flow of current only in one direction (forward bias). The forward
bias resistance is low as compared to the reverse bias resistance. This
property is used for rectification of ac voltages as discussed in the next
section. For diodes, we define a quantity called dynamic resistance as
the ratio of small change in voltage V to a small change in current I:
rd =
V
I
(14.6)
482
FIGURE 14.17
EXAMPLE 14.4
no
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
From the V-I characteristic of a junction diode we see that it allows current
to pass only when it is forward biased. So if an alternating voltage is
applied across a diode the current flows only in that part of the cycle
when the diode is forward biased. This property
is used to rectify alternating voltages and the
circuit used for this purpose is called a rectifier.
If an alternating voltage is applied across a
diode in series with a load, a pulsating voltage will
appear across the load only during the half cycles
of the ac input during which the diode is forward
biased. Such rectifier circuit, as shown in
Fig. 14.18, is called a half-wave rectifier. The
secondary of a transformer supplies the desired
ac voltage across terminals A and B. When the
voltage at A is positive, the diode is forward biased
and it conducts. When A is negative, the diode is
reverse-biased and it does not conduct. The reverse
saturation current of a diode is negligible and can
be considered equal to zero for practical purposes.
(The reverse breakdown voltage of the diode must
be sufficiently higher than the peak ac voltage at
the secondary of the transformer to protect the
diode from reverse breakdown.)
Therefore, in the positive half-cycle of ac there
FIGURE 14.18 (a) Half-wave rectifier
is a current through the load resistor R L and we
circuit, (b) Input ac voltage and output
get an output voltage, as shown in Fig. 14.18(b),
voltage waveforms from the rectifier circuit.
whereas there is no current in the negative halfcycle. In the next positive half-cycle, again we get
the output voltage. Thus, the output voltage, though still varying, is
restricted to only one direction and is said to be rectified. Since the
rectified output of this circuit is only for half of the input ac wave it is
called as half-wave rectifier.
The circuit using two diodes, shown in Fig. 14.19(a), gives output
rectified voltage corresponding to both the positive as well as negative
half of the ac cycle. Hence, it is known as full-wave rectifier. Here the
p-side of the two diodes are connected to the ends of the secondary of the
transformer. The n-side of the diodes are connected together and the
output is taken between this common point of diodes and the midpoint
of the secondary of the transformer. So for a full-wave rectifier the
secondary of the transformer is provided with a centre tapping and so it
is called centre-tap transformer. As can be seen from Fig.14.19(c) the
voltage rectified by each diode is only half the total secondary voltage.
Each diode rectifies only for half the cycle, but the two do so for alternate
cycles. Thus, the output between their common terminals and the centretap of the transformer becomes a full-wave rectifier output. (Note that
there is another circuit of full wave rectifier which does not need a centre483
tap transformer but needs four diodes.) Suppose the input voltage to A
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
tt
o N
be C
E
re R
pu T
bl
is
he
d
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
FIGURE 14.20 (a) A full-wave rectifier with capacitor filter, (b) Input and output
voltage of rectifier in (a).
In the section, we shall discuss some devices which are basically junction
diodes but are developed for different applications.
no
Physics
Zener diode as a voltage regulator
tt
o N
be C
E
re R
pu T
bl
is
he
d
We know that when the ac input voltage of a rectifier fluctuates, its rectified
output also fluctuates. To get a constant dc voltage from the dc
unregulated output of a rectifier, we use a Zener diode. The circuit diagram
of a voltage regulator using a Zener diode is shown in Fig. 14.22.
The unregulated dc voltage (filtered output of a
rectifier) is connected to the Zener diode through a series
resistance R s such that the Zener diode is reverse biased.
If the input voltage increases, the current through R s
and Zener diode also increases. This increases the
voltage drop across R s without any change in the
voltage across the Zener diode. This is because in the
breakdown region, Zener voltage remains constant even
though the current through the Zener diode changes.
Similarly, if the input voltage decreases, the current
through Rs and Zener diode also decreases. The voltage
drop across Rs decreases without any change in the
FIGURE 14.22 Zener diode as DC
voltage across the Zener diode. Thus any increase/
voltage regulator
decrease in the input voltage results in, increase/
decrease of the voltage drop across Rs without any
change in voltage across the Zener diode. Thus the Zener diode acts as a
voltage regulator. We have to select the Zener diode according to the
required output voltage and accordingly the series resistance R s.
EXAMPLE 14.5
no
486
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
(i) Photodiode
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
p = p + p
Here n and p are the electron and hole concentrations* at any
particular illumination and n and p are carriers concentration when
there is no illumination. Remember n = p and n >> p. Hence, the
* Note that, to create an e-h pair, we spend some ener gy (photoexcitation, thermal
excitation, etc.). Therefore when an electron and hole recombine the energy is
released in the form of light (radiative recombination) or heat (non-radiative
recombination). It depends on semiconductor and the method of fabrication of
the p-n junction. For the fabrication of LEDs, semiconductors like GaAs, GaAsGaP are used in which radiative recombination dominates.
EXAMPLE 14.6
487
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 14.6
Physics
no
488
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
(iii) Solar cell
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
Physics
Example 14.7 Why are Si and GaAs are preferred materials for
solar cells?
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 14.7
FIGURE 14.26
no
490
The credit of inventing the transistor in the year 1947 goes to J. Bardeen
and W.H. Brattain of Bell Telephone Laboratories, U.S.A. That transistor
was a point-contact transistor. The first junction transistor consisting of
two back-to-back p-n junctions was invented by William Schockley
in 1951.
As long as only the junction transistor was known, it was known
simply as transistor. But over the years new types of transistors were
invented and to differentiate it from the new ones it is now called the
Bipolar Junction Transistor (BJT). Even now, often the word transistor
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
is used to mean BJT when there is no confusion. Since our study is
limited to only BJT, we shall use the word transistor for BJT without
any ambiguity.
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
FIGURE 14.27
(a) Schematic
representations of a
n-p-n transistor and
p-n-p transistor, and
(b) Symbols for n-p-n
and p-n-p transistors.
491
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
IE = IC + I B
(14.7)
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
493
Physics
no
tt
o N
be C
E
re R
pu T
bl
is
he
d
494
ri =
VBE
I B
(14.8)
VCE
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
(ii) Output resistance (ro ): This is defined as the ratio of change in
collector-emitter voltage (VCE ) to the change in collector current (IC)
at a constant base current IB.
VCE
IC
(14.9)
IB
tt
o N
be C
E
re R
pu T
bl
is
he
d
ro =
The output characteristics show that initially for very small values of
VCE , IC increases almost linearly. This happens because the base-collector
junction is not reverse biased and the transistor is not in active state. In
fact, the transistor is in the saturation state and the current is controlled
by the supply voltage VCC (=VCE) in this part of the characteristic. When
VCE is more than that required to reverse bias the base-collector junction,
IC increases very little with VCE. The reciprocal of the slope of the linear
part of the output characteristic gives the values of ro. The output
resistance of the transistor is mainly controlled by the bias of the basecollector junction. The high magnitude of the output resistance (of the
order of 100 k) is due to the reverse-biased state of this diode. This
also explains why the resistance at the initial part of the characteristic,
when the transistor is in saturation state, is very low.
(iii) Current amplification factor ( ): This is defined as the ratio of
the change in collector current to the change in base current at a
constant collector-emitter voltage (V CE) when the transistor is in
active state.
ac =
I C
I B
(14.10)
VCE
This is also known as small signal current gain and its value is very
large.
If we simply find the ratio of IC and IB we get what is called dc of the
transistor. Hence,
IC
(14.11)
IB
Since I C increases with IB almost linearly and I C = 0 when IB = 0, the values
of both dc and ac are nearly equal. So, for most calculations dc can be
used. Both ac and dc vary with VCE and IB (or I C) slightly.
dc =
ac =
IC
I B
, dc =
V CE
IC
IB
For determining ac and dc at the stated values of VCE and IC one can
proceed as follows. Consider any two characteristics for two values
of IB which lie above and below the given value of I C . Here IC = 4.0 mA.
(Choose characteristics for IB = 30 and 20 A.) At VCE = 10 V we read
the two values of IC from the graph. Then
EXAMPLE 14.8
no
Solution
495
Physics
Therefore, ac = 1.5 mA/ 10 A = 150
For determining dc, either estimate the value of IB corresponding to
I C = 4.0 mA at V CE = 10 V or calculate the two values of dc for the two
characteristics chosen and find their mean.
Therefore, for I C = 4.5 mA and IB = 30 A,
dc = 4.5 mA/ 30 A = 150
and for IC = 3.0 mA and IB = 20 A
dc =3.0 mA / 20 A = 150
Hence, dc =(150 + 150) /2 = 150
tt
o N
be C
E
re R
pu T
bl
is
he
d
EXAMPLE 14.8
no
496
When V i becomes greater than 0.6 V the transistor is in active state with
some current IC in the output path and the output Vo decrease as the
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
For using the transistor as an amplifier we will use the active region of
the Vo versus V i curve. The slope of the linear part of the curve represents
the rate of change of the output with the input. It is negative because the
output is VCC ICR C and not ICR C. That is why as input voltage of the CE
amplifier increases its output voltage decreases and the output is said to
be out of phase with the input. If we consider Vo and Vi as small
changes in the output and input voltages then Vo /Vi is called the small
signal voltage gain AV of the amplifier.
If the VBB voltage has a fixed value corresponding to the mid point of
the active region, the circuit will behave as a CE amplifier with voltage
gain Vo / Vi . We can express the voltage gain AV in terms of the resistors
in the circuit and the current gain of the transistor as follows.
We have, Vo = VCC ICRC
Therefore, V o = 0 RC IC
no
Vi = RB I B + V BE
But VBE is negligibly small in comparison to I BRB in this circuit.
So, the voltage gain of this CE amplifier (Fig. 14.32) is given by
AV = RC IC / R B IB
= ac(R C /RB )
(14.14)
where ac is equal to I C/I B from Eq. (14.10). Thus the linear
portion of the active region of the transistor can be exploited for the use
in amplifiers. Transistor as an amplifier (CE configuration) is discussed
in detail in the next section.
497
Physics
14.9.4 Transistor as an Amplifier (CE-Configuration)
tt
o N
be C
E
re R
pu T
bl
is
he
d
(14.15)
(14.16)
V BE + vi = VBE + I B RB + IB (R B + r i)
no
v i = IB (R B + ri )
498
= r IB
The change in I B causes a change in I c. We define a parameter ac,
which is similar to the dc defined in Eq. (14.11), as
I
i
ac = c = c
(14.17)
I B ib
which is also known as the ac current gain A i. Usually ac is close to dc
in the linear region of the output characteristics.
The change in I c due to a change in IB causes a change in VCE and the
voltage drop across the resistor RL because VCC is fixed.
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
These changes can be given by Eq. (14.15) as
VCC = V CE + R L IC = 0
or V CE = R L IC
The change in VCE is the output voltage v 0. From Eq. (14.10), we get
tt
o N
be C
E
re R
pu T
bl
is
he
d
v0 = V CE = ac RL IB
The voltage gain of the amplifier is
Av =
v0
VCE
=
vi
r I B
ac RL
(14.18)
r
The negative sign represents that output voltage is opposite with phase
with the input voltage.
From the discussion of the transistor characteristics you have seen
that there is a current gain ac in the CE configuration. Here we have also
seen the voltage gain Av . Therefore the power gain Ap can be expressed
as the product of the current gain and voltage gain. Mathematically
Ap = ac A v
(14.19)
Since ac and A v are greater than 1, we get ac power gain. However it
should be realised that transistor is not a power generating device. The
energy for the higher ac power at the output is supplied by the battery.
Example 14.9 In Fig. 14.31(a), the VBB supply can be varied from 0V
to 5.0 V. The Si transistor has dc = 250 and RB = 100 k, RC = 1 K,
VCC = 5.0V. Assume that when the transistor is saturated, V CE = 0V
and VBE = 0.8V. Calculate (a) the minimum base current, for which
the transistor will reach saturation. Hence, (b) determine V1 when
the transistor is switched on. (c) find the ranges of V1 for which the
transistor is switched off and switched on.
no
Note that the transistor is in active state when I B varies from 0.0mA
to 20mA. In this range, IC = IB is valid. In the saturation range,
IC IB.
EXAMPLE 14.9
Solution
Given at saturation VCE = 0V, VBE = 0.8V
VCE = VCC ICR C
IC = VCC/RC = 5.0V/1.0k = 5.0 mA
Therefore IB = IC/ = 5.0 mA/250 = 20 A
The input voltage at which the transistor will go into saturation is
given by
VIH = V BB = IBR B +V BE
= 20A 100 k + 0.8V = 2.8V
The value of input voltage below which the transistor remains cutoff
is given by
VIL = 0.6V, V IH = 2.8V
Between 0.0V and 0.6V, the transistor will be in the switched off
state. Between 2.8V and 5.0V, it will be in switched on state.
499
Physics
EXAMPLE 14.10
tt
o N
be C
E
re R
pu T
bl
is
he
d
no
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
tt
o N
be C
E
re R
pu T
bl
is
he
d
apply proper bias for the first time. Obviously, a surge of collector current
flows in the transistor. This current flows through the coil T2 where
terminals are numbered 3 and 4 [Fig. 14.33(b)]. This current does not
reach full amplitude instantaneously but increases from X to Y, as shown
in Fig. [14.33(c)(i)]. The inductive coupling between coil T2 and coil T 1
now causes a current to flow in the emitter circuit (note that this actually
is the feedback from input to output). As a result of this positive feedback,
this current (in T1; emitter current) also increases from X to Y [Fig.
14.33(c)(ii)]. The current in T2 (collector current) connected in the collector
circuit acquires the value Y when the transistor becomes saturated. This
means that maximum collector current is flowing and can increase no
further. Since there is no further change in collector current, the magnetic
field around T2 ceases to grow. As soon as the field becomes static, there
will be no further feedback from T2 to T1. Without continued feedback,
the emitter current begins to fall. Consequently, collector current decreases
from Y towards Z [Fig. 14.33(c)(i)]. However, a decrease of collector current
causes the magnetic field to decay around the coil T2 . Thus, T1 is now
seeing a decaying field in T2 (opposite from what it saw when the field was
growing at the initial start operation). This causes a further decrease in
the emitter current till it reaches Z when the transistor is cut-off. This
means that both IE and IC cease to flow. Therefore, the transistor has
reverted back to its original state (when the power was first switched on).
The whole process now repeats itself. That is, the transistor is driven to
saturation, then to cut-off, and then back to saturation. The time for
change from saturation to cut-off and back is determined by the constants
of the tank circuit or tuned circuit (inductance L of coil T2 and C connected
in parallel to it). The resonance frequency ( ) of this tuned circuit
determines the frequency at which the oscillator will oscillate.
(14.20)
2 LC
In the circuit of Fig. 14.33(b), the tank or tuned circuit is connected
in the collector side. Hence, it is known as tuned collector oscillator. If the
tuned circuit is on the base side, it will be known as tuned base oscillator.
There are many other types of tank circuits (say RC) or feedback circuits
giving different types of oscillators like Colpitts oscillator, Hartley
oscillator, RC-oscillator.
AND
LOGIC GATES
no
501
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
Input
Output
no
(b)
FIGURE 14.35
(a) Logic symbol,
(b) Truth table of
NOT gate.
502
This is the most basic gate, with one input and one output. It produces
a 1 output if the input is 0 and vice-versa. That is, it produces an
inverted version of the input at its output. This is why it is also known
as an inverter. The commonly used symbol together with the truth
table for this gate is given in Fig. 14.35.
(ii) OR Gate
An OR gate has two or more inputs with one output. The logic symbol
and truth table are shown in Fig. 14.36. The output Y is 1 when either
input A or input B or both are 1s, that is, if any of the input is high, the
output is high.
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
Input
Output
tt
o N
be C
E
re R
pu T
bl
is
he
d
(b)
Apart from carrying out the above mathematical logic operation, this
gate can be used for modifying the pulse waveform as explained in the
following example.
Example 14.11 Justify the output waveform (Y) of the OR gate for
the following inputs A and B given in Fig. 14.37.
Solution Note the following:
At t < t1;
A = 0, B = 0;
Hence Y = 0
For t 1 to t2; A = 1, B = 0;
Hence Y = 1
For t 2 to t3; A = 1, B = 1;
Hence Y = 1
For t 3 to t4; A = 0, B = 1;
Hence Y = 1
For t 4 to t5; A = 0, B = 0;
Hence Y = 0
For t 5 to t6; A = 1, B = 0;
Hence Y = 1
For t > t6; A = 0, B = 1;
Hence Y = 1
Therefore the waveform Y will be as shown in the Fig. 14.37.
no
FIGURE 14.38 (a) Logic symbol, (b) Truth table of AND gate.
EXAMPLE 14.11
FIGURE 14.37
Input
Output
(b)
503
Physics
Example 14.12 Take A and B input waveforms similar to that in
Example 14.11. Sketch the output waveform obtained from AND gate.
Hence Y = 0
Hence Y = 0
Hence Y = 1
Hence Y = 0
Hence Y = 0
Hence Y = 0
Hence Y = 0
can be drawn
EXAMPLE 14.12
tt
o N
be C
E
re R
pu T
bl
is
he
d
Solution
For t t 1;
A = 0, B = 0;
For t1 to t2;
A = 1, B = 0;
For t2 to t3;
A = 1, B = 1;
For t3 to t4;
A = 0, B = 1;
For t4 to t5;
A = 0, B = 0;
For t5 to t6;
A = 1, B = 0;
For t > t 6;
A = 0, B = 1;
Based on the above, the output waveform for AND gate
as given below.
FIGURE 14.39
This is an AND gate followed by a NOT gate. If inputs A and B are both
1, the output Y is not 1. The gate gets its name from this NOT AND
behaviour. Figure 14.40 shows the symbol and truth table of NAND gate.
NAND gates are also called Universal Gates since by using these
gates you can realise other basic gates like OR, AND and NOT (Exercises
14.16 and 14.17).
Input
Output
A
(b)
504
EXAMPLE 14.13
no
FIGURE 14.40 (a) Logic symbol, (b) Truth table of NAND gate.
Example 14.13 Sketch the output Y from a NAND gate having inputs
A and B given below:
Solution
For t < t 1;
For t 1 to t2;
For t 2 to t3;
For t 3 to t4;
A
A
A
A
=
=
=
=
1,
0,
0,
1,
B
B
B
B
=
=
=
=
1;
0;
1;
0;
Hence
Hence
Hence
Hence
Y
Y
Y
Y
=
=
=
=
0
1
1
1
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
A = 1, B = 1;
A = 0, B = 0;
A = 0, B = 1;
Hence Y = 0
Hence Y = 1
Hence Y = 1
EXAMPLE 14.13
tt
o N
be C
E
re R
pu T
bl
is
he
d
For t4 to t5;
For t5 to t6;
For t > t 6;
FIGURE 14.41
(b)
FIGURE 14.42 (a) Logic symbol, (b) Truth table of NOR gate.
NOR gates are considered as universal gates because you can obtain
all the gates like AND, OR, NOT by using only NOR gates (Exercises 14.18
and 14.19).
no
505
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
FASTER
no
The Integrated Chip (IC) is at the heart of all computer systems. In fact ICs are found in
almost all electrical devices like cars, televisions, CD players, cell phones etc. The
miniaturisation that made the modern personal computer possible could never have
happened without the IC. ICs are electronic devices that contain many transistors, resistors,
capacitors, connecting wires all in one package. You must have heard of the
microprocessor. The microprocessor is an IC that processes all information in a computer,
like keeping track of what keys are pressed, running programmes, games etc. The IC was
first invented by Jack Kilky at Texas Instruments in 1958 and he was awarded Nobel Prize
for this in 2000. ICs are produced on a piece of semiconductor crystal (or chip) by a process
called photolithography. Thus, the entire Information Technology (IT) industry hinges on
semiconductors. Over the years, the complexity of ICs has increased while the size of its
features continued to shrink. In the past five decades, a dramatic miniaturisation in
computer technology has made modern day computers faster and smaller. In the 1970s,
Gordon Moore, co-founder of INTEL, pointed out that the memory capacity of a chip (IC)
approximately doubled every one and a half years. This is popularly known as Moores
law. The number of transistors per chip has risen exponentially and each year computers
are becoming more powerful, yet cheaper than the year before. It is intimated from current
trends that the computers available in 2020 will operate at 40 GHz (40,000 MHz) and
would be much smaller, more efficient and less expensive than present day computers.
The explosive growth in the semiconductor industry and computer technology is best
expressed by a famous quote from Gordon Moore: If the auto industry advanced as rapidly
as the semiconductor industry, a Rolls Royce would get half a million miles per gallon, and
it would be cheaper to throw it away than to park it.
506
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
SUMMARY
Semiconductors are the basic materials used in the present solid state
electronic devices like diode, transistor, ICs, etc.
2.
tt
o N
be C
E
re R
pu T
bl
is
he
d
1.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
no
13.
14.
15.
507
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
16. Zener diode is one such special purpose diode. In reverse bias, after a
certain voltage, the current suddenly increases (breakdown voltage) in
a Zener diode. This property has been used to obtain voltage regulation.
17. p-n junctions have also been used to obtain many photonic or
optoelectronic devices where one of the participating entity is photon:
(a) Photodiodes in which photon excitation results in a change of reverse
saturation current which helps us to measure light intensity; (b) Solar
cells which convert photon energy into electricity; (c) Light Emitting
Diode and Diode Laser in which electron excitation by a bias voltage
results in the generation of light.
18. Transistor is an n-p-n or p-n-p junction device. The central block
(thin and lightly doped) is called Base while the other electrodes are
Emitter and Collectors. The emitter -base junction is forwar d biased
while collector -base junction is r everse biased.
19. The transistors can be connected in such a manner that either C or E
or B is common to both the input and output. This gives the three
configurations in which a transistor is used: Common Emitter (CE),
Common Collector (CC) and Common Base (CB). The plot between IC
and VCE for fixed IB is called output characteristics while the plot between
IB and VBE with fixed VCE is called input characteristics. The important
transistor parameters for CE-configuration are:
V BE
I B V
CE
V CE
output resistance, ro =
IC I
input resistance, ri =
IC
IB
VCE
vo
R
= C , where RC and RB are respectively
vi
RB
no
508
POINTS TO PONDER
1.
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
In elemental semiconductors (Si or Ge), the n-type or p-type
semiconductors are obtained by introducing dopants as defects. In
compound semiconductors, the change in relative stoichiometric ratio
can also change the type of semiconductor. For example, in ideal GaAs
the ratio of Ga:As is 1:1 but in Ga-rich or As-rich GaAs it could
respectively be Ga1.1 As0.9 or Ga 0.9 As1.1. In general, the presence of
defects control the properties of semiconductors in many ways.
In transistors, the base region is both narrow and lightly doped,
otherwise the electrons or holes coming from the input side (say, emitter
in CE-configuration) will not be able to r each the collector.
tt
o N
be C
E
re R
pu T
bl
is
he
d
2.
3.
4.
5.
EXERCISES
14.1
14.2
no
14.3
14.4
509
Physics
When a forward bias is applied to a p-n junction, it
(a) raises the potential barrier.
(b) reduces the majority carrier current to zero.
(c) lowers the potential barrier.
(d) None of the above.
14.6 For transistor action, which of the following statements are correct:
(a) Base, emitter and collector regions should have similar size and
doping concentrations.
(b) The base region must be very thin and lightly doped.
(c) The emitter junction is forward biased and collector junction is
reverse biased.
(d) Both the emitter junction as well as the collector junction are
forward biased.
14.7 For a transistor amplifier, the voltage gain
(a) remains constant for all frequencies.
(b) is high at high and low frequencies and constant in the middle
frequency range.
(c) is low at high and low frequencies and constant at mid
frequencies.
(d) None of the above.
14.8 In half-wave rectification, what is the output frequency if the input
frequency is 50 Hz. What is the output frequency of a full-wave rectifier
for the same input frequency.
14.9 For a CE-transistor amplifier, the audio signal voltage across the
collected resistance of 2 k is 2 V. Suppose the current amplification
factor of the transistor is 100, find the input signal voltage and base
current, if the base r esistance is 1 k.
14.10 Two amplifiers are connected one after the other in series (cascaded).
The first amplifier has a voltage gain of 10 and the second has a
voltage gain of 20. If the input signal is 0.01 volt, calculate the output
ac signal.
14.11 A p-n photodiode is fabricated from a semiconductor with band gap
of 2.8 eV. Can it detect a wavelength of 6000 nm?
tt
o N
be C
E
re R
pu T
bl
is
he
d
14.5
no
ADDITIONAL EXERCISES
510
Eg
2k B T
where n0 is a constant.
Semiconductor Electronics:
Materials, Devices and
Simple Circuits
14.14 In a p-n junction diode, the current I can be expressed as
I = I 0 exp
eV
1
2k B T
tt
o N
be C
E
re R
pu T
bl
is
he
d
FIGURE 14.44
14.16 Write the truth table for a NAND gate connected as given in
Fig. 14.45.
FIGURE 14.45
Hence identify the exact logic operation carried out by this circuit.
no
14.17 You are given two circuits as shown in Fig. 14.46, which consist
of NAND gates. Identify the logic operation carried out by the two
circuits.
FIGURE 14.46
14.18 Write the truth table for circuit given in Fig. 14.47 below consisting
of NOR gates and identify the logic operation (OR, AND, NOT) which
this circuit is performing.
511
Physics
tt
o N
be C
E
re R
pu T
bl
is
he
d
FIGURE 14.47
14.19 Write the truth table for the cir cuits given in Fig. 14.48 consisting of
NOR gates only. Identify the logic operations (OR, AND, NOT) performed
by the two circuits.
no
FIGURE 14.48
512
408
PHYSICS
no N
C
tt E
o R
be T
re
pu
is
bl
CHAPTER 9
he
d
SUPPLEMENTARY MATERIAL
W=
0
l
YAl
YA l 2
dl =
2
L
L
2
1
l
W = Y AL
2
L
=
1
Youngs modulus strain2
2
volume of the wire
1
stress strain volume of the
2
wire
(A1)
SUPPLEMENTARY MATERIAL
vc = Re /( d).
CHAPTER 11
11.9.4 Blackbody Radiation
no N
C
tt E
o R
be T
re
pu
he
d
is
CRITICAL VELOCITY
bl
CHAPTER 10
409
H = AT 4
(A2)
410
PHYSICS
(A4)
no N
C
tt E
o R
be T
re
pu
he
d
(A3)
is
H = AeT 4
bl
SUPPLEMENTARY MATERIAL
411
no N
C
tt E
o R
be T
re
pu
bl
is
he
d
NOTES
he
d
is
bl
no N
C
tt E
o R
be T
re
pu
NOTES
CHAPTER SIX
WORK, ENERGY
AND
POWER
6.1 INTRODUCTION
6.1 Introduction
6.2 Notions of work and kinetic
energy : The work-energy
theorem
6.3 Work
6.12 Collisions
Summary
Points to ponder
Exer cises
Additional exercises
Appendix 6.1
2015-16(20/01/2015)
A dot B) is defined as
A. B = A B cos
115
(6.1a)
A = Ax $i + Ay $j + Az k$
A. ( B) = (A .B)
$
B = B x $i + By $j + Bz k
their scalar product is
A .B
$i $j = $j k$ = k
$ $i = 0
Given two vectors
Ay j
. B i
Az k
x
B yj
B z k
= Ax Bx + Ay B y + Az B z
(6.1b)
From the definition of scalar product and
(Eq. 6.1b) we have :
( i ) A A Ax Ax A y Ay A z Az
A 2 = A 2x + Ay2 + A 2z
(6.1c)
2
.
since A A = |A ||A| cos 0 = A .
(ii)
A .B = 0, if A and B are perpendicular.
Or,
Hence F.d
Now F.F
= F 2 Fx2 Fy2
= 9 + 16 + 25
= 50 unit
and d.d
= d 2 = d2x
Ax i
d2y
Fz2
d2z
= 25 + 16 + 9
= 50 unit
cos
16
16
=
= 0 .3 2 ,
50 5 0 50
= cos1 0.32
Fig. 6.1 (a) The scalar product of two vectors A and B is a scalar : A. B = A B cos . (b) B cos is the projection
of B onto A. (c) A cos is the projection of A onto B.
2015-16(20/01/2015)
116
PHYSICS
1
mu 2
2
mas
Fs
(6.2a)
K
1
10-3 50 50
2
= 1.25 J
where we have assumed that the drop is initially
at rest.
Assuming that g is a constant with a value
10 m/s2 , the work done by the gravitational force
is,
Wg = mgh
= 10 -3 10 103
= 10.0 J
(b) From the work-energy theorem
K
W g Wr
where W r is the work done by the resistive force
on the raindrop. Thus
Wr = K W g
= 1.25 10
= 8.75 J
is negative.
6.3 WORK
(6.3)
Kf Ki = W
1
m v2
2
2015-16(20/01/2015)
117
m v. v
mv 2
(6.5)
2015-16(20/01/2015)
118
PHYSICS
F (x )x
W =
2
mv f
vf =
(6.6)
xi
lim xf
lim
F (x ) x
x 0
xi
xf
F x dx
(6.7)
xi
= 100 J
2 100 J
0.05 kg
= 63.2 m s1
Fig. 6.3(a)
2015-16(20/01/2015)
119
d 1
m v2
dt 2
dv
v
dt
= F v (from Newtons Second Law)
m
dx
dt
Thus
dK = Fdx
Integrating from the initial position (x i ) to final
position ( x f ), we have
Answer
Kf
xf
dK
Fdx
Ki
xi
1
(100 + 50) 10
2
or
Kf
F dx
Ki
(6.8a)
xi
(6.8b)
2015-16(20/01/2015)
120
PHYSICS
k
for 0.1 < x < 2.01 m
x
= 0 for x < 0.1m and x > 2.01 m
where k = 0.5 J. What is the final kinetic
energy and speed vf of the block as it
crosses this patch ?
Fr =
Kf
Ki
0.1
1
mv 2i
2
1
mv 2
2 i
k
dx
x
k ln x
2.01
0.1
d
V(h)
mg
dh
The negative sign indicates that the
gravitational force is downward. When released,
the ball comes down with an increasing speed.
Just before it hits the ground, its speed is given
by the kinematic relation,
v2 = 2gh
This equation can be written as
F
k ln 2.01/0.1
= 2 0.5 ln (20.1)
= 2 1.5 = 0.5 J
v f = 2K f /m = 1 m s1
1
m v2 = m g h
2
which shows that the gravitational potential
energy of the object at height h, when the object
is released, manifests itself as kinetic energy of
the object on reaching the ground.
Physically, the notion of potential energy is
applicable only to the class of forces where work
done against the force gets stored up as energy.
When external constraints are removed, it
manifests itself as kinetic energy. Mathematically,
(for simplicity, in one dimension) the potential
2015-16(20/01/2015)
121
dV
dx
F x
This implies that
xf
Vf
F(x)dx
xi
dV
Vi
Vf
Vi
2015-16(20/01/2015)
122
PHYSICS
EH = mgH
Eh
E0
1
mvh2
2
= (1/2) mvf2
mgh
(6.11 b)
(6.11 c)
E=
2 gH
1
2
mv02
T A mg =
mv02
L
(6.12)
[Newtons Second Law]
1
E = mvc2 + 2mgL
2
mg =
mvc2
L
(6.13)
(6.14)
or,
5
m 2
mgL
v
2
2 0
v0
5 gL
gL
At B, the energy is
1
mv 2B mgL
2
Equating this to the energy at A and employing
E
Fig. 6.6
mgL
1
mv 02
2
5
m gL
2
2015-16(20/01/2015)
123
v B = 3gL
W =+
k x 2m
2
(6.16)
xm
Fs dx
Ws
k x 2m
2
kx dx
0
(6.15)
2015-16(20/01/2015)
124
PHYSICS
k x dx
Ws
xi
k x 2i
k x 2f
(6.17)
k x 2i
2
k x 2i
2
=0
(6.18)
The work done by the spring force in a cyclic
process is zero. We have explicitly demonstrated
that the spring force (i) is position dependent
only as first stated by Hooke, (Fs = kx); (ii)
does work which only depends on the initial and
final positions, e.g. Eq. (6.17). Thus, the spring
force is a conservative force.
We define the potential energy V(x) of the spring
to be zero when block and spring system is in the
equilibrium position. For an extension (or
compression) x the above analysis suggests that
kx 2
(6.19)
2
You may easily verify that dV/dx = k x, the
spring force. If the block of mass m in Fig. 6.7 is
extended to xm and released from rest, then its
total mechanical energy at any arbitrary point x,
where x lies between xm and + xm, will be given by
V(x) =
1
1
2 1
k xm
= k x2 + m v2
2
2
2
where we have invoked the conservation of
mechanical energy. This suggests that the speed
and the kinetic energy will be maximum at the
equilibrium position, x = 0, i.e.,
xi
Ws
1
3
10 5 5
2
K = 1.25 104 J
1
1
2
m v 2m = k x m
2
2
where vm is the maximum speed.
k
x
m m
Note that k/m has the dimensions of [T-2] and
our equation is dimensionally correct. The
kinetic energy gets converted to potential energy
or
vm =
2015-16(20/01/2015)
125
= 1.25 104 J
We obtain
xm = 2.00 m
We note that we have idealised the situation.
The spring is considered to be massless. The
surface has been considered to possess
negligible friction.
1
K = Kf K i = 0 m v 2
2
The work done by the net force is
1
kx m2
2
Equating we have
W
1
m v2
2
m g x m
1
2
k xm
2
m g x m
m v2
2 m g x m
mg
m 2 g2
0
m k v2
1/2
k
where we take the positive square root since x m
is positive. Putting in numerical values we
obtain
x m = 1.35 m
which, as expected, is less than the result in
Example 6.8.
If the two forces on the body consist of a
conservative force F c and a non-conservative
force Fnc , the conservation of mechanical energy
formula will have to be modified. By the WE
theorem
(F c+ F nc ) x = K
But
Fc x = V
Hence,
(K + V) = Fnc x
E = F nc x
where E is the total mechanical energy. Over
the path this assumes the form
Ef Ei = W nc
Where W nc is the total work done by the
non-conservative forces over the path. Note that
2015-16(20/01/2015)
126
PHYSICS
2015-16(20/01/2015)
127
235 ,
92 U
is split by a neutron
21
1 .6 10
19
J
J/eV
~ 0 .006 2 eV
2015-16(20/01/2015)
128
PHYSICS
Pav =
dW
dt
F.
(6.21)
dr
dt
= F.v
(6.22)
2015-16(20/01/2015)
129
6.12 COLLISIONS
In physics we study motion (change in position).
At the same time, we try to discover physical
quantities, which do not change in a physical
process. The laws of momentum and energy
conservation are typical examples. In this
section we shall apply these laws to a commonly
encountered phenomena, namely collisions.
Several games such as billiards, marbles or
carrom involve collisions.We shall study the
collision of two masses in an idealised form.
Consider two masses m1 and m2 . The particle
m1 is moving with speed v 1i , the subscript i
implying initial. We can cosider m2 to be at rest.
No loss of generality is involved in making such
a selection. In this situation the mass m1
collides with the stationary mass m2 and this
is depicted in Fig. 6.10.
1 = 2 = 0
m1v 1i = (m1+m2 )vf (momentum conservation)
vf
m1
v
m1 m 2 1i
(6.23)
1
m v2
2 1 1i
1
m 1v12i
2
1
(m
2 1
m 2 )v 2f
1
m 12
v12i
2 m1 m 2
1
m1
m1v12i 1
2
m1 m 2
2015-16(20/01/2015)
130
PHYSICS
1 m1m2
v2
2 m1 m 2 1i
v1 f
2
1 1i
m v
2
1 1f
m v
2
2 2f
or,
v1i )
m1v1 f (v2 f
v12i
v1 f )
(v1i
v 1f )(v1i
v1 f )
v2 f
v1 f
Hence,
v1i
v 1f = 0
v 2f = v 1i
v1 f )
v12f
v 2 f (v1i
m1v1i (v 2 f
(6.27)
2m1v1i
(6.28)
m1 m 2
Thus, the unknowns {v 1f, v2f} are obtained in
terms of the knowns {m1 , m2 , v1i}. Special cases
of our analysis are interesting.
(6.25)
m v
m2)
v
m 2 1i
and v 2 f
(6.24)
(m1
m1
(6.26)
2015-16(20/01/2015)
131
K1i
K1 f
1
m1
m1
2
m1
m2
m2
v12i
K1i
m1
m1
m2
m2
f2 = 1 f1 (elastic collision)
4m1m2
m1
m2
1
m1v1 f 2
2
1
m 2 v2 f 2
2
(6.31)
(6.30)
1
m 1v1 i 2
2
K1 f
(6.29)
v1i = v1f + v 2f
or
v 1i 2
v1 f
v1 f 2
v2 f
v2 f 2
v1 f
v2 f
2v1 f .v 2 f
2015-16(20/01/2015)
132
PHYSICS
v1 f 2
v2 f 2
2v1 f v 2 f cos
37
(6.32)
v1 f 2
v2 f 2
(6.33)
1 + 37 = 90
Thus, 1 = 53
This proves the following result : when two equal
masses undergo a glancing elastic collision with
one of them at rest, after the collision, they will
move at right angles to each other.
SUMMARY
1.
2.
3.
The work-energy theorem states that the change in kinetic energy of a body is the work
done by the net force on the body.
K f - Ki = Wnet
A force is conservative if (i) work done by it on an object is path independent and
depends only on the end points {xi , xj }, or (ii) the work done by the force is zero for an
arbitrary closed path taken by the object such that it returns to its initial position.
For a conservative force in one dimension, we may define a potential energy function V(x)
such that
F (x ) =
dV ( x )
dx
xf
or
Vi V f =
F (x ) d x
xi
4.
5.
6.
The principle of conservation of mechanical energy states that the total mechanical
energy of a body remains constant if the only forces that act on the body are conservative.
The gravitational potential energy of a particle of mass m at a height x about the earths
surface is
V(x) = m g x
where the variation of g with height is ignored.
The elastic potential energy of a spring of force constant k and extension x is
V (x ) =
7.
1
k x2
2
The scalar or dot product of two vectors A and B is written as A.B and is a scalar
quantity given by : A. B = AB cos , where is the angle between A and B. It can be
positive, negative or zero depending upon the value of . The scalar product of two
vectors can be interpreted as the product of magnitude of one vector and component
of the other vector along the first vector. For unit vectors :
i i = j j = k
k
= 1 and i j = j k
=k
i = 0
Scalar products obey the commutative and the distributive laws.
2015-16(20/01/2015)
133
POINTS TO PONDER
1.
2.
3.
The phrase calculate the work done is incomplete. We should refer (or imply
clearly by context) to the work done by a specific force or a group of forces on a
given body over a certain displacement.
Work done is a scalar quantity. It can be positive or negative unlike mass and
kinetic energy which are positive scalar quantities. The work done by the friction
or viscous force on a moving body is negative.
For two bodies, the sum of the mutual forces exerted between them is zero from
Newtons Third Law,
F12 + F21 = 0
But the sum of the work done by the two forces need not always cancel, i.e.
W 12 + W21 0
4.
5.
6.
7.
8.
9.
2015-16(20/01/2015)
134
PHYSICS
EXERCISES
6.1 The sign of work done by a force on a body is important to understand. State carefully
if the following quantities are positive or negative:
(a) work done by a man in lifting a bucket out of a well by means of a rope tied to the
bucket.
(b) work done by gravitational force in the above case,
(c) work done by friction on a body
sliding down an inclined plane,
(d) work done by an applied force on
a body moving on a rough
horizontal plane with uniform
velocity,
(e) work done by the resistive force of
air on a vibrating pendulum in
bringing it to rest.
6.2 A body of mass 2 kg initially at rest
moves under the action of an applied
horizontal force of 7 N on a table with
coefficient of kinetic friction = 0.1.
Compute the
(a) work done by the applied force in
10 s,
(b) work done by friction in 10 s,
(c) work done by the net force on the
body in 10 s,
(d) change in kinetic energy of the
body in 10 s,
and interpret your results.
6.3 Given in Fig. 6.11 are examples of some
potential energy functions in one
dimension. The total energy of the
particle is indicated by a cross on the
ordinate axis. In each case, specify the
regions, if any, in which the particle
cannot be found for the given energy.
Also, indicate the minimum total
energy the particle must have in each
case. Think of simple physical contexts
for which these potential energy shapes
are relevant.
Fig. 6.11
2015-16(20/01/2015)
135
Fig. 6.12
2015-16(20/01/2015)
136
PHYSICS
(c) What are the answers to (a) and (b) for an inelastic collision ?
(d) If the potential energy of two billiard balls depends only on the separation distance
between their centres, is the collision elastic or inelastic ? (Note, we are talking
here of potential energy corresponding to the force during collision, not gravitational
potential energy).
6.9 A body is initially at rest. It undergoes one-dimensional motion with constant
acceleration. The power delivered to it at time t is proportional to
(i) t1/2
(ii) t
(iii) t3/2
(iv) t2
6.10 A body is moving unidirectionally under the influence of a source of constant power.
Its displacement in time t is proportional to
(i) t1/2
(ii) t
(iii) t3/2
(iv) t2
6.11 A body constrained to move along the z-axis of a coordinate system is subject to a
constant force F given by
N
F = i + 2 j + 3 k
6.12
6.13
6.14
6.15
6.16
where i, j, k are unit vectors along the x-, y- and z-axis of the system respectively.
What is the work done by this force in moving the body a distance of 4 m along the
z-axis ?
An electron and a proton are detected in a cosmic ray experiment, the first with kinetic
ener gy 10 keV, and the second with 100 keV. Which is faster, the electron or the
proton ? Obtain the ratio of their speeds. (electron mass = 9.11 10 -31 kg, proton mass
= 1.67 1027 kg, 1 eV = 1.60 1019 J).
A rain drop of radius 2 mm falls from a height of 500 m above the ground. It falls with
decreasing acceleration (due to viscous resistance of the air) until at half its original
height, it attains its maximum (terminal) speed, and moves with uniform speed
ther eafter. What is the work done by the gravitational force on the dr op in the first
and second half of its journey ? What is the work done by the resistive force in the
entire journey if its speed on reaching the ground is 10 m s1 ?
A molecule in a gas container hits a horizontal wall with speed 200 m s1 and angle 30
with the normal, and rebounds with the same speed. Is momentum conserved in the
collision ? Is the collision elastic or inelastic ?
A pump on the ground floor of a building can pump up water to fill a tank of volume 30 m 3
in 15 min. If the tank is 40 m above the ground, and the efficiency of the pump is 30%,
how much electric power is consumed by the pump ?
Two identical ball bearings in contact with each other and resting on a frictionless
table are hit head-on by another ball bearing of the same mass moving initially with a
speed V. If the collision is elastic, which of the following (Fig. 6.14) is a possible result
after collision ?
Fig. 6.14
2015-16(20/01/2015)
137
Fig. 6.16
2015-16(20/01/2015)
138
PHYSICS
6.26 A 1 kg block situated on a rough incline is connected to a spring of spring constant 100
N m1 as shown in Fig. 6.17. The block is released from rest with the spring in the
unstretched position. The block moves 10 cm down the incline before coming to rest.
Find the coefficient of friction between the block and the incline. Assume that the
spring has a negligible mass and the pulley is frictionless.
Fig. 6.17
6.27 A bolt of mass 0.3 kg falls from the ceiling of an elevator moving down with an uniform
speed of 7 m s1. It hits the floor of the elevator (length of the elevator = 3 m) and does
not rebound. What is the heat produced by the impact ? Would your answer be dif ferent
if the elevator were stationary ?
6.28 A trolley of mass 200 kg moves with a uniform speed of 36 km/h on a frictionless track.
A child of mass 20 kg runs on the trolley from one end to the other (10 m away) with a
speed of 4 m s1 relative to the trolley in a direction opposite to the its motion, and
jumps out of the trolley. What is the final speed of the trolley ? How much has the
trolley moved from the time the child begins to run ?
6.29 Which of the following potential energy curves in Fig. 6.18 cannot possibly describe the
elastic collision of two billiard balls ? Here r is the distance between centres of the balls.
Fig. 6.18
6.30 Consider the decay of a free neutron at rest : n g p + e
2015-16(20/01/2015)
139
Show that the two-body decay of this type must necessarily give an electron of fixed
energy and, therefore, cannot account for the observed continuous energy distribution
in the -decay of a neutron or a nucleus (Fig. 6.19).
Fig. 6.19
[Note: The simple result of this exercise was one among the several arguments advanced by W.
Pauli to predict the existence of a third particle in the decay products of -decay. This
particle is known as neutrino. We now know that it is a particle of intrinsic spin (like
e, p or n), but is neutral, and either massless or having an extremely small mass
(compared to the mass of electron) and which interacts very weakly with matter. The
correct decay process of neutron is : n g p + e + ]
2015-16(20/01/2015)
140
PHYSICS
Fig. 6.20 An illustration of a single stride in walking. While the first leg is maximally off the round, the second leg
is on the ground and vice-versa
The work done by one leg in each stride is m l v 20 by the work-energy theorem. Here ml is the mass of the leg.
Note m l v 2
0 /2 energy is expended by one set of leg muscles to bring the foot from rest to speed v0 while an
additional m l v 20 /2 is expended by a complementary set of leg muscles to bring the foot to rest from speed v0.
Hence work done by both legs in one stride is (study Fig. 6.20 carefully)
W s =2ml v 20
(6.34)
Assuming ml = 10 kg and slow running of a nine-minute mile which translates to 3 m s-1 in SI units, we obtain
W s = 180 J / stride
If we take a stride to be 2 m long, the person covers 1.5 strides per second at his speed of 3 m s-1 . Thus the
power expended
J
stride
1.5
stride
second
= 270 W
P =180
We must bear in mind that this is a lower estimate since several avenues of power loss (e.g. swinging of hands,
air resistance etc.) have been ignored. The interesting point is that we did not worry about the forces involved.
The forces, mainly friction and those exerted on the leg by the muscles of the rest of the body, are hard to
estimate. Static friction does no work and we bypassed the impossible task of estimating the work done by the
muscles by taking recourse to the work-ener gy theorem. We can also see the advantage of a wheel. The wheel
permits smooth locomotion without the continual starting and stopping in mammalian locomotion.
2015-16(20/01/2015)
CHAPTER ELEVEN
OF
MATTER
he
d
THERMAL PROPERTIES
11.1 INTRODUCTION
temperature
is
no N
C
tt E
o R
be T
re
pu
absolute temperature
Thermal expansion
Specific heat capacity
Calorimetry
Change of state
Heat transfer
11.10 Newtons law of cooling
bl
11.1 Introduction
11.2 Temperature and heat
11.3 Measurement of
11.5
11.6
11.7
11.8
11.9
Summary
Points to ponder
Exercises
275
is
t F 32 t C
=
180
100
no N
C
tt E
o R
be T
re
pu
bl
he
d
(11.1)
276
he
d
no N
C
tt E
o R
be T
re
pu
PV
= R
T
or PV = RT
(11.2)
where, is the number of moles in the sample
of gas and R is called universal gas constant:
R = 8.31 J mol1 K1
In Eq. 11.2, we have learnt that the pressure
and volume are directly proportional to
temperature : PV T. This relationship allows a
gas to be used to measure temperature in a
constant volume gas thermometer. Holding the
volume of a gas constant, it gives P T. Thus,
with a constant-volume gas thermometer,
temperature is read in terms of pressure. A plot
of pressure versus temperature gives a straight
line in this case, as shown in Fig. 11.2.
However, measurements on real gases deviate
from the values predicted by the ideal gas law
at low temperature. But the relationship is linear
over a large temperature range, and it looks as
though the pressure might reach zero with
decreasing temperature if the gas continued to
be a gas. The absolute minimum temperature
for an ideal gas, therefore, inferred by
extrapolating the straight line to the axis, as in
Fig. 11.3. This temperature is found to be
273.15 C and is designated as absolute zero.
Absolute zero is the foundation of the Kelvin
temperature scale or absolute scale temperature
is
bl
PHYSICS
277
l (105 K1)
Aluminium
Brass
Iron
Copper
Silver
Gold
Glass (pyrex)
Lead
2.5
1.8
1.2
1.7
1.9
1.4
0.32
0.29
he
d
Materials
V
, of a substance for temperature
V
change T and define the coefficient of volume
in volume,
as
bl
expansion,
is
no N
C
tt E
o R
be T
re
pu
V 1
V =
(11.5)
V T
Here V is also a characteristic of the
substance but is not strictly a constant. It
depends in general on temperature (Fig 11.6). It
is seen that V becomes constant only at a high
temperature.
al T
2a l T
3a l T
l
= 1 T
l
(11.4)
278
PHYSICS
v ( K1)
Aluminium
Brass
Iron
Paraffin
Glass (ordinary)
Glass (pyrex)
Hard rubber
Invar
Mercurry
Water
Alcohol (ethyl)
7 10 5
6 10 5
3.55 10 5
58.8 10 5
2.5 10 5
1 10 5
2.4 10 4
2 10 6
18.2 10 5
20.7 10 5
110 10 5
is
Materials
he
d
V T
=
V
T
no N
C
tt E
o R
be T
re
pu
bl
Table 11.2
Temperature (C)
(a)
Fig. 11.7
1
for ideal gas
(11.6)
T
At 0 C, v = 3.7 103 K1, which is much
larger than that for solids and liquids.
Equation (11.6) shows the temperature
dependence of v; it decreases with increasing
temperature. For a gas at room temperature and
constant pressure v is about 3300 106 K1, as
i.e. v =
Temperature (C)
(b)
279
Al = a (b)
b
b
a
A2 = b (a)
Fig. 11.8
(11.8)
(11.9)
no N
C
tt E
o R
be T
re
pu
is
v = 3l
A3 = (a) (b)
bl
3V l
= 3V l T
l
which gives
V =
Answer
he
d
F
l
= Ysteel = 2.4
10 7 N m 2 , which
A
l
corresponds to an external force of
l
= 2.4 107 40 104 j 105N.
l
F = AYsteel
l
strain is
= l(steel) T = 1.2 105 10=1.2 104.
l
Youngs modulus of steel is Y (steel) = 2 1011 N m2.
Therefore, the thermal stress developed is
A 1
2l
A T
Answer
Given,
T1 = 27 C
LT1 = 5.231 m
LT2 = 5.243 m
So,
280
PHYSICS
S 1 Q
=
(11.11)
m m T
The specific heat capacity is the property of
the substance which determines the change in
the temperature of the substance (undergoing
no phase change) when a given quantity of heat
is absorbed (or rejected) by it. It is defined as
the amount of heat per unit mass absorbed or
rejected by the substance to change its
temperature by one unit. It depends on the
nature of the substance and its temperature.
The SI unit of specific heat capacity is J kg1 K1.
If the amount of substance is specified in
terms of moles , instead of mass m in kg, we
can define heat capacity per mole of the
substance by
is
he
d
s=
no N
C
tt E
o R
be T
re
pu
bl
Q
(11.10)
T
where Q is the amount of heat supplied to
the substance to change its temperature from T
to T + T.
You have observed that if equal amount of
heat is added to equal masses of different
substances, the resulting temperature changes
will not be the same. It implies that every
substance has a unique value for the amount of
S=
S 1 Q
=
(11.12)
T
where C is known as molar specific heat
capacity of the substance. Like S, C also
depends on the nature of the substance and its
temperature. The SI unit of molar specific heat
capacity is J mol1 K1.
However, in connection with specific heat
capacity of gases, additional conditions may be
needed to define C. In this case, heat transfer
can be achieved by keeping either pressure or
volume constant. If the gas is held under
constant pressure during the heat transfer, then
it is called the molar specific heat capacity at
constant pressure and is denoted by Cp. On
the other hand, if the volume of the gas is
maintained during the heat transfer, then the
corresponding molar specific heat capacity is
called molar specific heat capacity at constant
volume and is denoted by Cv. For details see
Chapter 12. Table 11.3 lists measured specific
heat capacity of some substances at atmospheric
pressure and ordinary temperature while Table
11.4 lists molar specific heat capacities of some
gases. From Table 11.3 you can note that water
C=
Ice
Glass
Iron
Kerosene
Edible oil
Mercury
no N
C
tt E
o R
be T
re
pu
2060
840
450
2118
1965
140
he
d
Aluminium
Carbon
Copper
Lead
Silver
Tungesten
Water
Substance
is
Substance
bl
Table 11.3
281
Gas
Cp (J mol1K1)
Cv(J mol1K1)
He
20.8
12.5
H2
28.8
20.4
N2
29.1
20.8
O2
29.4
21.1
CO2
37.0
28.5
11.7 CALORIMETRY
is
bl
no N
C
tt E
o R
be T
re
pu
PHYSICS
he
d
282
283
Triple Point
he
d
is
bl
Fig. 11.10
no N
C
tt E
o R
be T
re
pu
The temperature of a substance remains constant during its change of state (phase change).
A graph between the temperature T and the Pressure P of the substance is called a phase
diagram or P T diagram. The following figure shows the phase diagram of water and CO2.
Such a phase diagram divides the P T plane into a solid-region, the vapour-region and the
liquid-region. The regions are separated by the curves such as sublimation curve (BO), fusion
curve (AO) and vaporisation curve (CO). The points on sublimation curve represent states
in which solid and vapour phases coexist. The point on the sublimation curve BO represent
states in which the solid and vapour phases co-exist. Points on the fusion curve AO represent
states in which solid and liquid phase coexist. Points on the vapourisation curve CO represent
states in which the liquid and vapour phases coexist. The temperature and pressure at which
the fusion curve, the vaporisation curve and the sublimation curve meet and all the three
phases of a substance coexist is called the triple point of the substance. For example the
triple point of water is represented by the temperature 273.16 K and pressure 6.11103 Pa.
(a)
(b)
Pressure-temperature phase diagrams for (a) water and (b) CO2 (not to the scale).
284
PHYSICS
he
d
is
no N
C
tt E
o R
be T
re
pu
bl
Temperatures of the change of state and latent heats for various substances at
1 atm pressure
Lf
(105J kg1)
114
1063
328
39
210
219
0
1.0
0.645
0.25
0.12
0.26
0.14
3.33
78
2660
1744
357
196
183
100
Lv
(105J kg1)
8.5
15.8
8.67
2.7
2.0
2.1
22.6
no N
C
tt E
o R
be T
re
pu
Boiling
Point (C)
he
d
Ethyl alcohol
Gold
Lead
Mercury
Nitrogen
Oxygen
Water
Melting
Point (C)
is
Substance
bl
Table 11.5
285
Answer
Heat lost by water = msw (fi)w
= (0.30 kg ) (4186 J kg1 K1) (50.0 C 6.7 C)
= 54376.14 J
Heat required to melt ice = m2Lf = (0.15 kg) Lf
Heat required to raise temperature of ice
water to final temperature = mIsw (fi)I
= (0.15 kg) (4186 J kg1 K 1) (6.7 C 0 C)
= 4206.93 J
Heat lost = heat gained
54376.14 J = (0.15 kg ) Lf + 4206.93 J
Lf = 3.34105 J kg1.
W
286
PHYSICS
he
d
Heating by conduction, convection and
radiation.
11.9.1 Conduction
no N
C
tt E
o R
be T
re
pu
Now,
Fig. 11.13
is
Answer We have
Mass of the ice, m = 3 kg
specific heat capacity of ice, sice
= 2100 J kg1 K1
specific heat capacity of water, swater
= 4186 J kg1 K1
latent heat of fusion of ice, Lf ice
= 3.35 105 J kg1
latent heat of steam, Lsteam
= 2.256 106 J kg1
bl
Q1
Q2
Q3
Q4
So,
287
Materials
Thermal conductivity
(J s1 m1 K1 )
Silver
Copper
Aluminium
Brass
Steel
Lead
Mercury
bl
H = KA
406
385
205
109
50.2
34.7
8.3
is
Metals
he
d
no N
C
tt E
o R
be T
re
pu
Non-metals
Insulating brick
Concrete
Body fat
Felt
Glass
Ice
Glass wool
Wood
Water
0.15
0.8
0.20
0.04
0.8
1.6
0.04
0.12
0.8
Gases
Air
Argon
Hydrogen
0.024
0.016
0.14
288
PHYSICS
K1 A1 300
K 2 T2
K1
K2
L2
15
385 T
10
Answer
he
d
K1T1
T0 =
K1 A T1 T0
K 2 A(T0 T2 )
L
K 2 A 2 ( T0 T2 )
L2
H=
K 2 A2 T 0
L1
no N
C
tt E
o R
be T
re
pu
L1
K 1 A1 T1 T0
is
bl
Fig. 11.15
A T1 T0
K1 K 2
K1 K 2
L
A T1 T2
L
1
K1
1
K2
K A T1 T2
H
K
2L
2 K1 K 2
K1 K 2
K1T1
(i) T0
K1
K 2 T2
K2
79 W m 1K 1 373 K
1
79 W m K
109 W m 1K 1 273 K
109 W m 1K 1
= 315 K
2 K1 K 2
(ii) K = K
K2
1
Fig 11.16
2 (79 W m 1 K 1 ) (109 W m 1 K 1 )
79 W m 1 K 1 +109 W m 1 K 1
= 91.6 W m1 K1
= 916.1 W
11.9.2 Convection
no N
C
tt E
o R
be T
re
pu
he
d
2L
is
K A T1 T2
bl
(iii) H
289
Fig. 11.17
Convection cycles.
290
he
d
is
no N
C
tt E
o R
be T
re
pu
bl
11.9.3 Radiation
PHYSICS
291
bl
is
he
d
no N
C
tt E
o R
be T
re
pu
(11.15)
dt
where k is a positive constant depending upon
the area and nature of the surface of the body.
Suppose a body of mass m and specific heat
capacity s is at temperature T2. Let T1 be the
temperature of the surroundings. If the
temperature falls by a small amount dT2 in time
dt, then the amount of heat lost is
dQ = ms dT2
Rate of loss of heat is given by
dQ
dT
= ms 2
(11.16)
dt
dt
From Eqs. (11.15) and (11.16) we have
dT2
= k (T2 T1 )
dt
dT2
k
dt = K dt
=
T2 T1
ms
where K = k/m s
On integrating,
log e (T2 T1) = K t + c
or T2 = T1 + C e Kt; where C = e c
m s
(11.17)
(11.18)
(11.19)
K T
292
PHYSICS
8 C
= K ( 70 C)
2 min
2 C
= K (50 C)
Time
he
d
= 42 s
SUMMARY
Heat is a form of energy that flows between a body and its surrounding medium by
virtue of temperature difference between them. The degree of hotness of the body is
quantitatively represented by temperature.
2.
3.
The Celsius temperature (tC) and the Farenheit temperare (tF)are related by
bl
is
1.
tF = (9/5) tC + 32
The ideal gas equation connecting pressure (P), volume (V) and absolute temperature
(T) is :
no N
C
tt E
o R
be T
re
pu
4.
PV = RT
5.
In the absolute temperature scale, the zero of the scale is the absolute zero of temperature
the temperature where every substance in nature has the least possible molecular
activity. The Kelvin absolute temperature scale (T ) has the same unit size as the Celsius
scale (Tc ), but differs in the origin :
TC
6.
= T 273.15
The coefficient of linear expansion (l ) and volume expansion (v ) are defined by the
relations :
V
V
where l and V denote the change in length l and volume V for a change of temperature
T. The relation between them is :
v = 3 l
7.
s=
1 Q
m T
293
where m is the mass of the substance and Q is the heat required to change its
temperature by T. The molar specific heat capacity of a substance is defined by
1 Q
T
9.
The three modes of heat transfer are conduction, convection and radiation.
he
d
8.
T T
H=K A
bl
is
no N
C
tt E
o R
be T
re
pu
11. Newtons Law of Cooling says that the rate of cooling of a body is proportional to the
excess temperature of the body over the surroundings :
dQ
= k (T2 T1 )
dt
294
PHYSICS
POINTS TO PONDER
1. The relation connecting Kelvin temperature (T ) and the Celsius temperature tc
T = tc + 273.15
and the assignment T = 273.16 K for the triple point of water are exact relations (by
choice). With this choice, the Celsius temperature of the melting point of water and
boiling point of water (both at 1 atm pressure) are very close to, but not exactly equal
to 0 C and 100 C respectively. In the original Celsius scale, these latter fixed points
were exactly at 0 C and 100 C (by choice), but now the triple point of water is the
preferred choice for the fixed point, because it has a unique temperature.
A liquid in equilibrium with vapour has the same pressure and temperature throughout
the system; the two phases in equilibrium differ in their molar volume (i.e. density).
This is true for a system with any number of phases in equilibrium.
3.
Heat transfer always involves temperature difference between two systems or two parts
of the same system. Any energy transfer that does not involve temperature difference
in some way is not heat.
4.
Convection involves flow of matter within a fluid due to unequal temperatures of its
parts. A hot bar placed under a running tap loses heat by conduction between the
surface of the bar and water and not by convection within water.
EXERCISES
bl
is
he
d
2.
no N
C
tt E
o R
be T
re
pu
11.1 The triple points of neon and carbon dioxide are 24.57 K and 216.55 K respectively.
Express these temperatures on the Celsius and Fahrenheit scales.
11.2 Two absolute scales A and B have triple points of water defined to be 200 A and
350 B. What is the relation between TA and TB ?
11.3 The electrical resistance in ohms of a certain thermometer varies with temperature
according to the approximate law :
R = Ro [1 + (T To )]
The resistance is 101.6 at the triple-point of water 273.16 K, and 165.5 at the
normal melting point of lead (600.5 K). What is the temperature when the resistance
is 123.4 ?
(b)
There were two fixed points in the original Celsius scale as mentioned above
which were assigned the number 0 C and 100 C respectively. On the absolute
scale, one of the fixed points is the triple-point of water, which on the Kelvin
absolute scale is assigned the number 273.16 K. What is the other fixed point
on this (Kelvin) scale ?
(c)
(d)
295
11.5 Two ideal gas thermometers A and B use oxygen and hydrogen respectively. The
following observations are made :
Temperature
Pressure
thermometer A
Pressure
thermometer B
Triple-point of water
1.250
105 Pa
0.200
105 Pa
1.797
105 Pa
0.287
105 Pa
(b)
What do you think is the reason behind the slight difference in answers of
thermometers A and B ? (The thermometers are not faulty). What further
procedure is needed in the experiment to reduce the discrepancy between the
two readings ?
he
d
(a)
11.7
no N
C
tt E
o R
be T
re
pu
bl
is
11.6
11.8
A hole is drilled in a copper sheet. The diameter of the hole is 4.24 cm at 27.0 C.
What is the change in the diameter of the hole when the sheet is heated to 227 C?
Coefficient of linear expansion of copper = 1.70 105 K1.
11.9
A brass wire 1.8 m long at 27 C is held taut with little tension between two rigid
supports. If the wire is cooled to a temperature of 39 C, what is the tension
developed in the wire, if its diameter is 2.0 mm ? Co-efficient of linear expansion
of brass = 2.0 105 K1; Youngs modulus of brass = 0.91 1011 Pa.
11.10
A brass rod of length 50 cm and diameter 3.0 mm is joined to a steel rod of the
same length and diameter. What is the change in length of the combined rod at
250 C, if the original lengths are at 40.0 C? Is there a thermal stress developed
at the junction ? The ends of the rod are free to expand (Co-efficient of linear
expansion of brass = 2.0 105 K1, steel = 1.2 105 K1 ).
11.11
The coefficient of volume expansion of glycerin is 49 105 K1. What is the fractional
change in its density for a 30 C rise in temperature ?
11.12
11.13
11.14
296
PHYSICS
heat of the metal. If heat losses to the surroundings are not negligible, is your
answer greater or smaller than the actual value for specific heat of the metal ?
Given below are observations on molar specific heats at room temperature of some
common gases.
Gas
Hydrogen
Nitrogen
Oxygen
Nitric oxide
Carbon monoxide
Chlorine
he
d
11.15
Answer the following questions based on the P-T phase diagram of carbon dioxide:
At what temperature and pressure can the solid, liquid and vapour phases of
CO2 co-exist in equilibrium ?
(b)
What is the effect of decrease of pressure on the fusion and boiling point of
CO2 ?
(c)
What are the critical temperature and pressure for CO 2 ? What is their
significance ?
(d)
Is CO2 solid, liquid or gas at (a) 70 C under 1 atm, (b) 60 C under 10 atm,
(c) 15 C under 56 atm ?
bl
(a)
no N
C
tt E
o R
be T
re
pu
11.16
is
The measured molar specific heats of these gases are markedly different from
those for monatomic gases. Typically, molar specific heat of a monatomic gas is
2.92 cal/mol K. Explain this difference. What can you infer from the somewhat
larger (than the rest) value for chlorine ?
11.17
What happens when CO2 at 4 atm pressure is cooled from room temperature
at constant pressure ?
(c)
(d)
11.18
11.19
A thermacole icebox is a cheap and efficient method for storing small quantities
of cooked food in summer in particular. A cubical icebox of side 30 cm has a
thickness of 5.0 cm. If 4.0 kg of ice is put in the box, estimate the amount of ice
remaining after 6 h. The outside temperature is 45 C, and co-efficient of thermal
conductivity of thermacole is 0.01 J s1 m1 K1. [Heat of fusion of water = 335 103
J kg1]
11.20
A brass boiler has a base area of 0.15 m2 and thickness 1.0 cm. It boils water at the
rate of 6.0 kg/min when placed on a gas stove. Estimate the temperature of the part
297
of the flame in contact with the boiler. Thermal conductivity of brass = 109 J s1 m1
K1 ; Heat of vaporisation of water = 2256 103 J kg1.
Explain why :
(a) a body with large reflectivity is a poor emitter
(b) a brass tumbler feels much colder than a wooden tray on a chilly day
(c) an optical pyrometer (for measuring high temperatures) calibrated for an ideal
black body radiation gives too low a value for the temperature of a red hot
iron piece in the open, but gives a correct value for the temperature when the
same piece is in the furnace
(d) the earth without its atmosphere would be inhospitably cold
(e) heating systems based on circulation of steam are more efficient in warming
a building than those based on circulation of hot water
11.22
no N
C
tt E
o R
be T
re
pu
bl
is
he
d
11.21
CHAPTER TWELVE
he
d
THERMODYNAMICS
is
no N
C
tt E
o R
be T
re
pu
12.4
Introduction
Thermal equilibrium
Zeroth law of
Thermodynamics
Heat, internal energy and
work
First law of
thermodynamics
Specific heat capacity
Thermodynamic state
variables and equation of
state
Thermodynamic processes
Heat engines
Refrigerators and heat
pumps
Second law of
thermodynamics
Reversible and irreversible
processes
Carnot engine
bl
12.1
12.2
12.3
12.1 INTRODUCTION
In previous chapter we have studied thermal properties of
matter. In this chapter we shall study laws that govern
thermal energy. We shall study the processes where work is
converted into heat and vice versa. In winter, when we rub
our palms together, we feel warmer; here work done in rubbing
produces the heat. Conversely, in a steam engine, the heat
of the steam is used to do useful work in moving the pistons,
which in turn rotate the wheels of the train.
In physics, we need to define the notions of heat,
temperature, work, etc. more carefully. Historically, it took a
long time to arrive at the proper concept of heat. Before the
modern picture, heat was regarded as a fine invisible fluid
filling in the pores of a substance. On contact between a hot
body and a cold body, the fluid (called caloric) flowed from
the colder to the hotter body ! This is similar to what happens
when a horizontal pipe connects two tanks containing water
up to different heights. The flow continues until the levels of
water in the two tanks are the same. Likewise, in the caloric
picture of heat, heat flows until the caloric levels (i.e., the
temperatures) equalise.
In time, the picture of heat as a fluid was discarded in
favour of the modern concept of heat as a form of energy. An
important experiment in this connection was due to Benjamin
Thomson (also known as Count Rumford) in 1798. He
observed that boring of a brass cannon generated a lot of
heat, indeed enough to boil water. More significantly, the
amount of heat produced depended on the work done (by the
horses employed for turning the drill) but not on the
sharpness of the drill. In the caloric picture, a sharper drill
would scoop out more heat fluid from the pores; but this
was not observed. A most natural explanation of the
observations was that heat was a form of energy and the
experiment demonstrated conversion of energy from one form
to anotherfrom work to heat.
12.5
12.6
12.7
12.8
12.9
12.10
12.11
12.12
12.13
Summary
Points to ponder
Exercises
THERMODYNAMICS
bl
is
he
d
no N
C
tt E
o R
be T
re
pu
299
Thermodynamics may also involve other variables that are not so obvious to our senses e.g. entropy, enthalpy,
etc., and they are all macroscopic variables.
**
Both the variables need not change. It depends on the constraints. For instance, if the gases are in containers
of fixed volume, only the pressures of the gases would change to achieve thermal equilibrium.
300
PHYSICS
he
d
no N
C
tt E
o R
be T
re
pu
is
bl
(a)
(a)
(b)
(b)
THERMODYNAMICS
bl
is
he
d
no N
C
tt E
o R
be T
re
pu
301
302
he
d
is
PHYSICS
bl
Q W = U
(12.2)
no N
C
tt E
o R
be T
re
pu
Q = W
THERMODYNAMICS
303
no N
C
tt E
o R
be T
re
pu
he
d
(12.3)
(12.6)
is
Q = U + P V
1 Q
T
bl
Q
T
(12.4)
S
m
1
m
Q
T
U = 3 kBT
NA = 3 RT
Q
T
Table 12.1
U
T
= 3R
(12.7)
(12.5)
304
PHYSICS
(12.8)
Q = U + P V
he
d
Cp Cv = R
Q
T
U
T
U
T
(12.9)
Q
T
no N
C
tt E
o R
be T
re
pu
Cp
bl
is
U
T
V
T
(12.10)
which gives
P
V
T
(12.11)
THERMODYNAMICS
he
d
305
is
bl
no N
C
tt E
o R
be T
re
pu
As emphasised earlier, Q is not a state variable. However, Q is clearly proportional to the total mass of
system and hence is extensive.
306
PHYSICS
he
d
bl
is
Table 12.2
no N
C
tt E
o R
be T
re
pu
Fig. 12.7
W =
P dV
V1
V2
RT
dV
V
V1
RT In
V2
V1
(12.12)
THERMODYNAMICS
307
Adiabatic process
P dV
V1
constant
V1 V
P 1 V1 = P 2 V2
(12.14)
+1
V2
V1
constant 1
1
(1 )
V 1 V 1
2
P2V2
1
1
V2
P2V2
P1V1
is
W =
(12.15)
V1
P1V1
R(T1 T2 )
bl
dV
he
d
V2
= constant
(12.16)
no N
C
tt E
o R
be T
re
pu
V2
W=
Isochoric process
(12.17)
Fig. 12.8
308
PHYSICS
W = Q1 Q2
Fig. 12.9
(12.19)
Q
=1 2
Q1
he
d
i.e.,
(12.20)
no N
C
tt E
o R
be T
re
pu
is
bl
THERMODYNAMICS
309
Pioneers of Thermodynamics
he
d
no N
C
tt E
o R
be T
re
pu
bl
is
(12.21)
i.e.,
Q2
Q1 Q2
(12.22)
310
bl
is
he
d
no N
C
tt E
o R
be T
re
pu
PHYSICS
Kelvin-Planck statement
No process is possible whose sole result is the
absorption of heat from a reservoir and the
complete conversion of the heat into work.
Clausius statement
THERMODYNAMICS
he
d
is
no N
C
tt E
o R
be T
re
pu
bl
311
312
PHYSICS
(b)
(c)
(12.23)
R T1
3
T2
(12.24)
i.e.
V3
Q2
(12.25)
(12.29)
T1
T2 V4
i.e.
V1
V4
T2
T1 V1
1/
(d)
T1
T2
-1
(12.26)
W4 1
V
2
3
= RT1 ln V RT2 ln V (12.27)
1
4
Q1
T2
T1
Q2
Q1
In V3
V4
(12.28)
V2
In V
1
T2 V3
(12.30)
T1
(12.31)
T1
(Carnot engine)
(12.32)
no N
C
tt E
o R
be T
re
pu
1)
bl
W3
1/(
V
RT2 ln 3
V4
T2
he
d
W1 2 = Q1 = R T1
V2
ln V
1
V2
is
THERMODYNAMICS
313
he
d
Q1 T1
=
Q2 T2
no N
C
tt E
o R
be T
re
pu
W
SUMMARY
1.
2.
3.
(12.33)
bl
is
The zeroth law of thermodynamics states that two systems in thermal equilibrium with a
third system are in thermal equilibrium with each other. The Zeroth Law leads to the
concept of temperature.
Internal energy of a system is the sum of kinetic energies and potential energies of the
molecular constituents of the system. It does not include the over-all kinetic energy of
the system. Heat and work are two modes of energy transfer to the system. Heat is the
energy transfer arising due to temperature difference between the system and the
surroundings. Work is energy transfer brought about by other means, such as moving
the piston of a cylinder containing the gas, by raising or lowering some weight connected
to it.
The first law of thermodynamics is the general law of conservation of energy applied to
any system in which energy transfer from or to the surroundings (through heat and
work) is taken into account. It states that
Q = U + W
where Q is the heat supplied to the system, W is the work done by the system and U
is the change in internal energy of the system.
314
PHYSICS
4.
1 Q
m T
s=
where m is the mass of the substance and Q is the heat required to change its
temperature by T. The molar specific heat capacity of a substance is defined by
1 Q
T
he
d
where is the number of moles of the substance. For a solid, the law of equipartition
of energy gives
C = 3R
which generally agrees with experiment at ordinary temperatures.
Calorie is the old unit of heat. 1 calorie is the amount of heat required to raise the
temperature of 1 g of water from 14.5 C to 15.5 C. 1 cal = 4.186 J.
For an ideal gas, the molar specific heat capacities at constant pressure and volume
satisfy the relation
Cp Cv = R
is
5.
no N
C
tt E
o R
be T
re
pu
6.
bl
7.
A quasi-static process is an infinitely slow process such that the system remains in
thermal and mechanical equilibrium with the surroundings throughout. In a
quasi-static process, the pressure and temperature of the environment can differ from
those of the system only infinitesimally.
8.
9.
RT
V2
ln V
1
= constant
Cp
where
Cv
Work done by an ideal gas in an adiabatic change of state from (P1, V1, T1) to (P2, V2, T2)
is
W
R T1
T2
10. Heat engine is a device in which a system undergoes a cyclic process resulting in
conversion of heat into work. If Q1 is the heat absorbed from the source, Q2 is the heat
released to the sink, and the work output in one cycle is W, the efficiency of the engine
is:
W
Q1
Q2
Q1
THERMODYNAMICS
315
11. In a refrigerator or a heat pump, the system extracts heat Q2 from the cold reservoir and
releases Q1 amount of heat to the hot reservoir, with work W done on the system. The
co-efficient of performance of a refrigerator is given by
Q2
Q2
=
W
Q1 Q2
12. The second law of thermodynamics disallows some processes consistent with the First
Law of Thermodynamics. It states
he
d
Kelvin-Planck statement
No process is possible whose sole result is the absorption of heat from a reservoir and
complete conversion of the heat into work.
Clausius statement
No process is possible whose sole result is the transfer of heat from a colder object to a
hotter object.
is
Put simply, the Second Law implies that no heat engine can have efficiency equal to
1 or no refrigerator can have co-efficient of performance equal to infinity.
bl
13. A process is reversible if it can be reversed such that both the system and the surroundings
return to their original states, with no other change anywhere else in the universe.
Spontaneous processes of nature are irreversible. The idealised reversible process is a
quasi-static process with no dissipative factors such as friction, viscosity, etc.
no N
C
tt E
o R
be T
re
pu
14. Carnot engine is a reversible engine operating between two temperatures T1 (source) and
T2 (sink). The Carnot cycle consists of two isothermal processes connected by two
adiabatic processes. The efficiency of a Carnot engine is given by
=1
T2
T1
(Carnot engine)
No engine operating between two temperatures can have efficiency greater than that of
the Carnot engine.
Quantity
Symbol
Dimensions
Unit
Remark
Co-efficienty of volume
expansion
[K1]
K1
v = 3 1
[ML2 T2]
Q is not a state
variable
Specific heat
[L2 T2 K1]
J kg1 K1
Thermal Conductivity
[MLT3 K1]
J s1 K1
H = KA
dt
dx
316
PHYSICS
POINTS TO PONDER
3.
4.
5.
he
d
2.
Temperature of a body is related to its average internal energy, not to the kinetic energy
of motion of its centre of mass. A bullet fired from a gun is not at a higher temperature
because of its high speed.
Equilibrium in thermodynamics refers to the situation when macroscopic variables
describing the thermodynamic state of a system do not depend on time. Equilibrium of
a system in mechanics means the net external force and torque on the system are zero.
In a state of thermodynamic equilibrium, the microscopic constituents of a system are
not in equilibrium (in the sense of mechanics).
Heat capacity, in general, depends on the process the system goes through when heat is
supplied.
In isothermal quasi-static processes, heat is absorbed or given out by the system even
though at every stage the gas has the same temperature as that of the surrounding
reservoir. This is possible because of the infinitesimal difference in temperature between
the system and the reservoir.
EXERCISES
is
1.
A geyser heats water flowing at the rate of 3.0 litres per minute from 27 C to 77 C.
If the geyser operates on a gas burner, what is the rate of consumption of the fuel if
its heat of combustion is 4.0 104 J/g ?
12.2
no N
C
tt E
o R
be T
re
pu
bl
12.1
12.3
Explain why
(a) Two bodies at different temperatures T1 and T2 if brought in thermal contact do
not necessarily settle to the mean temperature (T1 + T2 )/2.
(b) The coolant in a chemical or a nuclear plant (i.e., the liquid used to prevent
the different parts of a plant from getting too hot) should have high specific
heat.
(c) Air pressure in a car tyre increases during driving.
(d) The climate of a harbour town is more temperate than that of a town in a desert
at the same latitude.
12.4
12.5
12.6
Two cylinders A and B of equal capacity are connected to each other via a stopcock.
A contains a gas at standard temperature and pressure. B is completely evacuated.
The entire system is thermally insulated. The stopcock is suddenly opened. Answer
the following :
(a) What is the final pressure of the gas in A and B ?
(b) What is the change in internal energy of the gas ?
(c) What is the change in the temperature of the gas ?
(d) Do the intermediate states of the system (before settling to the final equilibrium
state) lie on its P-V-T surface ?
THERMODYNAMICS
12.7
12.8
A steam engine delivers 5.4108J of work per minute and services 3.6 109J of heat
per minute from its boiler. What is the efficiency of the engine? How much heat is
wasted per minute?
An electric heater supplies heat to a system at a rate of 100W. If system performs
work at a rate of 75 joules per second. At what rate is the internal energy increasing?
A thermodynamic system is taken from an original state to an intermediate state by
the linear process shown in Fig. (12.13)
bl
is
he
d
12.9
317
no N
C
tt E
o R
be T
re
pu
Fig. 12.13
Its volume is then reduced to the original value from E to F by an isobaric process.
Calculate the total work done by the gas from D to E to F
12.10 A refrigerator is to maintain eatables kept inside at 90C. If room temperature is 360C,
calculate the coefficient of performance.
CHAPTER THIRTEEN
he
d
KINETIC THEORY
13.1 INTRODUCTION
is
bl
Introduction
Molecular nature of matter
Behaviour of gases
Kinetic theory of an ideal gas
Law of equipartition of energy
Specific heat capacity
Mean free path
no N
C
tt E
o R
be T
re
pu
13.1
13.2
13.3
13.4
13.5
13.6
13.7
Summary
Points to ponder
Exercises
Additional exercises
KINETIC THEORY
319
no N
C
tt E
o R
be T
re
pu
bl
is
he
d
Though John Dalton is credited with the introduction of atomic viewpoint in modern science, scholars in
ancient India and Greece conjectured long before the existence of atoms and molecules. In the Vaiseshika
school of thought in India founded by Kanada (Sixth century B.C.) the atomic picture was developed in
considerable detail. Atoms were thought to be eternal, indivisible, infinitesimal and ultimate parts of matter.
It was argued that if matter could be subdivided without an end, there would be no difference between a
mustard seed and the Meru mountain. The four kinds of atoms (Paramanu Sanskrit word for the
smallest particle) postulated were Bhoomi (Earth), Ap (water), Tejas (fire) and Vayu (air) that have characteristic
mass and other attributes, were propounded. Akasa (space) was thought to have no atomic structure and
was continuous and inert. Atoms combine to form different molecules (e.g. two atoms combine to form a
diatomic molecule dvyanuka, three atoms form a tryanuka or a triatomic molecule), their properties depending
upon the nature and ratio of the constituent atoms. The size of the atoms was also estimated, by conjecture
or by methods that are not known to us. The estimates vary. In Lalitavistara, a famous biography of the
Buddha written mainly in the second century B.C., the estimate is close to the modern estimate of atomic
size, of the order of 10 10 m.
In ancient Greece, Democritus (Fourth century B.C.) is best known for his atomic hypothesis. The
word atom means indivisible in Greek. According to him, atoms differ from each other physically, in
shape, size and other properties and this resulted in the different properties of the substances formed
by their combination. The atoms of water were smooth and round and unable to hook on to each
other, which is why liquid /water flows easily. The atoms of earth were rough and jagged, so they held
together to form hard substances. The atoms of fire were thorny which is why it caused painful burns.
These fascinating ideas, despite their ingenuity, could not evolve much further, perhaps because they
were intuitive conjectures and speculations not tested and modified by quantitative experiments - the
hallmark of modern science.
320
PHYSICS
P2V2
= constant = kB
N 2 T2
he
d
P1V1
As
N1T1
PV = KT
(13.1)
(13.2)
is
no N
C
tt E
o R
be T
re
pu
bl
(13.3)
PV = RT
where is the number of moles and R = NA
kB is a universal constant. The temperature T is
absolute temperature. Choosing kelvin scale for
KINETIC THEORY
321
M
M0
N
NA
(13.4)
is
no N
C
tt E
o R
be T
re
pu
bl
pV
J mol
1 1
K
he
d
P (atm)
(13.6)
i.e. P
RT
V
= P1 + P2 +
RT
V
(13.7)
...
(13.8)
(13.9)
322
he
d
is
Example 13.3
What is the average
distance between atoms
(interatomic
distance) in water? Use the data given in
Examples 13.1 and 13.2.
Answer : A given mass of water in vapour state
has 1.67103 times the volume of the same mass
of water in liquid state (Ex. 13.1). This is also
the increase in the amount of volume available
for each molecule of water. When volume
increases by 103 times the radius increases by
V1/3 or 10 times, i.e., 10 2 = 20 . So the
average distance is 2 20 = 40 .
no N
C
tt E
o R
be T
re
pu
bl
PHYSICS
Example 13.4 A vessel contains two nonreactive gases : neon (monatomic) and
oxygen (diatomic). The ratio of their partial
pressures is 3:2. Estimate the ratio of (i)
number of molecules and (ii) mass density
of neon and oxygen in the vessel. Atomic
mass of Ne = 20.2 u, molecular mass of O2
= 32.0 u.
KINETIC THEORY
323
3
2
m1
m2
20.2
32.0
0.947
1
2
M1
M2
(vx, vy, vz ) hits the planar wall parallel to yzplane of area A (= l 2). Since the collision is elastic,
the molecule rebounds with the same velocity;
its y and z components of velocity do not change
in the collision but the x-component reverses
sign. That is, the velocity after collision is
(-vx, vy, vz ) . The change in momentum of the
molecule is : mvx (mvx) = 2mvx . By the
principle of conservation of momentum, the
momentum imparted to the wall in the collision
= 2mvx .
To calculate the force (and pressure) on the
wall, we need to calculate momentum imparted
to the wall per unit time. In a small time interval
t, a molecule with x-component of velocity vx
will hit the wall if it is within the distance vx t
from the wall. That is, all molecules within the
volume Avx t only can hit the wall in time t.
But, on the average, half of these are moving
towards the wall and the other half away from
the wall. Thus the number of molecules with
velocity (vx, vy, vz ) hitting the wall in time t is
A vx t n where n is the number of molecules
per unit volume. The total momentum
transferred to the wall by these molecules in
time t is :
Q = (2mvx) ( n A vx t )
(13.10)
The force on the wall is the rate of momentum
transfer Q/t and pressure is force per unit
area :
P = Q /(A t) = n m vx2
(3.11)
Actually, all molecules in a gas do not have
the same velocity; there is a distribution in
velocities. The above equation therefore, stands
for pressure due to the group of molecules with
speed vx in the x-direction and n stands for the
number density of that group of molecules. The
no N
C
tt E
o R
be T
re
pu
Fig. 13.4
is
m1 / V
m2 /V
bl
he
d
324
PHYSICS
(13.13)
no N
C
tt E
o R
be T
re
pu
(13.14)
P = (1/3) n m v 2
Some remarks on this derivation. First,
though we choose the container to be a cube,
the shape of the vessel really is immaterial. For
a vessel of arbitrary shape, we can always choose
a small infinitesimal (planar) area and carry
through the steps above. Notice that both A and
t do not appear in the final result. By Pascals
law, given in Ch. 10, pressure in one portion of
he
d
is
2
v x2 = vy = v z2
bl
Ludwig
Boltzmann
(1844 1906)
born in
Vienna, Austria, worked on the kinetic theory of gases
independently of Maxwell. A firm advocate of atomism, that is
basic to kinetic theory, Boltzmann provided a statistical
interpretation of the Second Law of thermodynamics and the
concept of entropy. He is regarded as one of the founders of classical
statistical mechanics. The proportionality constant connecting
energy and temperature in kinetic theory is known as Boltzmanns
constant in his honour.
KINETIC THEORY
325
NA
28
6.02 1026
4.65 10 26 kg.
he
d
v 2 = 3 kB T / m = (516)2 m2s-2
The square root of v 2 is known as root mean
square (rms) speed and is denoted by vrms,
( We can also write v 2 as < v2 >.)
vrms = 516 m s-1
The speed is of the order of the speed of sound
in air. It follows from Eq. (13.19) that at the same
temperature, lighter molecules have greater rms
speed.
is
no N
C
tt E
o R
be T
re
pu
M N2
bl
PV = (2/3) N x m v 2
(13.15b)
where N (= nV ) is the number of molecules in
the sample.
The quantity in the bracket is the average
translational kinetic energy of the molecules in
the gas. Since the internal energy E of an ideal
gas is purely kinetic*,
E = N (1/2) m v 2
(13.16)
m1 v12 = m2 v 22 = (3/2) kB T
so that
P = (n1 + n2 + ) kB T
(13.21)
2
vrms
Ar
2
rms Cl
Cl
Cl
Ar
Ar
70.9
=1.77
39.9
Ar
vrms
Cl
= 1.33
* E denotes the translational part of the internal energy U that may include energies due to other degrees of
freedom also. See section 13.5.
326
PHYSICS
no N
C
tt E
o R
be T
re
pu
bl
is
he
d
In a given mass of gas, the velocities of all molecules are not the same, even when bulk
parameters like pressure, volume and temperature are fixed. Collisions change the direction
and the speed of molecules. However in a state of equilibrium, the distribution of speeds is
constant or fixed.
Distributions are very important and useful when dealing with systems containing large
number of objects. As an example consider the ages of different persons in a city. It is not
feasible to deal with the age of each individual. We can divide the people into groups: children
up to age 20 years, adults between ages of 20 and 60, old people above 60. If we want more
detailed information we can choose smaller intervals, 0-1, 1-2,..., 99-100 of age groups. When
the size of the interval becomes smaller, say half year, the number of persons in the interval
will also reduce, roughly half the original number in the one year interval. The number of
persons dN(x) in the age interval x and x+dx is proportional to dx or dN(x) = nx dx. We have
used nx to denote the number of persons at the value of x.
In a similar way the molecular speed distribution gives the number of molecules between
2
the speeds v and v+ dv. dN(v) = 4p N a3ebv v2 dv = nvdv. This is called Maxwell distribution.
The plot of nv against v is shown in the figure. The fraction of the molecules with speeds v and
v+dv is equal to the area of the strip shown. The average of any quantity like v2 is defined by
the integral <v2> = (1/N ) v2 dN(v) = (3kB T/m) which agrees with the result derived from
more elementary considerations.
Hence difference
KINETIC THEORY
he
d
327
is
1
1
1
mv x2
mvy2
mv z2
(13.22)
2
2
2
For a gas in thermal equilibrium at
temperature T the average value of energy
bl
> is
no N
C
tt E
o R
be T
re
pu
denoted by <
1
mv x2
2
1
mvy2
2
1
mv z2
2
3
kBT
2
(13.23)
1
mv x2
2
1
1
kBT ,
mvy2
2
2
1
kBT ,
2
1
1
mv z2
kBT
(13.24)
2
2
A molecule free to move in space needs three
coordinates to specify its location. If it is
constrained to move in a plane it needs two;and
if constrained to move along a line, it needs just
one coordinate to locate it. This can also be
expressed in another way. We say that it has
one degree of freedom for motion in a line, two
for motion in a plane and three for motion in
space. Motion of a body as a whole from one
point to another is called translation. Thus, a
molecule free to move in space has three
translational degrees of freedom. Each
translational degree of freedom contributes a
term that contains square of some variable of
motion, e.g.,
mv x2 and similar terms in
vy and vz. In, Eq. (13.24) we see that in thermal
equilibrium, the average of each such term is
kBT .
328
PHYSICS
1
mv x2
2
1
mv y2
2
1
mv z2
2
1
I1
2
2
1
1
I2
2
2
2
(13.25)
no N
C
tt E
o R
be T
re
pu
bl
is
(13.26)
t
r
v
where k is the force constant of the oscillator
and y the vibrational co-ordinate.
Once again the vibrational energy terms in
Eq. (13.26) contain squared terms of vibrational
variables of motion y and dy/dt .
At this point, notice an important feature in
Eq.(13.26). While each translational and
rotational degree of freedom has contributed only
one squared term in Eq.(13.26), one vibrational
mode contributes two squared terms : kinetic
and potential energies.
Each quadratic term occurring in the
expression for energy is a mode of absorption of
energy by the molecule. We have seen that in
thermal equilibrium at absolute temperature T,
for each translational mode of motion, the
average energy is kBT. A most elegant principle
of classical statistical mechanics (first proved
by Maxwell) states that this is so for each mode
of energy: translational, rotational and
vibrational. That is, in equilibrium, the total
energy is equally distributed in all possible
energy modes, with each mode having an average
energy equal to
kBT. This is known as the
law of equipartition of energy. Accordingly,
each translational and rotational degree of
freedom of a molecule contributes kBT to the
energy while each vibrational frequency
contributes 2
kBT = kBT , since a vibrational
mode has both kinetic and potential energy
modes.
The proof of the law of equipartition of energy
is beyond the scope of this book. Here we shall
apply the law to predict the specific heats of
gases theoretically. Later we shall also discuss
briefly, the application to specific heat of solids.
he
d
1
dy
m
2
dt
1 2
ky
2
* Rotation along the line joining the atoms has very small moment of inertia and does not come into play for
quantum mechanical reasons. See end of section 13.6.
KINETIC THEORY
3
RT
2
(13.27)
dU
3
= RT
dT
2
(13.28)
(13.30)
Cp
5
3
Cv
(13.31)
f
f
(13.36)
no N
C
tt E
o R
be T
re
pu
i.e. Cv = (3 + f ) R, Cp = (4 + f ) R,
he
d
NA
is
3
kBT
2
bl
329
5
5
kBT
NA
RT
(13.32)
2
2
The molar specific heats are then given by
U
Cv (rigid diatomic) =
(rigid diatomic) =
R, Cp =
(13.33)
(13.34)
5
If the diatomic molecule is not rigid but has
in addition a vibrational mode
5
U = k B T + k B T
2
Cv
7
R, C p
2
7
RT
N A =
2
9
R,
2
9
R
7
(13.35)
Cv
Cp - Cv
Cp
(J mol- K- )
(J mol- K- )
(J mol- 1 K- 1)
Monatomic
12.5
20.8
8.31
1.67
Diatomic
20.8
29.1
8.31
1.40
Triatomic
24.93
33.24
8.31
1.33
Table13.2
330
PHYSICS
no N
C
tt E
o R
be T
re
pu
he
d
ordinary
is
at
bl
verified
experimentally
temperatures.
Q
T
U
T
3R
(13.37)
KINETIC THEORY
331
Seeing is Believing
no N
C
tt E
o R
be T
re
pu
bl
is
he
d
Can one see atoms rushing about. Almost but not quite. One can see pollen grains of a flower being
pushed around by molecules of water. The size of the grain is ~ 10-5 m. In 1827, a Scottish botanist
Robert Brown, while examining, under a microscope, pollen grains of a flower suspended in water
noticed that they continuously moved about in a zigzag, random fashion.
Kinetic theory provides a simple explanation of the phenomenon. Any object suspended in water is
continuously bombarded from all sides by the water molecules. Since the motion of molecules is random,
the number of molecules hitting the object in any direction is about the same as the number hitting in
the opposite direction. The small difference between these molecular hits is negligible compared to the
total number of hits for an object of ordinary size, and we do not notice any movement of the object.
When the object is sufficiently small but still visible under a microscope, the difference in molecular
hits from different directions is not altogether negligible, i.e. the impulses and the torques given to the
suspended object through continuous bombardment by the molecules of the medium (water or some
other fluid) do not exactly sum to zero. There is a net impulse and torque in this or that direction. The
suspended object thus, moves about in a zigzag manner and tumbles about randomly. This motion
called now Brownian motion is a visible proof of molecular activity. In the last 50 years or so molecules
have been seen by scanning tunneling and other special microscopes.
In 1987 Ahmed Zewail, an Egyptian scientist working in USA was able to observe not only the
molecules but also their detailed interactions. He did this by illuminating them with flashes of laser
light for very short durations, of the order of tens of femtoseconds and photographing them. ( 1 femtosecond = 10-15 s ). One could study even the formation and breaking of chemical bonds. That is really
seeing !
1/
2 n d2
(13.40)
n=
22.4 10 3
= 2.7 10 25 m -3.
Taking, d = 2 1010 m,
= 6.1 1010 s
and l = 2.9 107 m 1500d
(13.41)
332
PHYSICS
bl
SUMMARY
The ideal gas equation connecting pressure (P ), volume (V ) and absolute temperature
(T ) is
= kB NT
PV = RT
where is the number of moles and N is the number of molecules. R and kB are universal
constants.
no N
C
tt E
o R
be T
re
pu
1.
273
2 1025 m 3
373
he
d
25
So n 2.7 10
is
2.
kB =
R
NA
= 1.38 1023 J K1
Real gases satisfy the ideal gas equation only approximately, more so at low pressures
and high temperatures.
Kinetic theory of an ideal gas gives the relation
1
n m v2
3
where n is number density of molecules, m the mass of the molecule and v 2 is the
mean of squared speed. Combined with the ideal gas equation it yields a kinetic
interpretation of temperature.
1
m v2
2
3.
v2
1/ 2
3k B T
m
This tells us that the temperature of a gas is a measure of the average kinetic energy
of a molecule, independent of the nature of the gas or molecule. In a mixture of gases at
a fixed temperature the heavier molecule has the lower average speed.
The translational kinetic energy
E=
PV =
4.
3
k B T , vrms
2
3
2
kB NT.
KINETIC THEORY
333
6.
The mean free path l is the average distance covered by a molecule between two
successive collisions :
1
2n
d2
is
POINTS TO PONDER
he
d
5.
Pressure of a fluid is not only exerted on the wall. Pressure exists everywhere in a fluid.
Any layer of gas inside the volume of a container is in equilibrium because the pressure
is the same on both sides of the layer.
2.
no N
C
tt E
o R
be T
re
pu
bl
1.
3.
The law of equipartition of energy is stated thus: the energy for each degree of freedom
in thermal equilibrium is k T. Each quadratic term in the total energy expression of
B
a molecule is to be counted as a degree of freedom. Thus, each vibrational mode gives
2 (not 1) degrees of freedom (kinetic and potential energy modes), corresponding to the
energy 2
k T = k T.
B
4.
5.
Molecules of air in a room do not all fall and settle on the ground (due to gravity)
because of their high speeds and incessant collisions. In equilibrium, there is a very
slight increase in density at lower heights (like in the atmosphere). The effect is small
since the potential energy (mgh) for ordinary heights is much less than the average
kinetic energy
mv2 of the molecules.
2
< v > is not always equal to ( < v >)2. The average of a squared quantity is not necessarily
the square of the average. Can you find examples for this statement.
EXERCISES
13.1
Estimate the fraction of molecular volume to the actual volume occupied by oxygen
gas at STP. Take the diameter of an oxygen molecule to be 3 .
13.2
Molar volume is the volume occupied by 1 mol of any (ideal) gas at standard
temperature and pressure (STP : 1 atmospheric pressure, 0 C). Show that it is 22.4
litres.
13.3
Figure 13.8 shows plot of PV/T versus P for 1.00103 kg of oxygen gas at two
different temperatures.
334
PHYSICS
T1
T2
Fig. 13.8
he
d
PV (J K1)
T
bl
is
An oxygen cylinder of volume 30 litres has an initial gauge pressure of 15 atm and
a temperature of 27 C. After some oxygen is withdrawn from the cylinder, the gauge
pressure drops to 11 atm and its temperature drops to 17 C. Estimate the mass of
oxygen taken out of the cylinder (R = 8.31 J mol1 K1, molecular mass of O2 = 32 u).
13.5
An air bubble of volume 1.0 cm3 rises from the bottom of a lake 40 m deep at a
temperature of 12 C. To what volume does it grow when it reaches the surface,
which is at a temperature of 35 C ?
13.6
Estimate the total number of air molecules (inclusive of oxygen, nitrogen, water
vapour and other constituents) in a room of capacity 25.0 m3 at a temperature of
27 C and 1 atm pressure.
13.7
Estimate the average thermal energy of a helium atom at (i) room temperature
(27 C), (ii) the temperature on the surface of the Sun (6000 K), (iii) the temperature
of 10 million kelvin (the typical core temperature in the case of a star).
13.8
Three vessels of equal capacity have gases at the same temperature and pressure.
The first vessel contains neon (monatomic), the second contains chlorine (diatomic),
and the third contains uranium hexafluoride (polyatomic). Do the vessels contain
equal number of respective molecules ? Is the root mean square speed of molecules
the same in the three cases? If not, in which case is vrms the largest ?
13.9
At what temperature is the root mean square speed of an atom in an argon gas
cylinder equal to the rms speed of a helium gas atom at 20 C ? (atomic mass of Ar
= 39.9 u, of He = 4.0 u).
no N
C
tt E
o R
be T
re
pu
13.4
13.10 Estimate the mean free path and collision frequency of a nitrogen molecule in a
cylinder containing nitrogen at 2.0 atm and temperature 17 0C. Take the radius of a
nitrogen molecule to be roughly 1.0 . Compare the collision time with the time the
molecule moves freely between two successive collisions (Molecular mass of N2 =
28.0 u).
KINETIC THEORY
335
Additional Exer
cises
Exercises
13.11 A metre long narrow bore held horizontally (and closed at one end) contains a 76 cm
long mercury thread, which traps a 15 cm column of air. What happens if the tube
is held vertically with the open end at the bottom ?
he
d
13.12 From a certain apparatus, the diffusion rate of hydrogen has an average value of
28.7 cm3 s1. The diffusion of another gas under the same conditions is measured to
have an average rate of 7.2 cm3 s1. Identify the gas.
[Hint : Use Grahams law of diffusion: R1/R2 = ( M2 /M1 )1/2, where R1, R2 are diffusion
rates of gases 1 and 2, and M1 and M2 their respective molecular masses. The law is
a simple consequence of kinetic theory.]
is
13.13 A gas in equilibrium has uniform density and pressure throughout its volume. This
is strictly true only if there are no external influences. A gas column under gravity,
for example, does not have uniform density (and pressure). As you might expect, its
density decreases with height. The precise dependence is given by the so-called law
of atmospheres
bl
where n2, n1 refer to number density at heights h2 and h1 respectively. Use this
relation to derive the equation for sedimentation equilibrium of a suspension in a
liquid column:
n2 = n1 exp [ -mg NA ( - P) (h2 h1)/ ( RT)]
no N
C
tt E
o R
be T
re
pu
where is the density of the suspended particle, and that of surrounding medium.
[NA is Avogadros number, and R the universal gas constant.] [Hint : Use Archimedes
principle to find the apparent weight of the suspended particle.]
13.14 Given below are densities of some solids and liquids. Give rough estimates of the
size of their atoms :
Substance
Carbon (diamond)
Gold
Nitrogen (liquid)
Lithium
Fluorine (liquid)
12.01
197.00
14.01
6.94
19.00
2.22
19.32
1.00
0.53
1.14
[Hint : Assume the atoms to be tightly packed in a solid or liquid phase, and use
the known value of Avogadros number. You should, however, not take the actual
numbers you obtain for various atomic sizes too literally. Because of the crudeness
of the tight packing approximation, the results only indicate that atomic sizes are in
the range of a few ].
Chapter Two
ELECTROSTATIC
POTENTIAL AND
CAPACITANCE
2.1 INTRODUCTION
In Chapters 6 and 8 (Class XI), the notion of potential energy was
introduced. When an external force does work in taking a body from a
point to another against a force like spring force or gravitational force,
that work gets stored as potential energy of the body. When the external
force is removed, the body moves, gaining kinetic energy and losing
an equal amount of potential energy. The sum of kinetic and
potential energies is thus conserved. Forces of this kind are called
conservative forces. Spring force and gravitational force are examples of
conservative forces.
Coulomb force between two (stationary) charges is also a conservative
force. This is not surprising, since both have inverse-square dependence
on distance and differ mainly in the proportionality constants the
masses in the gravitational law are replaced by charges in Coulombs
law. Thus, like the potential energy of a mass in a gravitational
field, we can define electrostatic potential energy of a charge in an
electrostatic field.
Consider an electrostatic field E due to some charge configuration.
First, for simplicity, consider the field E due to a charge Q placed at the
origin. Now, imagine that we bring a test charge q from a point R to a
point P against the repulsive force on it due to the charge Q. With reference
Physics
to Fig. 2.1, this will happen if Q and q are both positive
or both negative. For definiteness, let us take Q, q > 0.
Two remarks may be made here. First, we assume
that the test charge q is so small that it does not disturb
the original configuration, namely the charge Q at the
origin (or else, we keep Q fixed at the origin by some
unspecified force). Second, in bringing the charge q from
FIGURE 2.1 A test charge q (> 0) is
R to P, we apply an external force Fext just enough to
moved from the point R to the
counter the repulsive electric force FE (i.e, Fext= FE).
point P against the repulsive
force on it by the charge Q (> 0)
This means there is no net force on or acceleration of
placed at the origin.
the charge q when it is brought from R to P, i.e., it is
brought with infinitesimally slow constant speed. In
this situation, work done by the external force is the negative of the work
done by the electric force, and gets fully stored in the form of potential
energy of the charge q. If the external force is removed on reaching P, the
electric force will take the charge away from Q the stored energy (potential
energy) at P is used to provide kinetic energy to the charge q in such a
way that the sum of the kinetic and potential energies is conserved.
Thus, work done by external forces in moving a charge q from R to P is
P
WRP =
e xt
id r
FE idr
(2.1)
This work done is against electrostatic repulsive force and gets stored
as potential energy.
At every point in electric field, a particle with charge q possesses a
certain electrostatic potential energy, this work done increases its potential
energy by an amount equal to potential energy difference between points
R and P.
Thus, potential energy difference
U = U P U R = WRP
(2.2)
52
Electrostatic Potential
and Capacitance
(ii) Equation (2.2) defines potential energy difference in terms
of the physically meaningful quantity work. Clearly,
potential energy so defined is undetermined to within an
additive constant.What this means is that the actual value
of potential energy is not physically significant; it is only
the difference of potential energy that is significant. We can
always add an arbitrary constant to potential energy at
every point, since this will not change the potential energy
difference:
(U P + ) (U R + ) = U P U R
W P = U P U = U P
(2.3)
(2.4)
53
Physics
In other words, the electrostatic potential (V )
at any point in a region with electrostatic field is
the work done in bringing a unit positive
charge (without acceleration) from infinity to
that point.
The qualifying remarks made earlier regarding
potential energy also apply to the definition of
potential. To obtain the work done per unit test
charge, we should take an infinitesimal test charge
q, obtain the work done W in bringing it from
infinity to the point and determine the ratio
W/ q. Also, the external force at every point of
the path is to be equal and opposite to the
electrostatic force on the test charge at that point.
2.3 POTENTIAL
DUE TO A
P OINT CHARGE
Consider a point charge Q at the origin (Fig. 2.3). For definiteness, take Q
to be positive. We wish to determine the potential at any point P with
position vector r from the origin. For that we must
calculate the work done in bringing a unit positive
test charge from infinity to the point P. For Q > 0,
the work done against the repulsive force on the
test charge is positive. Since work done is
independent of the path, we choose a convenient
path along the radial direction from infinity to
the point P.
At some intermediate point P on the path, the
electrostatic
force on a unit positive charge is
FIGURE 2.3 Work done in bringing a unit
positive test charge from infinity to the
Q 1
r
(2.5)
point P, against the repulsive force of
4 0r '2
W =
Q
4 0r '2
(2.6)
Q
Q
2 dr =
4
r
'
4
0r
0
W =
Q
4 0r
(2.7)
54
V (r ) =
Q
4 0r
(2.8)
Electrostatic Potential
and Capacitance
Equation (2.8) is true for any
sign of the charge Q, though we
considered Q > 0 in its derivation.
For Q < 0, V < 0, i.e., work done (by
the external force) per unit positive
test charge in bringing it from
infinity to the point is negative. This
is equivalent to saying that work
done by the electrostatic force in
bringing the unit positive charge
form infinity to the point P is
positive. [This is as it should be,
since for Q < 0, the force on a unit
positive test charge is attractive, so
that the electrostatic force and the
FIGURE 2.4 Variation of potential V with r [in units of
displacement (from infinity to P) are
(Q/ 40) m-1] (blue curve) and field with r [in units
in the same direction.] Finally, we
-2
of (Q/4 0) m ] (black curve) for a point charge Q.
note that Eq. (2.8) is consistent with
the choice that potential at infinity
be zero.
Figure (2.4) shows how the electrostatic potential ( 1/r ) and the
electrostatic field ( 1/r 2 ) varies with r.
Example 2.1
(a) Calculate the potential at a point P due to a charge of 4 107C
located 9 cm away.
9
(b) Hence obtain the work done in bringing a charge of 2 10 C
from infinity to the point P. Does the answer depend on the path
along which the charge is brought?
Solution
(a)
= 4 104 V
9
2.4 POTENTIAL
DUE TO AN
EXAMPLE 2.1
(b) W = qV = 2 10 C 4 10 V
= 8 105 J
No, work done will be path independent. Any arbitrary infinitesimal
path can be resolved into two perpendicular displacements: One along
r and another perpendicular to r. The work done corr esponding to
the later will be zero.
ELECTRIC DIPOLE
55
Physics
the field falls off, at large distance, not as
1/r 2 (typical of field due to a single charge)
but as 1/r 3. We, now, determine the electric
potential due to a dipole and contrast it
with the potential due to a single charge.
As before, we take the origin at the
centre of the dipole. Now we know that the
electric field obeys the superposition
principle. Since potential is related to the
work done by the field, electrostatic
potential also follows the superposition
principle. Thus, the potential due to the
dipole is the sum of potentials due to the
charges q and q
V =
1 q q
4 0 r1 r2
(2.9)
Now, by geometry,
(2.10)
We take r much greater than a ( r >> a ) and retain terms only upto
the first order in a/r
2a cos
r 2 1
(2.11)
Similarly,
2a cos
r22 r 2 1 +
(2.12)
Using the Binomial theorem and retaining terms upto the first order
in a/r ; we obtain,
1 1
2a cos
1
r1 r
r
1 1
2a cos
1 +
r2 r
r
1/2
1
a
1 + cos
r
r
[2.13(a)]
1
a
1 cos
r
r
[2.13(b)]
1/2
56
q 2 acos p cos
=
4 0
r2
4 0r 2
(2.14)
Electrostatic Potential
and Capacitance
where r is the unit vector along the position vector OP.
The electric potential of a dipole is then given by
V =
1 pi r
4 0 r 2 ;
(r >> a)
(2.15)
1 p
4 0 r 2
(2.16)
2.5 POTENTIAL
DUE TO A
SYSTEM
OF
CHARGES
1 q1
4 0 r1P
1 q2
1 q3
V3 =
,
4 0 r2P
4 0 r3P
57
Physics
=
1 q1 q2
q
+
+ ...... + n
4 0 r1P r2 P
rnP
(2.18)
1 q
4 0 r
[2.19(a)]
(r R )
where q is the total charge on the shell and R its radius. The electric field
inside the shell is zero. This implies (Section 2.6) that potential is constant
inside the shell (as no work is done in moving a charge inside the shell),
and, therefore, equals its value at the surface, which is
V =
1 q
4 0 R
[2.19(b)]
8
FIGURE 2.7
Let P be the required point on the x-axis where the potential is zero.
If x is the x-coordinate of P, obviously x must be positive. (Ther e is no
possibility of potentials due to the two charges adding up to zero for
x < 0.) If x lies between O and A, we have
3 10 8
2 10 8
=0
2
(15 x) 10 2
4 0 x 10
1
58
EXAMPLE 2.2
= 0
x 15 x
which gives x = 9 cm.
If x lies on the extended line OA, the required condition is
3
2
=0
x x 15
Electrostatic Potential
and Capacitance
EXAMPLE 2.2
which gives
x = 45 cm
Thus, electric potential is zero at 9 cm and 45 cm away from the
positive charge on the side of the negative charge. Note that the
formula for potential used in the calculation required choosing
potential to be zero at infinity.
Example 2.3 Figures 2.8 (a) and (b) show the field lines of a positive
and negative point charge respectively.
(a) As V
(b)
(c)
(e)
EXAMPLE 2.3
(d)
http://video.mit.edu/watch/4-electrostatic-potential-elctric-energy-ev-conservative-fieldequipotential-sufaces-12584/
FIGURE 2.8
59
Physics
2.6 EQUIPOTENTIAL S URFACES
An equipotential surface is a surface with a constant value of potential
at all points on the surface. For a single charge q, the potential is given
by Eq. (2.8):
1 q
4 o r
This shows that V is a constant if r is constant . Thus, equipotential
surfaces of a single point charge are concentric spherical surfaces centred
at the charge.
Now the electric field lines for a single charge q are radial lines starting
from or ending at the charge, depending on whether q is positive or negative.
Clearly, the electric field at every point is normal to the equipotential surface
passing through that point. This is true in general: for any charge
configuration, equipotential surface through a point is normal to the
electric field at that point. The proof of this statement is simple.
If the field were not normal to the equipotential surface, it would
have non-zero component along the surface. To move a unit test charge
against the direction of the component of the field, work would have to
be done. But this is in contradiction to the definition of an equipotential
surface: there is no potential difference between any two points on the
surface and no work is required to move a test charge on the surface.
The electric field must, therefore, be normal to the equipotential surface
at every point. Equipotential surfaces offer an alternative visual picture
in addition to the picture of electric field lines around a charge
configuration.
V=
For a uniform electric field E, say, along the x -axis, the equipotential
surfaces are planes normal to the x -axis, i.e., planes parallel to the y-z
plane (Fig. 2.10). Equipotential surfaces for (a) a dipole and (b) two
identical positive charges are shown in Fig. 2.11.
60
Electrostatic Potential
and Capacitance
2.6.1 Relation between field and potential
Consider two closely spaced equipotential surfaces A and B (Fig. 2.12)
with potential values V and V + V, where V is the change in V in the
direction of the electric field E. Let P be a point on the
surface B. l is the perpendicular distance of the
surface A from P. Imagine that a unit positive charge
is moved along this perpendicular from the surface B
to surface A against the electric field. The work done
in this process is |E| l.
This work equals the potential difference
VAVB.
Thus,
|E| l = V (V + V)= V
i.e., |E|=
V
l
(2.20)
E =
V
V
= +
l
l
(2.21)
OF A
S YSTEM
OF
CHARGES
Consider first the simple case of two charges q1and q2 with position vector
r1 and r2 relative to some origin. Let us calculate the work done
(externally) in building up this configuration. This means that we consider
the charges q1 and q2 initially at infinity and determine the work done by
an external agency to bring the charges to the given locations. Suppose,
first the charge q1 is brought from infinity to the point r1 . There is no
external field against which work needs to be done, so work done in
bringing q1 from infinity to r1 is zero. This charge produces a potential in
space given by
1 q1
V1 =
4 0 r1P
where r1P is the distance of a point P in space from the location of q1.
From the definition of potential, work done in bringing charge q2 from
infinity to the point r2 is q2 times the potential at r2 due to q1 :
work done on q2 =
1 q1q2
4 0 r12
61
Physics
where r12 is the distance between points 1 and 2.
Since electrostatic force is conservative, this work gets
stored in the form of potential energy of the system. Thus,
the potential energy of a system of two charges q1 and q2 is
FIGURE 2.13 Potential energy of a
system of charges q 1 and q2 is
directly proportional to the product
of charges and inversely to the
distance between them.
U =
1 q1q2
4 0 r12
(2.22)
1 q1q2
4 0 r12
(2.23)
1 q1q3 q2q 3
+
(2.25)
4 0 r13
r23
The total work done in assembling the charges
at the given locations is obtained by adding the work
done in different steps [Eq. (2.23) and Eq. (2.25)],
q 3V1,2 (r3 ) =
62
Electrostatic Potential
and Capacitance
is characteristic of the present state of configuration, and not the way
the state is achieved.
Example 2.4 Four charges are arranged at the corners of a square
ABCD of side d, as shown in Fig. 2.15.(a) Find the work required to
put together this arrangement. (b) A charge q0 is brought to the centre
E of the square, the four charges being held fixed at its corners. How
much extra work is needed to do this?
FIGURE 2.15
Solution
(a) Since the work done depends on the final arrangement of the
charges, and not on how they ar e put together, we calculate work
needed for one way of putting the charges at A, B, C and D. Suppose,
first the charge +q is brought to A, and then the charges q, +q, and
q are brought to B, C and D, respectively. The total work needed can
be calculated in steps:
(i) Work needed to bring charge + q to A when no charge is present
elsewhere: this is zero.
(ii) Work needed to bring q to B when + q is at A. This is given by
(charge at B) (electrostatic potential at B due to charge +q at A)
q
q2
= q
=
4 d
4 d
0
+q
q
= +q
+
4 0 d 2 4 0 d
q 2
1
1
40 d
2
(iv) Work needed to bring q to D when +q at A,q at B, and +q at C.
This is given by (charge at D) (potential at D due to charges at A,
B and C)
+q
q
q
= q
+
+
4
d
4
d
2
4
0 d
0
0
=
q 2
40 d
1
2
EXAMPLE 2.4
63
Physics
Add the work done in steps (i), (ii), (iii) and (iv). The total work
required is
=
EXAMPLE 2.4
q 2
(0) + (1) +
4 0d
q 2
4 2
4 0d
1
1
1
+ 2
2
2
The work done depends only on the arrangement of the charges, and
not how they are assembled. By definition, this is the total
electrostatic energy of the charges.
(Students may try calculating same work/energy by taking charges
in any other order they desire and convince themselves that the energy
will remain the same.)
(b) The extra work necessary to bring a charge q0 to the point E when
the four charges are at A, B, C and D is q 0 (electrostatic potential at
E due to the charges at A, B, C and D). The electrostatic potential at
E is clearly zero since potential due to A and C is cancelled by that
due to B and D. Hence no work is required to bring any charge to
point E.
IN AN
EXTERNAL FIELD
64
In Section 2.7, the source of the electric field was specified the charges
and their locations - and the potential energy of the system of those charges
was determined. In this section, we ask a related but a distinct question.
What is the potential energy of a charge q in a given field? This question
was, in fact, the starting point that led us to the notion of the electrostatic
potential (Sections 2.1 and 2.2). But here we address this question again
to clarify in what way it is different from the discussion in Section 2.7.
The main difference is that we are now concerned with the potential
energy of a charge (or charges) in an external field. The external field E is
not produced by the given charge(s) whose potential energy we wish to
calculate. E is produced by sources external to the given charge(s).The
external sources may be known, but often they are unknown or
unspecified; what is specified is the electric field E or the electrostatic
potential V due to the external sources. We assume that the charge q
does not significantly affect the sources producing the external field. This
is true if q is very small, or the external sources are held fixed by other
unspecified forces. Even if q is finite, its influence on the external sources
may still be ignored in the situation when very strong sources far away
at infinity produce a finite field E in the region of interest. Note again that
we are interested in determining the potential energy of a given charge q
(and later, a system of charges) in the external field; we are not interested
in the potential energy of the sources producing the external electric field.
The external electric field E and the corresponding external potential
V may vary from point to point. By definition, V at a point P is the work
done in bringing a unit positive charge from infinity to the point P.
Electrostatic Potential
and Capacitance
(We continue to take potential at infinity to be zero.) Thus, work done in
bringing a charge q from infinity to the point P in the external field is qV.
This work is stored in the form of potential energy of q. If the point P has
position vector r relative to some origin, we can write:
Potential energy of q at r in an external field
= qV(r)
(2.27)
q1q2
4 o r12
(2.28)
Thus,
Potential energy of the system
= the total work done in assembling the configuration
= q1V ( r1 ) + q2V (r2 ) +
q1q2
4 0r12
(2.29)
EXAMPLE 2.5
Example 2.5
(a) Determine the electrostatic potential energy of a system consisting
of two charges 7 C and 2 C (and with no external field) placed
at (9 cm, 0, 0) and (9 cm, 0, 0) respectively.
(b) How much work is required to separate the two charges infinitely
away from each other?
65
Physics
(c) Suppose that the same system of charges is now placed in an
external electric field E = A (1/r 2); A = 9 105 C m2. What would
the electrostatic energy of the configuration be?
Solution
(a) U =
1 q1q2
7 ( 2) 10 12
= 9 109
= 0.7 J.
4 0 r
0.18
E XAMPLE 2.5
q1V ( r1 ) + q2 V (r2 ) = A
q1V ( r1 ) + q2 V (r2 ) +
q1q 2
7 C
2 C
= A
+A
0.7 J
4 0r12
0.09 m
0.09 m
= 70 20 0.7 = 49.3 J
W =
e xt ( ) d = pE sin d
0
= pE ( cos 0 cos 1 )
(2.31)
This work is stored as the potential energy of the system. We can then
associate potential energy U( ) with an inclination of the dipole. Similar
to other potential energies, there is a freedom in choosing the angle where
the potential energy U is taken to be zero. A natural choice is to take
0 = / 2. (n explanation for it is provided towards the end of discussion.)
We can then write,
66
(2.32)
Electrostatic Potential
and Capacitance
This expression can alternately be understood also from Eq. (2.29).
We apply Eq. (2.29) to the present system of two charges +q and q. The
potential energy expression then reads
U ( ) = q [V ( r1 ) V ( r2 )]
q2
4 0 2a
(2.33)
q2
q2
= pi E
4 0 2a
4 0 2a
(2.34)
2.9 ELECTROSTATICS
OF
EXAMPLE 2.6
CONDUCTORS
67
Physics
the situation in this case is more involved the movement of the charge
carriers is affected both by the external electric field as also by the
so-called chemical forces (see Chapter 3). We shall restrict our discussion
to metallic solid conductors. Let us note important results regarding
electrostatics of conductors.
68
This follows from results 1 and 2 above. Since E = 0 inside the conductor
and has no tangential component on the surface, no work is done in
moving a small test charge within the conductor and on its surface. That
is, there is no potential difference between any two points inside or on
the surface of the conductor. Hence, the result. If the conductor is charged,
Electrostatic Potential
and Capacitance
electric field normal to the surface exists; this means potential will be
different for the surface and a point just outside the surface.
In a system of conductors of arbitrary size, shape and
charge configuration, each conductor is characterised by a constant
value of potential, but this constant may differ from one conductor to
the other.
n
0
(2.35)
E=
S
0
(2.36)
6. Electrostatic shielding
Consider a conductor with a cavity, with no charges inside the cavity. A
remarkable result is that the electric field inside the cavity is zero, whatever
be the size and shape of the cavity and whatever be the charge on the
conductor and the external fields in which it might be placed. We have
proved a simple case of this result already: the electric field inside a charged
spherical shell is zero. The proof of the result for the shell makes use of
the spherical symmetry of the shell (see Chapter 1). But the vanishing of
electric field in the (charge-free) cavity of a conductor is, as mentioned
above, a very general result. A related result is that even if the conductor
69
Physics
70
EXAMPLE 2.7
Example 2.7
(a) A comb run through ones dry hair attracts small bits of paper.
Why?
What happens if the hair is wet or if it is a rainy day? (Remember,
a paper does not conduct electricity.)
(b) Or dinary rubber is an insulator. But special rubber tyr es of
aircraft are made slightly conducting. Why is this necessary?
(c) Vehicles carrying inflammable materials usually have metallic
ropes touching the ground during motion. Why?
(d) A bird perches on a bare high power line, and nothing happens
to the bird. A man standing on the ground touches the same line
and gets a fatal shock. Why?
Solution
(a) This is because the comb gets charged by friction. The molecules
in the paper gets polarised by the charged comb, resulting in a
net force of attraction. If the hair is wet, or if it is rainy day, friction
between hair and the comb reduces. The comb does not get
charged and thus it will not attract small bits of paper.
Electrostatic Potential
and Capacitance
2.10 DIELECTRICS
AND
E XAMPLE 2.7
POLARISATION
Physics
In an external electric field, the
positive and negative charges of a nonpolar molecule are displaced in opposite
directions. The displacement stops when
the external force on the constituent
charges of the molecule is balanced by
the restoring force (due to internal fields
in the molecule). The non-polar molecule
thus develops an induced dipole moment.
The dielectric is said to be polarised by
the external field. We consider only the
simple situation when the induced dipole
moment is in the direction of the field and
is proportional to the field strength.
(Substances for which this assumption
is true are called linear isotropic
dielectrics.) The induced dipole moments
of different molecules add up giving a net
dipole moment of the dielectric in the
presence of the external field.
A dielectric with polar molecules also
develops a net dipole moment in an
external field, but for a different reason.
FIGURE 2.22 A dielectric develops a net dipole
In the absence of any external field, the
moment in an external electric field. (a) Non-polar
different permanent dipoles are oriented
molecules, (b) Polar molecules.
randomly due to thermal agitation; so
the total dipole moment is zero. When
an external field is applied, the individual dipole moments tend to align
with the field. When summed over all the molecules, there is then a net
dipole moment in the direction of the external field, i.e., the dielectric is
polarised. The extent of polarisation depends on the relative strength of
two mutually opposite factors: the dipole potential energy in the external
field tending to align the dipoles with the field and thermal energy tending
to disrupt the alignment. There may be, in addition, the induced dipole
moment effect as for non-polar molecules, but generally the alignment
effect is more important for polar molecules.
Thus in either case, whether polar or non-polar, a dielectric develops
a net dipole moment in the presence of an external field. The dipole
moment per unit volume is called polarisation and is denoted by P. For
linear isotropic dielectrics,
P = e E
72
(2.37)
Electrostatic Potential
and Capacitance
every volume element v of the slab has a dipole moment
P v in the direction of the field. The volume element v is
macroscopically small but contains a very large number of
molecular dipoles. Anywhere inside the dielectric, the
volume element v has no net charge (though it has net
dipole moment). This is, because, the positive charge of one
dipole sits close to the negative charge of the adjacent dipole.
However, at the surfaces of the dielectric normal to the
electric field, there is evidently a net charge density. As seen
in Fig 2.23, the positive ends of the dipoles remain
unneutralised at the right surface and the negative ends at
the left surface. The unbalanced charges are the induced
charges due to the external field.
Thus the polarised dielectric is equivalent to two charged
surfaces with induced surface charge densities, say p
and p. Clearly, the field produced by these surface charges
opposes the external field. The total field in the dielectric
is, thereby, reduced from the case when no dielectric is
present. We should note that the surface charge density
p arises from bound (not free charges) in the dielectric.
2.11 CAPACITORS
AND
CAPACITANCE
Physics
geometrical configuration (shape, size, separation) of the system of two
conductors. [As we shall see later, it also depends on the nature of the
insulator (dielectric) separating the two conductors.] The SI unit of
capacitance is 1 farad (=1 coulomb volt-1) or 1 F = 1 C V 1. A capacitor
with fixed capacitance is symbolically shown as --| |--, while the one with
variable capacitance is shown as
.
Equation (2.38) shows that for large C, V is small for a given Q. This
means a capacitor with large capacitance can hold large amount of charge
Q at a relatively small V. This is of practical importance. High potential
difference implies strong electric field around the conductors. A strong
electric field can ionise the surrounding air and accelerate the charges so
produced to the oppositely charged plates, thereby neutralising the charge
on the capacitor plates, at least partly. In other words, the charge of the
capacitor leaks away due to the reduction in insulating power of the
intervening medium.
The maximum electric field that a dielectric medium can withstand
without break-down (of its insulating property) is called its dielectric
strength; for air it is about 3 106 Vm1. For a separation between
conductors of the order of 1 cm or so, this field corresponds to a potential
difference of 3 104 V between the conductors. Thus, for a capacitor to
store a large amount of charge without leaking, its capacitance should
be high enough so that the potential difference and hence the electric
field do not exceed the break-down limits. Put differently, there is a limit
to the amount of charge that can be stored on a given capacitor without
significant leaking. In practice, a farad is a very big unit; the most common
units are its sub-multiples 1 F = 106 F, 1 nF = 109 F, 1 pF = 1012 F,
etc. Besides its use in storing charge, a capacitor is a key element of most
ac circuits with important functions, as described in Chapter 7.
74
E=
20
20
=0
(2.39)
Electrostatic Potential
and Capacitance
Outer region II (region below the plate 2),
E=
=0
20 20
(2.40)
In the inner region between the plates 1 and 2, the electric fields due
to the two charged plates add up, giving
E=
Q
+
=
=
20 20 0 0 A
(2.41)
1 Qd
(2.42)
0 A
8.85 10 12 C2 N 1m 2 1 m2
= 8.85 10 9 F
(2.44)
10 3 m
(You can check that if 1F= 1C V 1 = 1C (NC1m)1 = 1 C2 N1m1.)
This shows that 1F is too big a unit in practice, as remarked earlier.
Another way of seeing the bigness of 1F is to calculate the area of the
plates needed to have C = 1F for a separation of, say 1 cm:
C =
Cd
1F 10 2 m
= 10 9 m 2
0
8.85 10 12 C2N 1m 2
which is a plate about 30 km in length and breadth!
A=
2.13 EFFECT
OF
DIELECTRIC
ON
V = Ed =
http://micro.magnet.fsu.edu/electromag/java/capacitance/
The direction of electric field is from the positive to the negative plate.
Thus, the electric field is localised between the two plates and is
uniform throughout. For plates with finite area, this will not be true near
the outer boundaries of the plates. The field lines bend outward at the
edges an effect called fringing of the field. By the same token, will not
be strictly uniform on the entire plate. [E and are related by Eq. (2.35).]
However, for d2 << A, these effects can be ignored in the regions sufficiently
far from the edges, and the field there is given by Eq. (2.41). Now for
uniform electric field, potential difference is simply the electric field times
the distance between the plates, that is,
(2.45)
CAPACITANCE
75
Physics
and the potential difference V0 is
V0 = E0d
The capacitance C 0 in this case is
C0 =
Q
A
= 0
V0
d
(2.46)
P
0
(2.47)
P
d
0
(2.48)
P =
(2.49)
K
where K is a constant characteristic of the dielectric. Clearly, K > 1. We
then have
V =
d
Qd
=
0 K A 0 K
(2.50)
K =
(2.53)
76
C
C0
(2.54)
Electrostatic Potential
and Capacitance
Eq. (2.54) for the case of a parallel plate capacitor, it holds good for any
type of capacitor and can, in fact, be viewed in general as a definition of
the dielectric constant of a substance.
ELECTRIC
DISPLACEMENT
We have introduced the notion of dielectric constant and arrived at Eq. (2.54), without
giving the explicit relation between the induced charge density p and the polarisation P.
We take without proof the result that
P = P in
where n is a unit vector along the outward normal to the surface. Above equation is
general, true for any shape of the dielectric. For the slab in Fig. 2.23, P is along n at the
at the left surface. Thus at the right surface, induced
right surface and opposite to n
charge density is positive and at the left surface, it is negative, as guessed already in our
qualitative discussion before. Putting the equation for electric field in vector form
E in =
Pi n
0
or ( 0 E + P) i n
=
The quantity 0 E + P is called the electric displacement and is denoted by D. It is a
vector quantity. Thus,
D = 0 E + P, D i n
= ,
The significance of D is this : in vacuum, E is related to the free charge density .
When a dielectric medium is present, the corresponding role is taken up by D. For a
dielectric medium, it is D not E that is directly related to free charge density , as seen in
above equation. Since P is in the same direction as E, all the three vectors P, E and D are
parallel.
The ratio of the magnitudes of D and E is
D
0
=
= 0K
E P
Thus,
D = 0 K E
and P = D 0E = 0 (K 1)E
This gives for the electric susceptibility e defined in Eq. (2.37)
e = 0 (K1)
Solution Let E 0 = V0/d be the electric field between the plates when
there is no dielectric and the potential difference is V0. If the dielectric
is now inserted, the electric field in the dielectric will be E = E0/K.
The potential difference will then be
EXAMPLE 2.8
77
E XAMPLE 2.8
Physics
1
E 3
V = E0 ( d ) + 0 ( d )
4
K 4
1
3
K +3
= E 0 d( +
) = V0
4 4K
4K
The potential difference decreases by the factor (K + 3)/K while the
free charge Q0 on the plates remains unchanged. The capacitance
thus increases
Q
4 K Q0
4K
C= 0 =
=
C0
V
K + 3 V0
K +3
2.14 COMBINATION
OF
CAPACITORS
Q
Q
V = V1 + V2 = C + C
1
2
(2.55)
V
1
1
i.e., Q = C + C ,
1
2
(2.56)
78
1
1
1
=
+
C C1 C 2
(2.58)
Electrostatic Potential
and Capacitance
The proof clearly goes through for any number of
capacitors arranged in a similar way. Equation (2.55),
for n capacitors arranged in series, generalises to
Q
Q
Q
V = V1 + V2 + ... + Vn =
+
+ ... +
(2.59)
C1 C 2
Cn
Following the same steps as for the case of two
capacitors, we get the general formula for effective
capacitance of a series combination of n capacitors:
1
1
1
1
1
=
+
+
+ ... +
(2.60)
C C1 C 2 C 3
Cn
E XAMPLE 2.9
FIGURE 2.29
79
Physics
Solution
(a) In the given network, C1, C2 and C3 are connected in series. The
effective capacitance C of these three capacitors is given by
1
1
1
1
=
+
+
C C1 C2 C3
For C1 = C2 = C3 = 10 F, C = (10/3) F. The network has C and C4
connected in parallel. Thus, the equivalent capacitance C of the
network is
E XAMPLE 2.9
10
C = C + C4 =
+ 10 F =13.3F
3
(b) Clearly, from the figure, the charge on each of the capacitors, C1,
C2 and C3 is the same, say Q. Let the charge on C4 be Q . Now, since
the potential difference across AB is Q/C1, across BC is Q/C2, across
CD is Q/C3 , we have
Q
Q
Q
+
+
= 500 V
C1 C2 C3
Also, Q /C4 = 500 V.
IN A
CAPACITOR
80
W = V Q =
Q
Q
C
(2.68)
Electrostatic Potential
and Capacitance
Since Q can be made as small as we like, Eq. (2.68) can be written as
1
[(Q + Q )2 Q 2 ]
(2.69)
2C
Equations (2.68) and (2.69) are identical because the term of second
order in Q , i.e., Q 2 /2C, is negligible, since Q is arbitrarily small. The
total work done (W) is the sum of the small work ( W ) over the very large
number of steps involved in building the charge Q from zero to Q.
W =
W =
=
=
1
[(Q + Q )2 Q 2 ]
2C
(2.70)
1
[{ Q 2 0} + {(2 Q )2 Q 2 } +{(3 Q )2 (2 Q )2 } + ...
2C
+ {Q2 (Q Q)2 }]
(2.71)
1
Q
[Q 2 0] =
(2.72)
2C
2C
The same result can be obtained directly from Eq. (2.68) by integration
=
Q
1 Q 2
W =
Q' =
C
C 2
0
=
0
Q2
2C
1 Q 2 ( A )2
d
=
0 A
2 C
2
(2.74)
(2.75)
(2.76)
81
Physics
Note that Ad is the volume of the region between the plates (where
electric field alone exists). If we define energy density as energy stored
per unit volume of space, Eq (2.76) shows that
Energy density of electric field,
u =(1/2)0 E 2
(2.77)
Though we derived Eq. (2.77) for the case of a parallel plate capacitor,
the result on energy density of an electric field is, in fact, very general and
holds true for electric field due to any configuration of charges.
Example 2.10 (a) A 900 pF capacitor is charged by 100 V battery
[Fig. 2.31(a)]. How much electrostatic energy is stored by the capacitor?
(b) The capacitor is disconnected from the battery and connected to
another 900 pF capacitor [Fig. 2.31(b)]. What is the electrostatic energy
stored by the system?
FIGURE 2.31
Solution
(a) The charge on the capacitor is
12
8
Q = CV = 900 10 F 100 V = 9 10 C
The energy stored by the capacitor is
2
= (1/2) CV = (1/2) QV
= (1/2) 9 108C 100 V = 4.5 10 6 J
(b) In the steady situation, the two capacitors have their positive
plates at the same potential, and their negative plates at the
same potential. Let the common potential difference be V. The
charge on each capacitor is then Q = CV . By charge conservation,
Q = Q/2. This implies V = V/2. The total energy of the system is
1
1
Q ' V ' = QV = 2.25 106 J
2
4
Thus in going from (a) to (b), though no charge is lost; the final
energy is only half the initial energy. Where has the remaining
energy gone?
There is a transient period before the system settles to the
situation (b). During this period, a transient current flows from
the first capacitor to the second. Energy is lost during this time
in the form of heat and electromagnetic radiation.
82
EXAMPLE 2.10
= 2
Electrostatic Potential
and Capacitance
1 Q
4 0 R
(2.78)
1 q
4 0 r at surface of small sphere
1 q
4 0 R at large shell of radius R.
(2.79)
http://www.physics.gla.ac.uk/~kskeldon/PubSci/exhibits/E10/
This is a machine that can build up high voltages of the order of a few
million volts. The resulting large electric fields are used to accelerate
charged particles (electrons, protons, ions) to high energies needed for
experiments to probe the small scale structure of matter. The principle
underlying the machine is as follows.
Suppose we have a large spherical conducting shell of radius R, on
which we place a charge Q. This charge spreads itself uniformly all over
the sphere. As we have seen in Section 1.14, the field outside the sphere
is just that of a point charge Q at the centre; while the field inside the
sphere vanishes. So the potential outside is that of a point charge; and
inside it is constant, namely the value at the radius R. We thus have:
Potential inside conducting spherical shell of radius R carrying charge Q
= constant
Taking both charges q and Q into account we have for the total
potential V and the potential difference the values
V (R ) =
V (r ) =
1 Q q
+
4 0 R R
1 Q q
+
4 0 R r
V (r ) V ( R ) =
q
4 0
1 1
r R
(2.80)
83
Physics
will immediately flow onto the matter, even
though the charge Q may be quite large. The
natural tendency is for positive charge to
move from higher to lower potential. Thus,
provided we are somehow able to introduce
the small charged sphere into the larger one,
we can in this way keep piling up larger and
larger amount of charge on the latter. The
potential (Eq. 2.78) at the outer sphere would
also keep rising, at least until we reach the
breakdown field of air.
This is the principle of the van de Graaff
generator. It is a machine capable of building
up potential difference of a few million volts,
and fields close to the breakdown field of air
which is about 3 106 V/m. A schematic
diagram of the van de Graaff generator is given
in Fig. 2.33. A large spherical conducting
FIGURE 2.33 Principle of construction
shell (of few metres radius) is supported at a
of Van de Graaf f generator.
height several meters above the ground on
an insulating column. A long narrow endless
belt insulating material, like rubber or silk, is wound around two pulleys
one at ground level, one at the centre of the shell. This belt is kept
continuously moving by a motor driving the lower pulley. It continuously
carries positive charge, sprayed on to it by a brush at ground level, to the
top. There it transfers its positive charge to another conducting brush
connected to the large shell. Thus positive charge is transferred to the
shell, where it spreads out uniformly on the outer surface. In this way,
voltage differences of as much as 6 or 8 million volts (with respect to
ground) can be built up.
SUMMARY
84
1.
2.
Potential at a point is the work done per unit charge (by an external
agency) in bringing a charge from infinity to that point. Potential at a
point is arbitrary to within an additive constant, since it is the potential
difference between two points which is physically significant. If potential
at infinity is chosen to be zero; potential at a point with position vector
r due to a point charge Q placed at the origin is given is given by
1 Q
V (r ) =
4 o r
3.
Electrostatic Potential
and Capacitance
The result is true also for a dipole (with charges q and q separated by
2a) for r >> a .
4.
V =
1 q1 q 2
q
( +
+ ... + n )
4 0 r1P r2P
rn P
6.
U =
1 q1 q2
4 0 r12
E=
9.
n where n
is the unit vector along the outward normal to the
0
A
d
where A is the area of each plate and d the separation between them.
10. If the medium between the plates of a capacitor is filled with an insulating
substance (dielectric), the electric field due to the charged plates induces
a net dipole moment in the dielectric. This effect, called polarisation,
gives rise to a field in the opposite direction. The net electric field inside
the dielectric and hence the potential difference between the plates is
thus reduced. Consequently, the capacitance C increases from its value
C 0 when there is no medium (vacuum),
C = KC 0
where K is the dielectric constant of the insulating substance.
85
Physics
11. For capacitors in the series combination, the total capacitance C is given by
1
1
1
1
=
+
+
+ ...
C C1 C2 C 3
In the parallel combination, the total capacitance C is:
C = C 1 + C2 + C3 + ...
where C1 , C 2, C 3 ... are individual capacitances.
12. The energy U stored in a capacitor of capacitance C, with charge Q and
voltage V is
U =
1
1
1 Q2
QV = CV 2 =
2
2
2 C
The electric energy density (energy per unit volume) in a region with
2
electric field is (1/2)0 E .
13. A Van de Graaff generator consists of a large spherical conducting shell
(a few metre in diameter). By means of a moving belt and suitable brushes,
charge is continuously transferred to the shell and potential difference
of the order of several million volts is built up, which can be used for
accelerating charged particles.
Physical quantity
Potential
Symbol
or V
Dimensions
Unit
[M1 L 2 T3 A1]
Remark
Potential difference is
physically significant
Capacitance
Polarisation
Dielectric constant
[M1 L 2 T4 A 2]
2
[L
AT]
F
C m-2
[Dimensionless]
POINTS TO PONDER
1.
2.
3.
86
n
0 outside. Electric potential is, however
Electrostatic Potential
and Capacitance
5.
6.
7.
EXERCISES
2.1
2.2
2.3
2.4
2.5
2.6
2.7
2.8
87
Physics
2.9
2.10
2.11
ADDITIONAL EXERCISES
2.12
2.13
2.14
2.15
2.16
0
is a unit vector normal to the surface at a point and
where n
is the surface charge density at that point. (The direction of
is from side 1 to side 2.) Hence show that just outside a
n
/ 0.
conductor, the electric field is n
(b) Show that the tangential component of electrostatic field is
continuous from one side of a charged surface to another. [Hint:
For (a), use Gausss law. For, (b) use the fact that work done by
electrostatic field on a closed loop is zero.]
A long charged cylinder of linear charged density is surrounded
by a hollow co-axial conducting cylinder. What is the electric field in
the space between the two cylinders?
In a hydrogen atom, the electron and proton are bound at a distance
of about 0.53 :
(E2 E1 )i n
=
2.17
88
2.18
Electrostatic Potential
and Capacitance
(a)
2.19
2.20
2.21
2.22
FIGURE 2.34
2.23
2.24
89
Physics
2.25
FIGURE 2.35
2.26
2.27
2.28
2.29
90
FIGURE 2.36
Electrostatic Potential
and Capacitance
2.30
2.31
2.32
2.33
2.34
91
Physics
(c)
(d)
2.35
2.36
2.37
92