A course of pure mathematics, by G.H. Hardy.

About this Item

Title
A course of pure mathematics, by G.H. Hardy.
Author
Hardy, G. H. (Godfrey Harold), 1877-1947.
Publication
Cambridge,: The University Press,
1908.
Rights/Permissions

The University of Michigan Library provides access to these materials for educational and research purposes. These materials are in the public domain in the United States. If you have questions about the collection, please contact Historical Mathematics Digital Collection Help at umhistmath-help@umich.edu. If you have concerns about the inclusion of an item in this collection, please contact Library Information Technology at libraryit-info@umich.edu.

DPLA Rights Statement: No Copyright - United States

Subject terms
Calculus
Functions
Calculus
Functions
Cite this Item
"A course of pure mathematics, by G.H. Hardy." In the digital collection University of Michigan Historical Math Collection. https://name.umdl.umich.edu/ACM1516.0001.001. University of Michigan Library Digital Collections. Accessed May 17, 2024.

Pages

Page [unnumbered]

Start of sub OutputBib BIBLIOGRAPHIC RECORD TARGET Graduate Library University of Michigan Preservation Office Storage Number: ACM1516 UL FMT B RT a BL m T/C DT 07/18/88 R/DT 07/18/88 CC STAT mm E/L 1 010:: a 91001050 035/1:: a (RLIN)MIUG86-B25706 035/2:: a (CaOTULAS)160431095 040:: I a RPB I c RPB d CPaHP I d MiU 050/1:0: a QA303 I b.H24 100:1: a Hardy, G. H. I q (Godfrey Harold), I d 1877-1947. 245:02: 1 a A course of pure mathematics, | c by G.H. Hardy. 260:: a Cambridge, I b The University Press, I c 1908. 300/1:: I a xv, 428 p. b diagrs. Ic 23 cm. 650/1: 0: I a Calculus 650/2: 0: I a Functions 998:: I cDPJ Is 9124 Scanned by Imagenes Digitales Nogales, AZ On behalf of Preservation Division The University of Michigan Libraries Date work Began: Camera Operator:

Page [unnumbered]

CAMBRIDGE UNIVERSITY PRESS WAREHOUSE, C. F. CLAY, MANAGER. kontron: FETTER LANE, E.C. btinburgj: 100, PRINCES STREET. 3Lcipfig: F. A. BROCKHAUS. Berlin: A. ASHER AND CO. ~eOb Awory: G. P. PUTNAM'S SONS. 3aombau anr calctttta: MACMILLAN AND Co., LTD. [All Rights reserved]

Page [unnumbered]

A COURSE K) OF PURE MATHEMATICS BY G. H. HARDY, M.A. FELLOW AND LECTURER OF TRINITY COLLEGE, CAMBRIDGE CAMBRIDGE: At the University Press I908

Page [unnumbered]

(ambribge: PRINTED BY JOHN CLAY, M.A. AT THE UNIVERSITY PRESS.

Page V

PREFACE. THIS book has been designed primarily for the use of first year students at the Universities whose abilities reach or approach something like what is usually described as 'scholarship standard'. I hope that it may be useful to other classes of readers, but it is this class whose wants I have considered first. It is in any case a book for mathematicians: I have nowhere made any attempt to meet the needs of students of engineering or indeed any class of students whose interests are not primarily mathematical. A considerable space is occupied with the discussion and application of the fundamental ideas of the Infinitesimal Calculus, Differential and Integral. But the general range of the book is a good deal wider than is usual in English treatises on the Calculus. There is at present hardly room for a new Calculus of an orthodox pattern. It is indeed not many years since there was urgent need of such a book, but the want has been met by the excellent treatises of Professors Gibson, Lamb, and Osgood, to all of which, I need hardly say, I am greatly indebted. And so I have included in this volume a good deal of matter that would find a place in any Traite d'Analyse, though in English books it is usually separated from the Calculus and classed as 'Higher Algebra' or 'Trigonometry'. In the first chapter I have discussed in some detail the various classes of numbers included in the arithmetical continuum. I have not attempted to include any account of any purely arithmetical theory of irrational number, since I believe all such theories to be entirely unsuitable for elementary teaching. My aim in this chapter is a more modest one: I take the 'linear continuum' for granted and assume the existence of a definite number corresponding to each of its points; and all that I attempt to do is to

Page VI

vi PREFACE analyse and distinguish the various classes of numbers whose existence these assumptions involve. Chapters II and III probably do not present many points of novelty. The account given in Chapter II of the most important classes of functions of x is more systematic and illustrated with much greater detail than is usual in English books. I have included, mainly for the sake of completeness, a certain amount of the elements of coordinate geometry of two and three dimensions: but I have, here and throughout the book, kept geometry in a strictly subordinate position and used it merely for purposes of illustration. I have also avoided any wealth of detail in connexion with the purely formal consequences of De Moivre's Theorem, and have devoted the space thus saved to the inclusion of a good deal of matter concerning vector analysis, bilinear transformation, and so on, which seemed to me likely to be more interesting and more useful as a preparation for Chapter X. I have endeavoured to make Chapter IV one of the principal features of the book. The notion of a limit is one that has always presented grave difficulties to mathematical students even of great ability. It has been my good fortune during the last eight or nine years to have a share in the teaching of a good many of the ablest candidates for the Mathematical Tripos; and it is very rarely indeed that I have encountered a pupil who could face the simplest problem involving the ideas of infinity, limit, or continuity, with a vestige of the confidence with which he would deal with questions of a different character and of far greater intrinsic difficulty. I have indeed in an examination asked a dozen candidates, including several future Senior Wranglers, to sum the series 1 + + 2+..., and not received a single answer that was not practically worthless-and this from men quite capable of solving difficult problems connected with the curvature and torsion of twisted curves. I cannot believe that this is due solely to the nature of the subject. There are difficulties in these ideas, no doubt: but they are not so great as many other difficulties inherent in mathematics that every young mathematician completely overcomes. The fault is not that of the subject or of the student, but of the text-book and the teacher. It is not enough for the latter, if he wishes to drive sound ideas on these points well into the mind of his pupils,

Page VII

PREFACE vii to be careful and exact himself. He must be prepared not merely to tell the truth, but to tell it elaborately and ostentatiously. He must drill his pupils in 'infinity' and 'continuity', with an abundance of written exercises and examples, as he drills them at present in poles and polars or symmetric functions or the consequences of De Moivre's theorem. Then and only then he may hope that accurate thought in connexion with these matters will become an integral part of their ordinary mathematical habit of mind. It is this conviction that has led me to devote so much space to the most elementary ideas of all connected with limits, to be purposely diffuse about fundamental points, to illustrate them by so elaborate a system of examples, and to write a chapter of fifty pages without advancing beyond the ordinary geometrical series. It is not necessary for me to say much about the general plan of the next four chapters. The two chapters on the Calculus are no doubt more difficult than the rest of the book. I have perhaps been inconsistent in the standards that I have adopted: but I have been influenced by the feeling that I shall have few readers who will not already have acquired some familiarity with the technique of the Calculus from other sources. I felt this particularly when I was writing the sections on integration. I also felt that the student is apt to carry away from the books in general use the quite mistaken impression that all methods of integration are essentially of a tentative and haphazard character. I have therefore deliberately given an account of the theory more systematic and general than would be suitable for a normal first course in the Calculus. Chapters IX and X are devoted to the theory of the logarithm and exponential, starting from the definition of the logarithm as an integral. It was the desire to write an elementary account of this theory that originally led me to begin the book, and I have generally decided my choice of what was to be included in the earlier chapters by a consideration of what theorems would be wanted in the last two. I regard the book as being really elementary. There are plenty of hard examples (mainly at the ends of the chapters): to these I have added, wherever space permitted, an outline of the solution. But I have done my best to avoid the inclusion of anything that involves really difficult ideas. For instance, I make no

Page VIII

viii ~ * Vlll PREFACE use of the 'principle of convergence': uniform convergence, double series, infinite products, are never alluded to: and I prove no general theorems whatever concerning the inversion of limit-operationsd2f d~f I never even define and df In the last two chapters dxdy dydx' I have occasion once or twice to integrate a power-series, but I have confined myself to the very simplest cases and given a special discussion in each instance. Anyone who has read this book will be in a position to read with profit Mr Bromwich's Infinite Series, where a full and adequate discussion of all these points will be found. It will be found that certain classes of theorems and examples that are prominent in many English books are here conspicuous by their absence. I may refer particularly to the standard theorems concerning the expression of the trigonometrical functions as infinite products or series of partial fractions, and to that familiar type of example the gist of which lies in the 'picking out of coefficients' from some combination of infinite series. The proofs of these results depend upon general theorems that seemed to me intrinsically too difficult to be included in a book professing to be at the same time rigorous and elementary: and I am on the whole of opinion that, if any proposition is too difficult to be proved properly, its statement and application had better be postponed. I am well aware that there is much to be said on the opposite side. A very plausible case can be made out for the habitual exercise of the student in the application of results whose proof is too difficult for his full comprehension. But I have found that I cannot myself write a book on those lines: nor am I fully convinced that such exercise is either necessary or desirable. After all there are plenty of theorems which are not too difficult to prove: and, if anyone believes that a sufficient variety of analytical training cannot be based upon them, I hope that my collections of Miscellaneous Examples may do something to convince him. I may say that it is only in these collections that examples of the character of 'problems' will be found. The sets of examples inside each chapter consist either of perfectly straightforward applications of the preceding 'book-work ', or of summaries of parts of the theory for which there was no room in the text. They include many important theorems, some indeed to which reference is frequently

Page IX

PREFACE ix made later in the book. No one can be more convinced than I am of the value of 'examples' designed merely to train the student's memory and powers of manipulation: but I see no reason why all examples should necessarily be trivial. I trust, however, that readers will not find it irritating to be referred back from the middle of a section in large type to an example in an earlier chapter. My decision as to whether a result should appear in the text or in the examples has always been based upon the relation that it bears to the general theorems in connexion with which it is first proved rather than upon the amount of use that is made of it later on. I have throughout laid particular stress upon points that do not seem to me to be emphasized sufficiently in the text-books in general use, and passed rapidly over others that are of equal importance but stand in no such danger of neglect. Here again I have been influenced by the consideration that this book is likely to be used in conjunction with others rather than as a first text-book in any particular subject. There are two respects in which I have diverged from the usually accepted notation and that seem of sufficient importance to be noticed here. I have entirely rejected the index notation for inverse functions (cos- x, cosh-1 x) in favour of the usual Continental notation (arc cos x, arg cosh x or arg ch x). And I have followed Mr Leathem and Mr Bromwich in always writing lim, lim, lim Z X xo a-x:; a and not lim, lim, lim. This last change seems to me one of %-=o x =co X=C= considerable importance, especially when 'oo' is the 'limiting value'. I believe that to write 'n = oo, x = o' (as if anything ever were 'equal to infinity'), however convenient it may be at a later stage, is in the early stages of mathematical training to go out of one's way to encourage incoherence and confusion of thought concerning the fundamental ideas of analysis. The word 'quantity' occurs occasionally in the earlier chapters. It should be in each case altered to 'number'. Unfortunately I arrived at the decision never to use the term 'quantity' only after the earlier sheets had been passed for press. The books to which I am most indebted (besides the treatises on the Calculus already mentioned) are Mr Bromwich's Infinite

Page X

X PREFACE Series and M. J. Tannery's Lemons d'Algebre et d'Analyse. I must also acknowledge my obligations to a number of friends who have been kind enough to assist me in the preparation of the book. Mr Bromwich has read the whole of it (except Chapter III) either in manuscript or in proof, and a good deal of it twice; and I am indebted to him for corrections and suggestions on almost every page. Mr Berry read Chapters I, II, III, IX and X in manuscript, Professor J. E. Wright Chapters I, II, and III, and Dr Whitehead Chapters I and IV, and all gave me much valuable advice. In particular the earlier part of Chapter IV has been practically rewritten in consequence of Dr Whitehead's suggestions. I have also changed a good deal of Chapter VI in consequence of suggestions received from Dr Askwith. My thanks are also due to Messrs H. W. Turnbull and E. H. Neville, of Trinity College, who have between them read all the proofs and verified the examples: to the latter I am additionally indebted for the figures that appear in the Miscellaneous Examples to Chapter X. Finally I must express my gratitude to the readers and officials of the University Press for their close attention and unfailing courtesy. G. H. HARDY. TRINITY COLLEGE, CAMBRIDGE, September 1908.

Page XI - Table of Contents

CONTENTS. CHAPTER I. REAL VARIABLES. SECT. PAGE 1. Rational numbers......... 1 2. Irrational numbers........ 4 3, 4. Quadratic surds......... 7 5, 6. Irrational numbers in general...11 7, 8. The continuum and the continuous real variable... 15 Miscellaneous examples. 19 CHAPTER II. FUNCTIONS OF REAL VARIABLES. 9. The idea of a function...... 25 10. The graphical representation of functions. Coordinate geometry of two dimensions...... 28 11. The equation of a straight line...... 30 12. Polar coordinates.... 34 13. Polynomials......... 35 14, 15. Rational functions...... 38 16, 17. Algebraical functions....... 41 18, 19. Transcendental functions....... 44 20. Graphical solution of equations..... 50 21. Functions of two variables and their graphical representation 51 22. The equation of a plane..... 52 23. Curves in a plane.... 53 24. Loci in space........ 54 Miscellaneous examples...... 60

Page XII - Table of Contents

xii CONTENTS CHAPTER III. COMPLEX NUMBERS. SECT. PAGE 25-29. Equivalence, addition, and multiplication of displacements 66 30-33. Complex numbers. 75 34. The quadratic equation with real coefficients.. 78 35. The Argand diagram..... 81 36. De Moivre's theorem....... 82 37. Theorems concerning rational functions of complex numbers 84 38, 39. Formulae for cos n and sin nO...... 95 40-42. Roots of complex numbers..... 98 Miscellaneous examples. 101 CHAPTER IV. LIMITS OF FUNCTIONS OF A POSITIVE INTEGRAL VARIABLE. 43. Functions of a positive integral variable.... 108 44. Functional interpolation....... 109 45. Finite and infinite classes....... 110 46-50. Properties possessed by a function of n for large values of n. 111 51-54. Definitions of a limit, etc........ 117 55. Oscillating functions....... 123 56-63. General theorems concerning limits.....127 64, 65. Steadily increasing or decreasing functions.. 135 66. The limit of xn........ 138 67. The limit of (+ )....... 140 68. The limit of n(/x-l)....... 141 69, 70. Infinite series......... 142 71. The geometrical series. Decimals..... 145 72. The representation of functions of a continuous variable by means of limits...... 149 73, 74. Limits of complex functions and series of complex terms. 151 75, 76. Applications to Xn and to the geometrical series.. 153 Miscellaneous examples..... 154 CHAPTER V. LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE: CONTINUOUS AND DISCONTINUOUS FUNCTIONS. 77-80. Limits as x +coo or x-c..... 159 81-83. Limits as x a......... 162 84-87. Continuous functions of a real variable.... 171

Page XIII - Table of Contents

CONTENTS xiii SECT. PAGE 88. Inverse functions........ 178 89. The range of values of a continuous function... 180 90. Continuous functions of several variables... 182 Miscellaneous examples...... 183 CHAPTER VI. DERIVATIVES AND INTEGRALS. 91-93. Derivatives or differential coefficients.... 185 94, 95. Rules for differentiation. 192 96. The notation of the differential calculus..194 97. Differentiation of polynomials. 196 98. Differentiation of rational functions.. 198 99. Differentiation of algebraical functions. 200 100. Differentiation of transcendental functions... 200 101. Repeated differentiation....... 202 102. Some general theorems concerning derived functions. 205 103-105. Maxima and minima........ 206 106, 107. The Mean Value theorem...... 214 108, 109. Differentiation of a function of a function... 216 110-112. Integration.......... 218 113. Integration of polynomials...... 222 114. Integration of rational functions..... 222 115-120. Integration of algebraical functions.... 226 121. Integration by parts........ 230 122-126. Integration of transcendental functions... 233 127, 128. Areas and lengths of plane curves..... 236 Miscellaneous examples. 242 CHAPTER VII. ADDITIONAL THEOREMS IN THE DIFFERENTIAL AND INTEGRAL CALCULUS. 129. Taylor's theorem........ 251 130. Taylor's series........ 255 131. Application of Taylor's theorem to maxima and minima. 256 132. Application of Taylor's theorem to the calculation of limits 257 133. The contact of plane curves...... 259 134-136. Differentiation of functions of several variables.. 262 137-140. Areas of plane curves....... 271 141. Lengths of plane curves....... 276 142-144. The definite integral....... 277 145. Integration by parts and by substitution.. 284 146, 147. Proof of Taylor's theorem by integration by parts. 287 Miscellaneous examples 289

Page XIV - Table of Contents

xiv CONTENTS CHAPTER VIII. THE CONVERGENCE OF INFINITE SERIES AND INFINITE INTEGRALS. SECT. PAGE 148-150. Convergence and divergence of series of positive terms. 296 151. Cauchy's and d'Alembert's tests..... 297 152. Dirichlet's theorem...... 300 153. Multiplication of series of positive terms... 301 154, 155. Further tests for convergence and divergence... 302 156. Pringsheim's theorem.......304 157. Maclaurin's integral test...... 305 158. The series En-s........ 307 159. Cauchy's condensation test...... 308 160-162. Infinite integrals of the first kind.....309 163, 164. Infinite integrals of the second kind....316 165. Series of positive and negative terms.... 322 166, 167. Absolutely convergent series...... 323 168, 169. Conditionally convergent series..... 324 170. Alternating series........ 326 171. Abel's and Dirichlet's tests of convergence... 328 172. Series of complex terms...... 330 173-176. Power series......... 331 177. Multiplication of absolutely convergent series... 334 2Miscellaneous examples...... 335 CHAPTER IX. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS OF A REAL VARIABLE. 178, 179. The definition of log x when x is positive.. 341 180. The functional equation satisfied by logx...344 181-184. The manner in which log x tends to infinity. Scales of infinity.... 344 185. The number e........ 347 186-188. The exponential function e...... 348 189. The general power ax...... 350 190, 191. The representation of ex and log x as limits...351 192. Common logarithms..... 353 193. Logarithmic tests of convergence.....357 194. The exponential series.......360 195. The logarithmic series.......363 196. The series for the inverse tangent.....364 197. The binomial series....... 365 198. Alternative method of development of the theory..367 Miscellaneous examples. 36

Page XV - Table of Contents

CONTENTS xv CHAPTER X. THE GENERAL THEORY OF THE LOGARITHMIC, EXPONENTIAL, AND CIRCULAR FUNCTIONS. SECT. PAGE 199, 200. Functions of a complex variable.....376 201, 202. Curvilinear integrals.... 377 203. Definition of log......380 204. The values of log z.......381 205. The logarithm of a real negative number... 385 206-208. The exponential function. 386 209, 210. The general power a....... 388 211-214. The trigonometrical and hyperbolic functions of a complex variable......... 394 215. The logarithmic and inverse trigonometrical functions. 398 216, 217. The power series for expz, cosz, and sin... 399 218, 219. The logarithmic series....... 402 220. The general form of the binomial theorem... 405 Miscellaneous examples..... 407 APPENDIX I. The proof that every equation has a root.. 415 APPENDIX II. A note on double limit problems.... 420 INDEX.... 423

Page 1

CHAPTER I. REAL VARIABLES. 1. The aggregate of rational numbers and their representation on a straight line. On a straight line L, produced indefinitely in both directions, we take a segment AoAl of any length. We call Ao the origin, or the point 0, and A, the point 1. We now mark off a series of points... A_ -_ r A_n A-... A, A_ A,..., A_,... along L, so that... = A- Am1 A =...= A_,Ao = AoA =.., each segment being measured from left to right along L. B C A An1A mA_ A, A, A A, An tl FIG. 1. Then (A,A Then fAoA = n...........................(1), if n is any positive integer. We will now agree that length is to be regarded as a magnitude capable of sign, positive if the length is measured in one direction along L (e.g. from B to C) and negative if measured in the other (from C to B), so that CB=-BC. We take the positive direction for the measurement of length to be from left to right. AA_n A_nA, Then _ n AoAl AoA1 H. A. 1

Page 2

2 REAL VARIABLES [I Hence the equation (1) is true for all integral values of n, positive or negative. For the sake of uniformity we adopt the convention that (1) is also true when n = 0, in which case it reads AoAo =0. AoA1 That is to say, we agree to regard BB, which is not, properly speaking, a segment at all, as a segment of no length. Now let us take any positive proper fraction in its lowest terms, for example p/q, where p and q are positive integers without any common factor, and p< q. We divide AoAl into q equal parts by points of division which it is natural to denote by Ao, A1/q, A2/q,..., Ap/q,..., A(q-l)/q, A1. It is evident that AoApq _ P........................... ( 2). AoA1 q We thus obtain points on the line L corresponding to all such proper fractions p/q. Any improper fraction may be expressed in the form n + (p/q), where n is a positive integer and p/q a proper fraction. If we take a point An+(pIq) such that AnAn+(pq) = AoAp/q, it is evident that AAn+(plq) = n +p and if we thus find points An+(ppq) A()AI q corresponding to all possible positive values of n, p, and q, we shall have a point Af corresponding to all possible positive integral or fractional values of f, and such that AoAf.(3 A-o = - f........................... (3). Finally, if -f is a negative fraction, proper or improper, we take A_f so that Af Ao = AoAf, or AoA_f AoAf AoA, AoA1 Thus we are able to determine a point A, corresponding to any integral or fractional value of r, positive or negative, and such that AoA = r.....................(4). A0Ar()

Page 3

1] REAL VARIABLES 3 If we take, as is natural, the length AoA, as our unit of length, so that AoA, = 1, the equation (4) becomes AA,= r...........................(5). DEFINITIONS. Any fraction r=p/q, where p and q are positive or negative integers, is called a rational number. The points Ar of the line L, which correspond to the rational numbers r in the manner explained above, are called the rational points of the line. We can suppose (i) that p and q have no common factor, as if they have a common factor we can divide each of them by it, and (ii) that q is positive, since p(-q)=(-p)/q, (-p)/(-q) = p/. The notion of a rational number obviously includes as a particular case that of an integer, since any integer may be expressed as a fraction whose denominator is unity. Examples I. 1. If r and s are rational numbers, r +s, r-s, rs, and r/s are rational numbers, unless in the last case s=0 (when r/s is of course meaningless). 2. If P and Q are rational points, and PQ is divided into any number of equal parts, each of the points of division is a rational point. 3. If X, m, and n are positive rational numbers, X (m2- n2), 2Xmn, and (n,2 + n2) are positive rational numbers. Hence show how to determine any number of right-angled triangles the lengths of all of whose sides are rational. 4. Any terminated decimal represents a rational number whose denominator contains no factors other than 2 or 5. Conversely, any such rational number can be expressed, and in one way only, as a terminated decimal. [The general theory of decimals will be considered in Chap. IV.] 5. The positive rational numbers may be arranged in the form of a simple series as follows: 221 24 2 1 7 1, 2, 15 U5 -6 1 2.3 4... Show that p/q is the [~ (p + q -1) (p + q -2) + q]th term of the series. [In this series every rational number is repeated indefinitely. Thus 1 occurs as }, 1-, |,.... We can of course avoid this by omitting every number which has already occurred in a simpler form, but then the problem of determining the precise position of p/q becomes more complicated.] 1-2

Page 4

4 REAL VARIABLES [I 2. Irrational numbers. If the reader will mark off on the line all the points corresponding to the rational numbers whose denominators are 1, 2, 3,... in succession, he will readily convince himself that he can cover the line with rational points as closely as he likes. We can state this more precisely as follows: if we take any segment BC on L, we can find as many rational points as we please on BC. Suppose, for example, that BC falls within the segment A1A2. It is evident that if we choose a positive integer k so that k. BC > A A........................... (1) and divide A1A2 into k equal parts, at least one of the points of division (say P) must fall inside BC, without coinciding with either B or (. For if this were not so BC would be entirely included in one of the k parts into which A1A2 has been divided, which contradicts the supposition (1). Thus at least one rational point P lies between B and C. But then we can find another -such point Q between B and P, another between B and Q, and so on indefinitely; i.e., as we asserted above, we can find as many as we please. We may express this by saying that BC includes infinitely many rational points. From these considerations the reader might be tempted to infer that these rational points account for all the points of the line, i.e. that every point on the line is a rational point. And it is certainly the case that if we imagine the line as being made up solely of the rational points, all other points (if any such there be) being imagined to be eliminated, the figure which remained would possess most of the properties which common sense attributes to the straight line, and would, to put the matter roughly, look and behave very much like a line. There is, however, good reason for supposing that there are on the line points which are not rational points. Let us look at the matter for a moment with the eye of common sense, and consider some of the properties which we may reasonably expect a straight line to possess if it is to satisfy the idea which we have formed of it in elementary geometry. The straight line must be composed of points, and any segment! of it by all the points which lie between its end points. With

Page 5

2] REAL VARIABLES 5 any such segment must be associated a certain entity called its length, which must be a quantity capable of numerical measurement in terms of any standard or unit length, and these lengths must be capable of combination with one another according to the ordinary rules of algebra by means of addition or multiplication. Again, it must be possible to construct a line whose length is the sum or product of any two given lengths. If the length PQ, along a given line, is a, and the length QR, along the same straight line, is b, the length PR must be a+ b. Moreover, if the lengths OP, OQ, along one straight line, are 1 and a, and the length OR along another straight line is b, and if we determine the length OS by Euclid's construction (Euc. VI. 12) for a fourth proportional to the lines OP, OQ, OR, this length must be ab, the algebraical fourth proportional to 1, a, b. And it is hardly necessary to remark that the sums and products thus defined must obey the ordinary laws of algebra, such as a+b=b+a, a+(b+c)=(a+b)+c, ab=ba, and so on. The lengths of our lines must also obey a number of obvious laws concerning inequalities as well as equalities. Thus if A, B, C are three points lying along L from left to right, we must have AB < AG, and so on. Finally it must be possible, on our fundamental line L, to find a point P such that AoP is equal to any segment whatever taken along L or along any other straight line. Now it is very easy, by means of various elementary geometrical constructions, to construct a length x such that x2= 2. For example, we may construct an isosceles right-angled triangle A 1 B L 2 M N FIG. 2. ABC such that AB=AC=1. Then if BC=x, x2=2. Or we may determine the length x by means of Euclid's construction

Page 6

6 REAL VARIABLES [I (Euc. VI. 13) for a mean proportional to 1 and 2, as indicated in the figure. It follows that there must be a point P on L such that AoP = x, 2 = 2. But it is easy to see that there is no rational number such that its square is 2. In fact we may go further and say that there is no rational number whose square is m/n, where m/n is any positive fraction in its lowest terms, unless m and n are both perfect squares. For suppose, if possible, that p2_ qp n p having no factor in common with q, and m no factor in common with n. Then np2 = mq2. Every factor of q2 must divide unp2, and as p and q have no common factor, every factor of q2 must divide n. Hence In = Xq2, where X is an integer. But this involves m = Xp2: and as m and n have no common factor, X must be unity. Thus m = p2, n = q2, as was to be proved. We are thus led to believe in the existence of a point P, not one of the rational points already constructed, and such that AP=, 2=2; and (as the reader will remember from elementary algebra) we write x= V/2. And if Q is the point such that QAo = AoP, we write AoQ = - V2. The following alternative proof that /2 cannot be rational is interesting. Suppose, if possible, that piq is a positive fraction, in its lowest terms, such that (p/q)2=2 or p2 =22. It is easy to see that then we must have (2q-p)2=2(p-q)2, and so (2q-p)/(p-q) is another fraction having the same property. But clearly q<p<2q, and so p-q<q. Hence we obtain another fraction equal to pqi and having a smaller denominator, which contradicts the assumption that p/q is in its lowest terms. DEFINITION. Any point P on the line L which is not a rational point is called an irrational point. The length A,P is called an irrational number. Examples II. 1. Show from first principles, without assuming the general theorem proved above, that ^/2 is not a rational number.

Page 7

2, 3] REAL VARIABLES 7 2. Give a similar proof for /2. 3. Prove generally that, if p/q is a rational fraction in its lowest terms, /f(p/q) cannot be rational unless p and q are both perfect cubes. 4. The square root of an integer must be either integral or irrational (i.e. it cannot be a rational fraction). [For suppose, if possible, /Jn=p/q, where p, q are positive integers without a common factor. Then nq2=p2. Hence p2 divides n, as p and q have no common factor; i.e. n=X=p2, where X is an integer, and so Xq2=1, which shows that X=1, q=1, n=p2, and so j/n=p.] 5. A more general proposition, due to Gauss, is the following: if xn+pPl X-n Jr p2x" - + n. 2+...+ p=0 is any algebraical equation with integral coeiffcients, it cannot have a rational but not integral root. [For suppose that the equation has a root a/b, where a and b are integers without a common factor, and b is positive. Writing a/b for x, and multiplying by bn-l, we obtain a3n -ab == an -'+p2an-2 b+... +p bna fraction in its lowest terms equal to an integer, which is absurd. Thus b=1, and the root is a. It is evident that a must be a divisor ofpn.] 6. Show that if pn=l and p +p +...+pn -1, the equation cannot have a rational root. 7. Find the rational roots (if any) of 4 - 4x3 - 82+ 13x + 10 =0. [The roots can only be integral, and so +1, ~2, ~ 5, ~10 are the only possibilities: whether these are roots can be determined by trial. It is clear that we can in this way determine the rational roots of any such equation.] 3. Quadratic surds. If a is any rational number, the two numbers + /a are either rational or irrational, and (as appears from what precedes) generally the latter. Numbers of this kind, when irrational, are called pure quadratic surds. A number a + /b, the sum of a rational number and a pure quadratic surd, is sometimes called a mixed quadratic surd. The only kind of irrationals for whose existence the geometrical arguments of the preceding section have given us any warrant are these quadratic surds, pure and mixed, and the

Page 8

8 REAL VARIABLES [I more complicated irrationals which may be expressed in a form involving the repeated extraction of square roots, such as 2 +,/+2 +2 2 + /2 + ~12. It is easy to construct geometrically a line whose length is equal to any number of this form, as the reader will easily see for himself. That only irrationals of these kinds can be constructed by Euclidean methods (i.e. by geometrical constructions with ruler and compasses) is a point the proof of which must be deferred for the present*. This particular property of quadratic surds naturally makes them peculiarly interesting. Examples III. 1. Give geometrical constructions for /2, %2+,/2, V2+/2+,/2. 2. The quadratic equation aX2 +2bx+c-O has two real roots if b2-ac>0. Suppose a, b, c rational. Nothing is lost by taking all three to be integers, for we can multiply the equation by the L.C.M. of their denominators. The reader will remember that the roots are {-b+ /(b2-ac)}/a. It is easy to construct these lengths geometrically, first constructing ^/(b2-ac). A much more elegant, though less straightforward, construction is the following. Draw a circle of unit radius, a diameter PQ, and the tangents at the ends of the diameters. Q' Y Q X FIG. 3. * See Chap. II, Misc. Exs. 41. t I.e. there are two values of x for which ax2 + 2bx + c =. If b2- ac <0 there are no such values of x. The reader will remember that in books on elementary algebra the equation is said to have two 'imaginary' roots. The meaning to be attached to this statement will be explained in Chap. III. When b2 =ac the equation has only one root. For the sake of uniformity it is generally said in this case to have 'two equal' roots, but this is a mere convention.

Page 9

REAL VARIABLES 9 Take PP'= - 2a/b and QQ'= - c/2b, having regard to sign*. Join P'Q', cutting the circle in M and N. Draw PM and PN, cutting QQ' in X and Y. Then QX and QY are the roots of the equation with their proper signst. The proof is simple and we leave it as an exercise to the reader. Another, perhaps even simpler, construction is the following. Take a line AB of unit length. Draw BC= -2b/a perpendicular to AB, and CD=c/a perpendicular to BC and in the same direction as BA. On AD as diameter describe a circle cutting BC in X and Y. Then BX and BY are the roots. 3. If ac is positive PP' and QQ' will be drawn in the same direction. Verify that if b2<ac P'Q' will not meet the circle, while if b2=ac it will be a tangent. Verify also that if b2=ac the circle in the second construction will touch BC. 4. Some theorems concerning quadratic surds. We shall assume that the reader is familiar with the ordinary rules for the manipulation of quadratic surds; such, e.g., as are expressed by the equations /(pq) = /Vp. vq, (p2q)=p Vq. He will find it a useful exercise at this stage to supply proofs of these equations. Similar and dissimilar surds. Two pure quadratic surds are said to be similar if they can be expressed as rational multiples of the same surd, and otherwise dissimilar. Thus V8 = 2 V2, V/2 = 2 V/2, and so /8, V2/5 are similar surds. On the other hand, if M and N are integers which have no common factor, and neither of which is a perfect square, /M and \/N are dissimilar surds. For suppose, if possible,,\/M a VN =O r / a where all the letters denote integers. Then /JMN is evidently rational, and therefore (Ex. II. 4) * The figure is drawn to suit the case in which b and c have the same and a the opposite sign. The reader should draw figures for other cases. t I have taken this construction from Klein's Lecons sur certaines questions de Geometrie Elem'entaire (French translation by J. Griess, Paris, 1896).

Page 10

10 REAL VARIABLES [I integral. Thus MN= P2 where P is an integer. Let a, b, c,.. be the prime factors of P, so that MN = a2b2c2.... Then MN is divisible by a2, and therefore either (1) M is divisible by a2, or (2) N is divisible by a2, or (3) M and N are both divisible by a. The last case may be ruled out, since M and N have no common factor. This argument may be applied to each of the factors a2, b2, 2,.... Ultimately we see that M must be divisible by some of these factors and N by the rest. Thus M = XP,2, N = XP2, where P2 denotes the product of some of the factors a2, b2, c2,... and P22 the product of the rest. Since M and N have no common factor we must have X = 1, = P12, N = P22; i.e. M and N are both perfect squares, which is contrary to our hypotheses. THEOREM. If A, B, C, D are rational and A + /B = C + \D, then either (i) A = C, B = D or (ii) B and D are both squares of rational numbers. If A is not equal to C, let A = C + x. Then, /B = x + V/D, or B = 2 + D +2x D; i.e. /) = (B - D - x2)/2x, which is rational, and therefore D is the square of a rational number. In this case \/B = C- A + V/D is also rational. On the other hand, if A = C it is obvious that B = D. Corollaries. (i) If A + V/B = C + /D, then A - B = C- ID (unless \/B and \/D are both rational). (ii) The equation \/B = C + /-D is impossible unless C= 0, B = D, or both V/B and /D are rational. Examples IV. 1. Prove ab initio that ^/2 and V3 are not similar surds. 2. Prove that Lx and V/(1/x) are similar surds (unless both are rational). 3. If a and b are positive and rational /a + Vb cannot be rational unless >/a and V/b are rational. The same is true of a - ^/b, unless a= b.

Page 11

4, 5] REAL VARIABLES 11 4. If J/A +/B=JC+ JD, then either (a) A = C and B=D, or (b) A =D and B=C, or (c) /A, V/B, /C, /D are all rational or all similar surds. [Square the given equation and apply the theorem above.] 5. A quadratic surd cannot be the sum of two dissimilar quadratic surds. 6. Neither (a + /b)3 nor (a- V/b)3 can be rational unless /b is rational. 7. Prove that if x=p+ /q, where p and q are rational, xw, where m is any integer, can be expressed in the form P+Q/ q, where P and Q are rational. For example, (p + q)2=ps + + 2p Vq, (p+ q)3=p3+3p + (3p2+q) Vq. Deduce that any polynomial in x with rational coefficients (i.e. any expression of the form a xn+ al,Xn-l1 +...+ an where ao,... a, are rational numbers), can be expressed in the form P+ Q Jq. 8. Express l/(p+/Vq) in the same form. We obtain 1 = p - - q p +Vq p2- q p_ q 9. Deduce from Exs. 7 and 8 that any expression of the form G (x)/H (x), where G (x) and H (x) are polynomials in x with rational coefficients, can be expressed in the form P + Q Vq, where P and Q are rational. 10. If a + Vb, where b is not a perfect square, is the root of an algebraical equation with rational coefficients, then a-,/b is another root of the same equation. 11. If p, q, and p2 - q are positive we can express /p+ V/q in the form V/x +/y, where =- {p+VJ( 2-)}, y= 2 {p- V(p2- _)}. 12. Determine the conditions that it may be possible to express Vp/+ Vq, where p and q are rational, in the form ^/x + /y, where x and y are rational. 13. If a2- b is positive, the necessary and sufficient conditions that V(a+ Vb) + V(a - Vb) should be rational are that a2 - b and ~ (a + /a2 - b) should both be squares of rational numbers. 5. Irrational numbers in general. The arguments which led us to believe in the existence of quadratic surds, and corresponding points on the line L, were based on considerations of elementary geometry. There is, however, another way of looking at the matter which is even more instructive and important, as it leads us to consider classes of irrational numbers far more general than quadratic surds.

Page 12

12 REAL VARIABLES [I Consider the equation x2 = 2. We have already seen that there is no rational number x which satisfies this equation. The square of any rational number is either less than or greater than 2. We can therefore divide the rational numbers into two classes, those whose squares are less than 2, and those whose squares are greater than 2. We call these two classes the class T, or the lower class, and the class U, or the upper class. It is obvious that every member of U is greater than all the members of T. Moreover, we can find a member of the class T whose square, though less than 2, differs from 2 by as little as we please. In fact, if we carry out the ordinary arithmetical process for the extraction of the square root of 2 we obtain a series of rational numbers, viz. i, 1-4, 1-41, 1-414, 1-4142,... whose squares 1, 1-96, 1-9881, 1'999396, 1-99996164,... are all less than 2, but approach nearer and nearer to it; and by taking a sufficient number: of the figures given by the process, we can obtain as close an approximation as we want. Similarly we can find a member of the class U whose square, though greater than 2, differs from 2 by as little as we please. It is sufficient to increase the last figure, in the series of approximations given above, by unity: we obtain 2, 1-5, 142, 1-415, 14143,.... Or again, we can find a member of T and a member of U which differ from one another by as little as we please. This follows at once from the fact that every rational number belongs to one class or the other. A formal proof may be supplied as follows. Take any member x of T and any member y of U. Let i be any positive integer, and consider the numbers 1 2.. n-1, x +n(y-x), x+(y-x)...... -, y. Each of these is rational and belongs either to T or to U. Let x + (r/n) (y- x) be the first which belongs to T. Then x+ {(r+ )/n} (y- x) belongs to U. Thus we have found a member of T and a member of U which differ by (y- x)n. And by taking a large enough value of n we can make this difference as small as we like. We add a formal proof that an x can be found in T and a y in U such that x2<2 and y2>2, but both squares differ from 2 by as little as we please.

Page 13

5] REAL VARIABLES 13 Suppose we want each difference to be less than E (where e may be, say, '01 or ~0001 or '00001). We can, in virtue of what precedes, choose x and y so that y-x<le We may obviously suppose both x and y less than 2, since x2<2 and y is nearly equal to x. Then 2 -2= (y - x) (y+x) < 4 (y- x) < E, and since x2 < 2 and y2 >2 it follows d fortiori that 2 - x2 and y2 - 2 are each less than e. We have thus divided all the positive rational points on L into two classes T and U such that (i) the class U lies entirely to the right of the class T, (ii) we can find a pair of points, one in T and one in U, whose distance from one another is as small as we please. And our common-sense notion of the attributes of a straight line demands the existence of a number x and a corresponding point P such that P divides the class T from the class U. T T TTT UU U U Ao FIG. 4. But (1) this number x cannot be rational. For if it were, P would belong either to the class T or the class U, let us say the former. Then x2 < 2, or x2 = 2 - 8, say, where 8 is some positive number. But we can find a member of the class T whose square is as near to 2 as we like; and therefore we can find such a member of T whose square is greater than 2 - 8, i.e. greater than x2. That is to say, we can find a member of T which lies to the right of P: which is absurd. Hence P cannot belong to T. Similarly it cannot belong to U. Again (2) x2 cannot be either less than or greater than 2. If it were less than 2 we could, as above, find members of T to the right of P. This hypothesis is therefore untenable, and x2 is not less than 2. Similarly it is not greater than 2.

Page 14

14 REAL VARIABLES [I Hence there is a point P and a number x such that AoP =x, x2=2. This number x we denote by /2. Examples V. 1. Find the difference between 2 and the squares of the decimals given in ~ 5 as approximations to ^/2. 2. Find the differences between 2 and the squares of 1 3 1 17 41 99 1 21 5) 12' 29' 70' 3. Show that if m/n is a good approximation to V/2, then (m+2n)/(m+n) is a better one, and that the errors in the two cases are in opposite directions. Apply this result to continue the series of approximations in the last example. 4. If x and y are approximations to ^/2, by defect and by excess respectively, and 2 - x2 < c, y2 - 2 < e, then y - x < e. 5. The equation x = 4 is satisfied by x= 2. Examine how far the argument of the preceding sections applies to this equation (writing 4 for 2 throughout). [We define the classes T, U as above. But in this case they do not include all rationals. The rational number 2 is an exception, since 22 is neither less than or greater than 4. And as before we are led to suppose the existence of a dividing point. But we cannot, of course, prove that this is not a rational point. It is, in fact, the point x= 2.] 6. But the preceding argument may be applied to equations other than x2= 2, almost word for word; for example to x2-=N, where N is any integer which is not a perfect square, or to x3= 3, x3= 7, X423, or, as we shall see later on, to x =3 +8. We are thus led to believe in the existence of points P on L such that x = AoP satisfies equations such as these, even when these lengths cannot be constructed by means of elementary geometrical methods. The reader will no doubt remember that in treatises on elementary algebra the root of such an equation as q = n is denoted by /n or nl/q, and that a meaning is attached to such symbols as nPIq, n-Plq by means of the equations nP= /nP, nPIq n-P/q= 1.

Page 15

5-7] REAL VARIABLES 15 And he will remember how, in virtue of these definitions, the 'laws of indices' such as r x nl =,+s, (r)s = ns are extended to cover the case in which r and s are any rational numbers whatever. 7. The continuum. The aggregate of points contained in a straight line L is called a linear continuum. It contains (1) the rational points, (2) the irrational points for whose existence we have the evidence summarized in the preceding sections, and the corresponding negative irrational points on the left of Ao, (3) all other points of the line, if any such there be. To each of these points corresponds a length measured from A,, and capable of numerical measurement in terms of our unit-length AoA,. The measures of these lengths are the real numbers, positive or negative, integral, rational or irrational. The aggregate of all these numbers is called an arithmetical continuum. All the numbers contained in this arithmetical continuum may be operated with according to the ordinary rules of elementary algebra. The substance of the preceding sections is not intended as a complete or rigorous analysis of the nature of either the linear or the arithmetical continuum. Such an analysis would be altogether beyond the scope of this book. What has been said is intended simply to remind the reader of some of the ideas on the subject which he no doubt already possesses, and to attempt to make them, and some of the obvious consequences which are involved in them, more explicitly present to his mind. In order to show the incompleteness of the analysis of the numbers of the arithmetic continuum which has been given, we need only consider a few examples. (i) Let us consider a more complicated surd expression, such as z =- (4 +4/15) + (4 - Vl15). Our argument for supposing that the expression for z has a meaning, and that a point P exists on the line such that Ao P=z, might be as follows. We first show, as above, that there is a point P1 such that if y=AoP1, y2=15, and we can then determine points corresponding to the numbers 4+^/15, 4- ^/15. Now consider the equation in zl 13 = 4 /15.

Page 16

16 REAL VARIABLES The right-hand side of this equation is not rational: but exactly the same reasoning which leads us to suppose that the line contains a point x for which x3=2 (or any other rational number) also leads us to the conclusion that it contains a point zl for which z13=4++/15. Thus we find a point P' such that zl = AoP'= /(4 -./15). Similarly we find a point P" such that Z2= AoP" - /(4-^ 15), and taking AoP=AoP'+AoP" we have finally AoP = z =(4+ /15) + /(4 - /15). Now it is easy to verify that 3= 3z + 8. And we might have given a direct proof of the existence of a unique number z such that z3=3z+8. It is easy to see that there cannot be two such numbers. For if,3= 3z1 +8 and 23= 32 + 8, we find on subtracting and dividing by Z1 -Z2 that z12+zxz + z22=3. But if z1 and Z2 are positive 13> 8, z223>8 and therefore z1 > 2, 2> 2, Z12 + I2 + 22> 12, and so the equation just found is impossible. And it is easy to see that neither zl nor Z2 can be negative. For if z1 is negative and equal to -, ~ is positive and _3-3~+8=0, or 3-C2=8/(. Hence 3- 2>0, and so C<2. But then 8/> 4 and cannot be equal to 3- 2, which is less than 3. Hence there is at most one z such that z3=3z+8. And it cannot be rational. For any rational root of this equation must be integral and a factor of 8 (Ex. II. 5), and it is easy to verify that no one of +1, +2, +4, + 8, is a root. Thus z3=3z+8 has at most one root and that root is not rational. We can now define the positive rational numbers x into two classes T, U according as x3 < 3x+8, or X3> 3x +8. It is easy to see that if x3 > 3x+8 and y is any number greater than x, then also y3>3y+8. For suppose if possible y3 _ 3 + 8. Then since x3> 3x + 8 we obtain on subtractingy3 - x3< 3 (y - x), or y2+xy+x2<3, which is impossible, since y is positive and x>2 (since 3> 8). Similarly we can show that if x< 3x + 8 and y < x then also y3< 3y+ 8. Thus we have separated the rational numbers into two classes similar to the classes T, U of ~ 5. And we conclude, as there, that there is a number z which is greater than any number of T, and less than any number of U, and which satisfies the equation z3= 3z+8. The reader who knows how to solve cubic equations by Cardan's method will be able to obtain directly from the equation the expression z= /(4 +15)+ V(4 -15).

Page 17

REAL VARIABLES 17 (ii) The direct argument applied above to the equation = 3x + 8 could be applied (though the application would be a little more difficult) to the equation x= x + 16, and would lead us to the conclusion that a unique positive number exists which satisfies this equation. In this case, however, it is not possible to obtain a simple explicit expression for x composed of any combination of surds. It can in fact be proved (though the proof is difficult) that it is generally impossible to find such an expression for the root of an equation of higher degree than 4. Thus, besides irrational numbers which can be expressed as pure or mixed quadratic or other surds, or combinations of such surds, there are others which cannot be so expressed. It is only in very special cases that such expressions can be found. (iii) But even when we have added to our list of irrational numbers roots of equations (such as x5 = x + 16) which cannot be explicitly expressed as surds, we have not exhausted the different kinds of irrational numbers contained in the continuum. Let us draw a circle whose diameter is equal to AoA1, i.e. to unity. It is natural to suppose that the circumference of such a circle has a length capable of numerical measurement as much as the diagonal of a square described on AoA,. This length is usually denoted by vr. And it has been shown (though the proof is unfortunately long and difficult) that this number 7r is not the root of any algebraical equation with integral coefficients, so that we cannot have, for example, any such equation as '72 = n, 7r3 = n, 7r5 = 7r + n, where n is an integer. If we take a point P such that AoP = 7, we have found a point which is not rational nor yet belongs to any of the classes of irrationals which we have so far considered. And this number 7r is no isolated or exceptional case. Any number of other examples can be constructed. In fact it is only special classes of irrational numbers which are roots of equations of this kind, just as it is only a still smaller class which can be expressed by means of surds. H. A. 2

Page 18

18 REAL VARIABLES [I Examples VI. 1. Show that x-= /(5 + 2 ^6) + /(5 -2 J/6) satisfies the equation x3=3x+ 10, and apply to this equation arguments similar to those used in ~ 7 (i). And, more generally, x-= / {n+ /(n2 - 1)}+ /{n -(n2 1)} satisfies 3 = 3x + 2n. Consider this equation similarly, n being any positive integer. 2. Consider the equation x2-4x+3=0. It is easy to see that x= and x=3 are roots of this equation. If we divide the rational numbers into two classes T and U according as x2- 4x+3 0 O, we see that T contains all rational numbers between 1 and 3 and U all less than 1 or greater than 3. In this case we are not led to any irrational number, the numbers which divide the classes being 1 and 3. But if we consider instead the equation 2- 4x+ 1 =0, (of which the roots are 2 ~+ /3), we again have two points of division, in this case each irrational: and we might argue directly from the equation to the existence of two such numbers by dividing up the rational numbers into classes T, U as above. 8. The continuous real variable. The 'real numbers' may be regarded from two points of view. We may think of them as an aggregate, the 'arithmetical continuum' defined in the preceding section, or individually. And when we think of them individually, we may think either of a particular specified number (such as 1, - 1, 1/2, or 7r) or we may think of any number, an unspecified number, the number x. This last is our point of view when we make such assertions as 'x is a number, 'x is the measure of a length,' 'x may be rational or irrational.' The x which occurs in propositions such as these is called the continuous real variable: and the individual numbers are called the values of the variable. A 'variable,' however, need not necessarily be continuous. Instead of considering the aggregate of all real numbers, we might consider some partial aggregate contained in the former aggregate, such as the aggregate of rational numbers, or the aggregate of positive integers. Let us take the last case. Then in statements about any positive integer, or an unspecified positive

Page 19

REAL VARIABLES 19 integer, such as 'n is either odd or even,' n is called the variable, a positive integral variable, and the individual positive integers are its values. In fact, this x and n are only examples of variables, the variable whose 'field of variation' is formed by all the real numbers, and that whose field is formed by the positive integers. These are the most important examples, but we have often to consider other cases. In the theory of decimals, for instance, we may denote by x any figure in the expression of any number as a decimal. Then x is a variable, but a variable which has only ten different values, viz. 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. The reader should think of other examples of variables with different fields of variation. He will find interesting examples in ordinary life. For instance-policeman x, the driver of cab x, star x in Herschel's catalogue, the year x, the xth day of the week. MISCELLANEOUS EXAMPLES ON CHAPTER I. 1. If a, b, c,... k and A, B, C,... K are two sets of numbers, and all of the first set are positive, then aA +bB+...+kK a+b+...+k lies between the algebraically least and greatest of A, B,..., K. 2. What are the conditions that ax+by+cz=0, (1) for all values of x, y, z; (2) for all values of x, y, z subject to ax+ 3y+yz=0; (3) for all values of x, y, z subject to both ax+/3y +yz=0 and Ax +By+ Cz=O 3. Any positive rational number can be expressed in one and only one way in the form al+ a2 c+ 3 + + ak al + 1. 2. + 3 + + 1.2.3... k' where al, a2,..., ak are integers, and 0 - al, 0 - aa2<2, 0 a 3,... 0 _ ak< k. 47 11280 7 1611+3 3 6-268+3 [For example - = 7! = 7! 7!+ 6! 3 3 5-63+3 3 3 3 4-13+1 7!t+6!+ 5! 7!+ 6! 5! 4! 3 3 3 1 1 0 = + + + + -+ 2, by continuing the same process. It is evident that k is at most equal to the largest prime factor of the denominator of the number given.] 2-2

Page 20

20 MISCELLANEOUS EXAMPLES ON CHAPTER I 4. Any positive rational number can be expressed in one and one way only as a simple continued fraction 1 1 1 al + -I a2 +a3+... +an, where al, a2,... are positive integers, of which the first only may be zero. [Accounts of the theory of such continued fractions will be found in textbooks of algebra.] 5. Find the rational roots (if any) of 9x3 - 6x2 + 15x - 10=0. [Put 3x=y and apply the method of Ex. II. 7 to the resulting equation iy.] 6. A line AB is divided at C in aurea sectione (Euc. II. 11)-i.e. so that AB. AC=BC2. Show that the ratio AC/AB is irrational. [A direct geometrical proof will be found in Bromwich's Infinite Series, ~ 143, p. 363.] 7. A is irrational. In what circumstances can aA +, where a, b, c, d cA+d' are rational, be rational? 8. Express,/p, ^/q in the form ax+(b/x), where a, b are rational, and x ==p+/4q. 9. If N/p,,/q are dissimilar surds, and a+b Jp+c /q+d./(pq)=O, where a, b, c, d are rational, then a=0, b=O, c=0, d=0. [Express ^/p in the form M+ NJ/q, where M and N are rational, and apply the theorem of ~ 4.] 10. Show that if a /2 + b,/3+ c 5 /5=, where a, b, c are rational numbers, then a=O, b=0, c=0. 11. Any polynomial in 4/p and /q, with rational coefficients, (i.e. any sum of a finite number of terms of the form A (/p)m (4/q)n, where m and n are integers, and A rational) can be expressed in the form a+rb s/p+c Vq+d /pq, where a, b, c, d are rational. 12. Express a+b N/p+ c q 12. Express+ eb /p+, /q where a, b, etc. are rational, in the form d +-e dp+.qf Jq ' A +B J,/+ C Jq+ D,/pq, where A, B, C, D are rational. [Evidently a+bJ^p+c/q J (a+b ^Jp+c4/q) (d+eJp-fVJq) a+3 ^,/p+y/q+a^tpq d+e/p+f J/q (d+e p)2-f 2q E+(/p where a,,, etc. are rational numbers which can easily be found. The required reduction may now be easily completed by multiplication of numerator and denominator by E- ^/p.]

Page 21

MISCELLANEOUS EXAMPLES ON CHAPTER I 21 For example, prove that 1 1 1 6. 1+,/2+3= 2 4 4 13. If a, b, x, y are rational numbers such that (ay - bx)2+ 4 (a-x) (b - y) =0, prove that either (i) = a, y = b, or (ii) 1 - ab and 1 - xy are squares of rational numbers. (Math. Tripos, 1903.) [If we write a-x=, b-y=t-, we obtain a2r2 + b22 +(4- 2ab). =0. Solving this equation for the ratio /lr we find that $/r (which we know to be rational) involves the quantity J {(2 - ab)2 - a2b2}= 2 J(1 - ab). Hence 1 - ab must be the square of a rational quantity. The only alternative is = 7 = 0. But the equation given may also be written in the form X2q2+y22 +(4- 2xy) = 0. Hence we deduce the same conclusion for ^/(1 - xy).] 14. If all the values of x and y given by ax2+2hxy+by2 =1, a'x+ 2h'xy + by2= 1, (where a, h, b, a', A', b' are rational) are rational, then (h - h')2 - (a- a') (b - b (ab' (ab - a'b)2+ 4 (ah' - a'h) (bh' - b'h), are both squares of rational quantities. (Math. Tripos, 1899.) 15. Show that ~/2 and,/3 are cubic functions of ^/2 +/3, with rational coefficients, and that ~/2-,/6+3 is the ratio of two linear functions of V2+V/3. (Math. Tripos, 1905.) 16. The expression Va+2mn a- -n2 + V/a - 2m N/a - m2 is equal to 2m if 2m2>a>m2, and to 2,/(a- m2) if a>2m2. 17. Show that any polynomial in,/2, with rational coefficients, can be expressed in the form a+b,/2+c /4, where a, b, c are rational. More generally, any polynomial in /p, with rational coefficients, can be expressed in the form ao+alx + a2x 2 +... + aCw_1 XM-'1, where a0, al,... are rational and x-=p. For any such polynomial is of the form

Page 22

22 MISCELLANEOUS EXAMPLES ON CHAPTER I where the b's are rational. If k _ m-1 this is already of the form required. If k>m -1, let xr be any power of x higher than the (m - l)th. Then r=-=m +s, where X is an integer and 0(s_ m-1: and Xr=X J+s =p s. Hence we can get rid of all powers of x higher than the (m- l)th. 18. Express (4/2-1) in the form a+b 4/2+c/ 4, where a, b, c are rational. 19. Express (12 -1)/(/2 +1) in the same form. [Multiply numerator and denominator by 4/4 - /2 + 1.] 20. If a + b /2+c 4 4=0, where a, b, c are rational, then a = O, b = 0, c = 0. [Let y=/12. Then y3=2 and cy+by+ a =0. Hence 2cy2+2by+ay3=O or ay 2+2cy+2b=0. Multiplying these two quadratic equations by a and c and subtracting we obtain (ab - 22) y + a2 - 2bc = O, or y = - (a2 - 2bc)/(ab - 2c2), a rational number, which is impossible. The only alternative is that ab - 2c2 =0, a2 - 2b = 0. Hence ab=2c2, a4=4b2c2. If ab-#0 we can divide the second equation by the first, which gives a3=2b3: and this is impossible, since 4/2 cannot be equal to the rational quantity a/b. Hence ab= 0, c=0, and it follows from the original equation that a, b, and c are all zero. As a corollary, if a + b /2 + c /4= d + e 2 +f /4, then a =d, b =e, c=f. It may be proved, more generally, that if ao+ alp lm +... + am lp(- m 0 p not being a perfect rnth power, then ao = al =...= a,,- = 0; but the proof is by no means so simple.] 21. Prove the theorem of ~ 4 by the method employed in Ex. 20. 22. If A + /B = C+ JD, then either A = C, B = D, or B and D are both cubes of rational quantities. [Assume A = C+x, cube, and apply the result of Ex. 20.] 23. If /A+4 B + /C=0, then either one of A, B, C is zero, and the other two equal and opposite, or 4/A, 4/B,,/C are rational multiples of the same surd I'X. 24. Find rational numbers a, 3 such that 4/(7+5 V2)=a+3,/2.

Page 23

MISCELLANEOUS EXAMPLES ON CHAPTER I 23 25. If (a - b3) b >, a+ 3b /9b3+a /(a3b + a —3b 3b __/__ I ___( -6 9b3+a /a-b3 is rational. [Each of the quantities under a cube root can be expressed in the form {+/3 /(a- b33' where a and i are rational.] 26. If a= VA, any polynomial in a is the root of an equation of degree n, with rational coefficients. [We can express the polynomial (x say) in the form X= 1 + mla+... +la(i - 1)/, where l1, ml,... are rational. Similarly 2 = 12 2a +. + rl a ~ 1)/.................................... xn= n+m,, a+... +-na (n- l)/ Hence Llx LSX2 +-... + L xn = A, where A is the determinant 11 ml -... 12 2... 72 In Mn... rn and L1, L2,... the minors of 11, 12....] 27. Apply this process to x=p + /q. [The result is 2 -2px+ (p2- q) = 0.] 28. Deduce from the result of the last example that if p+d/q=r+^s, either p=r, q=s or q, s are squares of rational quantities. [It is easy to see that if p + /q is not rational we must have z2 - 2px + (p2 - q) - 2rx + (r2 - s).] 29. Show that y= a + bp1 +cp23 satisfies the equation y3 - 3ay2 + 3y (a2 - bep) - a3 - b3p - c3p2 + 3abcp = 0. 30. Algebraical numbers. We have seen that some irrational numbers (such as,/2) are roots of equations of the type aOxn + -alxn-1 +... + an = 0, where ao, al,..., an are integers. Such irrational numbers are called algebraical numbers: all other irrational numbers, such as 7r (~ 7), are called transcendental numbers. Show that if x is an algebraical number so are kx, where k is any rational number, and xm/l, where m and n are any integers.

Page 24

24 MISCELLANEOUS EXAMPLES ON CHAPTER I 31. If x and y are algebraical numbers, so are x +y, x - y, xy and x/y. [We have equations aoxm+alx —l+....+fam=O, b0yn+ byn-l +... + b,, =0, where the a's and b's are integers. Write x+y=z, yz- x in the second, and eliminate x. We thus get an equation of similar form oZP +Cl z-l +... +cp=0, satisfied by z. Similarly for the other cases.] 32. If aox+aln-l+.. +an=O where ao, a1,..., an are any algebraical numbers, then x is an algebraical number. [We have n+ 1 equations of the type ao, r amr + al, r arm-1 +... + amr, r = 0 (rO0, 1,..., n), in which the coefficients a0,,., al,,.,... are integers. Eliminate a0, al,..., a, between these and the original equation for x.] 33. Apply this process to the equation x2 - 2x /2 +^/3 =0. [The result is x - 16x6 +584 - 48x2 + 9=- 0.] 34. Find equations, with rational coefficients, satisfied by /2 + J/3, 1 +^J2 +^ 3, /3, + /(2 /2)4 + (2 - /2), J3 -J22'/( ^{4(3+^12}+^{^ (3-2}, - /2+ /3, 1+4/2+2/3, /Y3+-/2 35. If x3= +1, then x3 = a,,x + b + c/, where an+1= a+bn, bn +=an+bn+ cn, C,+l=an+ C,. 36. If 6+x5 - 2x4- x3+X2+1 =0 and y-x4 -x2+x-1, then y satisfies a quadratic equation with rational coefficients. (Math. Tripos, 1903.) [It will be found that y2y+ 1 =0.]

Page 25

CHAPTER II. FUNCTIONS OF REAL VARIABLES. 9. The idea of a function. Suppose that x and y are two continuous real variables, which we may suppose to be represented geometrically by distances AoP = x, BoQ = y measured from fixed points Ao, Bo along two straight lines L, M. And Ao P Bo y Q M FIG. 5. let us suppose that the positions of the points P and Q are not independent, but connected by a relation which we can imagine to be expressed as a relation between x and y: so that, when P and x are known, Q and y are also known. We might, for example, suppose that y = x, or y = 2, or ~x, or x2+ 1. In all of these cases the value of x determines that of y. Or again, we might suppose that the relation between x and y is given, not by means of an explicit formula for y in terms of x, but by means of a geometrical construction which enables us to determine Q when P is known. In these circumstances y is said to be a function of x. This notion of functional dependence of one variable upon another is perhaps the most important in the whole range of higher mathematics. In order to enable the reader to be certain that he understands it clearly we shall, in this chapter, illustrate it by means of a large number of examples.

Page 26

26 FUNCTIONS OF REAL VARIABLES [nI But before we proceed to do this, we must point out that the simple examples of functions mentioned above possess three characteristics which are by no means involved in the general idea of a function, viz.: (1) y is determined for every value of x; (2) to each value of x for which y is given corresponds one and only one value of y; (3) the relation between x and y is expressed by means of an analytical formula. It is indeed the case that these particular characteristics are possessed by many of the most important functions. But the consideration of the following examples will make it clear that they are by no means essential to a function. All that is essential is that there should be some relation between x and y such that to some values of x at any rate correspond values of y. Examples VII. 1. Let y=x or 2x or ~x or x2+1. Nothing further need be said at present about cases such as these. 2. Let y =0 whatever be the value of x. Then y is a function of x, for we can give x any value, and the corresponding value of y (viz. 0) is known. In this case the functional relation makes the same value of y correspond to all values of x. The same would be true were y equal to 1 or - 2 or /2 instead of 0. Such a function of x is called a constant. 3. Let y2=x. Then if x is positive this equation defines two values of y corresponding to each value of x, viz. + Jx. If x=0, y=0. Hence to the particular value 0 of x corresponds one and only one value of y. But if x is negative there is no value of y which satisfies the equation. That is to say, the function y is not defined for negative values of x. This function therefore possesses the characteristic (3), but not (1) or (2). 4. Consider a volume of gas maintained at a constant temperature and contained in a cylinder closed by a sliding piston*. Let A be the area of the cross section of the piston and W its weight. The gas, held in a state of compression by the piston, exerts a certain pressure Po per unit of area on the piston, which balances the weight W, so that W=Apo. Let vo he the volume of the gas when the system is thus in equilibrium. If additional weight is placed upon the piston the latter is forced downwards. The volume (v) of the gas diminishes; the pressure (p) which it exerts * I borrow this instructive example from Prof. H. S. Carslaw's Introduction to the Calculus.

Page 27

FUNCTIONS OF REAL VARIABLES 27 upon unit area of the piston increases. Boyle's experimental law asserts that the product of p and v is very nearly constant, a correspondence which, if exact, would be represented by an equation of the type pv =a.......................................(i), where a is a number which can be determined approximately by experiment. Boyle's law, however, only gives a reasonable approximation to the facts provided the gas is not compressed too much. When v is decreased and p increased beyond a certain point the relation between them is no longer expressed with tolerable exactness by the equation (i). It is known that a much better approximation to the true relation can then be found by means of what is known as 'van der Waals' law,' expressed by the equation ( +) (V - y............................ (ii), where a, 2, y are numbers which can also be determined approximately by experiment. Of course the two equations, even taken together, do not give anything like a complete account of the relation between p and v. This relation is no doubt in reality much more complicated, and its form changes, as v varies, from a form nearly equivalent to (i) to a form nearly equivalent to (ii). But, from a mathematical point of view, there is nothing to prevent us from contemplating an ideal state of things in which, for all values of v above a certain limit, V say, (i) would be exactly true, and (ii) exactly true for all values of v less than V. And then we might regard the two equations as together defining p as a function of v. It is an example of a function which for some values of v is defined by one formula and for other values of v is defined by another. This function possesses the characteristic (2): to any value of v only one value of p corresponds: but it does not possess (1). For p is at any rate not defined as a function of v for negative values of v; a negative volume means nothing, and so negative values of v do not present themselves for consideration at all. 5. Suppose that a perfectly elastic ball is dropped (without rotation) from a height ~gr2 on to a fixed horizontal plane, and rebounds continually. The ordinary formulae of elementary dynamics, with which the reader is probably familiar, show that h= I gt2 if 0 < t, h= g (2r-t)2 if r e t 3T, and generally h=g (2n - t)2 if (2n-1)rT t (2n + 1)r, h being the depth of the ball, at time t, below its original position. Obviously h is a function of t which is only defined for positive values of t. The reader should construct other examples of functions which occur in physical problems.

Page 28

28 FUNCTIONS OF REAL VARIABLES [II 6. Suppose that y is defined as being the largest prime factor of x. This is an instance of a definition which only applies to a particular class of values of x;, viz., integral values. 'The largest prime factor of 1J3 or of ^/2 or of 7r' means nothing, and so our defining relation fails to define for such values of x as these. Thus this function does not possess the characteristic (1). It does possess (2), but not (3), as there is no simple formula which expresses y in terms of x. 7. Let y be defined as the denominator of x when x is expressed in its lowest terms. This is an example of a function which is defined if and only if x is rational. Thus = 7 if x= -11/7: but y is not defined for x=^/2, 'the denominator of ^/2' being a meaningless expression. 8. Let y be defined as the height in inches of policeman C.x, in the lMetropolitan Police, at 5.30 p.m. on 8 Aug. 1907. Then y is defined for a certain number of integral values of x, viz., 1, 2,..., N, where N is the total number of policemen in division C at that particular moment of time. 10. The graphical representation of functions. Coordinate geometry of two dimensions. Suppose that the variable y is a function of the variable x. It will generally be open to us also to regard x as a function of y, in virtue of the functional relation between x and y. But for the present we shall look at this relation from the first point of view. We shall then call x the independent variable and y the dependent variable; and, when the particular form of the functional relation is not specified, we shall express it by the general form of equation y=f( () (or F (x), b (x), * (x),... as the case may be). The nature of particular functions may, in very many cases, be illustrated and made easily intelligible as follows: draw two lines OX, OY at right angles to one another and produced indefinitely in both directions. We can represent values of x and y by distances measured from 0 along the lines OX, OY respectively, regard being paid, of course, to sign, and the positive directions of measurement being those indicated by arrows in Fig. 6. Let a be any value of x for which y is defined and has (let us suppose) the single value b. Take OA = a, OB = b, and complete the rectangle OAPB. Imagine the point P marked on the diagram. This marking of the point P may be regarded as showing that the value of y for x = a is b.

Page 29

10] FUNCTIONS OF REAL VARIABLES 29 If to the value a of x corresponded several values of y (say b, b', b") we should have, instead of the single point P, a number of points P, P', P". B P' b O A 'X B! --- Pp" FIG. 6. We shall call P the point (a, b); a and b the coordinates of P referred to the axes OX, OY; a the abscissa, b the ordinate of P; OX and OY the axis of x and the axis of y, or together the axes of coordinates, and 0 the origin of coordinates, or simply the origin. Examples VIII. 1. Let P be the point (a, b), Q the point (a, /). Complete the parallelogram OPRQ. Show that R is the point (a+-a, b+-3). 2. The middle point of PQ is the point (a+a), ~ (b+/). 3. More generally, the line which divides PQ in the ratio i: X is the point (Xa+,a)/(X+ut), (Xb+,u)/(X+/p). These expressions give, if the ratio /: X is properly chosen, the coordinates of any point on the line PQ. 4. The centre of mass of equal particles at the points (al, bl), (a2, b2),... (an, b%) is the point (a + a2+... +a n)/n, (bl+b2+... + b)/. 5. Change of axes. Draw through 0 lines OX', O Y' making angles 8 with OX, OY (Fig. 7). Draw PA', PB' perpendicular to OX', OY'. It is clear that P is determined if OA' and OB' are given just as much as if OA and OB are given. Let OA=x, OB=y, OA'=x', OB'=y'. Then x' and y' are the coordinates of P referred to the new axes OX', OY'. Prove that x'=x cos 0 +y sin 0, y'= -x sin +y cos 0, and express x and y in terms of x' and y'.

Page 30

30 FUNCTIONS OF REAL VARIABLES [II 6. In Ex. 5, the origin was left unchanged, but the new axes were inclined to the old ones. We might have taken new axes parallel to the old Y Yt\ 0A' 0 A X FIG. 7. ones, but passing through a new origin 0'. Let the coordinates of O' referred to the old axes be a, 3. Express x' and y' in terms of x and y, and conversely. 7. A new origin O' is taken, and new axes O'X', O'Y' inclined at any angle to the old ones. Show, by means of the results of Exs. 5 and 6, that x' and y' may be expressed in terms of x and y by formulae of the type x'=ax+by-+c, y'-dx+ey+f, where a, b,... are numbers independent of x and y. 11. The equation of a straight line. Let us now suppose that for all values a of x for which y is defined, the value b (or values b, b', b",...) of y, and the corresponding point P (or points P, P', P",...) have been determined. We call the aggregate of all these points the graph of the function y. To take a very simple example, suppose that y is defined as a function of x by the equation ax + by + c =.......................(1), where a, b, c are any fixed numbers. Then y is a function of x which possesses all the characteristics (1), (2), (3) of ~ 9. It is easy to show that the graph of y is a straight line. First suppose a = 0. Then y has the constant value - c/b, and the graph is obviously a straight line parallel to OX. Next suppose a different from zero, and suppose that (x1, y,) (x2, y2) are any two points on the graph, so that ax + by + c =, a +by+ c = 0............(2).

Page 31

11] FUNCTIONS OF REAL VARIABLES 31 The coordinates of any point P on the line joining (x,, y,) and (x2, y2) may (Ex. VIII. 3) be expressed in the form t = (Xx\ + $Ux2)/(X + ),? = (\y + A, y2)/(X + 4). But if we multiply the two equations (2) by X and p respectively, add the results, and divide by X + p, we obtain af+b + c = 0, which shows that P lies on the graph. Hence the graph includes all the points of the line. And it cannot include any other points. For the line is not parallel to OX, since if it were y would be constant for all points on it, which is not the case. Hence there is one point on the line for which y has any value we like to assign. And so, if the graph contained a point (x', y') which did not lie on the line, there would be two values of x given by the equation ax + by + c = 0 when y had the value y': and this is obviously untrue. Thus the graph includes all the points of the line and no others. We shall sometimes use another mode of expression. We shall say that when x and y vary in such a way that equation (1) is always true, the locus of the point (x, y) is a straight line, and we shall call (1) the equation of the locus, and say that the equation represents the locus. This use of the terms 'locus,' 'equation of the locus' is quite general, and may be applied whenever the graph of y is, in the ordinary sense of the word, a curve*, and the relation between x and y is capable of being represented by an analytical formula. The preceding work does not apply when b =0. The equation then reduces to x = - c/a, so that the distance of P from 0 Y is constant-i.e. P lies on a line parallel to OY. In this case y does not occur in the equation at all, and so the latter cannot be regarded as defining y as a function of x. But it may be regarded as defining x as a function of y, viz. the constant - c/a. The equation ax + by + c = 0 is the general equation of the first degree, for ax + by + c is the most general polynomial in x and y which does not involve any terms of degree higher than the first * ' Curve' of course includes straight line as a particular case. Some examples in which the 'graph' is not, in the ordinary sense of the word, a curve, will be found in Exs. XVI.

Page 32

32 FUNCTIONS OF REAL VARIABLES [II in x and y. Hence the general equation of the first degree represents a straight line. It is equally easy to prove the converse proposition, the equation of any straight line is of the first degree. After the discussion which precedes we may leave this as an exercise for the reader. Examples IX. 1. The angles which the line ax+ by+c=O makes with OX are arc tan (- a/b) and rr -arc tan (- a/b), where arc tan X denotes the numerically least angle whose tangent is X. 2. If P is a point on the line, and x', y' are defined as in Ex. VIII. (5), show that Ax' +BBy' + C=O, where A=acos +bsinO, B=bcos 0-asin. We call this equation the equation of the line referred to the new axes OX', OY'-it is the relation which connects the new coordinates x', y'. It will be observed that this equation also is of the first degree, as it obviously should be, since the proof that the equation of a straight line is of the first degree in no way depends upon what particular axes are chosen. 3. The coordinates of the point of intersection of ax+by +c=0 and a'x +b'y +c'=O are be' - b'c ac' - a'c ab'- a'b ' ab'- -a'b unless a/b = a'/b', in which case the lines are parallel. 4. The tangents of the angles between the lines in Ex. 3 are ~ (ab' - a'b)/(aa' + bb'), and the lines are perpendicular if aa' + bb' = 0. 5. The length of the perpendicular from ($,,) on to ax+ by+c=0 is at + br +c s/(a2+b2) ' the perpendicular being regarded as positive or negative according to the side of the line on which the point lies. [Positive when it is on the same side as 0, if c>: negative in the same circumstances if c<0.] 6. The equation (ax + by + c) + X (ax + fy + y) = 0 represents a line through the intersection of ax+by+c=0 and ax+/3y+y= 0, and, by proper choice of X, may be made to represent any such line. Discuss the particular case in which a/a=3/b. 7. Hence show how to find the equation of a line through the intersection of two given lines and parallel or perpendicular to a third. 8. The equation of the circle whose centre is (a, b) and radius r is (x - a)2 +(y - b)2=r2. Conversely, any equation of this form represents a circle.

Page 33

FUNCTIONS OF REAL VARIABLES 33 9. The most general equation of the second degree in x and y, in which there is no term in xy, and x2 and y2 have equal coefficients, viz. a (X +y/2)+ 2gx+2fy + c=0 represents a circle iff2+g2>ac. Discuss the cases in whichf2+-g2_ac. 10. Verify that the characteristic form of the equation of a circle (Ex. 9) is not altered by change of axes. 11. The general equation of a circle which passes through the points of intersection of two intersecting circles (x- a )2 +(y - b)2 r2 (- a)2+(-y 3)2 =p2 is (- a)2 + (y b)2-.2 + {(- a)2 + (y- _)2- p2} = 0. 12. If X -1 this last equation is of the first degree only, and represents the common chord of the two circles. 13. The two circles x2+y2+2dx+2ey+k2=0, x2+y2+28x+2Ey +K 2=0, will represent a pair of intersecting circles if d2+e2-k2>0, 82 +e2- K2>0 and 4 (d2+e2-k2) (82+E2 - K2)>(2da+2eE-k2 - K2)2. 14. Show that the two circles in Ex. 13 will cut at right angles if 2d + 2ee =k2+ 2. 15. The area of the triangle formed by the points (xl, Yl), (x2, Y2), (X3, Y3) is XI i y/1 1, X2 2 1 X y3 1 taken positively. Hence deduce the result of ~ 11. Examples X. 1. A point moves (a) so that its distance from a given line is constant, (b) so that its distances from two given lines are equal. Show that in each case the locus of the point is two straight lines (a) by geometrical reasoning, (3) by means of the results of ~ 11 and Ex. IX. 5. 2. The distances of a variable point P from a number of lines are p, p', p",..., and P moves so that ap- bp'+cp"+... =0 where a, b, c,... are constants. Show that the locus of P consists of a number of straight lines. 3. A, B are fixed points, and P a variable point which moves so that (a) X. AP2 +. BP2=const., (b) AP/BP=const. Show that the locus of P is in either case a circle. H. A. 3

Page 34

34 FUNCTIONS OF REAL VARIABLES [II 4. A line of fixed length moves with its ends upon OX and OY. Find the equation of the locus of the point which divides the line in the ratio,u: X. [Let the line be AB, meeting OX, OY in A, B, and let OA=a, OB=b. The coordinates of the point P in question (Ex. VIII. 3) are Xa/(X+p) and /b/(X +p). Also a2+b2=const. =c2, say. Thus if x=Xa/(X +), y=pb/(k +p), X2 2 c2 we deduce + y2 If X= =, i.e. if P is the middle point of AB, this is the equation of a circle.] 5. A line of constant length moves with its ends on a fixed circle. Prove that the locus of the point which divides the line in a fixed ratio is a concentric circle. 12. Polar coordinates. In what precedes we have determined the position of P by the lengths of OM= x, MP = y. If OP=r and MOP = 0, 0 being an angle between 0 and 27r (measured in the positive direction), it is evident that N x = r cos 8, y = r sin, r r = /(2 + y2), cos:sin: 1:: x:y:r, Y and that the position of P is equally well determined by a knowledge of r o M and 0. We call r and 0 the polar co- FIG. 8. ordinates of P. The former, it should be observed, is essentially positive. If P moves on a curve there will be some relation between r and 0, say r =f(0) or 0 =F(r). This we call the polar equation of the locus. The polar equation may be deduced from the (x, y) equation (or vice versa) by the formulae above. It should be observed that (x, y) and (r, 0) are only two out of an infinite variety of 'systems of coordinates' which may be used to fix the position of P. Examples XI. 1. The polar equation of a straight line is of the form r cos (0- a) =p, where p and a are constants. 2. The equation r = 2a cos 0 represents a circle passing through the origin. So do r = 2a sin 0 or r = X cos 0 +/ sin 0. Find the radius of each circle. 3. The general equation of a circle is of the form r2 + c2 - 2rc cos ( - a) = a2, where a, c, and a are constants.

Page 35

12, 13] FUNCTIONS OF REAL VARIABLES 35 13. Further examples of functions and their graphical representation. In all of Exs. IX. and X. we were concerned with two very simple functions of x, viz. the functions y defined by the equations ax +y 3y + y= 0 or (x - a) + (y - b)2 = r2. Only in Ex. X. 4 did we meet for a moment a slightly more general type of functional relation. The examples which follow will give the reader a better notion of the infinite variety of possible types of functions. A. Polynomials. The meaning of the term polynomial in x was explained in Ch. I. It denotes a function of the form aom + a, xm-l +... + am where a0, a1,... ac are constants. The simplest polynomials are the simple powers y=X X2, X,...= X... The graph of the function xm is of two distinct types, according as m is even or odd. First let m = 2. Then three points on the graph are (0,0), (1, 1), (-1,1). Any number of additional points on the graph may be found by assigning other special values to x: thus the values x=, 2, 3, -1, -2, 3 give y=X, 4, 9,, 4, 9. y-~,, 9, 4,. (0,0) FIG. 9. 3-2

Page 36

36 FUNCTIONS OF REAL VARIABLES [II If the reader will plot off a fair number of points on the graph he will be led to conjecture that the form of the graph is something like that shown in Fig. 9. If he draws a curve through the special points which he has proved to lie on the graph and then tests its accuracy by giving x new values, and calculating the corresponding values of y, he will find that they lie as near to the curve as it is reasonable to expect, when the inevitable inaccuracies of drawing are considered. There is, however, one fundamental question which we cannot answer adequately at present. The reader has no doubt some notions as to what is meant by a continuous curve, a curve without breaks or jumps-such a curve, in fact, as is roughly represented in Fig. 9. The question is whether the graph of the function y =2 is in fact such a curve. This cannot be proved by merely constructing any number of isolated points on the curve, although the more such points we construct the more probable it will appear. This question cannot be discussed properly until Ch. IV. In that chapter we shall consider in detail what our common sense idea of continuity really means, and how we can prove that such graphs as the one now considered, and others which we shall consider later on in this chapter, are really continuous curves. For the present the reader may be content to draw his curves as common sense dictates. It is easy to see that the curve y= 2 is everywhere convex to the axis of x. Let Po, PI (Fig. 9) be the points (X0, X02), (xl, xl2). Then YNP _ 812 - Xo2 tan NPoP1 = P —o - xo+l, and, if PO is kept fixed, this increases as xl increases-i.e. the slope of PoP1 becomes steeper and steeper. The curve y = x4 is similar to y = x2 in general appearance, but flatter near 0, and steeper beyond the points A, A' (Fig. 10). And y = x"', where m is even and greater than 4, is still more so. And as m gets larger and larger the flatness and steepness grow more and more pronounced, until the curve is practically indistinguishable from the thick broken line in the figure. The reader should next consider the curves given by y =m, when m is odd. The fundamental difference between the two

Page 37

FUNCTIONS OF REAL VARIABLES 37 cases is that whereas when m is even (-x)m=xm, so that the curve is symmetrical about OY, when m is odd (- x)m=- m, so that y is negative when x is negative. Fig. 11 shows the curves Y="r M 0 N FIG. 10. FIG. 11. y= x, y=x3, and the form to which y = x' approximates for larger odd values of m. It is now easy to see how (theoretically at any rate) the graph of any polynomial may be constructed. In the first place, from the graph of y = xm we can at once derive that of Cx1, where C is a constant, by multiplying the ordinate of every point of the curve by C. And if we know the graphs of f(x) and F(x) we can find that of f(x) + F(x) by taking the ordinate of every point to be the sum of the ordinates of the corresponding points on the two original curves. FIG. 12.

Page 38

38 FUNCTIONS OF REAL VARIABLES [II Fig. 12 shows the graph of y= 2x2- 3, constructed in this way. The thin line is y=2X2, the dotted line y =- x3. In order to prevent the figure becoming of an awkward size, the scale for measurements along the axis of y has been taken to be one-quarter of that for measurements along the axis of x. This is often convenient: of course any ratio of the scales may be chosen. The drawing of graphs of polynomials is however so much facilitated by the use of more advanced methods, which will be explained later on, that we shall not pursue the subject further here. Examples XII. 1. Trace the curves y=7x4, y=3x5, y=xl~. [The reader should draw the curves carefully, choosing the scales of measurement along OX and OY so as to get a convenient figure: but all three curves should be drawn in one figure. The reader will then realise how rapidly the higher powers of x increase, as x gets larger and larger, and will see that, in such a polynomial as x10 + 3x5 + 7x, (or even x10 + 30x5+ 700x4) it is the first term which is of really preponderant importance when x is fairly large. Thus even when x=4, xl0>1,000,000, while 30x5<35,000 and 700x4<180,000; while if vx=10 the preponderance of the first term is still more marked.] 2. Compare the relative magnitudes of x12, 1,000,000x6 1,000,000,000000000000 when = 1, 10, 100, etc. [The reader should make up a number of examples of this type for himself. This idea of the relative rate of growth of different functions of x is one with which we shall often be concerned in the following chapters.] 3. Draw the graph of a2 + 2bx + c. [Here y - {(ac - b2)/a}=a{x+(b/a)}2. If we take new axes parallel to the old and passing through the point - b/a, (ac-b2)/a, the new equation is y'=ax'2. The reader should consider a few different cases in which a, b, c have numerical values, sometimes positive and sometimes negative.] 4. Trace the curves y=X3-3x+l, y=x2(x- 1), y=x(x-1)2. 14. B. Rational Functions. The class of functions which ranks next to that of polynomials in simplicity and importance is that of rational functions. In Ch. I. we defined a rational function as the quotient of one polynomial by another: thus if P (x), Q (x) are polynomials we may denote the general rational function by R (x) = P (x)

Page 39

13, 14] FUNCTIONS OF REAL VARIABLES 39 In the particular case when Q (x) reduces to unity or any other constant (i.e. does not involve x), R (x) reduces to a polynomial: thus the class of rational functions includes that of polynomials as a sub-class. The following points concerning the definition should be noticed. (1) We usually suppose that P(x) and Q (x) have no common factor x + a or xP + axP -1 + bxP - 2 +... + k, all such factors being removed by division. (2) It should however be observed that this removal of common factors does as a rule change the function. Consider for example the function x/x, which is a rational function. On removing the common factor x we obtain 1/1 = 1. But the original function is not always equal to 1: it is equal to 1 only so long as x+O. If x=0 it takes the form 0/0, which is meaningless. Thus the function x/x is equal to 1 if x=i0 and is undefined when x=0. It therefore differs from the function 1 which is always equal to 1. (3) Such a function as (+1 1 )/1 1) may be reduced, by the ordinary rules of algebra, to the form x2 (x-2) (X- 1)2 (x+ i) which is a rational function of the standard form. But here again it must be noticed that the reduction is not always legitimate. In order to calculate the value of a function for a given value of x we must substitute the value for x in the function in the form in which it is given. In the case of this function the values x= -1, 1, 0, 2 all lead to a meaningless expression, and so the function is not defined for these values. The same is true of the reduced form, so far as the values + 1 are concerned. But x= 0 or 2 gives the value 0. Thus once more the two functions are not strictly equivalent. (4) But, as appears from the particular example considered under (3), even when the function has been reduced to a rational function of the standard form there will generally be a certain number of values of x for which it is not defined. These are the values of x (if any) for which the denominator vanishes. Thus (X2-7)/(x2- 3+2) is not defined when x=1 or 2. (5) Generally we agree, in dealing with expressions such as those considered in (2) and (3), to disregard the exceptional values of x for which such processes of simplification as were used there are illegitimate, and to reduce our function to the standard form of rational function. The reader will easily verify that (on this understanding) the sum, product, or quotient of two rational functions may themselves be reduced to rational functions of the standard type. And generally a rational function of a rational function is itself a rational function: i.e. if in z=P(y)/Q(y), where P and Q are

Page 40

40 FUNCTIONS OF REAL VARIABLES [II polynomials, we substitute y=Pi (x)/Q1 (x), we obtain on simplification an equation of the form z=P2(x)/Q2Q(x). (6) It is in no way presupposed in the definition of a rational function that the constants which occur as coefficients should be rational numbers. The word rational has reference solely to the way in which the variable x appears in the function. Thus x2+x+,/3 x 4/2- or is a rational function. The use of the word rational arises as follows. The rational function P (x)/Q (x) may be generated from x by a definite number of operations upon x, including only multiplication of x by itself or a constant, addition of terms thus obtained, and division of one function, obtained by such multiplications and additions, by another. In so far as the variable x is concerned, this procedure is very much like that by which all rational numbers can be obtained from unity, a procedure exemplified in the equation 5 1+1+1+1+1 3 1+1+1 Again, any function which can be deduced from x by the elementary operations mentioned above, using at each stage of the process functions which have already been obtained from x in the same way, can be reduced to the standard type of rational function. The most general kind of function which can be obtained in this way is sufficiently illustrated by the example x 2+7 \ 2 2w+1 o b r t t s t o r 1 which can obviously be reduced to the standard type of rational function. 15. The drawing of graphs of rational functions, even more than that of polynomials, is immensely facilitated by the use of Y X \IG 13. FI. 14 FIG. 13. FIG. 14.

Page 41

14-16] FUNCTIONS OF REAL VARIABLES 41 methods depending upon the differential calculus. We shall therefore content ourselves at present with a very few examples. Examples XIII. 1. Draw the graphs of y = l/, y= 1/x2, y = 1x3,.... [The figure shows the graphs of the first two curves. It should be observed that, since 1/0, 1/02,... are meaningless expressions, these functions are not defined for x=0.] 2. Trace y=x+(l/x), x-(1/x), x2+(1/x2), x2-(1/x2) and ax+(b/x), taking various values, positive and negative, for a and b. 3. Trace x+1 (x+1 2 x2+1 1 1 + =- _I. k-' X.2- 1' (X- 1)2' (1+l-() -(X1)2' 4. Trace y= 1/(x-a) (x- b), 1/( - a) ( - b) (x-c), where a<O<b<c. 5. Sketch the general form assumed by the curves y= l/.xn as m becomes larger and larger, considering separately the cases in which mn is odd or even. 16. C. Explicit Algebraical Functions. The next important class of functions is that of explicit algebraical functions. These are functions which can be generated from x by a definite number of operations such as those used in generating rational functions, together with a definite number of operations of root extraction. Thus (1 +) + _) \/X + -(x + v/x), X2 ~- + x + /3 2) are explicit algebraical functions, and so is Wx/n (i.e. Vxm) where m and n are any integers. Functions such as these differ fundamentally from rational functions in two respects. In the first place, a rational function is always defined for all values of x with a certain number of isolated exceptions. But such a function as /x is undefined for a whole range of values of x (i.e. all negative values). Secondly, the function, when x has a value for which it is defined, has generally several values. Thus, if x >, \/x has two values, of opposite signs.

Page 42

42 FUNCTIONS OF REAL VARIABLES [II Examples XIV. 1. ^{(x-a)(b- x)}, where a<b, is defined only for a<_ x__ b. If a< G<b it has two values: if x=a or b only one, viz. 0. 2. Consider similarly /(x-a) (x-b)(c-x) (a<b<c), X (X2-a2), /( - a)2(b- ) (a<b), 1+-+ - /1 - % f +,,/}. v'l4-X4\l- x 3. Trace y.=- x, x, I/x2, (1 + /x)/(1- /x). 4. Trace y=^(a2 - x2), y=b /{1 - (x2/a2)}. 17. D. Implicit Algebraical Functions. It is easy to verify that if V(1 + x)- (1 - x) Y /(1 + x) + ~/(1 - x) then (+y =(1 + ) 1i -y) (1 - )2 or if y = V/ + \/(x + /Vx) then 4 -(4y2 + 4y+ 1) =0. Each of these equations is of the form ym + _ Rly-l +... + R=...............(1), where RI, R2,..., Rm are rational functions of x: and the reader will easily verify that, if y is any one of the functions considered in the last set of examples, y satisfies an equation of this form. It is naturally suggested that the same is true of any explicit algebraic function. And this is in fact true, and indeed not difficult to prove, though we shall not delay to write out a formal proof here. An example should make clear to the reader the lines on which such a proof would proceed. Let x + / {X + a/x} + /(1 + x) Y =x - Vx + V {x + a/x} -:(1 + x)' Then we have the equations x+t u + v +w Y=x-u+v-w' u-2 = x, v2 = x + u, W3 = 1 + x, and we have only to eliminate u, v, w between these equations in order to obtain an equation of the form desired.

Page 43

FUNCTIONS OF REAL VARIABLES 43 We are therefore led to give the following definition: a function y =f(x) will be said to be an algebraical function of x if it is the root of an equation such as (1), i.e. the root of an equation of the mth degree in y, whose coefficients are rational functions of x. This class of functions includes all the explicit algebraical functions considered in ~ 16. But it also includes other functions which cannot be expressed as explicit algebraical functions. For it is known that such an equation as (1) cannot as a rule be solved explicitly for y in terms of x, when m is greater than 4, though such a solution is always possible if m = 1, 2, 3, or 4 and in special cases for higher values of m. The definition of an algebraical function should be compared with that of an algebraical number given in the last chapter (Misc. Exs. 30). Examples XV. 1. If m =1, y is a rational function. 2. If m=2 the equation is y2+R1y+R2=O, so that Y= {-RI -(R,2 -4R2)}. This function is defined for all values of x for which Ri2> 4R2. It has two values if R12>4R2 and one if R12=4R2. If n =3 or 4 we can use the methods explained in treatises on Algebra for the solution of cubic and biquadratic equations. But as a rule the process is complicated and the results inconvenient in form, and we can generally study the properties of the function better by means of the original equation. 3. Consider the functions defined by the equations y2-2_.y_2=0, y2-2y+x2=O, y 4-2y2+x2=0, in each case obtaining y as an explicit function of x, and stating for what values of x it is defined. 4. Find algebraical equations, with coefficients rational in x, satisfied by each of the functions JX+J(l/x), /x+/(1/x), /X+/(l/X), (/l+X)+/(1-X), N4{x+Vx}, J[ +V/{x+VX}]. 5. Consider the equation y4 = 2. [Here y2= +x. If x is positive y= ~ /x: if negative y= +~/(- x). Thus the function has two values for all values of x save x= 0, when it has the one value 0.]

Page 44

44 FUNCTIONS OF REAL VARIABLES [II 6. An algebraical function of an algebraical function of x is itself an algebraical function of x. [For we have Y" + R1 () -+... + R,z (Z) = 0, where zn+S1 (x) zn-I +... +Sn (x) =0. Eliminating z we find an equation of the form yP+ T, (x) y -l+... + T(x) = 0. Here all the capital letters denote rational functions.] 7. An example should perhaps be given of an algebraical function which cannot be expressed in an explicit algebraical form. Such an example is the function y defined by the equation y5 -y- x=0. But a proof that we cannot find an explicit algebraical expression for y in terms of x is difficult, and cannot be attempted here. 18. Transcendental Functions. All functions of x which are not rational or even algebraical are called transcendental functions. This class of functions, being defined in so purely negative a manner, naturally includes an infinite variety of whole kinds of functions of varying degrees of simplicity and importance. Among these we can at present distinguish two kinds which are particularly interesting. E. The direct and inverse trigonometrical or circular functions. These are the sine and cosine functions of elementary trigonometry, and their inverses, and the functions derived from them. We may assume that the reader is familiar with their most important properties. Examples XVI. 1. Draw the graphs of sin x, cos x, and a cos x+ b sin x. [Since a cos x + b sin x= f cos (x- a), where == /(a2 + b2), and a is an angle whose cosine and sine are a/^/(a2+b2) and b/V/(a2+b2), the graphs of these three functions are similar in character.] 2. Draw the graphs of cos2 x, sin2x, a cos2 x + b sin2 x. 3. Suppose the graphs of f(x) and F(x) drawn. Then the graph of f (x) cos2 x + F (x) sin2 x is a wavy curve which oscillates between the curves y=f(x), y=F(x). Draw the graph when f (x), F (x) are any pair of the functions l/x2, l/x, 1, x,.x2, ax+b, x+(l/x). 4. Discuss in the same manner the form of the graph of f (x) cos x + F (x) sin x.

Page 45

FUNCTIONS OF REAL VARIABLES 45 5. Draw the graphs of x + sin, x2 + sin, (x) + sin., x Sin x, x2 sin x, (sin x)/x. 6. Draw the graph of sin (l/x). [If y=sin (l/x), y=O when x= l/zm7r, where m is any integer. Similarly y=l when x=l/(2m+) Tr and y -1 when x=l/(2m- )7r. The curve is entirely comprised between the lines y= + 1. It oscillates up and down, the rapidity of the oscillations becoming greater and greater as x approaches 0. For x=0 the function is undefined. When x is large y is small. The negative half of the curve is an inversion of the positive half (Fig. 15).] FIG. 15. 7. Draw the graph of x sin (l/x). [This curve is comprised between the lines y= + x just as the last curve was comprised between the lines y= ~ 1 (Fig. 16).] FIG. 16. 8. Draw the graphs of x2 sin (l/x), (l/x) sin (l/s), sin2 (l/x), {x sin (l/x)}2, a cos2 (1/x)+b sin2 (l/x), sin x+sin (l/x), sin x sin (1/x). 9. Draw the graphs of cos x2, sin x2, a cos x + b sin x2. 10. Draw the graphs of arc cos x and arc sin x.

Page 46

46 FUNCTIONS OF REAL VARIABLES [II [If y = arc cos x, x = cos y. This enables us to draw the graph of x, considered as a function of y, and the same curve shows y as a function of x. It is clear that y is only defined for -l x <1, and is infinitely many valued for these values of x. As the reader no doubt remembers, there is, when - l<x< 1, a value of y between 0 and 7r, say a, and the other values of y are given by the formula 2n7r +a, where n is any integer, positive or negative.] 11. Draw the graphs of tan x, cot x, sec x, cosec x, tan2 x, cot2 x, sec2 a, cosec2 x. 12. Draw the graphs of arc tan x, arc cot x, arc sec x, arc cosec x. Give formulae expressing all the values of each of these functions in terms of any particular value. 13. Draw the graphs of tan (1/x), cot (1/x), see (1/x), cosec (l/x). 14. Show that sin x and cos x are not rational functions of x. [It is easy to see that no function which, like the sine or cosine, has a period, can possibly be a rational function. For suppose that f (x)=P (x)/Q (x), where P and Q are polynomials, and f (x)=f(x + 2r), each of these equations holding for all values of x. Let f(0)=k. Then the equation P(x) -kQ ()=0 is satisfied by an infinite number of values of x, viz. x =0, Srr, 47r, etc., and so it is an identity. Thus f(x)=k for all values of x, i.e. f(x) is a mere constant.] 15. Show, more generally, that no function with a period can be an algebraical function of x. [Let the equation which defines the algebraical function be ym + Rym- 1 +R........................ (1), where R1,... are rational functions of x. This may be put in the form P0ym+Pl Y"-1 l+... + pA=0O, where P0, P1,... are polynomials in x. Arguing as above we see that Pok" + Pi km- +...+ P,? = 0 is an identity. Hence y=k satisfies the equation (1) for all values of x, and one set of values of our algebraical function reduces to a constant. Now divide (1) by y-k and repeat the argument m times. Our final conclusion is that our algebraical function has, for any value of x, the same m values k, k',...; i.e. it is composed of m mere constants.] 16. The inverse sine and inverse cosine are not rational or algebraical functions. [This follows from the fact that for any value of x between -1 and +1, arc sin x and arc cos x have infinitely many values.]

Page 47

FUNCTIONS OF REAL VARIABLES 47 19. F. Other classes of transcendental functions. Next in importance to the trigonometrical functions come the exponential and logarithmic functions, which will be discussed in Chh. IX. and X. But these functions are beyond our range at present. And most of the other classes of transcendental functions whose properties have been studied, such as the elliptic functions, Bessel's and Legendre's functions, Gamma-functions, and so forth, lie altogether beyond the range of this book. There are however some elementary types of functions which, though of much less.importance theoretically than the rational, algebraical, or trigonometrical functions, are particularly instructive as illustrations of the possible varieties of the functional relation. Examples XVII. 1. Let y=[x], where [x] denotes the algebraically greatest integer contained in x. The graph is shown in Fig. 17 (a). 2. y=x-[x]. (Fig. 17(b).) 3. y=/{x -[x]}. (Fig. 17(c).) 4. y=[+x]+^{x-[x]}. (Fig. 17 (d).),.....,~~~~~~F 0 1 2 IFIG. 17 a. '/ ` 0 1 2 FIG. 17 b. 0 1 2 FIG. 17 c. FIG. 17 d.

Page 48

48 FUNCTIONS OF REAL VARIABLES [II 5. y (-[x])M, [XI + (X - [.])2. 6. y=[,/x], [x2], J^-[/ ], -[x2], [1-X2]. 7. Let y be defined as the largest prime factor of x (cf. Exs. VII. 6). Then y is defined only for integral values of x. When x= + 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13,... y= 1,2,3,2,5,3,7,2,3, 5, 11, 3, 13,... The graph consists of a number of isolated points. 8. Let y be the denominator of x (Exs. VII. 7). In this case y is defined only for rational values of x. We can mark off as many points on the graph as we please, but the result is not in any ordinary sense of the word a curve, and there are no points corresponding to any irrational values of x. Draw the straight line joining the points (N- 1, N), (N, N). Show that the number of points of the locus which lie on this line is equal to the number of numbers less than and prime to N. 9. Let y=0 when x is an integer, y=x when x is not an integer. The graph is derived from the straight line y=x by taking out the points...(-1, -1), (0,0), (1, 1), (2, 2),... and adding the points (-.1, 0), (0, 0), (1, 0),... on the axis of x. The reader may possibly regard this as an unreasonable function. Why, he may ask, if y is equal to x for all values of x save integral values, should it not be equal to x for integral values too? The answer is simply, why should it? The function y does in point of fact answer to the definition of a function: there is a relation between x and y such that when x is known y is known. We are perfectly at liberty to take this relation to be what we please, however arbitrary and apparently futile. This functiony is, of course, a quite different function from that one which is always equal to x, whatever value, integral or otherwise, x may have. Let us take an apparently still more arbitrary example. 10. Let y=0 when xv=-22, y2=1 when x=-1, y=sini' if - iv x:!, and y2= x2 if 1<x 2, except that y = - 1 when x = 1. And for x=3 let y have all values between -1 and +41. Finally, suppose that y is not defined at all except for the various values just enumerated. The graph is shown in Fig. 18. It consists of the curved arc L, the line P, the lines M and N, from which however the middle points and the ends nearest the axis of x must be taken out, and the four isolated points A, B, C, D. We notice further that y has infinitely many values for x=3, two for x= -1 and l<x<l and 1 <x 2, one for = -2<, - x and x =1, and none for any other value of x. This example is given merely to illustrate possibilities; it is not suggested that such functions as these are likely to be of any practical importance.

Page 49

FUNCTIONS OF REAL VARIABLES 49 The reader should however not be too ready to assume that even from the practical point of view it is only what is obvious and straightforward which is (-1,1) *B M A P (-2.5,0) L C (-1,-1) eD "lr *1 FIG. 18. important. If he turns back to Exs. VII. 4, 5, for instance, he will see examples of functions suggested by physical considerations and defined by different formulae for different ranges of values of x. 11. Let y = when x is rational, but = 0 when x is irrational. The graph consists of two series of points arranged upon the lines y =1 and y = 0. To the eye it is not distinguishable from two continuous straight lines, but in reality an infinite number of points are missing from each line. 12. Let y=x when x is irrational and y=-/{(1 +p2)/(l +q2)} when x is a rational fraction p/q.. ' e. ~./ FIG. 19. H. A. 4

Page 50

50 FUNCTIONS OF REAL VARIABLES [TI The irrational values of x contribute to the graph a curve in reality discontinuous, but apparently not to be distinguished from the straight line y=x. Now consider the rational values of x. First let x be positive. Then /{(l +p2)/(l+q2)} cannot be equal to p/q unless p=q, i.e. x=l. Thus all the points which correspond to rational values of x lie off the line, except the one point (1, 1). Again, if p<q, ^/{(l+p2)/(l+q2)}>p/q; if p>q, 4{(] +p2)/(l+q2)}<p/q. Thus the points lie above the line y=x if 0<x<l, below if x>1. If p and q are large /{(l +p2)/(l +q2)} is nearly equal to p/q. Near any value of x we can find any number of rational fractions with large numerators and denominators. Hence the graph contains a large number of points which crowd round the line y=x. Its general appearance (for positive values of x) is that of a line surrounded by a swarm of isolated points which gets denser and denser as the points approach the line. The part of the graph which corresponds to negative values of x consists of the rest of the discontinuous line together with the reflections of all these isolated points in the axis of y. Thus to the left of the axis of y the swarm of points is not round y=x but round y =-x, which is not itself part of the graph. See Fig. 19. 20. Graphical solution of Equations containing a single unknown quantity. Many equations can be expressed in the form f(x)= ( )........................... (1), where f(x) and + (x) are functions whose graphs are easy to draw. And it is obvious that if the curves =f(x), Y = () intersect in a point P whose abscissa is:, then: is a root of the equation (1). Examples XVIII. 1. The quadratic equation ax2+2bx+c=O. This may be solved graphically in a variety of ways. For instance we may draw the graphs of y=ax+2b, y= -c/x, whose intersections give the roots, if any. Or we may take y=2, y= -(2bx +c)/a. But the simplest method is probably to draw the circle a (x2+y2)+2bx+c=O, whose centre is (-b/a, 0) and radius {^(b2- ac)}/a. The abscissae of its intersections with the axis of x are the roots of the equation. 2. Solve by any of these methods x2+2x-3=0, x2-7x +4=O, 3x2+2x-2=0.

Page 51

20, 21] FUNCTIONS OF REAL VARIABLES 51 3. The equation xm+ax+b=0. This may be solved by constructing the curves y =xn, y= - ax- b. 4. Verify the following table for the number of real roots (if any) of xan c at + b =0:, e v fn b positive, two or none, '(a) m even [b negative, two; (b) n oddf{ a positive, one, a negative, three or one. Construct numerical examples to illustrate all possible cases. 5. Show that the equation tan x = a+ b has always an infinite number of real roots. 6. Determine the number of real roots of sinx=x, sinx=x/3, sinx=x/8, sinx =xt/120. 7. Show that if a is small and positive (e.g. a= 01) the equation x-a-= r sin2 x has three real roots. Consider also the case in which a is small and negative. Explain how the number of roots varies as a varies. 21. Functions of two variables and their graphical representation. In ~ 9 we considered two variables connected by a relation. We may similarly consider three variables (x, y, and z) connected by a relation such that when the values of x and y are both given, the value or values of z are known. In this case we call z a function of the two variables x and y; x and y the independent variables, z the dependent variable; and we express this dependence of z upon x and y by writing z=f(x, Y) The remarks of ~ 9 may all be applied, mutatis mutandis, to this more complicated case. The method of representing such functions of two variables graphically is exactly the same in principle as in the case of functions of a single variable. We must take three axes OX, Y, OZ in space of three dimensions, each axis being perpendicular to the other two. The point (a, b, c) is the point whose distances from the planes YOZ, ZOX, XOY, measured parallel to OX, OY, OZ, are a, b, and c. Regard must of course be paid to sign, lengths measured in the directions OX, OY, OZ being regarded as positive. The definitions of coordinates, axes, origin are the same as before. 4-2

Page 52

52 FUNCTIONS OF REAL VARIABLES [II Now let z =f(x, y). As x and y vary the point (x, y, z) will move in space. The aggregate of all these points is called the locus of the point (x, y, z) or the graph of the function z=f(x, y). When the relation between x, y, and z which defines z can be expressed in an analytical formula this formula is called the equation of the locus. 22. Equation of a plane. It may be shown without difficulty that the coordinates of the point which divides PQ in a given ratio p: X are Xa + pa kb + L34 Xc + p/y \+f X+t X' -X + where (a, b, c) are the coordinates of P and (a, /, y) those of Q (cf. Ex. VIII. 3). From this we can at once deduce the following important theorem: the general equation of the first degree represents a plane. For let ax + by + cz + d = 0 be the equation; and let (x1, y1, z,), (x2, y2, z2) be two points P, Q on the graph of the function z (or the locus represented by the equation). Then ax1 + by, + cz, + d = 0, ax2 + by2 + cz2 + d = 0, and so, multiplying by X and p, adding, and dividing by X + pu, Xa +x 7t + b Xy1 + PY2 + X1 + Z2 d a +- X+b ~+c-+ d=0. Thus the locus is such that if P and Q lie upon it, the point R which divides PQ in any ratio lies upon it. That is to say every point of the line PQ lies in the locus. The locus therefore satisfies Euclid's definition of a plane. Conversely the equation of any plane is of the first degree. For let (xi, y1, z2), (x2, y2, z2), (x3, Y3, z3) be any three points on the plane. We can choose a, b, c, d so that ax, + by1 + cz, + d = 0, ax2 + by2 + cz2 + d = 0, ax3 + bys + cz3 + d = 0. We can therefore determine a locus represented by an equation of the type ax + by + cz + d = 0,

Page 53

21-23] FUNCTIONS OF REAL VARIABLES 53 which passes through the three points. But we have already seen that this locus is a plane; and it can obviously only be the original plane. The equation of that plane is therefore of the first degree. Examples XIX. I. Prove that if (al, bl, cl), (a2, b2, c2), (a3, b3, C3) are three points on a plane, the point whose coordinates are X1+ 2+va3, etc. X+/I+v lies on the plane; and that it may, by choosing the ratios X: /: v appropriately, be made to coincide with any point in the plane. 2. Hence, by an argument similar to that of ~ 11, deduce that the general equation of the first degree represents a plane, and conversely. 3. The equation of the sphere whose centre is (a, b, c) and radius r is (X - a)2 + (y - b)2 + (Z - c)2 =r2 Conversely, this equation always represents a sphere. 4. Establish results for planes and spheres corresponding to those of Exs. IX. 3-9, 11-14. 23. Curves in a plane. We have hitherto used the notation y = f( )..........................(1) to express functional dependence of y upon x. It is evident that this notation is most appropriate in the case in which y is expressed explicitly by means of some formula involving x alone, as when for example y = 2, sin a, a cos2 + bsin2 x. We have however very often to deal with functional relations which cannot be or are most conveniently not expressed in this form. If, for example, y5-y-x=O or x' +y5-ay=O0 it is known to be impossible to express y explicitly as a simple function of x. If x2 + y2 + 2gx + 2fy + c = 0 y can indeed be so expressed, viz. by the formula y =-f~+ ~/f2- - 2gx-c; but the functional dependence of y upon x is better and more simply expressed by the original equation. It will be observed that in these two cases the functional relation is fully expressed by equating a function of the two variables x and y to zero, i.e. by means of an equation f(, y)= 0...........................(2).

Page 54

54 FUNCTIONS OF REAL VARIABLES [II We shall adopt this equation as the standard method of expressing the functional relation. It includes the equation (1) as a special case, since y -f(x) is a special form of a function of x and y. We can then speak of the locus of the point (x, y) subject to f(x, y) = 0, the graph of the function y defined by f(x, y) 0, the curve or locus f(x, y)= 0, and the equation of this curve or locus. There is another method of representing curves which is often useful. Suppose that x and y are both functions of a third variable t, which is to be regarded as essentially auxiliary and devoid of any particular geometrical significance. We may write x=f(t), y =F(t)................ (3). If t has any arbitrary value assigned to it, the value (or values) of x and of y are known. Each pair of such values defines a point (x, y). If we construct all the points which thus correspond to all the different values of t we obtain the graph of the locus defined by the equations (3). Suppose for example x=acost, y=asint. Let t vary from 0 to 27r. Then it is easy to see that the point (x, y) describes the circle whose centre is the origin and radius is a. If t varies beyond these limits (x, y) describes the circle over and over again. We can in this case at once obtain a direct relation between x and y by squaring and adding: we find that x2 + y2 = a2, t being now eliminated. Examples XX. 1. The points of intersection of the two curves whose equations are f (, y)= 0, p (x, y/)=0 are given by solving this pair of simultaneous equations. 2. Trace the curves (x+y)2=l, xy=l, x-2-y2=1. 3. The curve f(x, y)+X ( (x, y)=0 represents a curve passing through the points of intersection of f=0, j=0. 4. What loci are represented by (a) x=at+b, y=ct+d, (3) x/a=-2t/(l+t2), y/a=((l -)(+t2), when t varies through all real values? 24. Loci in space. In space of three dimensions there are two fundamentally different kinds of loci, of which the simplest examples are the plane and the straight line.

Page 55

23, 24] FUNCTIONS OF REAL VARIABLES 55 A particle which moves along a straight line has only one degree of freedom. Its direction of motion is fixed; if its velocity is given (velocity being regarded as a quantity capable of sign) its mode of motion is completely determined. Or again the position of a point on a line can be completely fixed by one measurement of position, e.g. by its distance from a fixed point on the line. If we take the line as our fundamental line L of Ch. I., the position of any of its points is determined by a single coordinate x. A particle which moves in a plane, on the other hand, has two degrees of freedom. In order to determine its mode of motion completely we require a knowledge of its component velocities in two different directions. Or again the position of a point on a plane requires the determination of two coordinates in order to fix it. Now let us look at these loci from the point of view of their equations. The plane is represented by a single equation ax + 1/y + yz + 8 = O. Two of the three coordinates y and z may be chosen arbitrarily, and the third is then fixed. The straight line on the other hand is the intersection of two planes. Let these two planes be ax+by +cz+d=O, ax+/y+7z y-+=0......... (1). Then if one of the three coordinates is chosen arbitrarily, both of the others and the position of the point are fixed. We can of course draw any nzumber of planes through the line. Hence it might appear that the coordinates of the point on the line are subject to more than two relations. And so in fact they are, but the relations are not all independent. Any other plane through the line could be expressed in the form ax+by +-cz+d+ (ax + y+^yz+)=O and any equation of this type is a mere consequence of the equations (1). The locus represented by a single equation z =f(, y) is called a surface. It may or may not (in the obvious simple cases it will) satisfy our common-sense notion of what a surface should be. The considerations of ~ 21 may evidently be generalised so as to give definitions of a function f(x, y, z) of three variables (or of functions of any number of variables). And as in ~ 23 we

Page 56

56 FUNCTIONS OF REAL VARIABLES agreed to adopt f(x, y)= 0 as the standard form of the equation of a plane curve, so now we shall agree to adopt f(x,, z)= 0, as the standard form of equation of a surface. The locus represented by two equations of the form z =f(x, y) or f(x, y, z)= 0 is called a curve. Thus a straight line may be represented by two equations of the type ax + /y + yz + 8 = 0. A circle in space may be regarded as the intersection of a sphere and a plane; it may therefore be represented by two equations of the forms (x - a)2(y- b)2 + (z- c)2 = r2, ax +/3 + = 0. Examples XXI. 1. What is represented by three equations of the type f(x,, z)=0? [Three equations in three variables are capable of solution (practically or theoretically). The solution consists of a finite or infinite number of isolated sets of values (x, y, z). The three equations therefore represent a number of isolated points. Or we may regard the question thus. Two of the equations determine a curve, which meets the surface represented by the third equation in a number of points.] 2. Three linear equations represent a single point. 3. The equations of a curve differ from the equation of a surface in that their mode of expression is not unique, since either may be transformed by means of the other. Thus the curve y=l, x2 +y2+z2=2, (a circle) may also be represented by y = 1, x2 + 2 1. 4. What are the equations of a plane curvef (x, y)-=0 in the plane XOY, when regarded as a curve in space? I[f(x, y)=0, ==0.] 5. Cylinders. What is the meaning of a single equation f(x, y )=0, considered as a locus in space of three dimensions? [All points on the surface satisfy f(x, y) =0 whatever be the value of z. The curve f(x, y)=0, z=0 is the curve in which the locus cuts the plane XOY. Draw the plane z=a, cutting ZOX, YOZ in OX', 0 Y', and take OX', OY' as axes of coordinates in this plane (Fig. 20). Obviously x'=x, y'=y and so f(', y') = 0. The curves in which the two planes z =0, z =a cut the locus are therefore repetitions of the same plane curve: if one curve were moved a distance a parallel to the axis of z it would coincide with the other. The locus is the surface formed by drawing lines parallel to OZ through all points of the plane curve f (x, y)= 0, z=0. Such a surface is called a cylinder.] 6. Interpret the equations: (a) y = mvx + c, (b) y = mx + c, z = a, (c) 2+ya2=1, (d) x2+y2=l, z=a, as loci in three-dimensional space.

Page 57

24] FUNCTIONS OF REAL VARIABLES 57 7. Graphical representation of a surface on a plane. Contour Maps. It might seem to be impossible to adequately represent a surface by a drawing on a plane; and so indeed it is: but a very fair notion of the nature of the surface may often be obtained as follows. Let the equation of the surface be z=f((x, y). FIG. 20. If we give z a particular value a, we have an equation a=f (x, y), which we may regard as determining a plane curve on the paper. We trace this curve and mark it (a). Actually the curve (a) is the projection on the plane XOY of the section of the surface by the plane z=a (Fig. 20). We do this for all values of a (practically, of course, for a selection of values of a). We obtain some such figure as is shown in Fig. 21. It will at once suggest a contoured Ordnance Survey map: and in fact this is the principle on which such maps are constructed. The contour line 1000 is the projection on the plane of the sea level of the section of the surface of the land by the plane parallel to the plane of the sea level and 1000 ft. above it *. 2000,.___^^1~000 FIG. 21. may assume here that the effects of the earth's curvature may be * We neglected.

Page 58

58 FUNCTIONS OF REAL VARIABLES [II 8. Draw a series of contour lines to illustrate the form of the surface 2z= 3xy. 9. Right circular cones. Take the origin of coordinates at the vertex of the cone and the axis of z along the axis of the cone (Figs. 22, 23). Let a be the semi-vertical angle of the cone, P any point on it, C and P' its projections z 0 0 \p FIG. 22. FIG. 23. on the axis OZ and the plane XOY. Then if x, y, z are the coordinates of P, we have x2 +/2 = OP' 2= P2 =0 C2tan2 a =2tan2a. The equation of the cone (which must be regarded as extending both ways from its vertex) is therefore x2 +y2 - 2 tan2 a = 0. 10. Surfaces of revolution in general. We notice that the cone of Ex. 9 cuts ZOX in the lines x= + z tan a, which may be combined in the equation x2 z2 tan2 a. That is to say, the equation of the surface generated by the revolution of the curve y=0, x2= z2tan2a round the axis of z is derived from the second of these equations by changing x2 into x2 +y2. Show generally that the equation of the surface generated by the revolution of the curve y=0, x=f(z), round the axis of z, is /(X2 +2y2)=f (z), or +2 +y2{f (Z)2. Verify in the case of (1) the line y=O, x= 1; (2) the circle y=0, xS2+2= 1. 11. Cones in general. A surface formed by straight lines passing through a fixed point is called a cone: the point is called the vertex. A particular case is given by the right circular cone of Ex. 9. Show that the equation of a cone whose vertex is 0 is of the form and that any equation of this form represents a cone. [If (x, y, z) lies on the cone, so must (Xx, Xy, Xz), for any value of X.]

Page 59

24] FUNCTIONS OF REAL VARIABLES 59 12. Ruled surfaces. Cylinders and cones are special cases of surfaces composed of straight lines. Such surfaces are called ruled surfaces. The two equations = az+- b 1 y —az+ --..............................(1) y=cz+CJ () represent the intersection of two planes, i.e. a straight line. Now suppose that a, b, c, d instead of being fixed, are functions of an auxiliary variable t. For any particular value of t the equations (1) give a line. As t varies this line moves, and generates a surface, whose equation may be found by eliminating t between the two equations (1). For instance, in Fig. 23 the line OL is inclined at a fixed angle a to OZ. PP' is perpendicular to the plane XO Y and XOP'= t. The equations of the line are x=z tan a cos t y=z tan a sin t f As t varies the line turns round OZ and generates the cone x2 +y=2 2 tan2 a. Another simple example of a ruled surface may be constructed as follows. Take two sections of a right circular cylinder perpendicular to the axis and at a distance I apart (Fig. 24 a). We can imagine the surface of the cylinder to be made up of a number of thin parallel rigid rods of length 1, such as PQ, the ends of the rods being fastened to two circular rods of radius a. Now let us take a third circular rod of the same radius and place it round the surface of the cylinder at a distance h from one of the first two rods (Fig. 24 a). Unfasten the end Q of the rod PQ and turn PQ about P until Q can be fastened to the third circular rod in the position Q'. The angle qOQ' -a in the figure is evidently given by 12- h2 = qQ'2 =(2a sin a)2. Let all the other rods of which the cylinder was composed be treated in the same way. We obtain a ruled surface whose form is indicated in Fig. 24 b. It is entirely built up of straight lines; but the surface is curved everywhere, and is in general shape not unlike certain forms of table-napkin rings (Fig. 24c). P FIG. 24 a. FIG. 24 b. FIG. 24 c.

Page 60

60 MISCELLANEOUS EXAMPLES ON CHAPTER II MISCELLANEOUS EXAMPLES ON CHAPTER II. 1. If y=f(x)=(ax + b)/(cx- a), show that x=f(y). 2. If f(x)=f/(-x) for all values of x, f(x) is called an even function. If f(x)== — f( -) it is called an odd function. Show that any function of x, defined for all values of x, is the sum of an even and an odd function of x. [Use the identity f(x)= {f (x) +f ( -x)}+ {f (x) -f( -x)}.] x-22 has a rational val e. 3. Find all the values of x for which y= has a rational value. 4. Draw the graphs of the functions 3 sin X+4 cos, sin ( 2 sin x). l(Math. Trip. 1896.) 5. Draw the graphs of the functions sinx(aCOS2x+bs sin X (a cos2 x + b sin2 x) 6. Draw graphs of the functions (i) arc cos (22 - 1) - 2 arc cos x, (ii) arc tan -a - arc tan a - arc tan x, 1 - ax where the symbols arc cos a, arc tan a denote, for any value of a, the least positive (or zero) angle, whose cosine or tangent is a. 7. Verify the following method of constructing the graph of f { (x)) by means of the line y-x and the graphs of f(x) and q (x): take OA =x along OX, draw AB parallel to OY to meet y= (x) in B, BC parallel to OX to meet y=x in C, CD parallel to 0 Y to meet y=f (x) in D, and DP parallel to OX to meet AB in P: then P is a point on the graph required. 8. Show that the roots of x3 +px+ q-= are the abscissae of the points of intersection (other than the origin) of the parabola y=x2 and the circle X2+y2 + (p- 1)y+ qx=0. 9. The roots of x4+nzx3+px2+qx+r =O are the abscissae of the points of intersection of the parabola x2=y - nx, and the circle x2+y2+(-n2 -~ pni+In +q) x +(p- - n2)y+r=0. 10. Discuss the graphical solution of the equation A'n + ax + bx + c = O by means of the curves y =xm, y= -ax2-bx-c. Draw up a table of the various possible numbers of real roots.

Page 61

MISCELLANEOUS EXAMPLES ON CHAPTER II 61 11. Show that the equation 2x = (2n + 1)r (1 - cos x) where n is a positive integer, has 2n+3 real roots and no more, roughly indicating their localities. (2lath. Trip. 1896.) 12. Discuss the number and value of the real roots of the equations (1) cot + - r-=0, (2) x2+sin2x =, (3) tan x=2x/(1+ x2), (4) sin x-x+ x3 =0, (5) (1-cosx)tana-x+sinx=0. 13. Determine a polynomial of the 5th degree which has, for x= - 1, -, 0,, 1 the values 3, 7, 2, 0, 4. 14. The polynomial of the second degree which assumes, when x=a, b, c, the values a, /, y, is (x- b) (- ) (x- ) ( a) (- a) (x- b) (a-b)(a-c ) -f (b-c)(b-aa){- (c-a) (c-b)' Give a similar formula for the polynomial of the (n-l)-th degree which assumes, when x =al, a2,... an, the values al, a2,... an. 15. If x is a rational function of y, and y is a rational function of x, show that Axy + Bx + Cy + D = 0. 16. If y is a rational function of x, with rational coefficients, then y has a rational value for all rational values of x. 17. If y is an algebraical function of x, x is an algebraical function of y. 18. Verify that for values of x between 0 and 1 the equation X2 coS ~T x --- -- 2 ~(x- 1) V (2-x) is approximately true. [Take x=0, 1, ),, 2,, 1, and use tables. For which of these values is the formula exact?] 19. The equation aoXn+nalxn-ly +n (-n - 1) a2X2 y?=0 a0 +na1x1y+ 1. 2 a2x-y+...+ay represents n straight lines through the origin. 20. Show that the line Ax+By-+ C=0 and the two lines represented by (Ax +By)2 - 3 (Ay -Bx)2 = 0 form the sides of an equilateral triangle. (Math. Trip. 1906.) 21. The equation of the circle described on the line joining (x, y) and (x', y') as diameter is (x- x') (x- x")+(y -y') (y - y") = 0. 22. The general equation of a circle through (x', y') and (x", y") may be expressed in either of the forms (1) (X - x') (y-y") -(x-x") (y-y) = (x- x') (x - x") +(y-y') (y-y")} tan a,

Page 62

62 MISCELLANEOUS EXAMPLES ON CHAPTER II (2) (x- ') (x- x" +,,,) + (-y)(y - A" - YI)=o. Here a is the angle contained in one of the segments of the circle. Express X in ternis of a. 23. The general equation of a circle cutting x2+y2+2x+c=0 orthogonally, for all values of X, is x2+y2+2ty-c=O. Sketch the two sets of circles. 24. The general equation of all circles cutting at right angles the two circles x2+y2-2alx-2bly+c=0, x2+y -2a2x - 2b2y+ c2=0 is 2 i +Xk x y 1 =0. C1 a, bbl al b1 1 c2 a2 b2 a2 b2 1 (Mfath. Trip. 1906.) 25. Show that the intersection of the two circular cylinders x2+y2-=1, x2+z2= 1, consists of two plane curves. Give a sketch of the cylinders and their line of intersection. 26. Sections of a right circular cone by a plane. Show that the cone x2+y2 =z2 tan2 a and the plane z x tan 0+c intersect in a curve whose projection on the plane XOY is x2+y2= (x tan 0+c)2 tan2 a. Taking axes O'~, O'q in the plane of section, O' being on OZ and 0',1 parallel to OY, show that the equation of the curve of section is 62 cos2 0 + iq2 = ($ sin 0 + C)2 tan2 a..................... (1). Show that this curve consists of a single closed branch, a single infinite branch, or two infinite branches, according as 0 c r- a, and that in any case it is symmetrical about 0'6. 27. Show that the equation (1) of the last example may be expressed in the formn (f -y)2+=2 e2 (- _ )2 c sin a sin 0 where e=sin 0 sec a, = i- + cos c sin a 1 +sin a cos 0 and K ' sin 0 sin a + cos unless 0= r - a, in which case e=i, =-c cos a, K=-~cseca(l+sin2 a). 28. Deduce that the section is a curve such that the distance of any point on the curve from a fixed point (y, 0) is e times its distance from a fixed line -K=)0, i.e. that the curve is a conic, having the focus and directrix property which is usually adopted as the definition of a conic in books on Conic Sections. The conic is an ellipse, parabola, or hyperbola, according as e 1; and, except in the special cases when e= 1 or e=O, has two foci (y, 0) and two corresponding directrices.

Page 63

MISCELLANEOUS EXAMPLES ON CHAPTER II 63 29. Let A, A' be the vertices of the conic, i.e. the points where the conic cuts the axis of symmetry 0'$, X, S' the foci, and K, K' the points where the directrices cut 0'~. Show that A, A' are given by -= + c sin a/cos (a ~ 0), that A lies between S and K and A' between S' and K', and that the length of the 'major axis' AA' is 2c sin a cos a cos O/(cos2 0 - sin2 a). 30. Show further that if we take axes parallel to 0'$, 0'7q through C, the middle point of AA', the equation of the curve becomes of the form (/a2) + (y2/b2) =, where a= AA' and b=a /(1-e2), or (X/a2) - (y2/b2)= 1 where a=2AA' and b=a /(e2-1), according as e l. Sketch the forms of the curves. 31. In the case when e=l, show that the one point A where the curve cuts 0'~ is given by == - c sec a, and that by taking axes through this point we can reduce the equation of the curve to the form y2= 4ax, where a=- c sin a tan a. [For an account of the simplest properties of the conic sections, deducible from the equations (x2/a2) ~ (y2/b2) = 1 or y2 = 4ax, we must refer to treatises dealing specially with this subject.] 32. Show that the most general equation of the second degree, viz. ax2 + 2hxy + by2 +2gx + 2fy + c =0 represents a conic. [It is this property which accounts for the importance of the conic sections.] 33. Show that the equation represents an ellipse, parabola, or hyperbola, according as h2 ab. 34. The equation ax2+2 hxy + by2 + 2 (gx +fy) (Ix+ my) + c (Ix + my)2 = 0 represents the two lines joining the origin to the points in which the line Ix+my = 1 cuts the conic ax2+2hxy + by2+2gx+2fy+c=0. 35. Show directly that if the cylinder x2 +y2= 1 is cut by a plane neither parallel nor perpendicular to its axis, the intersection is a curve possessing the focus and directrix property of a conic. 36. The curve f (x, y) > (x, y) + /F (x, y) D (x, y) = O passes through all points of intersection of f=0 and F==0, off=0 and =0, of 0=0 and F=0, and of 0=0 and <D=0. 37. If X,.=arx+bry+cr, the equation XL1LL3+IL2L4=O is the general equation of a conic circumscribing the quadrangle formed by the four lines ]1=0, L2=0, L3=0, 14=0 taken in order.

Page 64

64 MISCELLANEOUS EXAMPLES ON CHAPTER II 38. Determine the equation of the surface generated when the circle y=0, (x-a)2+z2=1 rotates round the axis of z. Sketch the form of the surface for different values of a. 39. What is the form of the graph of the functions z=[x]+[y], z=x+y- []- [y]? 40. What is the form of the graph of the functions z=sin x+siny, z= sin x sin, z = sin xy, z= sin (X2 +y2)? 41. Geometrical Constructions for irrational numbers. In Chapter I. we indicated one or two simple geometrical constructions for a length equal to ^J2, starting from a given unit length. We also showed how to construct the roots of any quadratic equation ax2+2bx+c=O, it being supposed that we can construct lines whose lengths are equal to any of the ratios of the quantities a, b, c. All these constructions were what may be called Euclidean constructions; they depended on the ruler and compass only. It is fairly obvious that any irrational expression, however complicated, can be constructed by means of these methods, provided it only contains square roots. Thus 4/( //17+3/11 /17 - 3 V/1 / \/ \17-3 /ll- /V 17+3 Vll/J is a case in point. This contains a fourth root, but this is of course the square root of a square root. We should begin by constructing ^11V, e.g. as the mean between 1 and 11: then 17+3 /11, and so on. Or these two mixed surds might be constructed directly as the roots of x2- 34x+ 190 =0. Conversely, only irrationals of this kind can be constructed by Euclidean methods. Starting from a unit length we can construct any rational length. And hence we can construct the line ax + by+ c =, or the circle (x- a)2 + (- )2 = r2, (or x2+y2+2gx+2fy+d=0) provided the constants which occur in these equations are rational. Now in any Euclidean construction, each new point introduced into the figure is determined as the intersection of two lines or circles, or a line and a circle. But if the coefficients are rational, such a pair of equations as ax+by+c=0, x2+y2+2gx+2fy+d=O give, on solution, values of x and y of the form m+ —n a/p, where m, n, p are rational: for if we substitute for x in terms of y in the second equation we obtain a quadratic in y with rational coefficients. Hence the coordinates of all points obtained by means of lines and circles with rational coefficients are expressible by rational numbers and quadratic surds. And so the same is true of the distance J{(x-x2)2+ (y -y2)2} between any two points so obtained. With the irrational distances thus constructed we may proceed to construct

Page 65

MISCELLANEOUS EXAMPLES ON CHAPTER II 65 a number of lines and circles whose coefficients may now themselves involve quadratic surds. It is evident, however, that by the use of such lines and circles we can still only construct lengths expressible by square roots only, though our surd expressions may now be of a more complicated form. And it is clear that this remains true however far we may go. Hence Euclidean methods will construct any surd expression involving square roots, and no others. In particular they will not construct /2, i.e. they will not solve the problem of the duplication of the cube, which was one of the famous problems of antiquity. 42. Approximate quadrature of the circle. Let 0 be the centre of a circle of radius R. On the tangent at A take AP=-I-R and AQ=-lAR, in the same direction. On AO take AN== OP and draw NM parallel to OQ and cutting AP in it. Show that Al= J/146. R, and that to take A as being equal to the circumference of the circle would lead to a value of rr correct to five places of decimals. If R is the earth's radius, the error in supposing AM to be its circumference is less than 11 yards. 43. Show that the only lengths which can be constructed with the ruiler only, starting from a given unit length, are rational lengths. 44. Constructions for 2/2. 0 is the vertex and S the focus of the parabola y = 4x, and P is one of its points of intersection with the parabola x2=2y. Show that OP meets the latus rectum of the first parabola in a point Q such that SQ=/2. 45. Take a circle of unit diameter, a diameter OA and the tangent at A. Draw a chord OBC cutting the circle at B and the tangent at C. On this line take OfM=BC. Taking 0 as origin and OA as axis of x, show that the locus of If is the curve (X2 +y2) X -y2 =0 (the Cissoid of Diodes). Sketch the curve. Take along the axis of y a length OD=2. Let AD cut the curve in P and OP cut the tangent at A in Q. Show that AQ= /2. H. A. 5

Page 66

CHAPTER III. COMPLEX NUMBERS. 25. Displacements along a line and in a plane. The 'real number' x, with which we have been concerned in the two preceding chapters, may be regarded from a considerable number of different points of view. It may be regarded as a pure number, destitute of geometrical significance, or a geometrical significance may be attached to it in at least three different ways. It may be regarded as the measure of a length, viz. the length AoP along the line L of Chap. I. It may be regarded as the mark of a point, viz. the point P whose distance from A0 is x. Or it may be regarded as the measure of a displacement or change of position on the line L. It is on this last point of view that we shall now concentrate our attention. Let a small particle be placed at P on the line L and then displaced to Q. We shall call the displacement or change of position which is needed to transfer the particle from P to Q the displacement PQ. To completely specify a displacement three things are needed, its magnitude, its sense (forwards or backwards along the line), and what may be called its point of application, i.e. the original position P of the particle. But, when we are thinking merely of the change of position produced by the displacement, it is natural to disregard the point of application and to consider all displacements as equivalent whose lengths and senses are the same. Then the displacement is completely specified by the length PQ = x, the sense of the displacement being fixed by the sign of x. We may therefore, without ambiguity, speak of the displacement [x], and we may write PQ = [x].

Page 67

COMPLEX NUMBERS 67 We use the square bracket to distinguish the displacement [x] from the length or number x*. If the coordinate of P is a, that of Q will be a x; the displacement [x] therefore transfers a particle from the point x to the point a + x. We come now to consider displacements in a plane. We may define the displacement PQ as before. But now more data are required in order to specify it completely. We require to know: (i) the magnitude of the displacement, i.e. the length of the straight line PQ; (ii) the direction of the displacement, which is determined by the angle which PQ makes with some fixed line in the plane; (iii) the sense of the displacement; and (iv) its point of application. Of these requirements we may disregard the fourth, if we consider two displacements as equivalent if they are the same in magnitude, direction, and sense. In other words, if PQ and RS are equal and parallel, and the sense of motion from P to Q is the same as that of motion from R to S, we regard the displacements PQ and RS as equivalent, and write PQ= -RS. Now let us take any pair of coordinate axes in the plane Y Q s R A 0 X B FIG. 25. (such as OX, OYin Fig. 25). Draw a line OA equal and parallel * Strictly speaking we ought, by some similar difference of notation, to distinguish the actual length x from the number x which measures it. The reader will perhaps be inclined to consider such distinctions futile and pedantic. But increasing experience of mathematics will reveal to him the great importance of distinguishing clearly between things which, however intimately connected, are not the same. If cricket were a mathematical science it would be very important to distinguish between the motion of the batsman between the wickets, the run which he scores, and the mark which is put down in the score-book. 5-2

Page 68

68 COMPLEX NUMBERS to PQ, the sense of motion from 0 to A being the same as that from P to Q. Then PQ and OA are equivalent displacements. Let x and y be the coordinates of A. Then it is evident that OA is completely specified if x and y are given. We call OA the displacement [x, y] and write OA = PQ = RS = [x, y]. 26. Equivalence of displacements. Multiplication of displacements by numbers. If ~ and 7 are the coordinates of P, ~' and r/' those of Q, it is evident that ~= '-f, yS-'-v. The displacement from (~, V) to (a', r/) is therefore [:'-, v'- 0]. It is evident that two displacements [x, y], [x', y'] are equivalent if, and only if, x = x', y = y'. Thus [x, y] = [x', y'] if x= x', y = y'........................ The reverse displacement QP would be [ - ', - I'], and it is natural to agree that If-t', v- '] = -'-, v'- ], QP = -PQ, these equations being really definitions of the meaning of the symbols - ['-, ' - a ], -PQ. Having thus agreed that - [, y] = [- x, - ] it is natural to agree further that x, y = [ x, ay].....................(2) where a is any real number, positive or negative. Thus (Fig. 25) if OB =- OA, OB = OA 2 [, y = [-, - y]. The equations (1) and (2) define the first two important ideas connected with displacements, viz. equivalence of displacements,, and multiplication of displacements by numbers.

Page 69

COMPLEX NUMBERS 69 27. Addition of displacements. We have not yet given any definition which enables us to attach any meaning to the expressions PQ + P'Q', [, y + [x', y']. Common sense at once suggests that we should define the sum of two displacements as the displacement which is the result Q \\Q E) ' C FIro. 26. of the successive application of the two given displacements. In other words, it suggests that if QQ, be drawn equal and parallel to P'Q', so that the result of successive displacements PQ, P'Q' on a particle at P is to transfer it first to Q and then to Q1, we should define the sum of PQ and P'Q' as being PQ1. Or, if we draw OB equal and parallel to P'Q', and complete the parallelogram OACB, PQ + P'Q' = OA + OB = OC = PQ. Let us consider the consequences of adopting this definition. If the coordinates of B are x', y', those of the middle point of AB are I (x + x'), (y + y'), and those of C are x + x', y + y'. Hence [x, Y] + [x, ] = x +, y +y']............ (3), which may be regarded as the symbolic definition of addition of displacements. We observe that [', y'] + [x, y] = [x + X, y + y] = [x + a', y + y'] = [w, y] + [I, y'].

Page 70

70 COMPLEX NUMBERS [III In other words, addition of displacements obeys the commutative law expressed in ordinary algebra by the equation a + b = b + a. Looked at geometrically, this expresses the obvious fact that if we move from P first through a distance PQ2 equal and parallel to P'Q', and then through a distance equal and parallel to PQ, we shall arrive at the same point Q1 as before. Again, since [x, y] + [x, y] = [2x, 2y] = 2 [x, y], our definition of addition agrees with that previously adopted for multiplication by a number. In particular [w, y] = [x, 0] + [0, y].....................(4). Here [x, 0] denotes a displacement through a distance x in a direction parallel to OX. It is in fact what we previously denoted by [x], when we were considering only displacements along a line. We call [y, 0] and [0, y] the components of [x, y], and [x, y] their resultant.x When we have once defined addition of two displacements there is no further difficulty in the way of defining addition of any number. Thus (by definition) [, y] + [I, y] + [", y"] = ([E, y] + [', y]) + [", '] = [x + X, y + y'] + [", y] = [X + X + X", y + y' + y"]. We define subtraction of displacements by the equation [, y] - [ ', y'] = [x, y] + (- [', y'])............(5), which is the same thing as [x, y] + [-', - y'] or as [x - ', y - y']. In particular [I, y] - [, ] = [0, ]. The displacement [0, 0] leaves the particle where it was; it is the zero displacement, and we agree to write [0, 0] = 0. Examples XXII. 1. Addition of displacements, and multiplication of displacements by numbers, obey all the ordinary laws of algebra, expressed by the equations, (i) a [p3x, 3Y] = 1 [ax, ay] = [a3x, a3y], (ii) ([x, y]+[', y'])+ [x", y"] =[, A] + ([x', y'] +[x", /y"), (iii) [x, y] +[, y'][, y'] += [, ]+ [, (iv) (a + ) [x, y] = a [, y]+/3 [x, y], (v) a{[x, y]+[X', y ]} = a, [ ]+a [', y'].

Page 71

27] COMPLEX NUMBERS 71 [We have already proved (iii). The remaining equations follow with equal ease from the definitions. The reader should in each case consider the geometrical significance of the equation, as we did above in the case of (iii).] 2. If 1X is the middle point of PQ, 01f1- (OP+ 0Q). More generally if J1 divides PQ in the ratio,: X 0 X OP-+ OQ.. \+L X+ 1 3. If G is the centre of mass of equal particles at P1, P,..., P,, 0o=(OPi + oP +... + OPn)/n. 4. If P, Q, R are collinear points in the plane, it is possible to find real numbers a, 3, y, not all zero, and such that a. OP+3. OQ+y. OR=O; and conversely. [This is really only another way of stating Ex. 2.] 5. If AB and AD are two displacements not in the same straight line, and a. AB+,3. AD=y. AB+8. AD, then a=y and 3=8. [Take ABi=a. AB, AD =-3. AD. Complete the parallelogram AB1P1D1. Then AP,=a. AB+3. AD. It is evident that AP1 can only be expressed in this form in one way, whence the theorem follows.] 6. ABCD is a parallelogram. Through Q, a point inside the parallelogram, RQS and TQU are drawn parallel to the sides. Show that D UC R U, TS intersect on A C. [Let the ratios A T: AB, AR: AD Q be denoted by a, 3. Then AT=a.AB, AR==3. AD, AU=a. AB+AD, AS=AB+3.AD. Let RU meet AC in P. Then, A T B since R, U, P are collinear FIG. 27. X + +p AP= AR+ ' AU, X+pL XA+/ where A/X is the ratio in which P divides R U. That is to say AP= a ~i - +l _ AB~ ~ AD. X~M X~,t But since P lies on AC, AP is a numerical multiple of AC; say AP=k. AC=k.AB+k. AD. Hence (Ex. 5) at =,3X + = = (X + /) k, from which we deduce k=a3/(a+/0- 1).

Page 72

72 COMPLEX NUMBERS [III The symmetry of this result shows that a similar argument would also give ~AP' = AC, if P' is the point where TS meets AC. Hence P and P' are the same point.] 7. ABCD is a parallelogram, and M the middle point of AB. Show that DM trisects and is trisected by A C*. 28. Multiplication of displacements. So far we have made no attempt to attach any meaning whatever to the notion of the product of two displacements. The only kind of multiplication which we have considered is that in which a displacement is multiplied by a mere?miber. The expression [, y] x [x, y'] so far means nothing, and we are at liberty to define it to mean anything we like. It is, however, fairly clear that if any definition of such a product is to be of.any use, the product of two displacements nmst itself be a displacement. We might, for example, define it as being equal to [x +, y + y]; in other words, we might agree that the product of two displacements was to be always equal to their sum. But there would be two serious objections to such a definition. In the first place our definition would be futile. We should only be introducing a new method of expressing something which we can perfectly well express without it. In the second place our definition would be inconvenient and misleading for the following reasons. If a is a real number, we have already defined a [x, y] as [ax, ay]. Now, as we saw in ~ 25, the real number a may itself from one point of view be regarded as a displacement, viz. the displacement [a] along the axis OX, or, in our later notation, the displacement [a, 0]. It is therefore, if not absolutely necessary, at any rate most desirable, that our definition should be such that [a, O] [x, y] = [ax, cy], and the suggested definition does not give this result. A more reasonable definition might appear to be [, y] [x, y'] = [xx', yy']. * The two preceding examples are taken from Willard Gibbs' Vector Analysis.

Page 73

28, 29] COMPLEX NUMBERS 73 But this would give [a, 0] [x, y] = [ax, 0], and so this also would be open to the second objection. In fact, it is by no means obvious what is the best meaning to attach to the product [x, y] [x', y']. All that is clear is (1) that, if our definition is to be of any use, this product must itself be a displacement whose coordinates depend on x and y, or in other words that we must have [x, y] [x', y']= [X, Y], where X and Y are functions of x, y, x', and y'; (2) that the definition must be such as to agree with the equation [., 0] [x', y] = [xx', xy], and (3) that the definition must obey the ordinary commutative, distributive, and associative laws of multiplication, so that x, y] [x', y'] = [x', y'] [x, y], ([x, y] + [x', y']) [X", y'] = [x, y] [x", y"] + [x)' y'] [x", y"], [x, Y] ([x', y] + [x", y']) = [x, ] [ ', y'] + [x, y] [X", y'], and [x, y] ([x', y'] [x", y]) = ([x, y] [x', y') [x", y"]. 29. The right definition to take is suggested as follows. We know that, if OAB, OCD are two similar triangles, the angles corresponding in the order in which they are written, then OB/OA = OD/OC, D 0 FIG. 28.

Page 74

74 COMPLEX NUMBERS [III or OB. OC = OA. OD. This suggests that we should try to define multiplication and division of displacements in such a way that OB/OA = OD/OC, OB. OC= OA. OD. Now let OB = [x, y], OC = [x', y'], OD = [X, Y], and suppose that A is the point (1, 0), so that OA = [1, 0]. Then OA. OD = [1, 0] [X, Y] = [X, Y], and so [x, ] [x', y'] = [X, Y]. The product OB. 00 is therefore to be defined as OD, D being obtained by constructing on OC a triangle similar to OAB. In order to free this definition from ambiguity, it should be observed that on OC we can describe two such triangles, OGD and OCD'. We choose that for which the angle COD is equal to AOB in sign as well as in magnitude. We say that th t two triangles are then similar in the same sense. If the polar coordinates of B and C are (p, 0) and (a, 0), so that =p cos0, y =psin0, x' a- cos, y' = osin b, the polar coordinates of D are evidently pa and 0 + b. Hence X = pa cos(0g + p) = xx'- yy', Y = pa sin (0 + 4) = xy' + y'. The required definition is therefore [x, y] x', y'] = [xx' - yy', xy' + yx']........... (6). We observe (1) that if y =O, X = xx', Y=xy', as we desired; (2) that the right-hand side is not altered if we interchange x and x', and y and y', so that [x, Y] [c', y'] = [x', y] [x, y]; and (3) that {[*, y] + [E', y']} [Ax, y"] = [x + x, y y'] [+ " y ] = [(x + c') ' - (y + y') y", (x + x') y" + (y + y') "] = [$x. - yy", yy" + yX"] + [C' _- y, x y + y/ ] = [c, y] [V", y"] + [x', y] [", y"].

Page 75

29, 30] COMPLEX NUMBERS 75 Similarly we can verify that all the equations at the end of ~ 28 are satisfied. Thus the definition (6) fulfils all the requirements which we made of it in ~ 28. Example. Show directly from the geometrical definition given above that multiplication of displacements obeys the commutative and distributive laws. [Take the commutative law for example. The product OB. OC is OD (Fig. 28), COD being similar to AOB. To construct the product OC. OB we should have to construct on OB a triangle BOD1 simnilar to AOC; and so what we want to prove is that D and D1 coincide, or that BOD is similar to A OC. This is an easy piece of elementary geometry.] 30. Complex numbers. Just as to a displacement [x] along OX correspond a point (x) and a real number x, so to a displacement [x, y] in the plane correspond a point (x, y) and a pair of real numbers x, y. We shall find it convenient to denote this pair of real numbers x, y by the symbol x + yi. The reason for the choice of this notation will appear later. For the present the reader must regard x + yi as simply another way of writing [x, y]. The expression x + yi is called a complex number. We proceed next to define equivalence, addition, and zmultiplication of complex numbers. To every complex number corresponds a displacement. Two complex numbers are equivalent if the corresponding displacements are equivalent. The sum or product of two complex numbers is the complex number which corresponds to the sum or product of the two corresponding displacements. Thus x+yi=x'+y'i if x=x', y=y'.............(1), (x + yi) + (x'+ y'i) = (x + x') + (y + y') i.........(2), (x + yi) (x' + y'i) = xx'- yy' + (y' + yx') i.........(3). In particular, if a is any real number, a (x + yi) = ax + ayi. The complex numbers of the particular form x + Oi may be regarded as equivalent to the corresponding real numbers x; thus x + Oi = x, and in particular 0 + Oi = 0.

Page 76

76 COMPLEX NUMBERS [III Positive integral powers and polynomials of complex numbers are then defined as in ordinary algebra. Thus, by putting x = x', y = y' in (3), we obtain (x + yi)2 = (X + yi) (X + yi) = x2- _ 2 + 2xyi, (x + yi)2 + 2 (x + yi) + 3 = x2 - y" + 2x + 3 + (2xy + 2y) i. The reader will easily verify for himself that addition and multiplication of complex numbers obey the ordinary laws of algebra, expressed by the equations x + yi + (x' + y'i) = (' + y'i) + (x + yi), {(x + yi) + (' + y')} + (" + y"i) = (X + yi) + {(x' + y'i) + (" + y"i)}, (x + yi) (x + y'i) = (x' + y'i) (x + yi), (x + yi) {(x' + y'i) + (x" + y"i)} = ( + yi) (' + y'i) + (x + yi) (x" + y"i), {(x y + y) + (x+ y'i) (x" +y"i) = (x + yi) (x" + y"i) + (' + y'i) (x" + y"i), (x + yi + ) { ( i + ) ( + y"i)} = {(X + yi) (x' + y'i)} (x" + yi), the proofs of these equations being practically the same as those of the corresponding equations for the corresponding displacements. Subtraction and division of complex numbers are defined as in ordinary algebra. Thus we may define ( + yi) - (x'+ yi) as (x + yi) + {- (x + y'i)} = x + yi + (- x' - yi) = (x - ) + ( - y') i; or again, as the number f + ri such that (c' + y'i) + (f + )i) = x + yi, which leads to the same result. And (x + yi)/(x' + y'i) is defined as being the complex number t + irq such that (x/ + yi) (I + ni)= x + yi, or x - y' + (x' + y'l) i = x + yi, or x' -yr = x, 'l + y' = y...............(4). Solving these equations for ~ and q, we obtain cxx' + yy' y' - xy' x'2 + y'2, x '2 + y' This solution fails if x' and y' are both zero, i.e. if x'+ y'i = 0. Thus subtraction is always possible; division is always possible unless the divisor is zero.

Page 77

30-32] COMPLEX NUMBERS 77 Examples. (1) From a geometrical point of view the problem of the division of the displacement OB by OC is that of finding D so that the triangles B COB, AOD are similar, and this is evidently possible (and the solution unique) unless C coincides with 0, or OC=. (2) The numbers x-t+yi, x-yi are said to be coljugate. Verify that (x + yi) (x - yi) = x+2 y/ / so that the product of two conjugate numbers is real, and that O A x+yi (x+yi) (x' -y'i) A x' + 'i (x'i) (x' -y'i) FIG. 29 xx' +yy' + i (x'y - xy') X,2 _+Y/2 31. One most important property of real numbers is that known as the factor theorem, which asserts that the product of two numbers cannot be zero unless one of the two is itself zero. To prove that this is also true of complex numbers we put x = 0, y = 0 in the equations (4) of the preceding section. Then x' -y' =O, x'l+y'- = 0. These equations give 0 = 0, X = 0, i.e. +4 i =0, unless x' = 0 and y' = 0, or x' + y'i = 0. Thus x + yi cannot vanish unless either x' + y'i or: + ai vanishes. 32. The equation i = - 1. We agreed to use, instead of x + Oi, the simpler notation x. Similarly, instead of 0 + yi, we shall use yi. The particular complex number li we shall denote simply by i. It is the number which corresponds to a unit displacement along OY. Also i = ii = (O+ 1i) (0 + 1i) = (0. 0 - 1. 1)+ (0.1 + 1. )i= - 1. Similarly (-)2=- 1. Thus the complex numbers + i satisfy the equation 2 = - 1. Now the reader will- easily satisfy himself that the upshot of the rules for addition and multiplication of complex numbers is this, that we operate with complex numbers in exactly the same

Page 78

78 COMPLEX NUMBERS [III way as with real numbers, treating the symbol i as itself a number, but replacing the product ii = i2 by - whenever it occurs. Thus, for example, (x + yi) (x' + y'i) = xx' + xy'i + yx'i + yy'i2 = (xx' - yy) + (xy' + yx') i. 33. The geometrical interpretation of multiplication by i. Since (x + iy) i = - y + ix, it follows that if x + iy corresponds to OP, and OQ is drawn equal to OP, and so that POQ is a positive right angle, then (x + iy) i corresponds to OQ. In other words, multiplication of a complex number by i turns the corresponding displacement through a right angle. We might, had we so chosen, have started from this point of view. We might have regarded x as a length measured along OX, and xi as the same length measured along 0 Y, and regarded i as a symbol of operation equivalent to turning the length x through a right angle round 0. We should then naturally have been led to regard xi2 = xii as denoting the result of twice turning x through a right angle. The result of this is to bring it into a position again lying along OX but pointing in the opposite direction, so that we should have been led to the equation xi2 = - X. Then, denoting li (a unit length along OY) simply by i, we should have found i2=-1, and the rule for multiplication of complex numbers would have followed immediately. 34. The equations ax +l = 0, ax2 + 2bx + c= 0. There is no real number x such that x2 + 1 =0; this is expressed by saying that the equation has no real roots. But, as we have just seen, the two complex numbers + i satisfy this equation. We express this by saying that the equation has the two complex roots + i. Since i satisfies x2=- 1, it is sometimes written in the form V(-1). Complex numbers are sometimes called imaginary, to distinguish them from real numbers. The expression is by no means a happily chosen one, but it is firmly established and

Page 79

32-34] COMPLEX NUMBERS 79 has to be accepted. It cannot, however, be too strongly impressed upon the reader that in reality an 'imaginary number' is neither 'imaginary' nor 'a number' at all. The 'real' numbers would be better described as 'common' or 'ordinary' numbers; they are the numbers of arithmetic. A 'complex' or 'imaginary number' is really not a number at all, but, as should be clear from the following discussion, a pair of numbers (x, y), united symbolically, for purposes purely of convenience, in the form x + yi. And such a pair of numbers is no less 'real' than any ordinary number such' as 2, or than the paper on which this is printed, or than the Solar System. In reality i=O+li stands for the pair of numbers (0, 1), and may be represented geometrically by a point or by the displacement [0, 1]. And when we say that i is a root of the equation 2 + 1 = 0, what we mean is simply that we have defined a method of combining such pairs of numbers (or displacements) which we call 'multiplication,' and which, when we so combine (0, 1) with itself, gives the result (- 1, 0). Now let us consider the more general equation ax 2+ 2bx + c = 0, where a, b, c are real numbers. If b2 > ac, the ordinary method of solution gives two real roots {-b + V(b2 - ac)}/a. If b2 < ac, the equation has no real roots. It may be written in the form {x + (b/a)}2 = - (ac - b2)/a2, an equation which is evidently satisfied if we substitute for x + (b/a) the complex number + i V(ac - b2)/a. We express this by saying that the equation has the two complex roots {-b + i V(ac - b2)}/a. If we agree as a matter of convention to say that when b2 = ac (in which case the equation is satisfied by one value of x only, viz. -b/a), the equation has two equal roots, we can say that a quadratic equation with real coefficients has two roots in all

Page 80

80 COMPLEX NUMBERS cases, two distinct real roots, two equal real roots, or two distinct complex roots. The question is naturally suggested whether a quadratic equation may not, when complex roots are once admitted, have more than two roots. It is easy to see that this is not possible. In fact, its impossibility may be proved by precisely the same chain of reasoning as is used in elementary algebra to prove that an equation of the nth degree cannot have more than n real roots. Let z = x + yi, and let f(z) denote any polynomial in zx, with real or complex coefficients. Then we prove in succession: (1) that the remainder, when j (z) is divided by z - a (a being any real or complex number), is f(a); (2) if a is a root of the equation f (z) = 0, then f(z) is divisible by z-a; (3) if f(z) is of the nth degree, and f(z) = 0 has the n roots a1, a.2,..., an, then f(z) = A (z - a) (z - a,)... (z - a,,), where A is a constant (real or complex), in fact the coefficient of zn in f(z). From the last result it follows at once that f(z) cannot have more than n roots. We conclude that a quadratic equation with real coefficients has exactly two roots. We shall see later on that a similar theorem is true for an equation of any degree and with either real or complex coefficients: an equation of the nth degree has exactly n roots. The only point in the proof which presents any difficulty is the first, viz. the proof that any equation must have at least one root. This we must postpone for the present. We may, however, at once call attention to one very interesting result of this theorem. In the theory of number we start from the positive integers, and from the ideas of addition and multiplication, and the converse operations of subtraction and division. We find that these operations are not always possible unless we admit new kinds of numbers. We can only attach a meaning to 3- 7 if we admit negative numbers, or to f if we admit rational fractions. When we extend our list of arithmetical operations so as to include root extraction and the solution of equations, we find * A polynomial in z=x+yi is of course defined in exactly the same way as a polynomial in x, i.e. as an expression of the form aoz + ailz-I +... + an.

Page 81

34, 35] COMPLEX NUMBERS 81 that some of them, such as the extraction of the square root of a number which (like 2) is not a perfect square, are not possible unless we widen our conception of a number, and admit the irrational numbers of Chap. I. Others, such as the extraction of the square root of -1, are not possible unless we go still further, and admit the complex numbers of this chapter. And it would not be unnatural to suppose that, when we come to consider equations of higher degree, some might prove to be insoluble even by the aid of complex numbers, and that thus we might be led to the considerations of higher and higher types of, so to say, hyper-complex numbers. The fact that any algebraical equation whatever can be solved by means of ordinary complex numbers shows that this is not the case. The application of any of the ordinary algebraical operations to complex numbers will yield only complex numbers. In technical language 'the field of the complex numbers is closed for algebraical operations.' Before we pass on to other matters, let us add that all theorems of elementary algebra which are proved merely by the application of the rules of addition and multiplication are true whether the numbers which occur in them are real or complex, since the rules referred to apply to complex as well as real numbers. For example, we know that if a and /3 are the roots of ax2 + 2bx + c = 0, then a + =- -(2b/a), a/3 = (c/a). Similarly, if a, 3, y are the roots of ax3 + 3bx2 + 3cx + d = 0, then a+ + y = -(3b/a), y + ya + a/3 = (3c/a), a3y=- (dl/a). All such theorems as these are true whether a, b,... a, 3,... are real or complex. 35. The Argand diagram. Let P be the point (x, y) in Fig. 30, r, 0 its polar coordinates, so that x=rcos0, y=rsin0, r =/(x2+y2), cos:sinO:l::x:y:r. We shall denote the complex number x +yi by z, and we 6 H. A.

Page 82

82 COMPLEX NUMBERS [III shall call z the complex variable. We shall call P the point z, or the point corresponding Y to z; and z the argument of P. We shall call x the real part, y the imaginary part, r P the modulus, and 0 the amplitude of z, and we shall write x=R(2), y=I(z), y = z, 0 = am z. It should be observed that r o X x is essentially positive (except FIc. 30. when z = 0). When y = 0 we shall say that z is real, when x=0 that z is purely imaginary. Two numbers x + yi, x - yi which differ only in the signs of their imaginary parts, we shall call conjugate. It will be observed that the sum (2x) of two conjugate numbers and their product (x2 + y2) are both real, that their moduli (/x2 + y2) are equal, and that their product is equal to the square of the modulus of either. The roots of a quadratic with real coefficients, for example, are conjugate, when not real. It must be observed that 0 or am z is a many-valued function of z, having an infinity of values differing by multiples of 27r. Any one of its values is an angle by turning through which about 0 a line originally lying along OX will come to lie along OP. We shall denote that one of-these values which lies between - r and + 7r as the principal value of the amplitude of z. This definition is unambiguous except when one of the values is wr, in which case - r is also a value. In this case we must make some special provision as to which value is to be regarded as the principal value. In general, when we speak of the amplitude of z we shall, unless the contrary is stated, mean the principal value of the amplitude. Complex numbers were first studied from a geometrical point of view by Wessel, Gauss and Argand, and the figure is usually known as the Argand diagram. 36. De Moivre's Theorem. The following statements follow immediately from the definitions of addition and multiplication.

Page 83

3.5, 36] COMPLEX NUMBERS 83 (1) The real (or imaginary) part of the sum of two complex numbers is the sum of their real (or imaginary) parts. (2) The modulus of the product of two complex numbers is the product of their moduli. (3) One value of the amplitude of the product of two complex numbers is the sum of their amplitudes. It should be observed that it is not true that the principal value of am (zz') is the sum of the principal values of am z and am '. For example, if z=z'= - I + i, the principal values of the amplitudes of z and z' are each w r. But zz' - 2i, and the principal value of am (zz') is - 1Tr and not I 7r. The two last theorems may be expressed in the equation r (cos 0 + i sin 0) x p (cos +- i sin /) = rp {cos (0 + )) + i sin (0 + j)}, which may be proved at once by multiplying out and using the ordinary trigonometrical formulae for cos (0 + b) and sin (0 + b). More generally r1 (cos 0, + i sin 0,) x r2 (cos 0d + i sin 02) x... x rn (cos 0, + i sin On) -rr2... r {cos ( + 02 +... + 0n)+i sin (O+ 0 2+... + 0)}. A particularly interesting case is that in which r= r2 =... = r = 1, 0 = 0 =... =0 = 0. We then obtain the equation (cos 0 + i sin 0)n = cos nO + i sin nO, where n is any positive integer: a result known as De Moivre's Theoren '. Again, if z = r (cos 0 + i sin 0), 1/z = (cos 0 - i sin 0)/r. Thus the modulus of the reciprocal of z is the reciprocal of the modulus of z, and the amplitude of the reciprocal is the amplitude of z with its sign changed. Hence we deduce from (2) and (3): (4) The modulus of the quotient of two complex numbers is the quotient of their moduli. (5) One value of the amplitude of the quotient of two complex numbers is the difference of their amplitudes. * It will sometimes be convenient, for the sake of brevity, to denote cos 0 + i sin 0 by Cis 0: in this notation, suggested by Profs. Harkness and Morley, De Moivre's theorem is expressed by the equation (Cis 0)"= Cis nO. 6-2

Page 84

84 COMPLEX NUMBERS [III Again (cos 6 + i sin l)-n = (cos 0 - i sin 0)n = {cos (- 0) + i sin (- 0)}' = cos (- nO) + i sin (- n0). Hence De ioivre's Theorem holds for all integral values of n, positive or negative. A large number of important applications of this theorem will be given later on in this chapter (~~ 38 et seq.). To the theorems (1)-(5) we may add the following theorem, which is also of very great importance. (6) The modulus of the sum of any number of complex quantities is not greater than the sum of their moduli. p.v) Q P U p,'t Up,, FIG. 31. Let OP, OP',... be the displacements corresponding to the various complex quantities. Draw PQ equal and parallel to OP', QR equal and parallel to OP", and so on. Finally we reach a point U, such that OU = OP OP' + O +.... The length OU is the modulus of the sum of the complex quantities, whereas the sum of their moduli is the total length of the broken line OPQR... U. The truth of the theorem is now obvious (see also Ex. XXIII. 1). 37. We add some theorems concerning rational functions of complex numbers. A rational function of the complex variable z is defined exactly as is a rational function of a real variable x, viz. as the quotient of two polynomials in z.

Page 85

36, 37] COMPLEX NUMBERS 85 THEOREM 1. If R (x + yi) is a rational function of x + yi, it can be reduced to the form X + Yi, where X and Y are rational functions of x and y with real coefficients. In the first place it is evident that any polynomial P (x + yi) can be reduced, in virtue of the definitions of addition and multiplication, to the form A + Bi, where A and B are polynomials in x and y with real coefficients. Similarly Q (x + yi) can be reduced to the form C +Di. Hence R (x + yi) = P (x + yi)/Q (x + yi) can be expressed in the form (A + B) + i)( Di) = (A + Bi) (C - Di)l(C +Di) (C - Di) AC+BD BC-AD. C2~+ D2 2+ C~+D2 which proves the theorem. THEOREM 2. If R (x + yi) = X + i, R denoting a rational function as before, but with real coefficients, then R(x - yi) = X -Yi. In the first place this is easily verified for a power (x + yi)n by actual expansion. It follows by addition that the theorem is true for any polynomial with real coefficients. Hence, in the notation used above, A-Bi AC+BD BC-AD. R( - yi)= C- = C2+D2 - C2+D2 l, the reduction being the same as before except that the sign of i is changed throughout. It is evident that results similar to those of Theorems 1 and 2 hold for functions of any number of complex variables. THEOREM 3. The roots of an equation aozn + alzn-1 +... + an = 0, whose coefficients are real, may, in so far as they are not themselves real, be arranged in conjugate pairs. For it follows from Theorem 2 that if x + yi is a root, so is - yi. A particular case of this theorem is the result (~ 34) that the roots of a quadratic equation with real coefficients are either real or conjugate. This theorem is sometimes stated as follows-in an equation

Page 86

86 COMPLEX NUMBERS [III with real coefficients complex roots occur in conjugate pairs. It should be compared with the result of Exs. IV. 10, which may be stated as follows-in an equation with rational coefficients irrational roots occur in conjugate pairs*. Examples XXIII. 1. Prove theorem (6) of ~ 36 directly from the definitions and without the aid of geometrical considerations. [First, to prove that z +z' I z I +I z' I is to prove that -+-)- (- < {V/(x +y2) ( + y/(2 +2)}2. The theorem is then easily extended to the general case.] 2. The one and only case in which IzI+I I'+...= I+-'+..., is that in which the numbers z, z',... have all the same amplitude. Prove this both geometrically and analytically. 3. The modulus of the sum of any number of complex numbers is not less than the sum of their real (or imaginary) parts. 4. If the sum and product of two complex numbers are both real the two numbers must either be real or conjugate. 5. If a+bV/2+(c+d^.2)i=A+B^V2+(C+DV2)i, where a, b, c, d, A, B, C, D are real rational numbers, then a=A, b=B, c=C, d=D. 6. Express the following numbers in the form A+ Bi, where A and B are real numbers: (1 +i)2, (1 i)2, (3- 2i)/(2 + 3i), (X +pi)/(X - i), (i)'2 2(1 Y)2 (1+i)3_ (1)- 3 (A+i2 )Q-( 2 l-i) ' \i+~i/ '. '- \l-i} l+ i/ ' \x —^'/}- \x +~'1 ' X and p denoting any real numbers. 7. Express the following functions of z=x++yi in the form X+ Yi, where X and Y are real functions of x and y: z2, z3, zn, 1/z, z+(/z), (l — z)/(- z), (a + 3z)/(y + 4z), a, 3, y, denoting real numbers. 8. Find the moduli of the numbers and functions in the two preceding examples. 9. The two lines joining the points z=a, z=b and z=c, z=d will be perpendicular if am ( — = + t 2 r; i.e. if (a- b)/(c - d) is purely imaginary. What is the condition that the lines should be parallel? 10. The three angular points of a triangle are given by z=a, Z=/3, z=y, where a, 3, y are complex quantities. Establish the following propositions: (i) The centre of gravity is given by z= (a + y). (ii) The circum-centre is given by I z- a =I -3 = I = z - y. * The numbers a + /b, a -,/b, where a, b are rational, are sometimes said to be 'conjugate.'

Page 87

37] COMPLEX NUMBERS 87 (iii) The three perpendiculars from the angular points on the opposite sides meet in a point given by Z-a z- z-7 V- -ya \a [If A, B, C are the vertices, and P any point z, the condition that AP should be perpendicular to BC (Ex. 9) is that (z - a)/(3 - y) should be purely imaginary, or that R (z - a) R (, - y) + I (z- a) I (, - y) = 0. This equation and the two similar equations, obtained by permuting a,,3, y cyclically, are satisfied by the same value of z, as appears from the fact that the sum of the three left-hand sides is zero (so that the third equation is a consequence of the first two). This proves the theorem.] (iv) There is a point P inside the triangle such that CBP= ACP= BAP=co. A Iso cot co = cot A + cot B + cot C. [From the equations* = CBP= am (z- 3) - am (y- d), cot am (z - 3)= {R (z - I)/I(z- 3)}, etc., we deduce cot c {I (z- 3) R(y ) - R (( -R(z ) I (y - )} =R (z- ) R (y-) + I(z -3) I(y- ).........(1). This equation, and the two similar equations obtained by permuting a, /3, y cyclically, suffice to determine cot co and the real and imaginary parts of z. If we add the three equations z disappears and we are left with cot o {17 () R (y)- R (/) I()} =- {R () (y-3+() I(-/)}............ (2) the sign of summation referring to the three terms produced by cyclical interchange of a, /, y. Now cot A cot {am (y - a) - am ( - a)} R (y-a) ( - a)+I(y- a) I(3-a) I(y-a) R (- a) - R (y-a) I(/ - a)' and a little reduction shows that the denominator of this fraction is equal to the coefficient of cot co in equation (2), with its sign changed. Hence we can deduce that cot co = cot A + cot B + cot C.] 11. The two triangles whose vertices are the points a, b, c and x, y, z respectively will be similar if 1 1 =0. a b c [The condition required is that ABIA C = XY/ XZ (large letters denoting the points whose arguments are the corresponding small letters), or (b -a)/(c - a)=(y - x)(z - x), which is the same as-.the given condition.] * We suppose that as we go round the triangle in the direction ABC we leave it on our left.

Page 88

88 COMPLEX NUMBERS [III 12. Deduce from the last example that if the points x, y, z are collinear we can find real numbers a, 3, y such that a+q-+y=0 and ax+f3y+yz=O, and conversely (cf. Exs. XXII. 4). [Use the fact that in this case the triangle formed by x, y, z is similar to a certain line-triangle on the axis OX, and apply the result of the last example.] 13. The general equation of the first degree with complex coefficients. The equation az+b = O has the one solution z = - (b/a), unless a = O. If we put a=a+ia', b=/3+i', z=x+iy, and equate real and imaginary parts, we obtain two equations to determine the two real quantities x and y. The equation will have a real root if y=0, which gives ax+j3=O, a'x+3'=O, and the condition that these equations should be consistent is a' - a'3 = 0. 14. The general quadratic equation with complex coefficients. This equation is (a+ iA) 2+2 (b+iB)z+(c+iC)=0. Unless a and A are both zero we can divide through by a +iA. Hence we may consider z+ 2 (b+iB)z+(c+iC)=O........................(1), as the standard form of our equation. Putting z=x+iy and equating real and imaginary parts we obtain a pair of simultaneous equations for x and y, viz. X2-y2+2 (bx-By)+c=0, 2xy+2(by+Bx)+C=0. If we put x+b=$, y+B=r7, b2-B2 - C=h, 2bB-C=k, these equations become 2_ - 2 = h, 2? = k. Squaring and adding we obtain 2 + = (h2 +k2), = +~ {IV(/ + k2)+t}, =~ + J~ {/(12+k2)-h}. We must choose the. signs so that rjq has the sign of k: i.e. if k is positive we must take like signs, if k is negative unlike signs. Conditions for equal roots. The two roots can only be equal if both the square roots above vanish, i.e. if h =0, = 0, or if c=b2-B2, C=2bB. These conditions are equivalent to the single condition c +iC=(b +iB)2 which obviously expresses the fact that the left-hand side of (1) is a perfect square. Condition for a real root. If x2 + 2 (b + iB) x + (c +i) =, where x is real, then x2+2bx+c=0, 2Bx+C=O. Eliminating x we find that the required condition is C2- 4bBC+4cB2=O. Condition for a purely imaginary root. This is easily found to be C2 - 4bBC- 4b2c=0. Conditions for a pair of conjugate complex roots. Since the sum and the product of two conjugate complex quantities are both real, b+iB and c +iC must both be real, i.e. B=0, C=0. Thus the equation (1) can have a pair of conjugate complex roots only if its coefficients are real. The reader should

Page 89

37] COMPLEX NUMBERS 89 verify this conclusion by means of the explicit expressions of the roots. Moreover, even in this case, if b2 _c the roots will be real. Hence for a pair of conjugate roots we must have B=0, C=O, b2<c. 15. The Cubic equation. Consider the cubic equation z3+3Hz+ G=0, where G and H are complex quantities, it being given that the equation has (a) a real root, (b) a purely imaginary root, (c) a pair of conjugate roots. If H=X + ip, G= p +ia, we arrive at the following conclusions. (a) A real root. If 4=0 O the real root is - o-/3/x, and o- + 27Xp2S - 27p3p = 0. On the other hand, if /=0, we must also have o-=0, and the coefficients of the equation are real. In this case there may be three real roots. (b) A purely imaginary root. If [4 0 the purely imaginary root is (p/3p) i, and p3 -27X/u2p - 273o- =0. If /i=0, then also p=0, and the root is iy, where y is given by the equation y3 - 3Xy - =0, which has real coefficients. In this case there may be three purely imaginary roots. (c) A pair of conjugate roots. Let these be xtyi. Then since the sum of the three roots is zero the third root must be - 2x. From the relations between the coefficients and the roots of an equation we deduce 2 _ 32 = 3H, 2x (x2 +y2)= G. Hence G and H must both be real. In each case we can either find a root (in which case the equation can be reduced to a quadratic by dividing by a known factor) or we can reduce the solution of the equation to the solution of a cubic equation with real coefficients. 16. The cubic equation x3+alx2+a2x+a3=O, where al=Al+iA',... has a pair of conjugate imaginary roots. Prove that provided A3' 4=0 the remaining root is - Al'a3/A3', and two identical relations hold between A1, Al', A2,.... Examine the case in which A3'=0. 17. Prove that if z3+ 3Hz+ G=0 has two imaginary roots, the equation 8a3+6a- G= O has one real root which is the real part a of the imaginary roots of the original equation; and show that a has the same sign as G. 18. An equation of any order with complex coefficients will in general have no real roots, nor pairs of conjugate complex roots. How many conditions must be satisfied by the coefficients in order that the equation should have (a) a real root, (b) a pair of conjugate roots 2 19. Coaxal circles. In Fig. 32, let a, b, z be the arguments of A, B, P. z-b Then amz- =APB, z-a the principal value of the amplitude being taken, and APB being a positive angle less than 7r. If the two circles shown in the figure are equal, and

Page 90

90 COMPLEX NUMBERS [III z', zl, zl' are the arguments of P', P1, P1', and APB=O, it is easy to see that z'- b zl-b_ am, a = -, P, z'-a Z1 / \ zl - b and am 2 — r= / O. zlf -a The locus defined by the equation z-b am — =. p, z- a where 0 is constant, is the arc APB. By writing wr-, -0, -7r + 0 for 0 we obtain A the other three arcs shown. The system of equations obtained by supposing that 0 is a parameter, varying from -rr to +7r, represents the system of circles which can be drawn througlh the points A, B. It should however be observed that each circle has to be divided into two parts to which correspond different values of 0. FIG. 32. 20. Now let us consider the equation I (z - b)/(z- a) I = X.............................. (1), where X is a constant. Take the point K on BA produced so that KPA=KBP. Then the triangles KPA, HBP are similar, and so AP/BP= KP/IB = KA/IKP= X. Hence KA/KB=X2, and therefore K is a fixed point for all positions of P which satisfy the equation (1). Also KP2=KA. KB=const. Hence the locus of P is a circle whose centre is K. For different values of X the equation (1) therefore represents a system of circles. Every circle of this system cuts at right angles every circle of the system of Ex. 19. For the equation KP2 =KA. KB shows that KP is a tangent to the circle APB. The system of Ex. 19 is a system of coaxal circles of the common point kind. The system of Ex. 20 is called a system of coaxal circles of the limiting point kind; if X is very small the circle is a very small circle containing B in its interior, if X is very large a very small circle containing A in its interior: it is from this fact that the name is derived. 21. Bilinear Transformations. Consider the equation = Z + a.......................................(1), where z=x+iy and Z=X+iY are two complex variables which we may suppose to be represented in two planes xoy, XOY. To every value of z

Page 91

37] COMPLEX NUMBERS 91 corresponds one of Z, and conversely. If a=a+i3, then x=X+a, y= Y+3, and to the point (x, y) corresponds the point (X, Y). If (x, y) describes a curve of any kind in its plane, (X, Y) describes a curve in its plane. Thus to any figure in one plane corresponds a figure in the other. A passage of this kind from a figure in the plane xoy to a figure in the plane XOY by means of a relation such as (1) between z and Z is called a transformation. In this particular case the relation between corresponding figures is very easily defined. The (X, Y) figure is the same in size, shape, and orientation as the (x, y) figure, but is shifted a distance a to the left, and a distance / downwards. Such a transformation is called a translation. Now consider the equation z= pZ....................................... (2), where p is real. This gives x=pX, y =pY. The two figures are similar and similarly situated about their respective origins, but the scale of the (X, Y) figure is (l/p) times that of the (x, y) figure. Such a transformation is called a magnification. Finally consider the equation z= (cos + i sin ) Z..............................(3). It is clear that I z =Z, am z=am Z+ q, and that the. two figures differ only in that the (X, Y) figure is the (x, y) figure turned about the origin through an angle ) in the negative direction. Such a transformation is called a rotation. The general linear transformation z= aZ + b....................................(4) is a combination of the three transformations (1), (2), (3). For if a =p and am a= ) we can replace (4) by the three equations z=z'+b, z'=pZ', Z'=-(cos ++isin q) Z. Thus the general linear transformation is equivalent to the combination of a translation, a magnification, and a rotation. Next let us consider the transformation z=1/Z..................................... (5). If Z =R and am Z=e, then z = 1/R and amz= -, and to pass from the (x, y) figure to the (X, Y) figure we invert the former with respect to o, with unit radius of inversion, and then construct the image of the new figure in the axis ox (i.e. the symmetrical figure on the other side of ox): We thus obtain a figure in the (x, y) plane, similar in every respect to the (X, Y) figure. Finally consider the transformation z=(aZ+b)/(cZ+d)..............................(6). This is equivalent to the combination of the transformations z=(a/c) + (b-ad) (z'/c), z'=/Z', Z' = cZ+d,

Page 92

92 COMPLEX NUMBERS [III i.e. to a certain combination of transformations of the types already considered. The transformation z = (aZ+ b)/(cZ+ d) is called the general bilinear transformation. Solving for Z we obtain Z= (dz - b)/(cz- a). This is the most general type of transformation for which one and only one value of z corresponds to each value of Z, and conversely. 22. The general bilinear transformation transforms circles into circles. This may be proved in a variety of ways. We may assume the well known theorem in pure geometry, that inversion transforms circles into circles (which may of course in particular cases be straight lines). Or we may take the equation (a+ iHa') (+ iY) + (3 + i3') Y(7+iy') (X+iY)+ (+f i')' assume that x and y satisfy the equation of a circle, calculate x and y in terms of X and Y, and so find the relation between X and Y by straightforward algebra. Or finally we may use the results of Exs. 19 and 20. This is the best and simplest method. If, e.g., the (x, y) circle is I (z-)/(-p) = X, and we substitute for z in terms of Z, we obtain I (Z- a')/(Z- p') I =', whereb - crd b-c pd a-pci where r'= — p — -.. =- X. a -ac '- a - pc a- ac 23. Consider the transformations z=1/Z, z=(1+Z)/(1-Z), and draw the (X, Y) curves which correspond to (1) circles whose centre is the origin, (2) straight lines through the origin, in the (x, y) plane. 24. The condition that the transformation z=(aZ+-b)/(cZ+d) should make the circle x2+y2=l correspond to a straight line in the (X, Y) plane, is a l=l c 1. 25. Cross ratios. The cross ratio (zlz2, Z3Z4) is defined to be (1 - Z3) (Z2- 4) (Z1 - Z4) (Z2- Z3) If the four points zl, Z2, Z3, z4 are on the same line, this agrees with the definition adopted in elementary geometry. There are 24 cross ratios which can be formed from Z, z2, z3, Z4 by permuting the suffixes. These consist of six groups of four equal cross ratios. If one ratio is X the six distinct cross ratios are X, 1 -, /, 1/, (1 -), 1 - (1/), k/(k-1). The four points are said to be harmonic or harmonically related if any one of these is equal to - 1. In this case the six ratios are - 1, - 1, ~, ~, 2, 2. If any cross ratio is real all are real and the four points lie on a circle. For in this case am (1 -- 3) (2 — 4) 0 (or r), (Z1 - Z4) (Z2 - Z3)

Page 93

COMPLEX NUMBERS 93 so that am {( /( 4)} f( - 4)and amn {(2 - 3)/(z2 - Z4)} are either equal or differ by 7r (cf. Ex. 19). If (Z1z2, 3z4) = - 1, we have the two equations Z1 - Z3 Z2 - Z3 am + — = + am -, Z1 - 4.- 2-Z4 The four points Al, A2, A3, A4 (Fig. 33) lie on a circle, A1 and A2 being separated by A3 and A4. Also A1A3/A1A4=A2A3/A2A4. Let 0 be the middle point of A3A4. The equation I - Z3 = _ 2 - 3 3 Z1 -4 l Z2 - 4 (Z1 -23) (Z2 -4) (Zl-Z4) (Z2- Z3) may be put in the form (Z1 + Z2) (3 +z4) = 2 (21Z2 + Z3z4), or, what is the same thing, {Z1- 2 (Z3 + 24)} (2 - 2 (23 + 4)}2 )-33 {FIG 33. {-2 (Z3- z4)}. But this is equivalent to OA,. OA2= OA32= OA42. Hence OA1 and OA2 make equal angles with A3A4, and OA1. OA2=OA3==OA49. It will be observed that the relation between the pairs Al, A2 and A3, A4 is symmetrical. Hence if 0' is the middle point of A1A2, O'A3 and O'A4 are equally inclined to A1A2, and O'A3. O'A4=0'A12=-O'A22. 26. If the points Al, A2 are given by az2+2bz+c=0, and the points A3, A4 by a'Z2+2b'z+c'=O, and 0 is the middle point of A3A4, and ac'+a't —2bb'=0, then OA1, OA2 are equally inclined to A3A4 and OA1. OA2= OA32=OA42. (MlVath. Trip. 1901.) [The pairs Al, A2 and A3, A4 are harmonically related.] 27. The condition that four points should lie on a circle. A sufficient condition is that one (and therefore all) of the cross ratios should be real (Ex. 25); this condition is also necessary. Another form of the condition is that it should be possible to choose real quantities a, /, y such that 1 1 1 =0. a 13 y 2124 + 22 3 22Z4+Z3 21 Z3 4 + Z122 To prove this we observe that the transformation Z= 1/(z- z4) is equivalent to an inversion with respect to the point z4, coupled with a certain reflexion (Ex. 21). If Zl, Z2, Z3 lie on a circle through z4 the corresponding points Z==l/(zl-z4), Z2-=1/(Z2-Z4), Z3=1/(Z3-24) lie on a straight line. Hence (Ex. 12) we can find real quantities a', 3', y' such that a'+/3'+ y'=0 and a'/(zl-z4) +3'(z2-z4) + y/(z3-Z4)=0, and it is easy to prove that this is equivalent to the given condition.

Page 94

94 COMPLEX NUMBERS [III 28. Prove the following analogue of De Moivre's Theorem for real quantities: —if 0i, )2, 03,... is a series of positive acute angles such that tan obm i+ 1 =+ tan, se se tan then tan Om + n= tan 0,, sec bn + sec 0,, tan 0, sec On +, = sec 0, sec q,, + tan M,, tan 0,,, and tan 0/ + see ) = (tan 01 + see 01)c. [Use the method of mathematical induction.] 29. The transformation z=Zm". In this case r=R=n, and 0 and me differ by a multiple of 27r. If Z describes a circle round the origin, z describes a circle round the origin m times. The whole (x, y) plane corresponds to any one of m sectors in the (X, Y) plane, each of angle 27r/m. To each point in the (x, y) plane correspond im points in the (X, Y) plane. 30. Complex functions of a real variable. If f(t), q (t) are two real functions of a real variable t defined for a certain range of values of t, we call Z= f(t) + i (t).................................(1) a complex function of t. We can represent it graphically by drawing the curve x=f(t), y=P (t); the equation of the curve may be obtained by eliminating t between these equations. If z is a polynomial in t, or rational function of t, with complex coefficients, we can express it in the form (1) and so determine the curve represented by the function. (i) Let z=a+(b-a) t, where a and b are complex numbers. If a = a + ia', b = +-i3', then x=a+(3-a) t, y=a'+ (3'-a')t. The curve is the straight line joining the points z= a and z=b. The segment between the points corresponds to the range of values of t from 0 to 1. Find the values of t which correspond to the two produced segments of the line. (ii) If z=c+p(1 +it)(l -it)}, the curve is the circle of centre c and radius p. As t varies through all real values z describes the circle once. (iii) In general the equation z=(a +bt)/(c+dt) represents a circle. This can be proved by calculating x and y and eliminating: but this process is rather cumbrous. A simpler method is obtained by using the result of Ex. 22. Let z=(a+bZ)/(c+dZ), Z=t. As t varies Z describes a straight line, viz. the axis of X. Hence z describes a circle. (iv) The equation z= a + 2bt + ct2 represents a parabola generally, a straight line if b/c is real.

Page 95

COMPLEX NUMBERS 95 (v) The equation z= (a + 2bt+ ct2)/(a + 2t+ tyt2), where a, /3, y are real, represents a conic section. [Eliminate t from x = (A + 2Bt+ Ct2)/(a + 2St+ t2), z=(A' + 2B't+ C' t)/(a+2t+yt2) (where a=A + iA', b = B iB', c=C+iC').] 38. Formulae for sin nO and cos no. De Moivre's Theorem enables us to express sin n and cos n0, where n is a positive integer, in terms of sin 0 and cos 0. For from the formula cos nO + i sin nO = (cos 0 + i sin 0)n (where n is a positive integer) we deduce cos n0 = R [(cos 0 + i sin 9O)I] =(cos 0) {1- (2 tan20 + () tan4 0-...}, sin n0 = I [(cos 0 + i sin 0)2*] = (cos 0) {() tn -( t ann 0 + t.. n, where () is the general binomial coefficient n (n- 1) ( n-2)...(n -r+ 1) 1.2.3...r (sometimes written nC.). Then cos nO n t2n\ )4A (Cos S~b - (2 t+ 0) 24-................(1 (c s i - 3J................(2), (Cos o) = () t-( )t3+......... where t = tan 0. By division From (1) and (2) we can deduce further formulae expressing From (1) and (2) we can deduce further formulae expressing sin nO cos nO, sin 0 in terms of cos 0 only. For cos'2 0 tan2' 0 = sin2r 0 cos-2r' 0 = cos-2r 0 (1 - cos2 )r;

Page 96

96 COMPLEX NUMBERS [III and on substituting in (1), after multiplying up by cos" 0, we see that cos nO = an cosn 0 + a,_2 COSn-2 0... where an, an_2,... are constants, and the last term is a constant or a multiple of cos 0, according as n is even or odd. Similarly, from (2), we deduce sin 6 = bn_ cos-l 0 + bn3 cos-3S 0 +.... sin 7 To determine the actual values of the coefficients generally and directly, and by means of really elementary methods, is a matter of some little difficulty. The formulae are 2 cos nO = (2 cos )n i- (2 cos 0)'-2 + n -) (2 cos O)n2!.n (-r- ).. ( 2r + ) (2 os 0) + (), - + Hr!...+(2cos)11-4... (4), sin 6n = (cos) ~ (2 os )0)1-5 ssin ( 1) 2! -.. + ( (-2 co - i...) (......(c. That these formulae are correct is easily verified by induction. For sin nO sin (n + 1) 0 s /in n0\. sin 0verd o sin 0at e show th and if we assume that the formulae hold for n = 1, 2,... k (and they are easily verified for n = 1, 2, 3), we can at once show that they hold for n = k+1. We leave this as an exercise for the reader. 39. When tan nO is given we can regard the equation (3), which we may write for brevity in the form tan no = f(t), as an equation of the nth degree in t, one of whose roots is t = tan 6. Similarly, one of the roots of tan ( + )} =f (t) is t = tan {0 + (k7r/n)} (kl being any integer). But since tan (nO + kIr) = tan nO,

Page 97

38, 39] COMPLEX NUMBERS 97 the two equations are the same. Hence tan {O + (ckr/n)} is a root of (3) for all values of k. As we give k all integral values, this expression assumes n and only n distinct values, viz. tan0, tan(0 + ), tan ( +.... tan + (0 \ n/ \ n n These are therefore the roots of (3), considered as an equation in t. It follows that any symmetric function of these quantities can be expressed in terms of the coefficients of (3), i.e. in terms of tan nO. The equations (1), (2), (4), (5) can of course be considered from the same point of view. Some illustrations will be found in the examples which follow. Examples XXIV. 1. The equation (3) may be written in the form ( () t2-2 +() t- -tan) t-... -) tn-3... =0 cot ' 3" where tan nO or cot nO is to be chosen according as n is odd or even. 2. Show that sec2 + sec( + r)+...+sec2 +(- 1)l is equal to n2 sec2 nO or n2 cosec2 nO according as n is odd or even. (Math. Trip. 1900.) [The expression given is n + t,.2, where t, is a root of the equation in Ex. 1.] 3. Prove that sec2 r + sec4 + sec4 3 + sec4 4=1120. 9 9 9 9 4. If n is odd tn- () t-2+ () t4 -... t (t2- tan2 ) (t -tan2 2)...(ttan2 ) where r= (n - ). State and prove the corresponding formula when n is even. 5. The roots of the equation 2 cos nO =xn-.n 2+ 3)n-4 - are 1! 2! 2cosO, cos(0+),.. 22cos{2 2 +2 (- 1) 7}. 6. The roots of 64x3- 112x2+56x-7=0 are sin2, sin22, sin24. 27r 4rr rr Deduce that sin 2r + sin -7 - sin T = ~ ^/7 7. Show that 4 cos2 (r/7) is a root of x3 - 52+6x- 1=0, and find the other roots. (Math. Trip. 1898.) H. A. 7

Page 98

98 COMPLEX NUMBERS 8. Factor-formulae for cos nO and (sin nO)/(sin 0)*. Show that 2 cos n0 is equal to 2 (cos2 - cos2 2 ) (cos2 - cos2...... {cos2 - cos2 ( 2)} COS or to 211 (cos2 0- cos2 ) (cos2 - cos22...... { - cos2 2n ' according as n is odd or even. 9. Show that sin sill..sin -=2 2 2n 2n 2n where r is equal to n - 2 or to n- 1 according as n is odd or even. [Put = 0 in Ex. 8: consider the sign carefully, when taking the square root of each side.] 10. Show that (sin nO)/(sin 0) is equal to \ /) 2w)\ { (n-2)irr 2 -1 cos2 - cos2 7-) (cos2 -cos ).... cos2 0-cos2 2) —} cos 0, \-n nj\ Co \ 2n or to 2'-1 (cos2 - cos2 ) (cos2 - cos2 c..... cos2 0 - cos 2 2 according as n is even or odd. 11. Show from Ex. 10, and equation (2) of ~ 38, that if n is odd 2'-1 (cos2 0-COS (COS2 -osS2... {c2 - COS ( - -1) rr 3! and deduce, by putting 0 = 0, that 7r. 27r (_ _ _ sin sin... sin (-1) = 2 (n ) n. n n" 2n Obtain the corresponding formula for the case in which n is even. 40. Roots of complex numbers. We have not, up to the present, attributed any meaning to symbols such as c/a, am/n, when a is a complex number, and m and n integers. It is, however, natural to adopt the definitions which are given in elementary algebra for real values of a. Thus we define /a or al/n, where n is a positive integer, as any number a which satisfies the equation zn= a; and a/n, where m is an integer, as (a/ln)m. These definitions do not prejudge the question as to whether there are or are not more than one (or any) roots of the equation. 41. Solution of the equation z2 = a. Let a= p (cos 4 + i sin,), where p is positive and b is an angle such that - r < 7r. * The results of Exs. 8-11 are of considerable importance in Higher Trigonometry.

Page 99

40, 41] COMPLEX NUMBERS 99 Then if z = r (cos 0 + i sin 0), the equation takes the form rn (cos nO + i sin nO) = p (cos + + i sin ), so that r = p, cos nO = cos 4, sin n = sin.........(1). The only possible value of r is /p, the ordinary arithmetical nth-root of p; and in order that the last two equations should be satisfied it is necessary and sufficient that n0 = ) + 2k7r, where k is an integer, or o = ( + 2k7r)/n. If k = pn + q, where p and q are integers, and 0 - q < n, the value of 0 is 2pr + (4 + 2q7r)/n, and in this the value of p is a matter of indifference. Hence the equation n= a = p (cos 4 + i sin 4) has n roots and n only, given by z = r (cos 0 + i sin 0), where r=y/p, 0=(+2q7r)/n (q=0, 1, 2,... n - 1). That these n roots are in reality all distinct is easily seen by plotting them on the Argand Diagram. The / /P figure (Fig. 34) shows the four / fourth roots of (1 6) (cos 55~ + i sin 55~). The particular root 4p {cos (0/n) + sin (4/n)} is called the principal value of ya. FIG. 34. The case in which a = 1, p = 1, 4 = 0 is of particular interest. The n roots of the equation z = 1 are cos (2gr/n) + i sin (2q7r/n), (q = 0, 1,... n - 1). These quantities are called the nth roots of unity; the principal value is unity itself. If we write con for cos (27r/n) + i sin (27r/n) we see that the n roots of unity are 1, Wn, On2,... c9 -- 1. Examples XXV. 1. The two square roots of 1 are 1, -1; the three cube roots are 1, ( 3), (-^3), (-l- i3); the four fr ourth roots are 1, i, -1, - i; and the five fifth roots are 1, {^I5-1+i\1104+2^5}, {{-15-1+zi/10-2^5}, ~ { - 5 - i -z/10 - 2 /J5}, {J5 - 1 - i /10 +2^s5}.

Page 100

100 COMPLEX NUMBERS [III 2. Prove that 1 ++, Oi,2 -... + o)nn-1 = ~ 3. Prove that (x+yo+3+zc032) (X+yo32 +zco3) =2+y2+z -yz - zx - xy. 4. The nth roots of a are 4/a, o,ta, on, 2 g/a,...o ~o-1 /a, where /a denotes the principal nth root. 5. It follows from Exs. XXIII. 14 that the roots of z2=a+ i3 are + 4 {,/(a2 + 32)+ a}] +~ i [ {,/(a2+1)- }], like or unlike signs being chosen according as i is positive or negative. Show that this result agrees with the result of ~ 41. 6. Show that (x2m" - a2")/(x2 - a2) is equal to (x2 - ax cos + a2 (x2 - 2ax cos -+a2)... (x- -ax cos ( - +a2 [The factors of x2i - a2m are (x-a), (x - a2), (x-aC22m),... (x - aco-). The factor x-ao2fm is x+a. The factors (x - awms), (x -aan-s) taken together give a factor x2 - 2ax cos - a2.] m 7. Resolve x2m + 1 - a2 + 1, x2 + a2', and 2m + 1+ a2m + l into factors in a similar way. 8. Show that X2 - 2xnan cos 0 + a2n is equal to (x- 2xa cos- + a2) ( - 2xa cos 6+ 2 a+... n n... (-2xacos 2 (- 1) r [Use the formula x2n 2x an cos 0 + a2n= {n - an (cos 0 - i sin 0)} {x - an (cos 0 - isin 0)}, and split up each of the last two expressions into n factors.] 9. The problem of finding the accurate value of co, in a numerical form involving only square roots, as in the formula 0)3 = (-1+ i/3), is the algebraical equivalent of the geometrical problem of inscribing in a circle a regular polygon of n sides by Euclidean methods, i.e. by ruler and compasses. We saw in fact in Chapters I. and II. that irrationals involving square roots could always be so constructed, and are the only irrationals which can be so constructed. Euclid gives constructions for n=3, 4, 5, 6, 8, 10, 12, and 15. It is evident that the construction is possible for any value of n which can be found from these by multiplication by any power of 2. There are other special values of n for which such constructions are possible, the most interesting being n= 17. Approximate constructions for regular polygons of any number of sides will be found in books of practical geometry.

Page 101

42] COMPLEX NUMBERS 101 42. The general form of De Moivre's Theorem. It follows from the results of the last section that, if q is a positive integer, one of the values of (cos 0 + i sin 0)l1q is cos (0/q) + i sin (/q). Raising each of these expressions to the power p (where p is any integer positive or negative), we obtain the theorem that one of the values of (cos 0 + i sin O)Plq is cos (p0/q) + i sin (pO/q), or if a is any rational quantity, one of the values of (cos 0 + i sin 0)a is cos a0 + i sin a0. This is a generalised form of De Moivre's theorem stated in ~ 36. MISCELLANEOUS EXAMPLES ON CHAPTER III. 1. The condition that a triangle (xyz) should be equilateral is that x2 +y2 + z2 _ yz - zx - xy = 0. [Let XYZ be the triangle. The displacement ZX is YZ turned through an angle T7r in the positive or negative direction: or, as Cis = 03, Cis (- 7r)= 1/C3=o32, we have x-z=(z-y) w3 or x-z=(z-y) W32. Hence X +yCo3 + C32=0 or x+yC32+zc)3=0. The result follows from Ex. XXV. 3.] 2. If XYZ, X'Y'Z' are two triangles, and YZ. ''ZX'. 2 X' = X. ' Y', then both triangles are equilateral. [From the equations (Y - ) (y -Z')=(z-x) (' -')=(-y) (' -y')= K2 say, we deduce 1/(y'-z')=O, or:x'2-y'z'=O. Now apply the last example.] 3. On the sides of a triangle ABC similar triangles BCX, CA Y, ABZ are described. Show that the centres of gravity of ABC, XYZ are coincident. [We have (x-c)/(b-c)=(y-a)/(ca))=(z- b)/(a-b)=X, say. Express (x+ y+z) in terms of a, b, c.] 4. If X, Y, Z are points on the sides of the triangle ABC, such that BX/XC= CY/ YA = AZZB = r, and if ABC, XYZ are similar, then either r= 1 or both triangles are equilateral. 5. Deduce Ptolemy's Theorem concerning cyclic quadrilaterals from the fact that the cross ratios of four concyclic points are real. [Start from the identity (2 - x3) (X1 - x4) + (X3- 1) (X2 - V4) + (1 - X2) (X3 - X) = 0.]

Page 102

102 MISCELLANEOUS EXAMPLES ON CHAPTER III 6. If z2+z'2=1, the points z, z' are ends of conjugate diameters of an ellipse whose foci are the points 1, -1. [If CP, CD are conjugate semidiameters of an ellipse and S, H its foci, then CD is parallel to the exterior bisector of the angle SPH, and SP. HP= CD2.] 7. Prove that I a+b 12+ a}-b 2=2{l a 2+l b 12}. [This is the analytical equivalent of the geometrical theorem that, if M is the middle point of PQ, OP2 + Q2=20M2+2MP2.] 8. Deduce from Ex. 7 that Ia+,/(a2-b2)+ I1 a-^/(a2-b2) = I a+b + I a-b. [If a + /(a2-b2)= zl, a -(a2-b2) =z2, we have ZI112+1212=1 Jziz212+llz,-z212=2 la2+2 a2-b21, andso (Izxl+lz2l)2=2 {lal2+la2-b2-l+lbl2}=ia+bl2+la-bl2+2la2-b21. Another way of stating the result is: if z1 and Z2 are the roots of az2+ '2z +y=0, then l z1 + l Z1 = (l/ ) {(I- 3+ lay I)+ (I- | - /-ay )}.] 9. If x4+4a1x3+6a2x2+4a3x+a4=0 is an equation with real coefficients and has two real and two complex roots, concyclic in the Argand diagram, then a32 + a2 a4+ a23 - a2a4 - 2al a2a3=0. 10. The four roots of ax4+ 4a1x3+ 6a2x2+ 4a3x+ a4 = Owill be harmonically related if aoa32+ a2 a4+a23 - a a2a4- 2ala2a3= O. [Express Z23,14 Z31,24 Z12,34, where Z23,14= (z1 - z2) (Z3 - Z4) + (Z1 - Z3) (Z2 - 4) and zl, Z2, Z3, Z4 are the roots of the equation, in terms of the coefficients.] 11. Imaginary points and straight lines. Let ax+by+ c=O be an equation with complex coefficients (which of course may be real in special cases). If we give x any particular real or complex value, we can find the corresponding value of y. The aggregate of pairs of real or complex values of x and y which satisfy the equation is called an imaginary straight line; the pairs of values are called imaginary points, and are said to lie on the line. The values of x and y are called the coordinates of the point (x, y). When x and y are real, the point is called a real point: when a, b, c are all real (or can be made all real by division by a factor), the line is called a real line. The points x=a+i3, y=-y+i8 and x=a-i/3, y=y-i3 are said to be conjugate; and so are the lines (A +iA')x+(B+iB')y+C+iC'=0, (A-iA')x+(B-iB')y+C-iC'=O. Verify the following assertions:-every real line contains infinitely many pairs of conjugate imaginary points; every imaginary line contains one and only one real point; an imaginary line cannot contain a pair of conjugate imaginary points:-and find the conditions (a) that the line joining two given imaginary points should be real, and (b) that the point of intersection of two imaginary lines should be real.

Page 103

MISCELLANEOUS EXAMPLES ON CHAPTER III 103 12. Sum the series cosa+cos(a+b)+cos(a+2b)+..., sin a+sin (a+b)+sin (a+2b)+... to n terms. [Sum the geometrical series (cos a + i sin a) {1 + (cos b + i sin b) + (cos b + i sin b)2+...} to n terms, and equate real and imaginary parts.] 13. Also sum 1 + cosa+( )cos 2a+..., +() sina+(2 sin2a+... to n + 1 terms. 14. Sum the series cosa+ cos(a +) +...+xn- cos {a+ (n- 1) 3}. (Math. Trip. 1905.) 15. Find the modulus and the amplitude of 1+cos +isin0, 1+cos0-isinO, 1-cos-+isinO, 1-cos - i sin,. 16. Find the square roots of the numbers in the preceding example. 17. Prove that /1 +sinO+icosO0 _ n nii (]+ sin 0-iecos = cos (~n7r - n0)-+i sin (nrr-nO). \1. + sin - i cos - 0 - 18. Prove the'identities (x+-y +z) (x +o03y+ 32) (x+c 32y+3) + =3O) = y3+ X3+z 3xyz, (X + / + 2) (X + 05 + C054Z) (X +q 52-y+ C53) (X+ os3y + co522) (X + q)54 + o sZ) = X5 + y + Z5 - 5X3yz +- 5xy2z2. 19. Solve the equations x3-3ax+(a3+ 1) = and X5- 5ax3+5a2 +(a5+ 1) =0. 20. If f ()=ao+alx+...+akxk, then {f (X) + f (Cox) +... + f (con)- Ix)}/n= ao + anX n + a2,X2n +... + axn XA, o being any root of xn=1 (except x= 1), and Xn the greatest multiple of n contained in X. Find a similar formula for a,+ az+,x +n a x ~,+2 2n +.. 21. If (1 +x)n=po+plx+p2 2+... (n being a positive integer), prove that pO-P2+.p4-... =2 cos Tn4r, 1-p3 +P-... -= 2n sin n7r. 22. Sum the series X X2 X3 X1/3 2!n-2! 5!n-5! 8!w-8!- "n-!' n being a multiple of 3. (Math. Trip. 1899.) 23. Show how to deduce the formulae given in ~ 38 from the addition theorems for cos x and sin x, using no complex quantities. [See Hobson's Trigonometry, Ch. VII.]

Page 104

104 MISCELLANEOUS EXAMPLES ON CHAPTER III 24. Prove that, when m is odd, 32 cosec2 (rr/mr)=rn2 -1, the summation extending to r=1, 2,..., - 1. The corresponding sum, extended only to values of r less than m and prime to it, is denoted by S. Show that, if m is the product of unequal odd primes a, b, c,... k, then 3S=(a2 - 1) (b2- 1) (C2- 1)... (k2 - 1). (Math. Trip. 1902.) 25. Prove that, if rn is odd, M —l tan4 {(2r+ 1) r/m} =m (m- 1) (22+M- 3). r=O (Math. Trip. 1903.) 26. If a,,= a + b cos{O+(2rrr/n)}, for r=l, 2,..., n, show that 3n (n- 2) Mal ala2= 3n2 Eala2a3+(n- 1) (n- 2) (2al)3, if n > 3, and find the corresponding equation when n=3. (Math. Trip. 1906.) 27. The roots of 1-n- n(2-1) S2+(n-1)(n -2)s3+ +(l)in( + 1) X = 2! 3! rr 5rr (4n- 3) 7r are tan -, tan, tan 4 4n 4n 4n [The equation is ( - xi) +i (1 +i)n= 0.] 28. If a= 7r/4n, show (cf. Ex. 27) that cot a, -cot 3a, cot 5a,..., (-)n- cot (2n- 1) a are the roots of t -(n - 1) n-+ (n- (- 2) n-3-. xn-nxnZl Xn-1x_2+n —3_0. 2! 3! Deduce that cot a cosec2 a - cot 3a cosec2 3a +... to n terms is equal to 2n3. (Math. Trip. 1901.) 29. There are in general two points unaltered by the transformation z=(aZ+ b)/(cZ+d). If these points are a, 3, the transformation can be put in the form (z - a)/(z -3) =k (Z - a)/(Z - 3). In particular, reduce the transformation z=(1+Z)/(1 -Z) to this form. Divide the Z-plane into 8 regions by means of the axes and the unit circle. Find the region in the z-plane which corresponds to each of these regions. 30. If z=Z2-1, then as z describes the circle I Z=K, the two corresponding positions of Z each describe the Cassinian oval pip2=K, where P1, P2 are the distances from the points +1. Trace the ovals for different values of K. 31. If t is a complex number such that It = 1, then as t varies, the point x=(at + b)/(t - c) describes a circle, unless Icl =1, when it describes a straight line.

Page 105

MISCELLANEOUS EXAMPLES ON CHAPTER III 105 32. If t varies as in the last example, the point x=- {at+ (b/t)) in general describes an ellipse whose foci are given by x2=ab, and whose axes are I[a + bI and Ia -Ib1. But if Ia]- b, x describes the finite straight line joining the points +._,(ab). 33. If z = 2Z+ Z2, the circle I Z =1 corresponds to a cardioid in the plane of z. 34. Discuss the transformation z ={Z+(1/Z)), showing in particular that to the circles X2 + Y2 = a2 correspond the confocal ellipses x2 2 + = 1 a+4 a — 35. If (z+1)2=4/Z, the unit circle in the z-plane corresponds to the parabola R cos2 =l in the Z-plane, and the inside of the circle to the outside of the parabola. 36. Show that, by means of the transformation z={(Z-ic)/(Z+ic)}2, the upper half of the z-plane may be made to correspond to the interior of a certain semicircle in the Z-plane. 37. Consider the relation az2 + 2hzZ + bZ2 + 2gz + 2fZ + c = 0. Show that there are two values of Z for which the corresponding values of z are equal, and vice versa. We call these the branch points in the Z and z-planes respectively. Show that, if z describes an ellipse whose foci are the branch points, so does Z. [We can, without loss of generality, take the given relation in the form z2 + 2zZ cos o + Z2 = 1 -the reader should satisfy himself that this is the case. The branch points in either plane are + cosec o. An ellipse of the form specified is given by \ z + cosec co + z - cosec co [= const. This is equivalent (Ex. 8) to I +,/(Z2 - cosec2 o) +- /J(2 - cosec2 w) I = const. Express this in terms of Z.] 38. If z= aZ"+ bZn, where mn, n are positive integers and a, b real, show that as Z describes the unit circle, z describes a hypo- or epi-cycloid. 39. Prove that sin (2n+ 1i) si 1 (1- sin2 {r7 (2n+ 1)} (Math. Trip. 1907.) (2n + 1) sin - 1- sinS {r-r/(?.n4-1) 40. By putting 0 = 7r in the last example, prove that 7r 27r o2r 1 cot 2n1 cot 2n +... cot 2n f 1 2n, -t 1 2n +1 /(2n + 1)

Page 106

106 MISCELLANEOUS EXAMPLES ON CHAPTER III 41. Prove that sin n=2n-1sin o sin (+ -)... sin {0+(n-) ). [Put x=a= 1 in Ex. XXV. 8, and change 0 into 20.] 7r 27r 72r 1 42. Prove that cos - cos -.. cos5 = 18 15 15 15 128 7r 2rr (n - 1) rr 43. Prove that tan - tan -.. tann = 1 2n 2n 2n 44. Prove that (1+x) -(1- x)n A ( 2+ tan2 )(x2+tan2... (x2 +tan2 _), 2x \ 9/ n" where A=1, r= (n-l) if n is odd, and A=n, r=~n-1 if n is even. 45. If/ (x + tan2 2 — +) is expressed in the form /r=l\ 2n+1 Ar2 ( + tan2 2n+ )' n being a positive integer, show that (- 7l1-1 2 r7rr 1/ 7 A (1)- 2 sin2 r cos23 r- 3 r 2n +1 2n + 1 2n +-' (Mfath. Trip. 1905.) [Apply the ordinary rule for partial fractions: it will be found that 1r7r rrr c kTr A, =(- )r- 1 2 sin2 2n cos2-3 1 I cot2 2n+1 ~ 2n+ k=1 2n+1 ' and Ex. 40 can be used to obtain the given result.] 46. Show that n-1 (2r- 1) 7r ((2r+l)Tr } 2 sin (2+ cosec )- a = =ncos(n- 1) a sena. r=o 2n 2n (Math. Trip. 1907.) [The right-hand side is Xn-l 1 -(n-1) 2n - 1 + xn 2 + x - Xn X2n +-1 where x= cos a + i sin a =Cis a. The roots of x2+ 1 =0 are Cis (2r + ) (=, 1,..., 2n-1). 2n Split up the right-hand side into partial fractions of the form A///{ -Cis (21+l ) rr} It will be found that A,.= -i sin (2r+1) Cis ( 1) To get the result 2n 2n in the form given we must associate the terms in pairs (r, n+r) where r=O, 1,..., n-1.]

Page 107

MISCELLANEOUS EXAMPLES ON CHAPTER III 107 47. Show that, if m and n are positive integers, and m _ t, then xim-1/(l +xn)=(1/n)2O?,n —nl(X - ), where o is a root of x+l=0: and hence show that, if n is even, 1 2 4 — 1 (2r +1) }/{(221+ 1) r - - x cos 1 - 2x cos +x 1 + xn o ' J ( n and find the corresponding formula when a is odd. 48. Express xm- 1/(1 - x"), where m and a are positive integers, and an _ a, in partial fractions, and obtain the formulae for 1/(1-xn) corresponding to those of Ex. 47. 49. Show that Xn - a"; COS 2%0 -1 xl -acos (d + 2rr) 2a - 2x" aran Cos nO + a2n Inxll-1 - 2r'r -2x, w ta Tco v.c=s2 2xa cos0 + +a2 50. If Pi, p2,... p are the distances of a point P, in the plane of a regular polygon, from the vertices, prove that ~ 1 n r2n - a2n i pV2 C-a2 _.21 - 2r.an COS nO + a2n ' where 0 is the centre and a the radius of the circumcircle of the polygon, r the length OP, and 0 the angle between OP and the radius from 0 to any vertex of the polygon. 51. If AA2... An), B1B2... Bin are concentric regular polygons, mz and a being prime to one another, prove that r" 'n 1 inn b2mn - a2inn r=1 s=1 (A,.B,)S b2 - a2 b2inn - 2btbnn am n cosm 0 + a2mn' where a and b are the radii of the circumcircles of the polygons, and 0 the angle between any two radii drawn one to a vertex of each polygon. (iath. T-rip. 1903.) 52. If p and q are integers, and q prime to p, and k is an odd positive integer less than 2p, and O=q r/p, show that p-1 cos k (a+nO) P-1 sin k (a + nO) 2 =p cotpa, 2 n=o sin (a+nO) n=O sin (a + nO) PX A - 1 1 P —1 tnx [We have p-i t= X x-1 X —1 1X — where t=cos28+isin 20, 1X:5p. In this equation write ~ (k+ 1) for X and cos 2a - i sin 2a for x.] 2 I/TIj1II llU~1 U-( 31 UILI~

Page 108

CHAPTER IV. LIMITS OF FUNCTIONS OF A POSITIVE INTEGRAL VARIABLE. 43. Functions of a positive integral variable. In Chapter II. we discussed the notion of a function of a real variable x, and illustrated the discussion by a large number of examples of such functions. And the reader will remember that there was one important particular with regard to which the functions which we took as illustrations differed very widely. Some were defined for all values of x, some for rational values only, some for integral values only, and so on. Consider, for example, the following functions: (i) y=x, (ii) y=_/x, (iii) y=the denominator of x, (iv) y=the square root of the product of the numerator and the denominator of x, (v) y=the largest prime factor of x, (vi) y=the product of /x and the largest prime factor of x, (vii) y=the xth prime number, (viii) y=the height measured in inches of convict x in Dartmoor prison. Then the aggregates of values of x for which these functions are defined or, as we may say, the fields of definition of the functions, consist of (i) all values of x, (ii) all positive values of x, (iii) all rational values of x, (iv) all positive rational values of x, (v) all integral values of x, (vi), (vii) all positive integral values of x, (viii) a certain number of positive integral values of x, viz., 1, 2,..., V, where V is the total number of convicts at Dartmoor at the present moment of time*. Now let us consider a function, such as (vii) above, which is defined for all positive integral values of x and no others. This function may be regarded from two slightly different points of * In the last case N depends on the time, and convict x, where x has a definite value, is a different individual at different moments of time. Thus if we take different moments of time into consideration we have a simple example of a function y=F(x, t) of two variables, defined for a certain range of values of t, viz. from the time of the establishment of Dartmoor prison to the time of its abandonment, and for a certain number of positive integral values of x, this number varying with t.

Page 109

43, 44] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 109 view. We may consider it, as has so far been our custom, as a function of the real variable x defined for some only of the values of x, viz. positive integral values, and say that for all other values of x the definition fails. Or we may, as in Chap. I. ~ 8, leave values of x other than positive integral values entirely out of account, and regard our function as a function of the positive integral variable n, whose values are the positive integers 1, 2, 3, 4,.... In this case we may write y=b (n) and regard y now as a function of n defined for all values of n. It is obvious that any function of x defined for all values of x gives rise to a function of n defined for all values of n. Thus from the function y = x2 we deduce the function y = n2 by merely omitting from consideration all values of x other than positive integers, and the corresponding values of y. On the other hand from any function of n we can deduce any number of functions of x by merely assigning values to y, corresponding to values of x other than positive integral values, in any way we please. 44. Interpolation. The problem of determining a function of x which shall assume, for all positive integral values of x, values agreeing with those of a given function of n, is of extreme importance in higher mathematics. It is called the problem of functional interpolation. Were the problem however merely that of finding some function of x to fulfil the condition stated it would of course present no difficulty whatever. We could, as explained above, simply fill in the missing values as we pleased: we might indeed simply regard the given values of the function of n as all the values of the function of x and say that the definition of the latter function failed for all other values of x. But such purely theoretical solutions are obviously not what is usually wanted. What is usually wanted is some formula involving x (of as simple a kind as possible) which assumes the given values for x= 1, 2,.... In some cases, especially when the function of n is itself defined by a formula, there is an obvious solution. If for example y= ((n), where c (n) is a function of n which would have a meaning even were n not a positive integer (e.g. n, n2, (n- l)/(n+l)), we naturally take our function of x to be y= q (x). But even in this very simple case it is easy to write down other almost equally obvious solutions of the problem. For example y (x) )+ sin xir, assumes the value q (n) for x=n, since sin nrr=O.

Page 110

110 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV In other cases 5 (n) may be defined by a formula, such as (-1)n, which ceases to define for some values of x at any rate (as here in the case of fractional values of x with even denominators, or irrational values). But it may be possible to transform the formula in such a way that it does define for all values of x. In this case, for example, (- i)"=cos nr, if n is an integer, and the problem of interpolation is solved by the function -COS XrT. In other cases + (x) may be defined for some values of x other than positive integers, but not for all. Thus from y=nw" we are led to y=xx. 'This expression has a meaning for some only of the remaining values of x. If for simplicity we confine ourselves to positive values of x, xx has a meaning for all rational values of x, since (p/g)/lq-) =/(p/q)P, according to the definition of fractional indices adopted in elementary algebra. But when x is irrational xx has (so far as we are in a position to say at the present moment) no meaning at all. Thus in this case the problem of interpolation at once leads us to consider the question of extending our definitions in such a way that xx shall have a meaning even when x is irrational. We shall see later on how the desired extension may be effected. Again consider the case in which y=1.2... n =n! In this case there is no obvious formula in x which reduces to n! for x=n,.as x! means nothing for values of x other than the positive integers. This is a case in which attempts to solve the problem of interpolation have led to important advances in mathematics. For mathematicians have succeeded in discovering a function (the Gamma-function) which possesses the desired property and many other interesting and important properties besides. 45. Finite and infinite classes. Before we proceed further it is necessary to make a few remarks about certain ideas of an abstract and logical nature which are of constant occurrence in Pure Mathematics. In the first place, the reader is probably familiar with the -notion of a class. It is unnecessary to discuss here any logical difficulties which may be involved in the notion of a 'class': roughly speaking we may say that a class is the aggregate or collection of all the entities or objects which possess a certain property, simple or complex. Thus we have the class of British subjects, or red-headed Germans, or. positive integers, or real numbers.

Page 111

44-46] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 111 Moreover, the reader has probably an idea of what is meant by a finite or infinite class. Thus the class of British subjects is a finite class: the aggregate of all British subjects, past, present, ~and future, has a certain definite number n, though of course we cannot tell at present the actual value of n. The class of present British subjects, on the other hand, has a number n which could be ascertained by counting, were the methods of the census effective enough. On the other hand the class of positive integers is not finite but infinite. This may be expressed more precisely as follows. If n is any positive integer (e.g. 1000, 1,000,000 or any number we like to think of), there are more than n positive integers. 'Thus if the number we think of is 1,000,000, there are obviously at least 1,000,001 positive integers. Similarly the class of real numbers, or of points on a line, is infinite. It is convenient to,express this by saying that there are an infinite number of positive integers, or real numbers, or points on a line. But the reader must be careful always to remember that by saying this we mean simply that the class in question is not a class with a,definite number of members, such as 1000 or 1,000,000. 46. Properties possessed by a function of n for large values of n. We may now return to the 'functions of n' which we were discussing in ~~ 43, 44. They have many points of difference from the functions of x which we discussed in Chap. II. But there is one fundamental characteristic which the two classes of functions have in common-the values of the variable for which they are defined form an infinite class. It is this fact which forms the basis of all the considerations which follow and which, as we shall see in the next chapter, apply, mutatis mutandis, to functions of x as well. Suppose that + (n) is any function of n, and that P is any property which c (n) may or may not have, such as that of being a positive integer or of being greater than 1. Consider, for each *of the values n = 1, 2, 3,..., whether p (n) has the property P or.not. Then there are three possibilities(a) b (n) may have the property P for all values of n, or for.all values of n except a definite number N of such values:

Page 112

112 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV (b) cb (n) may have the property for no values of n, or only for a definite number N of such values: (c) neither (a) nor (b) may be true. If (b) is true the values of n for which (b (n) has the property form a finite class. If (a) is true the values of n for which b (n) has not the property form a finite class. In the third case neither class is finite. Let us consider some particular cases. (1) Let q( (n)=n, and let P be the property of being a positive integer. Then q5 (n) has the property P for all values of n. If on the other hand P denotes the property of being a positive integer greater than or equal to 1000, ) (n) has the property for all values of n except a definite number of values of n, viz. 1, 2, 3,..., 999. In either of these cases (a) is true. (2) If 4 (n)= n, and P is the property of being less than 1000, (b) is true. (3) If 4 (n)=n, and P is the property of being odd, (c) is true. For 4 (n) is odd if n is odd and even if n is even, and either the odd or the even values of n form an infinite class. Example. Consider, in each of the following cases, whether (a), (b), or (c) is true: (i) ) (n))=n, P being the property of being a perfect square, (ii) 4 (n)=the nth prime number, P being the property of being odd, (iii) ) (n)=the nth prime number, P being the property of being even, (iv) ( (n)=the nth prime number, P being the property q (n)>n, (v) 4 (n)== - (- l) (l/n), P being the property 4 (n)<l, (vi) ) (n) 1-(- 1) (1/n), P being the property () ()<2, (vii) (n)= 1000 {1 + (- l)_}/n, P being the property q) (n)< 1, (viii) q5 (n)= l/n, P being the property 4 (n)< 001, (ix) q) (n) =(- 1)"/n, P being the property ) (n) I < 001, (x) (n) = 10000/n, or (- 1)n 10000/n, P being either of the properties ) (n)< 001 or (n) I< -001, (xi) ) (n)== ( - 1)/(n + 1), P being the property 1 - (n)< '0001. 47. Let us now suppose that 4 (n) and P are such that the assertion (a) is true, i.e. that b (n) has the property P, if not for all values of n, at any rate for all values of n except a definite number N of such values. We may denote these exceptional values by There is of course no reason why these N values should be the

Page 113

46,47] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 113 first N values 1, 2,..., N, though, as the preceding examples show, this is frequently the case in practice. But whether this is so or not we know that (n) has the property P if n > nN. Thus the nth prime is odd if n > 2 (n = 2 being the only exception to the statement), and 1/n < 001 if n > 1000 (the first 1000 values of n being the exceptions), and 1000{ + (- 1)n/n<, if n > 2000, the exceptional values being 2, 4, 6,..., 2000. That is to say, in each of these cases the property is possessed for all values of n from a definite value onwards. We shall frequently express this by saying that b (n) has the property for large, or very large, or all sufficiently large values of n. Thus when we say that b (n) has the property P (which will as a rule be a property expressed by some relation of inequality) for large values of n, what we mean is that we can determine some definite number, n0 say, such that q (it) has the property for all values of n greater than or equal to no. This number no, in the examples considered above, may be taken to be any number greater than n,, the greatest of the exceptional numbers: it is most natural to take it to be nN + 1. Thus we may say that 'all large primes are odd,' or that '1/n is less than '001 for large values of n.' And the reader must make himself familiar with the use of the word large in statements of this kind. Large is in fact a word which, standing by itself; has no more absolute meaning in mathematics than in the language of common sense. It is a truism that in common life a number which is large in one connection is small in another; 6 goals is a large score in a football match, but 6 runs is not a large score in a cricket match; and 300 runs is a large score, but ~300 is not a large income-and so of course in mathematics large generally means large enough, and what is large enough for one purpose may not be large enough for another. We know now what- is meant by the assertion '~ (n) has the property P for large values of n.' It is with assertions of this kind that we shall be concerned throughout this chapter. Given a function q (n), are there any properties of which such an assertion is true? H. A. 8

Page 114

114 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV 48. The phrase 'n tends to infinity.' There is a somewhat different way of looking at the matter which it is natural to adopt. Suppose that n assumes successively the values 1, 2, 3,.... The word 'successively' naturally suggests succession in time, and we may suppose n, if we like, to assume these values at successive moments of time (e.g. at the beginnings of successive seconds). Then as the seconds pass n gets larger and larger and there is no limit to the extent of its increase. However large a number we may think of (e.g. 969372855) a time will come when n has become larger than this number. It is convenient to have a short phrase to express this unending growth of n, and we shall say that n tends to infinity, or n o- o, this last symbol being usually employed as an abbreviation for 'infinity.' The phrase 'tends to' like the word 'successively' naturally suggests the idea of change in time, and it is convenient to think of the'variation of n as accomplished in time in the manner described above. This however is a mere matter of convenience. The variable n is a purely logical entity which has in itself nothing to do with time. The reader cannot too strongly impress upon himself that when we say that n 'tends to oo' we mean simply that n is supposed to assume a series of values which increase continually and without limit. There is no number 'infinity': such an equation as n= oo is as it stands absolutely meaningless: n cannot be equal to o, because 'equal to oo' means nothing. So far in fact the symbol oo means nothing at all except in the one phrase 'tends to oo,' the meaning of which we have explained above. Later on we shall learn how to attach a meaning to other phrases involving the symbol oo, but the reader will always have to bear in mind (1) that oo by itself means nothing, although phrases containing it sometimes mean something, (2) that in every case in which a phrase containing the symbol oo means something it will do so simply because we have previously attached a meaning to it by means of a special definition.

Page 115

48, 49] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 115 Now it is clear that if 0 (n) has the property P for large values of n, and if n 'tends to oc,' in the sense which we have just explained, n will ultimately assume values large enough to ensure that 0 (n) has the property P. And so another way of putting the question 'what properties has 4 (n) for sufficiently large values of n?' is 'how does + (n) behave as n tends to o?' 49. The behaviour of a function of n as n tends to infinity. We shall now proceed, in the light of the remarks made in the preceding sections, to consider the meaning of some kinds of statements which are perpetually occurring in higher mathematics. Let us consider for example, the two following statements-(a) 1/n is small for large values of n, (b) 1 -(1/n) is nearly equal to 1 for large values of n,-neither of which, we imagine, anyone will be inclined to dispute. Yet, obvious as they may seem, there is a good deal in them which will repay the reader's attention. Let us take (a) first, as being slightly the simpler. We have already considered the statement '1/n is less than '01 for large values of n.' This, we saw, means that the inequality 1/n < '01 is true for all values of n greater than some definite value, in fact greater than 100. Similarly it is true that '1/n is less than '0001 for large values of n': in fact l/n< 0001 if n > 10000. And instead of -01 or -0001 we might take -000001 or ~00000001, or indeed any' positive number we like. It is obviously convenient to have some way of expressing the fact that any such statement as '1/n is less than '01 for large values of n' is true, when we substitute for '01 some smaller number, such as '0001 or '000001 or any other number we care to choose. And clearly we can do this by saying that 'however small 8 may be (provided of course it is positive) 1/n < 8 for sufficiently large values of n.' That this is true is obvious. For 1/n < 8 if n > 1/8; so that our 'sufficiently large' values of n need only all be greater than 1/8. The assertion is however a complex one, in that it really stands for the whole class of assertions which we obtain by giving to 8 special values such as '01. And of course the smaller is 8 and the larger 1/8 the larger must the least of the 'sufficiently large' values of n be, values which are sufficiently large when 8 has one value being inadequate when it has another. 8-2

Page 116

116 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV The last statement italicised is what is really meant by the statement (a), that 1/n is small when n is large. Similarly for (b), which really means "if (n) = - (1/n), then the statement ' 1- (n) < 3 for sufficiently large values of n' is true whatever positive value (such as 01 or '0001) we attribute to 8." That the statement (b) is true is obvious from the fact that 1- b (n) = 1/n. There is another way in which it is common to state the facts expressed by the assertions (a) and (b). This is suggested at once by ~ 48. Instead of saying 'l/n is small for large values of n' we say 'l/n tends to 0 as n tends to co.' Similarly we say that ' 1- (1/n) tends to 1 as n tends to oo': and these statements are to be regarded as precisely equivalent to (a) and (b). Thus the statements. '1/n is small when n is large,' '1/n tends to 0 as n tends to o,' are equivalent to one another and to the more formal statement 'if 8 is any positive number, however small, l/n< for sufficiently large values of n,' or to the still more formal statement 'if 8 is any positive number, however small, we can find a number n0 such that 1/n < 8 for all values of n greater than or equal to n,.' The reader should imagine himself confronted by an opponent who questions the truth of the statement. He would name a series of numbers growing smaller and smaller. He might begin with '001. The reader would reply that 1/n<'001 as soon as n>1000. The opponent would be bound to admit this, but would try again with some smaller number, such as '0000001. The reader would reply that 1/n < '0000001 as soon as n > 10000000: and so on. In this simple case it is evident that the reader would always have the better of the argument. We shall now introduce yet another way of expressing this property of the function 1/n. We shall say that 'the limit of 1/n as n tends to oo is 0,' a statement which we may express symbolically in the form lim (l/n) = 0, or simply lim (1/n) =O. We shall also sometimes use the notation (n — 0o) (n 0o )

Page 117

49-51] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 117 or simply l/n - 0, which may be read 'l/n tends to 0 as n tends to oo.' In the same way we shall write lim {1 - (1/n)} = 1, limr {1 -(1/n)} =1, 1 - (1/n)- 1, (n- ). o) (n-i oo ) or 1-(l/n) 1. 50. Now let us consider a different example: let (n) =n2. Then ' n2 is large when n is large.' This statement is equivalent to the more formal statements 'if G is any positive number, however large, n2> G for sufficiently large values of n,' 'we can find a number no such that n2 > G for all values of n greater than or equal ton?.' And it is natural in this case to say that 'n2 tends to co as n tends to oo,' or' n2 tends to oo with n' and to write n2, (n - o) or simply n2 - o. Finally consider the function ( (n)= - n. In this case (n) is large, but negative, when n is large, and we naturally say that '- n2 tends to - oo as n tends to o ' and write - n2 - - o. And the use of the symbol - o in this sense suggests that it will sometimes be convenient to write n2 -- + or for n2 - oc and generally to use + oo instead of oo, in order to secure greater uniformity of notation. But we must once more repeat that in all these statements the symbols oc, + oo, - oo mean nothing whatever in themselves, and only acquire a meaning when they occur in certain special connections in virtue of the explanations which we have just given. 51. Definition of a limit. After the discussion which precedes the reader should be in a position to appreciate the general notion of a limit. Roughly we may say that ( (n) tends to a limit I as n tends to oo if ( (n) is nearly equal to I when n is large. But although the meaning of this statement should be clear enough after the preceding explanations, it is not, as it stands, precise enough to serve as a strict mathematical definition.

Page 118

118 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV It is, in fact, equivalent to a whole class of statements of the type 'for sufficiently large values of n, 4 (n) differs from I by less than 8.' This statement has to be true for 3 = '01 or '0001 or any positive number; and for any such value of 8 it has to be true for any value of n after a certain definite value no, though, the smaller 8, the larger (as a rule) will be this value 0,. We accordingly frame the following formal definition: DEFINITION I. The function 4 (n) is said to tend to the limit I as n tends to Go, if, however small be the positive number 8, ) (n) differs from I by less than 8 for suficiently large values of n; or if, however small be the positive number 8, we can determine a value no corresponding to 8 and such that 4 (n) differs from 1 by less than 8 for all values of n greater than or equal to no. It is usual to denote the difference b (n)n) 1, taken positively, by I q (n)-11. It is equal to 4 (n)-l or to I- b (n), whichever is positive, and agrees with the definition of the modulus of ) (n) 1, as given in Ch. III, though at present we are only considering real values, positive or negative. With this notation the definition may be stated more shortly as follows: 'if, given any positive number, 8, however small, we can find 0o so that I q (n) - I < 8 for n >n o, then we say that ( (n) tends to the limit I as n tends to oo, and write lim 4 (n) = 1.' Sometimes we may omit the 'n oo'; and sometimes it is convenient, for brevity, to write ( (n) -> 1. It should be observed that no is a function of 8. Thus if k (x) =l/n, = 0, and the condition reduces to 1/n < 8 (n > no), which is satisfied if no= 1 + [1/a] (the integer larger by one than the greatest integer contained in 1/8). There is one and only one case in which the same no will do for all values of 8. If, from a certain value NV of n onwards q (n) is constant, say equal to C, it is evident that / (n) - 0=0 for n >N, so that the inequality | ) (n)- C < is satisfied for n N and all positive values of 8. And if \ (n)-1I < for n _ N and all positive values of 8 it is evident that ( (n)=l for n i N, so that q5 (n) is constant for all such values of n. 52. The definition of a limit may be illustrated geometrically as follows. The graph of 4 (n) consists of a number of points corresponding to the values n=l, 2, 3,...

Page 119

51-53] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 119 Draw the line y =, and the parallel lines y = I -, y = I + 8 at distance 8 from it. Then lim 4 (n) = 1, 'A -.oo Y —.. ---.. —..-..._..._....^....._... -...._........... —......-........- y=1-+ 0 --- 1 --- —------------------------------------------------------------------ Y= 0 1 N I N FIG. 35. if, when once these lines have been drawn, no matter how close they may be together, we can always draw a line x = N (as in the figure) in such a way that the point of the graph on this line, and all points to the right of it, lie between them. We shall find this geometrical way of looking at our definition particularly useful when we come to deal with functions defined for all values of a real variable and not merely for positive integral values. 53. So much for functions of n which tend to a limit as n tends to oo. We must now frame corresponding definitions for functions which, like the functions n2 or - n2, tend to positive or negative infinity. The reader should by now find no difficulty in appreciating the point of DEFINITION II. The function b (n) is said to tend to + o (positive infinity) with n, if, when any number G, however large, is assigned, we can determine no so that b (n) > G for n > no; or if, however large G may be, 0 (n) > G for sufficiently large values of n. Another, less precise, form of statement is 'if we can make b (n) as large as we please by sufficiently increasing n.' This is open to the objection that it obscures a fundamental point, viz. that ) (n) must be greater than G for all values of n 1 no, and not merely for some such values. But there is no harm in using this form of expression if we are clear what it means.

Page 120

120 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV When 0 (n) tends to + oc we write (n) -+ c. (n — oo) We may leave it to the reader to frame the corresponding definition for functions which tend to negative infinity. 54. Some points concerning the definitions. The reader should carefully observe the following points. (1) We may obviously alter the values of + (n) for any finite number of values of n, in any way we please, without in the least affecting the behaviour of ( (n) as n tends to oo. For example 1/n tends to 0 as n tends to oo. We may deduce any number of new functions from 1/n by altering a finite number of its values. For instance we may consider the function p (n) which is equal to 3 for n = 1, 2, 7, 9, 106, 107, 108, 237 and equal to 1/n for all other values of n. For this function, just as for the original function l/n, lim b (Z) = 0. Similarly, for the function f (n) which is equal to 3 if n= 1, 2, 7, 9, 106, 107, 108, 237, and to n2 otherwise, it is true that b (n) — + o. (2) On the other hand we cannot as a rule alter an infinite number of the values of J (n) without fundamentally affecting its behaviour as n tends to c. If for example we altered the function 1/n by changing its value to 1 whenever n is a multiple of 100 it would no longer be true that lim f (n) = 0. So long as a finite number of values only were affected we could always choose the number no of the definition so as to be greater than the greatest of the values of n for which p (n) was altered. In the examples above, for instance, we could always take no > 237, and indeed we should be compelled to do so as soon as our imaginary opponent of ~ 49 had assigned a value of 8 as small as 3 (in the first example) or a value of G as great as 3 (in the second). But now however large no may be there will be greater values of n for which f (n) has been altered. (3) In applying the test of Definition I. it is of course absolutely essential that we should have! ((n) -1 < not merely for n = no but for n no, i.e. for no and for all larger values of n. In the last example, given 8 we can obviously choose no so that k (n) < 8 for n = n0: we have only to choose a sufficiently large value of n which is not a multiple of 100. But when ino is thus

Page 121

53, 54] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 121 chosen it is not true that ( (n) < 8 for n _ n0o: all the multiples of 100 which are greater than no are exceptions to this statement. (4) If p (n) is always greater than I we can replace i (n) - 1 by ( (n) - 1. Thus the test whether 1/n tends to the limit 0 as n tends to oc is simply whether 1/n < 8 for n -no. If however (n)=(- 1)/n, I is again 0, but (in)- 1 is sometimes positive and sometimes negative. In such a case we must state the condition in the form (in) - 1 < 8, in this particular case in the form I (n) < 8. (5) The limit I mcay itself be one of the actual values of (n). Thus if (n)= 0 for all values of n it is obvious that lim b (n)= 0. Again if we had (in (2) and (3) above) altered the value of the function (when n is a multiple of 100) to 0 instead of to 1 we should have obtained a function 9 (n) = 0 (in a multiple of 100), () (n) = 1/n (otherwise). The limit of this function as n tends to oo is still obviously 0. This limit is itself the value of the function for an infinite number of values of n, viz. all multiples of 100. On the other hand the limit itself need not (and in gIeneral will not) be the value of the function for any value of n. This is sufficiently obvious in the case of b (n) = /n. The limit is 0; but the function is never zero for any value of n. The reader cannot impress these facts too strongly on his mind. A limit is not a value of the function: it is something quite distinct from these values, though defined by its relations to them. The limits may possibly be equal to some of the values of the function-whether this be so or not has absolutely nothing to do with the notion of the limit: it is, so to say, a mere accident. For the functions 4 (n)=0, 1, the limit is equal to all the values of > (n): for (n) = 1/n, (-_l)n/?, 1 + (3/n,), 1 + {(- 1)/n it is not equal to any value.of ~ (n): for (n) = (sin ln7r)/n, 1 + {(sin 1n7r)/n} (whose limits as n tends to oo are easily seen to be 0 and 1, since sin ~nwr is never numerically greater than 1) the limit is equal to the value which ( (n) assumes for all even values of n, but the

Page 122

122 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV values assumed for odd values of n are all different from the limit and from one another. (6) A function may be always numerically very large when n is very large without tending either to + oc or to - oo. A sufficient illustration of this is given by,b(n) = (- 1)ln or (- 1)n 2. A function can only tend to + -o or to - o if, after a certain value of n, it maintains a constant sign. Examples XXVI. Consider the behaviour of the following functions of x as n tends to co: 1. < (n)=nk, where k is a positive or negative integer or rational fraction. If k is positive nk tends to + o with n. If k is negative lim nk=O. If k=0, nk= 1 for all values of n (by the definition of no). Hence lim nk=1. The reader will find it instructive, even in an almost obvious case like this, to write down a formal proof that the conditions of our definitions are satisfied. Take for instance the case of k > 0. Let G be any assigned number, however large. We wish to choose ln so that nk> G for n n0o. We have in fact only to take for n0 any number greater than t/G. If e.g. k=4, n4> 10000 for n 11, nA4>100000000 for n>101, and so on. From a geometrical point of view the matter may be stated as follows. If k > 0 the graph of y; = x is of the general form of A in Fig. 36; if k< 0 of the form of B; if k=0 it is a line C parallel to the axis of x. At present we are only concerned with the series of points marked on these curves. /A C B 01 i 2 3 FIG. 36. 2. q (n) =the nth prime number. If there were only a finite number of primes p (n) would be defined only for a finite number of values of n. There are however, as was first shown by Euclid, infinitely many primes. Euclid's proof is as follows. If there are only a finite number of primes let them be 1, 2, 3, 5, 7, 11,... V. Consider the number 1+(1. 2.3. 5.7.11... N). This number is evidently not divisible by any of 2, 3, 5,.. N, since the remainder when it is divided by any of these numbers is 1. It is therefore

Page 123

54, 55] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 123 not divisible by any prime save 1, and is therefore itself prime, which is contrary to our hypothesis. Again it is obvious that for all values of n (save n=l, 2, 3) 4 (n)> n. Hence n (n)- + oo. 3. ) (n)=the number of primes less than n. Here again 4 (n) —+o. 4. q (n) =[an], where a is any positive number. Here O((n)=o (On <l/a), 5(n~)=1 (l/an <2/a), and so on; and 5 (n) — +o o. 5. If )(n)=O1000000/n, limo(n)=O: if + (n) n/1000000, + (n)-.+oo. These conclusions are in no way affected by the fact that at first 4 (n) is much larger than + (n) (being, in fact, larger until n= 1000). 6. 4(n)=l/{n-(-l)},n- (-1)", n{l -(- l)n}. The first function tends to 0, the second to + oo, the third does not tend either to a limit or to + o. 7. 4) (n) =(sin n)/n, where 0 is any real number. Since Isin n l 1, I )( (n) I< l/n, and lim q (n)=O. 8. (n) = (sin nO)//n, (sin n)/n2, (a cos2 n+bsin2 n)/n, where a and b are any real numbers. 9. 4 (n) = sin n Or. If 0 is integral 4) (n) = 0 for all values of n, and therefore lim 4 (n) = 0. Next let 0 be rational, e.g. 0 =p/q, where p and q are positive integers. Let n=aq+b where a is the quotient and b the remainder when n is divided by q. Then sin (npTr/q) = (- l)ap sin (bpir/q). Suppose, for example, p even; then as n increases from 0 to q- 1, )b (n) takes the values 0, sin (pr/q), sin (2pqr/q),... sin {(q- l)prrlq}. When n increases from q to 2q -1 these values are repeated; and so also as n goes from 2q to 3q - 1, 3q to 4q- 1, and so on. Thus the values of 4 (n) form a perpetual cyclic repetition of a finite series of different values. It is evident that when this is the case 4q (n) cannot tend to a limit or to + oo or to - oo as n tends to infinity. The case in which 0 is irrational is a little more difficult. It is discussed in the next set of examples. 55. Oscillating Functions. DEFINITION. When, ( (n) does not tend to a limit, nor to + oo, nor to - o, as n tends to oo, we say that 4 (n) oscillates as n tends to oo. A function 4 (n) certainly oscillates if its values form, as in the case considered in the last example above, a continual repetition of a cycle of values. But of course it may oscillate without possessing this peculiarity. Oscillation is, according to its definition, a purely negative quality-a function oscillates when it does not do certain other things.

Page 124

124 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV The simplest example of an oscillatory function is given by (n) = (- 1)>, which is equal to + 1 when n is even and to - 1 when n is odd. In this case the values recur cyclically. But consider ) ( = ( 1)+ (l/n), the values of which are -1 + 1, 1+(1/2), -1 + (1/), 1 + (1/4), - 1 + (1/5)... When n is large every value is nearly equal to + 1 or- 1, and obviously c) (n) does not tend to a limit or to + oo or to - oo, and therefore it oscillates: but the values do not recur. It is to be observed that in this case every value of b (n) is numerically less than or equal to 3/2. Similarly + (n) = (- 1n 100 + (1000/n) oscillates. When n is large enough every value is nearly equal to 100 or - 100. The numerically greatest value is 900 (for n = 1). But now consider b (n) = (- 1)n n, the values of which are - 1, 2, - 3, 4, - 5,.... This function oscillates, for it does not tend to a limit, nor to + oo, nor to - o. But in this case we cannot assign any limit beyond which the numerical value of the terms does not rise. The distinction between these two examples suggests a further definition. DEFINITION. If b (n) oscillates as n tends to oo it will be said to oscillate finitely or infinitely according as it is or is not possible to assign a number K such that all the values of ( (n) are numerically less than K, i.e. I b (n) I < K for all values of n. These definitions, as well as those of ~ 54, are further illustrated in the following examples. Examples XXVII. Consider the behaviour as n tends to oo of the following functions: 1. (-1)n, 5+3(- )n, (1000000/n) +(- 1)", 1000000 (-1)+(i/n). 2. (-1) n, 1000000+(-1)nn. 3. 100000 —n, (-l1)(1000000 -n). 4. n {1 + (- 1)'}. In this case the values of q (in) are 0, 4, 0, 8, 0, 12, 0, 16,.... The odd terms are all zero and the even terms tend to +o o: (n) oscillates infinitely.

Page 125

55] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 125 5. n2 +(- l)n n. The second term oscillates infinitely, but the first is very much larger than the second when n is large. In fact ( (n) n2 - n and n2 = (n - _)2 - _ is greater than any assigned value G if n > + /{G + }. Thus 5 (n)-+ o. 6. n3+(-1) 2, n'2{l+(-l)}, (_l) 22+n. 7. 11+3(-l)n, (ll/n)+3(- 1)n, lln+3(-1)", ll+{3(-l)n/n}, 11 + 3 (-1l)n, {ll + 3 (-)n/n, {11 +3 (- 1)n, (l/n) + 3 (- 1)n, 11 I, + {3 (- )n/nj. 8. sin nOwr. We have already seen (Exs. XXVI. 9) that when 0 is rational ( (n) oscillates finitely-unless 0 is an integer, when ( (n) =0, ( (zn)-0. The case in which 0 is irrational is a little more difficult. But it is not difficult to see that ( (n) still oscillates finitely. We can without loss of generality suppose 0<0<1. In the first place I (n) <1. Hence p (n) must oscillate finitely or tend to a limit. We shall consider whether the second alternative is really possible. Let us suppose then that lim sinl n7r =. Then however small e is we can choose no so that sin nrr lies between l-e and 1+e for all values of n greater than or equal to no. Hence sin (n+ 1) 0r - sin n7r is numerically less than 2e for all such values of n, and so I sin 2 0O cos (n + ) O ] < E. Hence cos (n + ~) w -= cos n0r cos ~ 0w - sin n0r sin 2 0r must be numerically less than e/ I sin 2 Or i. Similarly cos (n - ~) 7rr =cos nowr cos 0wtr +sin n^0r sin 0rr must be numerically less than E/ I sin 0r I; and so each of cos nOir cos 0r, sin nOrr sin 07r must be numerically less than E/ sin 0 r |. That is to say, if n is large cos nOrr cos 0rr is very small, and this can only be the case if cos nOrr is very small. Similarly sin nOrr must be very small (so that I must be zero). But it is impossible that cos n0rr and sin nO7r can both be very small, as the sum of their squares is unity. Thus the hypothesis that sin nOrr tends to a limit I is impossible, and therefore sin nOrr oscillates as n tends to oo. The reader should consider with particular care the argument 'cos noir cos Orr is very small, and this can only be the case if cos nOr is very small.' Why, he may ask, should it not be the other factor cos ~7rr which is ' very small'? The answer is to be found, of course, in the meaning of the phrase 'very small' as used in this connection. When we say ' ( (n) is very small' for large values of n, we mean that we can choose no so that ( (n) is numerically smaller than any assigned number, if n is sufficiently large. Such an assertion is palpably absurd when made of a fixed number such as cos ~0rr, which is not zero. 9. sin nOr +(l/n)sinin nOr + (1000000/n), sin nOqr + 1, sin nOir + n, ( - 1) sin n07r, sin nor + ( - 1)n, sin nOrr + {( - 1)n/n}, sin nrr + (- 1)" n. 10. a cos nOr + b sin n7r, sin2 n0r, cos2 n07r, a os2 nOrr + b sin2 nOr. 11. a+bn+( - 1) (c+dn) +e cos nO7r+f sin nOrr.

Page 126

126 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 12. n sin nOwr. If n is integral, q (n) =0, 4 (n) — 0. If n is rational but not integral, or irrational, 4 (n) oscillates infinitely. 13. n (acos2 nO7r+bsin2 nrr). In this case 4)(n)-*+oo if a and b are both positive, - - oo if both are negative. Consider the special cases in which a=0, b>0, or a>0, b=0, or a=0, b=0. If a and b have opposite signs 4 (n) generally oscillates infinitely. Consider any exceptional cases. 14. sin (n20r). If 0 is integral, ( (n) —0. Otherwise ( (n) oscillates finitely, as may be shown by arguments similar to though more complex than those used in Exs. XXVI. 9 and XXVII. 8*. 15. sin (n! Or). If 0 has any rational value p/q, n! 0 is certainly integral for all values of n greater than or equal to q. Hence 4 (n) — O 0. The case in which 0 is irrational cannot be settled without the aid of considerations of a much more difficult character. 16. cos (in! 0r), a cos2 (n! 0r) + b sin2 (n! 0ir), where 0 is rational. 17. an-[bn], (-l)n(an-[b~n]). 18. [,/n], (-l)n[/n], Jn-[/n]. 19. The smallest prime factor of n. When n is a prime, 4 (n)=n. When n is even, 4) (n) = 2. Thus 4) (n) oscillates infinitely. 20. The largest prime factor of n. 21. The number of days in the year n A.D. Examples XXVIII. 1. If q) (n) —+oo and + (n) _ f (n) for all values of n, then 4, (n) - + oo. 2. If 4 (n) - 0, and |, (n) I c q| (n) l for all values of n, then + (n) - 0. 3. If lim | (n) 1=0, then lim. ) (n)=0. 4. If 4) (n) tends to a limit or oscillates finitely, and I (n) \ I 4 (n) [ for n no, then +(n) tends to a limit or oscillates finitely. 5. If q(n)- -+oo, or -oo, or oscillates infinitely, and I|(n)l_> I 0(n) for n >ino, then )(n)- +oo or -oo or oscillates infinitely. 6. ' If q) (n) oscillates and, however great be no we can find values of n greater than no and for which + (n) is either greater than or less than q) (n), then + (n) oscillates.' Is this true? If not give an example to the contrary [q (n) =(- 1)n, (n) = 0]. 7. If (n)- l as n- oo, then also 4(n+p)- l, p being any fixed integer. [This follows at once from the definition. Similarly we see that if q) (n) tends to + oo or -oo or oscillates so also does q (n +p).] 8. The same conclusions hold (except in the case of oscillation) if p varies with n but is always numerically less than a fixed positive integer N; or if p varies with n in any way, so long as it is always positive. 9. Determine the least value of no for which it is true that (a) n2+n>1000 (nno), (b) n2+n>1000000 (n —no). * See Bromwich's Infinite Series, p. 485.

Page 127

56] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 127 10. Determine the least value of no for which it is true that (a) n+(-l)">1000 (n2no), (b) n+(-1)n>1000000 (nno). 11. Determine the least value of no for which it is true that (a) n2+n>G (n-no), (b) n+(-1)f>G (nno), G being any positive number. [(a) no=[~+~/(G+4)]: (b) no=l+[G] or 2+[G], according as [G] is odd or even: i.e. no=l +[G]+ {1 +(- )[]}] 12. Determine the least value of no such that (a) n/(n2+l)<'0001. (b) (1/n)+{( -l)/n2}<'000001, for n no. [Let us take the latter case. In the first place (1/n) + {( - i)n/n2}_ (n + 1)/n2, and it is easy to see that the least value of no such that (n + 1)/n2< 000001, for nno0, is 1000002. But the inequality given is satisfied by n=1000001, and this is the value of no required.] 56. Some general theorems with regard to limits. A. The behaviour of the sum of two functions whose behaviour is known. THEOREM I. If b (n) and * (n) tend to limits a, b, then + (n) + f (n) tends to the limit a + b. This is almost obvious. The argument which the reader will at once form in his mind is roughly this: 'when n is large qb (n) is nearly equal to a and r (n) to b and therefore their sum is nearly equal to a + b.' It is well to state the argument quite formally, however. Let 8 be any assigned positive number (e.g. '001, '0000001,...). We require to show that a number n0 can be found such that I 0(n)+ n(n()-a-b I <......... 1.....(1), for n -z0. Now by a proposition proved in Chapter III. (more generally indeed than we need here) the modulus of the sum of two numbers is less than or equal to the sum of their moduli. Thus (n) + f (n)-a- b a - b (n) - a + (n)- b. It follows that the desired condition will certainly be satisfied if no can be so chosen that ( l(n)-a + (n ) - b\< 8............... (2), for n _ no. But this is certainly the case. For since lim b (n)= a we can, by the definition of a limit, find n1 so that I q (n) - a < 8',

Page 128

128 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV for tn _ nl, and this however small may be S'. Nothing prevents our taking 8'= ~8, so that ]| ()- a I< 1, for n ' n1. Similarly we can find n2 so that \ 5(n)-b < ~8 for n _ n2. Now take no to be the greater of the two numbers n,, n2. Then if n n, \ ( (n) - a I < -8 and | f (n) - b I < 8, and therefore (2) is satisfied and the theorem is proved. The argument may be concisely stated thus: since lim p (n)= a and lim 4 (n) = b we can choose n1, 9t2 SO that I <(n) -a|< (nz nl), I (n)-b < _ (n - n), and then, if n is greater than either nl or n2, (n) +- (n) - - b i I (n)-a +f (n)- b I <, and therefore linm (n) +4' (n)} = a +b. Even when stated thus the argument may possibly appear to the reader to be merely a piece of useless pedantry, or an attempt to manufacture difficulties out of what is really obvious. We do not assert that such an opinion is, in this case, entirely groundless. The result really is very obvious: nor would any mathematician think it worth while as a rule to state arguments for what is so obvious at such length. But the reader must remember that the theorem, obvious though it may be, is one of the most fundamental and important theorems in all mathematics. It is one which every mathematician uses, consciously or unconsciously, twenty times a day. The proof of such a theorem must be made absolutely clear, explicit, and rigorous: no room must be left for any possible misapprehension or confusion. And this is not all. The great majority of theorems concerning limits are, as the reader will discover before long, far from being so simple and so obvious as this one. In this case the result obviously indicated by common sense was true. In more difficult cases common sense as often indicates an untrue result as a true one: sometimes it fails to give any indication at all. In such cases vague general arguments are worse than useless: they lead to mistakes not only gross in themselves but entirely confusing in their consequences. And unless the reader is prepared to take the trouble to try and understand the way in which rigorous methods apply to simple and obvious cases, where their application is easy, he will find that when he comes to difficult questions, which cannot be settled without them, he has not the capacity to use them. 57. Results subsidiary to Theorem I. The reader should have no difficulty in verifying the following subsidiary results. 1. If 0 (n) tends to a limit, but 'k (n) tends to + oo or to - oo or oscillates finitely or infinitely, then b (n) + q (n) behaves like * (n).

Page 129

56, 57] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 129 2. If (n) -- + o, cad f (n) - + o or oscillates finitely, then + (n) -- +oo. In this statement we may obviously change + oo into - oo throughout. 3. But if 4 (n) ( -- oo and # (n)- oo, then (n) + f(n) may either tend to a limit or to + oo or to - co or may oscillate either finitely or infinitely. These five possibilities are illustrated in order by (i) q (n)=n, + (n) = -n, (ii) 1 (n)=n2, (n)= -n, (iii) qb(n)=n, (n)=-n2, (iv) k (n)=n+( -1)", A(n)= -n, (v) 0(n)=n2+(-1)nn n, 4,(n)= -n2. The reader should construct additional examples of each case. 4. If ) (n) - oC and f ('n) oscillates infinitely, ) (n) + ~ (n) may tend to + oo or oscillate infinitely, but cannot tend to a limit, or to - oo, or oscillate finitely. For + (n) = {p (n) + + (n)} - f (n); and, if ( (n) + (n) behaved in any of the three last ways, it would follow, from the previous results, that 4 (n) - - o, which is not the case. As examples of the two cases which are possible, consider (i) (n)=n2, (n)=(-l1)n, (ii) 0(n)=n, (n)=(-l)nn2. Here again the signs of +oo and - oo may be permuted throughout. 5. If Q (n) and r (n) both oscillate finitely, 4 (n) + 4 (n) must tend to a limit or oscillate finitely. As examples take (i) q (n)= (n)= (-l)5, (ii) = (n)=cos n7r, + (n)=sin nrr. 6. If ( (n) oscillates finitely, and ' (n) infinitely, then ( (n) + ~ (n) oscillates infinitely. For f (n) is in absolute value always less than a certain constant, say G. On the other hand + (n), since it oscillates infinitely must assume values numerically greater than any assignable number (e.g. 10G, 100G,...). Hence ) (n)+4 (n) must assume values numerically greater than any assignable number (e.g. 9G, 99G,...). Hence q (n) + + (n) must either tend to + 4o or - oo or oscillate infinitely. But if it tended to + oo, for instance, (rin) = {k (n) + (n)} - ~ (n) would also tend to + o, by the preceding results. Thus qf (n) + r (n) cannot tend to + o, nor, for similar reasons, to - o: hence it oscillates infinitely. 7. If both 0 (n) and 4, (n) oscillate infinitely, ) (n) + * (n) may tend to a limit, or to + oo, or to - oc, or-oscillate either finitely or infinitely. Suppose, for instance, that q (n)=(- l)n, while, (n) is in turn each of the functions (-1)n+ln, {l+(-1)f+1}n, -{l+(-1)n}n, (-1l)+1'(n+l), (-l) n. We thus obtain examples of all five possibilities. 9 H. A.

Page 130

130 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV This exhausts all the possibilities which are really distinct. The results may be conveniently summarised in the following tabular form, in which 1 stands for 'tends to a limit,' 2 for 'tends to + o,' 3 for 'tends to - oo,' 4 for 'oscillates finitely,' and 5 for 'oscillates infinitely.' s (2)+ '(n~*) p (n*)+~ ()+ 1 1 1 1 2 2 1 3 3 1 4 4 1 5 5 2 2 2 2 3 1, 2, 3, 4, or.5 2 4 2 2 5 2 or 5 4 4 lor4 4 5 5 5 5 1, 2, 3, 4, or 5 Before passing on to consider the product of two functions we may point out that the result of Theorem I. may be immediately extended to the sum of three or more functions which tend to limits as n - oo. 58. B. The behaviour of the product of two functions whose behaviour is known. We can now prove a similar set of theorems concerning the product of two functions. The principal result is the following. THEOREM II. If lim (n) = a and lim r, (n) = b, then lim b (n) # (n) = ab. Let 0 (n) = a + 01b (n, 4 (n) = b + f1 (n), so that lim b 0 (n)= 0 and lim * (n) =0. Then ( (n) * (n) = ab + afr (n) + (n) (n) 1 (n). Hence the numerical value of the difference (n)n) (n) - ab is certainly not greater than the sum of the numerical values of a1 (n), boi (n), 01 (n) f1 (n). From this it is obvious that lim {9I (n) * (n) - ab} = 0, which proves the theorem.

Page 131

57-59] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 131 The following is a strictly formal proof. We have I ((n) 4/ () -ab[ [ afl(n) I + I bo,(n) I + I (n)l Il l+(n) i. Choose no so that for n - no% I||(n)<~8/l|b|, | i<(n)l<~8/ial. Then I | (n) +(n) -ab 1<8+ I + {ga/( al I b)}, which is certainly less than 8, if 8 < 3 l a i b 1. That is to say we can choose no so that - (n) + (nX) - ab < 8 (n _no), and so the theorem follows. The reader should study the details of this proof attentively; it is an elementary specimen of a type of proof perpetually occurring in higher analysis. We need hardly point out that this theorem, like Theorem I., may be immediately extended to the product of any number of functions of n. 59. Results subsidiary to Theorem II. There is of course a series of theorems concerning products analogous to those stated in ~ 57 for sums. It will be convenient to present the results in tabular form. We must distinguish now six different ways in which + (n) may behave as n tends to oo. It may (1) tend to a limit other than zero, (2) tend to zero, (3) tend to + o0, (3') tend to - oo, (4) oscillate finitely, (5) oscillate infinitely. We need not, as a rule, take account separately of (3) and (3'), as the results for one may be deduced from those for the other by a change of sign. Case 0 (n) * (n) ~ (n) ' (n) 1 1 1 1 2 1 2 2 3 1 3 3or 3' 4 1 4 4 5 1 5 5 6 2 2 2 7 2 3 any way 8 2 4 2 9 2 5 any way 10 3 3 3 11 3 4 3, 3', or 5 12 3 5 3, 3', or 5 13 4 4 1, 2, or 4 14 4 5 any way 15 5 5 any way We leave the verification of this table as an exercise to the reader. The 9-2

Page 132

132 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV more difficult cases 7, examples. 7 9, 11, 13, 14, 15 may be illustrated by the following 9 (n) 1/n 1/n2 1/n - 1/nl (-1)-/n (- 1)n/n k(n) n n n2 n2 n n2 { (n) J (n) 1 2 3 3' 4 5 qp(n) (-l) /n (-l )"/n (-1),/n 1/n 1/n (n) / (n)O, (n) (- )nn 1 (-1)nn 2 (-1)nn+2 3 ( - 1)n + 1in2 3' (-1)"n 4 (-1)n2 5 11 13 ( (n), (n) ~ (n)q(i)) (n) (n) ()(n) n 2+(- 1)n 3 (-1) (-) 1 n -2-(-1) 3' 1(-l)n 1-(-l)n 2 n (- 1)n 5 cos nv sin n7r 4 14 (n) (n) (n) + (n) {1-(-1)n}-[{1+( 1)n}/n] {1-(-1)n}-n{1 +(-1)"} 1 1+(-l)n {1-(- 1)}n 2 (-1)n (-1) n 3 (-1)n (-I)n+l? 3' 1+(-1) 1+{1-(-1l)n}n 4 cos Anrr n sin ~nT 5 15 -/ (n) n {1 -(-l)n} -[{1 +( —1)n}/n] {1+(-1)n}n (-1)"n (- 1)"n {1 +( - 1)n} n + [{1 (-1 )-}/nn] n COS -?7Tr + (n) [{1-(-1)n/}/n,]-n {1 + (-1)n} {1-( -1)'}n (-l)nn {-(- 1)"nn n sin Inir 3 3' 4 5 As an illustration of how to verify these examples we may take the first example under Case 14. Since 1- (- l)n=0 or 2 according as n is even or odd, while l+(-1)-=0 or 2 according as n is odd or even, the values of p (n) are 2, -2/2, 2, -2/4, 2, -2/6,..., and so b (n) oscillates finitely; while the values of + (n) are 2, -2x2, 2, -2x4, 2, -2x6,..., and + (n) oscillates infinitely. But d (n) + (n)=4 for all values of n.

Page 133

59-61] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 133 60. A particular case of Theorem II which is important is that in which r (n) is constant. The theorem then asserts simply that lim kf (n) = ka if lim b (n) = a. To this we may join the subsidiary theorem that if ((n) -- + oo, then aO (n) -- + 4o or - o, according as a is positive or negative, unless a=0, when of course ao(n)==0 for all values of n and lim ac (n) = 0. And if + (n) oscillates finitely or infinitely so does ab (n), unless a= 0. 61. C. The behaviour of the difference or quotient of two functions whose behaviour is known. There is, of course, a similar set of theorems for the difference of two given functions: but they are such obvious corollaries from what precedes that it would be waste of time to state them at length. In order to deal with the quotient * (n)/ (n), we begin with the following theorem. THEOREM III. If lim r (n)= a, and a is not zero, then limn {1/4 (n)} = 1/a. Let ( (n) = a + 01 (n), so that lim,b (n) = 0. Then I t{1/ (n)}-(l/a) = | 0, (n) I/{ a I Ia + 41 (n)l}, and it is plain, since lim 0l (n) = 0, that we can choose n0 so that for n ' no this is smaller than any assigned number 8. The theorems subsidiary to this may again be stated concisely by means of a table. Case '/ (n) 1/lp (n) 1 1 1 2 2 3, 3', or 5 3 3 2 4 4 4 or 5 5 5 2, 4, or 5 The three more complicated cases may be illustrated by the following examples: the number indicates the behaviour of 1/p (n).

Page 134

134 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV Case 2. q(n)= /n 3 b (n)= - l/n 3' 5 Case 4. (n)=(- 1) 4 q (n)= {1 +( -1)L} +[{1 - (- i)l}/] 5 Case 5. 5 (n)=(- )n2 2 +(n)={1+(-1)f}+ {1-(- i),},I 4 () (n)= { +(- 1)n} +[{1 -(( - 1)n}/] 5 It will not be necessary now to attempt to state an exhaustive series of theorems for the quotient Jf (n)/ (n), such as may be deduced from the results above and those of ~ 59. The principal theorem is THEOREM IV. If lim + (n) = a and lim - (n) = b, and a = 0, then lim {f (n)/+ (n)} = b/a. This requires no proof, being an immediate consequence of Theorems II and III. The reader will however find it very instructive to draw up, at any rate partially, a table for the quotient similar to those we have given for the product and reciprocal, and to illustrate some of the possible cases with examples. 62. THEOREM V. If R {, (n), (n), X (n),... is any rational function of b (n), r (n), X (n), etc., i.e. any function of the form P { (n), (n), x (n),... }/Q {I (n), (n), X (),..., where P and Q denote polynomials in b (n), f (n), X (n),...: and if lim q (n) = a, lim * (n) = b, lim X (n) =c,..., and Q (a, b, c,...) 0; then lim R {I (n), # (n), x (n),... = R (a, b, c,...). For P is a sum of a finite number of terms of the type A {f (n)}P { (n)}k..., where A is a constant and p, q positive integers. This term, by Theorem II (or rather by its obvious extension to the product of any number of functions) tends to the limit AaPb..., and so P tends to the limit P (a, b, c,...), by the similar extension of Theorem I. Similarly for Q: and the result then follows from Theorem IV.

Page 135

61-64] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 135 63. The preceding general theorem may be applied to the following very important particular problem: what is the behaviour of the most general rational function of n, viz. S (n) _ anP + an-l1 +... + ap bonq + bnq- +... + bq ' as n tends to oo. In order to apply the theorem we transform S(n) by writing it in the form nP- (+ a + +... +/( o + b +.. + ) n nP nnl The term in brackets is of the form R {4 (n)}, where b (n) = 1/n, and so, by Theorem V, it tends, as n tends to co, to the limit R (0) = aO/bo. Now, if p < q, lim n -q = 0; if p = q, np-q = 1 and lim np-q = 1; if p > q, nP - + o. Hence, by Theorem II, lim S (n)= 0 (p < q), lim S (n) = ao/bo (p = q), S(n)-+-co (p>q, a,/bo positive), S(n) —co (p>q, aQ/bo negative). Examples XXIX. 1. Determine the behaviour, as n-coc, of each of the following functions of n, and of their sums, differences, products and quotients, taken in pairs: +{(- l)"/n}, (- 1)'+(l/n), 1+(- l)n, (- 1)+n, n +{(- 1)/n}, (-)nn +(1)/n, (-.) n (1 +n), (-l){l +(l/n)}, (- l){n+(1/n)}. 2. Do the same for the functions cos2 -nr + (sin2 n7r)/n, cos2nr +n sin2 ~nr, n cos2 -nn7r + (sin2 1nr)/n. 3. Which (if any) of the functions l/(cos2 Inr +n sin2 n7-),\ l/{n (cos2 nr + n sin2 nMr)}, (n cos2 2n + sin2 2r)/{R (cos2 in-4 +n sin2 nr)} tend to a limit as n oc? 4. Denoting by S(n) the general rational function of n, considered in ~ 63, show that. in all cases lim {S (n+ 1)IS ()} = 1, lim [S {n +(l/)}/S(n)]= 1. 64. Functions of n which increase steadily with n. A special but particularly important class of functions of n is formed of those whose variation as n tends to oo is always in the same direction, that is to say those which always increase (or always decrease) as n increases. Since -b (n) always increases if (n) always decreases, it is not necessary to consider the two kinds of functions separately; for theorems proved for one kind can at once be extended to the other.

Page 136

136 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV DEFINITION. The function b (n) will be said to increase steadily with nif ~ (n + 1) _ q (n) for all values of n. It is to be observed that we do not exclude the case in which (b (n) has the same value for several values of n; all we exclude is possible decrease. Thus the function (n) = 2~n + (- 1)i, whose values for n = 0, 1. 2, 3, 4,... are 1, 1, 5, 5, 9, 9,... is said to increase steadily with n. Our definition would indeed include even functions which, from some value of n, remain constant; thus 4 (n) = 1 steadily increases according to our definition. However, as these functions are extremely special ones, and as there can be no doubt as to their behaviour as n tends to -c, this apparent incongruity in the definition is not a serious defect. There is one exceedingly important theorem concerning functions of this class. THEOREM. If 0*(n) steadily increases with n, then either (i) ( (n) tends to a limit as n tends to oo, or (ii) 0 (n) - + o. That is to say, while there are in general five alternatives as to the behaviour of a function, there are two only for this special kind of function. The proof is very simple. Imagine the various values of b (n) represented by points along the line L of Chap. I. Each point lies to the right of the preceding point (or coincides with it). Let Pn be the point corresponding to + (n). Let Q be any other point whatever on the line. Then either (1) there are values of n such that P, lies to the right of Q (or coincides with it), or (2) there are no such values. In the first case we say that Q is a point which is reached for some value of n, in the second case that it is a point which is not reached. Every point is a reached point or a not reached point. If any point Q is reached, so obviously are all points to the left of Q. There are two alternatives: (1) every point may be reached. Then it is clear that if G is any number, however large, it will correspond to a point Q, and, for sufficiently large values of n, Pn

Page 137

64] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 137 will lie to the right of Q, and so will Pn+,.... In other words k (n) > G for all values of n from a certain value. That is < (n) -- + o. Or (2) not every point may be reached. Then we can divide L into two segments, L1, L2, of which the first includes all reached points, the second all not reached points. The only doubt is as to whether the point R which divides the two segments is reached or not. If R is reached, then, since no point Pn can lie to the right of R (as in that case other points in L2 would be reached), all the points P, must coincide with R from some value of n, the first for which R is reached. Thus if OR = I, we have b (n) = 1 from a certain value of n onwards; so that, of course, lim b (n)= 1. On the other hand, if R is not reached all points to the left of R, however close to R, are reached. Thus we can choose n0 so that b (n) is as nearly equal to I as we please when n = n. Since, as n increases beyond no, b (n) approaches even more nearly to the value 1, it is clear that lim 0 (n) = 1. The theorem is thus proved. For example, if 0 (n) = 3 - (1/n), I = 3: the point R (OR-= 3) is not reached. From a common-sense point of view the theorem may be stated thus. Let the point P move along the line L in a series of jumps, its motion always being from left to right. Then either P will pass over the whole line, or its position will gradually approximate to a definite position R on the line L. The theorem is almost intuitive: the proof which precedes is merely a careful analysis of the process of argument implied in but suppressed by our intuition of its truth. COR. 1. If + (n) increases steadily with n it will tend to a limit or to + o according as it is or is not possible to find a fixed number G such that k (n) < G. We shall find this corollary exceedingly useful later on. CoR. 2. If ( (n) increases steadily with n and b (n) < G for all values of n, b (n) tends to a limit and this limit is less than or equal to G.

Page 138

138 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV It should be noticed that the limit may be equal to G: if e.g. ( (n)< 3-(1/n), every value of ( (n) is less than 3, but the limit is equal to 3. The reader should write out for himself the corresponding theorems and corollaries for the case in which 0 (n) decreases as n increases. 65. The great importance of these theorems lies in the fact that they give us (what we have so far been without) a means of deciding (in a great many cases) whether a given function of n does or does not tend to a limit as n -- o, without requiring us to be able to guess or otherwise infer beforehand what the limit is. If we know what the limit must be (if there is one) we can use the test I +(n) - 1l < e (n — no) as for example in the case of b (n) = l/n, where it is obvious that the limit can only be zero. But suppose we have to determine whether 1(n)=(1 + tends to a limit. In this case it is not obvious what the limit, if there is one, will be: and it is evident that the test above, which involves I, cannot, at any rate directly, be used to decide whether I exists or not. Of course the test can sometimes be used indirectly, to prove that I cannot exist by means of a reductio ad absurdum. If e.g. 4 (n)=(-l)", it is clear that I would have to be equal to 1 and also equal to - 1, which is obviously impossible. 66. The limit of xal as n tends to oo. Let us apply some of the preceding results to the particularly important case in which ( (n) = xn. First, suppose x positive. Then since b (n + 1) = xf (n), b (n) increases with n if x > 1, decreases as n increases if x< 1. If x = 1, b (n)= 1, lim b (n) = 1, so that this special case need not detain us. Thus, if x> I, x must tend either to a limit (which must obviously be greater than 1), or to + o. Suppose it tends to a limit 1. Then (Ex. XXVIII. 7) lim b (n + 1) = linm (n) -=; but lim b (n + 1) = lim xs (n) = x lim b (n) = xl,

Page 139

64-66] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 139 and therefore 1 = xl: and as x and 1 are both greater than 1, this is impossible. Hence q -X j+_ co (x > 1). Ex. The reader may give an alternative proof, showing by the binomial theorem that, if x=l + (8>0), x> l+n8, and so that X- +o. On the other hand, if x< 1, xn is a decreasing function and must therefore tend to a limit or to - oo. Since x. is positive the second alternative may be ignored. Thus lim x == 1, say, and as above 1= xl, so that I must be zero. Hence lim xn = 0 (0 < <1). In the special cases of x=0, 1, we clearly have lim x"= 0, lim xn = 1 respectively. Ex. Prove as in the preceding example that, if 0< <1, (l/x)n tends to + oo, and deduce that x'1 tends to 0. We have finally to consider the case in which x is negative. If -1 < x < 0 and x =-y, we have lim yr = O by what precedes and therefore lim x = 0. If x = -1 it is obvious that x'? oscillates, taking the values -1, 1 alternatively. Finally if x < - 1, y > 1, yn tends to + oo and therefore xn takes values, both positive and negative, numerically greater than any assigned number. Hence,x' oscillates infinitely. To sum up: (n) = x21- + 0o, (x > 1), lim i (n) = 1, (x = 1), lim (n,)= 0, (-1 <x < 1), b ({n) oscillates finitely, (x = - 1), (b (i) oscillates infinitely, (x <- 1). Examples XXX. 1. If (n) is positive and ()(n+l)>KIp(n), where K>1, for all values of n, then q (n) — + oo. [For qb()> IK (n- )>K2 (an- 2)... >K'-1 K (1) from which the conclusion follows at once.] 2. The same result is true if the conditions above stated are satisfied only for n i_ no. 3. If d (n) is positive and ( (n + 1) < IK (n), where 0< K<1, then lim q(n)=0. This result also is true if the conditions are satisfied only for n ' no.

Page 140

140 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV 4. If I1 (n+l) <K] +(n) for nn0, where 0<K<1, then lim (n-)=0. 5. If q (n) is positive and lim{4 (n+l)}/{( (nr)}=l >1, then +(n)- +oo. [For we can determine no so that {((n+l)}/{f((n)}>K>l, for nno: we may, e.g., take K half-way between 1 and 1. Now apply Ex. 1.] 6. If limr {qb(n+ l)}/{f (n)} =, where I is numerically less than unity, then lim o(n)=O. [This follows from Ex. 4 as Ex. 5 follows from Ex. 1.] 7. Determine the behaviour, as n-emo, of 0(n))=nrxn, where r is any positive integer. [Here {O (n + 1)}/{( (n)} = {(n+ 1)/n}")x - x as n-coo. If x is positive and greater than 1, q (n)~ —+ o. If x is positive and O<x<l, (n) —O. If x is negative and equal to -y, q(n)=(-l)'nn'y', and it is easy to see that (n) — >0 (-1<x<0) and +(n) oscillates infinitely (x< -1). Finally if x==l, (n)=n', and q (n)- + o; and if x=0, q (n)=0 for all values of n.] 8. Discuss n-"x"' in the same way. [The results are the same, except that when x=1 or - 1, ((n)->0.] 9. Draw up a table to show how nkxn behaves as n —oo, for all real values of x, and all (positive and negative) integral values of k. [The reader will observe that the value of k is immaterial except in the special cases when x=1 or - 1. In other words, it is the factor xn which is the most important factor: the second factor only asserts itself in the special cases when, owing to the fact that x= + 1, the first factor loses all or most of its importance. The fact is that since lim {(n + l)/n}k= for all values of k, positive or negative, the limit of the ratio j (n + 1)/lp(n) depends only upon x.] 10. Prove that if x is positive,x - 1, as n -~ oo. [Suppose, e.g., x>l. Then x, ^', x/,... is a decreasing sequence, and /x > 1 for all values of n. Thus G/x- Il, where 1 1. But if 1> 1 we could find values of n, as large as we please, for which / x>l or x>ln: and as l-3-+ o as n —oo this is impossible.] 11. fn-/I. [For n+4/(n+l)<< n if (n+l)n<nl+l1 or {l+(l/n)} <n, which is certainly satisfied if n? 3 (see ~ 67 for a proof). Thus 2/n decreases as n increases from 3 onwards, and as it is always greater than unity it tends to a limit which is greater than or equal to unity. But if fn —l (1>1), n >1, which is certainly untrue for sufficiently large values of i, since 1l,/n-+ c with i (Exs. 7, 8).] 12. /(n!) --- +o. [However large G may be, n!>Gl" if n is large enough. For if u. = G"/n!,,, + 1/un = G/n + 1, which tends to zero as n — co, so that u, does the same (Ex. 6).] 67. The limit of 1+ -. A more difficult case which can be settled by the help of ~ 64 is given by b (n)= {1 + (1/n)}n.

Page 141

67, 68] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 141 We shall prove first that (+ > 1-..................(1), (' n-\ i.e. that /n + 1 I>./.(-1)2 1.................. Let (n + 1)/(-l) a, n } so that a> 1. Then the inequality (2) may be written in the form an-l> n n+1 i) n-1 n or (an - 1)/n > (aln-l_ 1)/(n- 1)...............(3), or, dividing by the positive factor a - 1, (an-1 + a"-2 +... + 1)/n > (an-2 + a~t-3+... + 1)/(n - 1)...(4). Multiplying up and subtracting we obtain (n-1) a',-l1- a-2 - a-3-... - 1 >.........(5), and this inequality is evidently true, since a > 1. Thus the inequality (1) is established. Hence, by the theorem of ~ 65, {l+(l/n)}n tends to a limit, or to + o, as n- oo. But l\ l 1 n(n - 1) I n(n- 1)... (n-n+ 1) 1 1+ =l]+n.-+- - -..+ /n/ n 1.2 n2 1.. -, by the binomial theorem; and so 1 1 1 n 1.2 1.2.3 1.2.3...n 1 1 1 <1 + 1 + ++... + < 3. 2 22 29.-1 Thus {1 + (l/n)}1 cannot tend to + oc, and so lim 1+- =e, where e is a number such that 2 < e _ 3. We shall have a great deal to do with this number e later on. 68. The limit of n (/3 - 1). We proved above that if a> (an - I)/n > (an - 1 - )/(n - 1). Let a"(n-1)=/. Then a3>1, and the inequality may be written in the form (n - 1) (n-,pt/- 1 )>n (,/ - 1).

Page 142

142 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV Thus, if b (n)=n (f3-l), ) (n) decreases steadily as n increases. Also q5 (n) is obviously always positive. Hence 5 (n) tends to a limit I as n- oo, and I _0. Moreover (a) >1 - (1/13) for all values of n. For the inequality n(/23-l )>l-(l/13) becomes, if we put y?" for 3, ny -(y-l)>yn- 1, or ny> yn ly-1.n-2.. Jr 1, which is obviously true, since y>1. Hence lim n.(;/2 - 1) =f (0), o00 where f (,) is afunction of f, and f (3) 1- (1//3) for all values of 3>1. Next suppose /3<1, and let 3==l/y; then n (//3- 1)= -n(/y- )/I/y. As n-eoo, n (-gy-l) —f(y), by what precedes. Also (Ex. XXX. 10) 7- 1. Hence if 3=(l/y)<l, n (/3 - l) -/(y). Finally, if 3=1, n (/3 - 1)=0 for all values of n. Thus we arrive at the result: the limit lim n (//3- 1).defines a function of 3 for all positive values of 3. This function f (/) possesses.the properties f(1/03)= -f(3), f(1)=0, and is positive or negative according as 3> 1. Later on we shall be able to identify this function as the Napierian logarithm of 3. Example. Prove that f(a13)=f (a) +f(3). [Use the equations f(a3P)=limn n (/a13- 1)=lim {n (a- 1) V/1 +n (/3 - 1)).] 69. Infinite Series. Suppose that u(n) is any function of n defined for all values of n. If we add up the values of u(v) for v = 1, 2,... n we obtain another function of n, viz. s (n)= u (1) + u (2) +... + u (a), also defined for all values of n. It is generally most convenient to alter our notation slightly and write this equation in the form Sn ==1 + u2+... + Un, or, more shortly, sit = E uV. v=1 Now suppose that Sn tends to a limit s when n tends to oo, i.e. that n lim S UV =- s. n-00 v=1

Page 143

68, 69] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 143 This equation is usually written in one of the forms V-y S= s, 'It +?2 + UL +... s, v=1 the dots denoting the indefinite continuance of the series of u's. The meaning of the above equations, expressed roughly, is that by adding more and more of the u's together we get nearer and nearer to the limit s. More precisely, if any small positive number e is chosen, we can choose no so that the sum of the first no or any greater number of terms lies between s-e and s+ e; or in symbols ' - e < Sn < S + e, if n _ no. In these circumstances we shall call the series Ul + U2 +..., a convergent infinite series, and we shall call s the sum of the series, or the sum of all the terms of the series. Thus to say that the series uo + ul, +... converges and has the sum s, or converges to the sum s or simply converges to s, is merely another way of stating that the sum sn = u + + +... + un of the first n terms tends to the limit s as n - oo, and the consideration of such infinite series introduces no new ideas beyond those with which the early part of this chapter should already have made the reader familiar. In fact the sum s, is merely a function f (n), such as we have been considering, expressed in a particular form. And any function ( (n) may be expressed in this form, by writing (n= (o) + ( 1) +[ )- (0)] +... + [0 (n)- 0(i- 1)]. It is sometimes convenient to say that + (n) converges to the limit i, say, as n - o. The use of the phrase 'converges' instead of 'tends to' is of course suggested by the phraseology usually employed in speaking of infinite series. If sn - + o- or to - oo we shall say that the series tu + ul +... is divergent or, diverges to + oo, or - oo, as the case may be. These phrases too may be applied to any function p (n)-e.g. if (n) -- + oo we may say that p (n) diverges to + o. If s, does not tend to a limit or to + oo or to - oo it oscillates finitely or infinitely: in this case we say that the series u0 + ul +... oscillates finitely or infinitely.

Page 144

144 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV 70. General theorems concerning infinite series. When we are dealing with infinite series we shall constantly have occasion to use the following general theorems. (1) If i +2+ +... is convergent, and has the sum s, then a + + u2 +... is convergent and has the sum a +s. Similarly a + b c +... + + 2+... is convergent and has the sum a+b+c+... +k+s. (2) If 1 U +... is convergent and has the sum s, then Um+- + Um+2 +... is convergent and has the sum S -t1-8- U -2... -'Itm. (3) If any series considered in (1) or (2) diverges or oscillates so do the others. (4) If U + 2+... is convergent and has the sum s, then au1 + au2 +... is convergent and has the sum as. (5) If the first series considered in (4) diverges or oscillates so does the second, unless a = 0. (6) If + u, +... and + v2 +... are both convergent the series (uin + vi) + (u2 + v2) +.. is convergent and its sum is the sum of the first two series. All these theorems are almost obvious and may be proved at once from the definitions or by applying the results of ~~ 56-60 to the sum sn = i + ~u2 +... + 'n. (7) If u1 + 2 +... is convergent, then lim u,, = 0. For Un,=Sn-sni, and so and s,-i have the same limit s. Hence lim u, = s - s = 0. The reader may be tempted to think that the converse of the theorem is true and that if lim m.=0 the series Etu, must be convergent. That this is not the case is easily seen from an example. Let the series be i1+~+,+k+... so that u= lln. The sum of the first four terms is t fur term is te sm of te n+r. The sum of the next four terms is I+I+ +I>4=; the sum of the next eight terms >s8-, and so on. The sum of the first 4+4+8+ 16+...+2n=2n+l terms is greater than 2+ + - + +... + =- (n + 3), and this increases beyond all limit with n: hence the series diverges to + o.

Page 145

70, 71] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 145 (8) If itu + u + + 3 +... is convergent, so is any series formed by grouping the terms in brackets in any way to form new single terms, as e.g. in (u, + u2 + iu3) + ua + (u5 + u6) +..., and the sums of the two series are the same. Here again the converse is not true. Thus, e.g. 1-1 +1 -1 +... oscillates, while (1 -1)+(1-1)+... or 0+0+0+... converges to O. (9) If evenr term u, is positive (or zero) the series Eun must either converge or diverge to + oo. If it converges its sum must be positive (unless all the terms are zero, when of course its sum is zero). For s, is an increasing function of n, according to the definition of ~ 64, and we can apply the results of that section to s,. (10) If every term u, is positive (or zero) the necessary and sufficient condition that the series Cz, should be convergent is that it should be possible to find a number G such that the sum of any number of terms is less than G, and if G can be so found the sum of the series is not greater than G. This also follows at once from ~ 64. It is perhaps hardly necessary to point out that the theorem is not true if the condition that every n,, is positive is not fulfilled. For example 1- 1+1- +... obviously oscillates, Sn being alternately equal to + 1 and to 0. (11) If uA + ur +..., v1 + v2+... are two series of positive (or zero) terms, and the second series is convergent, and if un - vn for all values of n, then the first series is also convergent, and its sum is less than or equal to that of the second. For, if vI v+ +... t, vI + v2 +... + v,: t, for all values of n, and so u1 + u2 +... + un _ t; which proves the theorem. Conversely, if Su, is divergent, and vn - U,, then Zvn is divergent. 71. The infinite geometrical series. We shall now consider the 'geometrical' series, whose general term is u,, = rn-1. In this case s,,= 1 + r + r2 +... + r-l = ( I-r)/(l-r), except in the special case in which r= 1, when s,, = 1 + +... + 1 = n. In the last case s, - + oo. In the general case Sn will tend to a H. A. 10

Page 146

146 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV limit if and only if r" does so. Referring to the results of ~ 66 we see that the series 1 + r + r2 +... is convergent and has the sum 1/(1 - ) if and only if - 1 < <1. If r _ 1, s1 - n, and so s,, - +; i.e. the series diverges to +oo. If r=- 1, s, =1 or 0 as n is odd or even: i.e. s,7 oscillates finitely. If r< -1, s,, oscillates infinitely. Thus, to sum up, the series l +r + r2 +... diverges to + o if r _ 1, converges to 1/(1 - r) if - 1 < r < 1, oscillates finitely if r =- 1, and oscillates infinitely if r < - 1. Examples XXXI. 1. Recurring decimals. The commonest example of an infinite geometric series is given by an ordinary recurring decimal. Consider for example the decimal '217i3. This stands, according to the ordinary rules of arithmetic, for 2 1 7 1 3 1 3 217 13 + 1 2687 10 102 103 104 105 106 + 107 ' 1000 5 12375 The reader should consider where and how any of the general theorems of ~ 70 have been used in this reduction. 2. Show that in general. al 2...am al.... -.an-a... a, a a2... arn a, a*a...2 99...900... O0 the denominator containing n 9's and rn 0's. 3. Show that a pure recurring decimal is always equal to a proper fraction whose denominator does not contain 2 or 5 as a factor. 4. A decimal with m non-recurring and n recurring decimal figures is equal to a proper fraction whose denominator is divisible by 21' or 51"m but by no higher power of either. [For the decimal is converted into the sum of an integer and a pure recurring decimal by multiplication by 10b, but not by multiplication by any lower power of 10.] 5. The converses of Exs. 3, 4 are also true, but their proof depends on Fermat's Theorem in the Theory of Numbers. If r=plq, and q is prime to 10, it is known that we can find n so that 10' -1 is divisible by q. Hence r may be expressed in the form P/(10 - 1) or in the form P P io- + -t+. i.e. as. a pure recurring decimal with n figures. But if q=2a5gQ, where Q is prime to 10, and n is the greater of a and 3, 10onr has a denominator prime to 10, and is therefore expressible as the sum of an integer and a pure recurring decimal. But this is not true of 10'r, for any value of Mu less than m; hence the decimal for r has exactly m non-recurring figures.

Page 147

71] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 147 6. To the results of Exs. 2-5 we must add that of Ex. I. 4. Finally, if we observe that 9 9 9.9= + i- +o +...=, 10 102 103 we see that every terminating decimal can also be expressed as a mixed recurring decimal whose recurring part is composed entirely of 9's. For example, -217 = 2169. Thus every proper fraction can be expressed as a recurring decimal, and conversely. 7. Decimals in general. The expression of irrational numbers as non-recurring decimals. Any decimal, whether recurring or not, corresponds to a definite number between 0 and 1. For the decimal 'ala2a3a4... stands for the series 1 +a2 + a3 10 102 103 " Since all the digits a,. are positive the sum s, of the first n terms of this series increases with n: also it is certainly less than -9 or 1. Hence s, tends to a limit between 0 and 1. Moreover no two decimals can correspond to the same number (except in the special case noticed in Ex. 6). For suppose that 'ala2a3..., 'blb2b3... are two decimals which agree as far as the figures a,-l, br-1, while a.>b.. Then a, _ br, +1 >b,.. + br+... (unless br.+1, br+2,... are all 9's), and so 'al a2...a a, + 1....a >blb2. -.. b b + I...b It follows that the expression of a rational fraction as a recurring decimal (Exs. 2 —6) is unique. It also follows that every decimal which does not recur represents some irrational number between 0 and 1. Conversely, any such number can be expressed as such a decimal. For it must lie in one of the intervals 0, 1 1/10 1/10, 2/10;...; 9/10; 1. If it lies in r/10, (r +1)/10 the first figure is r: by subdividing this interval into 10 parts we can determine the second figure; and so on. Thus we see that the decimal 1-414..., obtained by the ordinary process for the extraction of,/2, cannot recur. 8. The decimals -1010010001000010... and -2020020002000020..., in which the number of zeros between two l's or 2's increases by one at each stage, represent irrational numbers. 9. The decimal -11101010001010..., in which the nth figure is 1 if n is prime, and zero otherwise, represents an irrational number. [Since the number of primes is infinite the decimal does not terminate. Nor can it recur: for if it did we could determine iz and p so that m 2, m +p, m + 2p, m + 3p,... are all prime numbers; and this is absurd, since the series includes m2 + mip.]* * All the results of Exs. XXXI. may be extended, with suitable modifications, to decimals in any scale of notation. For a fuller discussion see Bromwich, Infinite Series, Appendix I. 10-2

Page 148

148 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV Examples XXXII. 1. If - <r<1, the series r. + n +l... is convergent and its sum is 1/(l-r)-1-r-... -.n-l (~ 70, (2)). 2. The series rm+-rm+l+... is convergent and its sum is rIn/( - r) (~ 70, (4)). Verify that the results of Exs. 1 and 2 are in agreement. 3. Prove that the series 1 + 2r + 2.2 +... is convergent, and that its sum is (l+r)/(l-r), (a) by writing it in the form -1+2(1+r-+r2+...), (3) by writing it in the form 1+2 (.+ r2 +...), (y) by adding the two series 1+r9+r2 +..., r+r2+.... In each case mention which of the theorems of ~ 70 are used in your proof. 4. Prove that the arithmetic series a+(a+b) +(a+2b)+... is always divergent, unless both a and b are zero. Show that if b =0 it diverges to + o or to - co according to the sign of b, while if b =0 it diverges to + oo or - co according to the sign of a. 5. What is the sum of the series (1 - r) + (r- r2)+(r2 _ -3)+... when the series is convergent? [The series converges only if - 1 <r 1. Its sum is 1, except when r= 1, when its sum is 0.] ~,2 r2 6. Sum the series.r2+ + -2 2)2 +.... [The series is always convergent. Its sum is 1 +r2, except when r9=0, when its sum is 0.] 7. If we assume that I +r+r2+... is convergent we can prove that its sum is 1/(1 -r) by means of ~ 70, (1) and (4). For if 1 r+r+92+...=s, s=1 +r( + r2+...)= 1 +s. 8. Sum the series r+ + ( +r2+ + r (1l+ r)2 when it is convergent. [The series is convergent if - < 1/(1 +r)< 1, i.e. if r< -2 or if r>0, and its sum is 1 +r. It is also convergent for r'=0, when its sum is 0.] 9. Answer the same question for the series r r r r + (- 1-r+ (1 ) — r) --- + 1+r + (r2 r r) r +- 2 + -+.... 10. Consider the convergence of (l +r) + (r2+r3) +..., (1 +.'2) + (l a+r2) 3+S'4 +4+)+..., 1-2r+r2+r93-2r4+ 5+..., (1 -2r+r2)+(1.3- 2r4+.5)+..., and find their sums when they are convergent. 11. If an is positive and not greater than 1, the series ao+ alr+a2r2 +... is convergent for 0_r<l, and the sum of the series is not greater than 1/(i-r).

Page 149

72] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 149 12. If in addition the series ao+al+a2+... is convergent, the series a0o+ar+a2r2+... is convergent for Or- l1, and its sum is not greater than the lesser of ao+al+<a2+... and 1/(1 - ). 1 1 13. -The series 1+ 1- 1 3 ' is convergent. [For 1/(1. 2...n)<1/2-1.] 14. The series 1 1 1 1 1.2 1.2.3.4+' 1+l.2.3+1.2.3.4.5'" are convergent. 15. The general harmonic series 1 I 1 a a+b +a+ 2b where a and b are positive, diverges to + o. [For un= l/(a +-b)> l/{n (a+ b)}. Now compare with 1 + (1/2)+(1/3) +....] 16. Show that the series ('1o- 1) + (u1 - 2) + ( — U3) +... is convergent if and only if un tends to a limit as n >oo. 17. If Ul +u2 3+ -... is divergent, so is any series formed by grouping the terms in brackets in any way to form new single terms. 18. Any series, formed by taking a selection of the terms of a convergent series of positive terms, is itself convergent. 72. The representation of functions of a continuous real variable by means of limits. In the preceding sections we have frequently been concerned with limits such as lim 0 (>), and series such as zu (x) + 2 (x) +... = lim {i (x) + 2 (x+).+ U* ()}, in which the function of n whose limit we are seeking involves, besides n, another variable x. In such cases the limit is of course a function of x. Thus in ~ 69 we came across the function f(x) = lim n (Xyx- 1): ni -- 0c and the sum of the geometrical series 1 + x + x2 +... is a function of x, viz. the function which is equal to 1/(1 - ) if- 1 < x < 1 and is undefined for all other values of x. Many of the apparently 'arbitrary' or 'unnatural' functions considered in Ch. II are capable of a simple representation of this kind, as will appear from the following examples.

Page 150

150 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV Examples XXXIII. 1.,, (x)=x. Here n does not appear at all in the expression of n,, (x), and q (x)= lim nf, (x) =x for all values of x. 2.,,(x)=x/n. Here ((x)=lim S,,(x)=0 for all values of x. 3. Q,(x)=nx. If x>0, 02(x)-+oo; if x<0, b(x) —O —: only for x=0 has q, (x)a limit (viz. 0) as n-co. Thus q (x)=0 when x=0 and is not defined for any other value of x. 4. f,,(x)= l/nx, nx/(nzx+l). 5. fn(x)=t=x. Here q )(x)=0, (-1<x<l); O(x)=l, (x=l); and (x) is not defined for any other value of x. 6., (x)=x" (1-x). Here + (x) differs from the q (x) of Ex. 5 in that it is defined and has the value 0 for x= 1. 7. qP (x) = x/n. Here q (x) differs from the p (x) of Ex. 6 in that it is defined and has the value 0 for x= -1 as well as + 1. 8.,,(x/)=x~/(x'+l). [b(x)=0, (-1<.<1); b(x)=2, (=l); P(x)=l, (x< - 1 or x>l); and q (x) is not defined for x= - 1.] 9. 07(z)=:x"/(xn- 1), 1/(xn+l.), 1/(xn-l), 1/(x'+x-n), 1/(x5-_-n). 10. 0.(x)=(x7-l)/(xl 1+), (nxa' —l)/(nx$7+l), (xl-n)/(x1x+n). [In the first case 4 (x)=l if Ix >1, q (x)= -1 if!x <1, 4 (x)=0 if x=1 and o (x) is not defined for x= -1. The second and third functions differ from the first in that they are defined both for x=l and x= - 1: the second has the value 1 and the third the value -1 for both these values of x.] 11. Construct an example in which O(x)=l, (Ixl>l); q((x)=-1, (Ixl<l); and q(x)=0, (x=+1). 12. )\(SX) {(X2-1) l/(21+ 1 )}2, n/(xn + n ) 13. 0,(x)={xf(x) +g(z)}/(xc"+l1). [Here ()=)/f(x), (xl >l); 6(x>)= g(x), (xlI<l); ((x)-={f(x)+g(x)}, (x=l); and (x) is undefined for x= -1.] 14.,, (x)=(2/r) arc tan (nx). [q((x)=l, (x>0); q((x)=0, (x=0); (x)= - 1, (x<0). This function is important in the Theory of Numbers, and is usually denoted by sgn x.] 15. /p (x)=(l/n) sin nxTr. [q (x)=0 for all values of x.] 16. (, (x)=sinnxrr. [q (X)=0 when x is an integer, and is otherwise undefined.] 17. *, (x)=(1/n1) cos nxrr, cs nXr, a cos2 nxrr + b sin2 nxir. 18. If O (x) = sin (n! x), ) (x)=0 for all rational values of x (Exs. XXVI. 9, XXVII. 8). The consideration of irrational values presents greater difficulties. 19. <q () = (cos2 XTr). [q (x)=0 except when. is integral, when ( )=.] 20.,b (X) = (sin2 x 1r)' (cos xrr), (sin xTr).

Page 151

73] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 151 21.,H ()=(acos2 T + b sin2x r)". [Here (x) =0 if a cos2x,r+bsin2 xr <1, (x)=1 if a cos2xr + b sin2 Xr =1, and 0 (x) is otherwise undefined. For what values of x these respective conditions are satisfied depends on the values of a and b. Thus if a and b are both numerically less than unity, ( (x)=0 for all values of x. Consider, e.g., the cases a=b=l; a=b=-; a=b=2; a=1, b=2; a=2, b=l; a=2, b=~.] 22. If NV 1752, the number of days in the year NV A.D. is lim {365 + (cos2 Tr)aT - (cos2 I Nr)" + (cos2 4 A,r)n}. 73. Limits of Complex functions and series of Complex terms. In this chapter we have, up to the present, concerned ourselves only with real functions of n and series all of whose terms are real. There is however no difficulty in extending our ideas and definitions to the case in which the functions or the terms of the series are complex. Suppose that b (n) is complex and equal to R (n) + iS(n), where R (n), S (n) are real functions of n. Then if, as n oo, R (n) and S(n) converge respectively to limits r and s, we shall say that ( (n) converges to the limit r + is, and write lim ( (n)= r + is. Similarly if u,, is complex and equal to vi + iwz we shall say that the series u1 + u2 + u3 +... is convergent and has the sum r + is, if the series VI + V2 + V +..., w*t + w2 + uW +... are convergent and have the sums r, s respectively. To say that u + l2+ + u... is convergent and has the sum r + is is of course the same as to say that the sum Su = U1 +... + ~- + u vn = ( + v2 +... + Vn) + i (w1 + w2 +.. + W. ) converges to the limit r + is as n -o. In the case of real functions and series we also gave definitions of divergence and oscillation (finite or infinite). But in the case of complex functions and series there are so many possibilitiese.g. R(n) may tend to + o and S(n) oscillate-that this is hardly worth while. When it is necessary to make further distinctions of this kind, we shall make them by stating the way in which the real or imaginary parts behave when taken separately.

Page 152

152 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE [IV 74. The reader will find no difficulty in proving such theorems as the following, which are obvious extensions of theorems already proved for real functions and series. (1) If lim c (n)= r +is, then lim c(n+p)=r+is, for any fixed value of p. (2) If ui + zI2 +... is convergent and has the sum r + is, then a+b+ c+... + k- +l + u +... is convergent and has the sum a + b + c +... + k + r + is, and Up+ + 2Up+ +... is convergent and has the sum r + is - z - uz -... - P. (3) Iflimn(n)=a and lim (2)=b, then lim {l(n)+r(n)} =a+b. (4) If lim (0n) = a, lim k4 (n) = ka. (5) If lim ( (n) = a and lim * (n) = b, then lim b (n) r (n) = ab. (6) If u1 + u2 +... converges to the sum a, and v1 + v2 +... to the sum b, then (ul + v1) + (u2 + v2) +... converges to the sum a + b. (7) If u1 + i2 +... converges to the sum a, kzil + k +... converges to the sum ka. (8) If ti + 2/2 + u3 +... is convergent, then lim un = 0. (9) If Ui + u2 + u3 +... is convergent, so is any series formed by grouping the terms in brackets, and the sums of the two series are the same. As an example, let us prove theorem (5). Let (n) =R (n) + iS (i), () =R'(n) + iS'(n), a = r + is, b=' +is'. Then R(n)-r, S(n)-zs, R'(n)-r', S'(n)s'. But = () (n)= RR'- SS' + i (RS' + R'S) and RR'- SS'-rr'- ss', RS'+R'S-^s' + 's, so that ( (n) () -rr' - ss' + i (rs' + r's), i.e. p(n) + (n) -- (r+ is) (r' + is')= ab. The following theorems are of a somewhat different character. (10) In order that (n) = R(n) + iS(n) should converge to zero as n - oo it is necessary and sufficient that (n) = R i= ()2+S ()}2 should converge to zero. If R (n) and S(n) both converge to zero it is plain that,/(R2+,S2) does so. The converse follows from the fact that the numerical value of R or S cannot be greater than %/(R2+S2).

Page 153

74-76] LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE 153 (11) More generally, in order that c (n) should converge to a limit I it is necessary and sufficient that I \ (n)- l should converge to zero. For qp (n) - converges to zero, and we can apply (10). 75. The limit of xn as -- oo, x being any complex number. Let us consider the important case in which ((n)= x=. This problem has already been discussed for real values of x in ~ 66. If xn - l, xnf+- 1, by (1) above. But since x+1 = zx. xI, wn+i - xl, by (4) above; and therefore I = xl, which is only possible if (a) 1=0 or (b) x=1. If x=1, limx=1. Apart from this special case the limit, if it exists, can only be zero. Now if x = r (cos 0 + i sin 0), where r is positive, we know that xn = rn (cos nO + i sin nO), so that Ix a -=rn. Thus xn tends to zero if and only if r < 1; and it follows from (10) of the last paragraph that lim xn = 0, if and only if r < 1. In no other case does x11 converge to a limit, except when x = 1 and xS -a 1. 76. The geometric series 1 + + x2 +..., when x is complex. Since s= = 1 + x+,+... + X2-1 = (1 -.n)/(1 - x), unless n = 1, when the value of s,, is Tn, it follows that the series 1 + x +x +... is convergent if and only if r = x < 1. And its sum when convergent is 1/(1 - x). Thus if x = r (cos 0 + i sin 0) = r Cis 0, and r < 1, 1 + x +... 1/(1 -r Cis 0), or 1 + r Cis 0 r Cis 2...=1/(1-rCis0) = (1 - r cos 0 + ir sin O)/(1 - 2r cos 0 + r2), or, separating the real and imaginary parts, 1 + r cos 0 + r2 cos 20 +... = (1 -r cos )/(1 - 2r cos 0 + r2), r sin + r2 sin 20+... = sin 0/(1 - 2r cos 0 + r2), provided r < l. If we change 0 into 0 + -ir we see that these results hold also for negative values of r numerically less than 1. Thus they hold for - 1 < < 1.

Page 154

154 LIMITS OF FUNCTIONS OF AN INTEGRAL VARIABLE Examples XXXIV. 1. Prove directly that (n) = r' cosnO converges to 0 if r<l and to 1 if r=1, 0=0. Prove further that if r=1, 0t0 it oscillates finitely, if r>l, 0=0 it diverges to + o and if r>1, 0=+0 it oscillates infinitely. 2. Establish a similar series of results for q (n) =?r sin nO. 3. Prove (as for the case of a real x in Ex. XXXII. 7) that if l+Sx+x2... converges its sum can only be 1/(l-x). 4. Prove that xm1 + Xl + 1 +... ='/(l - X), x m_ x1 +e +1. /(1 + x), x"n + 2x'l +1 + 2x,, + 2 + =m (1 + x) = (1 - x), Xm__ 2~Xn + 1 2.t72 + 2__... =- Xt (I - x)/(1 + A), if and only if Ix< 1. Which of the theorems of ~ 74 do you use? 5. Let (in the notation of Chap. III, ~~ 25 et seg.) PoP,=1, PIP2=x, P2P3=x2... where x=rCisO. Plot the points Po, P1, P2,..., and show how the figure obtained indicates the result of ~ 76. Prove that, if r<l, the point P,,, where n is large, is very near to the point (1 -' cos 0)/(1 - 2r cos +r- 2), ' sin 0/(1 - 2r cos 0 + r2). 6. Prove that, if -1 <r<l, 1 + 2r cos 0 + 2r2 cos 20 +... =(1 - r2)/(1-2r cos 0 + r2). 7. The series 1+ (+ {./( 1 + ()} + {(1 +X)}2+... converges to the sum 1/(1 - - +x if jx/(1l+x) <. Show thatthis is equivalent to the assertion that x has a real part greater than -. 8. Determine similarly the regions of values of x for which the series, obtained by writing x for r in Ex. XXXII. 9, are convergent, and find their sums when they are convergent. MISCELLANEOUS EXAMPLES ON CHAPTER IV. 1. The function j (n) takes for n= 0, 1, 2,... the values 1, 0, 0, 0, 1, 0, 0, 0, 1,.... Express 6 (n) in terms of n by a formula which does not involve trigonometrical functions. [ 4 (n)= 4 {1 + ( - 1) in ( - i)"}.] 2. If k (n) steadily increases, and + (n) steadily decreases, as n tends to co, and if, (n)> ( (n) for all values of n, then both ( (n) and 4+ (n) tend to limits, and linm (n)_lim, (n). [This is an intermediate corollary from ~ 64.] 3. Prove that if ()(1+ (), a i-(,) then 9 (en+ 1)> ( ()X) and q (n + 1)<q, (<). [The first inequality has already been proved in ~ 67; the second may be proved similarly.]

Page 155

MISCELLANEOUS EXAMPLES ON CHAPTER IV 155 4. Prove also that + (n)>+ (n) for all values of n: and deduce (by means of the preceding examples) that both ( ({6) and + (n) tend to limits as n tends to oo *..If T(m)\ m(m- 1)...(m-n+l) 5. If -, =, - m not being a positive integer, and - <x< 1, then in,=( )xn'-0 as n- co. [For t,, + 1/t = {((m - n)/( + 1)} X-> - x. Now apply Ex. XXX. 6.] 6. The arithmetic mean of the products of all distinct pairs of positive integers, whose sum is n, is denoted by Sn. Show that lim (S/n2)=l/6. (Aath. Trip. 1903.) 7. If x1i={x+(A/x)}, x2=i{x1+(A/x1)}, and so on, x and A being positive, prove that lim x,,='/A. Prove first that xn -A -= /-A)2 ] 8. If ( x (a) is a positive integer for all values of n, and tends to o with a, then x(n)->0 or + o according as 0<x< or x> 1. Discuss the behaviour of x(1n), as n —oo, for other values of x. 9t. If an increases (decreases) steadily as n increases, the same is true of (al + a2 +... + a)/n. 10. If xl =- (k + x,), and k and x1 are positive, the sequence xl, x2, x3,... is an increasing or decreasing sequence according as x1 is less than or greater than a, the positive root of the equation x2=x+k; and in either case x -a as n->ow. 11. If x+. =k/(l+n), and k and xl are positive, the sequence xl, x2, X3... is an increasing or decreasing sequence, according as xl is less than or greater than a, the positive root of the equation x2+x=k; and in either case X,W-a as n- - oo. 12. Suppose that f(x) is a positive and increasing function of x such that the equation x=f(x) has just one positive root a. Show, graphically. or otherwise, that if x1>0 and x,, + =f (x) then the sequence xl, 2,... has the limit a as nie-. Discuss the case in which tne equation x=f(x) has several positive roots. 13. If xl, x2 are positive and x, + 1 = (x + x-1) the sequences xl, X3, Xs,... and x2, xv4, Xa,... are one a decreasing and one an increasing sequence, and their common limit is I (a + 2a2). 14. Draw a graph of the function y defined by the equation x2n 'sin 1~rX +.v2 /=lim - ') ---c- o x2n + 1 (Math. Trip. 1901.) * A proof that lim {.- (n) - (n)} =0, and that therefore each function tends to the limit e, will be found in Chrystal's Algebra, vol. ii, p. 78. We shall however prove this in Ch. IX by a different method. t Exs. 9-13 are taken from Bromwich's Ifinite Series.

Page 156

156 MISCELLANEOUS EXAMPLES ON CHAPTER IV 15. The function y =lim ->~, 1 +n sin2 rx is equal to 0 except when x is an integer, and then equal to 1. The function = lim (x) + n (x)sin2 7r n-?o 1 + n sin2 7rX is equal to q (x) unless x is an integer, and then equal to + (x). 16. Show that the graph of the function y= lim ---------- + — Wn-*-oo,l -2 -is composed of parts of the graphs of ( (x) and +(x), together with (as a rule) one isolated point. Is y defined for (a) x= 1, (b) x= - 1, (c) x=0? 17. Prove that the function y, which is equal to 0 when x is rational and to 1 when x is irrational, may be represented in the form y= lim sgn {sin2 (i! 7rx)} where, as in Ex. XXXIII. 14, sgn z=- lim (2/rr) arc tan (nz). [If x is rational, sin2 (me! rx), and therefore sgn {sin2 (mn! rrx)} is equal to zero from a certain value of m onwards: if x is irrational, sin2(mn! rrx) is always positive, and so sgn {sin2 (l! 7rx)} is always equal to 1.] Prove that y may also be represented in the form 1 - lim [lim (cos (iz! 7rx) 2]. 18. Sum the series 1 C 1 1 i(v+)' (+)...(+) [Since I 1 (f 1 1 ) (.+ 1)...(v+k) =k (v +l)...(+k-l) 1) (v+l)(v+2)...(v+k ) ' 00 1 1 we find v (v+l).(v +]k) k -- 2... (n+l)(it+ 2)... (n+ ) and so i v(v+l)...(v+k) k(k!)-' L L X +; 19. If Ixj<la, -l -- l (,+-+ +... X-a a a a2 L L a+ a while if Ix\>\a, >n,- L l + + X- a. X.X 20. Expansion of (Ax+-B)l(ax2+2bx+c) in powers of x. Let a, / be the roots of ax2 + 2bx + c = 0, so that ax2 + 2bx + ca (x - a) (x - 3). It is easy to verify that (unless a==3) Ax+B _ (Aa+B A,3+B\ a.c + 2bx+c- a (a- 3) \ x-a x-3 j

Page 157

MISCELLANEOUS EXAMPLES ON CHAPTER IV 157 We shall suppose A, B, a, b, c all real. Then there are two cases, according as b2 ' ac. (1) If b2>ac, the roots a,!3 are real and distinct. If IxI is less than either f a I or I/ I we can expand l/(x - a) and 1/(x - 3) in ascending powers of x (Ex. 19). If Ix\ is greater than either la or 1/31 we must expand in descending powers of x; while if Ix I lies between a a and | 1 one fraction must be expanded in ascending and one in descending powers of x. The reader should write down the actual results. If IxI is equal to a j or Il3 no such expansion is possible. (2) If b2<ac the roots are conjugate complex numbers (Chap. III, ~ 34) and we can write a=p Cis O,;=p Cis(- ), where p2=a=c/a, pcos == (a+I)= -b/a, so that cos = ---J(b2/ac), sin = /{ - (b2/ac)}. If Ixl<p each fraction may be expanded in ascending powers of x. The coefficient of x' will be found to be {Ap sin nof + B sin (n + 1) )}/ap" + sin o. If Ixl>p we obtain a similar expansion in descending powers, while if Ixl=p no such expansion is possible. 21. Show that, if lx]<1, 1 +2x+3x2+...+(n + ) xn+... =1/(i -x)2. The sum to n terms is (1- - )2- x. 22. Expand L/( - a)2 in powers of x, ascending or descending according as Ixl<lal or Ixl>la!. 23. Show that if b2 =ac and I ax I < I b (Ax + B)/(ax2 + 2bx + c)= p xn, where pn —{(-a)/b +2}{(n+l1) aB- nbA}, and find the corresponding expansion, in descending powers of x, which holds when j ax I > b. 24. Verify the result of Ex. 20 in the case of the fraction 1/(1 +x2). [We have 1/(1 +X2)=x 2sin { (n+ 1) r}= 1- x2+4 -....] 25. Prove that, if x <1, 1/(1 + x+x2) =(2/%/3), Xn sin { (n+ 1) rr}. 0 26. Expand (l+x)/(l+x2), (1+x2)/(1+X3) and (l+x+A2)/(l+x4) in ascending powers of x. For what values of x do your results hold? 27. If a/(a + +cx2)=l1 + -p1x+p2x2+... then 1+9 i 9...-a+cx a2 a- ex a2 - (b2 - 2ac) + c22 ' (Math. Trip. 1900.)

Page 158

158 MISCELLANEOUS EXAMPLES ON CHAPTER IV 28. If lim s,,=l, then X —~-oo lim (sl+s2+... +sl)/n=l. [Let s,= l+t,,. Then we have to prove that (tl+t2+... +tn,)/n tends to zero if t, does so. We divide the numbers tl, t2,... t, into two sets tl, t2,..., tp and t,+ 1, t,+2,..., tl. Here we suppose that p is a function of n which tends to o with n, but more slowly than n, so that p- oc but p/nz-O: e.g. we might suppose p to be the integral part of,/l. Let e be any positive number. However small e may be, we can choose no so that tp +, tp+2,..., t are all numerically less than E, when n __no, and so |(tp ++ t+2+... +tn)/nI|<~E (n-p)/2.t<~. But, if A is the greatest of the nmoduli of all the numbers t1, t2,..., we have also (tl +t2+... + tl)/n <pA /n, and, if no is large enough, this will also be less than 'E when n2no, since p/n-O as n-oo. Thus, if no is large enough, l( t +2+... +t)/-z I l (- t1 2.. + t.P)l I + I (t... + t... )l < ', when n no: which proves the theorem. The reader, if he desires to become expert in dealing with questions about limits, should study the argument here given with great care. It is very often necessary, in proving the limit of some given expression to be zero, to split it into two parts which have to be proved to have the limit zero in slightly different ways. When this is the case the proof is never very easy. The point of the proof is this: we have to prove that (t+ t2 +... +t,)/n is small when n is large, the t's being small when their suffixes are large. We split up the terms in the bracket into two groups. The terms in the first group are not all small, but their number is small compared with n. The number in the second group is not small compared with n, but the terms are all small, and their number at any rate less than n, so that their sum is small compared with n. Hence each of the parts into which (tl+t +..*+t n)/ has been divided is small when n is large.] 29. If (n) - (- (n 1)- I as n-oo then also Q (n)/n -,l. [If we put < (n)=s1+s2+... +s,, we have (n)- p(n -l)= s,, and the theorem reduces to that proved in the last example.] 30. If s,,= 1 -( - 1)"}, so that s, is equal to 1 or 0 according as n is odd or even, then (sl1 + 2+... +sn)/n-~ as n-oo. [This example proves that the converse of 28 is not true: for s,, oscillates.as n-aoo.] 31. Let c,, sn denote the sums of the first n terms of the series +cos 9+cos2O+..., sin +sin 28+.... Prove that lim (C1+C2+... + C,)/n=O, lim (s +s2+... + s)/n= cot 28.

Page 159

CHAPTER V. LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE. CONTINUOUS AND DISCONTINUOUS FUNCTIONS. 77. Limits as x tends to oo. We shall now return to functions of a continuous real variable. We shall denote the typical such function by c (x). We suppose x to assume successively all values corresponding to points on our fundamental straight line L, starting from some definite point on the line and progressing always to the right. This variation of x, like the corresponding variation of n (Chap. IV, ~ 48), is often conveniently thought of as taking place in time. In these circumstances we say that x tends to oo, and write x —oo. The only difference between the 'tending of n to o ' discussed in the last chapter, and this 'tending of x to cc,' is that x varies through all values as it tends to cc, i.e. that the point P which corresponds to x coincides in turn with every point of L to the right of its initial position, whereas n tends to cc by a series of jumps. We can express this distinction by saying that x tends continuously to oo. As we explained at the beginning of the last chapter, there is a very close correspondence between functions of x and functions of n. Every function of n may be regarded as a selection from the values of a function of x. In the last chapter we discussed the peculiarities which may characterise the behaviour of a function b (in) as n tends to occ. Now we are concerned with the same problem for a function f (x): and the definitions and theorems to which we are led are practically repetitions of those of the last chapter. Thus corresponding to Def. 1 of ~ 51 we have: DEFINITION 1. The function 0,(x) is said to tend to the limit 1 as x tends to oc if, when any positive number e, however small, is

Page 160

160 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE [V assigned, a value X can be chosen such that, for all values of x equal to or greater than X, p (x) differs from 1 by less than e, i.e. if I () -1\ < (_X ). When this is the case we may write lim b (x) = l, 4 (x)-l, artU-0 (-X oo) or, when there is no risk of ambiguity, simply linm (x) = 1, or (x)- l. Similarly we have: DEFINITION 2. The function q (x) is said to tend to + x with x if when any number G, however large, is assigned, we can choose X so that (x) > G (> GX). We then write (x)- + Co. Similarly we define ( (x) — o. Finally we have: DEFINITION 3. If the conditions of none of the two preceding definitions are satisfied b (x) is said to oscillate as x tends to oo. If, for all values of x, \I (x) ] is less than some constant K, q (x) is said to oscillate finitely; otherwise infinitely. The reader will remember that in the last chapter we considered very carefully various less formal ways of expressing the facts represented by the equations /(n)->l, b(n)- + o. Similar modes of expression may of course be used in the present case. Thus we may say that ( (x) is small or nearly equal to 1 or large when n is large, using the words 'small,' 'nearly,' 'large' in a sense precisely similar to that in which they were used in Ch. IV. Examples XXXV. 1. Consider the behaviour of the following functions as x-oO: (1/X), 1+(1/X), X2, x[k, -[C], [x-[], []r/{-[]}. The first four functions correspond exactly to functions of n fully discussed in Ch. IV. The graphs of the last three were constructed in Oh. II. (Exs. XVII.), and the reader will see at once that [x] — + oo, x - [x] oscillates finitely, and [x] +/{x - [x-]}-. + o. One simple remark may be inserted here. The function d (x)=x-[x] oscillates (between 0 and 1) as is obvious from the form of its graph. It is equal to zero whenever x is an integer, so that the function ( (n) derived from it is always zero and so tends to the limit zero. The same is true of ( () )=sin xr, ( (n) = sin nr= 0.

Page 161

77-79] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 161 In such cases as these, it is evident that q((x)- l or +(x)- >+o or - o involves the corresponding property for ( (n), but that the converse is hy no means true. 2. Consider in the same way the functions: cos xT, tan xr, (cosx7r)/x, (tan rx)/x, (1/x) cos xr, xcosx7r, x2cos xr, x COS2 r, (XcosS 7r)2, a cos2 xr + b sin2 Xr, (acos2 x + b sin2 Xr)/x, illustrating your remarks by means of the graphs of the functions*. 3. Give a geometrical explanation of Def. 1, analogous to the geometrical explanation of Ch. IV, ~ 52. 4. If +(Sx)-l, ( (x)cosxTr and O(x)sinxir oscillate finitely. If () (X)-3 + so (or - co ) they oscillate infinitely. The graph of either function is a wavy curve oscillating between the curves y = (x), y - (x). 5. Discuss the behaviour, as x —o, of the function y==f (x) cos2 x7r - F (x) sin2 X7r. The graph of y is a curve oscillating between the curves y =f(x), y=F(x). Consider in particular the cases (i) f(x)= + (1x), F(x)= 1-(1/x); (ii) f(x) = a+(a/.x), F(x)=b+(3/x), where a b; (iii) f(x)=l, F(x)=x; (iv) f(x)=-x, F(x)=x; (v) f(x)=sinx7r, F(x)=cosxr; (vi) f(x) =cos4 X7r +3 sin4 x7r, F (x)= 3 cos4 X7r + sin14 Xrr. 78. Limits as x tends to - oo. The reader will have no difficulty in finding for himself definitions of the meaning of the assertions 'x tends to - oc' (x --- -o ) and lim 4 (x) = 1, c (x) - oo (or -oo ). X - I 0 (2 i!- -- 0) In fact if x=-y and (x) = (-y) = (y), then x tends to - oo as y tends to o, and the question of the behaviour of (x) as x tends to - o is the same as that of the behaviour of 4r (y) as y tends to o. 79. Theorems corresponding to those of Ch. IV, ~5 56-63. The theorems concerning the sums, products, and quotients of functions, proved in Ch. IV, are all true (with the obvious verbal alterations which the reader will have no difficulty in supplying) for functions of the continuous variable x. Not only the enunciations but the proofs remain substantially the same. Ex. Draw up a table, with examples of each case, similar to the table on p. 130 in Ch. IV, i.e. to illustrate the different possibilities with regard to the behaviour of qp (x) +- (x) when the behaviour of (p (x) and (x) is^nown. The other tables of Ch. IV suggest similar examples. * The reader has probably already drawn graphs of some of the functions considered in Exs. 2, 4, 5, while engaged on Ch. II and in particular Exs. xvi. H. A. 11

Page 162

162 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE [V 80. Steadily increasing or decreasing functions. The definition which corresponds to that of ~ 64 is as follows: the tfunction +((x) will be said to increase steadily with x if (x)) _ _(xo) whenever x, > x0. In many cases, of course, this condition is only satisfied from a definite value x==X onwards, i.e. when x, > xo-X. The theorem which follows requires no alteration but that of n into x: and the proof is also practically the same. The reader should consider whether or no the following functions increase steadily with x (or at any rate increase steadily from a certain value of x onwards): x2 -x, x+ sin x, x +2sin x, 2+ 2 sin x, [x], [x] + sin x, [x]+ ^,{x- [x]}. All these functions tend to + o with x. Ex. Show that if q (x) steadily increases (or decreases) as x —, then the behaviour of q (x) as x -oo is the same as that of q (n) as n-. 81. Limits as x tends to 0. Let c (x) be such a function of x that lim > (x) = I, and let y = 1/x. Then X-a00 +() = (l/y) = (y) say. As x tends to, y tends to the limit 0, and 4 (y) tends to the limit 1. Let us now dismiss x and consider fr (y) simply as a function of y. We are for the moment concerned only with those values of y which correspond to large positive values of x, that is to say with small positive values of y. And J (y) has the property that by making y sufficiently small we can make (y) differ by as little as we please from 1. To put the matter more precisely, the statement expressed by lim 4 (x) = meant that, when any positive number e, however small, was assigned, we could choose X so that i | (x)- 1 < e for all values of x greater than or equal to X. But this is the same thing as saying that we can choose n = 1/X so that I r (y) - < e for all positive values of y less than or equal to q. We are thus led to the following definitions. A. If when any positive number e, however small, is assigned we can choose 9' so that I (Y) - < -, for 0 < y 7, we say that b (y) tends to the limit 1 when y tends to 0 by positive values, and we write lim 0 (y) = 1. y/ ---+

Page 163

80-82] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 163 B. If when any number G, however large, is assigned we can choose 7 so that > (y)> G for 0< y:71, we say that + (y) tends to + 4c as y tends to 0 by positive values, and we write (y) -+ c. (y —t0) We define in a similar way the meaning of ' (y) tends to the limit I as y tends to 0 by negative values, or lim J (y)= l.' We y —o have in fact only to alter 0<y r to -- <y<O in A. The reader will find it a useful exercise to write out formal definitions of the statements expressed by ' (y)- +, ( _, (Y)y -- _ /. (y- - 0) (y2-+0) J — 0) If lim b (y) = l and lim r (y) =, we write simply (y-,+o) (y- - o) lim b (y) = 1. (y-This case is so important that it is worth while to give a) This case is so important that it is worth while to give a formal definition. If when any positive number e, however small, is assigned we can choose v so that, for all values of y different from zero but numerically less than or equal to 'q, 0 (y) differs from I by less than e, we say that (y) tends to the limit I as y tends to 0, and write lim q (y)= 1. So also if 4 (y) + oc and ( (y)- + oc we write 0(y)-+ oo. (y-+o) (y2- -) 0) - Similarly we define the statement ( (y)- - c. Finally, if b((y) (y-*o0) does not tend to a limit, or to + oo, or to - o, as y —0, we say that b(y) oscillates as y-0O, finitely or infinitely as the case may be. The preceding definitions have been stated in terms of a variable denoted by y: what letter is used is of course immaterial, and we may suppose x written instead of y throughout them. 82. Limits as x tends to a. Suppose that lim b (y) = and write y = x - a, 0 (y) = (x - a) =, (x). 11-2

Page 164

164 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE [V As y-O, x-P-a and r (x)-l, and we are naturally led to write lim, (x ( x) = l, (x)x —a x —a or simply lim J (x) = I or r (x) —l, and to say that r (x) tends to the limit I as x tends to a. The meaning of this equation may be directly and formally defined as follows: if, given e, we can always determine r so that I (X)- 1 <e for all values of x such that 0 <I x-a \ -, then lim r (x)= 1. X —3-a In other words, given any positive number e we can find another Vq such that if x is different from a, but its difference from a is less than V, < (x) will differ from I by less than e. By restricting ourselves to values of x greater than a, i.e. by replacing 0 < x- a i-r by a < x a~ + /, we define ' (x) tends to I when x approaches a from the right'; which we may write as lim = 1. (x-ta+0O) Similarly we define lim (x) = 1. x-B-a- Thus lim f (x) = 1 is equivalent to the two assertions x- a+O x-Ja - 0 And we can give similar definitions referring to the cases in which +((x) —+ oo (or - oo ) as x- -a through values greater (or less) than a; but it is probably unnecessary to dwell further on these definitions, since they are exactly similar to those stated above in the special case when a = 0, and since we can always discuss the behaviour of + (x) as x-a by putting x=y+ a and supposing that y-0O. Examples XXXVI. 1. If ~ (X) —b, (+ (x)-c, x-ba x-Aca then q (x) +_#(x)- b +c, (x) (x)-bc, and (xu)/ (x)-b/c, unless in the last case c= O. [We saw in ~ 79 that the theorems of Ch. IV, ~~ 56 et seq. held also for functions of x when x-boo (or -oo). By putting x=l/y we may extend

Page 165

82] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 165 them to functions of y, when y —0, and by putting y =x+a to functions of x, when x —a. The reader should however try to prove them directly from the formal definition given above. Thus, in order to obtain a strict direct proof of the first result he need only take the proof of Theorem I. in Ch. IV. and write throughout x for n, a for oo and 0<\x-a lf instead of nzn0o.] 2. If mn is a positive integer x' —O as x —0O. 3. If m is a negative integer xm-+oo as x-.+0, while x'-$-oo or + oo as x- - 0, according as m is odd or even. If mn=0, x'=1 and xn-al. 4. lim (a + bx + cx2 +... + kxm)=- a. 5. lim {(a+bx+... +kxm)/(a+Bx+... +cx')}=a/a unless a=O. If a=0 x-0O the function tends to + c or - co, as x- + 0, according as a, 3 have like or unlike signs; the case is reversed if x — -0. 6. lim rn = =a", if mn is any positive or negative integer. x-H-a [If n>0, put x=y+a and apply Ex. 4. When m<O the result follows from the theorem concerning 1/+ (x).- There is one exceptional case, viz. when a= 0 and m is negative. It follows at once that if P(x) is any polynomial, lim P (x)=P(a).] 7. lim R (x)=R (a), if R denotes any rational function and a is not one x-,a of the roots of its denominator. 8. Prove that if x and a are positive and unequal, and m is any rational number greater than 1, m'r- 1(x - a)> xG~ - a7n>nam-1 (x - a); while if 0<qn<1 signs of the inequalities must be reversed. [Suppose first that a= 1 and let mn =p/q. It follows from the inequality (3) of ~ 67 that, if $ is any number greater than unity, 6P- 1 ~ (p/q)( - l ) according as p > q, and it is easy to see, by similar reasoning, that the result remains true if 0<<<1, though both sides of the inequality are then negative. Writing xl/q for 6 and m for p/q we obtain - 1 m (x- 1)................................ (1), according as m < 1. If now we replace x by l/x, and multiply by -_m, we obtain mx 1(x - 1) -............................. (2). From (1) and (2) the result follows in the case of a= 1. The proof may now be completed by writing x/a for x.] 9. Show that the inequality stated in Ex. 8 holds also if mz is negative. Obtain corresponding inequalities when x and a are both negative. [See Chrystal's Algebra, vol. ii, pp. 43-45.]

Page 166

166 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE [V 10. Deduce from Ex. 8 that lirm x"= a' for all rational values of m, x-da except when a= 0 and m is negative. [This follows at once, when a is positive, from the inequalities of Ex. 8. For xV'- ajI <HjIx- al where H is the greater of the absolute values of mxm-1 and mna"-l1. If a is negative we write x=-y and a=-b. Then lim x"2=linm (-1)my1' = ( - l)m bm= an.] 83. The reader will probably fail to see at first that any proof of such results as those of Exs. 4,, 5 6, 7, 10 above is necessary. He may ask 'why not simply put x = 0 (or x =a)? Of course then we get a, a/a, a"o, P (a), R (a).' It is very important that he should see exactly where he is wrong. We shall therefore consider this point carefully before passing on to any further examples. The statement lim fb (x)= I is a statement about the values of + (x) when x has any value distinct from but differing by little from 0*. It is not a statement about the value of + (x) when x= O. When we 'make the statement we assert that, when x is nearly equal to 0, b (x) is nearly equal to 1. We assert nothing whatever about what happens when x is actually equal to 0. So far as we know or care c (x) may not be defined at all for x = 0; or it may have some value other than 1. For example, consider the function defined for all values of x by the equation (x) = 0. It is obvious that lim ( ) = 0................... (1). Now consider the function f (x) which differs from o (x) only in that, when x = 0, (x) = 1. In this case too lim +(x)= 0...................... (2), for, when x is nearly equal to 0, (x) is not only nearly but exactly equal to 0. But r(0)= 1. The graph of this function consists of the axis of x, with the point x = 0 left out, and one isolated point, viz. the point (0, 1). The equation (2) expresses the fact that if we move along the graph towards the axis of y, from either side, the ordinate of the curve tends to the limiting * Thus in Def. A (~ 81) we make a statement about values of y such that 0<IY Iv; the first of these inequalities being inserted expressly in order to exclude the value y=O.

Page 167

83] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 167 value 0: this is obvious from the fact that the ordinate is equal to 0 all the time. This fact is in no way affected by the position of the isolated point (0, 1). If the reader objects to this example on the score of artificiality it is easy to give him simple formulae representing functions which behave precisely like this near x = 0. One is, (x) = t = - X2] where [1- X2] denotes as usual the algebraically greatest integer contained in 1 - x2. For if x=O, (x)=[1] = 1; while if < x <1 or - < x< 0, 0 < 1 -2 < 1 and so r(x) = [1 - 2]= 0. Or again, let us consider the function y = x/X already discussed in Ch. II (~ 14, (2)). This function is equal to 1 for all values of x save x=0. It is not equal to 1 when x = 0: it is in fact not defined at all for x = 0. For when we say that q (x) is defined for x =0 we mean (as we explained in Ch. IT, I.c.) that we can calculate its value for x=0 by putting x=0 in the actual expression of b (x). In this case we cannot. When we put x= 0 in q (x) we obtain 0/0, which is a meaningless expression. The reader may object 'divide numerator and denominator by x.' But he must admit that when x=0 this is impossible. Thus y = xx is a function which differs from y = 1 solely in that it is not defined for x = 0. None the less lim (x/x)= 1 for x/x is equal to 1 so long as x differs from 0, however small the diference may be. Similarly (x) = {(x+ 1)2- 1}/ = + 2 so long as x is not equal to 0, but is undefined when x = 0. None the less limn (x)= 2. On the other hand there is of course nothing to prevent the limit of 0b(x) as x tends to 0 from being equal to b(0), the value of b (x) for x=0. Thus if sb(x)=x, (0) =0 and lim (x)=0. This is in fact, from a practical point of view, i.e. from the point of view of what most frequently occurs in applications, the ordinary case.

Page 168

168 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE Examples XXXVII. 1. lim (x2 -a2)/(x - a)= 2a. 2. lim (xmn - a'n)/(x - a) = na"'-1, if rn is any integer (zero excluded). 3. Consider the case in which a>O and mn=-. [Use the equation (VJx-J/a)/(x-a)=l/(V/x+J+a): the same method may be applied to extend the result of Ex. 2 to the case in which rn is the reciprocal of any positive integer.] 4. Show that the result of Ex. 2 remains true for all rational values of m, provided a is positive. [If x=y3, a=bq, then, by Ex. 3, y — b as x —a, and lim = lim - lim x-z x - a y —b YQ - bq yb y-b I \Y- b =pb- ll/qbq-1 = (p/q) a(P/q)-l If q is odd the restriction that a is positive is unnecessary. The reader should also deduce the result directly from the inequalities given in Ex. xxxvI. 8.] 5. lim (x - 2x + 1)/(x3 - 3x2+ 2) = 1. [Divide numerator and denominator by x-1.] 6. Discuss the behaviour of ~b (x) = (aoxm + al m + n+... + ak+ k)l(bon + bl xt + 1... + bl xn + 1) as x tends to 0 by positive or negative values. [If m >n, lim q (x) =0. If n = n, lim q5 (x) = ao/bo. If mn<n and n-nw is even 5 (x)- oo or q (x)-.- co according as ao/bo > or < O0. If m <n and n-m is odd + (x) — +>,?1(x) -.-c- or q(xZ) -c-, (x) -+oo ac(x~+o) (X-) - (+O ) (X- - o) 0) cording as ao/bo > or < 0.] 7. Orders of smallness. When x is small x2 is very much smaller, X3 much smaller still, and so on: in other words lim (x2/x) = O, lim (3/x2)=.... Another way of stating the matter is to say that when x tends to 0, x2, x3,... all also tend to 0, but x2 tends to 0 much more rapidly than x, x3 than x2, and so on. It is convenient to have some scale by which to measure the rapidity with which a function, whose limit, as x tends to 0, is 0, diminishes with x, and it is natural to take the simple functions x, x2, x3,... as the measures of our scale. We say, therefore, that + (x) is of the first order of smallness if / (x)/x tends to a limit other than zero as x tends to 0. Thus 2x+3X2+x7 is of the first order of smallness, since lim (2x+ 32 +X7)/ = 2. Similarly we define the second, third, fourth,... orders of smallness. It must not be imagined that this scale of orders of smallness is in any way complete. If it were complete every function q (x) which tends to zero with x would be of either the first or second or some higher order of smallness.

Page 169

83] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 169 This is obviously not the case. For example ( (x) =x7/3 tends to zero more rapidly than x2 and less rapidly than x3. The reader may not unnaturally think that our scale might be made complete by including in it fractional orders of smallness. Thus we might say that x7/3 was of the 7th order of smallness. We shall however see later on that such a scale of orders would still be altogether incomplete. And as a matter of fact the integral orders of smallness defined above are so much more important in applications than any others that it is hardly necessary to attempt to make our definitions more precise. Orders of greatness. Similar definitions are at once suggested to meet the case in which O(x) is large (positively or negatively) when x is small. We shall say that (q(x) is of the kth order of greatness when x is small if 4(x)/x-k=x ( (x) tends to a limit different from zero as x tends to 0. These definitions have reference to the case in which x- O. There are of course corresponding definitions relating to the cases in which x —oo or a. Thus if xk (5 (x) tends to a limit other than zero, as x -- o, we say that q (x) is of the kth order of smallness when x is large: while if (x - a)k ( (x) tends to a limit other than zero as x- a, we say that ( (x) is of the kth order of largeness when x is nearly equal to a. *8. lim/(l+x)=lim/(l-x)=l. [Put 1+x (or 1-x)=?/, and use Ex. xxxvI. 10.] 9. lim {V/(1+x) - /(1 - )}/x=. [Multiply numerator and denominator by /(1 + x) + VJ( - ).] 10. Consider the behaviour of {,/(1 + xn) - ^(1 - x')}/.x as x- 0, m and n being positive integers. 11. lim {/(l + x + x2) - l}/x= -. 12. lim {/(1 +x)-,/( +2)}/{/(1 - 2) -. '( - )}= 1. 13. Draw a graph of the function {- 1 1 + +/+ +- 1 Has it a limit as x —O [.y=l except for = 1,,, 1,,when y is not defined, and y-l1 as.-O.] 14. lim (sin x)x = 1. [It is proved in books on elementary Trigonometry (it follows in fact from the definitions of the trigonometrical ratios and of circular measure)+ that if x is positive and less than ~T sin x<< tan x * In the examples which follow it is to be assumed that limits as x —0 are required, unless (as in Exs. 23, 27) the contrary is explicitly stated. t The definition of circular measure presupposes that any arc of a circle has associated with it a definite number called its length, and the proof of the inequalities which follow depends on certain properties of the 'length' which are taken to be geometrically intuitive, as that the length of the arc is greater than that of the chord of the arc. These questions will be discussed more fully in Ch. VII.

Page 170

170 LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE [v or cos x<(sin x)/x< or 0<1 -(sin x)/x< 1 -cos x = 2 sin2 ~x. But 2sin2gx<2 (x)2< x2. It follows at once that lim { -(sinx)/x}=0, and lim (sin x)/x=O. As (sin x)/x is an even function the result follows. x — +0 We note that 1 - (sin x)lx tends to 0 with x at least as rapidly as x2.] 15. lim(1-cosx)/x2=. 16. lim(sin ax)x =a. Is this true if a=0? 17. lim (1-cosax)/x2= a2. 18. In the figure OP=1, A(P=arcAP=x, PLV=sinx, ON=cosx, PT=tanx, OT=secx. Show that when x is small PTY, PA, PT are of the first order of smallness, VA and AT of the second, and the triangle PNT of the third. P 0 NAT Fic. 37. 19. lim (arc sin x)/x = 1. [Put x = sin y.] 20. lim (tan ax)lx =a, lim (arc tan ax)/x = a. 21. Show that (sin mx)/(sin nx), (tan nmx)/(tan nx), (arc sin mx)/(arc sin ax), (arc tan nx)/(arc tan nx), (sin mx)/(tan nx) and (arc tan mnx)/(arc sin nx) all tend to the limit m/n as x' —O. 22. lim (cosecx-cotx)/x=-. 23. lim ( +cos 7rx)/tan2 r=- (as x-). 24. How do the functions sin (1/x), (l/x)sin (/x), x sin(l/x) behave as x-0? [The first oscillates finitely, the second infinitely, the third tends to the limit 0. None is defined for x=0. See Exs. xvi.] 25. Does the function = (sin -/(sin ) tend to a limit as x tends to 0? [No. The function is equal to 1 except when sin(1/x)=0; i.e. when x=l//r, 1/27T,..., -'1/T, - 1/27r,.... For these values the expressionof y assumes the meaningless form 0/0, and y is therefore not defined for an infinity of values of x near x=0.] 26. Prove that lim [x]=0, lim [x]=-1. x —J +0 x ---0 27. Prove that, if im is any integer, [x]mn and x- [x] 0 as x — 2 +0, and [x]-m - 1, x - [x]L- as x-nmz-0. What are the corresponding limits for the functions /{x - [x]}, (x - [])2, [x] + Jx -- [x]}, [X] + ( - [x])2?

Page 171

84] LIMITS OF FUNCTIONS OF A CONTINUOUS VARIABLE 171 28. Find the limits of [x2] and [1 - 2] as x — + 0 and x- -0. 29. Functions which steadily increase or decrease as x- a +0. If there is a segment to the right of a, say (a, a + 8), such that ( (x") > p (x') if a<a+x"<a+x'<a+8, (x) is said to increase steadily as x- a+O. Similarly we define what is meant by saying that f(xc) increases steadily as x- a-0, and there are corresponding definitions for decreasing functions. Show (cf. ~~ 64, 80) that if + (x) steadily increases as x —a+0, then q (x) tends to a limit or tends to +oo as x-ca+0. State the corresponding theorems for the other cases. Do the functions considered in Exs. 26-28 steadily increase or decrease as x —a+O, or x —a-O (where a=0 or m as the case may be)? 30. Show that (sinx)/x steadily increases as x->+0 or x ---0. [Use the equation sinx sin (x +h) xsin x (1- cos)+cos x (htan x - xsinA) ] x x+h x(x+h) 84. Continuous functions of a real variable. The reader has no doubt an idea as to what is meant by a continuous curve. Thus he would call the curve C in Fig. 38 continuous, the curve C' discontinuous. Either of these curves may be regarded as the graph of a function + (x). It is natural to call a function continuous if its Y l 0 I",FIG. 38. FIG. 38. graph is a continuous curve, otherwise discontinuous. Let us take this as a provisional definition and try to distinguish more precisely some of the properties which are involved in it. In the first place it is evident that the property of the function y = G (x) of which C is the graph may be analysed into some property possessed by the curve at each of its points. To be able to define continuity for all values of x we must first

Page 172

172 CONTINUOUS AND DISCONTINUOUS FUNCTIONS [V define continuity for any particular value of x. Let us therefore fix on some particular value of x, say the value x= corresponding to the point P of the graph. What are the characteristic properties of ( (x) associated with this value of x? In the first place (b (x) is defined for x = 4. This is obviously essential. If ( (4) were not defined there would be a point missing from the curve. Secondly (> (x) is defined for all values of x near x =; i.e. we can find an interval, including x: in its interior, for all points of which ( (x) is defined. Thirdly if x approaches the value 4 from either side (x) approaches the limit ( (4). The properties thus defined are far from exhausting those which are possessed by the curve as pictured by the eye of common sense. This picture of a curve is a generalisation from particular curves such as straight lines and circles. But they are the simplest and most fundamental properties: and the graph of any function which has these properties would, so far as drawing it is practically possible, satisfy our geometrical intuition of what a continuous curve should be. We therefore select these properties as embodying the mathematical notion of continuity. We are thus led to the following DEFINITION. The function ( (x) is said to be continuous for x= 4 if it tends to a limit as x tends to 4 from either side, and each of these limits is equal to +((). We can now define continuity throughout an interval. The function ( (x) is said to be continuous throughout a certain interval of values of x if it is continuous for all values of x in that interval. It is said to be continuous everywhere if it is continuous for every value of x. Thus the function whose graph is C' is continuous for every value of x except x= ' and x= ". It is continuous in the interval (4i, 2). If we recur to the definitions of a limit we see that our definition is equivalent to ' ( (x) is continuous for x = 4 if, given e, we can choose q so that I (x)- ()j[< e if 0O1 x - 4.' 85. This form of the definition may be illustrated geometrically as follows. Draw the two horizontal lines y = + (t) + e (Fig.

Page 173

84, 85] CONTINUOUS AND DISCONTINUOUS FUNCTIONS 173 39). Then ]0 (x) - () <e expresses the fact that the point on the curve corresponding to x lies between these two lines. Y._ --- —-....... ------—...: --- —-----. y= (V ) +e ---------- ---- — y- = () - e o {0- } + X FIG. 39. Similarly Ix-\ 1: - expresses the fact that x lies in the interval (- q, ~ +v). Thus our definition asserts that if we draw two such horizontal lines, no matter how close together, we can always cut off a vertical strip of the plane by two vertical lines in such a way that all that part of the curve which is contained in the strip lies between the two horizontal lines. This is evidently true of the curve C (Fig. 38), whatever value 4 may have. We shall now discuss the continuity of some special types of functions. Some of the results which follow were (as was pointed out) tacitly assumed in Ch. II. Examples XXXVIII. 1. The sum or product of two functions continuous at a point is continuous at that point. The quotient is also continuous unless the denominator vanishes at the point. [This follows at once from Ex. xxxvI. 1.] 2. Any polynomial is continuous for all values of x. Any rational fraction is continuous except for values of x for which the denominator vanishes. [This follows from Exs. xxxvI. 6, 7.] 3. J/x is continuous for all positive values of x (Ex. xxxvI. 10). It is not continuous for x=0, although it is equal to 0 when x=0 and also tends to the limit 0 as x tends to 0 by positive values. In fact /x —.0 as x-,+0, but not as x-.-0, since ^/x is not defined for negative values of x; and both conditions are wanted for continuity. Similarly for xnm/n where n, n are any positive integers and n is even. 4. Discuss the continuity or discontinuity of x'/'", where n is odd, for x=O. 5. 1/x is not continuous for x=0. It has no value for x=0, nor does it tend to a limit as x-.0. In fact l/x- +co or - oo according as x- 0 by positive or negative values.

Page 174

174 CONTINUOUS AND DISCONTINUOUS FUNCTIONS 6. Discuss the continuity of x-1/l, where m and n are positive integers, for x = 0. 7. The standard rational function R(x)=-P (x(x)(x) is discontinuous for x=a, where a is any root of Q(x)= 0. Thus (x2+l)/(.x2-3x+2) is discontinuous for x=l. It will be noticed that in the case of these functions a discontinuity is always associated with (a) a failure of the definition for a particular value of x and (b) a tending of the function to + oo or -oo as x approaches this value from either side. Such a particular kind of point of discontinuity is usually described as an infinity of the function. 'Infinities' are far from being the only possible kind of points of discontinuity; but they are the kind of most common occurrence in ordinary work. 8. Discuss the continuity of (x-a)(b- x)}, /{(x- a)(b- )}, {(x- a)/(b -x)}, /{( -a)/l(b -)}. 9. sin x, cos x are continuous for all values of x. [For sin (x + h) - sin x = 2 sin ~h cos (x + ~h) which is numerically less than the numerical value of h. Similarly for cos X.] 10. For what values of x are tanx, cotx, secx, cosecx continuous or discontinuous? 11. If f(y) is continuous for y=rX, and 4) (x) is a continuous function of x, which is equal to j when x = -, then f{/ (x)} is continuous for x-. 12. sin (sin x) is continuous for all values of x. 13. If (h (x) is continuous for any particular value of x, any polynomial in q( (x), such as a {0 ()}m +..., is so too. 14. a cos2 x b sin2 x is continuous for all values of x. 15. Discuss the continuity of l/(acos2x+ bsin2 X), /(2+cosx),,/( +sinx), 1/^/(l+sinx). 16. sin (l/x), xsin(l/x), x2 sin (l/x) are continuous except for x=0. [Apply Exs. 9 and 11.] 17. The function which is equal to x sin (l/x) except when x=0, and to zero when x =0, is continuous for all values of x. 18. x], x- [x] are discontinuous for all integral values of x. 19. For what (if any) values of x are the following functions discontinuous: [x2], [ax], (x- [x])2, (x - [X]), [] + ( - [x]), [2x], [] + [2x], [] + [- X? 20. Classification of discontinuities. Some of the preceding examples suggest a classification of different types of discontinuity. (1) Suppose that () x) tends to a limit as x —a either by values greater than or by values less than a. Denote these limits by G and L respectively. Then, for continuity, it is necessary that 4 (x) should be defined for x=a, and that G=L=((a). I)iscontinuity may arise in a variety of ways.

Page 175

86] CONTINUOUS AND DISCONTINUOUS FUNCTIONS 175 (a) G may be equal to L, but 4 (a) may not be defined, or may differ from G and L. Thus if Q (x)=xsin (l/x) and a=0, G=L=0, but 0 (x) is not defined for x=0. Or if ~ (.x)= [1 -X2] and a=O, G=L=O, but 4)(0)=1. (/) G and L may be unequal. In this case 0 (a) may be equal to one or to neither, or be undefined. The first case is illustrated by (x) =[x], when (for a=0) G=O= =(0), L=-l: the second by ) ()=[x]-[- x], when G= 1, L= -1, 4 (0) =0, and the third by ( (x) =[x]+xsin (1/x), when G - 0, L= -, and 4f (0) is undefined. In any of these cases we say that ( (x) has a simple discontinuity at = a. And to these cases we may add those in which one of G or L exists, but ) (x) is only defined on one side of x =a, so that there is no question of the existence of the other limit. (2) It may be the case that only one (or neither) of G and L exists, but that (supposing for example G not to exist) (x)-,- +c or - oo as x-a-a+0: so that q5 (x) tends to a limit or to + oo or - oo as x approaches a from either side. Such is the case, for instance, if q (x)= l/x or ljx2, and a=0. In such cases we say (cf. Ex. 7) that x=a is an infinity of q (x). And again we may add to these cases those in which q (x)-+ oo or -oo as x —a from one side, but is not defined at all on the other side of x=a. (3) Any point of discontinuity which is not a point of simple discontinuity nor an infinity is called a point of oscillatory discontinuity. Such is the point x=0 for the functions sin(l/x), (l/x) sin (l/x). 21. What is the nature of the discontinuities at x=0 of the functions: (sin x)/x, (1-cos x)/x2, Jvx, [x]+[-x], cosec x, ^/(cosec ), cosec (l/x), sin (1/x)/sin (l/)? 22. The function which is equal to 1 when x is rational and to 0 when x is irrational (Ch. IT, Ex. xvi. 11) is discontinuous for all values of x. So too is any function which is only defined for rational or for irrational values of x. 23. The function which is equal to x when x is irrational and to /{(1 +p2)/(l +q2)} when x is a rational fraction p/q (Ch. II, Ex. xvIr. 12) is discontinuous for all negative and for positive rational values of x, but continuous for positive irrational values. [This is not very obvious, and if the reader can see it he may be sure that he understands the nature of continuity and discontinuity.] 24. For what points are the functions considered in Ch. IV, Exs. xxxIII. discontinuous, and what is the nature of their discontinuities? [Consider, e.g., the function y= lim,n (Ex. 5). Here y is only defined when - 1 <x 1: it is equal to 0 for -l<x<l and to 1 for x=I. The points x= +l are points of simple discontinuity.] 86. The fundamental property of a continuous function. It may perhaps be thought that the way in which we stated (~ 84

Page 176

176 CONTINUOUS AND DISCONTINUOUS FUNCTIONS [V above) the property which a function must have, in order that its graph may satisfy our common sense notion of what a continuous curve should be, is not the simplest or most natural way possible. Another method of analysing our idea of continuity is the following. Let A and B be two points, whose coordinates are x0, b (x0) and xi, 4 (x,) respectively, on the graph of + (x). Draw any straight line L which passes between A and B. Then common sense certainly declares that if the graph of + (x) is continuous it must cut L. If we consider this property as an intrinsic geometrical property of continuous curves it is clear that there is no real loss of generality in supposing L to be parallel to the axis of x. In this case the ordinates of A and B cannot be equal: let us suppose, for definiteness, that S (xi) > q (xo). And let L be the line y = -, where ( (x,) < < ( (xi). Then to say that the graph of + (x) must cut L is the same thing as to say that there is a value of x between x0 and xi for which ( (x)= =. We conclude then that a continuous function k (x) must possess the following property: if (x0o)=y, c (X1)=yi, and y, < r < yl, there is a value of x between Xo and xa for which (x)=x. In other words as x varies from xo to x,, y must assume at least once every value between yo and y,. We shall now prove that if b (x) is a continuous function of x in the sense defined in ~ 84 it does in fact possess this property. There is a certain range of values of x to the right of x,, for which ( (x) < x. For p (xo) < V, and so b (x) is certainly less than 7 if ) (x) - ( (xo) is numerically less than r - - (x0). But since ( (x) is continuous for x = x0 this condition is certainly satisfied if x is near enough to x0. Similarly there is a certain range of values to the left of x1 for which ( (S) > >. We can now easily prove our theorem by a reductio ad absurdum. For suppose that there is no value of x between x, and xi for which b (x) =v. Then for every x in the interval (x0, x1) 0 (x) is either greater or less than I. Let us divide the values of x between x, and x1 into two classes T, U as follows:

Page 177

86, 87] CONTINUOUS AND DISCONTINUOUS FUNCTIONS 177 (1) in the class T we put all values: of x such that 0 (x) <? for x=: and for all values of x between x, and:; (2) in the class U we put all the other values of x, i.e. all values s such that either ( () >7 or there is a value of x between x0 and i for which q (x). Then it is evident that these two classes are related like the classes T and U of Ch. I, ~ 5, i.e. that the points of the class U lie entirely to the right of those of the class T, and that there is a point P which divides the two classes. Now, ex hypothesi, if to is the abscissa of P, b (0) == r. First suppose b (0)>) > V, so that 0o belongs to the upper class: and let (f0o) = Vr + k say. Then if P' (x = ~') is any point to the left of P, no matter how close, < (') < q and so <(o)- (g)>k, which directly contradicts the condition of continuity at P. Next suppose b (0) = -k < a. Then if P'(x= ') is any point to the right of P, no matter how close, either 9 (,')> or we can find another point P" (x=:") between P and P' and such that b (I") >?7. In any case we can find a point as near to P as we please and such that the corresponding values of b (x) differ by more than k. And this again directly contradicts the hypothesis that ( (x) is continuous at P. Hence the hypothesis that < (x) is nowhere equal to y is untenable, and the theorem is established. The fact is, of course, that b (go) must be equal to r. 87. It is easy to see that the converse of the theorem just proved is not true. Thus such a function as the function (x) whose graph is represented by Fig. 40 obviously assumes at least once every value between q (xo) and p (x,): yet + (x) is obviously discontinuous. Indeed it is not even true that + (x) must be continuous when it assumes each value once and once only. Thus let fb(x) be defined as follows from x=O to x= 1. If x=O let b(x))=0; if 0< x< let b(x)= - x; if x= let (x) =; if <x< <1 let sb(x)= 3-x; and if x = 1 let q (x)=l. The graph of the function is shown in Fig. 41; it includes the points 0,,, F but not the points A, B, D, E. It is clear that, as x varies from H. A. 12

Page 178

178 CONTINUOUS AND DISCONTINUOUS FUNCTIONS [V 0 to 1, s (x) assumes once and once only every value between (0) = 0 and 0 (1) = 1; but S (x) is discontinuous for x= 0, 2, 1. D v/ A _I I X0 Xl OF II I 0 B 1 FIG. 40. FIG. 41. As a matter of fact, however, the curves which usually occur in elementary mathematics are all composed of a finite number of pieces along which y always varies in the same direction. It is easy to show that if y = b (x) always varies in the same direction (i.e. steadily increases or decreases) as x varies from x0 to x1, the two notions of continuity are really equivalent, i.e. that if b (x) takes every value between b(x0) and (x1) it must be a continuous function in the sense of ~ 84. For let: be any value of x between x0 and x,. As x-A through values less than ~, +b(x) tends to a limit 1. Similarly as x-A through values greater than: it tends to a limit 1'. Moreover I < b (:) l1', and the function will be continuous for x= if and only if l=<(~)='. But if either of these equations is untrue, say the first, it is evident that +(x) never assumes any value which lies between I and ~) (M), which is contrary to our assumption. Thus > (x) must be continuous. The net result of this and the last section is consequently to show that our commonsense notion of what we mean by continuity is substantially accurate, and capable of precise statement in mathematical terms. 88. Inverse functions. The reader is already familiar with some examples of inverse functions: thus arc sinx and arc tanx are the functions inverse to sin x and tan x. Generally, if y = +(x)

Page 179

87, 88] CONTINUOUS AND DISCONTINUOUS FUNCTIONS 179 and x can be expressed in terms of y in the form x = f (y), we call *r the function inverse of b. Thus if y =x2, x = +Vy. It will be observed in this case that the inverse function differs fundamentally from the original function in two respects: it is not defined for all values of y, and, when it is defined, it has two values. Similarly if y = sin x, x = arc sin y, x is only defined when -1 _y 1, and then has an infinity of values. Let us suppose now, however, that (x) is a function which steadily increases or decreases (suppose the former) as x varies from a to b, and let b (a) = a, b (b) = /. If < (x) is continuous, it assumes every value between a and /, and conversely: we shall suppose this to be the case. These suppositions, however, are not enough to ensure that 0(x) assumes each such value once and only once. In order to ensure this we must exclude the possibility of + (x) remaining stationary for any part of the time during which x varies from a to b. We can do this by supposing that a - xI < x"I = b involves b (x") > (x'), and not merely { (Z") - b (x'), as we supposed in defining an increasing function in ~ 80. A function which satisfies this condition we may call an increasing function in the stricter sense, as opposed to the increasing functions in the wider sense with which we have hitherto been concerned. Then, as x varies from a to b, (x) varies from a to /, assuming each value between a and /3 once and once only. Thus to a value of y between a and /3 corresponds one and only one value of x between a and b. And if we write x = r(y), +(y) is a function of y which has just one value for any value of y between a and 3. Moreover it is evident that t (y) increases steadily as y increases from a to 3, assuming in turn, once and only once, each value between a and b. Finally, by ~87, (y) is continuous throughout the interval (a, /). Thus if y = (x) is a function of x which, throughout the interval (a, b), is one-valued, continuous, and increasing in the stricter sense, then x = - (y) is a function of y which has the same properties throughout the corresponding interval of values of y. Examples XXXIX. 1. As x increases from - 7r to +rn-, y=sinx is continuous and steadily increases, in the stricter sense, from -1 to +1. Hence x= arc siny is a continuous and steadily increasing function of y from y= -1 to y=+ 1. Here arc siny denotes the value of the inverse sine which lies between - tr and +-r.

Page 180

180 CONTINUOUS AND DISCONTINUOUS FUNCTIONS [V 2. Apply similar reasoning to the function arc cosy. 3. Show that the numerically least value of arc tany is a function of y continuous for all values of y, and increasing steadily from -- ~r to + ~7 as y varies through all real values positive and negative. 89. The range of values of a continuous function*. In this paragraph we shall state and prove some general theorems concerning continuous functions. These theorems are of a very abstract character, and the reader may well be tempted to regard them as obvious: but general theorems, as he probably realises by now, are apt to be a good deal less obvious than they seem. Let us consider a function ) (x) about which we shall only assume at present that it is defined for every value of x in an interval (a, b). It may be possible to assign a number G such that +(x)- G for all these values of x, i.e. such that the value of + (x) cannot be greater than G, or, as we may say, 4 (x) cannot surpass G. In this case we shall say that (x) is limited above. If G can be assigned at all it can be assigned in an infinity of ways: for if b) (x) cannot surpass G it certainly cannot surpass any number greater than G. But there is a least number G which (x) cannot surpass. For if we divide the aggregate of real numbers into two classes T, U composed respectively of the numbers which b (x) can and cannot surpass, it is clear that when T and U are represented along the line L, as in ~ 5, T lies entirely to the left of U. As in ~ 5, there is a point P which divides the two classes. But in this case, since the two classes between them include all real numbers, the coordinate M of P must belong to one class or the other. And it must belong to U. For if it belonged to T, 4 (x) could assume a value greater than M, say M+8: and then all the numbers between M and M 1+8 could also be surpassed; so that there would be points of T to the right of P, which is not the case. Thus M is the least number which 4 (x) cannot surpass: we call M the upper limit of 4 (x) in the interval (a, b). Similarly it may be possible to find a number which - ) (x) cannot surpass; and if this is possible, it is possible to find a least * In this section I have for the most part followed the exposition of de la Vallee Poussin (Cours d'Analyse, t. i. pp. 19-21).

Page 181

89] CONTINUOUS AND DISCONTINUOUS FUNCTIONS 181 number - m which - b (x) cannot surpass: or, what is the same thing, a greatest number m below which the value of b (x) cannot sink. In this case we say that q (x) is limited below, and call m, the lower limit of b (x). If both M and m can be determined in this way we shall say that b (x) is limited throughout (a, b). And then m and M are the greatest and least numbers respectively such that m= b (ax) - M for all values of x in the interval (a, b). If M cannot be determined b (x) has no upper limit: in this case we can find values of + (x) algebraically greater than any number we can assign. In the same way + (x) may have no lower limit. THEOREM 1. If b (x) is continuous throughout (a, b) it is limited throughout. (a, b). We can certainly determine an interval (a, |), extending to the right from a, throughout which qb(x) is limited. For since b(x) is continuous for x = a, we can, given any number 8, however small, determine an interval (a, ~) throughout which b (x) lies between 5 (a) - 8 and ) (a) + 8; and obviously <b (x) is limited in this interval. Now divide the points ~ of the interval (a, b) into two classes T, U, putting in T if b (I) is limited in (a, ~), and in U if this is not the case. By what precedes T certainly exists: what we propose to prove is that U does not. Suppose that U does exist; and let 8/ be the value of: which divides T from U. Since pb (x) is continuous at x = / we can determine an interval (/ -, /3 + r) throughout which f (/) - 8 < q (x) < b (/) + 8, however small 8 may be. Thus + (x) is limited throughout (/3 -, +). But,8-q belongs to T. Therefore q(x) is limited throughout (a, i/3-):' and therefore it is limited throughout the whole interval (a, /3+^). But /3 +V belongs to U and so )b(x) is not limited throughout (a, / + -7). This contradiction shows that U does not exist. And so ( (x) is limited throughout the whole interval (a, b). THEOREM 2. If qb (x) is continuous throughout (a, b) and M and m are its upper and lower limits, q (x) assumes the values M and m at least once each in the interval.

Page 182

182 CONTINUOUS AND DISCONTINUOUS FUNCTIONS [V For since M-4-(x) assumes values less than any assignable number, l/[iM-(-(x) assumes values greater than any assignable number, and so, by Theorem 1, cannot be continuous. But il- 4)(x) is a continuous function, and so 1/{M - ( (x) is continuous at any point at which its denominator does not vanish (Ex. xxxvIII. 1). There must therefore be one such point: at this point 4 (x)= lM. Similarly for m. Examples XL. 1. If )(x)=i/x except for x=0, 4((x)=O when x=O, 4 (x) has neither an upper nor a lower limit in any interval which includes x=O in its interior, as e.g. the interval (-1, + 1). 2. If )(x) =/x2 except for x=O, q (x)=O when x=O0, 4 (x) has the lower limit 0, but no upper limit, in the interval (-1, + 1). 3. Let 4)(x)=sin (l/x) except for x=0, 4)(x)=0 when x-=0. Then +q(x) is discontinuous at x= 0. In any interval ( -, + 8) the lower limit is -1, and the upper limit + 1, and each of these values is assumed by 4) (x) an infinity of times. 4. Let ()(x)=x-[x]. This function is discontinuous for all integral values of x. In the interval (0, 1) its lower limit is 0 and its upper limit 1. It is equal to 0 when x=0 or 1, but it is never equal to 1. Thus 4 (x) never assumes a value equal to its upper limit. 5. Let (x)=0 when x is irrational, )(x)=q when x is a rational fraction p/q. In any interval (a, b), () (x) has the lower limit 0, but no upper limit. But if ) (x) =(-l)Pq when x=p/q, q) (x) has neither an upper nor a lower limit in any interval. 90. Continuous functions of several variables. The notions of continuity and discontinuity may be extended to functions of several independent variables (Ch. II, ~~ 21 et seq.). Their application to such functions, however, raises questions much more complicated and difficult than those which we have considered in this chapter. It would be impossible for us to discuss these questions in any detail here; but we shall, in the sequel, require to know what is meant by a continuous function of two variables, and we accordingly give the following definition. It is a straightforward generalisation of the last statement of ~ 84. The function 4q (x, y) of the two variables x and y is said to be continuous for x=, y=r if, given any positive number e however small, we'can choose 8 so that I 1 (a a )-<H (d,7) 1 < e if O _I x- _ Ic 8, 0o\ y 3 - i \8; that is to say if we can draw a square, whose sides are parallel to the axes of coordinates and of length 2b, and whose centre is the point ($, q), and which is such that the value of q (x, y) at any point inside it or on its boundary differs fronm ( ($, r) by less than, e*. * The reader should draw a figure to illustrate the definition.

Page 183

90] MISCELLANEOUS EXAMPLES ON CHAPTER V 183 Another method of stating the definition is this: q( (x, y) is continuous for x=-, y=q7 if p ($, 1r) is defined and / (x, y) — (, r7) wtoen x-u, y-.r. in any manner. This statement is apparently simpler; but it contains phrases the precise meaning of which has not yet been explained and can only be explained by the help of inequalities like those which occur in our original statement. It is easy to prove that the sums, the products, and in general the quotients of continuous functions of two variables are themselves continuous. A polynomial in two variables is continuous for all values of the variables; and the ordinary functions of x and y which occur in every-day analysis are generally continuous-i.e. are continuous except for pairs of values of x and y connected by special relations. MISCELLANEOUS EXAMPLES ON CHAPTER V. 1. Show that, if neither a nor b is zero, an + bxn +1.. + kaxn (1 + x), where ex is of the first order of smallness when x is large. 2. If P (x) = axn + bx-1 +... + k, then as x increases P (x) has ultimately the sign of a; and so has P (x + X) - P (x), if X is any constant. 3. Show that in general (awn + bint- 1 +... J+ k)/(Axo - + Bxn- 1 +... + ) = a + (13/) (1 + ex) where a=a/A, /=(bA - aB)/A2 and,e is of the first order of smallness when x is large. Indicate any exceptional cases. 4. Express (a2 + bx +c)/(As2+Bx+C) in the form a + (j3/x) + (y/x2) (1 + E), where ex is of the first order of smallness when x is large. 5. Show that lim x {,/(x +a) - /x} = -a. [Use the f ormula /(+a) =/{ a) }.] [Use the formula V(x + a) -./ax = a/{(x + a) + }~xl,.] 6. Show that \/(x + a)=,/x + (a/l/x) (1 +Ex), where ex is of the first order of smallness when x is large. 7. Find values of a and 3 such that /(ax2 -+ 2bx + c) - ax - 3 has the limit O as x- oo; and prove that lim x{ /(ax2+2bx +c)-ax- 2}= (ac- b2)/2a. 8. Evaluate lim x {/[x2 + V(x4 + 1)] - x,/2}. 9. If ax2+ bx y+cy2+2dx+2ey=O and A=2bde - ae2 - cd2, then one value of y is given by y=ax+ x2 +yx3 (1~ +ex), where a= - de, = A/2e3, y = (ccl - be) A/2e5, and e is of the first order of smallness when x is small. [If y-ax=7 we have -2erq = ax2 + 2bx (r+ + ax) +c (ra + a,=)2= Ax+ r 2B.x + C72,

Page 184

184 MISCELLANEOUS EXAMPLES ON CHAPTER V say. It is evident that r7 is of the second order of smallness, x/7 of the third, and r)2 of the fourth; and - 2e7 =Ax2+2ABx3, the error being of the fourth order.] 10. If x=ay+by2+cy3, then one value of y is given by y = ax + Ox2 + yX3 (1 + eF), where a=l/a, =3 - /a3, y = (2b2 -ac)/a5, and cx is of the first order of smallness when x is small. 11. If x=ay +by", where n is an integer greater than unity, then one value of y is given by y=ax++ j(3x+-yx2n'-l (1 +ex), where a= l/a, a3= - b/an +1, y = nb2/a2 + 1, and Ex is of the (n - 1)th order of smallness when x is small. 12. Prove that (secx - tanx) — -O as x- t~r. 13. Prove that 5 (x) = 1 - cos (1 - cos x) is of the fourth order of smallness when x is small; and find the limit of 6 (x)/x4 as x —0. 14. Prove that P (x) = x sin (sin x) - sin2 x is of the sixth order of smallness when x is small; and find the limit of q (x)/x'3 as x O0. 15. From a point T in a radius OA of a circle produced, a tangent TP is drawn to the circle, touching it in P, and 1N is drawn perpendicular to OA. Show that NAAAT-a1 as P moves up to A. 16. Tangents are drawn to a circular arc at its middle point and its extremities: A is the area of the triangle formed by the chord of the arc and the two tangents at the extremities, and A' the area of that formed by the three tangents. Show that, as the length of the arc tends to zero, A/A'-4. 17. For what values of a does {a+sin (l/x)}/x tend to (1) +co, (2) - oo, as x —O? [To +co if a>l, to -o if a<-1: if -1_a-l1 the function oscillates.] 18. For what values of x is the function (;x) -=- lim /(cos2 rrx) continuous or discontinuous? 19. Show that the least positive root of the equation sinx=ax is a continuous function of a throughout the interval (0, 1), and increases steadily from 0 to rr as a decreases from 1 to 0. [The function is the inverse of (sin x)lx: apply ~ 88.] 20. The least positive root of tanx=ax is a continuous function of a throughout the interval (1, o ), and increases steadily from 0 to 7rr as a increases from 1 towards oo. 21. If 0(x)==l/q when x=p/q, O(x))=O when x is irrational, O(x) is continuous for all irrational and discontinuous for all rational values of x. 22. Let b (x) =v when x is rational and 5 (x)= 1 - x when x is irrational. Show that, as x increases from 0 to 1, b (x) assumes every value, between and including 0 and 1, once and once only, although 0 (x) is discontinuous for every value of x except x= -.

Page 185

CHAPTER VI. DERIVATIVES AND INTEGRALS (A FIRST CHAPTER IN THE DIFFERENTIAL AND INTEGRAL CALCULUS). 91. Derivatives or Differential Coefficients. Let us return to the consideration of the properties which we naturally associate with the notion of a curve. The first and most obvious property is, as we saw in the last chapter, that which gives a curve its appearance of connectedness, and which we embodied in our definition of a continuous function, i.e. a function whose graph is a continuous curve, such as the curve C of Fig. 38. The ordinary curves which occur in elementary geometry, such as straight lines (which we have agreed, for convenience, to class as curves), circles and conic sections, have of course many other properties of a general character. The simplest and most noteworthy of these is perhaps that they have a definite direction at every point, or what is the same thing, that at every point of the curve we can draw a tangent to it. The reader will probably remember that in elementary geometry the tangent to a curve at P is defined to be 'the limiting position of the chord PQ, when Q moves up towards coincidence with P.' Let us consider what is implied in the assumption of the existence of such a limiting position. In the figure (Fig. 42) P is a fixed point on the curve, and Q a variable point; PM, QNr are parallel to O Y and PR to OX. We can denote the coordinates of P by x, y and those of Q by x + h, y + k: Ah will be positive or negative according as N lies to the right or left of M. We have assumed that there is a tangent to the curve at P, or that there is a definite 'limiting position' of the chord PQ. Suppose that the tangent at P, PT, makes an angle r with OX.

Page 186

186 DERIVATIVES AND INTEGRALS [VI Then to say that PT is the limiting position of PQ is equivalent to saying that the limit of the angle QPR is r, when Q approaches P along the curve from either side. R, p R 0 N M N X FIG. 42. We have now to distinguish two cases, a general case and an exceptional one. The general case is that in which + -=7r: i.e. PT is not parallel to OY. In this case RPQ tends to the limit 4, and RQ/PR = tan RPQ, tends to the limit tan*. Now RQ/PR = (Q - IMP)/MN= { (x + A) - (x)}/h. Hence lim r{J (x + h) - 5 (x)}/h = tan............ (1). The reader should be careful to note that in all these equations all lengths (such as RQ) are regarded as affected with the proper sign (e.g. RQ is negative in the figure when Q lies to the left of P), and that the convergence to the limit is unaffected by the sign of h. Thus the assumption that the curve which is the graph of x (x) has a tangent at P, which is not perpendicular to the axis of x, implies that q (x) has, for the particular value of x corresponding to P, the property that t{ (x + h)- ~ ()}/lh tends to a limit when h tends to zero. This of course implies that both of {{ ( +Jd) - { (x)}/h, {1 ( -/A)- ~ (z)}/( - A) tend to limits when h —O by positive values only, and that the two limits are equal. The reader will easily convince himself that if these limits exist but are not equal, the curve y= (x) has an angle at the particular point considered. In such a case the curve might of course be said to have two

Page 187

91, 92] DERIVATIVES AND INTEGRALS 187 directions at P, or two tangents. At any rate it has not one definite tangent or direction (see Fig. 43). Now let us suppose that the curve has (like the circle or ellipse) a tangent at every point of its length, or at any rate every portion of its length which corresponds to a certain range of variation of x. Further let us suppose this tangent never perpendicular to the axis of x: in the case of a circle this would of course restrict us to considering an arc less than a semicircle. Then an equation such as (I) holds for all values of x which fall inside this range. To each such value of x corresponds a value of tan sr: tan 4 is a function of x, which is defined for all values of x in the range of values under consideration, and which may be calculated or derived from the original function + (x). We shall call this function the derivative or derived function of 9 (x), and we shall denote it by +'(S) Another name for the derived function of + (x) is the differential coefficient of b (x); and the operation of calculating +'(x) from b(x) is generally known as differentiation: these names are by no means happily chosen, but the reader should be familiar with them. Before we proceed to consider the special case mentioned above, in which f= -7r, we shall illustrate our definition by some general remarks and particular illustrations. 92. Some general remarks. (1) The existence of a derived function 0'(x) for all values of x in the interval accx-b implies that at every point of this interval b (x) is continuous. For it is evident that lim {( (x + h)- - (x)}lh cannot exist unless lim b (x + h) = + (x), and it is this which is the property denoted by continuity. (2) It is natural to ask whether the converse is true, i.e. whether every continuous curve has a definite tangent at every point, and every function a differential coefficient for every value of x for which it is continuous*. The answer is obviously No: it * We leave out of account the exceptional case (which we have still to examine) in which the curve is supposed to have a tangent perpendicular to OX: apart from this possibility the two forms of the question stated above are equivalent.

Page 188

188 DERIVATIVES AND INTEGRALS [VI is sufficient to consider the curve formed by two straight lines meeting to form an angle (Fig. 43). The reader will verify at once that in this case {((x+h)- (x)}/h has the / limit tan a as h-0, if x < X0, but the p limit tan/3 if x > x,. And {4 (o h - b (x,)}/h has no limit as h —0. In fact it has FIG. 43. the limit tan 3 if h-0 by positive values and the limit tan a if h-0- by negative values. This is of course one of the cases in which a curve might reasonably be said to have two directions at a point. But the following example, although more difficult, shows conclusively that there are cases in which a continuous curve cannot be said to have either one definite direction or several directions at one of its points. Draw the graph of the function xsin(l/x) (Fig. 16, Ch. II). The function is not defined for x=0 and so is discontinuous for x=0. But (Exs. xxxvIIi. 17) the function defined by the equations 5 (xv)=xsin(l1/x) (x$:O), fq6(x)=O (x=O) is continuous for x= 0: and the graph of this function, if it could be drawn completely (and we can draw it quite adequately enough to obtain a general idea of its appearance) would satisfy our common sense intuition of what a continuous curve should be like*. But q (x) has no derivative for x=0. For q'(0) would be, by definition, lim {( (h) - ( (O)}/h or lim sin (l/h) and this limit does not exist. The reader, on studying the figure carefully, will probably agree that the curve does not look as if it had a tangent or a definite direction at the origin. It has even been shown that a function of x may be continuous and yet have no derivative for any value of x. But the proof of this is much more difficult; the reader who is interested in the question may be referred to Bromwich's Infinite Series, pp. 490-1, or Hobson's Thzeory of Functions of c Real Variable, pp. 620-5. (3) Rates of Variation. There is another general point of view from which the notion of the derivative of a function +(x) may be considered. Let us suppose that OM, measured along OX, represents the value of x, and ON, measured along OY, represents the value of y= (x): so that the corresponding point on the graph of S(x) is obtained by completing the rectangle OMPN. Further, let us imagine the variation of x as taking * No doubt it is a somewhat peculiar curve, but there is obviously no breach of continuity. The apparently arbitrary assignment of the value 0 to 0 (x) for x- 0 is in reality natural enough-it merely amounts to the filling in, so to say, of a single point previously missing in the curve y=x sin (l/x).

Page 189

92] DERIVATIVES AND INTEGRALS 189 place in time, so that M moves along the axis of x: it is most convenient to suppose that it moves steadily in the same direction with uniform velocity V, so that at time t, OM= x= Vt. Let us suppose that t + T is Y a time a little after or a little N'i P' before t, according as - is positive N R or negative: and let M', N', P' denote the corresponding positions of M, N, P. Finally let OM' = x + h, so that h = Vr, and let INP cut M'P' in R. o- M' X-m Then if +(x) is a function FIG. 44. such as those which we have been considering, the ratio RP'/PR converges to a limit, say I, when -0O: 1 is in fact the same as tan# in ~ 91. Also YN'/r = RP'/ = (RP'/PR)(PRT/), which has the limit Vl when r-0-. But the limit of NN'/r is precisely what is meant by the velocity of N. Thus the value of the derivative ~'(x) represents the velocity of N compared with the velocity of M. If, as is simplest, we suppose that M is moving with unit velocity, then +'(x) is actually equal to the velocity of N. Another way of expressing the same thing is to say that ' (x) represents the rate of increase of + (x), taking the rate of increase of x as our standard; if b (x) is decreasing, this 'rate of increase' is of course negative. It is evident that the geometrical apparatus, by means of which we have arrived at this notion of the meaning of +'(x), is not essential. There is no reason why we should imagine the values of y represented along a line perpendicular to OX: this representation was adopted merely to show that our present point of view is not essentially distinct from that of ~ 91. Another way of looking at the matter is as follows. Taking V=l as above we have x = t: and 'there is no need for any special geometrical representation of x at all: x is the time t, simply, and we may regard the time t itself as the independent variable. We may simply take a line L and suppose the values of y = ( (t) represented by the lengths OQ measured along this line from a fixed point 0. If, at time t, OQ = b(t), b' (t) is the velocity of Q at time t.

Page 190

190 DERIVATIVES AND INTEGRALS [vI Example. The reader is no doubt familiar with the formulae v = t +ft, s = ut + ift2 for a particle moving in a line with 'uniform acceleration f.' He will easily prove from the definition of a derivative that if s=ut+ft2=( (t), then (' (t)=u+ft, i.e. p'(t)=v. The rate of increase of s is v; and similarly the rate of increase of v is f. Examples XLI. 1. If p (x) is a constant, 0'(x)=O. Interpret this result geometrically. 2. If (x) =x, +q'(x)=1. Prove this (a) from the formal definition and (b) by geometrical considerations. 3. If ( (x) = ax + b, ''(x) = a. Again interpret geometrically. 4. If (x) =Xm, where m is a positive integer, ('(x)=~nxlm-1. [For ' (x) = lim {(x + h)7m - x'n} / = (m- - I))l^^-~n+...+ K~-ll, 1.52. The reader should observe that this method cannot be applied to p/q, where p/q is a fraction, as we have no means of expressing (x+h)S/l as a finite series of powers of h. We shall show later on (~ 98) that the result of this example holds for all rational values of m. Meanwhile the reader will find it instructive to determine q' (x) when mn has some special fractional value (e.g. -), by means of some special device.] 5. If ( (x) = sin x, (p'(x) = cos; and if p (x) =cos x, )' (x)= -sin x. [For example, if q( (x)= sinx, {q (x +h) - q( (x)l}/ ={2 sin h cos (x + h)}/h the limit of which is cosx when it -0, since lim cos (x + 'h) = cos. (the cosine being a continuous function) and lim {(sin -h)/lh} = 1 (Ex. xxxvII. 14).] 6. Equations of the tangent and normal to a curve y= ( (x). The tangent to the curve at the point (xo, Yo) is the line through (x0, yo) which makes an angle At with OX, where tan += (' (xo). Its equation is therefore Y - Yo = (x - o) ' (xo); and the equation of the normal (the perpendicular to the tangent at the point of contact) is (y- Yo) ' (o) + x - Xo = O. We have assumed that the tangent is not parallel to the axis of y. In this special case it is obvious that the tangent and normal are x=xo and y=yo respectively. 7. Write down the equations of the tangent and normal at any point of the parabola x2=4ay. Show that if xo=2al/m,?/o=a/m2 is a point of the curve, the tangent there is x = my+(a/m). 8. Find the equations of the tangent and normal at any point (in, c2/m) of the rectangular hyperbola xy = c2.

Page 191

DERIVATIVES AND INTEGRALS 191 9. Write down the equations of the tangents at any points of the curves y=sinx, y=cosx; and find where they make a given angle a with the.axis of x. 93. We have seen that if b(x) is not continuous for a value of x it cannot possibly have a derivative for that value of x. Thus 1/x2 cannot have a derivative for x= 0: nor can any function, such as sin (l/x), which is not defined for x = 0 and so is necessarily discontinuous for x=0. Or again the function [x], which is discontinuous for every integral value of x, has no derivative for any such value of x. Example. Since [x] is constant between every two integral values of x, its derivative is zero for all values of x for which it is defined. Thus the derivative of [x], which we may represent by [x]', is a function equal to zero for all values of x save integral values and undefined for integral values. It sill 7rX' is interesting to note that the functions [x]' and 1- s.n are the same sin 7r.X function. We saw too in Ex. xxxvIII. 7 that the types of discontinuity which occur most commonly, when we are dealing with the very simplest and most obvious kinds of functions, such as polynomials or rational or trigonometrical functions, are associated with an equation of the type (0 x) + o (or - c). In all these cases, as in such cases as those considered above, there is no derivative for certain special values of x. In fact, as was pointed out above, all discontinuities of Oq(x) are also discon-;tinuities of b' (x). But the converse is not true, as we may easily see by considering the special case referred to in ~ 91, in which the graph of ((x) has a tangent parallel to OY. The most typical,cases are shown in Fig. 45. In cases (c) and (d) the function is Q Q i Q Q X \ I \. RP R R PKR p R P Q Q Q (a) (b) (c) (d) 0 X FIG. 45.

Page 192

192 DERIVATIVES AND INTEGRALS two valued on one side of P and not defined on the other. In such cases we may consider the two sets of values of O (x), which occur on one side of P or the other, as defining distinct functions 0((x) and 02(x), the upper part of the curve corresponding to 0(x). The reader will easily convince himself that in (a) {f(x + h) - +(x))}h- + oo, as h- 0, and in (b) f+ (x + h)- + (x)}lh -; while in (c) {+l(x + h) - 1(x)}jlh- + oo, j,(.x + h)- +2(x)}jh - GO, and in (d) 1(X + h) - 0(x)}/A- - o, 1q2(x+ h)- 02(x)}/h + o, though of course in (c) only positive and in (d) only negative values of h can be considered, a fact which by itself would preclude the existence of a derivative at P. 94. Some general rules for differentiation. Throughout the theorems which follow we assume that the functions f(x) and F(x) have derivatives f'(x) and F'(x), for the values of x considered. (1) If ' (x) =f(x) + F (x), p (x) has a derivative ' (x) =f'(x) + F'(x). (2) If b(x)=cf(x), where c is a constant, +(x) has a derivative f'(x) = cf'(x). We leave it as an exercise to the reader to deduce these results from the general theorems on limits stated in Ex. xxxvi. 1. (3) If (f (x) =f(x) F(x), + (x) has a derivative +'(x) =f(x) F'(x) +f'(x) F(x). For b' (x) = lim {f(x + h) F(x + h) - f(x) F(x)}/h =lim f(x + h) F(x + h)- F(x) + f(x + h) — f(x) =f(x) F'(x) + F(x)f'(x). (4) b (x) == 1 f(x) has a derivative 0' (x) = -f'(X)/{f(x)}2.

Page 193

93, 94] DERIVATIVES AND INTEGRALS 193 In this theorem we of course suppose that f(x) is not zero for the particular value of x under consideration. Then,~ M,.1 \I f -f(xx)-.(x + ) f ' (x) Ih ( f(x + h)f(x) {f(X)2' (5) (x) =f(x)/F (x) has a derivative f' (x) F(x) - f(x) F'(x) W- - - {F()}2 This follows at once from (3) and (4). (6) (b(x) =f(x + a) has a derivative b'(x) =f'(x + a). (7) b (x) =f(ax) has a derivative c'(x) = af'(ax). The proof of the first of these two theorems we leave as an exercise to the reader. As regards the second, we have ' (x) = a lim {f(ax + ah) -f(ax)}/ah, or, putting ax = and ah = k, 4' (x) = a lim f{(u + k) -f(u)}/Ik a = af'(u) = f' (ax). (8)' (x) =f(ax + b) has a derivative b'(x)= af'(ax + b). This follows at once from (6) and (7). Thus iff(x) - sin x, 0 (x) = sin (ax + b), f'(x) = cos x, and ('(x) =a cos (ax + b). In particular, if a= and b = 7rr, 4 (x)=cosx and ' (x)= -sinx. The theorems (6), (7), (8) are capable of simple geometrical interpretations. Thus the graph of ] (x)=f(x+a) is obtained by translating that of f(x) through a distance a parallel to the axis OX; and the tangent to the curve y= ) (x), at the point whose abscissa is x, is parallel to the tangent to the curve y=f(x) at the point whose abscissa is x+a. The graph of 4 (x)=f(ax) may be obtained from that of f(x) by a geometrical construction which is probably sufficiently clear from Fig. 46 (where a=2). ~Y T' (, ~ / = Y ).J=f(x) C KL 0 / / xx X FIG. 46. H. A. 13;

Page 194

194 DERIVATIVES AND INTEGRALS Here '(x) = tan LP'T' and f'(2x) = tan LPT, and the equation +'(x) = 2'(2x) expresses the fact that the 'gradient' at P' of the locus of P' is twice that at P of the locus of P. Later (~ 108) we shall prove a much more general theorem of which (6), (7), (8) are particular cases. (9) If y= +(x), where 5 is the function inverse to r, so that x= r(y), and J(y) has a derivative r'(y) which is not equal to zero, then ^(x) has a derivative 0'(x)= l/ r'(y). For if (x + h) = y + k, k 0 as h-0, and \ (x) = lim (x + h) - (x) li ( + k) - y I ho0 (x + h)-x klo k+(y + k) - () - (y) The last function may now be expressed in terms of x by means of the relation y = +(x), so that b'(x) = 1/r' { (x)}. This theorem enables us to differentiate any function if we know the derivative of the inverse function. 95. Derivatives of complex functions. So far we have supposed that y = +(x) is a purely real function of x. If y is a complex function O(x)+ii#(x), we simply define the derivative of y as being +'(x) + ir'(x). The reader will have no difficulty in seeing that all the theorems proved above, except (9) (to which we cannot at present attach any meaning when y is complex), retain their validity in this case. 96. The notation of the differential calculus. We have already explained that what we call a derivative is often called a differential coefficient. Not only a different name but a different notation is often used; the derivative of the function y =(x) is often denoted by one or other of the expressions dy Of these the last is the most usual and convenient: the reader must however be careful to remember that dy/dx does not mean 'a certain number dy divided by another number dx': it means 'the result of a certain operation D1 or d/dx applied to y = b (x),' the operation being that of forming the quotient {[(x + h) - (x)}/h and making h- -0. Of course a notation at first sight so peculiar would not have been adopted without some reason, and the reason was as follows. The denomi

Page 195

94-96] DERIVATIVES AND INTEGRALS 195 nator h of the fraction {q( (x +h) - lj(x)}/h is the difference of the values x+ h, x of the independent variable x; similarly the numerator is the difference of the corresponding values q (x + h), cq (x) of the dependent variable y. These differences may be called the increments of x and y respectively, and denoted by ax and By. Then the fraction is 8y/8x, and it is for some purposes convenient to denote the limit of the fraction, which is the same thing as +'(x), by dy/dx. But this notation is purely symbolical: the dy and dx which occur in it cannot be separated; standing by themselves they would mean nothing: in particular dy and dx do not mean lim ay and lim Ax, these limits being simply zero. The reader will have to become familiar with this notation, but so long as it puzzles him he will be wise to avoid it by writing the differential coefficient in the form Dxy, or using the notation + (x), b' (x), as we have done in the preceding sections of this chapter. The first five theorems of ~ 94 may of course at once be translated into this notation. They may be stated as follows: dy_ dy. dY2 (1) if y =yL + y2, dy d (13) ~,.,~ dx dx - ddy dyi (2) if y=cyl, dx dx (3) if y-y1y2, dy Yldy+Y2dy'; dy I1 dy1 (4) if y = d- y d (5) if y4 dy dyl~ dy;Y22Y dy I dx\ (9) d= _ d) Examples XLII. 1. If y=sinxcosx=Isin2x, find dy/dx by means of Theorem (3) and also by means of Theorems (2) and (7), and verify that the results are in agreement. 2. Since (cos2x+sin2x) = 1, it follows that D (cos2x +sin2x)n=O. Verify this by expanding and differentiating each term separately. 3. If Y=YlY2Y3 then dOy dyl dy2 dy3 =?12Y3 X+ YJ3Y1 -/x + Y1i22 X, and if y =yly2...y-, dy n dy. -- I Yl Y2.* *Yr - 1 Y) + 1.. Yn x. dx' d=1. In particular, if y = z, dy/dx =nz (dz/dx). 13-2

Page 196

196 DERIVATIVES AND INTEGRALS 4. If =y y2....y, then ldy l d1 1 dY 1 dy y dx y- dx +y- dx * Y- d+' d1 dy n, dz In particular, if y = zn, I dy =n dz y dx z dx' 5. Employ Theorem (9) and Ex. 3 to show that the differential coefficient of xn is mxm-l for all rational values of m. [First suppose mn=l/q, where q is an integer. If y=-1/q, x-yq and dyl/dx=l/(/dx/dy)= (1/q)yl-=(l/q)x(l/q)-1. Next, if rn=p/q, y= z, where z=xl/q, and the proof may be completed by means of the result of Ex. 3.] 6. If y=arc sin, x=siny and dy_ d\ 1.1 d-x l cosy) - /J(1 - 2) The positive sign must be chosen if cosy is positive, and in particular if -7r=<y<7. 97. Standard forms. We shall now investigate more systematically the forms of the derivatives of a few of the simplest types of functions. A. Polynomials. If Ob(x) = aoxn + axn-l +... + a,, then ( (x) = naon-1 + (n - 1) al n-2 +... + a,_1. It is sometimes more convenient to use for the standard form of a polynomial of degree n in x what is known as the binomial form, or the form with binomial coefficients, viz. aocx + (n) axn-1 + (2) a9xn-2+... + a, which differs from the first form in that the coefficients are taken to be () a, () a2,...a instead of a,, a, In this case +'(x) = n { a,lxn- 2 (7 1) a 2n-2 1) -3+... + an-1. The binomial form of ((x) is often written symbolically as (ao, a,.,..., angs, 1)n; and then ' (x) = n (ao, a,..., a,_0ax, 1)1-1. We shall see later that ~(x) can always be expressed as the product of n factors in the form (x) = a0 (x - a,)(x -a2)... (x - an), where the a's are real or complex numbers. Then

Page 197

DERIVATIVES AND INTEGRALS 197 the notation implying that we form all possible products of n - factors, and add them all together. This form of the result holds even if several of the numbers a are equal; but of course then some of the terms on the right-hand side are repeated. The reader will easily verify that if (x) = ao (x - a1)l (x - a)m2... (x -,)then '(x) = a0 r m, (x - a)nl-l(x - a)m2... (x _ a )n. Examples XLIII. 1. Show that if ( (x) is a polynomial, (' (x) is the coefficient of h in the expansion of 4(x+h) in powers of h. 2. If ( (x) is divisible by (x - a)2, 4b'(x) is divisible by x - a: and generally, if (x) is divisible by (x-a)m, 4'(x) is divisible by (x-a)'- l. 3. Conversely, if +2(x) and 4'(x) are both divisible by x -a, +(x) is divisible by (x - a)2; and if 4 (x) is divisible by x - a and ' (x) by (x- a)lm-l, 4 (x) is divisible by (x - a)'. 4. The multiple roots (if any) of 4 (x) =0 are the values of x given by equating to zero the n.C.F. of + (x) and 02'(x). 5. Find all the roots, with their degrees of multiplicity, of 4 +-3x3 3x2 -11x-6 = O, +2x5-8x4-14x3+ 1Xx2+28x+12=0. 6. If ax2+2bx+c has a double root, i.e. is of the form a(x-aa)2, then 2(ax+b) must be divisible by x-a: i.e. a= - b/a. This value of x must satisfy a2+2bx + c=O. Verify that the condition thus arrived at is ac-b2 =0. 7. The equation l/(x-a)+1/(x- b)+l/(x-c)=0 can have a pair of equal roots only if a=b=c. (Math. Trip. 1905.) 8. Show that ax3 +3bx2 +3c + d= O, has a double root if G2+4H3=0, where H= ac- b2, G=a2d-3abc+2b3. [Put ax+b=y, when the equation reduces to y3+-3Hy+ G=O. This must have a root in common with y2+ H=0.] 9. The reader may verify that if a, 3, y, a are the roots of ax4 + 4bx3 +6c2 + 4dx+e==0, the equation whose roots are 1 2a ((a - ) (y - )-(y - a) (/3-) and two similar expressions formed by permuting a, /3, y cyclically, is 403 - g20 -g3 = 0, where g2=ae-4bd+3c2, g3=ace+2bcd-ad2 -eb2- c3. It is clear that if two of a, 3, y, 8 are equal two of the roots of this cubic will be equal. Using the result of Ex. 8 we deduce g3 - 27gs2=0.

Page 198

198 DERIVATIVES AND INTEGRALS [VI 10. If f (x) is any polynomial, then between any pair of roots of q (x) =0 lies a root of ' (x)=0. A general proof of this theorem, applying not only to polynomials but to other classes of functions, will be given later. The following is an algebraical proof valid for polynomials only. We suppose that a, 3 are two successive roots, repeated respectively m and n times, so that ) (x) = (x - a)m (y- )n 0 (x), where 0 (x) is a polynomial which has the same sign, say the positive sign, for a x _ /3. Then ) (X) = (x - a)m (y -- )n 0' (X) + {n ( - a)m-1 (y - -)n + ( - a)m (y _ -)n -1} 0 (x) -(X a)m - 1 (y- _ )n-1 [(X-a) (y- /-) ' (x) + {n ( -- y-3) +n (x- a)} ($)] =(x- a)m-1(y-) _ n-F(), say. Now F(a) =m (a -3) (a) and F(3)=n (3-a) 8O (), which have opposite signs. Hence F(x), and so ' (x), vanishes for some value of x between a and 3. 98. B. Rational Functions. If R(x) = P(x)/Q(x) where P and Q are polynomials, it follows at once from ~ 94 that R'() = {P'(x)Q(x) - P() Q'(x)}/Q(x)}2, and this enables us to write down the derivative of any rational function. The form in which we obtain it, however, may or may not be the simplest possible. It will be the simplest possible if Q(x) and Q'(x) have no common factor, i.e. if Q(x) has no repeated factor. But if Q(x) has a repeated factor the expression which we obtain for R'(x) will be capable of further reduction. It is very often convenient, in differentiating a rational function, to employ the method of partial fractions. We shall suppose that Q(x), as in ~ 97 above, is expressed in the form ao (x - al) (x - a2)m... (x -a,)V^. Then it is proved in treatises on Algebra* that R(x) can be expressed in the form T( 1,1 A X,2 A A1, II o, )2 + + X - a* (x- 1 (X -, 2,1 + A + 2,2 + + A2,2i x - a2 (x - )2 (x - aC2)n2 where II (x) is a polynomial, i.e. as the sum of a polynomial and * See e.g. Chrystal's Algebra, vol. i, pp. 151 et seq.

Page 199

98] DERIVATIVES AND INTEGRALS 199 the sum of a number of terms of the type A(x - a)P where a is a root of Q () = 0. We know already how to find the derivative of the polynomial: all that remains, therefore, is that we should determine the derivative of the function written above, in which, it must be remembered, a may be real or complex. We shall first establish the following proposition: the derivative of xm is mxm-1, for all integral or rational values of m, positive or negative. We have already seen (Ex. XLI. 4) that this is so when m is a positive integer. The more general result (which was proved indirectly in Ex. XLII. 5) is a mere corollary from Ex. xxxvII. 4. For, if \b (x)= x, '(x) = lim {(x + h)m - xm}/h hAO = lim (,m - xm)/( - x) = mxm-1. From this result it follows at once, by Theorem (6) (~ 94) that if ) (x) = A/(x - a)P then ' (x) = - pA/(x - a)P+l, provided a is real. We shall however require the result for complex as well as for real values of a. We shall therefore give a direct proof applicable to the case in which p is a positive integer and a has any value whatever. Then {(x + h - a)-P-(x-a)-Ph=-(x + h-a) -P (-a)-(+h-a)P-(x-a)P}/h which, when we expand (x +h - a)P in powers of h by the Binomial Theorem, and make h tend to 0, obviously tends to - ( -a)- 2{p(X-a)- 1= -p (I-a)-P-1. We are now able to write down the derivative of the general rational function R (x), in the form: AI ' 1,1 2Al, 2 A2,1 2A2,2 (X - a -)2 -)3 (x - -a )2 ( - "2)3 Examples XLIV. 1. Differentiate -/(l-X2), (1+x2)/(1-x2), {(1 +)/(1-x)}2, (1-x+X2)/(1+x+x2), x(1-x)/(l+.3), x(1 +x2)/(1 +x4), (1-X2)/(1+X2+4), l/(x-a)(x-b)(x-c), (x-a)/(x-b)(x-c), (x- a)(x- b)/(x-c)(xa-d), x/(x2 + a2) (2 + b2), (x2 + a2)/(2 + b2), (x - a)2/( + a)2 (2 + 2). 2. The derivative of (ax2+2bx+c)/(Ax2+2Bx+C) is {(ax + b)(Bx + C) - (bx + c)(Ax + B)}/(A2 + 2Bx + C)2.

Page 200

200 DERIVATIVES AND INTEGRALS [VI 3. Show that 1/(2 + 1)- {i/(X +i)} - {/( - i)} and 1/(1 + x) = {,/(x + 1)} + {Co/(x+ o)} + {(o2/(X+ co2)} where w and 02 are the complex cube roots of unity. Write down the derivatives of the functions and verify that they agree with the results obtained by means of the formula R'=(P'Q- PQ')/Q2. 4. If Q has a factor (x - a)", the denominator of R' (when reduced to its lowest terms) is divisible by (x - a) + 1 but by no higher power of (x - a). 5. In no case can the denominator of R' have a simple factor x-a. Hence no rational function (such as 1/x) whose denominator contains any simple factor can be the derivative of another rational function. 99. C. Algebraical Functions. We add a few examples connected with the differentiation of irrational algebraical functions. At present we are not in a position to consider this question systematically, as the theorems which we have proved do not furnish us with any simple or convenient method of finding the derivative even of so elementary a function as /(1 + x2). The further consideration of the subject must be postponed until ~~ 108-9. Examples XLV. 1. The derivative of x'*" is mnx"-I for any rational value of mn (~ 98). Deduce from Th. (8) of ~ 94 that the derivative of (ax + b)m is ma (ax + b)" -1. 2. Find the derivatives of,/(1 +), /(1 -x), s/{(l +x)/(l-x)}, V/{(ax+b)/(cx+d)}, {(l +x)/(1-x)}f, (l +,x)/(l - /x), {V/(x+ 1)+-V( -1)}/{\/(x+ 1)-,/(x- 1)}, (ax+ b)m (cx+ d)". 3. Show that if y =,/x + /(x +Vx) then x=y4/(2y+ 1)2. Show that dx/dy= 4y3 (y + 1)/(2y + 1)3 and deduce that dy 1 1 + 2s ) dxa 2^/x t 2,/(x+,J )J 100. D. Transcendental Functions. We have already proved (Ex. XLI. 5) that Dx sin x = cos x, D cos x = - sin x. By means of Theorems (4) and (5) of ~ 94 the reader will easily verify the following formulae: Dx tan x = sec2 x, D cot x = - cosec2 x, Dx sec x = tan x sec x, D) cosec x = - cot x cosec x. By means of Theorem (8) the reader can write down the derivatives of tan(ax +b), cot(ax + b), etc. And by means of Theorem (9) we can determine the derivatives of the ordinary

Page 201

99, 100] DERIVATIVES AND INTEGRALS 201 inverse trigonometrical functions, as was shown in Ex. XLII. 6 for the function arc sin x. The reader should verify the following formulae: Dx arc sin x = + 1/1(1 - x2), Dx arc cos x = T 1/4(1 - X2), Dx arc tan x = 1/(1 + X2), Dx arc cot x = - 1/(1 + X2), D arcsec x = +1/{xV(x2 - 1)}, D arccosec x = T 1/{V(x2 -1)}. In the case of the inverse sine and cosecant the ambiguous sign is the same as that of cos(arc sin x); in the case of the inverse cosine and secant the same as that of sin (arc cos x). The more general formulae Dx arc sin (x/a) = + 1/V/(a2 - x2), Dx arc tan (x/a) = a/(x2 + a2), — which are easily derived from Theorem (7) of ~ 94-are also of considerable importance. In the first of them the ambiguous sign is the same as that of a cos {arc sin (x/a)}, since aV{l - (x2/a2)} =+ V(a2 - a2) according as a is positive or negative. The classes of functions which we can now differentiate are limited, and will have to be extended later on. But the reader will find the results which have been proved sufficient for many interesting applications of the Calculus. Examples XLVI. 1. Differentiate sin2 -X, cos2(ax+b), sec2, tanx+secx, cot x+cosec x, (1 +sinx)/(l-sinx), (a+bcosx)/(a- bcosx). 2. Prove that if y = (tan x + sec x)m then dy/dx =my sec x; and establish a similar result for (cotx+cosecx)n. 3. If y =cosax + isin ax, dydx = aiy. 4. Differentiate xcosx, (sinx)/x. Show that the values of x for which the tangents to the curves y=x cos x, = (sin x)/x are parallel to the axis of x are roots of cot x=x, tan x=x respectively. These equations may be solved graphically in the way explained in ~ 20 and Exs. xvIII. 5. Verify by differentiation that arcsinx+arccosx is constant for all values of x between 0 and ~-, and arc tan x + arc cot x for all positive values of x. 6. It is easy to see (cf. Exs. xvIII.) that the equation sin x = ax, where a is positive, has no real roots except x=0 if a _ 1, and if a<l a finite number of roots which increases as a diminishes. Prove that the values of a for which the number of roots changes are the values of cos $, where $ is a positive root of the equation tan =. [The values required are the values of a for which y= ax touches y = sin x.]

Page 202

202 DERIVATIVES AND INTEGRALS [VI 7. Show that Dxarc cos (x/a)= 1//(a2 - x2), D arc sec (x/a) = ~ 1/{x/(x2 - a2)}, determining in each case how the ambiguous sign depends on that of a and the value of the function chosen. 101. Repeated differentiation. We may form a new function 0" (x) from 0' (x) just as we formed 0' (x) from f(x). This function is called the second derivative or second differential coefficient of f (x). If y = (x), we may also write b" (x) in any of the forms D., / d 2 d2y dx y, ~i~ly, dx2~ In exactly the same way we may define the nth derivative or nth differential coefficient of y = q(x), which may be written in any of the forms b(n)(x), Dxny, (d Y' df But it is only in a few cases that it is easy to write down a general formula for the nth differential coefficient of a given function. Some of these cases will be found in the examples which follow. Examples XLVII.. If (x)=x, p()(x)=m(m- 1)...(m-n+l)xm-n. 2. If q(x)=(ax+b)m, O(n)(x)=m(m-1)... (m-n+1l)an(ax+b)m-n. In these two examples m may have any rational value. If m is a positive integer and n>m it is clear that b() (x) = 0. 3. The result of Ex. 1 enables us to write down the nth derivative of any polynomial. Thus if (x) = x7 - 3x6 +x4- 11, =(3) () =7.6. 5x- 3.6.5. 43 + 4.3.2x = 210x4- 360x3 +24x. 4. The formula Dn {A/(x - a} = ( - 1)np (p + )... (p + n - 1) A/(x - a)P + n enables us to write down the nth derivative of any rational function expressed in the standard form as a sum of partial fractions. 5. Prove that the nth derivative of 1/(1 - x2) is 2 (n!) {(1- X)-n-l + (-l)n (1 + X)-n-l}. 6. Leibniz' Theorem. If y is a product yly2, and we can form the first n derivatives of y, and y2, we can form the nth derivative of y by means of Leibniz' Theorem, which gives the rule ADn(YlY2) =DxnYl.y2 + () n ix-lY1. DxY2 + (2) Dxn -2Yl. * Dx2Y2+...Dxy2 _E()V.Ad 1/ n /m\3Dxlt-ryl. D rY2 -=2( W r.

Page 203

101] DERIVATIVES AND INTEGRALS 203 In this formula Dxyl, Dx~Y2 must be interpreted as meaning simply Yi and Y2. To prove the theorem we observe that Dx (Yly2) = DxY1. 2 +1. DxY2, D2 (y1y2) = Dx21.2 + 2DxY1. DxY2 + yl D2y2; and so on. It is obvious that by repeating this process we arrive at a formula of the type Dn (yly2)- D Y2 + an, 1 D x- l y1. Dxy2 + a,, 2 Dx - 2 yl. D.x2y2 +... +yl1. Dxn Y2. Let us assume that a,r= ( ) for r=, 2,... n-, and show that if this is so then an+l,r= (+ ) for r= 1 2,... It will then follow by the principle of mathematical induction that a,,,.= () for all values of n and r in question. When we form Dxn +1 (Y1Y2) by differentiating Dxn (yY2) it is clear that the coefficient of DXn + 1- Dl.ry is (n\ n n+l anr+anr-=() + (r ) This establishes the theorem. 7. Find <(n)(x) when b (x) is any one of the functions x/( +x), x2/(l- ), x sinx, x2cosx. Verify, in the case of the first two functions, that the results agree with those obtained by the method of partial fractions. 8. The nth derivative of xmf(x) is (n ^) m-nf( )+ n+ m-n+f '(X) n (n - )! x-+2 f () 1.2 (n -n+2)! the series being continued for n +1 terms or until zero terms occur. 9. Differentiate 1/(1 -x2) ={1/(1 +x)}{1/( -x)} n times by means of Leibniz' theorem. [The result is n!; {(- 1)r/(l +X)r +(1 - )n-r + }. It is r=O easy to verify that this result agrees with that of Ex. 5.] 10. Prove that DXn cos x=cos (x+ 2 nr), Dn sin x= sin (x +n7r). 11. If y= A cos mx B sin m, then D2y + m2y =0. And if y= A cos mx + B sin mn + Pn (x), where P (x) is a polynomial of degree n, then Dxn + 3 + m2 D + ly= 0. 12. If x2Dx2y+xDy/+y=O, then x2Dxn+2y + (2n+ l)xDn+ l + (n2 + 1)Dxy = 0. [Differentiate n times by Leibniz' Theorem.] 13. If Us denotes the nth derivative of (Lx +Mi)/(x2 - 2Bx + C), then x2 -2Bx+C 2 - ( h - B) (n-)( n+2+ nUz l +l1+ U,=0. (Math. Trip. 1900.) (n + 1) (n2) 2 n+1

Page 204

204 DERIVATIVES AND INTEGRALS [VI [First obtain the equation for n=O; then differentiate n times by Leibniz' Theorem.] 14. The nth derivatives of a/(a2+x2) and x/(a2 + x2). Since a 1 / 1 1 a x 1 1 \ a2-x2 -2i' -ai x+ai' a2+x2-2 a+4x- ai) we have Dn (a2 ) = 2i (x - ai) + 1 (x +ai) +1' and a similar formula for Dyn {.x/(a2 +x2)}. If p= (2/(X a2), and 0 is the numerically smallest angle whose cosine and sine are x/p and a/p, then x+ai=pCisO and x-ai=p Cis (-), and so Dn {a/(a2+z 2)}= {(-_ l)nn!/2i}p -e-1[Cis{(n + 1)0 - Cis {- ( +1) 0] =( - )ln! (2 + a2) - (t + 1)/2 sin {(n + 1) arc tan (a/x)). Similarly DXl {x/(a2 + x2)}- ( - )" n! (x2 + a2) - (" + 1)/2 cos {(n + 1) arc tan (a/x)}. These results enable us to write down the nth derivative of (Xx +)/(x2+ a2); that of (Xx+f )/(Ax2+2Bx+ C), where B2<AC, so that the roots of the denominator are complex, may be found in the same way if we first make the substitution Ax+B=At. When the roots are real the function can be expressed in the form {H/(x - a)} + {K/(x-(3)}, where H, IK, a, ( are real, and the nth derivative can be written down immediately in a real form. 15. Obtain identities from the equations Dx (xn. xn) =DZ xm +, DX (cosxsin x)=D 'sin 2x, calculating the left-hand sides by Leibniz' theorem and the right-hand sides directly. 16. Prove that D" {(cos x)/x} = {P, cos (x + ~n.r) + Q, sin (x + n7r)}/lx + 1, Dyn {(sin x)/x} = {P, sin (x + -n7r) - Q,, cos (x + -niwr)}/ + 1, where P,, and Qn are polynomials in x of degree n and n - 1 respectively. 17. Establish the formulae dx 1/(dy d2x d2y /dy\3 d3X_ fd3y dy_3 d 2 (dyY dy / dx ' dy2 dx2 \dxr ' dy - -dx3 dx \dx2d j/ dx 18. If yz=l and y,=(l/r!)Dxy, z,=(l/s!) Dxz, then 1 Z1 Z2 1 /2 3/3 Y3 z2 3 2 Y3 Y4 z2 Z3 Z4 (Math. Trip. 1905.) 19. If W(y, z, u)= y z u, dashes denoting differentiations with y' 2' u' respect to x, then W(y, z, u)=y3 ( 1, -, -). \ 2/

Page 205

DERIVATIVES AND INTEGRALS 205 20. If ax2 2hxy + by2+2gx + 2fy+c =0, then dy/dx= - (ax + hy +g)/(hx + by + ) and d2y/ldx2 = - (abc + 2fgh - af 2 - bg2 - ch2)/(hx+ by +f )3. 102. Some general theorems concerning derived functions. In all that follows we suppose that ~ (x) is a function of x which has a derivative +'(x) for all values of x in question. This assumption of course involves the continuity of ((x) (~ 92 (1)). The meaning of the sign of ' (x). THEOREM A. If /'(x0)>O, + (x) increases with x in the neighbourhood of X=x,: it is, moreover, an increasing function of x in the stricter sense (~ 88); that is to say we can find a small interval of values of x, stretching away from x0 on both sides, and such that if x, and x2 are any values of x in this interval, and x, < x2, then b(x1) < b(x2). For, as h —0, {( (xo + h) - (xo)}/h converges to a positive limit b'(x0). This can only be the case if, for sufficiently small values of h, O(x0 + h) - ((xo) and h have the same sign; and this is precisely what the theorem states. Of course from a geometrical point of view the result is intuitive, the inequality 0'(x)>0 expressing the fact that the tangent to the curve y = b(x) makes a positive acute angle with the axis of x. COR. 1. If Tb' (x) >0 for all values of x in a certain interval, b (x) is an increasing function of x (in the stricter sense of ~ 88) throughout that interval. COR. 2. If ' (x) > 0 throughout the interval (a, b), and 0g(a) 0, then p (x) is positive throughout the interval (a, b). The reader should formulate for himself the corresponding theorems for the case in which +'(x) < 0. An immediate deduction from Theorem A is the following important theorem, generally known as Rolle's Theorem: THEOREM B. If b (a)= 0 and b (b)= 0 there must be at least one value of x which lies between a and b and for which b'(x) = 0. There are two possibilities: the first is that b (x) is equal to zero throughout the whole interval (a, b). In this case 'q(x) is also equal to zero throughout the interval. If this is not so there must be at least one value of x for which b (x) is positive or negative, say positive. By Theorem 2 of ~ 89, there is a value

Page 206

206 DERIVATIVES AND INTEGRALS f: of x, not equal to a or b, and such that b (I:) is at least as great as the value of ((x) at any other point in the interval. And ~b'(:) must be equal to zero. For if it were positive, b(4) would be an increasing function in the neighbourhood of x= 4, and so there would be greater values of b (x) than f (I). Similarly we can show that +'(4) cannot be negative. COR. If b (a) = b (b) = k there must be a value of x between a and b, such that 0'(x)= O. We have only to put +((x)-k- = (x) and apply Theorem B to (x). The reader should consider the obvious geometrical meaning of this theorem and its corollary. 103. Maxima and Minima. We shall say that the value +b(f) assumed by 4 (x) for x = f is a maximum if qb (:) is greater than any other value assumed by b (x) in the immediate neighbourhood of x = f, i.e. if we can find an interval (I- 8, + 8) of values of x such that + (t)> + (x) for - 8 <x< and < x< + 3: and similarly we define a minimum. Thus in the figure the points A correspond to maxima, the points B to minima of the function Al ___A2 \. 4JB3 B2 FIG. 47. whose graph is there shown. It is to be observed that the fact that A3 corresponds to a maximum and B1 to a minimum is in no way inconsistent with the fact that the value of the function is greater at B1 than at As. THEOREM C. A necessary condition for a maximum or minimum value of b (x) at x =: is that /' (f) = 0O. This follows at once from the fact that b(x) cannot be an increasing or a decreasing function at x = f. In geometrical language the tangent at x = 4 is parallel to the axis of x. * A function which, like those referred to in ~ 92 (2), is continuous but has no derivative, may have maxima and minima. We are of course assuming the existence of the derivative.

Page 207

102-105] DERIVATIVES AND INTEGRALS 207 That this condition is not sufficient is evident from a glance at the point C in the figure. Thus if y =x, b)'(x)=3x2, which vanishes when x = 0. But x = 0 does not give either a maximum or a minimum of x3, as is obvious from the form of the graph of x3 (~ 13, Fig. 11). But there will certainly be a maximum for x= 4 if /' (4:)= 0, ' (x) > 0 for all values of x less than but near to x, and ' (x) < 0 for all values of x greater than but near to x: and if the signs of these two inequalities are reversed there will certainly be a minimum. For then we can determine an interval (I-8, 4) throughout which b (x) increases with x, and an interval (I, + 8) throughout which it decreases as x increases: and obviously this ensures that 0b(4) shall be a maximum. This result may also be stated thus-if the sign of b'(x) changes at x from positive to negative, then x = gives a maximum of ((x): and if the sign of q'(x) changes in the opposite sense, then x = gives a minimum. 104. There is another way of stating the conditions for a maximum or minimum which is often useful. Let us assume that +(x) has a second derivative +"(x): this of course does not follow from the existence of +'(x), any more than the existence of +'(x) follows from that of (x). But in such cases as we are likely to meet with at present the condition is generally satisfied. THEOREM D. If 0'(f) = 0 and ' (I) =j 0, (b(x) has a maximum or minimum for x=: a maximum if +b"(t) < 0, a mi imum if qi (4) >o. Suppose, e.g., +"(:) < 0. Then +'(x) is decreasing near x= I:, and so its sign changes from positive to negative. Thus x= gives a maximum. 105. In what has preceded (apart from the last paragraph) we have assumed simply that (x) has a derivative for all values of x in the interval under consideration. If this condition is not fulfilled the theorems cease to be true. Thus Theorem B fails in the case of the function y=l - /(X2), where the square root is to be taken positive. The graph of this function is shown in Fig. 48. Here /(- 1)=((1)=0: but 0'(x) (as is evident from the figure) is equal to + 1 if x is negative and to -1 if x is positive, and never

Page 208

208 DERIVATIVES AND INTEGRALS [VI vanishes. There is no derivative for x =0, and no definite tangent to the graph at P. And in this case x=0 obviously gives a maximum of) (x), but ('(0), as it does not exist, cannot be equal to zero, so that the test for a Y maximum fails. P The bare existence of the derivative )' (x), however, is all that we have assumed. And there is one assumption in particular that we have not made, and that is that 5' (x) itself is a con- O tinuous function. This raises a rather subtle but still a very interesting point. FIG. 48. Can a function )q (x) have a derivative for all values of x which is not itself continuous? In other words can a curve have a tangent at every point, and yet the direction of the tangent not vary continuously? The reader if he considers what the question means and tries to answer it in the light of common sense will probably incline to the answer, No. That this answer is wrong is shown in the next set of examples (Ex. XLVIII. 44). However let us leave this point for the moment and evade the difficulty by assuming that the derivative )' (x) is continuous; an assumption which is, needed in what follows, although it was not needed in what precedes. We might of course have assumed this all along, in proving the theorems of ~~ 102-103; and there would, for practical purposes, have been no harm in making the assumption, as in all ordinary cases there is no doubt about the continuity of the derivative: but as the assumption would in no way have shortened or simplified the proofs of the theorems we should have gained nothing by making it. But in the next theorem, and some theorems which will be proved later on in this chapter, the fact that q' (x) is continuous is essential to the argument. THEOREM E. The maxirma and minima of ( (x) occur alternately. For at a maximum )'(x) changes its sign from positive to negative: and, being continuous, it can only become positive again by passing through zero and changing sign from negative to positive; and this gives a minimum of 4 (x). Examples XLVIII. 1. Verify Theorem B when r(x)==(x -a)m (x-b) or (x- a)m(x- b)n(x - c)p, where m, n, p are positive integers and a<b<c. [The first function vanishes for x=a and x=b. And )' (x) = (x - a)m - (x - b)n-1 {(m + n) x - mb - na} vanishes for wx=(mb+na)/(m+n), which lies between a and b. In the second case we have to verify that the quadratic equation (m+n+-p)x2- {m (b + c) + n(c + a) +p (a+ b)}x+ mbc+ nca+pab=O has roots between a and b, and between b and c.]

Page 209

DERIVATIVES AND INTEGRALS 209 2. Show that the polynomials 2x+3x2- 12 + 7, 3x4+8x3- 6x2- 24x+19, are positive for x>l. 3. Show that x- sin x is an increasing function for all values of x, and that tan x - x increases as x varies from - Tr to T7r. For what values of a is ax- sin x a steadily increasing or decreasing function of x? 4. Show that tan x-x also increases from X=Tir to x=wTr, from x=a|r to x= 7r,: and so on, and deduce that there is one and only one root of the equation tan x=x in each of these intervals (cf. Ex. xvIIi. 5). 5. Deduce from Ex. 3 that sinx - x <0 if x>0, from this that cosx-1+~x2>0, and from this that sinx-x+ -x3>0. And, generally, prove that if x x x2 X2m.C<7=cos-1l+^ -...-(-1)'~2m!, x^3 x2m + 1 2m+ 1= sin x- x+ -...-(-1) 1)! and x>0, then Cm > 0 and S2 + 1 > 0 according as m is odd or even. 6. Show that xsinx+cosx+Jcos2x increases as x increases fronm 0 to ~Tr. 7. If f(x) and f" (x) are continuous and have the same sign at every point of an interval (a, b), this interval can include at most one root of f() =o. 8. The functions u, v and their derivatives i', v' are continuous throughout a certain interval of values of x, and uv'-u'v never vanishes at any point of the interval. Show that between any two roots of u.=0 occurs one of v = 0, and conversely. [If v does not vanish between two roots of = 0, say a and f, the function u/v is continuous throughout the interval (a, f3) and vanishes at its extremities. Hence (u/v)'=(u'v-uv')/v2 must vanish between a and 3, which contradicts our hypotheses.] 9. Verify the preceding theorem when = cos, v = sinx. 10. Show how to determine as completely as possible the multiple roots of P(x)=O, where P(x) is a polynomial, with their degrees of multiplicity, by means of the elementary algebraical operations. [If H1 is the highest common factor of P and P', H2 the highest common factor of H1 and P", H3 that of H2 and P"', and so on, then the roots of Hi1/Hf2=0 are the double roots of P=0, the roots of H2/H3 =0 the treble roots, and so on. But it may not be possible to complete the solution of 1/H2 =0, 2/iH3=0, etc. Thus if P(x)=(x-1)35(x-x-7)2, H1/H2=xS-x-7 and H2/H3=x- 1; and we cannot solve the first equation.] 11. Show that x5 -10x2+15x-6=0 has a treble root, and find it. 12. If + (x) is a polynomial and f(x) = (x - a)r (x), then f' (a)=f" (a) =... =f-i)(a) =0, f(')(a)=r! (a). H. A. 14

Page 210

210 DERIVATIVES AND INTEGRALS [VI 13. Determine the maxima and minima (if any) of (x - 1)2l(x+ 2), x3 - 3x, 2x3 - 3x2 - 36 + 10, 4xs 3-18x2 + 27x - 7, 3x4 - 4x3+1, - 15x3 + 3. In each case sketch the form of the graph of the function. [Consider the last function, for example. Here d' (x) = 5x2 (x2 - 9), which vanishes for x = 0, x= + 3. It is easy to see that x= -3 gives a maximum, x= +3 a minimum, while x=O gives neither, as +'(x) is negative on both sides of x=0.] 14. Discuss the maxima and minima of x (x- 1), X2(X- 1)2, X3(X-1)3, 4 (x- 1)4. Sketch the graphs of the functions. 15. Discuss similarly the function (x-a)(x-b)2(x-c)3, distinguishing the different forms of the graph which correspond to different hypotheses as to the relative magnitudes of a, b, c. 16. Discuss similarly the function (x-a)m(x- b)n, where m and n are any positive integers, considering the different cases which occur according as m and Ad are odd or even. 17. Show that (ax+ b)/(cx+d), whatever values a, b, c, d may have, has no maxima or minima. Draw a graph of the function. 18. Discuss the maxima and minima of {(1-x)/(l +x)}2, 2x/(l +x2), 2x/(1 -x2), (1-X2)/(1+X2), (1 +2)/(1-x2). 19. Discuss the maxima and minima of the function y = (ax2 + 2bx + c)/(Ax2 +- 2Bx + c),,when the denominator has complex roots. [We may suppose a and A positive. The derivative vanishes if (ax + b)(Bx +c) - (Ax +B) (bx + c)=0.................. (1). This equation must have real roots. For if not the derivative would always have the same sign, and this is impossible, since y is continuous for all values of x, and y -. a/A as x — - co or x- - co. It is easy to verify that the curve cuts the line y=a/A in one and only one point, and that it lies above this line for large positive values of x, and below it for large negative values, or vice versa, according as b/a >B/A. Thus the algebraically greater root of (1) gives a maximum if b/a> B/A, a minimum in the contrary case.] 20. The maximum and minimum values themselves are the values of X for which ax2 + 2bx + c - X (Ax2+2Bx+ C) is a perfect square. [This is the condition that y=X should touch the curve.] 21. In general the maxima and minima of R(x) = P (x)/Q(x) are among the values of X obtained by expressing the condition that P(x)- XQ (x) =0 should have a pair of equal roots. 22. If Ax2+-2Bx+-C=O has real roots it is convenient to proceed as follows. We have v - (a/A) = (Xx + 4)/{A (Ax2 + 2Bx + C)}

Page 211

DERIVATIVES AND INTEGRALS 211 where X=bA-aB, li=cA-aC. Writing further 4 for Xx+,u and 7 for (A/X2)(Ay-a) we obtain an equation of the form 7 = /{( -P) (4- q)}. This transformation from (x, y) to (4, 7) amounts only to a shifting of the origin, keeping the axes parallel to themselves, a change of scale along each axis, and (if X<0) a reversal in direction of the axis of abscissae, and so a minimum of y, considered as a function of x, corresponds to a minimum of 77 considered as a function of 4, and vice versa; and similarly for a maximum. The derivative of 7 with respect to 4 vanishes if (~-p)(t — q)-4 (e-p)-4 (e-q) =0, or if 2 =pq. Thus there are two roots of the derivative if p and q have the same sign, none if they have opposite signs. In the latter case the form of the graph of 77 is as shown in Fig. 49 a. P q q _ Pq v _At 0 0 o FIG. 49 a. FIG. 49b. FIG. 49 C. When p and q are positive the general form of the graph is as shown in Fig. 49 b, and it is easy to see that =V/pq gives a maximum and 4= - /pq a minimum*. In the particular case in which p=q the Y graph of _ =7 1(4-p)2 is of the form shown in Fig. 49 c. 0 - 1 I 2 X The preceding discussion fails if X =0, i.e. if a/A = b/B. But in this case we have y - (a/A) = /{A (Ax2 + 2Bx + C)} = (j/A2)/((X-X1) (X- x2)}, say, and dy/dx=- gives the single value x= i(xl +x2). On drawing a graph it is clear that this is a maximum or minimum according as pu is positive or negative. The graph shown in Fig. 50 corresponds to the former case. [A full discussion of the general function y = (ax2 2bx + c)/(Ax2+ 2B + C), will be found in Chrystal's Algebra, vol. I. pp. 464-7: there however only purely algebraical methods are used.] 23. Show that (x - a) (x - 3)/(x - y) assumes all real values as x varies, if a lies between 3 and y, and otherwise assumes all values except those included in an interval of length 4 ^/{(a 3) (a y)}. * The maximum is - 1/(/ - /q)2, the minimum -1/(,/p+/q)2, of which the latter is the greater. 14-2

Page 212

212 DERIVATIVES AND INTEGRALS 24. Discuss (x2-x+ )/(x2+x+l), (x2-x+3)/(x2-3x+2). 25. The maximum and minimum of (x+ a)(x + b)/( - a)(x- b), where a and b are positive, are //a+ fb\2 (/a - 2 b\2 ka-^/bJ ' /a +/ Jb 26. The maximum value of (x - 1)2/(x+ 1)3 is -. 27. Discuss the maxima and minima of X(X-1)/(x2+3x+3), x4/(x-1)(x —3)3. 28. Discuss the maxima and minima of (x-1)2(32 - 2x-37)/(x + 5)2(32- 14- - 1). (Math. Trip. 1898.) [If the function be denoted by P (x)/Q(x), it will be found that P'Q- PQ'= 72 (x - 7) (x - 3) ( - 1) (x + 1) (x + 2) (x+ 5).] 29. Find the maxima and minima of a cos x+ b sin x. Verify the result by expressing the function in the form A cos (x - a). 30. Find the maxima and minima of a2 cos2 x + b2 sin2 x, A cos2 x + 2H cos x sin x + B sin2 x. 31. Show that sin (x + a)/sin (x + b) has no maxima or minima. Draw a graph of the function. 32. The least value of a2 sec2 x + b2 cosec2 x is (a + b)2. 33. Show that tan 3x cot 2x cannot lie between - and -. 34. Find the maxima and minima of (1 + 2x arc tan x)/(1 + x2). 35. The base of a triangle is equal to a, and the ratio of the other two sides is r. Show that the maximum value of its area is ~a2r/(r2 1). 36. A line is drawn through a fixed point (a, b) to meet the axes OX, 0 Y in P and Q. Show that the minimum values of PQ, OP+OQ, and OP. OQ are respectively (at + b), (a/a +/b)2, 4ab. 37. A tangent to an ellipse meets the axes in P and Q. Show that the least value of PQ is equal to the sum of the semiaxes of the ellipse. 38. Find the lengths and directions of the axes of the conic ax2 + 2hxy + by2 = 1. [The length r of the semidiameter which makes an angle 0 with the axes of x is given by l/r2 =a os22 + 2h cos 0 sin 0+ b sin2 0. The condition for a maximum or minimum value of r is tan 20 = 2h/(a - b): eliminating 0 between these two equations we find {a - (l/r.2)} {b - (l/r2))= h2.] 39. The greatest value of the product of two positive numbers whose sum is constant is obtained by supposing them equal. Extend the result to any number of numbers.

Page 213

105] DERIVATIVES AND INTEGRALS 213 40. The greatest value of xmyn, where x+y =k, is mmnnkm + /( + n)m +. 41. The greatest value of ax +by, where x + y +y2=3K2, is 2K %(a2- ab + b2). [If ax+ by is a maximum, a + b (dy/dx) =0. The relation between x and y gives (2x +y) + (x+ 2y) (dy/dx) =. Equate the two values of dy/dx.] 42. If 0 and 4) are acute angles connected by the relation a sec 0 + b sec Q = c, where a, b, c are positive, then acos +b cos ) is a minimum when 0=4q. 43. Two particles are moving with uniform velocities u, v along two straight lines which make an angle co with one another. Originally they are at distances a, b from the point of intersection of the lines, and they are moving towards the point of intersection. Show that they are nearest after time {au + bv - (av + bu) cos co}/(62 - 2uv cos c + v2) has elapsed, and that then their distance is (av ~ bZ) sin ol/ (u2 - 2uv cos o + v2). 44. Example of a function whose derivative exists for every value of x, but is discontinuous. In the case of a function whose graph has such a form as that shown in Fig. 48 the derivative is obviously discontinuous for x=0: but the derivative does not exist for x= 0 (it is obvious from the figure that there is no definite tangent at P). We shall now give an example of a function (curve) which has a derivative (tangent) everywhere, although the derivative (tangent) is discontinuous (discontinuous in direction). Let us consider how we might set about constructing such a curve geometrically. Draw a double curve (as shown in Fig. 51 a, where the upper and lower curves are y = + x2 respectively) which touches the axis of x at the origin on both sides. 0 0 FIG. 51 a. FIG. 51 b. Now draw a wavy curve, as in the figure, oscillating continually between the two curves, and forming a single continuous branch passing through the origin. The reader will easily convince himself that it is possible to draw the curve so that the tangent to the curve at P continually oscillates through an angle greater than some fixed angle as P approaches 0. Thus tan 4

Page 214

214 DERIVATIVES AND INTEGRALS [VI cannot tend to a limit as x'-O. None the less the curve has a perfectly definite tangent at 0. The common sense way of looking at it is that the curve always lies between the curves y= +x2 and y= -x2; and that as each of these has a definite direction at 0, viz. along OX, the other curve must have the same direction too. Analytically we may regard the matter as follows. If ( (x) is the function represented by the curve, q (0)=0, and - 2 < ) (X) _ 2, so that lim { (x) - (O)}/x= 0, i.e. q' (0) =0. We shall see later (Ex. L. 17) that the function +)(x) which is equal to x2sin (l/.x) when x+O0, and to 0 when x=0, is such a function. The reader may conceivably object that the tangent at 0 cuts the curve infinitely often near 0, and that this is contrary to his notion of the tangent to a curve. The reply to this is that even if this is so it does not affect the purpose of the example, which is to show that c'(x) may exist for every value of x and yet be discontinuous. And as a matter of fact it is easy to construct curves which possess the same peculiarity and for which the tangent at 0 does not meet the curve at all except at 0. The nature of such examples will be sufficiently illustrated by Fig. 51 b. 45. It is however impossible that 0'(x) should have what was in Chap. V. called a simple discontinuity: e.g. that we should have q'(x)-,a when x —0 by positive values and 0'(x) —b when x-0- by negative values, and ' (0) =c, unless a, b, c are all equal and so q/' (x) continuous. A proof of this will be given shortly (Ex. XLIX. 3); from a geometrical point of view the result is obvious, since if a curve has an angle (as in Figs. 43, 48) it cannot have a definite tangent at the vertex of the angle. 106. The Mean Value Theorem. We can proceed now to the proof of another general and exceedingly important theorem, commonly known as 'The Mean Value Theorem' or ' The Theorem of the Mean.' THEOREM. If b (x) has a derivative for all values of x in the interval (a, b), it must be possible to find a value ~ of x between B a and b, such that c (b) - < (a) = (b - a) b' (:). Before we give a strict proof A (b) of this theorem, which is perhaps the most important theorem in ( the Differential Calculus, it will be well to point out its obvious geometrical meaning. This is a 0 b X simply (see Fig. 52) that if the Fig. 52. curve APB has a tangent at all points of its length, there must

Page 215

106, 107] DERIVATIVES AND INTEGRALS 215 be a point, such as P, where the tangent is parallel to AB. For 4' (5:) is the tangent of the angle which the tangent at P makes with OX, and t{)(b)-)(a)}/(b-a) the tangent of the angle which AB makes with OX. It is easy to give a strict analytical proof. Consider the function 4) (b) -4) () - b - a { (b) - (a)}, which vanishes when x = a and x = b. It follows from Theorem B above that there is a value ~ for which its derivative vanishes. But this derivative is 4 (b)-) (a) _, ) which proves the theorem. It should be observed that in this proof it has not been assumed that )'(x) is continuous. Examples XLIX. 1. Show that the expression b —2x ( (b) - ( (x)- b- a { - (b)- ( (a)} is the difference between the ordinates of a point on the curve and the corresponding point on the chord. 2. Verify the theorem for ) (x) =x2 and for () (x) =x3. [In the latter case we have to prove that (b3-a3)/(b-a)=342, where a< <b: i.e. that if I (b2+ab+a2)=-2, then 4 lies between a and b.] 3. Prove the result of Ex. XLVIII. 45 by means of the mean value theorem. [Since 9'(O)-c we can find a small positive value of x such that {p (x) - <p (O)}/x is nearly equal to c; and therefore, by the theorem, a small positive value of 4 such that ('(() is nearly equal to c, which is inconsistent with lim ('(x)=a, unless a=c. Similarly b=c.] x-~+0 107. Another form of the Mean Value Theorem. It is often convenient to express the Mean Value Theorem in the form 4(b)= 4 (a)+ (b- a) ' {a + (b- a)} where 0 is a number lying between 0 and 1. Of. course a + 0(b- a) is merely another way of writing 'some number | between a and b.' If we put b = a + h we obtain h (a + h) = h) (a) + ho'(a + Oh) which is the form in which the theorem is most often quoted.

Page 216

216 DERIVATIVES AND INTEGRALS [VI 108. Differentiation of a function of a function. It is convenient at this stage to introduce a very important theorem which enables us to complete our results relating to the differentiation of the functions which commonly occur in analysis. THEOREM. If f(x) and F(x) are functions with derivatives f' (x) and F' (x), then F f(x)} has a derivative F'{f (x)} f' (x). For let f(x)=y, f(x+h)=y+k, F(y)=z, F(y+k)=z+l. Then k and 1 are functions of h which tend to zero with h, andf' (x) and F'(y) or F {f(x)} are by definition lim (k/h) and lim (l/h) respectively. And the derivative of Fff(x)} is by definition lim (l/h) = lim (1/k) x lim (k/h) = F' {f(x)}f' (x). The following alternative method of proof, depending on the Mean Value Theorem, is interesting. It will be noticed that in using it we have to assume the continuity and not merely the existence of the derivatives of the functions f and F. This is of course a theoretical limitation, but not one of any practical importance. The derivative of F{f(x)} is by definition im F {f(x + h) -f (x)} h But, by the Mean Value Theorem, f(x + h) =f(x) + hf' (a), where t is a number lying between x and x + h. And F {f(x) + hf' (}F)} = {f(x)} + hf () F' (), where Al is a number lying between f(x) and f(x)+hf'(:). Hence the derivative of F {f(x)} is lim f' () F' (T))= f' (x) F' {f(x)}, since — x and,-~f (x) as h-0O. The most important cases of this theorem are the following: (i) If f(x)=ax +b, f'(x)=a, and so the derivative of F(ax+b) is aF'(ax+b), as result was proved independently in ~ 94, Theorem (8). (ii) If f(x)-xm% f'(x)=mxm-l, and so the derivative of F(xm) is mx"-1F' (xm). (iii) If f(x)=sinx, f'(x)=cosx, and so the derivative of F(sinx) is cos xF' (sin x). Similarly the derivative of F(cosx) is - sin xF' (cos ).

Page 217

108, 109] DERIVATIVES AND INTEGRALS 217 109. By means of the theorem proved in the last section we are able practically to complete the rules for differentiation given in ~ 94 and to write down the derivative of any common function whatsoever. For the only ordinary class of functions with which the rules there given did not enable us to deal is precisely that to which this theorem applies, viz. what may be called composite functions, or functions of functions. The manner of applying the theorem will be seen from the ensuing examples. Examples L. 1. Consider the function q5 (x) = ^(a2 + x2). This function may be expressed in the form F{f(x)}, where f(x)=a2+x2, F(y)=^/y, and so f'(x)=2x, F'(y)=~I/y. Thus +9'(x)=x//(a2+x2). 2. Find the derivatives of V(a2 - 2), 1/V(a2+ 2), 1/V(a2-x2), x//(a2+x2), x//(a2 -2), (a2 +X2)n, (a2 - 2)m, (a2+x2)m(a2 - 2), n(a2+.2)n, /(a2 m), V/{(1 + +x2)/(1 - x + x2)}, V{(a + 2b + cx2)/(A + 2B + Cx2)}, {J(1 + X) + J/(1 - )}/{V(1 + )-(1) -x)}, {X - /(1 + x2)}m, xm(a + bx1)P. Here m, n, p are any rational numbers. 3. Find the derivatives of (sin x)%n (cos x),2, sin (xm), cos (xm), sin (cos x), cos (sin x),,/(a + b sin x), cos x//(a + b sin x), {(a cos x + b sin X)/(a cos +/3 sin x)},,/(a2 cos2 x + b2 sin2 x), sin x cos xl/(a2 cos2 x + b2 sin2 x). 4. Find the derivatives of x are sin x +, /(1 - 2), (1 + x) arc tan /sx - ^x, arc tan V/{(a - x)/(a - x)}. 5. Find the derivatives of arcsin /(1- x2), arctan(I/x), arcsin {x//(1 +x2)}, arc sin {2x V/(1 - x2)}, arc tan {2x/(1 - x2)}, ar tan {(a + x)/(1 - ax)}. How do you explain the simplicity of the results? 6. Differentiate 1 ax+b 1. ax+b ---— 2 arc tan, - arc sin J/(ac - b2) / (ac- b2) V a/(- a) s /(bs - ac)' 7. The differential coefficient of arc cos ^/{(cos 30)/cos3 0} is V/{3/(cos 30 cos 9)}. (Math. Trip. 1904.) 8. Show that each of the functions 2 arc cos V/{(a -x)/(a - 3)}, 2 arc sin ^{(x - 3)/(a - 3)}, 2 arc tan {^/(x- )/(a- x)}, arc sin [2 V/{(a -x) ( - 3)}/(a -,3)] has the derivative 1/V/{(a - x) (x- )}. * In these examples it is assumed that the constants a, b,... which occur have such values that the functions which involve them are real.

Page 218

218 DERIVATIVES AND INTEGRALS 9. Show that 1 d - /(f C(ax2+c) -_ 1,/{C(Ac-aC)} dx Lrc cs / tc(AI+ C) (A2 +C) /(ax + c) 10. Each of the functions 1 /acosx+b\ 2 f /ta-b\, 1 (a2 b2) arc cos a+ b cos x, ' /(a- b) arc tan / -b tan, ^/(a2 - b2) a+bcosxx t SA-b2) [ a+6 2 has the derivative l/(a + bcos x). 11. If X=a+ b cos x+c sin x, and 1 aX-a2 +b2+C2 ^-7(02 ^^)arc cos y = b(a2 _ b2_ c) arc co X (b2 + c2) then dy/ldx= 1/X. 12. Find the equations of the tangent and normal at the point (x0, yO) of the circle x2 +y2=a2. [Here y=VJ(a2- x2), dy/dx= -x/J(a2 - x2), and the tangent is y-yO = (x - xo) { - xo/(a2 - Xo2)}, which may be put in the form xxo+yyo=a2. The normal is Xryo-yXo=0, which of course passes through the origin.] 13. Find the equations of the tangent and normal at any point of the ellipse (xla)2+ (y/b)2= 1 and the hyperbola (x/a)2 - (:/b)2= 1. 14. The equations of the tangent and normal to the curve w=x (t), y=~(t), at the point (t), are x- q (t) Y-,k(t) +'t) +(t- {z - q (t)})q/(t) + {y-, (t)}) ' (t)= o.,'(t) q'(t) ' 15. Prove that the derivative of F [f{ q(x)}] is F' [f{o (x)}]f' {4 (x)} +'(x), and extend the result to still more complicated cases. 16. If u and v are functions of x, then D, arc tan (u/v) = (vD, u - uD v)/(u2 + v2). 17. If (x) =x2 sin (l/x) when xS 0 and q (0)=0, then () (x) = 2x sin (l/x) - cos (l/x) when xO0, and /')(0)=0. And c'(x) is discontinuous for x=0 (cf. Ex. XLVIII. 44). 110. The Mean Value Theorem furnishes us with a proof of a result which is of essential importance for what follows:-if ' (x)=0, throughout a certain interval of values of x, O (x) is constant throughout that interval. For if a and b are any two values of x in the interval, 4 (b)- + (a)= (b - a) 4' {a+ 0 (b - a) = 0. An immediate corollary is that if +'(x) = ''(x), throughout a certain interval, the functions b (x) and +t(x) differ, throughout that interval, by a constant.

Page 219

110, 111] DERIVATIVES AND INTEGRALS 219 111. Integration. We have in this chapter seen how we can differentiate a given function +)(x)-i.e. find its derivative-in a variety of cases, including all those of the commonest occurrence. It is natural to consider the converse question, that of determining a function whose derivative is a given function. Suppose that r (x) is the given function. Then we wish to determine a function such that b' (x) = (x). A little reflection shows us that this question may really be analysed into three parts. (1) In the first place we want to know whether such a function as < (x) actually exists. This is a purely theoretical question, and must be carefully distinguished from the practical question as to whether (supposing that there is such a function) we can find any simple formula to express it. (2) We want to know whether it is possible that more than one such function should exist: i.e. we want to know whether our problem is one which admits of a unique solution or not; and if not, we want to know whether there is any simple relation between the different solutions which will enable us to express all of them in terms of any particular one. (3) If there is a solution, we want to know how to find an actual expression for it. It will throw light on the nature of these three distinct questions if we compare them with the three corresponding questions which arise with regard to the differentiation of functions. (1) A function 4 (x) may have a derivative for all values of x (like xm, where m is a positive integer, or sin x). It may generally, but not always have one (like /x or tanx or secx). Or again it may never have one: for example the function considered in Ex. xvII. 11, which is nowhere continuous, has obviously no derivative for any value of x. Of course, during this chapter, we have confined ourselves to functions which are continuous except for some special values of x. The example of the function l/x, however, shows that a continuous function may not have a derivative for some special value of x (in this case x = 0). Whether there are continuous functions which never have derivatives, or continuous curves which never have tangents, is a further question

Page 220

220 DERIVATIVES AND INTEGRALS [VI which is at present beyond us. Common-sense says No: but, as we have already stated (~ 92 (2)), this is one of the cases in which higher mathematics has proved common-sense to be mistaken. But at any rate it is clear enough that the theoretical question -has 4 (x) a derivative ' (x)?-is one which has to be answered differently in different circumstances. And we may expect that the converse question-is there a function ((x) of which +(x) is the derivative?will have different answers too. We have already seen that there are cases in which the answer is No: thus if + (x) is the function which is equal to a, b, or c according as x is less than, equal to, or greater than 0, the answer is No (Exs. XLVIII. 45, XLIX. 3), unless a= b = c. This is a case in which the given function is discontinuous. In what follows, however, we shall always suppose 4r(x) continuous. And then the answer is, Yes: if r (x) is continuous there is always a function 4 (x) such that ' (x) = F (x). To prove this would take us beyond our limits, however: in Ch. VII. we shall give a proof, not perfectly general, but general enough to deal with the simplest and most interesting cases that arise. (2) The second question presents no difficulties. In the case of differentiation we have a direct definition of the derivative which makes it clear from the beginning that there cannot possibly be more than one. In the case of the converse problem the answer is almost equally simple. It is that if ( (x) is one solution of the problem 4 (x) + C is another, for any value of the constant C: and that all possible solutions are comprised in the form )(x) + C. This follows at once from ~ 110. (3) The practical problem of actually finding 4' (x) is as a rule a fairly simple one. We have already shown how it can be done in a number of cases, and the theorem of ~ 108, in conjunction with the rules of ~ 94, make the problem easy enough in the case of any function defined by some finite combination of the ordinary functional symbols. The converse problem is much more difficult. The nature of the difficulties will appear more clearly later on. DEFINITIONS. If r (x) is the derivative of 4 (x), we call 4 (x) the integral or integral function of + (x). The operation of forming f (x) from 4) (x) we call integration.

Page 221

Ill, 112] DERIVATIVES AND INTEGRALS 221 We shall use the notation C () = (x) dx. It is hardly necessary to point out that f..dx like d/dx must, at present at any rate, be regarded purely as a symbol of operation: the f and the dx no more mean anything when taken by themselves than do the d and dx of the other operative symbol d/dx. The reason for this notation will be explained in Ch. VII. 112. The practical problem of integration. The results of the earlier part of this chapter enable us to write down at once the integrals of some of the commonest functions. Thus C +XM1 + r jx mdxm -, cos xdx = sin x, sin xdx =-cos...(1) These formulae must be understood as meaning that the function on the right-hand side is one integral of that under the sign of integration. The most general integral is of course obtained by adding to the former a constant C, known as the arbitrary constant of integration. There is however one case of exception to the first formula, that in which m= 1. In this case the formula becomes nugatory, as is only to be expected, since we have already (Ex. XLIV. 5) seen that 1/x cannot be the derivative of any polynomial or rational fraction. And in fact it can be proved (though the proof is too detailed and tedious to be inserted here) that it is impossible to form, by means of a finite combination of the functional signs which correspond to any of the classes of functions which we have so far considered-signs such as +, x, -,,/, sin, arc sin,-a function of x whose derivative is 1/x. Some further discussion of this point will be found in Ch. IX. For the present we shall be content to assume that, if there is such a function, it is an essentially new function. That there really is a function F(x) such that DF(x)= 1x will be proved in the next chapter; and the properties of this function will be investigated in Ch. IX. For the present we shall simply assume the existence of such a function, and we

Page 222

222 DERIVATIVES AND INTEGRALS [VI shall call it the logarithmic function and denote it by log x; so that - = log...........................(2). The four formulae (1) and (2) are the four most fundamental standard forms of the Integral Calculus. To them should perhaps be added two more, viz. j 1 +X2arca f dx - | l do x = arc tan x, | /(1 - )- + arc sin x*...(3). 113. Polynomials. All the general theorems of ~ 94 may of course be also stated as theorems in integration. Thus we have, to begin with, the formulae f{f(x)+F(x)}dx=ff(x)dx+ F(x)dx.........(1), f(x) dx= = f (x)dx...................(2) Here it is assumed, of course, that the arbitrary constants are adjusted properly. Thus the formula (1) asserts that the sum of any integral of f(x) and any integral of F(x) is an integral of f(x) + F(x). These theorems enable us to write down at once the integral of any function of the form X A,f, (x), the sum of a finite number of constant multiples of functions whose integrals are known. In particular we can write down the integral of any polynomial: in fact C a, xn~4"1 a^x" (aOxn + a1,x+-1 a) n+ +... + a) d + x. n + — I- n 114. Rational Functions. After integrating polynomials it is natural to turn our attention next to rational functions. Let us suppose R (x) to be any rational function expressed in the standard form of ~ 98, viz. as the sum of a polynomial H (x) and a number of terms of the form A/(x - a)P. We can at once write down the integrals of the polynomial and of all the other terms except those for which p = 1, since A A 1 (x - a)p p - 1 (x- a)-l ' whether a be real or complex (~ 98). * See ~ 100 for the rule for determining the ambiguous sign.

Page 223

112-114] DERIVATIVES AND INTEGRALS 223 The terms for which p= 1 present rather more difficulty. And it is convenient at this stage to introduce another general theorem in integration. It follows immediately from Theorem (8) of ~ 94 that if j r(x)dx= (x), and a and b are real, then J (ax + b)dx =(1/a) (ax - b)............(3) Thus, for example, dax+ b=(l/a) log (ax + b), and, in particular, if a is real, = log (x - a). We can therefore write down the integrals of all the terms in R (x) for which p = 1 and a is real. There remain the terms for which p = 1 and a is complex. In order to deal with these we shall introduce a restrictive hypothesis, viz. that R(x) is a real function-i.e. that all its coefficients are real. Then if a= y + i is a root of Q(x) =0, m times repeated, so is a' = y-i8. Moreover, if the partial fractions corresponding to the factor (x-a)m are Ap/(x -a)P, those corresponding to the factor (x-a')m are Ap'/(x-a')P, where Ap' is conjugate (Ch. III, ~ 30) to Ap. This follows from the nature of the algebraical processes by means of which the partial fractions can be found, and which are explained at length in treatises on Algebra*. Thus if a term (X + i -)/(x - - i) occurs in the standard form of R (x), so will a term (X - i)/(x - 7 i8); and the sum of these two terms is 2 tX (x - )- I8}1(x - y)2 + 82}. This fraction is in reality the most general fraction of the form (Ax + B)/(ax2 + 2bx + c), where b2< ac. The reader will easily verify the equivalence of * See, for example, Chrystal's Algebra, vol. i, pp. 151-9.

Page 224

224 DERIVATIVES AND INTEGRALS the two forms, the formulae which express X,,/, y, 8 in terms of A, B, a, b, c being X = A/2a, u =-D/(2a^/V), =-b/a, 8 = /A/a, where A = ac - b2, and D = aB - bA. We shall now introduce another general theorem in integration, which follows at once from the theorem of ~ 108: viz. f {.f(x)} f' ()dx = F {()}............... (4). If in particular we suppose F{f(x)} to be logf(x), so that F' f{(x)} = 1/f(x), we obtain fJ (x) dx = log f (x); f( ) and if we further suppose that f (x) = (x - X)2 + 2 we obtain 2(x - x) dx = log {( - X)2 + ~2. Again, in virtue of the equations (3) of ~ 112 and (3) above 2) +.2 dx =- 28 arc tan ( — (X - X)+ JL2 +la2 These two formulae enable us to integrate the sum of the two terms which we have been considering in the expression of R (x); and we are thus enabled to write down the integral of any rational function, if all the factors of its denominator can be determined. The integral of any such function is composed of the sum of a polynomial, a number of rational fractions of the type - A/{(p - 1)(x-a)-} a number of logarithmic functions, and a number of inverse tangents. It only remains to add that if a is complex such a fraction as - A/{(p - 1)(x - a)P-1 always occurs in conjunction with another in which A and a are replaced by the complex numbers conjugate to them, and that the sum of the two fractions is a real rational fraction. Examples LI. 1. The integral of the function (Ax+B)/(ax2+2bx+c) may be expressed in the form ~A D A log + D [log {ax + b-(-( - A) - log {ax+ b+(- A)}] 2a z2ad(-a) (where X= ax2+2bx+c) if A<0, and in the form A log X+ D arc tan (ax+ 2a anA if A>o.

Page 225

DERIVATIVES AND INTEGRALS 225 2. In the particular case in which ac= b2 the integral is - D/{a (ax + b)} + (A/a) log {x + (b/a)). 3. Show that if the roots of Q(x)=0 are all real and distinct, and P(x) is of lower degree than Q(x), then JR () dx= X Q'(a) log (x -a) the summation applying to all the roots a of Q (x)= O. [That the fraction corresponding to a is Q(a) - follows from the fact 0 Q'(a)x-a that Q(x)/(x-a) —> Q'(a) and so (x-a)R(x) — P(a)/Q' (a), as x —a.] 4. If all the roots of Q (x) are real and one (a) is a double root, the rest being simple roots, and P(x) is of lower degree than Q(x), then the integral is A/(x -a) +A'log(x- a) + B log (x-/), where A= -2P(a)/Q"(a), A' =i{3P'(a)Q"(a)-P(a) Q"'(a)}/{Q" (a)}2, B=P(3)/Q'(3), and the summation applies to all roots / of Q(x)=0 other than a. 5. Find I - J{(X -1)(2+1)}12 [The expression in partial fractions is 1 1 i 2-i i 2+i 4(x-1)2 2(x-1) 8(x-i)2 8(x-i)8(x+i)2 8(x+i)' and the integral is 1 1 4(x- 1) -4(x + 1) - log(xi - '1) +4 log (x2+ 1) arctanx*.] 6. Integrate /{(x- a) (x - ) (x - c), x/{(x-a)2(x- b)}, x(x- a)3, X/{(X2+a2)(x2+ b2)}, 2/{(x2+a2) (X2+ b2)), X3/{(X2 + a2) (X2+b2)}, (X+1)/{2(X-1)}, (X+l)/{X(X-1)2}, (X+ )/1{(X-1)}2, (x2-1)/{x2(x2+1)}, (2 -1)/{ (x2 + )2}, (x2-1)/{x(x2+ 1)}2. 7. Prove the formulae: +dx4 4 = 2 {log (l + x2+x)- log(l — V2+x2)+2arc tan (1 ^/2), f1 + x4 X 4 log(l-x+,V2 +x2)- log (1 +ix/2+ 2) + 2 arc tan (- )} 1 +x+ =4 3 {/3[log(l+ X+x2)-log(1l- +x2)]+2arc tan ( 1j' * In this case the application of the general method of ~ 114 is fairly simple. In more complicated cases the labour involved is sometimes prohibitive, and other devices have to be used. We have, moreover, assumed that all the factors of the denominator can be determined. If this is not the case the method of partial fractions fails, and recourse must be had to other methods. For further information concerning the integration of rational functions the reader may be referred to Goursat's Cours d'Analyse, t. i, pp. 234 et seq., and to the author's tract The integration of functions of a single variable, pp. 10 et seq. H. A. 15

Page 226

226 DERIVATIVES AND INTEGRALS [VI 115. Algebraical Functions. We naturally pass on next to the question of the integration of algebraical functions. We shall confine our attention to explicit algebraical functions (Ch. II, ~ 16). We have to consider the problem of integrating y, where y is an explicit algebraical function of x. It is however convenient to consider an apparently more general integral, viz. JR (x, y) dx, where R(x, y) is any rational function of x and y. The greater generality of this form is only apparent, since (Ex. xv. 6) the function R (x, y) is itself an algebraical function of x. The choice of this form is in fact dictated simply by motives of convenience: such a function as (x + V(aX2 + 2bx + c)}/{x - V(ax2 + 2bx + c)} is far more conveniently regarded as a rational function of x and the simple algebraical function \/(ax2 + 2bx + c), than directly as itself an algebraical function of x. 116. Integration by substitution and rationalisation. It follows from equation (4) of ~ 114 that if f+(x) dx = (x), then {f(t} ' ( = ( t)............... (1). This equation supplies us with a method for determining the integral of + (x) in a large number of cases in which the form of the integral is not directly obvious. It may be stated as a rule as follows: put x =f(t), where f(t) is any function of a new variable t which it may be convenient to choose; multiply by f'(t) and determine (if possible) the integral of r {ff(t)} f '(t); express the result in terms of x. It will often be found that the function of t to which we are led by the application of this rule is one whose integral can easily be calculated. This is always so, for example, if it is a rational function, and it is very often possible to choose the relation between x and t so that this shall be the case. Thus the integral of R(V/x), where R denotes a rational function, is reduced by the substitution x=t2 to the integral of 2tR(t2), i.e. to the integral of a rational function of t. This method of integration is called integration by rationalisation, and is of extremely wide application.

Page 227

115-117] DERIVATIVES AND INTEGRALS 227 Its application to the problem immediately under consideration is obvious. If we can find a variable t such that x and y are both rational functions of t, say x = R,(t), y = R, (t), then JR (x, y) dx = f {[R1(t), R2(t)} R1'(t)dt, and the latter integral, being that of a rational function of t, can be calculated by the methods of ~ 114. It would carry us beyond our present range to enter upon any general discussion as to when it is and when it is not possible to find an auxiliary variable t connected with x and y in the manner indicated above. We shall only consider a few simple and interesting special cases. 117. Integrals connected with conics. Let us suppose that x and y are connected by an equation of the form ax2 + 2hxy + by2 + 2gx + 2fy + c = 0; in other words that the graph of y, considered as a function of x, is a conic. Suppose that (?, v) is any point on the conic, and let x-= X, y- v= = Y. If the relation between x and y is expressed in terms of X and Y it assumes the form aX2 + 2hXY+ b Y + 2GX + 2FY= 0, where F=h +br +f, G= a + h + g. In this equation put Y= tX. It will then be found that X and Y can both be expressed as rational functions of t, and therefore x and y can be so expressed, the actual formulae being x- -=-2(G+Ft)/(a+2ht+bt2), y- =-2t(G +Ft)/(a+2ht+bt2). Hence the process of rationalisation described in the last section can be carried out. The reader should verify that hx + by +f - ~ (a + 2ht + bt2) d 2 dr' sothat f dx -2 dt so that hx by +f 2 a + 2ht + bt2' a formula which will be useful later on. 15-2

Page 228

228 DERIVATIVES AND INTEGRALS 118. The integral JR tx, J(ax2 + 2bx + c)} dx. The most. important case is that in which the relation between x and y is 2 = ax2 + 2bx + c. Let:, V/(a2 + 2b6 + c) be the coordinates of any point on the conic. The relation between X and Y is aX2 - 2 + 2 (a4 + b) X - 27Y = 0, and the formulae expressing x and y in terms of t are 2 (a + b - tr) 2t (a: + b - tr) x-a - tV a - at2...-1. Consider, for example, the integral f. If A=ac-b2, there are three cases to consider, viz. those in which (i) a>O, A>0; (ii) a>0, A<0; (iii) a<O, A<0. If a<O and A>0, ax2+ 2bx+c is always negative, so that this case is not of any interest. If a>0, A>0, the conic is a hyperbola with one branch entirely above the axis of x, which is described once as t varies from -//a to +,/a. If (5,,1) lies on this branch, Y7>0 and - ~a<t<V/a. Since now h =f=O, b=-1, the last formula of ~ 117 gives Id=2 2 dt 1= {log(s/ca+t)-log (/a-t)} JY Ja-t2 t /a But it follows from the equations (1) that (x - )^Va + (y - )= -2 (a +b - tr)l/(Ja t) and so dx 1 a+t_ 1 (X - a + +(y -77) = = -logog (-) oa - (y-a........ (2), J y Va Va-t, - t =-)xa-(y-r) the logarithm being written in this form in order to avoid any possible difficulty as to the sign of the function inside the large bracket. This equation is true for all pairs of values of $ and r7 related as above. A particularly simple form of the integral may be found as follows. Since we may add any constant to the right-hand side, we have j- 2 log {(x - ) Ja - (y - r)}2 - log - Ja - ) Now suppose that a - oo. Then it is easy to see that -+ oo, a/a+rq — b/^/a, and that the contents of the last large bracket tend to * In the succeeding discussion we anticipate the fundamental properties of the logarithm, which will be proved later on, viz. that log u is continuous for all positive values of u, log 1=0, log (1/u)= -log u, log uv=log +logv.

Page 229

118, 119] DERIVATIVES AND INTEGRALS 229 unity. We thus obtain the formula f = 2 I, log ( a+y+ 2....... ' (3) If in particular a=1, b=0, c=a2 we obtain j =/( 2 g)=log + V( + a2)}.....................(4). The truth of this equation may be at once verified by differentiation of the right-hand side. If we transform the integral by the substitution x2+a2=u2, U=^(x2- a2) we obtain, on writing x again for u, (,2w-a-2 log {x + d(x2-a2)}............................ This integral may also be calculated directly by an argument similar to that used for the integral (4). It is the simplest example of case (ii) above. The reader should associate with these two formulae the third formula I /(a = arc sin (x/a)...........................(6). This integral corresponds to the case (iii) above. The formula appears very different from (4) and (5): the reader will hardly be in a position to appreciate the connection between them until he has read Chs. IX and X. In the last formula it is supposed that a is positive: if a is negative the integral function is arc sin (x/ i a ) = - arc sin (x/a) (cf. ~ 100). 119. The integral f - 2 + -dx. This integral can V(ax2 + 2bx + c) be integrated in all cases by means of the results of the preceding sections. It is most convenient to proceed as follows. Since Xx + t =- (X/a)(ax + b) + ta - (Xb/a), /( ax2 + 2b- + c) dx = V(ax2 + 2bx + c), J \/(aX2 + 2bx + c) we have f (Xx + LL) dx / f/ X \ // 9,7,fc) y dx V(ax + 2bx = ( ) V(/(ax + 2bx + c) + 7 V(ax2 + 2bx + c)' where y = - (Xb/a). In the last integral a may be positive or negative. If it is positive we put xa + (b//a) = t, when we obtain 1 f dt Va J (t2 + K) where = (ac-b2)/a. If a is negative we write A for -a and put x VA - (b//A) = t, when we obtain 1 f dt (-a) J (- _ -t2)

Page 230

230 DERIVATIVES AND INTEGRALS [VI It thus appears that in any case the calculation of the integral may be made to depend on that considered in ~ 118, viz. one or other of the three integrals J(t+ J (t2 -_ 2) J2 /V(a2 _ t2)' 120. The integral f(Xx+fL)V'(ax2+2bx+c)dx. In exactly the same way we find f(Xr+p) (ax2 + 2bx+c)dx= (X/3a) (ax2+2bx+ c)3/2 + y j(aX2 + 2bx + c) dx; and the last integral may be reduced to one or other of the three forms V(t2 + a2) dt, J(t2 - a2) dt, J(a2 -t2) dt. In order to obtain these integrals it is convenient to introduce at this point another general theorem in integration. 121. Integration by parts. The theorem of integration by parts is merely another way of stating the rule for the differentiation of a product (~ 94). It follows at once from Theorem (3) of ~ 94 that f'(x) F(x) dx =f(x) F(x) - f f(x) F' (x) dx. It may happen that the function which we wish to integrate is expressible in the form f'(x) F(x), and that f(x) F'(x) can be integrated. Thus suppose that b (x) = xr (x), where J (x) is the second derivative of a known function X(x). Then f (x)dx = xx"(x)dx = xx'(x)- X'(x)dx = x'(x) - X(x). We can illustrate the working of this method of integration by applying it to the integrals of the last section. Taking f(x)=ax+b, F(x)=V(ax2+2b6x+c)=y, we obtain ayd = (ax+b)y- (ax+ b) d-= (a+b)y - aJc d + (ac - b2)j-, so that fYdx =- y+b - b2d and we have already seen (~ 118) how to determine the last integral. and we have already seen (~ 118) how to determine the last integral.

Page 231

119-121] DERIVATIVES AND INTEGRALS 231 Examples LII. 1. Prove that, if a>O, /(x2 + a2) dx= I V(x2 +a2) + a2 log (x + (x2 + a2)}, JV(x 2-a2) dx= x (x2 - a2) - a2 log { + /(2 - a2), /(a2 - x2) dx= xV(a2 - x2) + a2 arc sin (x/a). 2. Calculate the integrals f/( - d 2) f(a2 _x2) dx by means of the j /(a2 - xys substitution x=a sin 0, and verify that the results agree with those obtained in ~ 118 and Ex. 1. 3. Calculate fx(x+a)mndx, m being any rational number, in three ways, (i) by integration by parts, (ii) by the substitution (x+a)m=t, (iii) by writing (x+a)-a for x; and verify that the results agree. 4. Prove, by means of the substitutions ax+-b= lt and x = /u, that (in the notation of ~ 118) [dx ax +b [xdx bx+c y3 z ]~-,y J, j =- y ' 5. Integrate 1/{(1+x)x)}, Vx/(l+x),' 1/{xJ(l+x)}, x/l(1+x), 1/V{x(l+x)}, 1/^{I,(x-1)}, 11/J{x(1-x)}, /J(l+x)/(x1-)}, x/( (a+bx), xl2/J(a + bx), l/{X,/(X2+ a2)}, 1/{x2N/(X2-a2)}, 1/{x3J(a2-X2)}, x3//(a2- X2). 6. Integrate l//{(x - a) (b - x) in three ways, (i) by the methods of the preceding sections, '(ii) by the substitution (b-x)/(x-a)=t2, (iii) by the substitution x= a cos2 + b sin2 0; and verify that the results agree. 7. Integrate /{(x - a) (b - x)} and /{(b - x)/(x - a)). 8. Show, by means of the substitution 2x+a+ b= =(a-b)ft2+(1/t2)), or by multiplying numerator and denominator by /,(x+a)- V(x+b), that, if a>b, iJ(x + a) + /(x + b)= l,(a - b) {t+ (1/3t3)}. 9. Find a substitution which will reduce (x+a)3/2 (x a)3/2 to the integral of a rational function. (Math. Trip. 1899.) 10. Show that JR x, /(ax+b)>dx is reduced, by the substitution ax+b=y", to the integral of a rational function. 11. Prove that ff" (x) F(x) d =f' (x) F(x) -f (x) F' () +ff(x) F" x) dx, and generally f () () F(x) dx =f (1-1) (x) F(x)-f (n)(x) F' (x) +... + (-1 )nf () ) (x) dx.

Page 232

232 DERIVATIVES AND INTEGRALS [VI 12. Hence integrate x"n q( + 1)(x), i.e. x't (x), where +, (x) is a function which can be integrated n+1 times. In particular integrate xn(ax.+b)p, n being a positive integer, and p any rational number other than - 1. 13. The integral f(l1+x)Pxqd, wherep and q are rational, can be found in three cases, viz. (i) if p is an integer, (ii) if q is an integer, (iii) if p+q is an integer. [In case (i) put x=u8, where s is the denominator of q: in case (ii) put 1 +x=ts, where s is the denominator of p: in case (iii) put 1 +x=xt8, where s is the denominator of p.] 14. The integral fx1 (ax + b)qdx can be reduced to the preceding integral by the substitution aXn=bt. [In practice it is often most convenient to calculate a particular integral of this kind by a fornmula of reduction (v. Misc. Ex. 40).] 15. The integral R{x, I(ax+b), (cx +d)} dx can be reduced to that of a rational function by the substitution 4x= - (b/a) {t + (1/t)}2 - (d/c) {t - (1/t)}2. 16. Show how to calculate the integral / d(ax2+2 by means j(x -p) \(a2 +- 2bx + c) of the substitution x -p = l/t. 17. Show by means of the substitution y= /(ax2+2bx+c)/(x-p) that J dx _ dy (x-p) (ax2+2bx+c) - /{Xy2-/}' where X=ap2 + 2bp+c, = ac- b2; and hence evaluate the integral. 18. Calculate the integrals of 1/{(X- 1), /(X2+1)}, l/{(x+ l)(1 + 2 - X2)} by means of each of the preceding methods, and verify the agreement of the results -. 19. Reduce JR(x, y)dx, where y2(x-y)=-x2, to the integral of a rational function. [Putting y=tx we obtain x=1/{t2(1 -t)}, y= l/{t( -t)}.] 20. Reduce the integral in the same way when (a) y (x - y)2 =x, (b) (x2 + y2)2 = a2 (x2 - y2). [In case (a) put x - =t: in case (b) put 2 +y2 = t ( - y), when we obtain x= a2t (t2+ a2)/(t4 + a4), = a2t (t2 -a2)/(t4 + a4).] 21. If y(x- y)2=x then fx x log{(a -y)2 -} 22. If (x2 +y2)2 = c2(x2 -y2) then I (x -2 +) - cg (Xo * See also Misc. Exs. 33 et seq.

Page 233

122-124] DERIVATIVES AND INTEGRALS 233 122. Transcendental Functions. Owing to the immense variety of the different classes of transcendental functions, the theory of their integration is a good deal less systematic than that of the integration of rational or algebraical functions. We.shall consider in order a few classes of transcendental functions whose integrals can always be found. 123. Polynomials in cosines and sines of multiples of x. We can always integrate any function which is the sum of a finite number of terms such as A (cos ax)m(sin ax)rn'(cos bx)(sin bx)n'... where m, m', n, n',... are positive integers and a, b,... any real numbers whatever. For such a term can be expressed as the sum of a finite number of terms of the types a cos{(pa + qb +...)x}, /3sin {(p'a + q'b +...)} and the integrals of these terms may be at once written down. Examples LIII. 1. Integrate sin3 x cos22x. In this case we use the formulae sin3 x (3 si - n x - sin 3x), cos 2x (1 + cos 4). Multiplying these two expressions and replacing sin x cos 4x, for example, by (sin 5x - sin 3x), we obtain l [1(7 sin x- 5sin 3x+3 sin 5 - sin 7) dx = - - COS + -4 COS 3x - cos 5x + 1 - COS 7x. The integral may of course be obtained in a different form by different methods. For example f sin3 x cos2 2xdx= (4 cos4- 4 cos2 x+ 1) (1 -cos2 x) sin xdx, which reduces, on making the substitution cos = t, to (4t6 - 84 + 5t2 - 1) dt = cos7 x - 8 cos5 x + cos3 x- cos x. It may of course be verified that this expression and the integral already obtained differ only by a constant. 2. Integrate by any method cos ax cos bx, sin ax sin bx, cos ax sin bx, cos2 x, sin3 x, cos4 x, cos x cos 2x cos 3x, cos co s x 2x, cos3 2x sin2 3x, cos5 x sin7 x. [In cases of this kind it is also sometimes convenient to use a formula of reduction (Misc. Ex. 40).] 124. The integrals n cos xdx, f sin xdx and associated integrals. The method of integration by parts enables us to

Page 234

234 DERIVATIVES AND INTEGRALS [VI generalise the preceding results. For f/x cos xdx = x sin x - n - sin dx, f x sin x dx= - xn cos x + n Xn-l cos dx, and clearly the integrals can be calculated completely by a repetition of this process. It follows that we can always calculate f x cos ax dx, | x sin axdx; and so by a process similar to that of the preceding paragraph, we can calculate P (x, cos ax, sin ax, cos bx, sin bx,...) dx, where P is any polynomial. Examples LIV. 1. Integrate x sin x, x2 cos x, (x cos x)2, (x sin n sin x)2 x sin2 x cos4 x, (x sin x)3. 2. Find polynomials P and Q such that J[(3x- 1) cosx+(l -2x) sinx] dx=Pcosx+ Qsinx. 3. Prove that Jfn cos x dx = Pn cos x + Q, sin x, where P,=nx*_-l- z (n-l)(n-2) X"-3+..., Q=x (n (- 1) n-2+.... 125. Rational Functions of cos x and sin x. The integral of any rational function of cos x and sin x may be calculated by the substitution tan x x= t. For - t2 2t dx 2 cos T s= + t2 = + i t2' dt + t2' so that the substitution reduces the integral to that of a rational function of t. Examples LV. 1. Prove that fsec x dx = log (sec x + tan xI, cosec x dx= log tan Ix. [Another form of the first integral is log tan (I7r + x); a third form is - log {(1 +sin x)/(1 - sin x)}.] 2. tan x dx - log cos x, cotx dx=log sin x, sec2x dx =tanx, cosec2 xdx=- cot x, tan x sec x dx= sec x, cot x cosec x dx- cosec x [These integrals are included in the general form, but there is no need to use a substitution, as the results follow at once from ~ 100.]

Page 235

124-126] DERIVATIVES AND INTEGRALS 235 3. Show that the integral of l/(a+ b cosx), where a +b is positive, may be expressed in one or other of the forms 2 tan //a - -b\ 1 1f NI(b + a) + t V(b - a) /(a2 b2)arc tan a b',(b2 a2) g (b + a) - t f(b - a) where t=tanrx, according as a2 b2. If a2=b2 the integral reduces to a constant multiple of that of sec2 x or cosec2 1, and its value may at once be written down. Deduce the forms of the integral when a+b is negative. 4. Show that if y is defined in terms of x by means of the equation (a +b cosx) (a- bcosy)=a2 -b2, where a2> b2, then as x varies from 0 to 7r one value of y varies from 0 to 7r. Show also that %/(a2 - b2) sin y sin x dx siny a-bcosy ' a+bcosxdy a-bcosy' and deduce that, if 0 < x < r, dx 1 a cos x+ b I -- 7 --- = / arc cos - I a + b cos x,/ (a2 - b2) ar \a+b cos b x Show that this result agrees with that of Ex. 3. 5. Show how to integrate 1/(a + b cos x + c sin x). [Express b cos x + c sin x in the form d/(b2+c2) cos (x - a).] 6. Integrate (a + b cosx+ c sinx)/(a + cos x + y sinx). [Determine X, ~L, v so that a +bcos +csin X -X+ (a + cos x+ysinx)+r ( -3 sinx+ 7 cos x). Then the integral is ^dx ix+v log(a+B3cosx+ysinx)+Xj a+co +.] a + 2 cos x+y sin 7. Integrate 1/(5 + 3 cos x), 1/(3 - 5 cos x), 1/(2 -sin x), 1/(1 - cos x + 2 sin x), (5 + 3 cos x - 7 sin x)/(ll - cos x+ sin x). 8. Integrate 1/(a cos2 x + 2b cos x sin x + c sin2 x). [The subject of integration may be expressed in the form 1/(A+B cos2x+Csin2x), where A=-(a+c), B=-(a-c), C=b: but the integral may be calculated more simply by putting tan x = t, when we obtain [ sec2 x dt a a+2btanx+ctan2x j a+2bt+ct2 ] 126. Integrals involving arc sin x, arc tan x, and log x. The integrals of the inverse sine and tangent and of the logarithm can easily be calculated by integration by parts. Thus arc sin xdx = arc sin x - ( d ) = x arc sin x (1 ) farc tan x dx = x arc tan x - x ----- = x arc tan x - log (1 + x2), log xdx= x log - dx x (logx- 1).

Page 236

236 DERIVATIVES AND INTEGRALS [vi It is easy to see that if we can find the integral of y =f(x) we can always find that of x = <(y), ( being the function inverse to f. For on making the substitution y =f(x) we obtain f (y) dy=jf '(x) dx = xf (x) -jf(x)dx. The reader should evaluate the integrals of arc sin y and arc tan y in this way. Any integrals of the form P (x, arc sin x) dx, (x log x)dx where P is a polynomial, can be found. Take the first form, for example. We have to calculate a number of integrals of the type fxim(arcsin x)dx. Making the substitution x=siny we obtain I y sinmy cos ydy, which can be found by the methods of ~ 124. In the case of the second form we have to calculate a number of integrals of the form fx (log x)l dx. Integrating by parts we obtain /* /pm~l {}r~rf r rf, C xm (log x)n dx= (log x) 1- xm (log x)n-ldx, J M ~ 1 m +1 I and it is evident that by repeating this process often enough we shall always arrive finally at the complete value of the integral*. 127. Areas of plane curves. One of the most important applications of the processes of integration which have been explained in the preceding sections is to the calculation of areas of plane curves. Suppose that POPP' (Fig. 53) is the graph of a continuous curve y = ((x), P being the point (x, y) and P' the point (x + h, y +k) and h being either positive or negative (positive in the figure). The reader is of course familiar with the idea of an 'area,' and in particular with that of an area such as ONPPo. This idea we * A more general account of the problem of integration (~~ 111-126) will be found in Goursat's Cours d'Analyse or the author's tract quoted on p. 225. The reader may also be referred to the text-books of Profs. Lamb and Gibson, to Prof. Greenhill's A Chapter in the Integral Calculus, and a paper by Mr Bromwich in vol. xxxv of the Messenger of Mathematics.

Page 237

126, 127] DERIVATIVES AND INTEGRALS 237 shall at present take for granted. It is indeed one which needs and has received the most careful mathematical analysis: later on pi P R PO P R FIG. 53 a. O NI N N' FIG. 53. we shall return to it and explain precisely what is meant by ascribing an 'area' to such a region of space as ONPPo. For the present we shall simply assume that this and similar regions have associated with them definite numbers which we call their areas, and that these areas can be compounded in the manner indicated by common sense-e.g. that (area PRP') + (area NT1'RP) = (area NN'P'P) and so on. Taking all this for granted it is obvious that the area ONPPo is a function of x; we denote it by 4((x). Also ((x) is a continuous function. For 1 (x + h) - D (x) = (area NN'P'P) = (area NN'RP) + (area PRP') = hc (x) + (area PRP'). As the figure is drawn, (area PRP')< h. RP'=hk. This is not however necessarily true in general, because it is not necessarily the case (see for example Fig. 53 a) that the arc PP' should rise (or fall) steadily from P to P'. But the area PRP' is always less than jhlXh, where Xh is the greatest distance of any point of the arc PP' from PR. Moreover, since +(x) is a continuous function, Xh —0 with h. Thus we have,(x I + A) - (x) = h {(x) + 1h4,

Page 238

238 DERIVATIVES AND INTEGRALS [vI where I/,h^ < Xh and Xh-O as h-O0. From this it follows at once that 4 (x) is continuous. And not only so but qV (x) = lim -(x + h) (x) = lim {+ (x) + t = b (x). h —0 h h-O Thus the ordinate of the curve is the derivative of the area, and the,area is the integral of the ordinate. We are thus able to formulate a rule for determining the area ONPPO. Calculate i (x), the integral of f((x). This involves an arbitrary constant, which we suppose so chosen that (0)= 0. Then the area required is 4(x). If it were the area NilVPP1 which was wanted, we should of course determine the constant so that (x1) )=0, where xl is the abscissa of P1. 128. Lengths of curves. The notion of the length of a curve (other than a straight line) is in reality a more difficult one even than that of an area. In fact the assumption that OP (Fig. 53) has a definite length (which we may denote by S (x)) does not suffice for our purposes, as did the corresponding assumption about areas. In fact we cannot even prove that S(x) is continuous-i.e. that lim{S(P') -S(P)}=0. This looks obvious enough in the large figure, but less so in such a case as is shown in the smaller figure. Indeed it is not possible to proceed further (with any degree of rigour) without a careful analysis of precisely what is meant by the length of a curve. It is however easy to see what the formula must be. The assumption that the curve has a length leads to the equation {S(x + h) - S(x)}/h = (arc PP')/h = {(chord PP')/h} x {(arc PP')/(chord PP')}. Now PP'= V(PR2 + RP'2)= h (1+ 2) and, if we assume that the curve has a tangent whose direction varies continuously, ki = ~ (x + h) - O (x) = h/'(f) where: lies between x and x + h. Hence lim (PP'/h) = lim / {l + [<' (:)]i = j / l + [' (x)].

Page 239

127, 128] DERIVATIVES AND INTEGRALS 239 If also we assume that lim {(arc PP')/(chord PP')} = 1 we obtain the result S' () = lim {S(x + h) - S (x)}/h = V/ 1 + [' (x)]2} and so S(x) = / {1 + [' (x)]2} d*. Examples LVI. 1. Calculate the area of the segment cut off from the parabola y2-4ax by the ordinate x=4, and the length of the arc which bounds it. [The area is 2 fY(4ax)dx=g3 /(a) and the arc is 2 /V{l+ - dx. Also fv dx= V{x+adx)} =x x + a) I a 1+a) X/( fx~c~ Jx, (x + a)j f =^/{xDx~a)}+ 2 IJ (d + a)j ==Vx (x + a)} + a log {VJx +,( + a)},;so that the length of the arc is 2s/{ (a + f)} + 2a log [V(f/a) + J{(a + $)/a}]. In particular, the length of the arc from one end of the latus rectum to.another is 2a {^/2 +log (1 +,2)}.] 2. Answer the same questions for the curves y= 2/4a, ay2=3, showing that the length of the arc of the latter curve, from the origin to the point for -which x=6, is (j8a) [{1 +(9$/4a)}3/2 -1]. 3. Calculate the areas and lengths of the circles a2+y2=a2, x2+y2=2ax by means of the formulae of ~~ 127-8. 4. Show that the area of the ellipse (x2/a2) +(y2/b2)= 1 is 7rab. 5. Find the area bounded by the curve y=sin x and the segment of the -axis of x from x=0 to x=27r. [Here @ (x) = - cos x, and the difference between the values of - cos x for.x=0 and x=27r is zero. The explanation of this is of course that between.x=Tr and x=27r the curve lies below the axis of x, and so the corresponding part of the area is counted negative in applying the method. The area from.x=0 to x = r is - cos 7r+cos 0=2; and the whole area required, when every part is counted positive, is twice this, i.e. is 4.] 6. Show that the curves c2y2=x2(x-a)(b -), (a2+x2)y2=(a2- 2)x2, (a_ -)y2=(a+x)X2, and a2(y-x)2=(a+x)3(a-x) each consist of a single loop, and find the area of each curve. * For some discussion of the question of the existence of areas and lengths see Ch. VII, ~ 137 et seq.

Page 240

240 DERIVATIVES AND INTEGRALS [vI [Consider the last curve, for example. Here y =x + (l/a) (a +x) /(a2 - x2). The curve lies entirely between the lines x= -a and x= +a (Fig. 54). The area of the curve is 1 (a) - 4) (- a), where q (x)= f(y -y2)dx= (2/a) f(a+x) '(a2 - x2)dx. Putting x= a cos 0, we find without difficulty that the area of the curve is a27r.] Y " O/ X FIG. 54. 7. Suppose that the coordinates of any point on a curve are expressed as functions of a parameter t by equations of the type x=q)(t), y=+k(t), p and 5+ being one-valued and differentiable functions of t. Prove that if, as t varies from to to ti, x steadily increases, the area of the region bounded by the corresponding portion of the curve, the axis of x, and two ordinates, is t +(t) p'(t)dt or j y ddt. 8. Suppose that C is a closed curve formed of a single loop and not met by any parallel to either axis in more than two points. And suppose that the coordinates of any point P on the curve can be expressed as in Ex. 7 in terms of t, and that, as t varies from to to tl, P moves in the same direction round the curve and returns after a single circuit to its original position. Show that the area of the loop is equal to the difference of the initial and final values of any one of the integrals -f ~ dr, /'dy, - t dy d2,A dt dt dt dt dt, this difference being of course taken positively. 9. Apply the result of Ex. 8 to determine the areas of the curves given by (i) x/a=(1-t2)/(l+t2), y/a=2t/(l+ t), (ii) x=acost, y=bsint, (iii) x=acos3t, y=bsin3t. Determine also the perimeters of the first and third curves (for the second see Ex. 15).

Page 241

128] DERIVATIVES AND INTEGRALS 241 10. Find the area of the loop of the curve x3+-y-3axy. [Putting y=tx we obtain x= 3at/(l+ t3), y=3at2/(l + t3). As t varies from 0 towards + oo the loop is described once. Also dx dy\ dt = - 1 d (y\ - 9a2t2 3a2 -X dt=- 2 dti dt= 2 (l+t3)2t2(l+t3), which tends to 0 as t- o o. Thus the area of the loop is -aa2.] 11. Find the area of the loop of x5+-y5=Sax2y2. 12. Find the length of one arch of the curve x= a (t - sin t), = a (1 - cos t), and the area included between the arch and the axis of x. 13. Find the area common to the two parabolas y2= 4ax, x2= 4ay. 14. The whole area between the curve y (a2 + x2)4= a54 and the positive half of the axis of x (i.e. the limit as -a-oo of the area between the curve, the axis and the ordinate x=$) is raa2/32. 15. The arc of the ellipse given by zx=acost, y=bsint, between the points t=t1 and t=t2, is F(t2)-F(tl), where F(t)=af(1- e2 sin2t) dt (e being the eccentricity). [This integral cannot however be evaluated in terms of such functions as are at present at our disposal.] 16. Polar coordinates. Show that the area bounded by the curve r=f(0), where f(0) is a one-valued function of 0, and the radii = 01, 8=02, is F(02)-F(1), where F(O)= r2dO. And the length of the corresponding arc of the curve is, (02) - 4 (01), where I~- dd o Hence determine (i) the area and perimeter of the circle r=2asinO; (ii) the area between the parabola r = I 1sec2 IO and its latus rectum, and the length of the corresponding arc of the parabola; (iii) the area of the limagon r==a+bcos8, distinguishing the cases in which a> b, a=b, and a<b; and (iv) the areas of the ellipses l/r =acos + 2hcos sin +b sin2 and l/r= -l+ecos0. [In the last case we are led to the integral (1+ de s)2 which may be calculated (cf. Ex. Lv. 4) by the help of the substitution (1 +e cos )(1 - e cos b)=-1 -e2.] 17. Trace the curve 20=(a/r) +(r/a), and show that the area bounded by the radius vector 0=f and the two branches which touch at the point r=a, 0=1 is 2a2 (2- 1)3/2. (Math. Trip. 1900.) 18. A curve is given by an equation p=f(r), r being the radius vector and p the perpendicular from the origin on to the tangent. Show that the calculation of the area of the region bounded by an arc of the curve and two r dr radii vectores depends upon that of the integral p d H.A2 _ 12)6 H. A. 16

Page 242

242 MISCELLANEOUS EXAMPLES ON CHAPTER VI MISCELLANEOUS EXAMPLES ON CHAPTER VI. 1. Denoting a, ax+b, ax2+2bx+c,... by uo, U1,? 2,..., show that Uo02 3 - 3UoU1U2+ 2u,13 and t0 t4- 4tl i3 + 3U22 are independent of x. 2. If ao, a1,..., a2, are constants and U,.=(ao, al,..., a,,x, 1)', then UO U2 -2n U1 U 2n- (2n- ) U2 U-2.+ 2n UO is independent of x. (Math. Trip. 1896.) [Differentiate and use the relation U.'=rUy._i.] 3. The first three derivatives of the function arc sin (t sin ) - x, where,u > 1, are positive for 0 _ x _ ~rr. 4. The constituents of a determinant are functions of x. Show that its differential coefficient is the sum of the determinants formed by differentiating the constituents of one row only, leaving the rest unaltered. 5. Ifif, f2, f3,f4 are polynomials of degree not greater than 4, then fl f2 f3 f4 Af' f2' f' 'f4 I f" f211" f3 A" ]fi... fo /i f3ll f4111 is also a polynomial of degree not greater than 4. [Differentiate five times, using the result of Ex. 4, and rejecting vanishing determinants.] 6. Ify3+3yx+-2x3=0 then x2(1+X3)y" -xy'+y=O. (Math. Trip. 1903.) 7. Verify that the differential equation y=/ {-(yl)}+){x- (yi)}, where yl is the derivative of y, and + is the function inverse to O', is satisfied by y = + (c) + (x - c) or by y= 2q (~x). 8. Verify that the differential equation y = {x/+ (Yi)} k ({+ (y)}, where the notation is the same as that of Ex. 7, is satisfied by y= c) (x/c) or by y- = x, where 3= 0b(a)/a and a is any root of the equation q (a) - a'(a) =O. 9. If ax+by+c=O it is clear that y2=0 (suffixes denoting differentiations with respect to x). We may express this by saying that the general differential equation of all straight lines is y2=0. Find the general differential equations of (i) all circles with their centres on the axis of x, (ii) all parabolas with their axes along the axis of x, (iii) all parabolas with their axes parallel to the axis of y, (iv) all circles, (v) all parabolas, (vi) all conics. [The equations are (i) 1 +y12+yy2=0, (ii) y12+YY2=0, (iii) y3=0, (iv) ( +y2)y3 =3yly22, (V) 5y32=3Y2Y4, (vi) 9y22yS - 45Y2Y3Y4+40Y33=O. In each case we have only to write down the general equation of the curves in question, and differentiate until we have enough equations to eliminate all the arbitrary constants.]

Page 243

MISCELLANEOUS EXAMPLES ON CHAPTER VI 243 10. Show that the general differential equations of all parabolas and of all conics are respectively DX2 (Y2 - 2/3) =, Dx3 (2 - 23) = 0. [The equation of a conic may be put in the form y= ax+ b +~ (p2 +2qx + r). From this we deduce Y2 = + (pr - 2)/(px2 + 2qx + r)3/2 If the conic is a parabola, p = 0.] _y I d2y d3y 1 d?4y 11. Denoting dy 2d2y dB d3 d X... by t, a, b, c,... and dx' 21dx2' 3!-dx' 4!dx4 dx 1 d2x 1 d3x 1 d4x dy 2! dy2 3! dy3 4! dy4' '. by T, a, A, y,., show that 4ac - 5b2= (4ay - 5/32)/r8, bt- a2= - (3r - a2)/r6. Establish similar formulae for the functions a2d - 3abc - 2b3, (1 +t2)b -2a2t, 2ct- 5ab. 12. If Yk is the kth derivative of y=sin (n arc sin x), prove that (1 - 2) k+2- (2k+1)xyk + +(n2 -k2) Yk=0. [Prove first for k=O, and differentiate k times by Leibniz's Theorem.] 13. Prove the formula vDXn u = DXn (~v) - -nDx.1 (mDxv) + DXn - 2 (uDs V)...... where n is any positive integer. [Use the method of induction.] 14. A curve is given by x=a(2cost+cos2t), y=a(2sint-sin2t). Prove (i) that the equations of the tangent and normal, at the point P whose parameter is t, are xsin t+ycost =asin 3 t, xcos t-y sin t=3acos 3t; (ii) that the tangent at P meets the curve in the points Q, R whose parameters are - ~t and Tr- t; (iii) that QR=4a; (iv) that the tangents at Q and R are at right angles and intersect on the circle x2+y2=a2; (v) that the normals at P, Q, and R are concurrent and intersect on the circle x2 + y2= 9a2; (vi) that the equation of the curve is (X2 +y2 + 12ax + 9a2)2 = 4a (2x + 3a)3. Sketch the form of the curve. 15. Show that the equations which define the curve of Ex. 14 may be replaced by 5/a=2u+(1/u2), i7/a=(2/u)+u2, where =x=+yi, -=x-yi, u= Cis t. Show that the tangent and normal, at the point defined by u, are u2- wur =a(u3-1), u2 +uq 7=3a(u3+ 1), and deduce the properties (ii)-(v) of Ex. 14. 16. Show that the condition that x4+4px3- 4qx-1=0 should have equal roots may be expressed in the form (p + q)2/3 -(p _ q)2/3= 1. (Math. Trip. 1898.) 16-2

Page 244

244 MISCELLANEOUS EXAMPLES ON CHAPTER VI 17. Show that the equation sin x= ax will have a repeated root if a = cos 4, where 4 is any root of the equation tan x=x. Determine roughly the values of a which satisfy this condition. 18. Investigate the maxima and minima of f(x), and the real roots of f(x)=0, f(x) being either of the functions x- sin x - tan a (1 - cos x), x - sin x- (a-sin a) - tan a (cos a - cos ), and a an angle between 0 and 7r. Show that in the first case the condition for a double root is that tan a-a should be a multiple of -r. 19. Show that by choice of the ratio X: we can make the roots of X (aX2+ bx + c) + - (a'2 + b'x + c') =0 real and having a difference of any magnitude, unless the roots of the two quadratics are all real and interlace; and that in the excepted case the roots are always real, but there is a lower limit for the magnitude of their difference. (Math. Trip. 1895.) [Consider the form of the graph of the function (ax2 + bx + c)/(a'x2 + b'x + c'): cf. Exs. XLVIII. 19-22.] 20. A sheet of paper is folded over so that one corner just reaches the opposite side. Show how to fold the paper in this way so that the length of the crease shall be a maximum. 21. The greatest acute angle at which the ellipse (x2/a2) + (y2/b2)= 1 can be cut by a concentric circle is arc tan ((a2 - b2)/2ab}. (Math. Trip. 1900.) 22. In a triangle the area A and the semi-perimeter s are fixed. Show that any maximum or minimum of one of the sides is a root of the equation s ( -s) 2+4A2 =0. Discuss the reality of the roots of this equation, and whether they correspond to maxima or minima. [The equations a + b + c = 2s, s (s - a) (s - b) (s - c) = A2 determine a and b as functions of c. Differentiate with respect to c, and suppose da/dc=O. It will be found that b=c, s-b=s-c= -a, from which we deduce that s (a- s) a2 — 42 = 0. This equation has three real roots if s4> 27A2, and one in the contrary case. In an equilateral triangle (the triangle of minimum perimeter for a given area) s4=27A2; thus the case of s4<27A2 cannot occur. Hence the equation in a has three real roots, and since their sum is positive and their product negative, two roots are positive and the third negative. Of the two positive roots one corresponds to a maximum and one to a minimum.] 23. The area of the greatest equilateral triangle which can be drawn with its sides passing through three given points A, B, C is 2A + (1/2V3) (a2 + b2+c2), a, b, c being the sides and A the area of ABC. (Math. Trip. 1899.) 24. If A, A' are the areas of the two maximum isosceles triangles which can be described with their vertices at the origin and their base angles on the cardioid r=a(1 + cos 0), prove that 256AA' = 25a4 V5. (Math. Trip. 1907.) [One of A, A' will be found to be negative, and algebraically a minimum.]

Page 245

MISCELLANEOUS EXAMPLES ON CHAPTER VI 245 25. Find the limiting values which (2- 4y + 8)/(y2- 6x +3) approaches as the point (x, y) on the curve x2y-4x2-4y+y2+16x-2y-7=0 approaches the position (2, 3). (Math. Trip. 1903.) [If we take (2, 3) as a new origin, the equation of the curve becomes 2r - 62+ 2 = 0, and the function given (2 + 4 - 4q)/(q2+ 6q-6 ). If we put 7 = t we obtain =(l - t2)/t, 7=1 - t2. The curve has a loop branching at the origin, which corresponds to the two values t= + 1. Expressing the given function in terms of t and making t tend to + 1 we obtain the limiting values —, -.] 26. If f(x)={l/(sinx-sina)}-{1/(x- a)}seca, then d { lf(a -im f(aX)} - lima s (x) a se a sea. da Ac4 xcc2a (Math. Trip. 1896.) 27. Show that if b (x).=l /(l + 2), n(n) (x) /=Q (l/ +x2) 1n+l, where Q (x) is a polynomial of degree n. Show also that (i) Qn+ 1 = (I1 +2) Qn-2 (n + 1) x Q,, (ii) Qn+2+2(n+2) Q+l1+(n+2)(n+l)(l+x2)Q,=0, (iii) (1 +x2) Q"- 2x Q' + n (n + 1) Q, = 0, (iv) n=a(-l t r o ae rl ad s d by t e of (v) all the roots of Q,=0 are real and separated by those of Q,-,. [To prove (ii) differentiate the equation (1+x2) q(x)=l n+2 times by Leibniz's theorem: (iii) follows from (i) and (ii); and (iv) can be deduced from (iii) or by writing q (x) in the form (1/2i)[{1/(x-i)} - {l/(x+i)}] before differentiating. Finally, to prove (v), we observe that, when Q = 0, Qn+ has the sign of Q', by (i), and that the sign of Q,, when x is numerically large is that of ( - 1)1/n + 2.] 28. If f(x), q (x), 4 (x) have derivatives for a - < b, there is a value of ~, lying between a and b, and such that f(a) +(a),(a) =0. f(b) (b) +(b) f'(t) ''(~) ~,'(&) [Consider the function formed by replacing the constituents of the third row by f (x), 4 (x), + (x).] 29. Deduce from Ex. 28 the formula {/(b) - f (a)}/{+ (b) - (a)} =f'(~)/q' ($). 30. If 0'(x)-a as x-coo, then k(x)/x-z-a. If '(x) —oo (or — oo), then q (x) —oo (or - o). [Use the formula q (x) - f (0o) = (x- Xo) ' ($), where xo< < x.] 31. If p(x) ---a as x --- cc, k'(x) cannot tend to any limit other than zero.

Page 246

246 MISCELLANEOUS EXAMPLES ON CHAPTER VI 32. If ( (x) + q'(x)- a as x-+cco, then (x).->-a and q'(x)-O. [Let ( (x)=a++(x), so that +(x)++'(x) ---0. If +'(x) is of constant sign, say positive, for all sufficiently large values of x, then +(x) steadily increases; thus +(,x) tends to a limit I or to +o0, and so +'(x) to -I or - oo, which is impossible (Exs. 30, 31) unless 1=0. But if +'(x) changes sign for infinitely many values of x, these are the maxima and minima of + (x); at these + (x) + '(x) is very small (when x is large), and +'(x) = 0, so that + (x) is very small. A fortiori are the other values of + (x) very small when x is large.] 33. Prove that dx 2 (X -P) {(x -p) (x- q)} - q-p V x p 34. Show that f- x, where y2=ax2+2bx +c, may be expressed in 3 S w- x o ) y h one or other of the forms 1 log +axxo+b(x+xo)+c+yyo} I 1 axxo +b (x + X0) + c +3Yo} j arc tan{a X + b ( Yo X - x0o z yzO according as axo2+2bxO+c is positive and equal to y02 or negative and equal to - z02. In the first case the ambiguous sign is to be chosen so as to make the contents of the bracket positive. [Put x- x=1/t, when we obtain J dx f dt (X - o)y j f{a+2 (axo + b) t +(ao2+2bxo +) t2'] 35. If ag2+ ch2= -v < 0, show that f dx 1 x (+- arc tan )l" (hx +g)/(aX2 + c) arctan ch - agx 36. Show that the integral ( ) is reduced to that of a rational J(ax2 + c) function by the substitution x/l/(ax2 + c) = u. In particular show that rf _ dx _ du j (Ax2 + C) d(ax2 + c) + (cA - aC) 2' and hence evaluate the integral. [The integral ( x dx +C)(a ) is reduced to this form by putting (Ax2+ C) IJ(aX2 m+ c) x=l/y (or it may be calculated by putting x2=t). Hence we can calculate any integral of the type (Lx+M dx ( _+ ) in terms of x/^I(ax2+c) and jAx2 ~ / j(aX +C) 1/^/(ax2 + c).] 37. Show how to integrate A + 2Bx + C /(a + 2bx c)' when Ax + 2Bx + C= 0 has complex roots. [Put x=(1it + v)/(t+ 1), where /,, v are so chosen that alv+ b(l+rv)+c=O, A1zv+B(4+v)+C=0, i.e. are the roots of (aB-bA)62-(cA-aC)6+(bC-cB)=O. It is easy to

Page 247

MISCELLANEOUS EXAMPLES ON CHAPTER VI 247 verify that p and v are real and distinct: they are in fact the maxima and minima of the function (a2+2bx +c)/(Ax2+ 2Bx+ C) considered in Exs. XLVIII. 19 et seq. It will be found that the integral now reduces to one of the form fpt, + drt +, and so may be calculated in terms of (x- p)/y J pt2+ /(rt2 + S) I and (x - v)/y, where y = /(ax2 + 2bx + c) %.] 38. Show how to reduce IR x a/( + b\ / x+ dx to j [ \mmx+/n V mx-+n) the integral of a rational function. [Put mx+n= l/t and use Ex. LII. 15.] 39. Calculate the integrals: J dx f /x-l\ dx f dx (x-l)dx (l+ 2)3 J j \\\x+1 x' Jx.I/(x+2x +3) J(x2+4) v/(2+9)' f + dx f _ xdx (5x2 + 12x+ 8)J(5x2 + 2x - 7)' +x)- /(1 + x) f a /L2+ dxb^l ^ fcosec3 x f dx, dx r_________ _ dx xc~cx2 os x sin x + 3 dxI I cos -— x- -,+ sini I -A x d cosec x,(sec 2x) dx, J(2- sin2x) (2 + sin x- sin2 x) cos4 + sin4 s x, r dx rx+\-'&x -, r f ( _7r 7 - _ _ dxs arc sec x dx (arc siln x)2 dx, J {(l+sinx)(2+sinx}' J ' +cosx f ar sinxd Cfx arc sin x ar sin arc sin xd, xarcsinxdx, j _X), J> d2lSxJ) x, i,/(I - X2) d1 x2 J (( + x)Ix, ar tan x arc tan x tlog (a2 + d2x2) log (a+,x) 2 x j (1+ 2)3/2 x )Ja2 dx2 (a+ bx)2 40. Formulae of Reduction. (i) Show that dx _ 1 x+2p (X2+pC + q)n 2 (?,- 1)(q -jp2) (x2+}-px-)-1 2n-3 f dx + 2(n- l)(q - p2) (x +p.+ q)n-l' [Put x+ p=t, q- ~ p2=X: then we obtain f dt 1 dt 1 t2dt j (2+x= fx (t2+X) — X J(t2+x)-1 f dt 1 f 1 }at X (t2+X)n-1 2X (n-) It dt (t2+x)"- 1 and the result follows on integrating by parts. * The method fails if a/A = blB; but then the integral may be reduced by the substitution ax — b=t. See Stolz, Grundziige der Diff. und Int. Calc., bd. I, pp. 331 et seq.; Goodwin, Messenger of Mathematics, vol. xxxVII, pp. 104-6. Another method of reduction has been given by Prof. Greenhill; see his A Chapter in the Integral Calculus, pp. 12 et seq., and pp. 24 et seq. of the author's tract quoted on p. 225. Reference may also be made to the paper by Mr Bromwich quoted on p. 236.

Page 248

248 MISCELLANEOUS EXAMPLES ON CHAPTER VI A formula such as this is called a formula of reduction; it enables us to calculate O-x2+p+q) in terms of (+ )+, and so to evaluate +J (XI px^+ q)Y X(2+px + q) +l the integral whenever n is a positive integer.] (ii) If rp,= (1 +x)qdx, show that Ip, q / = 1(+ -1) XP +(1 + )q —{ql/(+l)}-lp+ 1,q_l, and hence that p, q can be calculated if q is a positive or p a negative integer. Obtain a similar formula connecting Ip, with -p_1,q+1, and show that Ip,q can be calculated whenever p or q is an integer positive or negative. Finally, by means of the substitution x= -y/(l +y), show that p, a=(-l)P+lf(Jy l+Py)-P —2d and that I, q can also be calculated whenever p + q is an integer. (iii) Show that if X=a + bx, JX-113 dx= - 3 (3a - 2bx) X2/3/0b2, f2X-13 dx= 3(9a2 - 6abx + 5b2x2) X2/3/40b3. fxl -11dx= -4 (4a - 3bx) X3/4/21b2, x2X-1/4 dx=4 (32a2 - 24abx + 21b2x2) X3/4/231b3. (iv) If,,,n= J(1 +2) then 2(n- 1)'m,= - xm- (1 + 2)-(n-1) +(m - 1)I-2, n-l. (v) If I,= cos 3x dx, J = x sin 3 dx, then f3n =- xt sin f/x- n J,-, 3J = - xn cos 3x + n I,-1. (vi) If I= cos x dx and J,= sinn"xdx, then nIn = sin x cosn- x+ (- 1) -_a2, nJ = - cos x sin"n —x + (n - 1) Jn-2. (vii) If I= ftan xdx, then n +Jn-2=(tan -lO)/( -l1). (viii) If m, n= cos' Xsinn x dx, then (m + 1) I, n-= -cosn+l x sinn-lx+(n- 1) 4m,n-2 = cos - 1 sin + lx + (mq- 1) Im -2,. [For (m+1) Im, n= -fsin -1 x d (cos"' + 1 x) dx cosn+1 x sinl x +(n-1) fcosm + 2 xsin-2 x dx = -cosm+l sin-1 x+(n- 1) (,n,- 2 -In,n), which leads to the first reduction formula: similarly for the second.]

Page 249

MISCELLANEOUS EXAMPLES ON CHAPTER VI 249 (ix) Connect I,n= fsim xsin nxdx with Im2,n. (Math. Trip. 1897.) (x) If Im, n = f (cosec x)n dx, then (- 1) (n - 2) Icn= (n - 2)2..s. _2 + M (t- l) Im-2, 1-2 - xm -1 {m sin x + (n - 2) x cos x} (cosec x) - 1. (Math. Trip. 1896.) (xi) If 1 = (a+ b cos x)-dx then (n - 1) (a2 - b2) I= - b sin x(a + b cos x)-("- 1) (2n - 3) al,_1 — (n - 2) I —2. (xii) If = (a cos2 x+2h cos x sin x+ b sin2 )-n d then 12n dx2J 2 (I+l)(ab-h2)I, 2-(2n+1-)(a+b)I+ 1+2nI = 2 d 2 (Math. Trip. 1898.) (xiii) If IMp>= fx (log x) dx, (m+ 1)I,,- xm + 1 (log x)n - nIn, 1. 41. If n is a positive integer the value of | (log x)n dx is xm+l f((log x)n _ n(logx)n-1 n (n - l) (log x)n-2 ([ m+l (m+1)2 (nm+1)3 (fm+i1)?L+1l 42. Show that if u and v are functions of x such that du/dx=-v, dv/dx=u, then u2+V2 is a constant; and that if =l1 and v=0 when x=0, then u = cos x and v = sin x. 43. Show that the most general function q (x), such that q"+ a2" =0 for all values of x, may be expressed in either of the forms A cos ax + B sin ax, p cos(ax+ ), where A, B, p, e are constants. [Multiplying by 20' and integrating we obtain q'2 + a242 =const. =a2b2, say, from which we deduce d - ax=.,(b2 - 2) ] 44. Determine the most general functions y and z such that y'+o z=0, z'- y =0, where co is a constant and dashes denote differentiation with respect to x. 45. Trace the curve for which x=2a(1 -t2)/(l+t2)2, y=4at/(1+t2)2. Show that the point (ty, -tx) lies on the normals at the two points t and - I/t; and deduce that the locus of their intersection is x2+y2=ax. Also find the area of the curve. 46. The area of the curve given by x=cos + {sin asin q/(1 -cos2asin2), y= sin b - {sinacos/(1 - cos2 a sin2 )}, where a is a positive acute angle, is 2rr (1 +sin a)2/sin a. (Math. Trip. 1904.) 47. The projection of a chord of a circle of radius a on a diameter is of constant length 2a cos 3; show that the locus of the middle point of the chord consists of two loops, and that the area of either is a2 ( - cos 3 sin 3). (Math. Trip. 1903.)

Page 250

250 MISCELLANEOUS EXAMPLES ON CHAPTER VI 48. If f(x, y) =(a, b,c,f,g, hYx,y, 1)2=0 is the equation of a conic, prove that d______x________ PT -alog PT (I +m y+ n) (/I + by +f ) PT+ where PT, PT' are the perpendiculars from a point P of the conic on the tangents at the ends of the chord Ix + my + n =0, and a, / are constants. (Math. Trip. 1902.) [We may, in the formulae of ~ 117, suppose that ($, 7) is one of the points in which Ix+ my +rn=0 cuts the conic (say that corresponding to T). Then lx + my +n= (x - ) + m (y- ) 2(+t) (G + Ft) a+ 2ht+ bt2 1 dx 2 hx+ by +f dt - a + 2ht+ bt2' and so dx + dt 1 /log +m (Ix x+my + n)(hx+by+f ) (I+mt) (G+Ft) ~ mG lF g G-+Ftj' But the equations of the chord and the tangent at ($, 77) are respectively lx+my+n=0, G(x -)+-F(y —7)=O; and, if PN is the perpendicular from P on to the chord, P Ix+fmy+n l1(.x- )+m(.y-q) /(12 +m 2) = (12 + m2) Similarly PT= G (x-) + F (y-)/ J (F2 + 2i) Also t=(y-r)/(w-$), and PT. PT'=k. PN2, where k is a constant. From these relations the result follows. It is not difficult to prove that (m G -F)2= a h g I ] h b f m g f c n I me n 0

Page 251

CHAPTER VII. ADDITIONAL THEOREMS IN THE DIFFERENTIAL AND INTEGRAL CALC ULUS. 129. Higher Mean Value Theorems. In the preceding chapter we proved (~ 106) that if f(x) has a derivative f'(x) throughout the interval (a, b), then f (b) - (a) = (b - a) f' a + 0 (b - a)} where 0 lies between 0 and 1; or, if b = a + h, f(a + )-J (a) = hf'((a + Oh)...............(1). This we proved by considering the function b-x f(b)-f(x) -b - a {f(b)-f(a)} which vanishes for x = a and x = b. Let us now suppose that f(x) has also a second derivative f"(x) throughout (a, b) (an assumption which of course involves the continuity of the first derivative f'(x)), and consider the function f(b) f - - (b - x)f '() - (b {f(b) - f(a - (ba - a)f'(a)}. This function also vanishes for x = a and x= b; and its derivative is 2 (b - x) (b - a)i {f(b) -f(a) - (b - a)f'(a) - (b - a)2() (b - a))2 and this must vanish (~ 1.02) for some value of x between a and b (exclusive of a and b). Hence there is a value: of x, between a and b, and therefore capable of representation in the form a+ 0(b- a), where 0 < 0 < 1, which is such that f(b) =f(a) + (b - a)f'(a) + (b - a)2f" (f).

Page 252

252 ADDITIONAL THEOREMS IN THE CALCULUS [VII If we put b = a + h we obtain the equation f(a + h) =f(a) + hf'(a) + lh2f"(a + Oh).........(2), which is the standard form of what may be called the Mean Value Theorem of the second order. The analogy suggested by (1) and (2) at once leads us to formulate the following theorem: Taylor's or the General Mean Value Theorem. If f(x) is a function of x which has derivatives of the first n orders throughout the whole interval (a, b), then f(b) =f(a) + (b - a)f'(a) + - a)! "(a) +.. (b - a)"' - +(b - a)n + (n-1) (a) + -!) (n (n - I) I ) where a < < b: or, if b = a +h, f(a + h) =f(a) + hf'(a) + h2f"(a) +... hLn-i h + (n- )f() (a) + f (n) (a + Oh) (n - 1i/! n1 v where 0 < 0 < 1. The proof proceeds on precisely the same lines as were adopted before in the special cases in which n = 1, 2. We consider the function (X ( -b-a n (a) where Fn (x) =f(b) -f(x) -(b - x)f' (x) - - (b - )2f"(x) -... -( f, 1 ) (b - x)n-lf(n-~i (X). This function vanishes for x = a and x = b; its derivative is (b - a)- n{ (b _ t-1 X(a)-1 -(b _a)f(~) ()}' and there must be some value of x between a and b for which the derivative vanishes. This leads at once to the desired result. In view of the great importance of this theorem we shall give at the end of this chapter another proof, not essentially distinct from that given above, but different in form and depending on the method of integration by parts.

Page 253

129] ADDITIONAL THEOREMS IN THE CALCULUS 253 Examples LVII. 1. Suppose that f(x) is a polynomial of degree r. Then f ()(x) is identically zero for n > r, and the theorem leads to the algebraical identity 1 1 /(a + ) =f(a)+ f' ()+ h2f(a..+ rf () (a). Verify this when f(x) =, X2 X, m xm +pX2 q + r. 2. By applying the theorem to f(x)= /x, supposing x and x+h positive, obtain the result 1 1 h h2 (- 1)-ih-1 (-l1)n 1 _ h h2 1) 1 (_ _ )nhn A1 [Since +a(h [Since xx+ h - - x 2 +3 '"^ n X+ ( h) we can verify the result by showing that (x + h) can be put in the form (x+ Oh)n + 1, or that en + 1 < cn (x+h) < (x + h)n + 1, as is evidently the case.] 3. Obtain the formula 2 h3 sin (x + h) =sin x +h cos x - sin x - sin x+... h2n-1 h2n +(- 1)-' c(2n os x+ ( 1l)n 2n sin (x+ Oh), the corresponding formula for cos (x+h), and similar formulae involving powers of h extending up to h' + 1. 4. Show that if m is a positive integer, and is a positive integer not greater than m, (x+h)m =xn () X + (-1) x8-n+1 hnl1+ (m) ( +. )- n Show also that if the interval (x, x+h) does not include x=0, the formula holds for all real values of m and all positive integral values of n; and that, even if x<0<x+ h or x+h<O<x, the formula still holds if mn-n is positive. 5. The formula f(x+A)=f(x)+hf'(x+ Oh) is not true if f(x)=1/x and x< 0 < x +h. [For f(x+h) -f(x) >0 and hf'(x+ Oh) =-h/(x+ )2 < 0: it is evident that the conditions for the truth of the Mean Value Theorem are not satisfied.] 6. If x=- a, = 2a, f(x) = xl13 the equation f(x + h) =f(x) + hf'(x + Oh) is satisfied by 0= + J ^/3. [This example shows that the result of the theorem may hold even if the conditions under which it was proved are not satisfied.] 7. Approximation to the roots of equations. Let 4 be an approximation to a root of an algebraical equation f(x) =0, the actual root being +. Then o=f ( +A)=f () + hf'()+ h2f" (+ h), so that,=_/f(4) ih2f"(+Oh) so that -= -f-'()- 2 f'( 2 (f () It follows that in general a better approximation than x= is =I - /(+)/e c, wn If the root is a simple root, so that f'(+h) A)0, we can, when h is small

Page 254

254 ADDITIONAL THEOREMS IN THE CALCULUS [VII enough, find a positive constant K such that If'(x) I > K for all the values of x which we are considering, and then, if h is regarded as of the first order of smallness, f(4) is of the first order of smallness, and the error in taking ~-{f(4)/f' ()} as the root is of the second order*. 8. Apply this process to the equation x2=2, taking 4=3/2 as the first approximation. [We find A= -1/12, 4+h=17/12= 1417..., which is quite a good approximation, in spite of the roughness of the first. If now we repeat the process, taking 6=17/12, we obtain 6+h=577/408=1'414215..., which is correct to 5 places of decimals. The reader should also test the result of taking = 7/5.] 9. By considering in this way the equation x2- -y=0, where y is small, show that ^/(1 +y) = 1 + y - {.y2/(2 +y)} approximately, the error being of the fourth order. 10. Show that the error in taking the root to be - (fif') - (2f/f '3), where 4 is the argument of every function, is in general of the fourth order. 11. The equation sin x=ax, where a is small, has a root nearly equal to 7r. Show that (l-a) rr is a better approximation, and (1- a+a2) 7r a better still. [The method of Exs. 7-10 does not depend on f(x)=0 being an algebraical equation, so long as f' and f" arp continuous.] 12. Show that as h-.-0 the limit of the 0 which occurs in the general Mean Value Theorem is l/(n + 1), provided f(n + ) (x) is continuous. [For f(x + h) is equal to each of hn,n An +1.f(x)+...+ /f()(x+ h), f(x)+...+ f()+ + (n+l)(X+lh), where 01 as well as 0 lies between 0 and 1. Hence f(n) (X + oh) =f(n) () + {hf(n + ) ( + 01h)}/(n + 1). But if we apply the original Mean Value Theorem to the function f(")(x), taking Oh in place of h, we find f(n) (x + Oh) =f(,n) (x) + Ohf ( + 1)(x+0 402 h) where 02 also lies between 0 and 1. Hence Of(n + 1)( 002 ) = {f(n + 1) (X + 1 h)}/(n + 1), from which the result follows, since f(n+l)(x+002h) and f(n+ 1)(x+01hA) tend to the same limit f(n +1)(x) as h-0.] 13. Prove that {f(x+ 2h) - 2f(x + A) +f(x)}/h2-.f" () as h- 0, provided f"(x) is continuous. [Use equation (2), p. 252.] 14. Show that, if the first n derivatives of f(x) are continuous for x=0, and f (n) (0)=0, then f (x) = ao + a x + a2x2 +... +an (1 +), where a,.=f(r) (0)/r! and ex- 0 with x. * This method of approximation is due to Newton. For further details see Gibson's Calculus, pp. 244 et seq., and Tannery's Leqons d'Algebre et d'Analyse, pp. 302 et seq.

Page 255

ADDITIONAL THEOREMS IN THE CALCULUS 255 15. Show that if ao+alx+a2X 2+... +an (1 +ex) =bo+blx+b2 2+... +bnx (1 +rx), where ex and yx tend to zero with x, then ao0=bo, al=bl,..., an,=bn. [Making x-O0 we see that ao=bo. Now divide by x and afterwards make x —O0. We thus obtain al=bl; and this process may be repeated as often as is necessary. It follows that if f(x)=ao+alx+a2x2+... +anXn(l+E)x then a. =/(r)(0)/r!.] 130. Taylor's Series. Suppose that f(x) is a function all of whose differential coefficients are continuous in an interval (a - a, a + a) surrounding the point x = a. Then, if h is numerically less than a, hnf-1 n f(a + h) =f(a)+ hf' (a) +... +f - /(-) (a) + f (n (a +,h, where 0 < On < 1, for all values of n. Or, if nl hv h, Sn =~ X!f ()(a), Rn = h f (n) (a + Onh), o pi n! we have f (a + h)- R = Sn. Now let us suppose, in addition, that we can prove that Rn- o0 as n —3o. Then evidently f(a + h) = lim Sn =f(a) + f'(a) + 2 a)+.... n —oof + This expansion of f(a + h) is known as Taylor's Series. The particular case in which a = 0, when the formula reduces to f(h) =f(0) + hf'(0) + 2 "((0) +. is known as Maclaurin's Series. The function Rn is known as Lagrange's form of the remainder. Examples LVIII. 1. Let f(x)=sin x. Then all the derivatives of f(x) are continuous for all values of x. Also I fn (x) I 1 for all values of x and n. Hence in this case I R, _ hA/n!, which tends to zero as n —co (Ex. xxx. 12) whatever value h may have. It follows that h12 h3 h4 sin (x + h)=sin x +h cos x- sinx - cos x + sin x..., for all values of x and h. In particular 3 A5 sin h=h- +-..., 3! 5! for all values of h. Similarly we can prove that hxS 2 h3 h. 2 A4 cos(x+h)=cosx-hsinx- -.cosx+3-sinx+.., cosh=l-. + -.

Page 256

256 ADDITIONAL THEOREMS IN THE CALCULUS [VII 2. The Binomial Series. Let f(x)=(l +x) where m is any rational number, positive or negative. Then f/() (x)= m (n - 1)... (m- - n + 1)(1 + x)m-, and Maclaurin's Series takes the form (i+x)n=i+(7) X+()x2+.... When mn is a positive integer the series terminates, and we obtain the ordinary formula for the Binomial Theorem with a positive integral exponent. In the general case RRn= f(n) (Ox)= ( x)n(1 +Ox)-n, and in order to show that Maclaurin's Series really represents (1 +x)m, where m is not a positive integer, for any range of values of x, we must show that RO — for every value of x in that range. This is so in fact if - <x<1. When 0 g x < 1 this may be proved by means of the expression given above for R, since (1 +Ox)m~-n< if n>m, and (Xn -O as n-oo (Ch. IV, Misc. Ex. 5). But if -l<x<0 a difficulty arises because 1+0x<1, and (l +x)m- >l if n>m; knowing only that 0<0<1 we cannot be assured that 1 +Ox is not quite small and (1 + Ox)"^- quite large. In fact, in order to establish the Binomial Theorem properly by means of Taylor's Theorem, we need some different form for Ra, such as will be given later (~ 146). 131. Applications of Taylor's Theorem. A. Maxima and minima. Taylor's Theorem may be applied to give greater theoretical completeness to the tests of Ch. VI, ~~ 103-4, though the results are not of much practical importance. It will be remembered that, on the assumption that b (x) and its first two differential coefficients are continuous for x=:, we stated the following as being sufficient conditions for a maximum or minimum of ( (x) at x =: for a Maximum, 0'(I) = 0, '"(:) < 0; for a Minimum, p'(:) = 0, "(~) > 0. It is evident that these tests fail if +'"(f) as well as +'(t) is zero. Let us suppose that the first n derivatives ' (x), ) (x),... (n(X) are all continuous at x = I, and that all save the last vanish there. Then for sufficiently small values of h, k( + h)- f)= (n)( (n + Oh). In order that there should be a maximum or a minimum, this expression must be of constant sign for all sufficiently small

Page 257

131, 132] ADDITIONAL THEOREMS IN THE CALCULUS 257 values of h, positive or negative. This evidently requires that n should be even. And if n is even there will be a maximum or a minimum according as f(n) () is negative or positive. Thus we obtain the test: if there is to be a maximum or minimum the first derivative which does not vanish must be an even derivative, and there will be a maximum if it is negative, a minimum if it is positive. Examples LIX. 1. Verify the result when q(x)=(.x-a)m and ==a, m being a positive integer. 2. Test the function (x - a)n (x - b)n, where m and n are positive integers, for maxima and minima at the points x=a, x=b. Draw graphs of the different possible forms of the curve y=(x-a)m(x-b)n. X3 X3 X5 3. Test the functions sinx-x, sinx-x+ -, sin x- - x - - 22 X2 X4 cosx-1, cosx-1+ -, cosx-1 - ---... for maxima or minima at x==0 2i 2 24 4 132. B. The calculation of certain limits. Suppose that f(x) and (b(x) are two functions of x whose derivatives f'(x) and b'(x) are continuous for x =: and that f(~) and 0(~) are both equal to zero. Then the function W(x) =f( )W/( ) has no value for x= ~. But of course it may well tend to a limit as x —. Now f(x) =f(x) -f()= ( - ) f (), where x, lies between ~ and x; and similarly (x) = (x - ) b'(x2), where x2 also lies between x and ~. Thus t'(x) =f'(x,)/l'(x2). Then we must distinguish four cases: (1) if neitherf'(~) nor b'(g) is zero it is clear that { fx) (x) ( f '(() /f ' (W ); (2) iff'() = 0, /'(:) + 0, it is clear that f(x)/b(x)-0; (3) if f'() + 0, 0'(|) = 0, it is clear that f(x)/4f(x) becomes numerically very large as x —; but whether f(x)/,(x))-oo or - x or is sometimes large and positive and sometimes large and negative we cannot say, without further information as to the way in which 0'(x) —0 as x — (if e.g. f'() > 0, and +'(x) > 0 when x > i, it is clear that f(x)/l(x) — + oo ); H. A. 17

Page 258

258 ADDITIONAL THEOREMS IN THE CALCULUS [VII (4) iff'(i) = 0, O '(:) = 0, we can as yet say nothing about the behaviour of f (x)/ (x) as xO0. But in this case it may happen that f(x) and cf(x) have continuous second derivatives. And then f(x) =f(x) -f () - )f ( - I)/'( = ( x - f)" ), (x) = W (x) - 0 () -(- 4-) ' () = 2 (x - )2 b"(x2), where again x, and x, lie between t and x; so that * (x) = f" (xO/)O" (x2). We can now distinguish a variety of cases similar to those considered above. In particular, if neither second derivative vanishes for x= 4, we have f (x)/0(x)f (~:)/~" (:). It is obvious that this argument can be repeated indefinitely, and we obtain the following theorem: suppose that f(x) and +(x) and their derivatives, so far as may be wanted, are continuous for x=f. Suppose further that fp) (x) and (q) (x) are the first derivatives of f(x) and db(x) respectively which do not vanish for x =. Then (1) if p = q, f(x)/k(x)-f(P)(0)/(P)(4); (2) if p>q, f ()/(X)- O 0; (3) if p< q, and q-p is even, f (c)l(x) — + oo, the sign being the same as that of f(P) ()/( (q)(4); (4) if p < q and q- p is odd, f(x)/o(x)-+ ~ o as x- +, the sign being the same as that of f(P) ()/q) (4), while if x- -O0 the ambiguous sign must be reversed. This theorem is in fact an immediate corollary from the equations f() = (- (x - )xP f(p (x), () (X - ) p! q! 4 (q) (x. Examples LX. 1. Find the limit of {X - (,+ 1) n + + nxn + 2}/(1 - X)2, as x-Al. [Here the functions and their first derivatives vanish for x=1, and f"(l)=n(n+l), f"(1)=2.] 2. Find the limits as x -0- of (tan x-.)/(x- sin x), (tan nx - n tan x)/(n sin x - sin nx). 3. Find the limit of x{/(x2+a2)-x} as x-. [Put x=l/y.]

Page 259

132, 133] ADDITIONAL THEOREMS IN THE CALCULUS 259 4. Prove that lim (x - n)c cosec, lim - - x_11 rsoxn xr C- n (x- n) 7r) 6 ' n being any integer, and evaluate the corresponding limits involving cot xr. 5. Find the limits as x —0O of {cosec x - (1/x) - (x/6)}/x3, {cot x - (l/) + (x/3)}/x3. 6. (sin x arc sin x - x2)/x-, (tan x arc tan x - x2)/x6-., as x-o0. 133. C. The contact of plane curves. Let us suppose that f(x), +(x) are two functions which possess derivatives of all orders continuous for x=, and consider the curves y =f(x), y= f(x). In general f(J) and 0()) will not be equal. In this case the abscissa x = t does not correspond to a point of intersection of the curves. If however f(e) = +(), the curves intersect in the point x = A, y =f()= ( (). Let us suppose this to be the case. Then in order that the curves should not only cut but touch at this Y /0(x) point it is obviously necessary and suf- Q =f (W) ficient that the first derivatives f'(x), b'(x) should also have the same value for x =. The contact of the curves in this case may be regarded fiom a different 0 + h X point of view. In the figure the two curves are drawn touching at P, and QR is equal to (: + h) -.f ( + h), or, since b(t) =f(:), b' (:) =f'(0), to Ih2 "(+ Oh) -f"( + 0 h)] (where 0 and 01 lie between 0 and 1). Hence lin = {() ( h-O h2 In other words, when the curves touch at the point for which x= I, the difference of their ordinates for x= + h, when h is small, is at least of the second order of smallness compared with h. The reader will easily verify that when the curves cut and do not touch limr (QR/h)-= q (~) -f'(t), so that QR is of the first order of smallness only. It is evident that the degree of smallness of QR, as compared with h, may be taken as a kind of measure of the closeness of the contact of the curves. It is at once suggested that if the first 17 —2

Page 260

260 ADDITIONAL THEOREMS IN THE CALCULUS [VII n-1 derivatives of f and q all have equal values for x = ~, then QR will be of the nth order of smallness: and the reader will have no difficulty in proving that this is so and that QR 1 lim Ih -! {b('~'(I)-f(')(f)}. We are therefore led to frame the following definition: Contact of the nth order. If f(t:)= (:), f'(Q)= '(0),.../ (n-")() = (n-l))(~), but f(n, (n ) ~+ (n)(;), the cur~ves y =f(x), y = +(x) will be said to have contact of the nth order for x =. Let us consider some particular cases of this definition in greater detail. Examples LXI. I. Let q (x)=ax+b, so that y= q(x) is a straight iine. The conditions for contact at the point for which x=& are f($)=a$+b, f' ()=a. If we determine a and b to satisfy these equations we find a=f '(), b=f(f)-$f'($), and the equation of the tangent to y=f(x) at the point x= $ is y=xf'($) + (f()- -f/, (), or y-f ()-(x - )f'() (cf. Ex. XLI. 6). 2. The fact that the line is to have simple contact with the curve completely determines the line. In order that the tangent should have contact of the second order with the curve we must have f" ($) = "(f), i.e. f"($)=0. A point at which the tangent to a curve has contact of the second order is called a point of inflexion. 3. Find the points of inflexion on the graphs of the functions 3x4 - 6x3 + 1, 2x/(1 + x2), a2x/(x-a)2, sin x, cos (x - a), a cos2 x + b sin2 x, tan x, arc tan x. 4. Show that the conic ax2 + 2hxy + by2 + 2gx + 2fy + c = O cannot have a point of inflexion. [Here ax + hy+g+(hx by+f)yl=O and a +2hyl+by12+ (hx+by+f) /2=0, suffixes denoting differentiations. Thus at a point of inflexion a+2hyl + by12 = O or a (Ax+by+f )2-2h/(ax+hy+g)(h/x+by+f)+b (ax+hy+g)"=O,, or (ab-h2) {ax2 J 2hxy + by2 + 2gx + 2fy} + aqf2 - 2fgh + bg2 = 0. But this is inconsistent with the equation of the conic unless af 2 - 2fgh J+ bg2 = c (ab - h2) or abc + 2fgh-af2 - bg2-c2=; and this is the condition that the conic should degenerate into two straight lines.] 5. The curve y = (ax 2bx +c)/(ax2 + x+ y) has one or three points of inflexion according as the roots of ax2 + 2x +y=0 are real or complex. [The equation of the curve can, by a change of origin (cf. Ex. XLVIII. 22), be reduced to the form,7 = 6/(A +2 + 2B + C) =-/{A ( -_) (d- )},

Page 261

133] ADDITIONAL THEOREMS IN THE CALCULUS 261 where p, q are real or conjugate. The condition for a point of inflexion will be found to be 3-3pq +~pq (p+ q)=0, which has one or three real roots according as {pq(p- q)}2 is positive or negative; i.e. according as p and q are real or conjugate.] 6. Discuss in particular the curves y=(1 - x)/(l -x2), y=( - 2)/(1 + 2), y=(1 +x2)/(1 - 2). 7. Show that when the curve of Ex. 5 has three points of inflexion, they lie on a straight line. [The equation 3 - 3pq +pq (p +q)=0 can be put in the form ( - p) ( - q) (+p +q) + (p - q)2 = 0, so that the points of inflexion lie on the line + A (p - q)2 +p+ q =O or A- 4 (A C- B2) = 2B*.] 8. Show that the curves y=xsinx, y=(sinx)/x have each infinitely many points of inflexion. 9. Contact of a circle with a curve. The general equation of a circle, viz. (x-a)2 + (y- b)2 =r2..............................(1), contains three arbitrary constants. Let us attempt to determine them so that the circle has contact of as high an order as possible with the curve y=f(x) at the point $, r7=f($). We write 11, '72 for f' (), f"(6). Differentiating the equation of the circle twice we obtain (x- a) +(y - b) yl= 0......................... (2), + 2+ (y- b) 2 O.............................. (3). If the circle touches the curve the equations (1), (2) are satisfied by x=&, y=, yi =. This gives ($-a)/71= -(77-b)==r/^(1l+,2). If the contact is of the second order the equation (3) must also be satisfied by taking Y2 = 72. Then b = + {(1 + 12)/q}); and hence we find a= - {1(1 + 12)/72}, b=9 +{(1 + i1)/)2}, = (1 + q2)3/2 q2 The circle which has contact of the second order with the curve at the point (, 1)) is called the circle of curvature, and its radius the radius of curvature. The measure of curvature (or simply the curvature) is the reciprocal of the radius: thus the measure of curvature is f" ()/{l +[f' ()]2}3/', or 10. Verify that the curvature of a circle is constant and equal to the reciprocal of the radius; and show that the circle is the only curve whose curvature is constant. 11. Find the centre and radius of curvature at any point of the conics 2= 4ax, (x/a)2 + (y/b)2= 1. 12. In an ellipse the radius of curvature at P is CD3/ab, where CD is the semi-diameter conjugate to CP. * For more general results of this kind see a paper by Mr Bromwich, Messenger of Mathematics, vol. xxxII, p. 113.

Page 262

262 ADDITIONAL THEOREMS IN THE CALCULUS 13. Show that in general a conic can be drawn to have contact of the fourth order with the curve y =f(x) at a given point P. [Take the general equation of a conic, viz. ax2 + 2xy + by2 + 2gx + 2fy + c=O, and differentiate four times with respect to x. Using suffixes to denote differentiation we obtain ax+hy+g+ (hx+by+f) y1=0, a+ 2hyl+ by,2 + (hAx+ by +f) y2 = 0, 3 (h + bl)y2+(hA + by f)3=O, 4 (h + by) 3 + 3b22 + (/h + by +f) 4 = 0. If the conic has contact of the fourth order these five equations must be satisfied by writing 7, 7, 71, '12, r/3, 14 for x, y.... We have thus just enough equations to determine the ratios a: b: c:f g: h.] 14. An infinity of conics can be drawn having contact of the third order with the curve at P. Show that their centres all lie on a straight line. [Take the tangent and normal as axes. Then the equation of the conic is of the form 2y = ax2 + 2hxy + b/2, and when x is small one value of y may be expressed (Ch. V, Misc. Ex. 9) in the form y= ax2 +~ ahX3(1 +( ), where e —0O with x. But this expression must be the same as y=-f" (0) x2+ f"' (O) X3(1 + E), where ex' — with x, and so (Ex. LVII. 15) a=f"(0), h=f"'(0)/3f"(0). But the centre lies on the line ax+ hy =0.] 15. Determine a parabola which has contact of the third order with the ellipse (x/a)2 (y/b)2= 1 at the extremity of the major axis. 16. The locus of the centres of conics which have contact of the third order with the ellipse (x/a)2+(y/b)2=l at the point (acosa, b sin a) is the diameter x/(a cos a)=y/(b sin a). [For the ellipse itself is one such conic.] 134. Differentiation of functions of several variables. Suppose that f(x, y) is a function of two real variables x and y, and that the limits linm f(x + h, y) -f(x, y)}/h, lim {f(x, y + k) -f(x, y)}/k h -.O k --- O both exist; i.e. that f(x, y) has a derivative with respect to x, which we have agreed to denote by df/dx or Dxf(x, y), and also a derivative with respect to y, denoted similarly by df/dy or Dyf(x, y). Another notation which it is natural to use is that of fx'(x, y), fy (X, y) or simply fx', fy' or fx, fy. It is usual to call these differential

Page 263

134] ADDITIONAL THEOREMS IN THE CALCULUS 263 coefficients partial differential coefficients, and the process of forming them partial differentiation. We shall not, however, adopt this method of expression, which is apt to be misleading, and to lead the reader to imagine that there is some mystery about the process of 'partial differentiation.' In point of fact it is exactly the same process as 'ordinary differentiation.' The only novelty which occurs is the presence of a second variable y inf(x, y). But the variation of y has nothing to do with that of x, and there is no possible ambiguity about the meaning of df/dx, df/dy, which are formed in precisely the same way as f'(x) or df/dx in Ch. VI. It is also usual to write ff af ax' ay for df/dx, df/dy, when f is a function of two variables: but, for the reasons indicated above, we shall not make any use of this notation. In what precedes we have supposed x and y to be two real variables entirely independent of one another. If x and y were connected by a relation the state of affairs would be very different. In this case our definition of df/dx would fail entirely, as we could not change x into x+h without at the same time changing y. But then f(x, y) would not really be a function of two variables at all. A function of two variables, as we defined it in Ch. II, means essentially a function of two independent variables. If y depends on x, y is a function of x, say y = g(x): and then f (x, y) =f x, (x)} is really a function of the single variable x. Of course we may also represent it as a function of the single variable y. Or, as is often most convenient, we may regard x and y as functions of a third variable t (Ch. II, ~ 23), and then f(x, y), which is of the form f/{(t), +(t)}, is a function of the single variable t. Examples LXII. 1. If x=rcos, y=rsin, so that r= /(x2+y2), O =arc tan (y/x), prove that dr _ x dr y d dO x dx 2(x2+y2)' dy -/(x+y2) ' ds +nY2 =y2 +y2 dx dx dy = dr =cos, =- r sin, -=sin, rcos dr cos d dr dO

Page 264

264 ADDITIONAL THEOREMS IN THE CALCULUS [VII dr /(dx) dO [e 2. Account for the fact that 1 (d and 1/g [When we were considering a function y of one variable x it followed from the definitions that dy/dx and dx/dy were reciprocals. This is no longer the case when we are dealing with functions of two variables. Let P (Fig. 56) be the point (x, y) or (r, 0). To find dr/dx we must increase x (say by an increment MM1 = 8x) while keeping y constant. This brings P to P1. If along OP1 we take OP'=OP, the increment of r is P'P== 8r, say; and dr/dyx=lim (8r/8c). If on the other hand we want to calculate d'c/dr, x and y being now regarded as functions of r and 0, we must increase r by Ar, say, p2 keeping 0 constant. This brings P to P2, if PP2= Ar: the corresponding increment of x is MM/i1= Ax, say; and P dx/dr = lim (A.x/Ar). / Now AX= 8r*: but Ar 4 r. Indeed it is easy to see from the figure that lim ('r/bx) = lim (P'P1/PP1) = cos 0, but lim (r/Ax) =lim (PP2/PP1)=sec 0, so that lim ( =r/Ar)= cos2 0. 0 M M' M1 The fact is of course that dx/dr and FIG. 56. dr/dx are not formed upon the same hypothesis as to the variation of P.] 3. If z=f(ax + by), prove that b (dz/dx) = a (dz/dy). [For dz/x = af' (ax + by), dz/dy = bf' (ax + by).] 4. If z =f (ax + by) + F (ax - by), then b2 (d2z/dx2) = a2 (d2z/dy2). 5. Find dX/dx, dX/dy, etc. when X+ Y=x, Y=Ixy. Express x, y as functions of X, Y and find dx/dX, dx/d Y, etc. 6. Find dX/dx, etc. when X+Y+Z=-x, Y+Z=xy, Z=xyz; express x, y, z in terms of 7X, Y, Z and find dx/dX, etc. [There is of course no difficulty in extending the ideas of the last section to functions of any number of variables.] 135. Differentiation of a function of two functions. We shall not, in this volume, be much concerned with functions of two variables from the point of view of differentiation. But there is one theorem, concerning the differentiation of a function of one variable, and known generally, though not very happily, as the Theorem of the Total Differential Coefficient, which is of very great importance and depends on the notions explained in * Of course the fact that Ax=sx is due merely to the particular value of Ar that we have chosen (viz. PP2). Any other choice would give us values of Ax, A:. proportional to those used here.

Page 265

135] ADDITIONAL THEOREMS IN THE CALCULUS 265 the preceding section regarding functions of two variables. This theorem gives us a rule for differentiating f{ (t), +r(t)}, with respect to t. Let us suppose, in the first instance, that f(x, y) is a function of the two variables x and y, and that fx', fy' are continuous fiunctions of both variables (~ 90) for all of their values which come in question. And now let us suppose that the variation of x and y is restricted by supposing that (x, y) lies on a curve x = (t), y = (t), where qb and J are functions of t with continuous differential coefficients +'(t), #'(t). Then, when x and y are thus restricted, f(x, y) reduces to a function of the single variable t, say F(t). The problem is to determine F'(t). Suppose that, when t changes to t +, x and y change to x +:, y + q. Then by definition dF (t) = lim [f {(t + ), (t + 7)} -f {(t), (t)}] dt T-.o. = lim { f( +, y + )-/f(x, )}/r =ir [f(x+ y+J)-f(x, y+n) + f(x, y+v)-f(x, y) ~ But by the Mean Value Theorem {f (,f(x+, y+)-, }/=fx'(x+, Y+ V y), {f(x, y+v)-f(x, y)\l-fy'(X, y+O67), where 0 and 0, each lie between 0 and 1. As Ir-0, 4-~-0 and 7 —0O, and -/T0-4'(t), 7/T -,'(t): also fX ( + 0, y+v) -f' (, y), Jfy (x, y+0l) —>fy'(X, y). Hence F(t) = Dtf I{(t), (t)} =fx(x, y) q (t) +fy'(x, y) ' (t), where we are to put x= (t), y =(t) after carrying out the differentiations with respect to x and y. This result may also be expressed in the form df df dx df dy dt dx dt dy dt'

Page 266

266 ADDITIONAL THEOREMS IN THE CALCULUS Examples LXIII. 1. Suppose k (t) = (1 - t2)/(1 +t2), 4 (t)= 2t/(1+ t2), so that the locus of (x, y) is the circle x2+ y2=1. Then ' (t) = -4t/(1 + t2)2, ' (t) = 2(1 - t2)/(1 +.t2)2, F' (t) = DJf(x, y) {- 4t/(1 + t2)2) + Dyf(X, 2X) {2(1 - t2)/(1 + t2)2}, where x and y are to be put equal to ( -t2)/(l+t2) and 2t/(l+t2) after carrying out the differentiation. We can easily verify this formula in particular cases. Suppose e.g. f(x, y) =x2 +y2. Then fx'=2x, fy'=2y, and it is easily verified that F'(t)=2 {x+'(t)+y+' (t)} =0, which is obviously correct, since F(t)=1. 2. Verify the theorem in the same way when (a) x=tm, y=1-tm, f(x, y)=x+y; (b) x=acost, y=asint,f(x, y)=x2+y2. 3. One of the most important cases is that in which t is x itself. We then obtain D f ({, + ()}) = D f (, y) +D f (x, y),'(x), where y is to be replaced by + (x) after differentiation. It was this case which led to the introduction of the notation a/fla, aflay. For it would seem natural to use the notation df/dx for either of the functions Dxf{x, 4 (x)} and Dxf(x, y), in one of which y is put equal to (x) before and in the other after differentiation. Suppose for example that y= -x and /(f y) =x+y. Then Dxf(x, 1 -x)=Dx 1=0, but Dxf(x, /y)=1. The distinction between the two functions is adequately shown by denoting the first by df/dxr and the second by af/ax, in which case the theorem takes the form df af +fdy dx- ax ay dx' But this notation is open to the objection that it is misleading to denote the functions f {x, + (x)} and f(x, y), whose forms as functions of x are quite different from one another, by the same letterf in df/dx, af/ax. And if we prefer, as we shall do throughout this book, to regard x and y, when connected by a relation, as being each functions of an auxiliary variable t, no ambiguity can arise and the need for any such complication in our notation is not felt. Thus if x=t, y=l-t, F(t)=f(x, y)=x+y=1, the meaning of the equations dF df d df dy (1 ~ (t) = t= d d + -y - = (..- 1)=0 dt dy dt is perfectly straightforward and their truth obvious. 4. If the result of eliminating t between x=+ (t), y= (t) is f(x, y) = 0, df dx df dyithen + -..............................(1). dx dt dy dt r[Sinc (dy'\ i/dx\ dy [Since a/y ( dxt= -dy, this is equivalent to fx +f y' (dy/dx)= O....................... (2),

Page 267

136] ADDITIONAL THEOREMS IN THE CALCULUS 267 a result which of course follows directly from the discussion of Ex. 3. The equation (1) is sometimes written in the form fx'dx+f ' dy=O.................................(3). This formula as it stands means nothing, since dx and dy mean nothing by themselves: it is merely a convenient way of writing (1) or (2). There is however another point of view from which it may be regarded. Suppose that when t is changed to t + at the corresponding increments in x and y are ax, Ay. Then lim {fx, (ax/at) +fy' (y/at)} = 0; in other words, when at is small, fx 8x fy'ay is small compared with at, instead of being, as an expression of the form A ax + By prima facie appears to be, of the same order of smallness as at. And the equation (3) may be regarded as a way of expressing this fact.] 5. If x and y are functions of t, and r and 0 are the polar coordinates of (x, y), show that r'=(xx'+yy')/r, 0 =(xy'-yx')/r2, dashes denoting differentiations with respect to t. 136. The Mean Value Theorem for functions of two variables. Many of the results of the last chapter depended upon the Mean Value Theorem, expressed by the equation (x + h)- (x) = zf'(x + Oh), or as it may be written, if y = ~>(x), Sy =f'(x + O8x) Sx. Now suppose that z =f(x, y) is a function of the two independent variables x and y, and that x and y receive increments h, k or 8x, 8y respectively; and let us attempt to express the corresponding increment of z, viz. z =f(x + h, y + k) - f(x, y), in terms of h, k and the derivatives of z with respect to x and y. Let f (x + ht, y + lct) = F(t). Then f(x+h, y+k) -f(x, y) = F(1)-F(O) = '(), where < 0 < 1. But, by ~ 135, F' (t)= Dtf (x + ht, y + kt) = hf' (x+ht, y + t)+kfy'(x+ht, y+kt). Hence finally z =f(x +h, y + k)-f(x, y) = hf(x + Oh, y + )+ kfy'(x + Oh, y+ Ok), which is the formula desired. Since f,', fy' are supposed to be continuous functions of x and y, we have f'(x+O h, y+Ok)==f (x, y)+ h,k, fy'(x + Oh, y + Ok) =fy (x, y) + e', k,

Page 268

268 ADDITIONAL THEOREMS IN THE CALCULUS [VII where eh,k and e'h,k —O as h and kc-O0. Hence the theorem may be written in the form 8z = (f'+ e)8& + (.fy' + /) y, where e and e are small when 8x and 6y are small. This may be expressed by saying that the equation S = fX'8x +fyl'y is approximately true; i.e. that the difference between the two sides of the equation is small in comparison with the larger of 8x and Sy*. We must say 'the larger of 6x and 6y ' because one of them might be small in comparison with the other; one indeed might actually be zero. It should be observed that if any equation of the form 8z=Xax+p/oy is 'approximately true' in this sense, we must have X=fx,', i=fy'. For we have az -f, ' -f y,y= ea + e'sy, as- \8x - 13y = El 8. + elay where e, E', E1, c' all tend to zero as Ax and 8y tend to zero; and so (X -f/,) x + (- f-/) y = px + p'y where p- 0 and p' -0. Hence if q is any assigned positive number, we can choose o- so that I (X -f,')x+(/ -f/) ay I < Y(||x + | II81) for all values of rx and by numerically less than or. Taking ay =0 we obtain I (X -f') ax I < 7 1| ax, or i X-fx' < 7, and, as 7r may be as small as we please, this can only be the case if X=f/'. Similarly P=fy'. This observation is of very frequent use, as will be seen from the examples which follow. It is obvious that the theorems of this section are capable of immediate extension to functions of any number of variables. Examples LXIV. 1. The area of an ellipse is given by A = 7rab, where a, b are the semiaxes. Prove that if a, b receive small increments 8a, 8b, then AA/A = (8a/a) + (8b/b), approximately. 2. Express a, the area of a triangle ABU, as a function of (in the usual notation) (i) a, B, C, (ii) A, b, c, and (iii) a, b, c, and establish the approximate formulae A 6a= c8B baC 6 c8b 8c -2 — +-+ + — n -=cot A8A + b+, A a asin B a sin C' b c &A = R (cos A 6a +cos B 8b + cos C8c). * Or with I x I r y or /(ax2 + y2).

Page 269

136] ADDITIONAL THEOREMS IN THE CALCULUS 269 3. The sides of a triangle vary in such a way that the area remains constant. They are then connected by one relation, and a may be regarded as a function of b and c. Prove that da/db = - cos B/cos A, da/dc= -cos C/cos A. [This follows from the fact that both of the equations 8a= d b +da c, cosA a+cosB b+cos Cc=0, db de are approximately true.] 4. If a, b, c vary so that R remains constant, then 2(a/cos A)=0, approximately, and so da/db= - cos A/cos B, da/dc= - cos A/cos C. [Use the formulae a = 2Rsin A, etc., and the facts that R and A + B + C are constant.] 5. If f(x, y)=O, and f', jf' are continuous and fy' 0, then dy/dx= -fx'/f. 6. The equation of the tangent to the curve whose equation is f(x, y) = 0, at the point (xo, yo), is (x- xo)f'i (xo0, o) + (y -yo)fy (o, yo) = 0. 7. If z is a function of u and v, which are functions of x and y, prove that dz z dz dz d d d ddz d d dv dx- du dx dv dx' dy du dy dv dy' [We have Z =Zau +Z,. = x + Zv aaV, =v aX -Uyy, aV= X -+ Vy y, approximately, suffixes denoting differentiations. Substitute for 3u and 8v in the first equation and compare the result with the equation az=z;8x++zyy.] 8. Let z be a function of x and y, and let X, Y, Z be defined by the equations x=alX+bl Y+ciZ, y=a2X+b2 Y+c2Z, z=a3X+b3 Y+c3Z. Then Z may be expressed as a function of X and Y. Express dZ/dX, dZ/dY in terms of dz/dx, dz/dy. [Let these differential coefficients be denoted by P, Q and p, q. Then z -p8x - qy =0, approximately, or (clp + c2q - c3) Z+ (alp + a2q - a3) 3X+ (blp + b2q - b3) Y= 0. Comparing this equation with sZ- PX- Q 3 Y=0 we see that = alp+ ca2 - a3 b1= -p+b2q- b -i + = _] CIp + Cq- C3 clp +c2q-C3 -9. If (alx+bly fclz)p+(a2x+b2y+c2z)q=a3x+b3y+c3z, then (aIX+bl Y+clZ)P+(a2X+b2 Y+c2Z) Q=a3X+b3 Y+c3Z. (iMath. Trip. 1899.) 10. If u and v are functions of x and y, x and y may be expressed as functions of t and v. Establish the formulae Jxu=vyv, JXV= -Uy, Jy =-x -Z, Jyv=Ux, where J=XVy - y V. Verify these formulae when it=r, v=O, by means of the results of Ex. LXII. 1.

Page 270

270 ADDITIONAL THEOREMS IN THE CALCULUS 11. Independent and non-independent functions. Let i and v be functions of x and y: and suppose that x and y vary in such a way that u remains constant. In general, of course, v will not remain constant; in this case we shall say that u and v are independent functions. If on the other hand the constancy of u does involve the constancy of v, then to a given value of u corresponds one (or several) definite values of v, and v may be regarded as a function of u. In this case we shall say that u and v are non-independent functions of x and y, or functions of x and y connected by a functional relation. Show that the necessary and sufficient condition that u and v should be connected by a functional relation is that J= t, Vy - Uy vx should vanish for all values of x and y. [We have the approximate equations 8u =uxx +Uy 8y, 8v= vx B. +vyy. That the condition J=0 is necessary follows from the fact that vx&x+-vuby must vanish for all values of the ratio x: ay for which x8x-+-Uyby vanishes. Moreover (i) bv = (8u - uy ay) (Vx/ux) + Vy -y = ( Vx/16Z) abi, approximately, if J=0 and uz,=0. But, if we express v as a function of u and y, eliminating x by means of the relation between x, y and a, we must have (ii) av=vyy +v,6bu approximately. Comparing (i) and (ii) we see that vy=0, so that v is a function of u only. If x — 0 for all values of x and y, then either u, or vx must also vanish for all values of x and y. The first alternative would show that u was a mere constant, the second that zt and v are both functions of y only, and therefore v a function of u. It is usual to call J the Jacobian of u and v with respect to x and y. It can be shown by an extension of this argument that three functions it, v, w of three independent variables x, y, z are connected by a functional relation if and only if J= tx Vx Wx Uy Vy Wy U2 VZ WZ vanishes for all values of x, y, z: and so generally for any number of variables.] 12. Show that ax2 +2hxy + by2 and Ax2+-2Hxy+By2 are independent functions unless /A = hlH= b/B. 13. Show that ax2 + by2 + cz2 + 2fyz + 2gzx + 2hxy can be expressed as a function of two linear functions of x, y, and z if and only if abc + 2fgh - af2 - bg2 - ch2 = 0.

Page 271

1:37] ADDITIONAL THEOREMS IN THE CALCULUS 271 [Let px + qy +rz, p'x + q'y + r'z be the linear functions. Then we must have ax+hy+gz hx+by+fz gx+fy+cz 0, p q r p' q r' or aP+hQ+gR=O, hP+bQ-+fR=O, gP+fQ+cR=O, where P=qr'-q'r, etc. Eliminating P, Q, R we have the condition required.] 137. Definite Integrals and Areas. It will be remembered that, in Ch. VI,~ 127, we assumed that iff(x) is a continuous function of x the region PpqQ shown in Fig. 57 has associated with it a Y Q definite number which we call its area. If we fix Op=a, p but let Oq = x vary, it is clear that this area is a function of x, which, as in ~ 127, we denote by S(x). Making this assumption, we ____ proved in ~127 that S'(x)=f(x), o p q x and we showed how this result FIG. 57. might be used in the calculation of the areas of particular curves. But we have still to justify the fundamental assumption that there is such a number as the area S(x). We know indeed, what is meant by the area of a rectangle, and that it is measured by the product of its sides. Also the properties of triangles, parallelograms, and polygons proved by Euclid enable us to attach a definite meaning to the areas of such figures. But nothing which we know so far provides us with a direct definition of the area of a figure bounded by curved lines. We shall now show, in the case of a particular class of functions f(x), how to give a definition of S(x) by means of which we can prove its existence*. We suppose that f(x) is not only continuous, but is positive and increases steadily (as in Fig. 57) from x = a to x = b. Let P, Q be the points whose abscissae Op, Oq are a and b respectively. Draw parallels to the axes of coordinates as shown in Fig. 58 a; * I have followed very closely the argument used by Mr Bromwich, Infinite Series, Appendix II, pp. 407-9.

Page 272

272 ADDITIONAL THEOREMS IN THE CALCULUS [VII and denote the area of the rectangle pQ bounded by a thick line (which we call the outer rectangle) by So, and the area of the Y Y Y Q Q Q P p S// P | I 0 p q X O p m q X O p n q X FIG. 58 a. FIG. 58 b. FIG. 58 c. shaded rectangle Pq (which we call the inner rectangle) by so. The outer rectangle includes and the inner is included by the curvilinear region PpqQ: and So - o = rect. PQ = (b - a) ff(b) -f(a)}. Now bisect pq at m, and draw mM parallel to OY, and other lines parallel to OX as shown in Fig. 58 b. And now let us denote the sum of the rectangles pM, mQ, which are bounded by a thick line in Fig. 58 b, and which we again call the outer rectangles, by Si, and the sum of the shaded rectangles Pm, Mq (which we again call the inner rectangles) by s,. Then again the outer rectangles include and the inner are included by PpqQ; also S1 < So, 81 > So and S, - s, = rect. PM + rect. MQ = (b - a) f (b) -f(a)}. We now carry the construction one stage further by bisecting pm and mq and forming outer and inner rectangles, of total areas S2 and s2 respectively, as in Fig. 58 c. The outer rectangles again include and the inner are again included by PpqQ; and S2 < S1 < So, 52 > si > So. Also S2- s2 is the sum of four rectangles, each of breadth 4 (b - a), and of total heightf(b) -f(a), so that 2.-8s= 2 (b - a) {f(b) -f(a)}. 2~-22 =~

Page 273

137, 138] ADDITIONAL THEOREMS IN THE CALCULUS 273 It is evident that we may proceed as far as we like in this way by continually bisecting segments of the line pq: and so we define two sequences S,, S, X,...; So, S1, s,... such that (1) Sn is the sum of the areas of a set of rectangles which include the curvilinear region PpqQ, and s, is the sum of the areas of a set of rectangles included by it; (2) So > S > Sl > S3 >...; So < Sl < 2<...; 1 (3) Sn - s, = f.(b- a){f(b)-f(a)}. From (2) and the fact that every S, is greater than the corresponding n,, it follows that Sn and sn, as n —oo, each tend to limits. Also (3) shows that Sn- Sn-0, and therefore that Sn and Sn tend to the same limit, which we may call S. And (1) naturally leads us to define the area of the region PpqQ as being equal to this number S. That is to say, our definition of this area is as follows: divide pq into 2n equal parts, and fornz the two sets of rectangles, of total areas Sn, Sn, as indicated in Fig. 58 a b c; then Sn andd s, tend, as n --, to a common limit S, which we call the area of the region. 138. This definition, however, may be generalised. Suppose that instead of dividing pq into 2n equal parts we divide it into v parts of any kind, subject only to the condition that each part is less than 38, where,V is a number which tends to 0 as v —o. Q We then construct an outer and an inner area,V, a- as shown in Fig. 59. If 8n and sn are defined as in ~ 137, it is easy to see* that S, > oe- and sn < S for all values of n and v; and, as S is the limit of Sn and sn, it follows'that S o-, 8<E for all values of v. Hence But it is obvious from the figure ) ) FIG. 59. that SP-.V < 8, {f(b) -f(a)}, * The reader should draw a figure to illustrate these inequalities. H. A.

Page 274

274 ADDITIONAL THEOREMS IN THE CALCULUS [VII and so, v-rv- O: hence S- o ---O, or limo a=S; and similarly lim = S. 139. We can give an analytical form to the definition of the area as follows: divide up the interval (a, b) by the points -- o, Xti, X2,..., x* v — and let E, = (X1 - Xo)/(X1) + (X2 - Xl)f(x2) +... + (X - v-1) f (), O-v = (X1 - xo)f(xo) + (x2 - Xl)f(xl) + *. + (V - xV-l)f/(XV-). Then if, as v —>oo, all the sub-intervals xs+ —xs tend to zero, i.e. if, given 8, we can choose n so that for v > n all of X1 - X, 2X - - 1,..., * y - Xv-1 are less than 8, the two sums 2^, a'v tend to a common limit s, which we define to be the area of the region PpqQ. 140. So far we have supposed that f(x) is a positive and increasing function. It remains to generalize our definitions so as to apply to a number of other cases. (i) The'preceding analysis is in no way affected if we suppose that f(x) is a decreasing instead of an increasing function. (ii) Now let us suppose that f(x), while increasing or decreasing steadily throughout (a, b), changes its sign (as at R in Fig. 60). In this case we may still use the analytical definition of ~ 139. Y It should however be observed that, along the arc PR, f(x) is negative. If we apply the definition of ~ 139 to the regions PpR, RqQ separately, the first has a negative and the /_ _ second a positive area; and the R q X number furnished by the definition as the area of the whole region PpqQ is the difference and not the sum of these sub-regions. If we want the sum we must calculate the area of each sub-region separately and change the sign of the area of the first before adding it to that of the second. (iii) Next we may suppose that f(x) does not either increase or decrease steadily throughout the whole interval (a, b), but that

Page 275

138-140] ADDITIONAL THEOREMS IN THE CALCULUS 275 (a, b) can be divided into a finite number of regions throughout each of which f(x) increases or decreases steadily (as in Fig. 61). FIG. 61. Then we define the areas of each of the regions PprR, RrsS, SstT, TtqQ as in ~ 139, and that of PpqQ as their sum. Here again it is to be observed that if f(x) changes sign some parts of the region will be reckoned as having negative areas. (iv) The reader will now find no difficulty in seeing how to define the area of any such curvilinear region as is likely to occur in elementary work. Thus in the case of the region shown in Fig. 62 we add the areas of the regions bounded by dotted lines FIG. 62 and subtract those of the shaded regions. This gives the total area, counting the part below the axis of x negatively. To find the area of the closed curve, in the ordinary geometrical sense, we must, before carrying out this process, change the sign of the areas of the two negative sub-regions. 18-2

Page 276

276 ADDITIONAL THEOREMS IN THE CALCULUS [VII 141. Lengths of Curves. The notion of the length of a curve is capable of precise mathematical analysis just as much as that of an area, but the analysis is rather more difficult, and we shall not give any general treatment of the question here. We shall however consider briefly the case in which the curve is a circle, since the assumption that a circular arc has a length is fundamental in elementary Trigonometry (see for example p. 169, footnote). T D^ J-^ ^ ^E A B FIG. 63. Let ACB (Fig. 63) be an arc of a circle, C its middle point (i.e. the point equidistant from A and B), and A T, BT the tangents at A and B. Then it is easy to prove that the tangent at C meets AT, BT in points D, E between A, T and B, T respectively *. Then A C+ CB >AB and A T+ TB= AD+ DT+ TE+EB>AD+DE+.EB. Thus if AB=so, AC+CB=sl, AT+TB=So and AD+DE+EB=S1, we have So<sl, So>S1. Now let F and G be the middle points of the arcs AC, CB, and HFI, JGK the corresponding tangents. If we put AF+FC+CG+GB=s2, Af+HI+IJ +rJK.+KB=S2, it is easy to prove as above that So< sl< s2 and So >S1 >S2. Proceeding in this way we define two sequence of numbers So, sl, s2,... So, S1, S2,... such that s, is the perimeter of part of a polygon inscribed in the circle and S, that of part of a polygon circumscribed to the circle, while So< S1< S2<... < Sn<,.., So >S1 >S2 >... >Sn >.. It is moreover easy to see that every s. is less than every Sn. Hence s, and Sn tend to limits as n — oo. But if a is the angle subtended by the whole arc at the centre of the circle, that subtended by one of the chords by which s, is defined is a/2", and S, - S = S, {1 - cos (a/2f)} <So {1 - co (a/2o)}) and therefore tends to zero as n - oo. Hence s8 and Sn tend to the same limit 1, which we call the length of the arc AB. * For the distance of every point of the tangent, other than C, from the centre of the circle is greater than that of any point on the circumference.

Page 277

141, 142] ADDITIONAL THEOREMS IN THE CALCULUS 277 It is easy to generalise the foregoing reasoning to cover the case in which the arc is subdivided in any manner instead of by repeated bisection (cf. ~ 138). Similar reasoning may be applied to curves other than circles, the only difficulty being that of defining the precise conditions under which such a point as D always falls between A and T*. 142. The definite integral. Let us now suppose that f(x) is a function which satisfies the conditions of the preceding sections, so that the region bounded by the curve y=f(x), the ordinates x= a and x = b, and the axis of x has a definite area. Then we proved in Ch. VI (~ 127) that if we could find an 'integral function' of/f(), i.e. a function F(x) such that F' () =f (x), F() = (x) dx, then the area in question is F(b) - F(a). As we saw in Ch. VI, however, to actually determine the form of F(x) is not always practicable. It is therefore convenient to have a formula which represents the area PpqQ and contains no explicit reference to F(x). We shall write b area PpqQ= f(x)dx.....................(1). The expression on the right-hand side of this equation may then be regarded as being defined in either of two ways. We may regard it as simply an abbreviation for F(b)- F(a), where F(x) is some integral function off(x), whether an actual formula expressing it is known or not; or we may regard it as the value of the area PpqQ, as directly defined in ~ 139. It follows from ~ 127 that these two definitions are equivalent in all cases to which we have shown that the direct definition is applicable. An example of a case in which the direct definition (so far as we know at present) cannot be applied is obtained by supposing f(x)=3x2sin (l/x)-xcos (l/x) (x=0), f(x)=O (x=0), and F(x)=x3sin(1/x) (xO0), F(x)=0 (x=0). Then F'(x)=f(x) and F(b)- F(a)=b3 sin (1/b)- a3 sin (l/a). But since f (x) changes its sign infinitely often as x approaches 0, and has an infinity of maxima and minima near x=0, our proof of the applicability of the direct method of definition fails if a and b have opposite signs. * I am indebted to Mr Bromwich for the substance of this section. It enables us to complete the proof of the inequalities sin x < x < tan x assumed in Ex. xxxvii. 14.

Page 278

278 ADDITIONAL THEOREMS IN THE CALCULUS [VII As a matter of fact the scope of the direct definition may be greatly extended. In particular it may be proved applicable to all continuous functions (such as f(x) above). But the restricted proof which we have given will be sufficient for our present purposes. rb The number f(x)dx..........................(1), a is called a definite integral; a and b are called its lower and upper limits; f(x) is called the subject of integration or integrand, and the interval (a, b) the range of integration. The definite integral depends on a and b and the form of the function f(x) only. The reader should be careful to observe that it is not a function of x. On the other hand the integral function F(x) = f (x)dx is sometimes called the indefinite integral of f(x). The distinction between the definite and the indefinite integral is merely fb one of point of view. The definite integral /f(x)dx = F(b)- F(a) is a function of b, and may be regarded as a particular integral function of f(b). On the other hand the indefinite integral F(x) can always be expressed by means of a definite integral, since F(x) = F(a)+ f (t)dt. a But when we are considering 'indefinite integrals' or 'integral functions' we are usually thinking of a relation between two functions, in virtue of which one is the derivative of the other. And when we are considering a 'definite integral' we are not as a rule concerned with any possible variation of the limits; usually the limits are constants such as 0 and 1: and f(x) dx= F(1)- F(0) is not a function at all, but a mere number. If, in the definition of ~ 139, we write Bxv for x+ -Xy^, we obtain the formulae f (x) dx =lim f (xv + ) xv = lim f (xv) xv. a It was this expression of the definite integral as the limit of a sum which suggested the notation which is now invariably used. Since 1/x is a continuous and steadily decreasing function of x for all positive values of x, the investigations of the preceding paragraphs supply us with a proof of the actual existence of the function log x, which we agreed to assume provisionally in ~ 112.

Page 279

142] ADDITIONAL THEOREMS IN THE CALCULUS Examples LXV. Calculation of the definite from the indefinite integral. 1. Show that (b bn+l _ anz+l xn dx = + a n+1 and in particular X| dx -. 0o J+1' 2. f cosmxtdx = (sin-sina) mx dx= (cosma- cos mb)/m. Ja ja 3. 1 d 2 -- arc tan b - arc tan a, I - = 7r. Ja +X 2 X Jo l+2 4 [There is an apparent difficulty here owing to the fact that arc tan x is a many valued function. The difficulty may be avoided by agreeing that in formulae of this kind arc tan x is always to denote an angle lying between -~Tr and +-7r. With this convention the equation x dt fZ clt 1 2 = arc tan x Jol+t2 is true for all values of x, for the integral vanishes for x=0 and increases steadily and continuously towards + -Tr as x increases towards + o, decreases steadily and continuously towards - r as x decreases towards - c. Similarly in the equation ^ dt /(1 _ t2) =are sin x, where -1<x<1, arc sin x denotes an angle always lying between -~Tr and +7TT.] 4. If a and b are both numerically less than unity, I v(1 2 --- = arc sin b - arc sin a. 1 dx 27r 1 dx 7r J o -x+x2 3/3' J 41- x+x2 373 3 dx a 6. If -ir<a<r, -- =,except when a=0, when o 1+2x cos a+ 2 2 sin a the value of the integral is 1, which is the limit of ~a cosec a as a-o0. rl na 7. f1^/(1 -x2)dX=, I fr (a2 -x2) dx=i7ra2 (a>O). Jo dx 8. a b cos = /(a2 b2), if a > [b. [For the form of the indefinite Joa + b cos x J/(a2 -b2)' integral see Ex. Lv. 3. If lal < b the subject of integration has an infinity between 0 and 7r. What is the value of the integral when a is negative and -a> bl ] f'"' dxr 9. I a2 o+b2 2- = ' if a and b are positive. What is the J o a2 cOs2 x + Vsn2x 2ab value of the integral when a and b have opposite signs, or when both are negative?

Page 280

280 ADDITIONAL THEOREMS IN THE CALCULUS [VII 10. Fourier's integrals. Prove that if m and n are integers r27e J cos mx sin nx dx Jo is always equal to zero, and f27r f27r /cosmx cos nx d, f sin mxsin nxdx' Jo Jo are equal to zero unless m = n, when each is equal to r. 11. Prove that cosmx cosnxdx, sinmxsinnxdcx, are each equal to zero except when m = n, when each is equal to 7r; and that 7i. 2n cos mx sin nx dx- =2 2 SO o?> - m or zero, according as n - m is odd or even. 143. Calculation of the definite integral from its definition as the limit of a sum. In a few cases we can evaluate a definite integral by a direct calculation, starting from the definition of ~ 139. As a rule this is by no means as easy as by using the indefinite integral, but the reader will find it instructive to work through a few examples. fb Examples LXVI. 1. Evaluate xdx by dividing (a, b) into n. equal J a parts by the points of division a=xx, x, x1,..., x,,=b, and calculating the limit as n —>oo of (X1 - Xo) f(X0) + (X2- l)f/(Xl)+.. + + (X2n — n- l))/(n, -1). [This sum is b-a ( b-a b -a\ + + - 1) -a} a+ a+- -}+( + +2 i-4... + (n - l) —} b [na+ba {1+2+... +(n-1)]=(b-a) {a+(b-a) (- /n n 2 n2 which tends to the limit (b2 -a2) as n-uc. Verify the result by graphical reasoning.] rb 2. Calculate 2dx in the same way. J a fb 3. Calculate xdx, where 0 < a < b, by dividing (a, b) into n parts a by the points of division a, ar, ar2,... ar"-1, where r= b/a. Extend the fb result to the more general integral x mdx. Ja fb rb 4. Calculate | cosmx dx and I sin mxdx by the method of Ex. 1. a a

Page 281

143, 144] ADDITIONAL THEOREMS IN THE CALCULUS 281 n-1 1 5. Prove that n Y2 2+r2 — r as n-co. r=O -f' [This follows from the fact that qn n s f_ (1/72) n22 + n2+- + " + * n - 1)2 r=1 + (r/n)2' which tends to the limit f l d as 7- co, by the direct definition of the integral.] 1 n-i 6. Prove that n2 ^/(n2 -r2) 4 r. [The limit is /(1- x2)dx.] 144. General properties of the definite integral. The definite integral possesses the important properties expressed by the equations: J b Ja (1) (x) d f ( dp. This follows at once from the definition of the integral by means of the integral function F(x), since F(b) - F(a)= -{F(a)-F(b)}. It should be observed that in the direct definition it was essentially supposed that the upper limit was greater than the lower; thus this method of definition does ra not apply to the integral f(x) dx when a < b. If we adopt this definition as fundamental we must extend it to such cases by regarding the equation (1) as a definition of its right-hand side. (2) f (x)dx =0. (3) f/(x)dx + f(x)dx= f(x)dx. rb rb (4) /~f(x)dx= cJ f (x)dx. a ba (5) {f(x)+ (x)} dx = f ()dx + I (x)dx. The reader will find it an instructive exercise to write out formal proofs of these properties, in each case giving a proof starting from (a) the definition by means of the integral function (3) the direct definition of ~ 139. The following theorems are also important. rb (6) Iff(x)- O for a- - x:b, then f f(x)dx > 0. (i) If we start from the definition by means of F(x) this is merely another way of stating the result of ~ 102, Ch. VI.

Page 282

282 ADDITIONAL THEOREMS IN THE CALCULUS (ii) If we start from the direct definition the result is geometrically intuitive. To supply a formal proof we have only to observe that ov (~ 138) cannot be negative. (7) If, throughout the interval (a, b), L - F (x) G, then L(b- a) _ f(x)dx _ G (b - a). This follows at once if we apply (6) to f(x)-L and G-f(x). (8) f()d = (b - a)f( ), where ~ lies between a and b. This follows from (7). For we can take L to be the least and G the greatest value of f(x) in (a, b). Then the integral is equal to H (b - a), where H lies between L and G. But since f(x) is continuous there must be a value of $ for which f () = H (~ 86). If F(x) is the integral function we can write the result of (8) in the form F(b) -F(a)= (b - a) F' (), so that (8) appears now to be only another way of stating the Mean Value Theorem of Ch. VI, ~ 106. We may call (8) the First Mean Value Theorem for Integrals. (9) The Generalised Mean Value Theorem for integrals. If ((x) is positive and L and G are defined as in (7), then rb rb rb L fl (x) dx_<- f (x) ( (x) dx_- Gf (x)dx; rb rb and f (x) (x)dx =f() (x)dx, where v is defined as in (8). This follows at once by applying (6) to the integrals ff (x) - L} (x) dx, J G - f(x)} (x) dx. The reader should formulate for himself the corresponding result which holds when q (x) is always negative. Examples LXVII. 1. Show, by means of the direct definition of the definite integral, and equations (1)-(5) above, that (i) f (X2) dx=2 f (x2)dx, f X_ (X2)d=. (ii) f ( (cosx)dx = (sin )(sin qsx) dx. (iii) f ((cos2x)dx=m ( CI(cos2x)x, Jo Jo

Page 283

144] ADDITIONAL THEOREMS IN THE CALCULUS 283 m being an integer. [The truth of these equations will appear geometrically intuitive, if the graphs of the functions under the sign of integration are sketched.] 2. Prove that | sin n dO = rr or 0 according as n is odd or even. [Use the formula (sin nO)/(sin 0)= 2 cos (n - 1)+ +2 cos (n -3) +..., the last term *being 1 or 2 cos 0.] 3. Prove that sin nO cot 0 d=0 or rr according as n is odd or even. 4. If (0))= a +al icos +b sin +a2 cos 20 +...+ a, cosn +b sin n, then r27r r2r r27r f2((0)dO =27r~ao, f cos kO (O))dO =7rak, j sin k q (0) d=7rbk, if k is a positive integer less than n. If k > n the value of each of the last two integrals is zero. [Use Ex. LXV. 10.] 5. If ( (0)=ao+alcos +a2cos20+... +ancosnO then f c (0)dO =rag, cos kd (0()d= -7rak, if k is a positive integer less than n. If k$>n the value of the last integral is zero. [Use Ex. LXV. 11.] 6. If (x)! < (x) for aC x5 b, then Jf odx< 0 f dx. a Ja 7. Prove that 0 <:f2(sin O)'+ldO < f(sin n dO, 0 < (tan O)n+ dO < (tan O)n dO. 8*. If n>l, '5< f (1-x 2)- <524. [The first inequality follows from the fact that /(1 - x2n)<1, the second from the fact that /(1 - x2n) >^ (1 - x2).] 9. Prove that < | /(4- 2+x <. 0J (4 -2+X3) 6 10. Prove that (3x+8)/16< 1/V/(4-3x+x3)< 1/,/(4-3x) if 0<x<1, and hence that 19 f (4 + )< 2 o J(4 - 3x+ X3) *2 //dx 11. Prove that -573< f (4-3x+x3)<-595. [Put x=1+u: then replace 2 + 3u2 + t3 by 2 + 42 and by 2+ 3u2.] 12. If a and q are positive acute angles fdx __)____c 0 < j /(1 -sin2 a sin2 x) /(1 - sin2 a sin2 ) ~ If a-==1 rr, the integral lies between '523 and '541. * Exs. 8-12 are taken from Prof. Gibson's Elementary Treatise on the Calculus.

Page 284

284 ADDITIONAL THEOREMS IN THE CALCULUS [VII 13. Prove that | ( (x)dx i _ (x) dx. [Let /, y,... K be the values of x for which q5 (x) changes sign. Then (x)4dx\ ' (xf)x)d + (f ) dx +... + f) (x)dX J a a K 3 P =(|+ +...+| )) )|(.s)|=| |(X) I d. The reader should also attempt to prove this result directly from the definition of the integral as the limit of a sum 2v (~ 138), showing that, if 2I' is the corresponding sum for the function \ (x) |, then | 2 I c 2v'.] 14. If If(x)- | M, then | f(x) )dx M A j )d. 145. Integration by parts and by substitution. It follows from Theorem (3) of ~ 94 that -b b f (.) 0'(x)dx =f(b) ( (b) -f(a) (a) -.f' (x) (x) dx This formula is known as the formula for integration by parts, as applied to definite integrals. Again we know (~ 108) that if F(t) is the integral function of f(t), then j f{ (x)} '(x) dxF= ()}. Hence if ) (a) = c, 4 (b) = d, we have f(t)dt = F(d) - F(c) = F (b)} - F{ (a)} =f f{J (x)} )'(x)dx; which is the formula for the transformation of a definite integral by substitution. The formulae for integration by parts and for transformation often enable us to evaluate a definite integral without the labour of actually finding the integral function of the subject of integration, and sometimes even when the integral function cannot be found. Some instances of this will be found in the following examples. That the value of a definite integral may sometimes be found without a knowledge of the integral function is only to be expected, for the fact that we cannot determine the general form of a function F(x) in no way precludes the possibility that we may be able to determine the difference F(b)-F(a) between two of its particular values. But as a rule this can only be effected by the use of more advanced methods than are at present at our disposal.

Page 285

145] ADDITIONAL THEOREMS IN THE CALCULUS 285 Examples LXVIII. 1. Prove that b xf" (x)dx= {bf' (b) -f(b)} - {af' (a)-/ -f(a). a rb 2. More generally, x"nmf(+ 1)(x)dx =F(b)-F(a), where F(x)-= nf(m)(x) - _nxn-lf (m - 1) x + (m -l) xn - 2f(m - 2)X -... +( - l )'n!f(x). 3. Prove that arc sin xdx = rr-l, x arc tan xdx= 7r- -. 4. Prove that f7 x cos xsin xx dx - J o(a2 cos2 x + b2 sin2 x)2 - 4ab2 (a + b)' [Integrate by parts and use Ex. Lv. 9.] rx fx Tx 5. If fi(x)=f f(t)dt, f2(x)= fi(t)dt, *...fk()=J fkl(t)dt, Jo Jo Jo then fk (X) =(k1)! () (X- t) - t)-ldt. [Integrate repeatedly by parts.] 6. If u, n,= xm(1 - )"dx, where m and n are positive integers, prove by integration by parts that (m + n+1) Ulm, n'ftn, n-1_ and deduce that mn\ mn! n! urn, (m+n +1)!' 7. If un = (tan O)ndO, prove that u, + n-2=l/(n- 1). Hence evaluate Jo the integral for all positive integral values of n. [Put (tan O)n= (tan O)n-2(sec2 0 - 1) and integrate by parts.] 8. Deduce from the last example that u, lies between 1/{2(n-1)) and 1/12 (n+ )}. 9. If un=, (sin x)2dx, prove that Ut= {(n- l)/n}u,,_2. [Write (sin x)"- sinx for (sinx)" and integrate by parts.] 10. Deduce that u, is equal to 2.4.6.. (n-l) 1 1.3.5..(- 1) 3.5.7.. 2 r 2.4.6.. n according as n is odd or even. 11. The Second Mean Value Theorem. If f(x) is a function of x which has a differential coefficient of constant sign for all values of x from x= a to x= b, then there is a number 4 between a and b such that f () (x) dx =f (a) (x) dx +f(b) f (x) Cx. [Let (t)dt = (x). Then Ja

Page 286

286 ADDITIONAL THEOREMS IN THE CALCULUS [VII |f/(x) () dx= f(x) )' ()d=-( /( b) ~b(b) f f '() ()dx ba =f (b) 4(b) - 4 ( )f f' (x) dx, by the generalised Mean-value Theorem of ~ 144: i.e. fx f(x) (x) dx =f(b) F (b) - {f(b) -f(a)} b (), which is equivalent to the result given.] 12. Bonnet's form of the Second Mean Value Theorem. If f' (x) is of constant sign, and f(b) and f(a) -f(b) have the same sign, then /f (x) (x dx =f (a) p(x) dx a a where X lies between a and b. [For f (b) (b) + {f(a) -f(b)} ()=f (a), where i lies between (4 () and @ (b), and so is the value of, (x) for a value of x such as X. The important case is that in which 0 _f (b) _f(x)f (a).] Prove similarly that if f(a) and f(b) -f(a) have the same sign f(x) (x) dx=f () ) (x) dx rb where X lies between a and b. [Use the function 4+()=/ b (x)dx. It will be found that the integral can be expressed in the form f (a) (a) + {f(b)-f(a)} + (~). The important case is that in which 0 f/(a)f f(x) f (b).] XI xsin x 2 13. Prove that,x < X if X' >X>0. [Apply the first Ix x XX formula of Ex. 12, and note that the integral of sin x over any interval whatever is numerically less than 2.] 14. Establish the results of Ex. LxvII. 1 by means of the rule for substitution. [In (i) divide the range of integration into the two parts (- a, 0), (0, a), and put x= -y in the first. In (ii) use the substitution x=irr-y to obtain the first equation: to obtain the second divide the range (0, Tr) into two equal parts and use the substitution x=-rr+y. In (iii) divide the range into m equal parts and use the substitutions x=rx +y, 27r+y, etc.] rb b 15. Prove that I F(x)dx= f F(a+ b —x)dx. Ja Ja 16. Prove that (cos sin x)2?dx = 2- (cos x)mdx. 7 r 'rr 17. Prove that x(f (sin x)dx=Pr r b(sinx)dx. [Put x=7r-y.] Jo o 18. Prove that |r sin x dx= r2 1 + cos2 x 4

Page 287

146] ADDITIONAL THEOREMS IN THE CALCULUS 287 19. Show by means of the transformation x= a cos2 + b sin2 0 that f {(x - a)(b - )} dx= 17r (b -a)2. 20. Show by means of the substitution (a+b cosx)(a-b cosy)=a2-b2 that j(a+ bcos x)-n dx=(2 - b2)-(nf (a- b cosy)- 'ldy, if n is a positive integer and a>l b, and evaluate the integral when n=l, 2, 3. 21. If m and n are positive integers, then J(x-a)T (b-x )n dxv = (b-a) + 1 + + 1)! [Put x=a+ (b-a) y, and use Ex. 6.] 146. Proof of Taylor's Theorem by Integration by Parts. We shall conclude this chapter by giving the alternative form of the proof of Taylor's Theorem to which we alluded in ~ 129. Let f(x) be a function whose first n derivatives are continuous, and consider the integral R = (n-1)! j1 - t)-lf () (a + th)dt. Integrating repeatedly by parts we find that Rn= (- 1 (1 - t) t { (n- (a + th)}dt hn — f -) (a) + h!(1 -t)-2f(n-i)(a + th) dt (n - 1 - - 2)hn l h —2 n-( 1)!/'-(a)-( (2)! f-2') (a) - -hf'(a) + f(a+ th)dt =f (a + h) -f(a) - hf' (a)-...- f (1"-) (a). Hence hn-l f(a + h) =f(a) + hf'(a)+... + ( f (f- (a) (n - 1)! + -- ) (1 - t)n-lfn) (a + th)dt.........(1), which may be regarded as another form of Taylor's Theorem.

Page 288

288 ADDITIONAL THEOREMS IN THE CALCULUS Now, by Theorem (9) of ~ 144, f(i- t)-f(n) (a + th) dt =f(1- t)n-p(l- t)p-lf () (a + th)dt =(1- -O)-Pf ()(aa + h) ( -t)P- dt, where 0< 0 < 1, p being any positive integer not greater than n. Hence R, = (1 - O)n-pf(n(a + h) hn/p (n- 1)!.........(2). If we take p = n we obtain Lagrange's form of R,. If on the other hand we take p = i we obtain Cauchy's form, viz. = (1 -o0)n-f )(aC + Oh) h/(n- 1)!.............(3). The truth of the equation (1) may also be proved as follows. Let (in the notation of ~ 129) (b - X)n-1 f (71-1)(x). F, (x) =f (b)-f(x) - (b- x)f' (x)... - - (-X). Then Fn' (x)= - ((b- X)1() (n ) by differentiation, and so Fn,(a)==F, (b) - F'X (x) dx-( 1) | (b - x)-l f() (X) dx; and the equation (1) above follows at once on writing b = a+h, x =a +th. 147. Application of Cauchy's form to the binomial series. If f(x)=(l+x)m, where m is not a positive integer, Cauchy's form of the remainder is m= (m - 1)... (m - n+ l)(1 - O)n~ -.~., Rn- 1. 2... (n- 1) (1 + X')n*-~1 -' Now (1- O)/(1+0Ox) is less than unity, so long as - <x <1, whether x is positive or negative (it will be remembered that the difficulty in using Lagrange's form, in Ex. LVIII. 2, arose in connection with negative values of x); and (1 +0x)m"l is less than a constant K for all values of in, being in fact less than (1 +lx )m-l if n > 1 and than (1 -I x )m-1 if m < 1. Hence I R,<K ml ^ j|^| Pn say. But (Ch. Iv, Misc. Ex. 5) pn,-0, and so Rn-O0, as n- oo. The truth of the Binomial Theorem is thus established for all real values of m and all values of x between - 1 and +1. * The method used in ~ 129 can also be modified so as to obtain these alternative forms.

Page 289

146, 147] MISCELLANEOUS EXAMPLES ON CHAPTER VII 289 MISCELLANEOUS EXAMPLES ON CHAPTER VII. 1. Verify the terms given of the following Taylor's Series: (1) tanx=x+ X3+ L25 5+317+..., 1 X + (1) 3 1 315 (2) secx=l+ 1X2 + 2_ 4 + -2-1o6 +,6 (3) xcosecx=l- 1+X2+30X_4+..., (4) xcotX=l-~ —2_ I15 x4-.... 2. If/(x) and its first n + 2 derivatives are continuous, and f(n +1)(0) =0, and 06 is the value of 0 which occurs in Lagrange's form of the remainder after n terms of Taylor's Series, show that n 1= n f(76 + 2)(0) n+l 2 ( (6 + 2) f (n 1) (0) where ex —0 with x. [Follow the method of Ex. LVII. 12.] 3. Ver'ify the last result when f()= 1/(1 +x). [Here (1 + On )n+ l= 1 +X.] 4. Show that if f(x) has derivatives of the first three orders f(b)- =(a) + I (b - a) {f'(a) +/'(b)}- -1 ](b - ( a)3f"' (a), where a <a< b. [Apply to the function f(x) -f(a)- I (x- a) {f' (a) +f ()} -- /a [f(b)-f(a) - (b - a){/'(a)+f'(b)}] arguments similar to those of ~ 129.] 5. Show that under the same conditions (b)=f/(a)+(b - a)f' {(a+ b) + b - a)3f"'(a). 6. Show that iff(x) has derivatives of the first five orders f (b) =f(a) + (b - a) { f'(a) +f' (b) + 4/ (a )} - (b - a)f(5) (a). 7. Show that under the same conditions f(b) =f(a)+ (b- a){ '(a) (}1 a)2{f" (b -f"(/a) }+ 7-0 (b - a)5f()(a). 8. Establish the formulae (i) f(a) f(b) =(a-b) f(a) /'(3) g(a) g(b) g(a) g'(3) where 3 lies between a and b, and (ii) f ( ) f (b ) f=(b- (c-a) (a-b) (a) f'(a) f" (y) g(a) g(b) g(c) g(a) g'(3) g"(y) h (a) h(b) h(c) h(a) h'(3) h"(y) H. A. 19

Page 290

290 MISCELLANEOUS EXAMPLES ON CHAPTER VII where / and y lie between the least and greatest of a, b, c. [To prove (ii) consider the function ~)(x)- f (a) f (b) f(x) (x-a)(x-b) /(a) f (b) f(c) g(a) g() g(x) c-b) g(a) g(b) g(c) h(a) h(b) h(x) h(a) h(b) h(c) which vanishes for x=a, b, c. Its first derivative, by Th. B of ~ 102, must vanish for two distinct values of x lying between the least and greatest of a, b, c; and its second derivative must therefore vanish for a value y of x satisfying the same condition. We thus obtain the formula fa) f (b) f(c) =(c-a)(c-b) f(a) f(b) f"(y). g(a) g(b) g(c g(a) g() g"(y) h (a) h (b) h(c) h(a) h (b) /" (y) The reader will now complete the proof without difficulty.] 9. If F(x) is a function which has continuous derivatives of the first n orders, of which the first n —1 vanish for x=O, and A<F(I)(x) _B for 0 x — h, then A (x/n!) - F(x) B (x/n!) for 0 x _ h. Apply this result to yn -1-1 f(X) — f(O) - xf' () -... - ( f(-1(0) and deduce Taylor's Theorem. 10. If Ah J(x)=q(x)-q((x+ h+), Ah2k(x)=Ahj{Ah (x)}, and so on, and q)(x) has derivatives of the first n orders, show that Awhr ls b n (d nh. Dedue that, if (n)() is where ~ lies between x and x+%h. Deduce thatj if +(n)(x) is continuous, {AOh8(x)}/Ihn-( ()(x) as h-a-0. [This has already been proved when n=2 (Ex. LVII. 13).] 11. Deduce from Ex. 10 that xn-man xm —m(vm-1)...(m —n+l)hn as x- + -o, m being any rational number and n any positive integer. In particular prove that x^Jx {Jx - 2V(x+ 1)+ V(x+ 2)}) —. 12. Suppose that y= (x) is a function of x with continuous derivatives of at least the first four orders, and that 0 (0)=0, 0b' (0)=1, so that y= b (x) =x+ a2 2 + a33 + (a4 + Ex) X4, where ex —0 with x. Establish the formula X = (y) =y - asy2 + (2a22 - a3) y3 - (5a23 - 5a2 3 + 4 + Ey) Y4, where y-~0 with y, for that value of x which vanishes with y; and prove that, as x —0, + (X)+(X)-2 22 -42

Page 291

MISCELLANEOUS EXAMPLES ON CHAPTER VII 291 13. The coordinates (~, 7) of the centre of curvature, at the point (x, y), of the curve x=f(t), y =F(t) are given by -( - x)/' = (, -. y)/x' (X'2 +y2)/(x'y- "y') and its radius of curvature is (X2 +y'2)312/(.' y'- X/Iy'), dashes denoting differentiations with respect to t. 14. The coordinates ($, 7) of the centre of curvature, at the point (x, y), of the curve 27ay2 = 4x3 are given by 3a (f+ x) + 2=, =4y + (9ay)/. (Math. Tri. 1899.) 15. Prove that the circle of curvature at a point (x, y) will have contact of the third order with the curve if at that point y3(1 +.yl2) =3yly22. Also prove that the circle is the only curve which possesses this property at every point; and that the only points on a conic which possess the property are the extremities of the axes. 16. The conic of closest contact with the curve y= a2+ bx3 + cx4 +... + kx%, at the origin, is a3y-=a4x2+a2bxy/+(ac-b2)y2. Deduce that the conic of closest contact at the point ($, r) of the curve y=f(x) is 18q23 T= 9=24 (X - )2 + 6q2 q3 (X - T) T+ (3q274 - 4q32) T2, where T=(y- r)-il7 (x- ). (Math. Trip. 1907.) 17. Homogeneous functions*. If u=xxf(y/x, z/x,...), u is unaltered, save for a factor Xk, if x, y, z,... are all increased in the ratio: 1. In these circumstances u is called a homogeneous function of degree n in the variables x, y, z,.... Prove that if u is homogeneous and of degree n, then x +y-+z- +...=nu. Gdx v y dz This result is known as Euler's Theorem on homogeneous functions. 18. If u is homogeneous and of degree n, du/dx, du/dy,... are homogeneous and of degree n- 1. 19. Let f(x, y)=0 be an equation in x and y (e.g. xn+yn- x=0), and let F(x, y, z)=0 be the form it assumes when made homogeneous by the introduction of a third variable z in place of unity (e.g. x'n+yn - xz'-l= O). Show that the equation of the tangent at the point (e, r) of the curve f (x, y)=0 is xFE + yF, + zF = 0, where Ft, Fl, Fi denote the values of Fx, Fy, F, when x=-, y=q, z=C=l. * In this and the following examples the reader is to assume the continuity of all the derivatives which occur. 19-2

Page 292

292 MISCELLANEOUS EXAMPLES ON CHAPTER VII 20. If u and v are functions of ~ and 77, which are themselves functions of x and y, and the Jacobian (Ex. LXIV. 11) of u and v with respect to $ and 77 is denoted by d(u), then d ($, 17) d (t, v) d (u, v) d (, 7) d(x, y) cl (, ) * d(x, y)' Extend the result to any number of variables. 21. Let f(x) be a function of x whose derivative is 1/x and which vanishes for x==. Show that if u =f/(x) +(y), v = xy, then uzx Vy -UyvO =, and hence that u and v are connected by a functional relation. By putting y=1, show that this relation must be f(x) +f(y) =f(xy). 22. Prove in a similar manner that if the derivative of f(x) is 1/(1 +x2), and/(0)= 0, then f(x) must satisfy the equation f(x) +f(y) =f(x+y)/(1 - xy)}. 23. Show that if a functional relation exists between =f /()+/f()+f(z), v=f ()f() +f (z)f(x) +f(x)f(y), wo=f (X)f s)f(Z), thenf must reduce to a constant. [The condition for a functional relation will be found to be f' (Q)f' (y)f' () f(s) -f(z) {f(z) -f(W)} {f/()-f(js)}= -] 24. Suppose that x and y are functions of two variables a and 3, and that the equation of a curve is given by a relation between a and 3, say f (a, /3)=0. Then we may regard a, /3 (and therefore x, y) as functions of an auxiliary variable t. If dashes denote differentiations with respect to t, we have fa' +f/g''=, so that a' and 3' are proportional to -fe and fa; and by choosing t appropriately we can reduce this proportionality to equality, so that a'= -fe, 8'=fa. Then ' =faX -fpXa=J, =oayp-fap=K, say. Show that the measure of curvature of the curve is {Jd(, /) d(a, )- (J2+K2)32. (Jlath. Trip. 1899.) 25. If u=0, v=0, w=0 are the equations of three circles, rendered homogeneous as in Ex. 19, the equation d (, v, w)= d (x, y, z) represents the circle which cuts them all orthogonally. (iath. Trip. 1900.) 26. If A, B, C are three functions of x such that A A' A" B B' B" C C' C" vanishes identically, we can find constants X,,u, v such that XA++,B+vC vanishes identically: and conversely. [The converse is almost obvious. To prove the direct theorem let a=BC'-B'C, etc. Then a'=BC" -B"C, etc.,

Page 293

MISCELLANEOUS EXAMPLES ON CHAPTER VII 293 and in virtue of the vanishing of the determinant ty' -,f'y, ya' - ay, a' - a'23 all vanish; from which it follows that the ratios a: 3: y are constant. But aA + 3B+yC=0.] 27. Let z be a function of x and y, and zx, zy its two first derivatives. Now let x be expressed as a function of y and z, and let xy, x, denote its first derivatives. Prove that Xy = —zy/Z, X = 1 /Zx. [We have the approximate equations 8z = zx 8x + zy, 8x= Xy 8y + X z. The result of substituting for Ax in the first is 8z = (zx xy + zY) y + zx x, 8z. Since 8y and 3z are independent this can be true only if zxy +z y=0, zxx-=l.] 28. Four variables x, y, z, u are connected by two relations. Show that o )X U = z Z -n XV yS — z S y X -- 1 yfz ux, / ~ = XY -Y=ZX 1, xfzY +yfZ y -1, where yu denotes the derivative of y, when expressed as a function of z and u, with respect to z. (Math. Trip. 1897.) 29. Prove the formulae -d f f(t)dt=f (), d+ JO f(t)dt=f(x) qY(x). 30. Find A, B, C, X so that the first four derivatives of Ja +f(t) dt -x [Af(a) + Bf(a + Xx) + Cf(a +x)], vanish for x=0; and A, B, (, D, X, M/ so that the first six derivatives of J +X(t)dt - x [Af(a) +Bf(a +Xx)+ Cf(a +fx) + Df)(a +)] vanish for x=0. 31. If a >, ac - b2 > 0, and x > x, then __ ___ 1 ac tan (x - x0) (ac - b2) j, ax2 +2bx+c /(ac - b2) axxo + b (xl +x) + c the inverse tangent lying between 0 and 7rr. 32. Evaluate the integral -f sin a dx For what values of a is i 1 - 2x cos a + x,2 the integral a discontinuous function of a? (iath. Trip. 1904.) [The value of the integral is rr if 2n7r<a<(2n+l)7r, and -~rr if (2n - 1) r < < a 27rr, n being any integer; and 0 if a is a multiple of r.] 33. If ax2+2bx+c>0 for xo x xl, f(x) =^/(ax2+2bx+c), and y=f(x), yo=f(xo),?y=f(x1), X=(xl-xo)/(yi+l o), * In connection with Exs. 31-33 see Mr Bromwich's paper quoted on p. 236.

Page 294

294 MISCELLANEOUS EXAMPLES ON CHAPTER VII h edxn J d 1 1+ X Ja 2 then. y - la log 1-X/a' aarctan {X/(-a)}, according as a is positive or negative. In the latter case the inverse tangent lies between 0 and -7r. [It will be found that the substitution t=(x- o)/(y+yo) reduces the integral to the form 2 1 -2.] J 1 - at2 34. Prove that /(1 dx = r 1. 35. If a> l, then /( dx =-r {a-/(a21) / _ (1 - x dx: '{a-V7(a2- 1)}. 36. Ifp > 1, 0<q < 1, then dx 20o J[ {l + (p2 - )X) {l - (1 - q2) x] (P + q) sin X where co is the positive acute angle whose cosine is (1 +pq)/(p +q). 37. If a>b>, then j/ sb =in2 {a -/(a2-b2 (Math. Trip. 1904.) 38. Prove that, if a>V(b2+c2) and c>0, then 7r - - 2 _ arc tan j(V /) V (a2 - b2 - 02)1, Joa + b cos + c sin 0= (a2 - b2 ) arc tan /c) /(a- b2 - while if c<0 we must add 27r/^/(a2- b2 - c2) to the result, the inverse tangent in each case being in absolute value less than ~ 7r. 39. If f(x) is continuous and never negative, and f 1(x) dx=0, then f(x)=0 for a__xb. [If f(x) were equal to k(k>0) for x=, say, we could, in virtue of the continuity off, find an interval (d - 8, +a) throughout which f(x) > k; and then the value of the integral would be greater than k.] 40. If P, (x)=( _ )n! (i) {( - a) ( - X)}n Pn(x) is a polynomial of degree n, which possesses the property that J P (x) O (x) dx=, a if (x) is any polynomial of degree less than n. [Integrate by parts m + 1 times, if m is the degree of 0(x), and observe that 0(m + l)(x) 0.] 41. Prove that Pm(x) Pn(x)d =O, if mrz+n, but that if zr=n the value a of the integral is (3 - a)/(2n+ 1). 42. If Qn(x) is a polynomial of degree n, which possesses the property that Q, (x) 0(x)dx=0 if (x) is any polynomial of degree less than n, then () is a constant multiple of P ). Qn (x) is a constant multiple of Pn (x).

Page 295

MISCELLANEOUS EXAMPLES ON CHAPTER VII 295 [We can choose K so that Q - K P, is of degree n - 1: then f Q(Q,-P )d=O, K P P(Ql- KP)dx=O, J a y a and so J (Q&- KPn)2 d=O. Ja Now apply Ex. 39.] 43. Approximate Values of definite integrals. Show that the error rb in taking (b - a) {( (a) + qs (b)} as the value of the integral () (x) dx is less 2 V C~/~~\j y \/(I~ VI~VCUIL aZ than - (b -a)3, where i is the maximum of q" (x) ] in the interval (a, b); and the error in taking (b- a) q/ {2 (a+b)} is less than -24M(b-a)3. [Write f'(xv)=(x) in Exs. 4, 5.] Show that the error in taking i(6- a) q (a) +4 U(b)+4 ( ) as the value is less than -2so M(b -a)5, where 1 is the maximum of +(4)(x). [Use Ex. 6. This rule, which gives a very good approximation, is known as Simpson's Rule. It amounts to taking one-third of the first approximation given above and two-thirds of the second.] Show that the approximation assigned by Simpson's Rule is the area bounded by the lines x= a, x= b, y=O and a parabola with its axis parallel to OY and passing through the three points on the curve y= b(x) whose abscissae are a, 2((a + b), b. It should be observed that if +(x) is any cubic polynomial, +(4)(X)= 0, and Simpson's Rule is exact. That is to say, given three points whose abscissae are a, (a + b), b, we can draw through them an infinity of curves of the type y =a + x+ yx2 + x3; and all such curves give the same area. For one curve 8 =0, and this curve is a parabola. 44. Apply Simpson's Rule to the calculation of wr from the formula 47rV= 1 -. [The result is '7833... If we divide the integral into two, J - 1 +x2' from 0 to I and ~ to 1, and apply Simpson's Rule to the two integrals separately, we obtain '7853916.... The correct value is -7853981....] r5 45. Show that 8-9< / (4+x2)dx<9. (Math. Trip. 1903.) J 3 46. Calculate the integrals x dx [ dx 7r. sinx I,/ TT-\?:(sin x) dx dX, Jo1+x' Jo o(1 +4), j sm x)d J x to two places of decimals. [In the last integral the subject of integration is not defined for x=O: but if we assign to it, when x=0, the value 1, it becomes continuous throughout the range of integration.]

Page 296

CHAPTER VIII. THE CONVERGENCE OF INFINITE SERIES AND INFINITE INTEGRALS. 148. IN Ch. IV we explained what was meant by saying that an infinite series is convergent, divergent, or oscillatory, and illustrated our definitions by a few simple examples, mainly derived from the geometrical series 1 + x + x2 +... and other series closely connected with it. In this chapter we shall pursue the subject in a more systematic manner, and prove a number of theorems which enable us to determine when the simplest series which commonly occur in analysis are convergent. 149. Series of Positive Terms. We pointed out in Ch. IV (~ 70) that the easiest type of series to consider from the point of view of its convergence is that in which all the terms are positive (using positive to include zero). We shall consider such series first, not only because they are the easiest to deal with, but also because the discussion of the convergence of a series containing negative or complex terms can often be made to depend upon a similar discussion for a series of positive terms only. When we are discussing the convergence or divergence of any series we may disregard any finite number of terms. Thus if a series contains a finite number only of negative or complex terms, we may omit them, and apply the theorems which follow to the remainder. 150. It will be well to recall the following fundamental theorems established in ~ 70. A. A series of positive terms must be convergent or diverge to + oo, and cannot oscillate.

Page 297

148-151] CONVERGENCE OF SERIES AND INTEGRALS.297 B. The necessary and sufficient condition that E,.n should be convergent is that there should be a number G such that Uto + u1 +... + n < G for all values of n*. C. The comparison theorem. If 1u,n is convergent, and Vn 5 u, for all values of n, then 2v, is convergent, and Zv, s z-,U. More generallyt, if vn Ku,,, where K is a constant, 2vs is convergent and Zv,, KZu,. And if Zu, is divergent, and v} V Ku,, then Zv, is divergent. Moreover, in inferring the convergence or divergence of Zvl by means of one of these tests, it is sufficient to know that the test is satisfied for sufficiently large values of n (i.e. for all values of n greater than a definite value no). But of course the conclusion that Yv,, 5 KZu,% does not necessarily hold in this case. A particularly useful case of this theorem is D. If Zun is convergent (divergent) and un/vn tends to a limit (other than zero) as n -o c, then Zv, is convergent (divergent). 151. First applications of these tests. The one important fact which we know at present, as regards the convergence of any special class of series, is that 2rn is convergent if r < and divergent if r _ 1. It is therefore natural to try to apply Theorem C, taking Un =r'1. We at once find 1. The series Zv,, is convergent if v, = Kr", where r < 1, for all sufficiently large values of n. If K = 1, this condition may be written in the form v,,lln c r. Hence we obtain what is known as Cauchy's test for the convergence of a series of positive terms; viz. 2. The series Evn is convergent if vn1In l r, where r < 1, for all sufficiently large values of n. There is a corresponding test for divergence, viz. * It is of course a matter of indifference whether we denote our series by ulz+u2+... (as in Ch. IV) or by ulo+uL+... (as here). Later in this chapter we shall be concerned with series of the type ao +ax+a2x2+....; for these the latter notation is clearly more convenient. + Only the first part of this theorem was actually stated in ~ 70. The reader will be able to supply the proof of the second part.

Page 298

298 CONVERGENCE OF SERIES AND INTEGRALS 2 a. The series EVn is divergent if vill > 1 for an infinity of values of n. This hardly requires proof, for vl/n 1 involves v,n 1. The two theorems 2 and 2 a are of very wide application, but for some purposes it is more convenient to use a different test of convergence, viz. 3. The series ZVn is convergent if vn+i/vn < r, where r < 1, for all sufficiently large values of n. To prove this we observe that if vn+l/v, - r for n _ ni0, then -Vn. = - - rn-no.v Vn — Vn —2 Vo and the result follows by comparison with the convergent series 2r?. This is known as d'Alembert's test. We shall see later that it is less general, theoretically, than Cauchy's, in that Cauchy's test can be applied whenever d'Alembert's can, and sometimes when the latter cannot. Moreover the test for divergence which corresponds to d'Alembert's test for convergence is much less general than the test given by Theorem 2 a. It is true, as the reader will easily prove for himself, that if v+1l/vn _ r > 1 for all values of n, or all sufficiently large values, then v,,s is divergent. But it is not true (see Ex. LXIX. 10) that this is so if only vn+i/vn 2 > 1> for an infinity of values of n, whereas in Theorem 2 a our test had only to be satisfied for such an infinity of values. None the less d'Alembert's test is very useful in practice, because when Vn is a complicated function v,,+/Vn is often much less complicated, and so easier to work with. In the simplest cases which occur in analysis it often happens that vn+i/v? or Vnl/1 tends to a limit as n -> o *. If this limit is less than 1 it is evident that the Theorems 2 or 3 above are satisfied. Thus 4. If v,'1n or vZl+/vf, tends to a limit less than unity as n-o, the series svn is convergent. It is almost obvious that if either finction tend to a limit greater than unity, then 2vn is divergent. We leave the formal proof of this as an exercise to the reader. But when v/lln or vn+l/v, tends to 1 all these tests fail completely, and they fail * It will be proved in Ch. IX (Ex. LxxxIX. 33) that if v,+llVn -1, then vnl/n I, but that the converse is not true.

Page 299

151] CONVERGENCE OF SERIES AND INTEGRALS 299 also when v,,ll or v,+l/v,, oscillates in such a way that, while always less than 1, it assumes for an infinity of values of n values approaching indefinitely near to 1. And the tests which involve vn+/vn fail even when that ratio oscillates so as to be sometimes less than and sometimes greater than 1. When vl/n behaves in this way Theorem 2 a is sufficient to prove the divergence of the series. But it is clear that there is a wide margin of cases in which some more subtle tests will be needed. Examples LXIX. 1. Apply Cauchy's and d'Alembert's tests (as specialised in Theorem 4 above) to the series 2nrn. [Here vn+l/vn={(n+l)/n}r ---r, so that d'Alembert's test shows at once that the series is convergent if r<1, divergent if r>l. The test fails if r=l: but the series is then obviously divergent. Since lim nln= 1 (Ex. xxx. 11) Cauchy's test leads at once to the same conclusions.] 2. Apply these tests to 2n2r7, 2r /n, Er'/n2. 3. Consider the series 2 (Ank + Bnk- ~ +... + K) r. [Let the coefficient of rn be denoted by P(n). Then if A is positive, as we may suppose, P(n) is positive for all sufficiently large values of n; and since lim {P (n + 1)/P(n)}= 1 (Ex. XXIx. 4), d'Alembert's test gives convergence or divergence according as r < 1 or r > 1. When r= 1 the series is obviously divergent.] Ank + Bnkc-l +... + K 4. Consider Ank+Bnl+ r... antK-3-n1~.. +X [D'Alembert's test gives the same results as in Ex. 3, when r= 1. The case in which r=l requires further consideration. It is evident that the series is divergent if r= 1 and k > 1.] 5. We have seen (Ch. IV, Misc. Ex. 18) that the series 21/{n(n+l1)}, 1/{ + (n+1)...(n +p)} are convergent. Show that Cauchy's and d'Alembert's tests both fail when applied to them. [For lim un/ = lim (n + 1/u)= 1.] 6. Show that the series 2n-p, where p is an integer not less than 2, is convergent. [Since lim {n(n+ 1)...(n+p- )}/nP=l, this follows from the convergence of the series considered in Ex. 5. It has already been shown (~ 70) that the series is divergent if p=1.] 7. Show that the series Ank + Bnk1 +...+ K anz + onl-l +... + X ' is convergent if 1 >k+2, but divergent if l=k+ 1. [If v, is the general term of the series, limrn nkv=A/a.]

Page 300

300 CONVERGENCE OF SERIES AND INTEGRALS [VIII 8. If mn is a positive integer, and m, + 1 > m,, the series s r"n is convergent if r < 1, divergent if r _ 1. For example the series 1 +r+rr4+r9+... is convergent if r < 1, divergent if r _ 1. 9. Sum the series 1 2q+2l24+... to 24 places of decimals when q=-l and to 2 places when q=-9. [If q='l the first 5 terms give the sum 1-200200002000000200000000 and the error is 2q25 + 2q36 +... <2q25 + 2q36 + 2q47 +.. = 2q25/(1 - q1) < 3/1025. If q=-9 the first 8 terms give the sum 5'458..., and the error is less than 2q64/( - q17) <.003.] 10. If 0<a<b<], the series a-b+a2+b2+a3 -... is convergent. Show that Cauchy's test may be applied to this series, but that d'Alembert's test fails. [For V2n + 1/V2n = (b/a)L + 1 - + cc, V2t + 2/V2t + 1 = b (ab) + 2 0.],2 X3 x2 X3 11. The series 1l+x + + +... and 1 +l + - +... are convergent for all positive values of x. 12. If iul is convergent so is S u,/(1 +,n): and 2 u,/(1 - u,) is also convergent if u,, < 1. 13. If Eu, is convergent so is 2tu2. 14. If 22,2 is convergent so is 2u,/n. [For,2+(1/n2) >2ui/n and (1/n2) is convergent.] 1 1 15. Show that 1+ + + +...l...) and +22 + 32 + 2 + 62+2+92 + * *+ + + [To prove the first result we note that +22+ ***= 1+ I. ( ) + (+ ) I * =i+3252 +...+22 +2+... by Theorems (8) and (6) of ~ 70.] 16. Prove by a reductio ad absurdumn that (l1/n) is divergent. [If the series were convergent we should have, by the argument used in Ex. 15, +1 +i+...=(l+ ++...)+(I +1 i+...), or +++ ++...=1++ 1++... which is obviously absurd, since every term of the first series is less than the corresponding term of the second.] 152. Before proceeding further in the investigation of tests of convergence and divergence, we shall prove an important general theorem concerning series of positive terms.

Page 301

152, 153] CONVERGENCE OF SERIES AND INTEGRALS 301 Dirichlet's Theorem*. The sum of a series of positive terms is the same in whatever order the terms are taken. This theorem asserts that if we have a convergent series of positive terms, Uo + u1 + ut2 +... say, and form any other series vo + V1 + V2 +... out of the same terms, by taking them in any new order, the second series is convergent and has the same sum as the first. Of course no terms must be omitted: i.e. every uz must come somewhere among the v's, and vice versa. The proof is extremely simple. Let s be the sum of the series of u's. Then the sum of any number of terms, selected from the u's, is not greater than s. But every v is a u, and therefore the sum of any number of terms selected from the v's is not greater than s. Hence Ev,, is convergent, and its sum t is not greater than s. But we can show in exactly the same way that s < t. Thus s = t. 153. Multiplication of Series of Positive Terms. An immediate corollary from Dirichlet's Theorem is the following theorem: if Uo + u+2 +... and vo + v + v2+... are two convergent series of positive terms, and s, t are their respective sums, then the series Uo0vo (o + ) (U + (uVo + U + ( + + oV) +... is convergent and has the sum st. Arrange all the possible products of pairs umvn in the form of a doubly infinite array UOVo U v uo U2VO u3Vo... u0v V 'U1Vl 2t21 V 313 V.. 0V2 u1V2 'U2V2 3 V2... U0V3 Ut1 V3 U2 V3 U3V3... We can arrange these terms in the form of a simply infinite series in a variety of ways. Among these are the following. (1) We begin with the single term uz0v for which m + n = 0; then we take the two terms uxVo, utov for which m + n = 1; then * This theorem seems to have first been stated explicitly by Dirichlet in 1837. It was no doubt known to earlier writers, in particular to Cauchy.

Page 302

302 CONVERGENCE OF SERIES AND INTEGRALS the three terms uzVo, uxv, UoV2 for which m + n = 2; and so on. We thus obtain the series UoVo + (Uvo0 + U0ov) + (tt2 + UIv1 + U0V2) +... of the theorem. (2) We begin with the single term UoVo for which both suffixes are zero; then we take the terms uzVo, uIv1, UoV1 which involve a suffix I but no higher suffix; then the terms t2Vo, U2v1, U2v2, u1v2, uoV2 which involve a suffix 2, but no higher suffix; and so on. The sums of these groups of terms are respectively equal to WUoo, (o + 1) (Vo + v1) -oVo, (Uo + 1I + 02) (Vo + V1 + v2) - (to + ~U) (Uo + U2),.. and the sum of the first n, + 1 groups is (%to + 1 +... + Un) (Vo + V +... +. n), and tends to st as n o- oo. When the sum of the series is formed in this manner the sum of the first one, two, three,... groups comprises all the terms in the first, second, third,... rectangles indicated in the diagram on p. 301. The sum of the series formed in the second manner is st. But the two series are rearrangements of one another; and therefore, by Dirichlet's Theorem, the sum of the first series is also st. Thus the theorem is proved. Examples LXX. 1. Verify that if r< 1 1 r2+r+- r4+r6 +r3 +... =1 ++3 + 2 +r5 +7+...= 1/(1 -). 2. If either of the series uo+u1+..., vo+v1+... is divergent, so is the series Uo Vo + (ul vo + Uo v1) +t (t2 vo + tit v + z0ov2) +... 3. If the series o+ u +..., + v +..., wo + w +... (all of whose terms are positive) converge to sums r, s, t, the series Y Xk, where Xk=Unvu^,vwp, the k=o summation being extended to all sets of values of m, n, p such that m+n+p=k, converges to the sum rst. 154. Further tests for convergence and divergence. The examples which precede will suffice to show how many simple and interesting types of series of positive terms cannot be dealt with by the general tests of ~ 151. In fact, if we consider the simplest type of series, in which uz,+,/un tends to a limit as n -O cc, the tests of 151 fail completely when this limit is 1. Thus in Ex. LXIX. 6, 7 these tests failed, and we had to fall back upon a special device, which was in essence that of using

Page 303

153-155] CONVERGENCE OF SERIES AND INTEGRALS 303 the series 1/{n (n + 1)... (n +p)} as our comparison series, instead of the geometric series. The fact is that the geometric series, by comparison with which the tests of ~ 151 were obtained, is not only convergent but very rapidly convergent, far more rapidly than is necessary in order to ensure convergence. The tests derived from comparison with it are therefore naturally very crude, and far more delicate tests are often wanted. It was proved in Ex. xxx. 7 that nkrn -0 as n- -c, provided r < 1, whatever value k may have; we saw indeed in Ex. LXIX. 3 that not only does nkrn tend to zero but the series nkr" is convergent. It follows that the sequence 1, r, r2,..., el,..., where r < 1, diminishes more rapidly than the sequence 1, 2-k, 3-k..., n-k,.... This seems at first paradoxical if r is not much less than unity, and k is large. Thus of the two sequences 2 4'8 1 6 1 1_ 83 9? 2-iF7' S. I ' 4 096 '-41).*. whose general terms are (;)"f and n-12, the second seems at first sight to decrease far more rapidly. But this is far from being the case; if only we go far enough into the sequences we shall find the terms of the first sequence very much the smaller. Thus (2/3)4= 16/81 < 1/5, (2/3)t2 < (1/5)3 < (1/10)2, (2/3)1000 < (1/10)166, while 1000-2=10-36. Thus the 1000th term of the first sequence is less than the 1013~th part of the corresponding term of the second sequence, and so the series 2 (2/3)" is far more rapidly convergent than the series En-12, and even this series is very much more rapidly convergent than E n-2. 155. We shall proceed to establish two further tests for the convergence or divergence of series of positive terms, Maclaurin's (or Cauchy's) Integral Test and Cauchy's Condensation Test, which, though very far from being completely general, are sufficiently general for our needs in this chapter. In applying either of these tests we make a further assumption as to the nature of the function un, about which we have so far assumed only that it is positive. We assume that u1n decreases with n: i.e. Un+i U< n for all values of n (or at any rate all sufficiently large values). This condition is satisfied in all the most important cases. From one point of view it may be regarded as no restriction at all, so long as we are dealing with series of positive terms-for in virtue of Dirichlet's theorem above we may rearrange the terms without affecting the question of convergence or divergence: and there is nothing to prevent us rearranging the Five terms suffice to give the sum of 27-12 correctly to 7 places of decimals, whereas some 10,000,000 are needed to give an equally good approximation to ~;n-2

Page 304

304 CONVERGENCE OF SERIES AND INTEGRALS i[vm terms in descending order of magnitude, and applying our tests to the series of decreasing terms thus obtained. But before we proceed to the statement of these two tests, we shall state and prove a very simple and important theorem, Pringsheim's Theorem. This is a one-sided theorem in that it gives a sufficient test for divergence only and not for convergence, but it is essentially of a more elementary character than the two theorems mentioned above. 156. Pringsheim's Theorem. If zt is a convergent series of positive and decreasing terms, then lim ni n = 0. This is easily proved by a reductio ad absurdum. To say that nu, - 0 is the same thing as to say that, given any positive number 8, however small, we can find no so that nut, < 8 for n - no. If this is not the case there must be some such positive number 8 for which no choice of no will ensure the satisfaction of this inequality; i.e. such that there is an infinite sequence of values of n for which nu,n - 8. Let nj be the first such value of n; n2 the next such value of n which is more than twice as large as n,; n3 the next such value of n which is more than twice as large as n2; and so on. Then we have a sequence of numbers nl, n., n3,... such that n2 > 2n1, n3 > 2n,... and so n22- n > 2 n2, 3s - n1 > n3,...; and also n iUn 8, s t28,2.... But since un, decreases as n increases U0 + it1 +... + t,-1 - nu, (? 8, u,4 +... e+ 2_-1 (n2 - n,) tz > 2 }2 > 1 U, +.. + u,3-1 (n3 - n) u,3 > n3u 3 > 2 and so on; and we can bracket the terms of the series >Stn so as to obtain a new series whose terms are severally greater than those of the divergent series (8+ -8+ 8+.. and therefore `u1n is divergent. An alternative proof is as follows. Since uo0+ul+...-t+, tends to a limit s, we can choose no so that s- 8 < uo + ulz+... +u _ s for n _ no, however small be the positive number 8. Hence ''nC +1+ —n +2+ -..< 8

Page 305

155-157] CONVERGENCE OF SERIES AND INTEGRALS 305 and a fortiori un1 + 1+U + 2+... U 2n<; and still more nu2n< 6, for n no. Therefore n 2,, —0, and therefore 2nu2n —0O. And since (2n+ l)c2n + < (2n+1)u2 < {(2n + 1)/2n} 2nU2n, it is clear also that (2n+1) U2+l-~0. Hence nt — 0 as n-ao.through either odd or even values, and therefore as n -oo in any manner. Examples LXXI. 1. Use Pringsheim's theorem to show that 2 (1/n) and 2{1/(an+b)} are divergent. [Here nun-i-l or 1/a.] 2. Show that 2(1/n8) is divergent if s 1. [Compare with 2(l/n): or use Pringsheim's theorem directly.] 3. Show that Pringsheim's theorem is not true if we omit the condition that un decreases as n increases. [The series 1111 1 11 1 1+ + -+ I-+I-+-~I 22 + 32 4 52 62 + 72+9+ 102+ in which,=l1/n or l/n2, according as n is or is not a perfect square, is convergent, since it may be rearranged in the form 22 + 32 + 52 + 62 + 72 + i2 + 1o02+* + 1 + 9+..) and each of these series is convergent. But since nut=l whenever n is a perfect square, it is clearly not true that nu —, 0.] 4. The converse of Pringshein's theorem is not true, i.e. it is not true that, if u. decreases with n and lim nun =0, then 2 un is convergent. [Take the series 2(1/n) and multiply the first term by 1, the second by 2, the next two by 3, the next four by H, the next eight by, and so on. On grouping in brackets the terms of the new series thus formed we obtain i+~. I+ '(,+ ')+ -i(5- + +I + 8)+...; and this series is divergent, since its terms are greater than those of 2+.2+3 2+..+..., which is divergent. But it is easy to see that, in the series nu, -O. In fact nun = /v if 2v<n 2v+', and v — c with n.] 157. Maclaurin's (or Cauchy's) Integral Test*. If u, decreases as n increases, we can write u, = ~ (n), and can suppose that 9( (n) is the value assumed when x = n by a continuous and decreasing function b (x) of the continuous variable x. Then r2 (1) _ (x) dx) >_ 0 (2), J1 * The test was discovered by Maclaurin but forgotten, and rediscovered by Cauchy, to whom it is usually attributed. H. A. 20

Page 306

306 CONVERGENCE OF SERIES AND INTEGRALS [VIII (2) (x) dx _ (3), 2..............oooo........................ C(, - 1) (do i 0 (n), since ( (n - 1) is the greatest and ( (n) the least of the values assumed by ( (x) in the interval (n - 1, n). We have to use the sign >, and not simply the sign >, in the inequalities above, because p(v) may be equal to (v + 1) for some values of v, in which case p (x) will obviously be constant in the interval (v, v + 1). Hence by addition (1) + 2) +... + (2) (n - 1) - () )dx (2) + (3) +... + (). Let us write > (D) =f (x) dx. Then, since b (x) > 0, (I) is an increasing function of:, and so either (i) (E (~) tends to a limit I as - + oo, or (ii) I(s) - + c. In the first case (2) + (3) +... + (n)-1 and so ( (2) + (3)+... converges to a sum not exceeding 1; and therefore ( (1) + p (2) +... converges to a sum not exceeding <b(1) +l. In the second case, since, (1) +, (2) +... + (n - 1) - (n), it is evident that the series is divergent. Hence if ( (x) is a function of x which is positive and continuous for all values of x greater than unity, and steadily decreases as x increases, the series ( (1)+ (2)+... will or will not converge according as J (I) =Jf (x) dx does or does not tend to a limit I as -f + oo: and, in the first case, the sum of the series will be not greater than ( (1) + 1.

Page 307

157, 158] CONVERGENCE OF SERIES AND INTEGRALS 307 Examples LXXII. 1. This argument may be put in a geometrical form. In the figure >(n) is the area PpqQ: while (1) +(2)+...+ 4(n-1) is the sum of the exterior rectangles (bounded by a thick line) and c (2) + (3)+... + (n) is the sum of the interior rectangles (shaded). The inequalities used above follow from a glance at the figure. Y 0 1 2 3 - n FIG. 64. 2. The convergence of either of the series )(1)+0(3)+)(5) +..., < (2)+0(4)+0(6)+... is a necessary and sufficient condition for that of ( (1)+ (2) + 3(3) +.... [By using (1, 3), (3, 5),... as intervals of integration we can show as above that ( (1) + (3)+... will or will not converge according as (:)-l or ($) — + c. Or we may use the inequalities 2(k(1) > (1)+ —(2), (2)+q(3)>2q(3)>q(3)+-q(4),.... For a more general result see Ex. LXXVI. 3.] oo 1 3. Prove that 2 2 < 1 +7r. 158. The series;n-s. By far the most important application of the Integral Test is to the series 1- + 2- + 3- +... + s +..., where s is any rational number. If s _ 0 it is obvious that the series is divergent. If s > 0 Un decreases as n increases, and we can apply the test. Here (unless s = 1) -(E)=f ^ idx es-s 1 20-2

Page 308

308 CONVERGENCE OF SERIES AND INTEGRALS If s > 1, t- 0 as - + +, and (D) - 1/(s- 1)= i, say. But if s < 1, l8 - + oo with x, and so cE (d) + o. Hence we derive the result The series Yn-s is convergent if s > 1, divergent if s < 1: if s > 1 its sum is less than s/(s - 1). So far as the divergence for s< 1 is concerned, this result might (as in Ex. LXXI. 2) have been derived at once from comparison with 2(1/n), which we already know to be divergent. It is however interesting to see how the Integral Test may be applied to the series 2(1/n) (when the preceding work fails). In this case ()f, (Vdx and it is easy to see that, ($) -+ oo with 4. For, if > 2", then ()>jn d= 2 + - +...+ d; 1 x x x2 2n-1 X ' and since (putting x=-2ru) 2"+1 dX 2dt j 2r X ~ it ' we obtain, ())> n -, which shows that ($)-+ oo with $. Examples LXXIII. 1. Prove by an argument similar to that used above, and without integration, that 1 (6)= s, where s < 1, tends to + oo with ~. 2. The series en-2, s2z-3/2, n-l1110 are convergent, and their sums are not greater than 2, 3, 11 respectively. The series 2n-1/2, n-10/11 are divergent. 3. The series: ns/(n+ a), where a > 0, is convergent or divergent according as t> 1 + s or t 1 +s. [Compare with 2 ns-t.] 4. Discuss the convergence or divergence of the series 2 (al2sl + a2nS2 +... + aknS")/(bltl + b2nt2 +... + blntI), where all the letters denote positive numbers and the s's and t's are arranged in descending order of magnitude. 5. If (n)-l->l the series 2n- (n) is convergent. If j (n)-l <l it is divergent. 159. Cauchy's Condensation Test. The second of the two tests mentioned in ~ 155 is as follows: if ut = g (n) is a decreasing function of n, the series E+ (n) converges or diverges according as Z21Qb (2n) converges or diverges.

Page 309

158-160] CONVERGENCE OF SERIES AND INTEGRALS 309 We can prove this by an argument which we have already used (Ch. IV, ~ 70) in the special case of the series;(1/n). In the first place + (3) + 4 (4) 20 (4), (5) + (6)+... + (8) _ 4~ (8),.....,...................................... ( (2T1 + 1) + ( (2n + 2) +... + m (21+1) _ 2no (2?X+1), inequalities which make it obvious that if 2 (2n+1) or, what is the same thing, 2n +l(2'1+l) or 2n b(2") diverges so does $f(n). On the other hand < (2) + S (3) c 20 (2), 0 (4) + 4 (5) +... + (7) _- 40 (4), and so on. And from this set of inequalities it follows that if E21b (2'1) converges, so does Eb (n). Thus the theorem is established. For our present purposes the field of application of this test is practically the same as that of the Integral Test. It enables us to discuss the series En-s with equal ease. For:n-S will converge or diverge according as;2n 2-~S converges or diverges: i.e. according as s > 1 or s 1. Examples LXXIV. 1. Show that if a is any positive integer greater than 1, the series 5;+(n) converges or diverges with Eanb(a"). [Use the same arguments as above, taking groups of a, a2, a3,... terms.] 2. If ~ b( n) converges it is obvious that lim 2n (21) =0. Hence deduce Pringsheim's Theorem. 160. Infinite Integrals. The Integral Test of ~ 157 shows that if 0 (x) is a positive and decreasing function of x, the series S (n) is convergent or divergent according as the integral function (< (x) does or does not tend to a limit as x - +.oo Let us suppose that it does tend to a limit, and that lim rn (t) dt = 1. x -.+oo 1 Then we shall say that the integral J ~( (t) dt is convergent, and has the value I; and we shall call the integral an infinite integral.

Page 310

310 CONVERGENCE OF SERIES AND INTEGRALS [VIII So far we have supposed b (t) positive and decreasing. But it is natural to extend our definition to other cases. Nor is there any special point in supposing the lower limit to be unity. We are accordingly led to formulate the following definition: If 0 (t) is a function of t which satisfies the conditions of ~~ 137 et seq. throughout the interval (a, x), where x is any number greater than or equal to a, and if lim I (t)dt= l, we shall say that the infinite integral i ( t) dt........................... (1) a is convergent and has the value 1. The ordinary integral between limits a and A, as defined in Ch. VII, we shall sometimes in contrast call a finite integral. On the other hand when j (t) dt - + oo Ja we shall say that the integral diverges to + oo. Similarly we define divergence to - oc. Finally, when none of these alternatives occur, we shall say that as x -+ oo the integral oscillates (finitely or infinitely). The following remarks suggest themselves. (i) If we write () (t) dt = (x), a the definitions of convergence, divergence, and oscillation of the integral are precisely the same as those already formulated (Ch. V, ~ 77) for the behaviour of ~(x) as cx-. +co. (ii) In the special case in which q (t) is always positive (or zero) it is clear that (c(x) is an increasing function of x. Hence the only alternatives are convergence and divergence to + o. (iii) The integral (1) of course depends on a, but is quite independent of t, and is in no way altered by the substitution of any other letter for t. In this respect the infinite integral only resembles the finite integral (cf. Ch. VII, ~ 142). (iv) Of course the reader will not be puzzled by the use of the term infinite integral to denote something whose value is a definite number such as 2 or ~rr. The word infinite is used to distinguish the infinite integral, in which the variable of integration ranges through all values greater than a

Page 311

160, 161] CONVERGENCE OF SERIES AND INTEGRALS 311 or, as it is often expressed, whose 'upper limit is infinite,' from the finite integral, just as it is used to distinguish an infinite series from a finite series: no one supposes that an infinite series is necessarily divergent. (v) The integral q (t) dt was defined in Ch. VII, ~ 139, as a simple limit, a the limit of a finite sum Ef(xv) (xv+l-xv). The infinite integral is therefore the limit of a limit, or what is known as a repeated limit. The notion of the infinite integral is in fact an essentially more complex notion than that of the finite integral, of which it is a development. (vi) The Integral Test of ~ 157 may now be stated in the form: if p (x) is positive and steadily decreases as x increases, the infinite series 2 q (n) and the infinite integral J i (x) dx converge or diverge together. 161. The case in which G (x) is positive. It is natural to consider what are the general theorems concerning the convergence or divergence of the infinite integral (1) and analogous to theorems A-D (~ 150 above). That A is true of integrals as well as of series we have already seen (~ 160 (ii)). Corresponding to B we obviously have the theorem that the necessary and suffcient condition for the convergence of the integral (1) is that it should be possible to find a constant K such that (x) dx < K for all values of x greater than a. Similarly, corresponding to C, we have the theorem: if 00 * J (q (x) dx is convergent, and q/ (x) _ Kcb (x) for all values of x greater than a, then f (x) dx is convergent, and *j (x) dx =- K (x) dx. a a We leave it to the reader to formulate the corresponding test for divergence. We may observe that D'Alembert's test (~ 151, 3), depending as it does on the notion of successive terms, has no analogue for integrals: and the analogue of Cauchy's test (~ 151, 1) is not of much importance, and in any case could only be formulated when we have investigated in greater detail the theory of the function

Page 312

312 CONVERGENCE OF SERIES AND INTEGRALS [VIII ( (x) = r, as we shall do in Ch. IX. And the most important particular tests are the following, which are derived at once by comparison with the. integral f dx ( >O) (a > 0) a f (whose convergence or divergence we have investigated in ~ 158): viz. If, for x >_- a, < (x) < Kx-8, where s > 1, then f q (x) dx is convergent: if on the other hand, for x _ a, b (x) > Kx-s, where s = 1, the integral is divergent. In particular, if lim r b (x) = l, where 1> 0, the integral is convergent or divergent according as s>1 or s5 1. There is one exceedingly fundamental and important property of a convergent infinite series, the obvious analogue of which is not necessarily true of a convergent infinite integral. If 2q0(n) is convergent then q((n) —O; but it is not always true, even when f (x) is always positive, that if (x) dx is convergent then ( (x) -0. Consider for example the function (x) whose graph is indicated by the thick line in the figure. Here the height of the peaks corresponding to the Yi / - - 0 1 2 3 X FIG. 65. points x=l, 2, 3,... is in each case unity, and the breadth of the peak corresponding to x=n is 2/(n+1)2. The area of the peak is 1/(7n+1)2, and it is evident that, for any value of $, (x)dx< ( n+1)2 and so ( S(x) dx is convergent; but it is not true that < (x)-O0. Jo Examples LXXV. 1. If (x)dx is convergent then (|(x)dx is a A convergent for any value of A greater than a, and f|s (X) xdx=f (x) &d+ f/ (x) dx. J a J ~aJA

Page 313

161] CONVERGENCE OF SERIES AND INTEGRALS 313 f00 ax^+Xr1 ~...+X 2. The integral AxS+Bs-l+, +L dx, where a and A are positive and a is greater than the greatest root of the denominator, is convergent if s>r +l and otherwise divergent. 3. Which of the integrals dC dr dx d[ x ddx xdx Ja / ]' fa a/x' fJC2 2 a C2+2 C2+X2 a c2+X' aa+2x2 +x4 are convergent? 4. The integrals fcoC, sin, cos (ix + /c ) dx, a a ja oscillate finitely as ~$- + -. 5. The integrals ) xcosxdx, f 2sinx dx, f xncos (ax+ 3) dcx ja aj a (where n is any positive integer) oscillate infinitely as c-c+ o. ra 6. Integrals to - oo. If f (x)dx tends to a limit I as $ — oo, we say a f q (x) dx is convergent and equal to 1. Such integrals possess properties in every respect analogous to those of the integrals discussed in the preceding sections: the reader will find no difficulty in formulating them. 7. Integrals from - cc to + cc. If the integrals ra r00 f (x)dx, J (x)dx, are both convergent, and have the values k, I respectively, we say that jf (x) dx 00 is convergent and has the value k +1. 8. Prove that 0 dx r r dx = r dx__ J1 7 J +1- I+2 jo 1+X2- 2 i -l+-=27r. 9. Prove generally.that (X2) dx= 2 f (X2)dx,,-coo provided the integral j (x2) dx is convergent. 10. Prove that if xq(x2)dx is convergent, then _ x ((2) dx=O. J o J -Go

Page 314

314 CONVERGENCE OF SERIES AND INTEGRALS [VIII 11. Analogue of Pringsheim's Theorem. If 0 (x) is positive and steadily decreases, and (x) dx is convergent, ent x ] (x) -O. Prove this (a) by means of Pringsheim's Theorem and the Integral Test, (b) directly, by arguments analogous to those of ~ 156. 12. If a=<xo c<<x 2 <... and x — + -o, and u.= + (x) dx, then the XjXn convergence of q5 (x)dx involves that of 2EU. If q (x) is always positive the converse statement is also true. [That the converse is not true in general is shown by the example in which p (x)=cosx, xn=nzr.] 162. Application to infinite integrals of the rules for substitution and integration by parts. The rules for the transformation of a definite integral which were discussed in ~ 145 may be extended so as to apply to infinite integrals. (1) Transformation by substitution. Suppose that f (x) dx......................... (1) a is convergent. Further suppose that, for any value of ~ greater than a, we have, as in ~ 145, (x) dx = {f(t)}f'(t)t............(2), a b where a =f(b), =f(r). Finally suppose that the functional relation x=f(t) is such that t-+ oo with x. Then, making n and so T tend to + oo in (2) we see that the integral 0 {f(t) f' (t) dt..................... (3) b is convergent and has the same value as (1). On the other hand it may happen that as -o-+ 00, T - or T -- oo. In these cases we have the equations, (x)dx = f I{f(t)} f' (t) dt, J a J-b b (x)dx=J c {f (t)}f' (t) dt= - f(t)}f'(t)dt. a b -o In the first of these two cases we observe that the method of substitution transforms an infinite into a finite integral. There are of course corresponding results for the integrals ra d, < 0 (x) dxz, - 0 (x) dx, -oo _ -o

Page 315

162] CONVERGENCE OF SERIES AND INTEGRALS 315 which it is not worth while to set out in detail: the reader will be able to formulate them for himself. Examples LXXVI. 1. Show, by means of the substitution x= ta, that if s> and a>O then coo 0) y x-sdx=a ta (-s)-1dt; and verify the result by actually calculating the value of each side. 2. If fc (x)dx is convergent, it is equal to one or other of the integrals ja a (at+ 3)dt, - af Pi (at + 3) dt, (a-3) /a -o according as a is positive or negative. 3. If (x) is positive for all positive values of x, and a and 3 are any positive numbers, the convergence of the series x; (n) involves and is involved by that of the series 2) (ann+ ). [It follows at once, on making the substitution x=at+3t, that the integrals ro rco c] (x) dx, f 4)(at + 3) dt JO ~ ~ J{(a —) la converge or diverge together. Now use the Integral Test.] 4. Show that f(1 d_)- _ ==7r. [Put x-t2.] 5. Show, by means of the substitution x=t/(l - t), that if I and m are both positive then XI-1 rJ1 Jo (1 +) +mdx=J t-l(l -t)m-ldt A 6. Show by means of the substitution x=pt/(p+ 1 - t), that if a, b, and p are positive then xa-1 (l-)b-1 (X +_p)ab 1 d _t J tda- (1 - t)b-1 dt. Jo ' ' ^+p2c3 b (1-^"pbp J +( 7. If q (x) —h as x — + o and 4 (x) —.k asx- - oo, then j { (x -a) - (x- b)} dx =-a - ab) ( - k). -00 [For f {4(c-a)-(x -b)}do= J( (.* -a)dx - (x - b)dx -j= f (t)dt- f )(t)dt= f ) (t)dt- (t) dt. J a- b -'- b-a The first of these two integrals may be expressed in the form (a- b)kk J p dt, J,_ -'ap

Page 316

316 CONVERGENCE OF SERIES AND INTEGRALS [vITI where p —0 as +'- +oo, and the modulus of the last integral is less than or equal to I a- b I K, where K is the greatest value of p throughout the interval (-' - a, - '- b). Hence f (t) dt -(a - b) k. Similarly for the second integral.] (2) Integration by parts. The formula for integration by parts (~ 145) is f (x) ' (x) dx =f () ( (5) -f(a) b (a)- f ' (x) < (x) dx. a a Suppose now that: - + so. Then if any two of the three terms in the above equation which involve e tend to limits, so does the third, and we obtain the result f (x) '(x))dx =lim f()() () -f (a) O(a)- ' (x) b (x)dx. a >+o a There are of course similar results for integrals to - c, or from - co to + co. Examples LXXVII. 1. Show that 0 (i+ dx= (1 ) d=00 x 2 * 0 x 2. (i + 4-)-:: oJ (1 +)-) dx-=. co xm dx 3. If mn and n are positive integers and i,. = ( n then J o (lI+ )?n+ '' Inn = {mn/(n + n - 1)) Imln. Hence prove that Im, = m! (n - 2)!/(m + - 1)!. r 'I X2m + dx 4. Show similarly that if 1n,=J (f1+ 2)2m-+ then In,n= Int/(l+2 -1)} I-l, n, 21m, n = =21! (n-2)!/(m+r-1)!. Verify the result by applying the substitution x= t2 to the result of Ex. 3. 163. Other types of infinite integrals. It was assumed, in the definition of the ordinary or finite integral given in Ch. VII, that (1) the range of integration was finite, (2) the subject of integration was continuous. It is possible so to modify this definition that it applies to many cases in which these conditions are not satisfied. But this can only be done by introducing considerations of a somewhat more advanced character than those used in Ch. VII. And as a matter of fact, when condition (2) is satisfied, but not condition (1), as was the case with the infinite integrals of the preceding sections, we found it best not to attempt to give a direct definition

Page 317

162, 163] CONVERGENCE OF SERIES AND INTEGRALS 317 on the lines of Ch. VII, but to adhere to that definition and, so to say, superimpose another on it; and (as was pointed out in ~ 160, (v)) to define the infinite integral not as a simple limit but as a repeated limit. We shall now suppose that it is the second of the conditions (1), (2), that is not satisfied. It is natural to try to obtain definitions which enable us to extend the notion of an integral to some such cases. There is only one such case which'we shall consider here. We shall suppose that b (x) is continuous throughout the range of integration (a, A) except for a finite number of values of x, x = 1f 27,,..., but that 0 (x) + oo (or - oo ) as x -.. from either side. It is evident that we need only consider the case in which (a, A) contains one such point |. If there is more than one such point we can divide up (a, A) into a finite number of sub-intervals each of which contains only one; and if the value of the integral over each of these sub-intervals has been defined we naturally define the value of the integral over the whole interval as being the sum of the integrals over each sub-interval. Further, we can suppose that the one point: in (a, A) comes at one or other of the limits a, A. For if it comes between a and A we naturally rA define f c: (x) dx as a | () dx + (x) dx, assuming these integrals to have been satisfactorily defined. We shall suppose, then, that ~ = a; it is evident that the definitions to which we are led will apply, with trifling changes, to the case in which = A. Let us then suppose b (x) to be continuous throughout (a, A) except for x = a, while 5b (x) - + oo as x -- a through values greater than a. A typical example of such a function is given by (x)= ( - a)-S ( > 0), or, in particular, if a = 0, by b (x)= x-~ (s > 0). Let us therefore consider how we can define TA d (s0)). (,o0)........................(I).,,O~~~~ C' ' ''''*'''*"'

Page 318

318 CONVERGENCE OF SERIES AND INTEGRALS [VIII The integral f ys-2dy is convergent if s<l (~ 158) and means 1/A ra] lim yS-2dy. But if we make the substitution y= /x, we obtain -*-q +oJ- I/A I yS-8 dy = I xdx. J 1A 1/Al A — sA 'Thus lim a sdx, or, what is the same thing, lim j x- dx exists + -J 1/-? e — n + 0Je provided s< 1; and it is natural to define the value of the integral (1) as being equal to this limit. Similar considerations lead us to define A(x -a) -dx by the equation Ja fA FA (x-a)-scdx= lim I (x- a)s-dx. J a e-S+OJ a+e We are thus led to the following general definition: if the integral 4 (x) dx J a+e tends to a limit I as -e + 0, we shall say that the integral HA Ja (x) dx a is convergent, and has the value 1. rA Similarly, of course, we should define f (b(x)dx, if b(x) — + oo a.as x tends to the upper limit A, as being rA-e lim q (x) dx: e — +O a and then, as explained above, we can extend our definitions to cover the case in which the interval (a, A) contains any finite number of points near which ) (x) - + oo or - oo. An integral in which the subject of integration tends to + oc or - ooas x tends to some value or values included in the range,of integration will be called an infinite integral of the second kind: the first kind of infinite integrals being the class discussed in ~ 160 et seq. Examples LXXVIII. 1. If (x) is continuous except for x=a, while, as xt-a, (x)-, + 0, the necessary and sufficient condition that f q(x)dx a should be convergent is that we can find a constant K such that (x) dx < K J a+e -for all values of (, however small (compare ~ 161).

Page 319

163] CONVERGENCE OF SERIES AND INTEGRALS 319 It is clear that we can choose a number A' between a and A, such that b(x) is positive in (a, A'). If q (x) is positive in the whole interval (a, A) we can of course identify A' and A. Then rA fA' fA a - (x) dx - (x) d+ A d (x) dx. The first integral on the right-hand side of the above equation increases as e decreases, and therefore tends to a limit or to + o; and the truth of the result stated becomes evident. TA If the condition is not satisfied, I q>(x)cdx —+ oo. Ve shall then say a- e that the integral j ) (x) dx diverges to + co. It is clear that, if ) (x)- + 0o a as x -a+0, convergence and divergence to + o are the only alternatives for the integral. Similarly we may discuss the case in which 4 (x) - -o. 2. Prove directly from the definition that (x - a) -Sx = ( A-a) - /( 1 -s) a if s< 1, while the integral is divergent if s _ 1. 3. If (x)- + ooas x —a+O and >(x) < K (x-a)-s, where s<l, then fc (e) dx is convergent: but if 4( (x) > K (x- a)-, where s 1, the integral a is divergent. [This is merely a particular case of a general comparison theorem analogous to that stated in ~ 161.] 4. Are the integrals [A dX [A clx [A dx [A dx ja^/(x- a)' J(A -x)' J(a )^(x-a)' Ja (A-x)-(A-x)' a {-a) (A-x) a (x-a) /(A-x) J (A -x) /x(C-a)' fA dx by dx [A dx Bf dx a /(X2- a2)' ja,(A3-X 3) ' X -' a2" A3-x3 convergent or divergent? I Tdx [a+1 dx 5. The integrals | J:- 1 /( — a are convergent, and the value of J 5 a1 - lxJ - a each is zero. 6. The integral | ( i- ) is convergent. [The subject of integration j o (sin x) tends to +oo as x tends to either limit: like 1/,lx as x-0., and like 1/^/(7r-x) as x ---r.] 7. The integral (. dx) is convergent if and only if s < 1. j o (sin x)8 in is conerget if t <s1. 8. The integral 2 dx is convergent if t<s+l. 0 o (smin x)

Page 320

320 CONVERGENCE OF SERIES AND INTEGRALS [VIII 9. The integral f (cos x)1 (sin x) zdx is convergent if and only if 1 > - 1, Jo m> -1. 10. Such an integral as | -1I does not fall directly under any of our previous definitions, if s < 1. For the range of integration is infinite, and the subject of integration tends to +oo as x- +0. It is natural to define it as equal to the sum Jo i + j i+ ' provided these two integrals are both convergent. The first integral is a convergent infinite integral of the second kind if 0 < s < 1. The second is a convergent infinite integral of the first kind if s < 1. It should be noted that if s > 1 the first integral is an ordinary finite integral: but then the second is divergent. Thus the integral from 0 to co is convergent if and only if 0 < s < 1. X s-1 11. Prove that J -+xt dx is convergent if and only if 0 < s < t. r00 Xs-l_ Xt12. The integral -f dx is convergent if and only if 0<s< 1, 0 1 - x 0< t < 1. [It should be noticed that the subject of integration is undefined for x=l; but (x-l- xt-l)/(l - x)->-t- s as x- 1 from either side; so that if we agree to regard the subject of integration as being equal to t-s for x= 1, it becomes a continuous function of x, and there is no difficulty as regards its behaviour for x= 1. It often happens that the subject of integration has a discontinuity which is due simply to a failure in its definition at a particular point in the range of integration, and can be removed by attaching a particular value to it at that point. In this case it is usual to suppose the definition of the subject of integration completed in this way. Thus the integrals r sin mx sin nmx -dx, I -. dxr J x o sin x are ordinary finite integrals, the subjects of integration being regarded as having for x-= the value m.] 13. Substitution and integration by parts. The formulae for transformation by substitution and integration by parts may of course be extended to infinite integrals of the second as well as of the first kind. The reader should formulate the general theorems for himself, on the lines of ~ 162. 14. If s > 0, t > 1, prove by integration by parts that Xs-1(1-x)t-ldx=t- 1 f (1 -x)t-2 d. 15. If s>O, xsl-dx fJ t dt [Put x=l/t.] / I+ 1 d'j +t- s

Page 321

164] CONVERGENCE OF SERIES AND INTEGRALS 321 16. If 0<s<l, then Il+x dx=- ---.t t-Idt? J -~L'o 1+t o+t 'dx dx 17. (1 +xx ' +x) ='rr. [Put x=t2.] 18. Prove that ( = rr and x — r (a+ b) Ja s{(x - a(b-x)} J a x1(a - a) (b-x)} 2 (i) by means of the substitution x=a +(b - a) t2, (ii) by means of the substitution (b - x)/(x - a) = t, and (iii) by means of the substitution x = a cos2 t + b sin2 t. 19. Ifs>-1, then f (sin 0)8dO = -- ~- l =x- (1-) ) d8. 0o' / No\/(1-x2) o ~/(l1-x) o \x' 20. Establish the formulae ff /(xd) dx- f (sin ) do, {(x-a)(b-x)} 2 f(acs2 +bsin2 )dO, fa { /(X)} dx= 4aff (tan 0) sin 0 cos O d. 164. Some care has occasionally to be exercised in applying the rule for transformation by substitution. The following example affords a good illustration of this. Let J= (x2- 6x+13) dx. JI By direct integration we find that J= 48. Now let us apply the substitution y= -2_ 6x+13, x=3+V/(y-4). Since y= 8 when = 1 and y = 20 when x = 7, we appear to be led to the result /* C20 r/ /20 ydy V \32 120 J= I2dy = ~ 1 20 y d(y - 4)3/2+ 4 (y - = 4 + 580 J dy J 8 4J(y -4) 3 \ j which is certainly untrue whichever sign we choose. The explanation is to be found in a closer consideration of the relation between x and y. The function x2 - 6x +13 has a minimum for x=3, when y=4. As x increases from 1 to 3, y decreases from 8 to 4, and dx/dy is negative, so that dx 1 dy 2 /(y - 4)' As x increases from 3 to 7, y increases from 4 to 20, and the other sign must be chosen. Thus r 7 4 r 2 J=fydx f{2% 4)} + c' J1 8 2 -(y_ 4) 4ciy + 2 J/i _ 4) dy =[I(y- 4)3/2+ 4 (y - 4)1/2 + - (y- 4)3/2 + 4 (y - 4)/2 = 14 =48. H. A. 21

Page 322

322 CONVERGENCE OF SERIES AND INTEGRALS [VIII Similarly, if we wish to transform the integral dxc = r by the substitution x=arc sin y, we must observe that dx/dy= 1/,/(1 - y2) or - 1/V(1 _y2) according as 0 x< X 7r or r7r< x 7r. Example. Verify the results of transforming the integrals (4x2 - x + L-) dx, Cos2 dX 'by the substitutions 4x2 - x + j- =y/, x = arc sin y respectively. 165. Series of positive and negative terms. Our definitions of the sum of an infinite series, and the value of an infinite integral, whether of the first or the second kind, apply to series of terms or integrals of functions whose values may be either positive or negative. But the special tests for convergence or divergence which we have established in this chapter, and the examples by which we have illustrated them, have had reference almost entirely to the case in which all these values are positive. Of course the case in which they are all negative is not essentially different, as it can be reduced to the former by merely changing un into - un or 0 (x) into - ( (x). In the case of a series it has always been explicitly or tacitly assumed that any conditions imposed upon u, may be violated for a finite number of terms: all that is necessary is that such a condition (e.g. that all the terms are positive) should be satisfied from some definite term onwards. Similarly in the case of an infinite integral the conditions have been supposed to be satisfied for all values of x greater than some definite value or for all values of x within some definite interval (a, a + 8) which includes the value a near which the subject of integration tends to + o. Thus our tests apply to such a series as E (n2 - 10)/n4, since n2 - 10 > 0 if n - 4, and to such integrals as 00 3x - 7dx, ] (x + 1)3 Jo d since 3x-7> 0 ifx >, and 1-2x > 0 if 0 <x< -. But when the changes of sign of u, persist throughout the series, i.e. when the number of both positive and negative terms is infinite (as in the series 1- + - ~ +...)-or when (x) continually changes sign as x - + o, as in the integral *f sin dx 1 xS

Page 323

164-166] CONVERGENCE OF SERIES AND INTEGRALS 323 or as x — a, where a is a point of discontinuity of + (x), as in the integral [A. / 1 dx a san x-a ' -the problem of discussing convergence or divergence becomes more difficult. For now we have to consider the possibility of oscillation as well as of convergence or divergence. We shall not, in this volume, have to consider the more general problem for integrals. But we shall, in the ensuing chapters, have to consider certain simple examples of series containing an infinite number of both positive and negative terms. 166. Absolutely Convergent Series. Let us then consider a series zun in which any term may be either positive or negative. Let | n | = n, so that an = un if Un is positive and an =- Un if Un is negative. Further let vn = un or 0, according as un is positive or negative, and wn = - Un or 0, according as Un is negative or positive; or, what is the same thing, let Vn or wn be equal to an according as Un is positive or negative, the other being in either case equal to zero. Then it is evident that Vn and Wn are always positive, and Un = Vn - Wn, an = Vn + Wn. If, for example, our series is 1 -(1/22) + (1/32) -..., n= (- l)/(n+ 1)2 and an=l/(n+1)2, while v~=l/(n+l)2 or 0 according as n is even or odd, and w, =/(n + 1)2 or 0 according as n is odd or even. We can now distinguish two cases. A. Suppose that the series Zao is convergent. This is the case, for instance, in the example above, where an is 1 + (1/22) + (1/32) +.... Then both Evn and oWn are convergent: for (Ex. xxxII. 18) any series selected from the terms of a convergent series of positive terms is convergent. And hence (~ 70) zUn or Z (vn- wn) is convergent and equal to 2Ev - Wn. We are thus led to formulate the following definition. DEFINITION. When zan or l n I is convergent the series Eun is said to be absolutely convergent. And what we have proved above amounts to this: if zUn is absolutely convergent it is convergent: so are the series formed 21-2

Page 324

324 CONVERGENCE OF SERIES AND INTEGRALS [VIII by its positive and negative terms taken separately: and the szum of the series is equal to the sum of the positive terms plus the sum of the negative terms. The reader should carefully guard himself against supposing that the statement 'an absolutely convergent series is convergent' is a mere tautology. When we say that 2 u, is 'absolutely convergent' we do not directly assert the convergence of 2 un' we assert the convergence of another series 21 un, and it is by no means evident a priori that convergence of 2lJ1 and oscillation of 2Yt, might not go together. 167. Extension of Dirichlet's Theorem to absolutely convergent series. Dirichlet's Theorem (~ 152) shows that the terms of a series of positive terms may be rearranged in any way without affecting its sum. It is now easy to see that any absolutely convergent series has the same property. For let 2ui, be so rearranged as to become Und', and let an', v,', w, be formed from un' as an, Vn, wn were formed from un. Then Can' is convergent, as it is a rearrangement of 2an, and so are v wn', 2wu/, which are rearrangements of 2vn, Ewn. Also, by Dirichlet's Theorem, v5'= 2vn and wWn'= Ewn, and so 2n = IVn' - Wn' = 2vn - 2Wn = 2^. Examples LXXIX. 1. Verify that if 0 < r < then 1 2 -r3+. =1 - r 1+r2- r +r+G _- r3 +8 + r10 r5 +... =1 +r2+ -4 -... -r -r3 -.... 2. If 2 ac is a convergent series of positive terms, and I b] g fKa,, then 2b, is absolutely convergent. 3. If a,, is a convergent series of positive terms, the series an"n is absolutely convergent for - 1 < r < 1. 4. If 2an is a convergent series of positive terms, the series 2 a, cosnO, 2 a sin nO are absolutely convergent for all values of 0. [Examples are afforded by the series 2 r' cos nO, r sinnO of Ch. IV, ~ 76.] 5. Any series selected from the terms of an absolutely convergent series is absolutely convergent. [For the series of the moduli of its terms is a selection from the series of the moduli of the terms of the original series.] 6. Prove that if 1 lul is convergent then I 2un|l 2< 2unl and that the only case to which the sign of equality can apply is that in which every term is positive. 168. Conditionally convergent series. B. We have now to consider the second case indicated above, viz. that in which the series of moduli Ca, diverges to + oo (this being of course the only alternative to its convergence, as an is positive).

Page 325

166-169] CONVERGENCE OF SERIES AND INTEGRALS 325 DEFINITION. If Un is oonvergent, but l ul divergent, the original series is said to be conditionally convergent. In the first place we note that, if 2u,, is conditionally convergent, the series 2V, ZWn of ~ 166 must both diverge to + oo. For they obviously cannot both converge, as this would involve the convergence of E (vn + Wn) or 2an. But if one of them, say 2wn were convergent, and Zvn divergent, then since N VN N 2nU =:-Vn- (Wn..................... (1), o 0 0 the left-hand side would tend to + oo with N, which is contrary to the hypothesis that 2un is convergent. Hence 2v,, Ew, are both divergent. It is clear from equation (1) above that the sum of a conditionally convergent series is the limit of the difference of two functions each of which tends to + so with n. It is obvious too that 2u, no longer possesses the property of convergent series of positive terms (Ex. XXXII. 18) and all absolutely convergent series (Ex.LXXIX. 5) that any selection from the terms itself forms a convergent series. And it seems more than likely that the property prescribed by Dirichlet's Theorem will not be possessed by conditionally convergent series; at any rate the proof of ~ 167 fails completely, as it depended essentially on the convergence of Ev nand zw separately. We shall see in a moment that this conjecture is well founded, and that the theorem is not true for series such as we are now considering. 169. Tests of convergence for conditionally convergent series. It is not to be expected that we should be able to find tests for conditional convergence as simple and general as those of ~ 150 et seq. It is naturally a much more difficult matter to formulate tests of convergence for series whose convergence, as is shown by equation (1) above, depends essentially on the cancelling of the positive by the negative terms. In the first instance there are no comparison tests for convergence of conditionally convergent series. For suppose we wished to infer the convergence of ZVn from that of mu,. We have to compare Vo + VI +... + Vn, Uo + uZ +... + 'un.

Page 326

326 CONVERGENCE OF SERIES AND INTEGRALS If every u and every v were positive and every v less than the corresponding t, we could at once infer that Vo + V1 +... + Vn < ito +... + tUn, and so the convergence of Vn-. If the u's only were positive and every v numerically less than the corresponding it, we could infer that o I + I Vi | +... + vn < U +... + n, and so the absolute convergence of Evn. But in the general case, when the t's and v's are both unrestricted as to sign, all that we can infer is that I o i + I V1 I + --- + I a < Uo I... -+ | Un This would enable us to infer the absolute convergence of Eva from the absolute convergence of zua,: but if Nun is only conditionally convergent we can draw no inference at all. Example. We shall see shortly that the series 1-2+ — +... is convergent. But the series - + ++~+... is divergent, although each of its terms is numerically less than the corresponding term of the former series. It is therefore only natural that such tests as we can obtain should be of a much more special character than those given in the early part of this chapter. 170. Alternating Series. The simplest and most common conditionally convergent series are what is known as alternating series, series whose terms are alternately positive and negative. The convergence of the most important series of this type is established by the following theorem. If +p (n) is a positive function of n which tends steadily to zero as n - oo, the series o (0) - (1)+ (2)-... is convergent, and its sumn lies between ) (0) and ( (0) - (1). Let us write o, 01,... for b (0), b (1);... and let Sn = 0o - 01 + 02 -... + (- 1) f n. Then 82n+1 - s2n-1 = 2n - b2n+l- 0, s2n - S2n-2 = - (n-1- ~2n) _ 0. Hence so, s2, 4,..., 2n,... is a decreasing sequence, and therefore tends to a limit or to - o, and s1, 83, S5,..., 82n+i,... is an increasing sequence, and therefore tends to a limit or to + oo. But

Page 327

169, 170] CONVERGENCE OF SERIES AND INTEGRALS 327 since lim (s2+~_ - s2) = lim ( - l)2n+ 1 2n+i = 0, it is obvious that both sequences must tend to limits, and that the two limits must be the same. That is to say, the sequence So, s~,..., s,... tends to a limit. Since so = 00,, s = b00- b, it is clear that this limit lies between 00 and P0o- 0i. Examples LXXX. 1. The series -1+-+ +..., 1-2+, -1+ -2 + 3 -4 +. -+2 + JV3 - 4 2(-1)f/(n+ a), 2(-l)//(n+a), 2(- l)f/(4n+Va), (- i)/(n+Va)2, where a > 0, are conditionally convergent. 2. The series 2( - )n/(+a)8 (a>0) is absolutely convergent if s>1, conditionally convergent if 0 < sl 1, oscillatory if s_ 0. 3. The sum of the series of ~ 170 lies between s, and s + for all values of n; and the error committed by taking the sum of the first n terms instead of the sum of the whole series is numerically not greater than the modulus of the (n+ l)th term. 4. Consider the series 2(- 1)"/{/n+(- 1)n}, which we suppose to begin with the term for which n=2, to avoid any difficulty as to the definitions of the first few terms. This series may be written in the form rf (?_ (-l)n (-_)y LI [V (1 - i)7 ~+ (/ J or i } or {(V n( + (1)nn}-i The series 2 (- l)n/^Vn is convergent; but 2l/{n+(-l1)nVn} is divergent, as all its terms are positive, and lim n/{n+(- 1) Vnj}=. Hence the original series is divergent, although it is of the form q (2)- ) (3) + c5 (4)-... where 4q(n)-.0. This example shows that the condition that b(n) should tend steadily to zero is essential to the theorem. The reader will easily verify that V(2n+ 1)-1 < V(2n)+ 1, so that this condition is not satisfied. 5. If the conditions of ~ 170 are satisfied except that ) (n) tends steadily to a positive limit I, then the series 2 ( - 1)" 4) (n) oscillates finitely. 6. Alteration of the sum of a conditionally convergent series by rearrangement of the terms. Let s be the sum of the series 1 - + - 4 +..., and s2 the sum of its first 2n terms, so that lim s2=s. Now consider the series 1+~ -~+ + - i- +..............................(1) in which two positive terms are followed by one negative term. And let t3, denote the sum of the first 3n terms. Then t3n= 1 I I I I 1 =1+3+...4n -1 2 4 "'...2n I 1 1 =82n + 3 + ' + 4 - + 2n $- 1 2n 1-3 4n- 1

Page 328

328 CONVERGENCE OF SERIES AND INTEGRALS [VIII Now lim[2 1 1-2 2 +2' 3 *+4n- - 4 = 0 since the sum of the terms inside the bracket is clearly less than n/(2n+l)(2n+2); and lin ( 1 1 1=lim E l IX dxa1 ll ^2n+2 2n+4 4n n r I+ (In) I2j by ~ 138. Hence lim t3,n=s + -, and it follows that the sum of the series 1 x (1) is not s, but the right-hand side of the last equation. Later on (~ 195) we shall give the actual values of the sums of the two series. It can indeed be proved that a conditionally convergent series can always be so rearranged as to converge to any sumo whatever, or to diverge to + co or to - o. For a proof we may refer to Bromwich's Infinite Series, p. 68. 1 1 1 1 1 7. The series 1+ I- - I+ I- + - -4+... diverges to +oo. [Here 13 12 1J5 V7 -/4 1 1 1 n t3n = 82, n+ + +.. + > 52n~ V=(2n + 1) /(2n + 3) +* +(4n - 1) > /(4n - 1) 1 I where s2= 1- +...-, --- which tends to a limit as n- -oo.] /z2 iJ/2n' 171. Abel's and Dirichlet's Tests of Convergence. A more general test, which includes the test of ~ 170 as a particular test case, is the following. Dirichlet's Test. If jn satisfies the same conditions as in ~ 170, and 2 an is a series which converges or oscillates finitely, then the series ao0 al + al 1 + 2 2 +... is convergent. The reader will easily verify the identity a0 00 + al )1 +...+ Zn an-,=so(o 0- 0() +S1 ()1 -0(2) +.-. + S,_ ((_n-1- )n) +"S,)n where s,=ao+ al+...+ an. Now the series (@Po —)l)+(ql —P2)+... is convergent, since the sum to n terms is (o0-qn and limq =O; and all its terms are positive. Also since as, if not actually convergent, at any rate oscillates finitely, we can determine a constant K so that {sv < K for all values of v. Hence the series 2^(V-(v- (+1) is absolutely convergent, and so so (/o 0- (1) + s1 ()1 - -2) + * * * + Sn- 1 (on- 1 -- qn) tends to a limit as n-cc.. Also On, and therefore snn tends to zero. And therefore ao,o t+ abl 1+... + an fn tends to a limit, i.e. the series 2 av,,v is convergent. The most important case of this theorem is obtained by supposing a,,=cosnO or sin nO, in which case 2a,, oscillates finitely (Exs. xxxiv. 1, 2)

Page 329

171] CONVERGENCE OF SERIES AND INTEGRALS 329 except when 0=0 or a multiple of 27r. It follows that if ()t(n) is a positive function of n which tends steadily to zero as n — o, the series q (n) cos n0, q) (n) sin nO are convergent, except perhaps the first series when 0 =0 or a multiple of 2wr. In this case the first series reduces to E (n), which may or may not be convergent: the second series vanishes identically. If E + (n) is convergent, both series are absolutely convergent (Ex. LxXIX. 4) for all values of 0, and the whole interest of Dirichlet's Test lies in its application to the case in which 2 ) (n) is divergent. And in this case the series above written are conditionally and not absolutely convergent, as will be proved in Ex. LxxxI. 6 below. If we put 0= r in the cosine series we are led back to the result of ~ 170, since cosnr = (- l)n. Abel's Test. There is another test, due to Abel, which, though of less frequent application than Dirichlet's, is sometimes useful. Suppose that 4n,, as in Dirichlet's Test, is a positive and decreasing function of n, but that its limit as n —oo is not necessarily zero. Thus we postulate less about O)', but to make up for this we postulate more about a,, viz. that it is convergent. Then we have the theorem: if n,, is a positive and decreasing function of n, and 2 a, is convergent, then 2 a, 4n is convergent. For on has a limit as n —oo, say 1: and lim (,n-l)=0. Hence, by Dirichlet's Test, E a, (n-l) is convergent; and as 2a is convergent it follows that E an 4n is convergent. This theorem may be stated as follows: a convergent series remains convergent if we multiply its terms by any sequence of positive and decreasing factors. Examples LXXXI. 1. Show that a convergent series remains convergent if we multiply the nth term by n,,, where )n is an increasing function of n which tends to a limit I as n-coo. [Write 4,,=l-),n and consider the series 2 an n.] 2. Deduce Abel's Test directly from the identity of p. 328, in a manner similar to that in which we deduced Dirichlet's Test. 3. The series 2 (cos n0)/ns, 2 (sin nO)/ln are convergent if s> 0, unless (in the case of the first series) 0=0 or a multiple of 27r. 4. The series of Ex. 3 are in general absolutely convergent if s> l, conditionally convergent if 0 < s 1, and oscillatory if s 5 0 (finitely if s=0 and infinitely if s < 0). Mention any exceptional cases. 5. If E a n a- is convergent, a. n-t is convergent for t >s. 6. If n, is a positive function of n which tends steadily to 0 as n-aco, and 2 4n is divergent, the series 2 4)n cos nO, O sin l nO are not absolutely convergent, except the sine-series when 0=0 or a multiple of 7r. [For suppose,

Page 330

330 CONVERGENCE OF SERIES AND INTEGRALS e.g., that 2j I cos nO were convergent. Since cos2 nO I cos nO I it follows that 2 n % cos2 nO or (1 +cos 2nO) would be convergent. But this is impossible, since On is divergent and O,, cos2n0, by Dirichlet's Test, convergent, unless 0 is a multiple of 7r. And in this case it is obvious that n,, cos nO is divergent. The reader should write out the corresponding argument for the sine-series, noting where it fails if 0 is a multiple of r.] 172. Series of complex terms. So far we have confined ourselves to series all of whose terms are real. We shall now consider the series un -= (vn + iwn), where vn and wn are real. The consideration of such series does not, of course, introduce anything really novel. The series is convergent if, and only if, the series Evn,:wn, are separately convergent. There is however one class of such series so important as to require special treatment. Accordingly we give the following definition. This definition is an obvious extension of that of ~ 166. DEFINITION. The series 2zn, where un = vn + iwn, is said to be absolutely convergent if the series 2vn and 2w,, are absolutely convergent (~ 166). THEOREM. The necessary and sufficient condition for the absolute convergence of u,, is the convergence of E lI t I or Z \/(vn2 + Wn2). For if 2un is absolutely convergent, both of the series E vl, Wnit are convergent, and so E { Vn \ I Wn I } is convergent: but /(v2 + WL) < V | + Wn and therefore I un is convergent. And conversely, since |I n i - (v,+2 + W,2), Wl- V (v2 + Wn2), E Iv n and E I wn, are convergent if E I,, n is convergent. It is obvious that an absolutely convergent series is convergent, since its real and imaginary parts converge separately. And Dirichlet's Theorem (~~ 152, 167) may obviously be extended to absolutely convergent complex series, since it is true of the separate series 2v, and Zw,.

Page 331

172-174] CONVERGENCE OF SERIES AND INTEGRALS 331 173. Power Series. One of the most important parts of the theory of the ordinary functions which occur in elementary analysis (such as the sine and cosine, and the logarithm and exponential, which will be discussed in the next two chapters) is that which is concerned with their expansion in series of the form Saxn. Such a series is called a power series. We have already, in Ch. VII, in connection with Taylor's and Maclaurin's series, come across some cases of expansion in series of this kind. There, however, we were concerned only with real values of x. We shall now consider a few general properties of power series in x, where x is not restricted to be real, which will be useful in Chapters IX and X. A. A power series may be convergent for all values of x, for a certain region of values, or for 'no values except x = 0. It is sufficient to give an example of each possibility. 1. The series 2 (xn/n!) is convergentfor all values of x. For if un =xn/n!, then I,6n+ l 1/{ ~, I=| | /(n+l)-0 as n -oo, whatever value x may have. Hence, by d'Alembert's test, 2 1 U, I is convergent for all values of x, and the original series is absolutely convergent for all values of x. We shall see later on that a power series, when convergent, is generally absolutely convergent. 2. The series n!xn is not convergent for any value of x except x=O. For if u,,=n! xn, l+, I1/j n I=( n+1) I, which tends to +oo with a, unless x=O. Hence (Exs. xxx. 1, 2) the modulus of the nth term tends to +oo with n; and so the series cannot converge, except when x=0. It is obvious that any power series converges for x = 0. 3. The series 2 x" converges if I x I < 1, and not if I x I _ 1, by ~ 76. Thus we have an actual example of each of the three possibilities. 174. B. If a power series aanxn" is convergent for a particular value of x, say x, = r, (cos 0 + i sin 0i), it is absolutely convergent for all values of x such that x < ri. For since Yaxin is convergent, lim anxn = 0, and we can certainly find a constant K such that l axlnI <K for all values of n. But, if Ix =r< r,, (\ it / \n a Cnxcn | = n a n ( _1) < K (_ ); and the result follows at once by comparison with the convergent geometrical series Z (r/rt)n.

Page 332

332 CONVERGENCE OF SERIES AND INTEGRALS [VIII In other words, if the series converges at P, it converges absolutely at all points nearer to the origin than P. Examcple. Show that the result is true even if the series oscillates finitely for x=xl. [If s,=ao+alxl+...+a,x x, we can find K so that 1s,,<Kfor all values of n. But I anxl'll= Isn- sn_l Is_|+| sn <2f, and the argument can be completed as before.] 175. The region of convergence of a power series. The circle of convergence. Let x = r, be any point on the positive real axis. If the power series converges when x = ri it converges absolutely for all points inside the circle Ix r= r. In particular it converges for all real values of x less than r,. Now let us divide the points r, of the positive real axis into two classes, the class for which the series converges and the class for which it does not. The first class must contain at least the one point x = 0. The second class, on the other hand, need not exist, as the series may converge for all values of x. Suppose however that it does exist: and that the first class of points does include points besides x = 0. Then it is clear that every point of the first class lies to the left of every point of the second class. Hence there is a point A (x = R, say) which divides the two classes, and may itself belong to either one or the other. Then the series is absolutely convergent at all points inside the circle w x = R. For let P be any such point. We can draw a circle, whose centre is 0 and whose radius is less than R, so as to include P p, inside it. Let this circle cut OA in Q. Then the series is con- \ vergent at Q, and therefore, by Theorem B, absolutely convergent o Q A Q' X at P. On the other hand the series cannot converge at any point P' FIG. 66. outside the circle. For if it converged at P' it would converge absolutely at all points nearer to 0 than P: and this is absurd, as it does not converge at any point between A and Q' (Fig. 66). So far we have excepted the cases in which the power series (1) does not converge at any point on the positive real axis

Page 333

174-176] CONVERGENCE OF SERIES AND INTEGRALS 333 except x = 0, or (2) converges for all points on the positive real axis. It is clear that in case (1) the power series converges nowhere except for x= O, and in case (2) it is everywhere absolutely convergent. Thus we obtain the following result: a power series either (I) converges for x = O and for no other value of i; or (2) converges absolutelyfor all values of x; or (3) converges absolutely for all values of x within a certain circle of radius R, and does not converge for any value of x outside this circle. In case (3) the circle is called the circle of convergence and its radius the radius of convergence of the power series. It should be observed that this general result gives absolutely no information about the behaviour of the series on the circle of convergence. The examples which follow show that as a matter of fact there are very diverse possibilities as to this. Examples LXXXII. 1. The series l+ax+aa2x2+... (a >0) has a radius of convergence equal to 1/a. It does not converge anywhere on its circle of convergence, diverging for x= 1/a and oscillating for all other points on the circle. X X2 X3 2. The series f + 2 + 2+... has its radius of convergence equal to 1; it converges absolutely at all points on its circle of convergence. 3. Generally if Ia+ll/la[-,-X or lanl1/ -~X as naoo, the series ao+alx+a2 x2+... has 1/A as its radius of convergence. For in the first case; lim I an+lxel+l 1/[ a xn =Xx which is less or greater than unity according as Ix is less or greater than 1/X, so that we can use D'Alembert's Test (~ 151. 3). In the second case we can use Cauchy's Test (~ 151. 1) similarly. 4. The series x- x2 + x3-... has its radius of convergence equal to unity. It diverges for x= - 1, but is convergent (though not absolutely) for all other points on the circle of convergence, since its real and imaginary parts are cos 0 - cos 20 +..., sin 0 - sin 20 +.... 176. Uniqueness of a power series. If laxn is a power series which is convergent for some values of x at any rate besides x=0, and f(x) is its. sum, it is easy to see that f(x) can be expressed in the form ao + alxx+ a2x2 +... + (a, + x) xn, where Ex-~0 with x. For if M is any number less than the radius of convergence of the series, and I x i <, we have If(x)- a,,xl< lx Iv+l (lav+lll a,+281 +l a+312+... )< lIx 1+ 0

Page 334

334 CONVERGENCE OF SERIES AND INTEGRALS where K is a number independent of x. It follows (cf. Ex. LVII. 15) that if axn=bnxn, for all real values of x whose modulus is less than some fixed number, then an= bn for all values of n. This result is capable of considerable generalisations into which we cannot enter now. It shows that the same function f(x) cannot be represented by two different power series. 177. Multiplication of Series. We saw in ~ 153 that if Etn and Evn are two convergent series of positive terms, then xUn X EVn = Wn,- where Wn = UoVn + UiVn-1 +... + UnVo. We can now extend this result to all cases in which lun and v,, are absolutely convergent; for our proof was merely a simple application of Dirichlet's theorem, which we have already extended to all absolutely convergent series. Examples LXXXIII. 1. If I x is less than the radius of convergence of either of the series Ya,nx", l bn", then the product of the two series is 2 CXn, where n = ao b + al b - +... + a, bo. 2. If the radius of convergence of 2an x is r, and f(x) is the sum of the series when I x <r, and x is less than either r or unity, then f()/(1 -X)=-SnXn, where sn=ao+al+...+an. 3. Prove by squaring the series for 1/(1 - x) that 1/(1 - x)2= 1 + 2x + 32 +... if Ixl<l. 4. Prove similarly that l/( -x)3=1+3x+6x2+..., the general term being (n+ 1) (n +2) xn. 5. The Binomial Theorem for a negative integral exponent. If x < 1 and n is a positive integer, then -I n(+ l) ~ n (=.. +n (+)...(n +r - l) x+ 1 i+,nx.+,,n2+...+ ~ 2 XI'r,, -x)- - -- 'x'1.2 1.2+..( r [Assume the truth of the theorem for all indices up to n. Then, by Ex. 2, 1/(1-r)n +1 = 2 s,. r, where n(n+l) n+(n +1)...(n+r-1) (n+ +)(n+2)...(n+r) s=l4+n...+ 7.- ' 1.2 1.2...r 1.2...r The last identity is easily proved by induction. We leave it to the reader to supply the details.] 6. Iff(m, x) +( =+ () x+ ( ) 2+..., prove by multiplication of series that, when Ix <1, f(m, x) f(n, x)=-f(m+n, x). [This equation forms the basis of Euler's proof of the Binomial Theorem. The coefficient of Xk in the product series is /W A/ \ n / X \ -n\ zn k )- + 1 ) - 1) + \ 2 )k - 2) Jr ' *Jk 1) 1 J k)

Page 335

176, 177] CONVERGENCE OF SERIES AND INTEGRALS 335 This is a polynomial in m and n: but when m and n are positive integers this polynomial must reduce to ( + ), in virtue of the Binomial Theorem k for a positive integral exponent. And it is easy to see that if two such polynomials are equal for all positive integral values of rm and ni they must be identically equal.] x2 7. If f(x) = 1 +x + - +... then /(x)/(.y) =f(+y). [For the series for f(x) is absolutely convergent for all values of x: and if un=x/n!, Vn=yn/n! it is easy to see that w= (x +y)n/n!.] x2 I4 X 3 46 8. If C(x)=l- +4-..., Sx- +-f-.., then C(x+y)=C(x) C(y) - (x) (y), S(x+y)=S(x)C(y) + C(x)(y), and {C(X)}2 + {8(X)}2 = 1. 9. Failure of the Multiplication Theorem. That the theorem is not always true when 2un and 2ev are not absolutely convergent may be seen by considering the case in which u,,=vv=( - 1)/s%/(n+ 1). Then n, 1.= (-1) ( r + 1) (n+ 1- r)} But /{(r + 1) (n + 1 - r)}2 1 (n + 2), and so Iw t > (2n + 2)/(n + 2), which shows that 2 wn is not convergent. MISCELLANEOUS EXAMPLES ON CHAPTER VIII. 1. Discuss the convergence of the series 2nk{/(n + l) - 2/n +/( - 1)}, where k is real. (Jfath. Trip. 1890.) 2. Show that the series 1- 1 1 1 1 1+x 2 2+x 3 3+x - is convergent provided x is not a negative integer. 3. Investigate the convergence or divergence of the series 2s, n s, (sin, ()sin 2 1-cos, 2(-1)n 1-co n n n (' c n n) where a is real. 4. Discuss the convergence of the series (1+2+ 1 + 1 )sin (' O +a) where 0 and a are real. (Math. Trip. 1899.) 5. Show that, if R (n) is any rational function of n, we can determine a polynomial P (n) and a constant A such that 2 {R (n) - P (n)-(A/n)} is convergent. Consider in particular the cases in which R(n) is one of the functions 1/(an + b), (an2 + 2bn +c)/(an2+ 2i3n +y).

Page 336

336 MISCELLANEOUS EXAMPLES ON CHAPTER VIII 6. If z1, u2, u3,... is a decreasing sequence of positive numbers whose limit is zero, the series u1- (l + t2) + - (T + 2+3 )-..., 1- U (ul + 3) + (1 + 3 +5)-.. are convergent. [For if (ll+ u2 +... +u,,)/n = v, then vl, v2, V3,... is also a decreasing sequence whose limit is zero (see Ch. IV, Misc. Ex. 9). This shows that the first series is convergent; the second we leave to the reader. In particular the series 1- (1+~I) + (1+I+.I)-+ 1 I —I (1+1) +I (1l+ +T)2'2'''2 53 '' are convergent.] 7. The series E cos n, sin nO oscillate finitely, the sum of the first n terms of either series always lying between -cosec 0 and cosec 0, while we can find values of n, as large as we please, and for which the sum lies as near as we please to either of these limits, or to any number which lies between them. If an increases or decreases steadily to a limit which is not equal to zero, the series Za, cos nt, Za~ sin nO also oscillate finitely. 8. Find the sums of the series x 2x2 4x4 x x2 x4 l+x 1 —x21+x4+... l1 + 1-x4+1 8+' (in which all the indices are powers of 2), whenever they are convergent. [The first series converges only if I x <1, its sum then being x/(1 -x); the second series converges to x/(l -x) if I x <1 and to 1/(1 -x) if Ix 1>1.] 9. Find the sum of the series 2 u, where 1 X n --- 1 1( 1 n n-(X, + X —) (X+n + 1 J -,- - -1) + 1X + -n + 1 + x-n- for all real values of x for which the series is convergent. (Math. Trip. 1901.) [If | x I is not equal to unity, the series has the sum x/(x - 1) (x2+ 1)}. If x=1, then un,=0 and the sum is 0. If x=-l, then tn =-(-1)X+1 and the series oscillates finitely.] 10. If 'uo+u+ u2+... is a divergent series of positive and decreasing terms, then (uo+u+ -... + 2)/(q +u3+... + n + 1)- 1. 11. If, a, 1 for all values of n, and 1 +ax + a2x2+... is a finite series, or an infinite series convergent for x 1 <1, the equation 0=1 +a-lx +axa2+... cannot have a root whose modulus is less than I, and the only case in which it can have a root whose modulus is equal to ~ is that in which a, = - Cis (nO), when x= Cis (- 0) is a root. C 1 12. Prove that, if a>0, then lim 2 = 0. p-o n=O (-P + n)l + 13. Prove that lim a -- = 1. [Use the inequalities of ~~ 157-8 to a-+ o ] l+a show that the sum of the series lies between I/a and (1 +a)/a.

Page 337

MISCELLANEOUS EXAMPLES ON CHAPTER VIII 337 14. Recurring Series. A power series lanxn is said to be a recurring series if its coefficients satisfy a relation of the type a,+plan-_l+P2an-2+... +Pkan-k=0......................(1), where nik and Pi, p2,..., Pk are independent of n. Any recurring series is the expansion of a rational function of x. To prove this we observe in the first place that the series is certainly convergent for sufficiently small values of x. For let G be the greater of the two numbers 1, 11 + P2| +... + Pk. Then it follows from the equation (1) that \a \,__Gan, where an is the modulus of the numerically greatest of the preceding coefficients; and from this that an I<KGn, where K is independent of n. Thus the recurring series is certainly convergent for values of x numerically less than 1/G. But if we multiply the series f (x) = 2anx by pix, p2x2,... pkxk, and add the results, we obtain a new series in which all the coefficients after the (k - l)th vanish in virtue of the relation (1), so that (1 +plX+2 X2 p +2- +)=... + kxk) f (x) = Po P... Pk- 1k-1, where Po, P1,..., P- 1 are constants. The polynomial 1 +plx +p22 +... *-pkXk is called the scale of relation of the series. Conversely, it follows from the known results as to the expression of any rational function as the sum of a polynomial and certain partial fractions of the type A/(x - a)p, and from the Binomial Theorem for a negative integral exponent, that any rational function whose denominator is not divisible by x can be expanded in a power series convergent for sufficiently small values of x, in fact for Ixl<p, where p is the least of the moduli of the roots of the denominator (cf. Ch. IV, Misc. Exs. 19 et seq.). And it is easy to see, by reversing the argument above, that the series is a recurring series. Thus the necessary and sufficient condition that a power series should be a recurring series is that it should be the expansion of such a rational function of x. 15. Solution of Difference-Equations. A relation of the type of (1) in Ex. 14 is called a linear difference-equation in an with constant coefficients. Such equations may be solved by a method which will be sufficiently explained by an example. Let the equation be an- aan- -8an-2+12an3 =0. Consider the recurring power series Eanx. We find, as in Ex. 14, that its. sum is ao + (al - ao) x + (a2 - al -ao8) x2 A1 A2 B 1 - - 82 + 12x3 1 - 2x (1 - 2x)2 1+3x where Al, A2, B are numbers easily expressible in terms of a0, al, and a2. Expanding each fraction separately we see that the coefficient of xn is a,= 2n {Al+ (n + l) A2} + (- 3)n B. The values of Al, A2, B depend upon the first three coefficients ao, al, a2, which may of course be chosen arbitrarily. H. A. 22

Page 338

338 MISCELLANEOUS EXAMPLES ON CHAPTER VIII 16. The solution of the difference-equation u,, -2 cos 0 t, -_ 1+ - 2 = 0 is u, ---A cos nO +B sin nO, where A and B are arbitrary constants. 17. If u, is a polynomial in n of degree r, ~2 it,xn is a recurring series whose scale of relation is (1 - x) + 1. (Math. Trip. 1904.) 18. If f(n) is the coefficient of xn in the expansion of x/(l+x+-x2) in powers of x, prove that (1) f (n) +f (n - 1) +f (n -2) =, (2) f (n) = (c03 - Ca32)/(03 - 032), where o03 (Ch. III, ~ 41) is a complex cube root of unity. Deduce that f(n)=0 or 1 or - 1 according as n is of the form 3k or 3k + 1 or 3k + 2, and verify this by means of the identity x/(1 + x + x2)=x (1 - x)/(1 - x3). 19. A player tossing a coin is to score one point for every head he turns up and two for every tail, and is to play on until his score reaches or passes a total n. Show that his chance of making exactly the total e is 1 {2 + (- ~)n}. (Math. Trip. 1898.) [If p, is the probability, p,,=2 (p ~-+p, -_ ) alsoPo=l, P1=.] 20. If vn=(- 1)n- 1 (+l + u2 +... + tun) and s,, = +V2 +...+v,, then lim (sl + sa2 + s,)/ = (l- S,02+ % - * * ), whenever the series is convergent. [It is easy to verify that S2n - 1 = U1 +-3 "+ * * + it2nl- l S2 2 = -- 2 -4 ~ ~ -- U2n 2v-1 + S2v = 1 -,2 U-2 +U3 - U4 +... + 2-2v-1 - lt2v, so that s2-1 + s2V,-zn, where Ue is the sum of the series on the right-hand side when continued to infinity. Hence (Ch. IV, Misc. Ex. 28) lim {(S1 + S2) + (S3 + 4) +... -+ (S2 -1 + S2,)}/1 = It, which proves the result when n-e-oo through even values. Also, since u2N —O, S2n,/ —O, and so the result is true when n —oo in any way.] 21. Prove that a+1 - +2+ a+n -a+ ) - (a+l) (a+2) + if n is a positive integer and a is not one of the numbers -1, - 2,..., -n. [This follows from splitting up each term on the right-hand side into partial fractions. If a >- 1 it may be proved very simply from the equation ( xa 1 ate 1- d = (1 dx by expanding (1 - x)/(l - ) and 1 -(1- -x) in powers of x and integrating each term separately. The result, being merely an algebraical identity, must be true for all values of a save - 1, - 2,..., - n.] 22. Prove by multiplication of series that ~, Xn oo -)n-lxn a_ I ]_) X' 0 n! 1 n. i! 1 ~ 3 n

Page 339

MISCELLANEOUS EXAMPLES ON CHAPTER VIII 339 [The coefficient of x" will be found to be n -I(L +IG) Now use Ex. 21, putting a=0.] 23. If A,-A and B —.B as n —oo, then (Al B +A2Bn-1 +.. +-A Bl)/n-AB. [Let A.=A +E-. Then the expression given is equal to B+B2++.+Bn, E Bn+E2B, +... +EnB n n The first term tends to AB (Ch. IV, Misc. Ex. 28). The modulus of the second is less than /3 { 1 +1 + +... +E1 e }/n, where 3 is any number greater than the greatest value of I B, I: and the last expression tends to zero.] 24. If c= al b, b +a2s bn1+.. + a bi and A,=al+a2+-...-+a,,, B=bl+b2+...+b, n C,=l+c2+... +C, prove that Cn=a Bn+a2Bn- I +... +-,B1 =bl1A,,+b2A,-_+...+ bnA and C2 7+- 2+... + 2 =AlB+A2B^, _1 +... + AB1. Hence prove that if the series 2n, Ebn are convergent and have the sums A, B, so that An-A, B, —B, then (C1 + C2 +... + Cn)/l- AB. Deduce that if 2E, is convergent, its sum is AB. This result is known as Abel's Theorem on the multiplication of Series. We have already seen that we can multiply the series Ian, 2b, in this way if both series are absolutely convergent: Abel's Theorem shows that we can do so even if one or both are not absolutely convergent, provided only the product series is convergent. 25. Prove that I (1- +~ -...)2=~ - ( +1 (+-1 )-... (-+.-. _.)=-2 -I ( + ) + 1 (I + I + )- - [Use Ex. 6 to establish the convergence of the series.] 26. For what values of m and n, has the integral (sin)n (1 - cosx) dx a meaning? [It has a meaning if m + 1 and m +2n+ 1 are positive.] 27. Prove that, if a>l, then 1 dx, o (a - x)^(l 2) - /(2 -)'

Page 340

340 MISCELLANEOUS EXAMPLES ON CHAPTER VIII 28. Establish the formulae F{v(x2 + l)+x.} dx= (i+-) F (y)dy, fFj{ (x2+l )-x}dx=1 (1+) F 1 ) dy, where F is such a function that the integrals have a meaning. In particular, prove that if n > 1 then 00 dx 0 n o {(x+l)+x}"=o {(:2-+l)-1x} -X=n2 [First put x=sinh u and then eU=y.] 29. Show that if 2/=ax- (b/x), where a and b are positive, y increases steadily from - oo to + oo as x increases from 0 to +oo. Hence show that f ax +a $)}d= ft- f. %2( + ab) {+ b)}dy = f /{(y2+ ab)}dy. ao 0 30. Show that if 2y=ax+ (b/x), where a and b are positive, two values of x correspond to any value of y greater than J(ab). Denoting the greater of these by xl and the less by x2, show that, as y increases from J/(ab) towards oo, xl increases from J/(b/a) towards oo, and x2 decreases from V(bla) to 0. Hence show that /(y) dx& = 1 Y f + i d Y/(b/a) a (ab) ( ( 2 ab) + l d r\I(bla) r( v O(ba) a \/(ab) {vS(y2 ) }d Jb) and that ~lo~f oyb xl 2 0 /f() 2 2 ) f{ ( ax X dx - y( ) dy = _ f f f{/(2 + ab)} dz, fo 0{ \ x+)} J J =f(ab) (y2 - ab) a f denoting any function such that these integrals have a meaning in accordance with the definitions of ~~ 160 et seq. 31. Prove the formula f (sec xl + tan 1 x) 2 J -f(cosec x) $c' () 2 (sin x) xo (sin X) 32. If a and b are positive, then fo dx _r- b ' x2dx rr Jo (x2+a2) (X2+b2)= 2ab (a+b)' o (x2+a2) (x2+ b2) 2 (a+ b) Deduce that if a, 3, and y are positive, and 2> ay, then f/ dx- _ _ r f x2dx _r XJo a4+2,3x2+ 2 /(2yA)' J ax4 + 2x- 2 +y 2 ^(2aA)' where A=/3+V/(ay). Also deduce the last result from Ex. 29, by putting f(y)= l/(C2+y2). The last two results remain true when 12<ay, but their proof is then not quite so simple.

Page 341

CHAPTER IX. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS OF A REAL VARIABLE. 178. THE number of essentially different types of functions with which we have been concerned in the foregoing chapters is not very large. Among those which have occurred the most important for ordinary purposes are polynomials, rational functions, algebraical functions, explicit or implicit, and trigonometrical functions, direct or inverse. We are however far from having exhausted the list of functions which are important in mathematics. As the range of mathematical knowledge has widened so have new classes of functions, one after another, been introduced and defined, and their properties investigated. These new functions have generally been introduced because it appeared that some problem which was occupying the attention of mathematicians was incapable of solution by means of the functions already known. The process may fairly be compared with that by which the irrational and complex numbers were first introduced, when it was found that certain algebraical equations could not be solved by means of the numbers already recognised. One of the most fruitful sources of new functions has been the problem of integration. Attempts have been 'made to integrate some function f(x) in terms of functions already known. These attempts have failed; and after a certain number of failures it has begun to appear probable that the problem is insoluble. Sometimes it has been proved that this is so; but as a rule such a strict proof has not been forthcoming until later on. Generally it has happened that mathematicians have taken the impossibility for granted as soon as they have become reasonably convinced of it, and have introduced a new function F (x) defined by its

Page 342

342 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX possessing the required property, viz. that F'(x) =f(x). The properties of F (x) have been investigated by starting fromn this definition, and it has then appeared that F (x) has properties which no finite combination of the functions previously known could possibly have; and thus the correctness of the assumption that the original problem could not possibly be solved has been established. One such case has occurred in the preceding pages, when in Ch. VI we defined the function log x by means of the equation log-x = Let us consider what grounds we have for supposing logx to be a really new function. We have already seen (Ex. xLIv. 5) that it cannot be a rational function, since the derivative of a rational function is itself a rational function, whose denominator contains only repeated factors. The question whether it can be an algebraical or trigonometrical function is more difficult. But it is very easy to become convinced by a few experiments that differentiation will never get rid of algebraical irrationalities. For example if we differentiate ^/(1 +x) any number of times the derivative is always the product of ^(1 + x) by a rational function. And so generally-the reader should test the correctness of the statement by experimenting with a number of examples, such as /(x+V/x), (1-/x)/(l+1/x). Similarly with the trigonometrical functions: if we differentiate a function which involves sinx, or cosx, one or other of these functions persists in the result; while differentiation of a function which involves arcsinx or arctanx always leads to a function involving 1/(1 - x2) or 1 +x2 in a form impossible to eliminate. We have therefore, not indeed a strict proof that log x is a new function — that we do not profess to give —but a reasonable presumption that it is. We shall therefore treat it as such, and we shall find on examination that its properties are quite unlike those of any function which we have come across hitherto. 179. Definition of log x. We define log x, the logarithm of x, by the equation x dt log = t. Here x is positive: if x is negative the integral has no meaning (Ex. LXXVIII. 2). We might have chosen a lower limit other than 1; but 1 proves to be the most convenient. With this definition log 1 =0. We shall now consider how log x behaves as x varies from 0 * For such a proof see p. 35 of the author's tract quoted on p. 225.

Page 343

178, 179] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 343 towards co. It follows at once from the definition that log x is a continuous function of x which increases steadily with x and has a derivative D, log x = 1/x. Moreover (~ 158) log x tends to + co with x. If x is positive but less than 1, log x is negative. For log x =z dt 1 dt. logx= fX=- Tf 0' Moreover, if we make the substitution t = 1/u in the integral, we obtain log x = - = log (l/). Thus as x varies from 1 downwards towards 0, log x tends steadily to - o. The general form of the graph of the logarithmic function is shown in Fig. 67. Since the derivative of log x is l/x, the slope of Y 0 1 X FIG. 67. the curve is very gentle when x is very large, and very steep when x is very small. Examples LXXXIV. 1. Prove from the definition that if u > 0 then i/(l + u) < log (1 + u) < u. [For log (1 + it) = l+t' and the subject of integration lies between 1 and 1/(1 + ).] 2. Prove that, if u > 0, log (1 + ) lies between u - u2 and u - {u2/(1 +l)}. [Use the fact that log(l+i)= =it-.] 3. IfO < < 1 then u <-log (l - u)<u/(l -).

Page 344

344 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX 4. Prove that lim log x= lim log (1 + t)1 — l x-I tw-o t [Use Ex. 1.] 180. The functional equation satisfied by logx. The function log x is a solution of the functional equation. f(xy)=f ( ) +f(y).....................(1). For, making the substitution t = yu, we see that fXY dt [x du r" du 1/y du log (xy) = - = - =j - e v J1 t JlUy U Ji A J1 A = log - log () =log = + logy, which proves the theorem. Examples LXXXV. 1. It can be shown that there is no solution of the equation (1) fundamentally distinct from logx. For, if we suppose f(x) to be a function which has a differential coefficient (and other functions may be neglected as of no practical importance) we obtain by differentiating the functional equation, first with respect to x, and then with respect to y, the two equations yf' =/' (x), f'l (y) =f' (y); and so, eliminating f' (y), xf' (x) =yf' (y). But if this if true for every pair of values of x and y, we must have xf' (x) = C, or f' (x) = C/x, where C is a constant. Hence f(x)= - dx+C= Clogx+C'. Thus there is no solution fundamentally distinct from logx —except the trivial solution f(x)= O, given by taking C=0O. 2. Show in the same way that there is no solution of the equation f(x) +f(y) =f( +(y)/(l - ay)} fundamentally distinct from arc tanx. 181. The manner in which log x tends to + oo with x. It will be remembered that in Ch. V we defined functions of x which tend to + o with x in certain different ways, distinguishing between those which, when x is large, are of the first, second, third,... orders of greatness. A function f(x) was said to be of the kth order of greatness when f(x)/xk tends to a limit different from zero as x tends to + oo. It is easy to define a whole series of functions which tend to + co with x, but whose order of greatness is smaller than the first.

Page 345

180-183] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 345 Thus /x, A/x, t/w,... are such functions. We may say generally that xa, where a is any positive rational number, is of the ath order of greatness when x is large. We may suppose a as small as we please, e.g. less than -0000001. And it might be thought that by giving a all possible values we should exhaust the possible 'orders of infinity' of f(x). At any rate it might be supposed that if f(x) tends to + cc with x, however slowly, we could always find a value of a so small that xa would tend to + so more slowly still: and conversely that if f(x) tends to + oo with x, however rapidly, we could always find a value of a so great that x- would tend to +oo more rapidly still. Perhaps the most interesting feature of the function log x is its behaviour as x tends to + c. It shows that the presupposition stated above, which seems so natural, is unfounded. The logarithm of x tends to + cc with x, but more slowly than any positive power of x, integral or fractional. In other words log x -+ 4 but (log x)/xa - 0 for all positive values of a. This fact is sometimes loosely expressed by saying that 'the order of infinity of logx is infinitely small'-but to say that anything is 'infinitely small' is, strictly speaking, just as meaningless as to say that anything is 'infinitely great,' or that x = o, and we therefore advise the reader to avoid such modes of expression. 182. Proof that (log x)lxa - 0 as x — + oc. Let /3 be any positive number. Then if t > 1, 1/t < l/tl-, and so rX dt [X dt log x = < i t, or log x < (X - 1)//, < x9. Now if a is any positive number (e.g. '01) we can choose a smaller positive value of / (e.g. '001). And then 0 < (log x)csa < x-a//3 (x > 1). But since a > /, xa-a//3 - 0 as x - + oo, and therefore (log x)/xa -- 0. 183. Since (log x)/xa, = - ya log y if x = l/y, it follows from the theorem proved above that lim ya log y = - lim (log x)/la = 0. y --- +0 aX --- +

Page 346

346 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX Thus log x tends to - o and log (1/x) =- log x to + oo as x tends to zero by positive values, but log (l/x) tends to + oo more slowly than any positive power of 1/x, integral or fractional. 184. Scales of infinity. The logarithmic scale. Let us consider once more the series of functions x, Jx, x/x,..., /,.., which possesses the property that, if f(x) and + (x) are any two of the functions contained in it, f(x) and (x) both tend to +oo with x, while f(x)/5 (x) tends to 0 or to + oc according as f(x) occurs to the right or the left of 0 (x) in the series. We can now continue this series by the insertion of new terms to the right of all those already written down. We can begin with logx, which tends to infinity more slowly than any of the old terms. Then N/(logx) tends to + oo more slowly than logx, (/(logx) than,/(logx), and so on. Thus we obtain a series, Vx /x, x,..., / x,... logx, ^(logx, S/(logx),....... g/(log),... formed of two simply infinite series arranged one after the other. But this is not all. Consider the function loglogx, the logarithm of logx. Since (logx)/xa- O, for all positive values of x, it follows on putting x=logy that (log log y)/(log y)a = (log x)/xa ->0. Thus log logy tends to + o with y, but more slowly than any power of logy. Hence we may continue our series in the form x, v/x, /x,... log x, J/(log x), /(logx),... log log x, ^/(log logx),... /(log logx),...; and it will by now be obvious that by introducing the functions log log logx, loglogloglogx,... we can prolong the series to any extent we like. By putting xc= l/y we obtain a similar scale of infinity for functions of y which tend to + cc as y tends to 0 by positive values. Examples LXXXVI. 1. Between any two terms f(x), F(x) of the series we can insert a new term b(x) such that f (x) tends to + co more slowly than f(x) and more rapidly than F(x). [Thus between V/x and ~/x we could insert x5/12: between,/(log x) and N/(log x) we could insert (log x)5/12. And, generally, j (x) = /{f(x) F(x)} satisfies the conditions stated.] 2. Find a function which tends to + o more slowly than V/x, but more rapidly than xa, where a is any rational number less than 1/2. [^/x/(logx) is such a function; or /x/(logx)9, where 3 is any positive rational number.] 3. Find a function which tends to + o more slowly than Vx, but more rapidly than,/x/(log x), where a is any rational number. [Such a function is,x/(log log x). It will be gathered from these examples that incompleteness is an inherent characteristic of the logarithmic scale of infinity.] 4. How does the function f(x)= {xa (logx)a' (log log x)a"}/{xP (log x)P' (log log x)$"}

Page 347

183-185] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 347 behave as x tends to +co? [If a=/3 the behaviour of f (x) = x'a-P (log x)a'-' (log log x)a"-P" is dominated by that of xa-/. If a=3 the power of x disappears and the behaviour of f(x) is dominated by that of (log x)'-P', unless a'=t', when it is dominated by that of (log log x)""-P". Thus f(x) -- +oo if a> 3, or a =/3, a'> 3', or a= =, a'-=3', a" > 3", and f(x). O if a < 3, or a=13, a' < ', or a=3, a' =1', (" </ ".] 5. Arrange the functions x/^/(log x), x (log x)/loglog x, xlologog x//(logx), (xlogloglogx)/l(loglogx) according to the rapidity with which they tend to + co with x. 6. Arrange log log x/(x log x), (log x)/x, x log log x^/(x2 + 1), {/(x + 1)}/x (log x)2 according to the rapidity with which they tend to zero as x tends to + oo. 7. Arrange x log log (1/X), [x/{log (1/X)}3], {x sinxlog(1/.)}, (1-cosx)log(1/x) according to the rapidity with which they tend to zero as x — + 0. 8. Show that Dxlog log x= 1/( log x), Dx log log log x = l/( log x log log x), and so on. 9. Show that DC (logx)a =a/{x (log x) - a}, Dx (log log x) = a/{x log x (log logx)1 - a}, and so on. 185. The number e. We shall now introduce a number, usually denoted by e, which is of immense importance in higher mathematics. It is, like Tr, one of the fundamental constants which perpetually occur in analysis. We define e as the number whose logarithm is 1. In other words e is defined by the equation;e dt t t Since log x continually increases with x, it can only pass once through the value 1. Hence our definition does in fact define one definite number. Example. Prove that 2 < e < 3. [In the first place it is evident that 2 dt J1 t

Page 348

348 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX and so 2 < e. Also 3 dt (51/4 3/2 7/4 2 3 dt J1 t \ 1 J5/4 3/2 7/4 11/4 t > 1 1.++$+- +4 + 2 +A-+} >, so that e < 3.] Now log (xy) = log x + log y and in particular log x2 = 2 log x, log X3 = 3 log x,..., log x = n log x, where n is any positive integer. Hence log en = n log e = n. Again, if p and q are any positive integers, and ePlq denotes the positive qth-root of ep, we have p = log eP = log (ePlq)Q = q log ep/q, so that log (eP/) =p/q. Thus if y has any positive rational value and ey denotes the positive yth-power of e, we have log e = y..........................(1), and log e- =-loge =-y. Hence the equation (1) is true for all rational values of y, positive or negative. In other words the equations y = log x, x= e........................ (2) are consequences of one another so long as y is rational and ey has its positive value. At present we have not given any definition of a power such as ey in which the index is irrational, and the function ey is defined for rational values of y only. 186. The exponential function ey. We now define the exponential function eY for all real values of y as the inverse of the logarithmic function. In other words, if y = log x, we write X = ey. We saw that, as x varies from 0 towards + co, y increases steadily (in the stricter sense) from - oo towards + oo. To one value of x corresponds one value of y and conversely. Also y is a continuous function of x, and it follows from 88 that x is likewise a continuous function of y. A direct proof of the continuity of the exponential function is easily given. For if x = ey and x + = e +, it is clear that -f:dt'

Page 349

185-187] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 349 Thus 1 771 is greater than /(x + ) if > 0, and than |I l/x if < 0; and if 77 is very small ~ must also be very small. Thus ey is a continuous function of y which increases steadily from 0 towards + oo as y increases from - oo towards + oo. Moreover, by the results of ~ 185, eY is the positive yth-power of the number e, according to the elementary definitions, whenever y is a rational number. In particular ey = 1 when y = 0. The general form of the graph of ey is therefore as shown in Fig. 68. It is to be observed that ey is positive for all values of y. Y O X FIG. 68. 187. The principal properties of ey. (1) If x = ey, so that y = log x, we have dy/dx = 1/x and dx/dy = x = ey. Thus the derivative of the exponential function is equal to the function itself. In other words the exponential function is a function whose rate of increase is always equal to its own value. More generally, if y = eax then dy/dx = aeax. (2) The exponential function satisfies the functional equation f (x + ) =f(x)f(y). This is evident if x and y are rational, by the ordinary rules of indices. If x or y, or both, are irrational we can choose two series of rational numbers x., x2 -, X..., x; y, y,..., yn..., such that lim xn = x, lim y, = y. Then, since the exponential function is continuous, ex x ey = lim ex" x lim eY = lim eL+Sn = ex+y. In particular ex x e- = e~= 1, or e- = 1/ex. Or we may deduce the functional equation satisfied by ey from

Page 350

350 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX that satisfied by logx. For if y, = log x1, y2 log x2, so that x1 = eyl, x2 = eY2, we have y/ + y/2= log xI + log x2 = log (x1x2) and eyl+y2 = e log(xlx2) =,12 =.ey x ey2. Examples LXXXVII. 1. If dx/dy=x then x=iKey, where K is a constant. 2. There is no solution of the equation f(x +y)=f(x)() f) fndamentally distinct from f(y) =ey. [For, differentiating the equation with respect to x and y in turn, we obtain f'(x+y)=f'(x)f(y), f'(x+y)=f(x)f'(/) and so f' (x)/f(x) =f' (y)/f(y), and therefore each is constant. Thus if z=f(y), dz/dy =z/A, or y=A =A logz + B, A and B being constants; so that z=e(Y-B)/A.] 3. Prove that (e"a- l)/x-a as x —O. [Applying the Mean Value Theorem, we obtain eax- 1 =aeat where 0 < 4 < x.] 188. (3) The function ey tends to + co with y more rapidly than any power of y, or lim ya/ey = lim e-Yy = 0 as y -- + co, for all values of a, however great. We saw that (log x)/xg -G 0 as x - + oo, for any positive value of f3, however small. Hence, if a = 1/3, (log x)a/x 0 for any value of a, however large. The result follows by putting x = ey. From this result it follows that we can construct a 'scale of infinity' similar to that constructed in ~ 184, but extended in the opposite directioni.e. a scale of functions which tend to + oo with x more and more rapidly. This scale is X, X2 X3I... ex, e2X... e,..., e,..., eex,... eee 5..., where of course ex2,..., ee',... denote e(X2),..., e().... The reader should try to apply the remarks made in ~ 184 and Exs. LXXXVI, about the logarithmic scale, to this 'exponential scale' also. The two scales may of course (if the order of one is reversed) be combined into one scale... loglogx,... logx,... x,... ex,... ee... 189. The general power ax. The function ax has been defined only for rational values of x, except in the particular case when a = e. When a is rational and positive, the positive value of the power ax is given by the equations ax = (e log a)x = ex log a We take this as our definition of ax when x is irrational. Thus 10^ = eW2. log 10. It is to be observed that ax, when x is irrational,

Page 351

187-190] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 351 is defined only for positive values of a, and is itself essentially positive. The reader will find no difficulty in verifying the following statements. (1) Whatever value a may have, ax x ay = ax+y and (aX)y = acxY. In other words the laws of indices hold for irrational no less than for rational indices. (2) If a > 1, ax = e log a = ex, where a is positive. The graph of ax is in this case similar to that of ex, and ax - + oc with x, more rapidly than any power of x. For if a > 0 and ax=y, then x"e-aX = (y/a)me-.- 0 as x and y tend to + oo. If a < 1, ax = ex log a = e-a where a is positive. The graph of ax is then similar in shape to that of ex, but reversed as regards right and left, and ac - 0 as x -+ oo, more rapidly than any power of 1/x. (3) ax is a continuous function of x, and DxaX = ax log a. (4) ax is also a continuous function of a, and Daa = xax-l. (5) (ax - l)/x - log a as x 0. This of course is a mere corollary from the fact that D:al = ax log a, but the particular form of the result is often useful; it is of course equivalent to the result (Ex. LXXXVII. 3) that (ex - 1)/x -> a as x -> 0. In the course of the preceding chapters a great many results involving the function ax have been stated with the limitation that x is rational. The definition which we have now given, and the theorems proved above, enable us to remove this restriction. 190. The representation of ey as a limit. In Ch. IV, ~ 67, we proved that {1 + (l/n)} tends, as n oo, to a limit which we provisionally denoted by e. We shall now identify this limit with the number e of the preceding sections. We can however establish a more general result, viz. that expressed by the equations liam + =li+ I - = ey............(1). n % n n oo x 1e As the result is of very great importance, we shall indicate alternative lines of proof. (1) Since Dx log (1 + yx) = y/(l + yx), it follows that lim {log (1 + yh)}/h = y, as h - 0; or, putting h = 1/, lim [I log {1 + (y/~)}] = y

Page 352

352 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX as - + so or - - oo. Since the exponential function is continuous it follows that {1 + (y/)} = e log{l+(y/t)} -. e asf - + oo or - o --- o: i.e. that lim {1 + (y/:)} = lim {t + (y/l)}- = e. - -> + Xo - o If we suppose that -- + cc or - oo through integral values only, we obtain the result expressed by the equations (1). (2) If n is any positive integer, however large, and x > 1, we have Ax dt fx dt rX dt J1 tl + (l/) <J t J1 tl-(L,/n) or (1 - x- /) < log x < n (xl/- 1).....................(2) from which we deduce << (1 -............................(3) if y=logx>0. It is easy to prove, as in ~67, that the first of these two functions of y is an increasing and the second a decreasing function of n, and therefore that each tends to a limit as n — oo; and the two limits must be equal. For if l+(/yn) =7 and 1/{1-(y/n)}=172, we have (Ex. xxxvI. 8), at any rate for sufficiently large values of n, 2 - 't < 7 -1 (772 - '1) =y2 l"/nl which evidently tends to 0 as n- c oo. 191. The representation of logx as a limit. We can also prove (cf. ~ 68) that lim n (1- X- i/n) = lim n (xl/' 1) = log x. For n (xl - 1) -2 (1 — x-1/) = n (Xl -- 1)(1 - x-l/n) which tends to zero as n —co, since n (x1/ -1) tends to a limit (~ 68) and x-l'/' to 1 (Ex. xxx. 10). The result follows from the inequalities (2) of ~ 190 (2). Examples LXXXVIII. 1. Prove, by taking n1=6 in the inequalities (3) of ~ 190, that 2.5 < e < 2.9. 2. Prove that, if t> 1, (tl/- t-l/l)/(t-t-l) < /n, and so that, if x> 1, x dt x dt 1 X ( I\dt I He 1 ce t,-/) '1 t+J t )<1 (- ) t_ - =L(x+1-2). tI -1 tlj + 'ini) V t t n X Hence deduce the results of ~ 190. 3. If e,, is a function of n such that n$n,,-.l as n-eoo, then (1+ f)n-.el. [Writing n log (1 + en) in the form (n) log (1 + n,) and using Ex. LXXXIV. 4, we see that n log (1+ 4,) —.] 4. If ne —+co then (1+$&,)-n+ oo, and if -,-oo- then (1+)n-O0.

Page 353

190-192] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 353 192. Common logarithms. The reader is probably familiar with the idea of a logarithm and its use in numerical calculation. He will remember that in elementary algebra loga x (the logarithm of x to the base a) is defined by the equations =ay, y=loga X. This definition of course, at the stage of knowledge when it is usually given, can be regarded as applying only to rational values of y, though this point is often passed over in silence. Our logarithms are therefore logarithms to the base e. For numerical work logarithms to the base 10 are used. If y = log x = log, x, z = log0o x, then x = ey and also x = 10 = eZ loge 10, so that log10 x = (loge x)/(log 10). Thus it is easy to pass from one system to the other when once loge 10 has been calculated. It is no part of our purpose in this book to go into details concerning the practical uses of logarithms. If the reader is not familiar with them he should consult some text-book on Elementary Algebra or Trigonometry*. Examples LXXXIX. 1. Show that De(x)=0e(x) ' (x), D_ f (ex) = 'exq(ex). 2. Show that Dx eax cos bx = reax cos (bx + 0), Dx eax sin bx = reax sin (bx + 0) where r=^ /(a2+ b2), cos = a/r, sin 0=b/r. Hence determine the nth derivatives of the functions eax cos bx, ea sin bx, and show in particular that Dxn eax = an eax. 3. If y = eau, Dxy = ea (Dxu + au), and Dxn=eax {Dxnu+ () al -l+ () a2Dxn-24u... +aql}. [Proceed by induction.] 4. Integrals containing the exponential function. Prove that co bd a cos bx + b sin bx f a sin bx - b cos bx yeaxc~ bode= a2+b - eax year sinbxdxc - aeaX a2 + b2 a2 ~ b2 * See for example Chrystal's Algebra, vol. I., ch. xxI. The value of logg 10 is 2-302... and that of its reciprocal -434.... H. A. 23

Page 354

354 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX [Denoting the two integrals by I, J, and integrating by parts, we obtain al= eax cos bx + bJ, aJ= eax sin bx - b. Solve these equations for I and J.] 5. Prove that, if a>O, then e- ax cos b x = e - a sin bx dx - - fe aob2' a2abb2 6. If In,=eaxx$dx then aIleaxXn-nL,_1. [Integrate by parts. It follows that In can be calculated for all positive integral values of n.] 7. Prove that, if a is a positive integer, then e-xxndx=q!e-t (-1 —2!**n Jo0 2!. n! and f e-xx' dx=n!. Jo 8. Show how to find the integral of any rational function of ex. [Put x=logu, eX=u, dx/du=l/u, and the integral is transformed into that of a rational function of u.] 9. Prove that we can integrate any function of the form P(x, eax, ebx,...), where P denotes a polynomial. [This follows from the fact that P can be expressed as the sum of a number of terms of the type Axmnekx, where rn is a positive integer.] 10. Integrate ex {(x (x) + b' (x)}, x - ex (x+n), and eX (1 + sin x)/(1 + cos x). 11. Prove that e-xR( x) dx, where X>0 and a is greater than the greatest root of the denominator of R (x), is convergent. [This follows from the fact that eX-s + co more rapidly than any power of x.] 12. Prove that e-X2+,X dx is convergent, if X>0, for all values of /, -00 and that the same is true of f e-AXx2+x xn dx, where n is any positive integer. 13. Draw the graphs of ex2 -x,, e-X, exe x, xe- xe e-X2 and xlogx, determining any maxima and minima of the functions and any points of inflexion on their graphs. 14. Show that the equation eax=bx, where a and b are positive, has two real roots, one, or none, according as b > ae, b =ae, or b < ae. [The tangent to the curve y=eax at the point ($, eat) is y- et = aeat (x-), which passes through the origin if at= 1, so that the line y=aex touches the curve at the point (1/a, e). The result now becomes obvious on drawing the line y= bx. The reader should discuss the cases in which a or b or both are negative.] 15. Discuss in a similar manner the equation bxeax=l. [Put x=-z.]

Page 355

192] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 355 16. Show that the equation eX=lx + has no real root except x=0, and that eX x + x+ x2 has three real roots. 17. Draw the graphs of the functions log {x+ /(x2+ 1)}, log (i +), eaX cos (bx+ c) /, e-l/, e[XI, e-Cx2 os2 X+ 2 sin2 x. 18. Determine roughly the positions of the real roots of the equations X 2+ 1 log {+x ^+( +)}-1, = 0 ex-0 e in x= 7, eX2 sinx= 10000. 100' 2-x 10000' 19. The hyperbolic functions. The hyperbolic functions cosh x*, sinhx, etc. are defined by the equations cosh x = (ex +e-), sinh x= (eX-e-x), tanh x=(sinh x)/(cosh x), coth x=(cosh x)/(sinh x), sech x= l/(cosh x), cosech x= 1/(sinh x). Draw the graphs of these functions. 20. Establish the formulae cosh(- x) = cosh x, sinh(-x)= -sinh x, tanh (-x)= -tanh x, cosh2 x - sinh2 x= 1, sech2 x+ tanh2 x= 1, coth2 - cosech2 x= 1, cosh 2x = cosh2 x + sinh2 x, sinh 2x = 2 sinh x cosh x, cosh (x +y) = coshxcoshy +sinh inhiy, sinh (x +y) =sinh xcosh + cosh x sinhy. 21. Verify that these formulae may be deduced from the corresponding formulae in cos x and sin x, by writing cosh x for cos x and i sinh x for sin x. [It follows that the same is true of all the formulae involving cos nx and sin nx which are deduced from the corresponding elementary properties of cos x and sin x. The reason of this analogy will appear in Ch. X.] 22. Prove that DX cosh x= sinh x, Dx sinh x= cosh x, Dx tanh x = sech2 x, DX coth x=- cosech2, Dx sech x = - sech x tanh x, DX cosech x = - cosech x coth x, Dlog cosh x = tanh x, D.log sinh x= coth x, D, (2 arc tan e) = sechx, Dx log tanh -x= cosech x. [All these formulae may of course be transformed into formulae in integration.] 23. Prove that cosh x > 1 and -1 < tanh x < 1. 24. If y=coshx, x=log{y /(y2- 1)}. Ify=sinhx, x=log{y+/(y2 - )}. If y=tanh, x=log {(l +y)/(l -y)}. Account for the ambiguity of sign in the first case. * 'Hyperbolic cosine': for an explanation of this phrase see Hobson's Trigonometry, ch. xvi. 23 —2

Page 356

356 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 25. We shall denote the functions inverse to coshx, sinhx, tanhx by arg cosh x*, arg sinh x, arg tanh x. Show that arg cosh x is defined only for x? 1, and is two-valued, while arg sinh x is defined for all real values of x, and arg tanh x for - 1 < x < 1, and both of the two latter functions are onevalued. Sketch the graphs of the functions. 26. Show that if - I 7r < x < ~rr and y is positive, and cos x cosh y = 1, then y = log (sec + tan x), D y=sec x, Dyx=sechy. 27. Prove that, if a >0, then /( 2+ -- = arg sinh (x/a), and f//( _x) =arg cosh (a), - argcosh ( -/a), according as x > 0 or x < 0. 28. Prove that, if a > 0, then fs2 -a2-= -(1/a) arg tanh (x/a), (l/a) argcoth (x/a), according as x is numerically less or greater than a. [The results of Exs. 27 and 28 furnish us with an alternative method of writing a good many of the formulae of Ch. VI.] 29. Solve the equation acoshx+bsinhx=c, where c>0, showing that it has no real roots if b2 + c2 - a2< 0, while if b2 + c2 - a2>0 it has two, one, or no real roots according as a+ b and a - b are both positive, of opposite signs, or both negative. Discuss the case in which b2+ c2- a2 O. 30. Solve the simultaneous equations cosh x cosh y = a, sinh x sinh y = b. 31. As x —+oo, llX-.l. [For x1/x=e(logx)/x, and (logx)/x —0. Cf. Ex. xxx. 11.] 32. As x — +0, xx -l 1. 33. If {f(n + l)}/{f(n)} —., where 1> 0, as n-noo, then l/{f(n)} -l. [For logf(n + 1) - logf (n) -log i, and so (l/n) log (n)- log 1 (Ch. IV, Misc. Ex. 29).] 34. (n!)/ln — 1/e as -- o. [If f(n) = -! then {f(n+ l)}/{f(n)}={1 +(/n)}-.] 35. {(2n)!/(n!)2} - 4 as n- o. 36. Discuss the approximate solution of the equation e=x1000000~. [It is easy to see by general graphical considerations that the equation has two positive roots, one a little greater than 1 and one very large, and one negative root a little greater (algebraically) than - 1. To determine roughly the size of the large positive root we may proceed as follows. If ex=x=100000 then = 106log, log x=1382 + looglogx, loglogx= 263+log +loglg2 ), roughly, since approximate values of log 106 and log log 106 are 13'82 and 2-63 respectively. It is easy to see from these equations that the ratios logx: 13-82 and loglogx: 2-63 are not very far from ratios of equality, and that x= 106 (13'82 +log logx)= 106 (13-82 + 2 63) = 16450000 * The argument whose hyperbolic cosine is x.'

Page 357

193] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 357 gives a tolerable approximation to the root, the error involved being roughly measured by 106 (log log x - 2 63), which is not very far from being in a ratio of equality with (1061oglogx)/13'8 or (106x2'63)/13'8, which is less than 200,000. The approximations are of course very rough, but suffice to give us a good idea of the scale of magnitude of the root.] 37. Discuss similarly the equations eX=1000000x1~00~~~, e =l~000000000. 193. Logarithmic tests of convergence for series and integrals. We showed in Ch. VIII (~~ 158 et seq.) that 2, S la (a>0) i ns) ] a Xs are convergent if s > 1 and divergent if s < 1. Thus E (1/n) is divergent, but S(1/n1+a) is convergent, if a is any positive number, however small. We saw however in ~ 184 that with the aid of logarithms we can construct functions which tend to zero, as n c- o, more rapidly than l/n, yet less rapidly than l/nI+a, however small a may be (provided of course it is positive). For example l/(n log n) is such a function, and the question as to whether the series E (1/n logn) is convergent or divergent cannot be settled by comparison with any series of the type (1/n"). The same is true of such series as > {1/n (log n)2}, 2 {(log log n)/n/(log n)}. It is a question of some interest to find tests which shall enable us to decide whether series such as these are convergent or divergent: and such tests are easily deduced from the Integral Test of ~ 157. For since Dx (log x)1- = (1 - s)x (log x), Dx log log x= 1/x log x, we have dx (log )1 - (log a)s f dx log log:- log log a, x (logx)" 1-s x ogx if a > 1. The first integral tends to the limit - (log a)-8/(l - s) as — + oo, if s> 1, and to + oo if s < 1. The second integral tends to + oo. Hence the series and integral,0.-n (log n)s' J x (log X)"

Page 358

358 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX where no and a are greater than unity, are convergent if s> 1, divergent ifs s 1. It follows, of course, that ZSq (n) is convergent if b (n) is positive and less than K/{n (log n)S}, where s> 1, for all values of n greater than some definite value, and divergent if b (n) is positive and greater than K/(n log n) for all values of n greater than some definite value. And there is a corresponding theorem for integrals which we may leave it to the reader to formulate. Examples XC. 1. The series 1 (log n)l0 n2-1 1 n(log n)2' n1000 ' 2 +1 n(log n)7/6 are convergent. [The convergence of the first series is a direct consequence of the theorem of the preceding section. That of the second follows from the fact that (log n)00 is less than n? for sufficiently large values of n, however small 3 may be (provided it is positive). And so, taking /= 1/200, (log n)100 -101/100 is less than n-201/200 for sufficiently large values of n. The convergence of the third series follows from the comparison test at the end of the last section.] 2. The series 1 1 nlogn n (log n)6/7' nl~/10~1 (log n)0' 2 (nlog n) +1 are divergent. 3. The series (log n)P (log n)P (log log n)q (log log n) l+8 2 n +8 ' n (log n)l + where s > 0, are convergent for all values of p and q; similarly the series 1 1 1 nl -8 (log n)' n1 -8 (log n)p (log log n) 2 n (log n)1-8 (log log n)p are divergent. 4. The question of the convergence or divergence of such series as 2 1 1 log log log r n log n log log n' n log n /(log log n) cannot be settled by the theorem of ~ 193, since in each case the function under the sign of summation tends to zero more rapidly than l/(n log n) yet less rapidly than l/{n (log n)l +}, where a is any positive number, however small. For such series we need a still more delicate test. The reader-should be able, starting from the equations Dx (logk )1-8= (1 -s)/{x log x log2... logk _ 1 x (logk )S}, /D logk + = l/(x log x log2 x... logk- x logk x),

Page 359

193] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 359 where log2 x = log log x, log3 x = log log log x,..., to prove the following theorem: the series and integral v, 1__1 [dx no nlogn log n...logk- n (lgkn)8 a logx log2x... log1 _ (log )8 are convergent if s>l, divergent if sl1, no and a being any numbers sufficiently great to ensure that logkn and logkx are positive for n no or xHa. [These values of no and a increase very rapidly as k increases: thus log x >0 requires x >1, log2x >0 requires x >e, log log x >0 requires x>ee, and so on; and it is easy to see that ee>l0, ee>e10 > 20,000, eeee > eooo > 08000. The reader should observe the extreme rapidity with which the higher exponential functions, such as eex and eeex, increase with x. The same remark of course applies to such functions as aax and aaa, where a has any value greater than unity. It has been computed that 999 has 369,693,100 figures, while 1010~ has of course 10,000,000,000. Conversely, the rate of increase of the higher logarithmic functions is extremely slow. Thus to make log log log log x>l we have to suppose x a number with over 8000 figures.] 5. Prove that the integral f log(-)} dx, where 0<a<l, is convergent if s<- 1, divergent if s8_ - 1. [Consider the behaviour of e {og ()}d as -- + 0. This result also may be refined upon by the introduction of higher logarithmic factors.] 6. Prove that - {log ()} dx has no meaning for any value of s. [The last example shows that s<- 1 is a necessary condition for convergence at the lower limit: but, if s< 0, {log(l/x)}8 tends to + co as x-1 -0, like (1 - x), and so the integral diverges at the upper limit if s< -1.] 7. The necessary and sufficient conditions for the convergence of xa-i {log ()} dxre a>0, s> -1. Examples XCI. 1. Euler's limit. Show that 11 1 (n)-l+ - + +...+ — logn tends to a limit as n - oo. [For ( (n+ )- -l(n)= - og = >O. n n n >0. Hence /q (n) is an increasing function. But 11 - <1 == d tI1 1 or 0<0(n)<l-(1/n)<l. Hence 4(n)-l where 0<O _1. The value of I is in fact '577..., and I is usually called Euler's constant.]

Page 360

360 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX 2. Show similarly that if a and b are positive then 1 1 1 1 1 I + II..+og(1og (a+ nb) a + a+b+a+2b+ a+ +(n- 1)b (a+nb) tends to a limit as n - oo. 3. Show similarly that if 0<s <1 then (n)=1 + 2-8 + 38- +... + (n- 1)s - 8 {-8/(1 - )} tends to a limit as n -~ oo. 4. Show that the series 1 1 1 i+2(i+ +3(l+~+)+... 1 (1+) +3) is divergent. [Compare with E (1/n log n).] Show also that the series derived from 2 (l/n8), in the same way that the above series is derived from 2 (1/n), is convergent if s>l and otherwise divergent. 5. Prove generally that if 2u, is a series of positive terms, and Sn=Ul +U2+...- +n, then (u1/s,) is convergent or divergent according as 26n is convergent or divergent. [If ~2 is convergent, s. tends to a positive limit I, and so 2 (u,/s,) is convergent. If 2u, is divergent, s - + oo, and UzlSn >log {1 + (unls)} =log (sn + /S.) (Ex. LXXXIV. 1); and it is evident that log (S2/1l)+log (3/2)+... +log (n +l/sn)==log (s +/sl) tends to + o as n —o.] 6. Sum the series 1 - +-... [We have 1 11 11\ 1 +... + =log (2n+l)++2+ (, 2 - ++.. = 2 log (n + )++y + n 2 2n ' 24 by Ex. 1, y denoting Euler's constant, and E,,,' being numbers which tend to zero as n -. Subtracting and making n - oo we see that the sum of the given series is log 2. See also ~ 195.] 194. Series connected with the exponential and logarithm. Expansion of ex by Taylor's Theorem. Since all the derivatives of the exponential function are equal to the function itself, we have 2 n-1 Xn eX= 1+ + + + - 1+ -- + eea 2! (n -1! n1 where 0 < 8 < 1. But xn/n! - 0 as n - o, whatever be the value of x (Ex. xxx. 12); and e0e < ex. Hence, making n tend to oo, we have yx2 Xn eX= 1 + +... + + +............(1). 2! (n!

Page 361

194] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 361 The series on the right-hand side of this equation is known as the exponential series. In particular we have 1 1 e=- +1+ +..+ +... +............(2); 2! n! and so 1 + 1 + +...) 1+... + + + -...++ -...(3) 21 n! 2 1 n! a result known as the exponential theorem. Also, if a > 0, ax ex oga (x log a) + ( lg a) + (4) The reader will observe that the exponential series has the property of reproducing itself when every term is differentiated, and that no other series of powers of x would possess this property (for some further remarks in this connection see Appendix II). The power series for ex is so important that it is worth while to establish it by an alternative method which does not depend upon Taylor's Theorem. Let X2 Xn 2 n E~(x)=l+x+2+...+'.T, and suppose that x>0. Then (1+- = 1+n (n - 1) 2.. n(n-1...1 n which is less than E, (x). And, provided n>x, we have also, by the binomial theorem for a negative integral exponent, (i- (-l+ ) +n (ix) +... >En (x). Thus (1+ ) <E () < (-). But (~ 190) the first and last functions tend to the limit ex as n oo, and therefore E (x) must do the same. From this the equation (1) follows when x is positive; its truth when x is negative follows from the fact that the exponential series, as was shown in Ex. LXXXIII. 7, satisfies the functional equation f(x)f(y)=f/(x+y), so that f(x)f(- )=f(0)=1. Examples XCII. 1. Calculate e, e2, 1/e, 1/e2 to six places of decimals by means of the exponential series. X2 x4 2 3 X5 2. Show that cosh x=l + + +..., sinh x=x+ + +.... 3. If x is positive, the greatest term in the exponential series is the ([x] + l)-th, unless x is an integer, when the preceding term is equal to it. 4. Show that n! >(n/e)n. [For na/n! is one term in the series for en.]

Page 362

362 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX 5. Show that e is not a rational number. [If e=p/q, where p and q are integers, we must have P 1 1 1 p=2l+ +3! + or, multiplying up by q!, Y p l, 1 ] \ 1\ 1!\q _ 12! q)' q+ 1 (q+ 1) (q+2) and this is absurd, since the left-hand side is integral, and the right-hand side less than {1/(q+ l)}+{l/(q + 1)}2+...= 1/g.] oo Xn 6. Sum the series Pr (n) where P,. (n) is a polynomial of degree r 0 in n. [We can express P,. (n) in the form Ao+Aln+A2n (n- 1)+... +A.n (n- )... (n-r + 1), and 00 /yln 00 00 00 ~oo Sign Xnz oo X0 n oo Xn 0)! on 2( 1)! (nr) =(Ao+Alx+A2X+... + +Arx) e.] 7. Show that ~~n3 xn - ~ n4 n= (. n ( 332 x x3) + e, 2.xn = (I+7X2+63x+xA4)eX; and that if S= 13 + 23 +.. + n3 then 1;n =.- (4x+ 14n2 + 8X3+ - 4) ex. In particular the last series is equal to zero when x= - 2. (Math. Trip. 1904.) 8. Prove that 2 (n/n!) e, (n2/n!) = 2e, 2 (13/!) = 5e, and that 2 (nk/n!), where k any positive integer, is a positive integral multiple of e. 9. Prove that 2 (n = {(x2 - 3x 3) e +I2 - 3}/x2. 1 (n + 2) n! [Multiply numerator and denominator by n +1, and proceed as in Ex. 6.] 10. Determine a, b, c so that {(x+a) ex+(b +c)}/x3 tends to a limit as bx+c x - 0, evaluate the limit, and draw the graph of the function eX++ --- 11. Draw the graphs of 1+x, 1 +x+ x2, 1 +X+x2 +i, and compare them with that of ex. Xn Xn 12. Prove that e-X-l+- 2! +.-(-1)~! is positive or negative according as n is odd or even. Deduce the exponential theorem. 13. If Zo=ex Z1=ex - 1, Z2=ex - 1-x, Z3=ex- 1 —(x2/2!),... then dZv/dx=Z^1. Hence prove that if t>0 then t (tt t ft t2 ZI (t)= Zodxr<tet, Z2(t)= Z < xexdx < el xd.e, and generally Z, (t)< tv etv!. Deduce the exponential theorem.

Page 363

195] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 363 195. The logarithmic series. Another very important expansion in powers of x is that of log(l +). Since log(1 +)= i t' and 1/(1 + t)= 1 - t + t2 -... if t is numerically less than unity, it is natural to expect* that log(l + x), when - 1 <x < 1, will be equal to the series obtained by integrating each term of the series between 0 and x, i.e. to the series x - X2 + 3 -.... And this is in fact the case. For 1/(1 + t= 1 - t + t2-... + (-1)m-1 + {(- l)'nt/(l + t)} and so, if > - 1, log (1 X+ x):f:dt+ (- 1 ) m\og(~+x)^ _x-: +.+. +(-1)yR'R, jx in Mn where J ftmt -Rm = 1 +t' We require to show that the limit of Rm, when m tends to oo, is zero. This is almost obvious when x is positive and less than or equal to unity; for then Rm is positive and less than Jorx (xm+1) f tmdt = __1_ and therefore less than 1/(m + 1). If -1 x< 0 we put t=-u and x=-a, so that t / [ du Rm = ( -1) 1 1 -- 'itJ which shows that Rm has the sign of (- l). Also, since the greatest value of 1/(1 - u) in the range of integration is 1/(1 -), we have 1 ______ __ 1 I Rm 1-^o (m+1)(1-:) <(m +l)(l- ) and so lim Rn = 0. Hence log (1 + 7) = x - x2 + 3 -..., provided- 1 <x 1. If x lies outside these limits the series is not convergent. If x= 1 we obtain log2=-1 +-..., a result already proved otherwise (Ex. xcI. 6). * See Appendix II for some further remarks on this subject.

Page 364

364 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX 196. The series for the inverse tangent. It is easy to prove in a precisely similar manner that are tan x= -1 t2- (1 - t + t4-...) dt = x - 'xI + x5 -..., provided - 1 _ x 1. The only difference is that the proof is a little simpler: for since arc tan x is an odd function of x we need only consider positive values of x. And the series is convergent for x =- 1 as well as for x= + 1. We leave the discussion to the reader. The value of arc tan x which is represented by the series is of course that which, when - 1 x 1, lies between - ~r and + r, and which h we saw in Ch. VII (Ex. LXV. 3) to be the value represented by the integral. If x = 1 we obtain the formula = 1 - +.... Examples XCIII. 1. log{l/(l -x)}=x+x2+-Lx3+... if -lx<l. 2. arg tanh x- log{(l+x)/(l-x)}=x+ x3+x5+... if -1 <x<1. 3. Obtain the series for log ( +x) and arc tan x by means of Taylor's theorem. [Since Dnlog(l +x)=DZ)-l {1/(1 +x), the coefficients in the first series are easily calculated. A difficulty presents itself in the discussion of the remainder when x is negative, if Lagrange's form R =(- l)-l x"/{n (1 + x)n} is used; Cauchy's form, viz. Rn,= (- 1)-l (1- O)n-l xn/(l + O)n', should be used (cf. the corresponding discussion for the Binomial Series, Ex. LVIII. 2 and ~ 147). In the case of the second series we have Dn arc tan x=Dx-1 '{/(1 +x2)} =(- 1)-1 (n- 1)! (x2 + 1)-n/2 sin {nare tan (l/x)}, (Ex. XLVII. 14) which reduces to (- 1)- (n - 1)! sin In7r for x=O. In this case there is no difficulty about the remainder, which is obviously not greater in absolute value than 1/n.] 4. If y>0 then log/ = 2 y- + 1 - Y - -1 + I } [Use the identity y=(l+ Y+l\ ) (1 - ). This series may be used to calculate log 2, a purpose for which the series 1- + - —..., owing to the slowness of its convergence, is practically useless. Put y=2 and find log2 to 3 places of decimals.]

Page 365

196, 197] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 365 5. Find log 10 to 3 places of decimals from the formula log 10=3 log 2+ log (1 + ). Deduce the values of log1o e, logo1 2. 6. Show that 1log2=7a+5b+3c, 1log3=lla+8b+5c, ~log5=16a+12b+7c, where a=argtanh (1/31), b=argtanh (1/49), c=argtanh (1/161). [These formulae enable us to find log 2, log 3, and log 5 rapidly and with any degree of accuracy.] 7. Show that 7r = arc tan (1/2) +arc tan (1/3)=4 arc tan (1/5) - arc tan (1/239), and calculate 7r to 6 places of decimals. 8. Show that I — log (1 =+(1+ ) 2+(1+ +I)x3+..., if -1<x<l. [Use Ex. LXXXIII. 2.] 'x dt 9. Show that | = X-X+ l9-... if -1 < 1. J 0 5" t4 9 Deduce that 1- +I - **=4"2 {lr+2log (/2+1)}. (Math. Trip. 1896.) [Proceed as in ~ 195 and use the result of Ex. LI. 7.] 10. Prove similarly that 1 t2dt I - - +- 1. = t4 r - 2 log (./2 + 1)}. 11. Prove generally that if a and b are positive integers then 1 1 1 1 l t-ldt a a+b a+2b " Jo 1+tb and so that the sum of the series can be found. Calculate in this way the sums of 1 -+7-... and1 r- +-.... 12. Verify the inequalities of Ex. LXXXIV. 2, viz. u-~u2<log (1 +u)<u- {u2/(1 +u)}, with the help of the logarithmic series, assuming 0<u< 1. 197. The Binomial Series. We have already investigated the Binomial Theorem (1 )m 14 () += +(2+..., assuming -1 < x < 1 and mn rational. When m is irrational (1 + )m = em log (+x), DX (1 + X)m = {m/(1 + X)} eml~g (1+X) = m (1 + X)m-,1 so that the rule for the differentiation of (1 + x) remains the

Page 366

366 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX same, and the proof of the theorem given in ~ 147 retains its validity. We shall not discuss the question of the convergence of the series when x = 1 or - 1 *. Examples XCIV. 1. Prove that if - 1 <x< then 1 1 x1.3 1 1 1.3x =1 X2 + 2 * 3 =1 + 12 + I x4 +.... /(1l+x2) 2 2.: /(1-x2)- 2 2.4 2. Approximation to quadratic and other surds. Let V/N be a quadratic surd whose numerical value is required. Choose the nearest square to 1V (above or below N), say M2: and let N=M2+d or M2-d, d being positive. Since d cannot be greater than M, d/M2 is comparatively small, and the surd ^IV=M /{1 +~(d/lM2)} can be expressed in a series =rM i + 2 -..4 +2 ' which is at any rate fairly rapidly convergent, and may be very rapidly so. Thus, e.g., /67=v/(64+3)=8{1+2 () -2 ()...} Let us consider the error committed in taking 8y3a (the value given by the first two terms) as an approximate value. After the second term the terms alternate in sign and decrease. Hence the error is one of excess, and is numerically less than 32/(8. 642)< 0003. 3. If x be small compared with 12 then +( N2 + x)= N (x/4N) + {(Nx/2 (2 2 +x)}, the error being of the order x4/r7. Apply the process to V/997, /1031. [Expanding by the binomial theorem x X2 x3 /(N2+x)=N+2N 833 + 16 5- ' the error being less than the numerical value of the next term, viz. 5x4/128N7. Also VNX X X \-1 X X2 XS3 2 (2NV2+x) = 4 - \ ) 4- - 8N3 + 16N5' the error being less than x4/32l7. The result follows. No difference in method is needed for surds other than quadratic.] 4. If p differs from N3 by less than 1 percent. of either then /p differs from N-+(3p/N2) by less than N/90000. (Math. Trip. 1882.) 5. If p = N4+x, and x is small compared with N, a good approximation for /p is _ 51 s p 27vx P56 56 N3 14 (7p+5NV4) Show that when N= 10, x 1 this approximation is accurate to 16 places of decimals. (Math. Trip. 1886.) * See Bromwich, Infinite Series, pp. 150 et seq.

Page 367

197, 198] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 367 6. Show how to sum the series 00 /M\ E 0-(a) (n ) Xn where 0,. (n) is a polynomial of degree r in n. [Express r, (n) in the form Ao+Aln+A2n(n-1)+... as in Ex. xcII. 6.] 7. Sum the series 2 n ( n x n2 x n and prove that o \i o0 00 /on\ 2 n3 ( ') {n{mM33 + m(3m- l)+x2 + n (1 +X.)-3. o \f/ 198. An alternative method of development of the theory of the exponential and logarithmic functions. We shall now give an outline of a method of investigation of the properties of ex and logx entirely different in logical order from that followed in the preceding pages. This method x2 starts from the exponential series 1 +x + +.., which we know to be convergent for all values of x. We may therefore define the function exp x by the equation x2 expx=l +x++. (.............................(1). We then prove as in Ex. LXXXIII. 7, that expx. expy= exp (x+y)...........................(2). h A2 Again (exp-l )/h=1+-.+ -.+...=l+ph, 2! where Ph is numerically less than I l I +I 12 + 1A 13 +.. 1h 1/(1 -Il l ), so that ph->O with h. And so {exp (x + h) - exp x}lh = exp x {(exp A - l)/h} -exp x as hA —, or Dxexpx=expx.................................(3). Incidentally we have, of course, proved that exp x is a continuous function. We have now a choice of procedure. Writing y=exp x and observing that exp 0- 1, we have Sdy [fY dt dx=.y, X=j: and if we define the logarithmic function as the function inverse to the exponential function we are brought back to the point of view adopted in this chapter. But we may proceed differently. From (2) it follows that if n is a positive integer then (ex)exp x n) = 1exp n (exp 1 exp n. If x is a positive rational fraction m/n {exp (m/n)}n = exp m =(exp 1)%",

Page 368

368 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [IX and so exp (mr/n) is equal to the positive value of (exp l)ml/. By means of the equation exp x. exp (- x)=l this result is extended to negative rational values of x, and so we have exp x =(exp 1)x = e, say, where e= exp 1 = 1 + 1+! + +... for all rational values of x. Finally we define eX, when x is irrational, as being equal to exp x. The logarithm is then defined as the function inverse to exp x or ex. Example. Develop the theory of the Binomial Series f(m,)=Il+(mx +( + X where -l<x<1, in a similar manner, starting from the equation f(m, x)f( ) f ( m', x)=f x) (Ex. LXXXIII. 6). MISCELLANEOUS EXAMPLES ON CHAPTER IX*. 1. Given that logioe= 4343 and that 210, 321 are nearly equal to powers of 10, calculate logo0 2, log o 3 to four places of decimals. (Math. Trip. 1905.) 2. Determine which of ( (e/3, (^/2)~ is the greater. [Take logarithms and observe that ^/3/(^,3 + 7r) < 2 %/3 < 6929 < log 2.] 3. Show that logio0n cannot be a rational number, if n is any positive integer not a power of 10. [If n is not divisible by 10, and log10 n=p/q, we have 1OP=nq, which is impossible, since 0l ends with 0 and nq does not. If n= 10"N, where N is not divisible by 10, log0 N and therefore logo10 n=a+log1o N cannot be rational.] 4. For what values of x are the functions log x, log log x, log log log x,.. (a) equal to 0, (b) equal to 1, (c) not defined? Consider also the same question for the functions Ix, llx, llix,..., where Ix=log Ix. 5. Show that logx-() log (x+l )+ () log (x+2)-... +(-)log(x+n) is negative and increases steadily towards 0 as x increases from 0 towards + oo. [The derivative of the function is:~ (-M 1 = ' o r x+r x(x + 1)... (x+n)' as is easily seen by splitting up the right-hand side into partial fractions. This expression is positive, and the function itself tends to zero as x- + o, since log (x+r)= log x+, where ex 0, and 1 - ) + 2)...=0.] * A considerable number of these examples are taken from Bromwich's Infinite Series.

Page 369

198] MISCELLANEOUS EXAMPLES ON CHAPTER IX 369 6. If x> -1 then x2 > (1 +X) {log (1 +x)}2. (Math. Trip. 1906.) [Put 1 +x=et, and use the fact that sinh E>6 if >>0.] 7. Show that {log(1+x)}/x and x/{(l +x) log(l +x)} both decrease steadily as x increases from 0 towards + oo. 1 1 1 8. Show that l - --- as x- 0. [Use Exs. LXXXIV., or the log (1+ X) x 2 logarithmic series.] 1 1 9. Show that - decreases steadily from 1 to 0 as x increases log (1 + x) x from - 1 towards + oo. [The function is undefined for x=0, but if we attribute to it the value -1 for x=0 it becomes continuous there. Use Ex. 6 to show that the derivative is negative.] 10. Show that ex>MfxN, where M and N are large positive numbers, if x is greater than the greater of 2 log M and 16 N2. [Since log x<2^x, the inequality given is certainly satisfied if x> log M- + 2Vx, and therefore certainly satisfied if -x >log Al, ~x>2NV/x.] 11. If f(x) and q (x) increase steadily as x — + o, and f' (x)/0q' (x) — + o, then f (x)/q (x) — + o. [Use the result of Ch. VI, Misc. Ex. 29.] By taking f (x) =x, ( (x) = log x, prove that (log x)/-a-O for all positive values of a. 12. If p and q are positive integers then 1 1 II og() pn +1 pn++2 '+q g as tn-aooo. [Cf. Exs. LXVI. 5, 6.] 13. Prove that if x is positive then n log { ~ (1 +I /)} -- log x as n - oo. [We have n log og {1 (1 + - x)} = n (1 log (1- u) where u= (1-xl/8'). Now use ~ 191 and Ex. LXXXIV. 4.] 14. Prove that, if a and b are positive, then {2 (abl/n+bl/n)}n- - /(ab). [Take logarithms and use Ex. 13.] 15. Show that 11 1 1+ +...+ 2 1=log n + log 2+~ 7+ where y is Euler's constant (Ex. xci. 1) and e, -0 as tn-a o. 16. Show that 1+-+I+l+L-I++...=3log2, the series being formed from the series 1- + -... by continually taking two positive terms and then one negative. [The sum of the first 3n terms is 11 1, 1 1\ +3 5+ 42 - 1 2 +4n 1 + 2 = log (2n) + log 2 + ty n+ e- (log n + + t'), where En and En' tend to 0 as ln co.] H. A. 24

Page 370

370 MISCELLANEOUS EXAMPLES ON CHAPTER IX 17. Show that 1 - 1-+3 —+- 1- -...=1 log 2. 18. Prove that I -33 +` - S3n-1 1 v (36v2 - )1 1 1 1 1 where S, = 1 +... —, + 2n=1 -...+-1. Hence prove that the sum 2 n 3 2n- -1' of the series when continued to infinity is - 3+ 3log 3 + 2 log 2. (Math. Trip. 1905.) 19. Show that ct 1 2 21 ~ 1 2o=2 log 2 -1 -(log3-1). 1 nt (4n2 -1) i n (9n2- 1) 2 20. Prove that the sums of the four series o )+1 oo 1 (- )n+1 E 4n2-1' 1 4n2 1 ' 1 (2n+1)2-1' 1 (2n+l)2- 1 are i, 7r-,, log 2-1 respectively. 21. Prove that n! (a/n)"-. 0 or + ca according as a < e or a > e. [For -',+l/u=a ({l+ (/n)}) — a/e. It can be shown that the function tends to + oo when a= e: for a proof, which is rather beyond the scope of the theorems 'of this chapter, see Bromwich's Infinite Series, pp. 461 et seq.] 22. Find the limit as x —+ oo of (ao+alx+...+ar2 Ao+Al X Vbo +b1 +...+b,. distinguishing the different cases which may arise. (Math. Trip. 1886.) 23. Determine a and a so that the series 2 (ell a-) is convergent. 24. If a, 2,... are different numbers, no equation of the type Ae + Be~x +.. =o, where A, B,... are polynomials, can hold for all values of x. [If a is the algebraically greatest of a, 2,..., the term Aeax outweighs all the rest as x -+ o.] 25. Show that the sequence al=e, a2= ee2 a =ee3 tends to infinity more rapidly than any member of the exponential scale. [Let e (x)=ex, e (x)=ee)(x), and so on. Then, if ek(x) is any member of the exponential scale, an > ek (n) if n > k.] 26. Prove that d d d A {0 (x)}W )x = dx {p ( )}ar +d{3 {()} where a is to be put equal to (x) and /3 to q)(x) after differentiation. Establish a similar rule for the differentiation of 6 (x) [{' (x)}x(x)J.

Page 371

MISCELLANEOUS EXAMPLES ON CHAPTER IX 371 27. Prove that if Dx e- X =e x2,(x) then (i),n(.x) is a polynomial of degree n, (ii) n +1 = - 2xqn +,', and (iii) all the roots of On =0 are real and distinct, and separated by those of,,,_i=0. [To prove (iii) assume the truth of the result for n=1, 2,... K, and consider the signs of fK+ for the n values of x for which = 0 and for large (positive or negative) values of x.] 28. Show that the most general function ( (x), such that q" - a2O = 0 for all values of x, is Aeax+Be-a", where A and B are constants (cf. Ch. VI, Misc. Ex. 43). [Let = -e - ax; then 4"-2a+'=0 or +"/+'= 2a. If +' >0 it follows that log +'-==2ax + C, where C is a constant, or +'=De2a, where D is a positive constant. If,' < 0 we obtain ( - +")/(- +') = 2a, or log ( - ')= 2ax+ C, or '= - De2ax. Hence we may write generally,'=Ae2x, where A is any constant, positive or negative. Thus + = Ae2ax B, q = Aeax + Be- ax An equivalent form of solution is G cosh ax+ Hsinh ax, where G and H are constants. On the other hand the form L cosh (ax-a) is not sufficiently general, for it is equivalent to Aea +Be-ax, where A= Le -, B =Lea, so that A and B have the same sign, instead of being perfectly arbitrary. The reader should also attempt to solve the equation "" - a2O =O as in Ch. VI, Misc. Ex. 43, viz. by multiplication by 25'.] 29. If " - a2q =0 for all values of x, and / (0)=X, (' (0)=, then q = X cosh ax + (p/a) sinh ax. 30. The general solution of f(xy) =f(x)f(y), where f is a differentiable function, is xa, where a is a constant: and that of f (x+y) +f (x - y)= 2f (x)f (y) is cosh ax or cos ax, according as f" (0) is positive or negative. [In proving the second result assume that f has derivatives of the first three orders. Then 2f(x) +y2 {f" (x) + Y} = 2f (x) [f(O) +yf' 0)+/(0 ) +Y' () }], where ey and ey' tend to zero with y. It follows that f(0)=l, f'(0)=0, f"(x) =f" (O)f(x), so that a= /{f" (O)} or /{-f" (O)}.] 31. How do the functions sin(I/x), xsin2(1/x), cosec(l/x) behave as x —.+0? 32. Trace the curves y = tan x etanx y-sin x log tan x. 33. Sketch the forms of the graphs of the functions exe-`, e-xex, xe-elx, (l/x)e-(e-x/x). 34. The equation ex=ax+b has one real root if a<O or a=0, b>0. If a>0 it has two real roots or none according as a log aX b -a. 35. Show by graphical considerations that the equation eX=ax2 +2bx-+c has one, two, or three real roots if a>0, none, one, or two if a<0; and show how to distinguish between the different cases. 24-2

Page 372

372 MISCELLANEOUS EXAMPLES ON CHAPTER IX 36. Trace the curve = log ), showing that the point (0, ~) is a centre of symmetry, and that as x increases from - co to + c, y steadily increases from 0 to 1. Deduce that the equation 1 ( ex? a log(eX )=a has no real root unless O<a<l, and then one, whose sign is the same as that of a -. [In the first place - =1 0 {log log (e I /s=inh x l is clearly an odd function of x. Also d - {2 x coth x- 1-log ti - ]) ' dx x2 [2 2 \ x The function inside the large bracket tends to zero as x- 0; and its derivative is I1 ( 2} x[L ~sinh- which has the sign of x. Hence dy >0 for all values of x.] dx 37. Trace the curve y=el/x^/(x2 +2x), and show that the equation el/,(x2 + 2x) = a has no real roots if a is negative, one negative root if O<a<a = e1/2^//(2+ 2V/2), and two positive roots and one negative if a >a. 38. Show that the equation f(x)=1+x+2!+... +.=0 has one real root if n is odd and none if n is even. [Assume this proved for n=l1, 2,... 2k. Thenf2k+1 (x)=0 has at least one real root, since its degree is odd, and it cannot have more since, if it had, f' 2k+l () =f2k(x) would have to vanish once at least. Hence f2k+1 (x)=O0 has just one root, and so fk + 2(x) =0 cannot have more than two. If it has two, say a and f3, f'2k+2(X)=f2k+l(x) must vanish once at least between a and i3, say at y. And y2k + 2 f~k (+ 22())!f^ +1()" (-2) >0 But f+2k 2(X) is also positive when x is large (positively or negatively), and a glance at a figure will show that these results are contradictory. Hence f2k+2(x)=0 has no roots.] 39. If a and b are positive and nearly equal, prove that log = a1 -b) +, approximately, the error being about ( {(a-b)/a)3. [Use the logarithmic series. This formula is interesting historically as having been employed by Napier for the numerical calculation of logarithms.]

Page 373

MISCELLANEOUS EXAMPLES ON CHAPTER IX 373 40. Show by means of the exponential series that xk ex —0 as x -- + oo, for iall values of k, and that (log x)/xa —O for all positive values of a. [Use the fact that ex > x'/n! for all positive values of x and all positive integral values of n: then put x=logy.] 41. Prove by multiplication of series that if - 1< x <1 then {log (1 +X)2= X2- (1+ )X3+ (1+ 1 + )x4-..., -l log (l +X) log (-X) = I 2+ (1- +I ) X+ I (1- + - +..., (are tan )2= 2 l -,)+I ( +1 - ( 1... arc tan x log, l-x)= 2X2+ L (1 - + )6+ (1- + I-++)xlO+... arc tan x log(l4-2)=3 (l+- ) 3- 1 + 1- 1).... 42. Prove that log (1 + x) = x (1 - x) + x2 (1 - x2) + ~x3 (1 - 3) +..., if -1<x<l. 43. Prove that if - rr<0< 7r then tan- tan4 + -tanO-... = sin20 + sin4 + sin6 +.... 44. Prove that (1 + ax)l/x = ea {1- a2 X + —1- (8 +3a) a3X2 (1 + Ec)}, where Ec -- 0 with x. 45. The first n + 2 terms in the expansion of log (I + + -2+... + in powers of x are xn+ 1 1 X X2 Xt \ ~ n? +1 1!(n +2) +2! (n+3) + n! (2?+1) (Math. Trip. 1899.) x2 Xn [Let 1 +x+ +... + -=s= (x), e=s, (x) +,,(x), andf(x) =log s (x). Then d d x e-.f (x)= log {el - r ()} = 1 + log {1 -e-Xr, (x)} = 1 -— e " v / dx & (- dx /-> ^ &v v /-' n! 1-e-"ry(x) x7 I x2 ' X2 =l-n! 2! -+...t+(-1) n! +p(x), where p (x) is of the (n+ l)th order of smallness when x is small. This equation gives the first n + 2 terms of the Taylor's Series for f' (x), and we can at once deduce those of the expansion off (x). For we must have f' (0) = 1, f" (0) = f' (0)...=(n) (0) = 0, /(n+1)(0)= —, /(n+2)(0)=n+1, f(n+3)(0)= —~(n+1)(n+2),... 2n! f(2n+ 1) (0)=(-1)n+1-2; (n!)2 and clearly f(0)=0. The same argument enables us to prove generally that if the Taylor's Series for (x) is ao + alx+a2x2+... then the Taylor's Series for f'(x) is a + 2a2+ 3a3 2+... the series obtained by differentiating separately each of the terms of the former series; and conversely.]

Page 374

374 MISCELLANEOUS EXAMPLES ON CHAPTER IX 46. Show that, if -1 < x < 1, then 1 1.4 22x 1.4.7 32(x+3) 3 3.6 3.6.9 9+.. (- X)7/3' 1 1.4 1 4.7 X (2 + 18x + 9) - + x 2x2+-f-w33x3 +. (..+18+9) 3 3.6 3.6.9 27(1- )103 [Use the method of Ex. xciv. 6: or start from the equation 1 1.4 1.4.7x+ = 1 + 3. 6 3.6+ 3. 9 3 - X/3 and differentiate each side as explained above; then multiply the resulting equation by x and differentiate again; and so on.] 47. Prove that - ) - log ( (+a (a+ )xb) a-b b f00 __dx 1 b (a\\ o(x+a)(x+b)2 -(a - b)2 b a-b-blog f00 xd l a Jo ( +a) (x +b)2 (a-b)2 a log (a- b) o (x +a) (2 + b2) (a2 + b2) b { -b log xdx 1 { (x+a) (2 +b2) a2+b2 {2 ralog provided a and b are positive. Deduce, and verify independently, that each of the functions a- -loga, aloga-a+1, lrra-loga, -7r+aloga is positive for all positive values of a. 48. Prove that if a, /, y are all positive and B2>ay, then [ d _ 1 - lo 3+~/(I32-ay). o ax2+2xx+-y /(32-ay) log a J while if a is positive and ay > 32 the value of the integral is ( ) arc tan f/(y - 12)1.,/(a,/ - 32a- la) I that value of the inverse tangent being chosen which lies between 0 and 7r. Are there any other really different cases in which the integral is convergent? 49. Prove that, if a > - 1, then r X dx r dt du J (x+ac)/(x2-1)- 0 cosht+a J t2+2au+ and deduce that the value of the integral is (1 _ arc tan /(1 if -1<a<l, and 1,/( a+1)+/(a - 1) 2 I/a -1 1(a2 -1) /(ca+l)- /(a- 1) arg tanh / a-+1 if a> 1. Discuss the case in which a= 1.

Page 375

MISCELLANEOUS EXAMPLES ON CHAPTER IX 375 50. Transform the integral | d7 where a > 0, in the same 0 (x + a),/(2 + 1)' ways, showing that its value is,+1. 1 a+ I +,,/(a2_ 1) 2 +1,./(a2 + 1) log a+ arg tanh /a)} d/(a2 + 1) a+ I -,d(a +I=d( +l) T f +l 51. Prove that arc tan x dx - - log 2. fo 52. IfO<a<l, </<l, then [1 _dx 1 log 1+ +/(a/3) Ji/{(1 - 2ax+ a)(1- 2x +32)} - /(a /) 1 -J(a3)' 53. Prove that, if a> b > 0, then. i dO _7r _, a cosh 8 + b sinh 8 (a2 - b2)' 54. Prove that f logxfl og fI Olog x — dx=- I- ~" d. 0=o; ol-+X2 =-J I1+g j 2 o 1+X 2d=O; and deduce that, if a > 0, then f~logx ao dx = l- log a. o a2 +X2 = 2a l [Use the substitutions x= 1t and x=au.] 55. Prove that | log (l +) dx= ra if a> 0. [Integrate by parts.] 56. Prove that the successive areas bounded by the curve = e a sin bx, where a and b are positive, and the positive half of the axis of x, form a geometrical progression, and that their sum is b 1+e- a7/b a2 + b2 1 -e - ar/b

Page 376

CHAPTER X. THE GENERAL THEORY OF THE LOGARITHMIC, EXPONENTIAL, AND CIRCULAR FUNCTIONS. 199. Functions of a complex variable. In Oh. III we defined the complex variable z = x + yi, and we considered a few simple properties of some classes of expressions involving z, such as the polynomial P (z). It is natural to describe such expressions as functions of z, and in fact we did describe the quotient P (z)/Q (z), where P (z) and Q (z) are polynomials; as a 'rational function.' We have however given no general definition of what is meant by a function of z. It might seem natural to define a function of z in the same way as that in which we defined a function of the real variable x; i.e. to say that Z is a function of z if any relation subsists between z and Z in virtue of which a value or values of Z corresponds to some or all values of z. But it will be found, on closer examination, that this definition is not one from which any profit can be derived. For if z is given, so are x and y, and conversely: to assign a value of z is precisely the same thing as to assign a pair of values of x and y. Thus the 'function of z,' according to the definition suggested, is precisely the same thing as a complex function f(x, y) + ig (x, y), of the two real variables x and y. For example x - yi, xy, I z = V(x2 + y2), amz = arc tan (y/x) are 'functions of z.' The definition, although perfectly legitimate, is futile because it does not really define a new idea at all. It is

Page 377

199-201] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 377 therefore more convenient to use the expression 'function of the complex variable z' in a more restricted sense: in other words to pick out, from the general class of complex functions of the two real variables x and y, a special class to which the expression shall be restricted. But if we were to attempt to explain how this selection is made, and what are the characteristic properties of the special class of functions selected, we should be led far beyond the limits of this book. We shall therefore not attempt to give any general definitions, but shall confine ourselves entirely to special functions defined directly. 200. We have already defined polynomials in z (~ 34), rational functions of z (~ 37), and roots of z (~ 40). There is no difficulty in extending to the complex variable the definitions of algebraical functions, explicit and implicit, which we gave (~ 16, 17) in the case of the real variable x. In all these cases we shall call the complex number z, the argument (~ 35) of the point z, the argument of the function f(z) under consideration. The question which will occupy us in this chapter is that of defining, and determining the principal properties of, the logarithmic, exponential, and trigonometrical or circular finctions of z. These functions are of course so far defined for real values only, the logarithm indeed for positive values only. We shall begin with the logarithmic function. It is natural to attempt to define it by means of some extension of the definition log x t ( > 0); and in order to do this we shall find it necessary to consider briefly some extensions of the notion of an integral. 201. Integrals along a curve. Let AB be an arc C of a curve defined by the equations x=(t), y = (t); and suppose that as t varies from to to t1 the point (x, y) moves along the curve, in the same direction, from A to B. Then we define the curvilinear integral f{g (, y)dx 4- (, y) dy}............... (1), a

Page 378

378 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X as being equivalent to the ordinary integral obtained by effecting the formal substitutions x = j (t), y = 4 (t), i.e. to t{s (s, Of) + h (, ) *'} At. In giving this definition we tacitly assume that all the functions involved satisfy the restrictions of ~~ 137 et seq.-viz. that they are continuous and have only a finite number of maxima and minima. We call C the path of integration. The functions b, + are of course. essentially real. But we may suppose g and h complex, if we please, the integral of a complex function of a real variable t being naturally defined by the equation to to to N: Gt) iII (t~l dtf G U d + i H(t)dt. We now define f /(z) dz................ (2), where z = x + iy, f (z) = u + iv, as being equivalent to (u + iv) (dx + idy), c which is itself defined as being equivalent to f(dx - vd y) + if (v dx udy), each of these latter integrals being defined as above. 202. It will not be necessary for us to consider in any detail the properties of curvilinear integrals in general or of those of the type (2) in particular. If we were to attempt to do so we should be led beyond the range of this volume into what is known as the Theory of Functions of a Complex Variable. But there are one or two simple properties of such integrals which we shall have occasion to use. In the first place, if F(t)= G (t)-+ iH (t) is a complex function of the real variable t, and to < tj, we have t F(t)dt t_ jF(t)\dt. For the left-hand side is the limit (~ 139) of a sum of the type 2 {G (t^) + iHt(t~)} (t+ l-t) and the right-hand side is the limit of 2 J[{G (t^)}2 + {H(tv)}2] (t + 1-t), and the second sum is at least as great as the modulus of the first (~ 36, (6)).

Page 379

201, 202] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 379 We can now prove a relation of inequality satisfied by the modulus of the integral (2). For I f(z) dz =t (U+)(++it)dl i t ^ /{(n2+V2) ('2+2}2j dt C Mt2 '/(0'2+ '2) dt, J r2 where NMis the maximum of If =l/(t2+ v2) for all points on the path of integration. But the length of the path of integration is given by the integral f v/{l + ( 2}x -/('2-+ +12) dt taken between the appropriate limits, and so the modulus of the integral (2) is certainly not greater than Ml, where 1 is the length of the path. We shall also require the value of the integral i z dz, where m is a positive integer, taken along any path in the plane of z. In order to determine this we observe first that if + and + are real functions of the real variable t, then d ( +i ) = m (p + i )-l ( i )...................... (1); or in other words that the formulae d fC )ff f ()c t-)}?l Id {(t)}m=m {f (t)}m-l f' (t), f (t)} f ' (t) dt= rn =t) still hold when f(t) is a complex function of t. For suppose that (1) holds for m= 1, 2,..., k. We know already (~ 95) that the rule for the differentiation of a product (~ 94) still holds when the functions involved are complex. Thus Dt ( +3 i+)~ +l = Dt {(() + i+) (0 + i,)-} = (m + l) (q) + i+), (' + i' ); and so (1), which certainly holds for m=O and m==l, is established by induction. Hence IC j t0o m1 4 ) + 1 z m +1 - z + where z d (+i4,)rn( q 'y~AJ) Ldt1H 1 m l m where zo and zl are the arguments of the ends of the path C. The formula is the same as when the path is real, and it can be shown that all the ordinary formulae of integration are capable of similar extension: but this need not concern us at present. Finally we shall have occasion to use the equation f (z) dz=/ f (Z+a) dZ, where a is any complex number and C1 is the path in the plane of Z obtained by copying the path C in the plane of z and then giving it the displacement represented by the complex number - a. The truth of this equation follows immediately from the definitions. In particular it dz C+ dZ Ji I o1+Z2

Page 380

380 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X the paths of integration being the straight lines from 1 to 1+ and from 0 to g respectively. 203. The definition of Log t. Now let ==+in^ be any complex number. We define Log;, the general logarithm of, by the equation Log = z the path of integration being any curve x = p (t), y = (t) in the plane (x, y), subject only to the restrictions (i) that it does not pass through the origin, (ii) that q and * satisfy the conditions of ~~ 137 et seq. The first condition is of course required because of the discontinuity of 1/z for z = 0. Thus (Fig. 69) the paths (a), (b), (c) Y, 0, 1 X (c)( (b) FIG. 69. are paths such as are contemplated in the definition. The value of Log z is thus defined when the particular path of integration has been chosen. But at present it is not clear how far the value of Logz resulting from the definition depends upon what path is chosen. Suppose for example that 5 is real and positive, say equal to:. Then one possible path of integration is the straight line from 1 to A, a path which we may suppose to be defined by the equations x= t, y= 0. In this case and with this particular choice of the path of integration r dt Log = t i.e. Log = log A, the ordinary arithmetical logarithm of } defined in the last chapter. Thus one value at any rate of Log, when:

Page 381

202-204] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 381 is real and positive, is log I. But in this case, as in the general case, there is an infinite variety of possible choice of the path of integration. There is nothing to show that every value of Log: is equal to log ~; and in point of fact we shall see that this is not the case. This is why we have adopted the notation Log I, Log C instead of log g, log ~. Log ~ is (possibly at any rate) a many valued function, and log | is only one of its values. To return to the general case: so far as we can see at present, three alternatives are equally possible:(1) we may always get the same value of Log g, by whatever path we go from 1 to ~; (2) we may get a different value corresponding to every different path; (3) we may get a number of different values each of which corresponds to a whole class of paths:and the truth or falsehood of any one of these alternatives is in no way implied by our definition. 204. The values of Log r. Let us suppose that the polar coordinates of the point z = are (p, ), so that = p (cos q + i sin f). We suppose for the present that - r < b < 7r, while p may have any positive value. Thus ' may have any value other than real negative values. Further, we suppose the coordinates (x, y) of any point on the path expressed as in ~ 23 in terms of t. Then r and 0, the polar coordinates of (x, y), may also be expressed in terms of t, by such equations as r =f(t), 0= F (t). Now / dz t1 1 (dx.dy\ Log== -= d +z t, L J1 Jt P0+iy \dt dt dt by the definition. But if x = r cos 0, y = r sin 0, we have dx.dy ( dr dO\ dr d\ dt +- = cos 0 d- r sin 0- +i sin 0o-+rcos0 - dt dt \ dt dt) / dt dt = (cos 0 + i sin ) dt r + dt) t l dr Lf t d0t and so Log ' = - r, dt + i d t Jrdt 'o 0dt

Page 382

382 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X Since 1 d-7tr dt = log r, (r being positive), we obtain Log '= [log r] + i [0], where [log r] and [0] denote the differences between the final and initial values of log r and 0 respectively. It is clear that [log r] = log p - log 1 = log p; but the value of [0] requires a little more consideration. Let us suppose first that the path of integration is the straight line from 1 to g. The initial value of 0 is the amplitude of 1, or rather one of the amplitudes of 1 (viz. 0 and 2k7r, where k is any integer). Let us suppose that initially O=22k7r. It is evident from the figure that as t moves along the line 0 increases from 2k7r to 2k7r +. Thus [0] = (2k7r + ) - 2kr =, and, when the path of integration is a straight line, Log = log p + ib. FIG. 70. We shall denote this particular value of Log ' the principal value. When ~ is real and positive, 5= p and, = 0, so that the principal value of Log ' is the ordinary logarithm log f. Hence it will be convenient in general to denote the principal value of Log ~ by log S. Thus log = log p + i. Next let us consider any path (such as those shown in Fig. 71) such that the area or areas included Y between the path and the straight line from 1 to ' does not include -' the origin. --- It is easy to see that [0] is / still equal to c. Along the thick / Q x curve shown in the figure, for example, 0, initially equal to 2k7r, first decreases to the value P 2krr - X6P FIG. 71.

Page 383

204] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 383 and then increases again, being equal to 2k7r at Q, and finally to 2k7r + f. The dotted curve shows a similar but slightly more complicated case in which the straight line and the curve bound two areas, neither of which includes the origin. Thus if the path of integration is such that the closed curve formed by it and the line from 1 to ' does not include the origin, then Log = log = logp + ib. On the other hand it is easy to construct paths of integration. - such that [0] is not equal to /. Consider, for example, the thick curve shown in Fig. 72. If 0 is initially equal to 2k7r, it will evidently have increased by 27r when we get to P and by 4vr when we get to Q: its final value will be 2k7r + 47r + 4, and so [0] = 47r + Q and FIG. 72. Log g= log p + i (47r + O). In this case the path of integration winds twice round the origin in the positive sense. If we had taken a path winding m times round the origin we should have found, in a precisely similar way, that [0] = 2m7r + 0 and Log '= log p + i (2mr + ). Here m is positive. By making the path wind round the origin in the opposite direction (as shown in the dotted path in Fig. 72), we obtain a similar series of values in which m is negative. Since I |]=p, and the different angles 2mr + cQ are the different values of am i, we conclude that every value of log I I +i am ' is a value of Log ': and it is clear from the preceding discussion that every value of Log ' must be of this form. We may summarise our conclusions as follows: the general value of Log T is log I 1 + i am = log p + i (2m7r + ), where m is any positive or negative integer. The value of m is determined by the path of integration chosen. If this path is a

Page 384

384 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X straight line m = 0, and Log = log p + ib. This particular value of Log ', which we denote by log S, is called the principal value; it is characterised by the fact that its imaginary part lies between - r and + r. In what precedes we have used ' to denote the argument of the function Log g, and (I, r) or (p, b) for the coordinates of ~, and z, (x, y), (r, 0) for an arbitrary point on the path of integration and its coordinates. There is however no reason now why we should not revert to the natural notation in which z is used as the argument of the function Log z, and this we shall do in the following examples. Examples XCV. 1. We supposed above that -7r<0<7r, and so excluded the case in which z is real and negative. In this case the straight line from 1 to z passes through 0, and is therefore not admissible as a path of integration. Both wr and -rr are values of am z, and 0 is equal to one or other of them: also r- z. The values of Log z are still the values of log z | + i am z, viz. log (-z)+(2m+1) Wi where n is an integer. The values log (-z) ~ ri correspond to paths from 1 to z lying respectively entirely above and below the real axis. Either of them may be taken as the principal value of Log z, as convenience dictates. We shall choose the value log ( - ) + 7ri corresponding to the first path. 2. Any value of Log z is a continuous function of both x and y, except for x=0, y=0. 3. The functional equation satisfied by Log z. The function Log z satisfies the equation Log (Zl12)-Log zl + Log z2.....................(1), that is to say every value of either side of this equation is one of the values of the other side. This follows at once by putting zl = rl (cos 01 + i siln 0), 2 = r2 (Cos 02 + i sin 02), and applying the formula of ~ 204. It is not however true that log (Z Z2) =log +log 2...........................(2) in all circumstances. If, e.g., Z1 = Z2 = (-1 i- /3) -cos 2 7r+ isin 3 r, then log zl = log z2 = r-i, and log z1 + log 2 = 4 7ri, which is one of the values of Log (zlZ2), but not the principal value. In fact log (ZlZ2)= - ri. An equation such as (1), in which every value of either side is a value of the other, we shall call a complete equation, or an equation which is completely true. 4. The equation Log zm = m Log z, where m is an integer, is not completely true: every value of the right-hand side is a value of the left-hand side, but the converse is not true.

Page 385

204, 205] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 385 5. The equation Log(l/z)= -Logz is completely true. It is also true that log (I/z)= -log z, except when z is real and negative. 6. Show that the equation (2) of Ex. 3 is true unless the line joining the points z=zl, z=z2 cuts the negative half of the real axis. 7. The equation log ( =log (z - a)- log (z - b) is true if z lies outside the region bounded by the line joining the points z=a, z=b, and lines through these points parallel to OX and extending to infinity in the negative direction. 8. The equation g z l o l o o is true if z lies outside the triangle formed by the three points 0, a, b. 9. The function of the real variable x defined by r/ (x) =_p + (q -p) I (log x), (where I (u), as usual, denotes the imaginary part of u) is equal to p when x is positive and to q when x is negative. 10. The function of x defined by rf (x) =p7r + (q -p) I {log (x - 1) + (r- q) I (log x) is equal to p (l<x), to q (O<x<l), and to r (x<0). 11. Draw the graph of the function I(Log x) of the real variable x. [The graph consists of the positive halves of the lines y= 2kr and the negative halves of the lines y=(2k+ 1) 7r.] 12. For what values of z is (i) log (ii) any value of Logz (a) real or (b) purely imaginary? 13. If z=x +iy then Log Logz= logR+(+- 2k' r) i, where R2 = (log r)2 + (O + 2kr)2 and e is the least positive angle determined by the equations cosO: sin e: 1:: logr: + 2krr: /{(log r)2 + ( + 2kr)2}. Plot roughly the doubly infinite set of values of Log Log (1 +i-/3), indicating which of them are values of log Log (l+i4/3) and which of Log log (1 +^i/3). 205. Note on the logarithm of a real negative number. According to the definitions adopted in this chapter a negative number x has no real logarithm, but an infinite number of complex ones, whose values may be expressed in any one of the equivalent forms log(-x)+(2+l1)7ri, log ] x+ (2k+ 1) ri, 1 logx2 +(2k+1) wri, where k is an integer. The question remains as to how this is to be reconciled with the definition which we gave in Ch. VI. We there defined logx, or rather log x+ C, where C is an arbitrary constant, as the function whose derivative is 1/x. H. A. 25

Page 386

386 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X X x dt It follows from ~~ 137 et seq. that when x is positive or -, Ji t a t where a is any positive number, is such a function. This function is, in virtue of the definition of log x given in Ch. IX, equal to log x - log a and so included in the form log x+ C. If x is negative the integral ceases to have x dt any meaning, but the integral f -, where b is any negative number, has a J b t meaning, and its derivative is 1/x. Now, making the substitution t=-u, we obtain [x dt -x du i x- d ie= =log (- )-log - b), which is included in the form log (- ) + C. We therefore conclude that the most general real function whose derivative is 1/x is logx+C (x>0), log(-x)+C (x<O); and we may write generally J = log (~x)+ C= log IxI+ C=I log x + C, the last two forms applying equally to positive or negative values of x, while in the first that sign is to be chosen which makes + x positive. In other words log x, as defined in Ch. VI, is really log Ix or ~ log x2 as defined in Ch. IX. The results of this chapter show that, when x is negative, log x differs from log I x [ or I log x2 by an imaginary constant, and so the two definitions are equivalent so far as the determination of integral functions is concerned. In order, however, to preserve an absolute consistency between the notation of Oh. VI and that of these later chapters, we should modify some of the formulae of Ch. VI by the introduction of an ambiguous sign, or of the sign of the absolute value, before the arguments of the logarithms involved. Thus we should write dx -l( f dx 1 =log ( x), ax+b=alog {+~(ax+b)}, fX2- a2 = 2a log { +( —a)} -sec dx= log {(sec x+tan x)}, |cosec xdx=log (+tan x), flog (~ ) d=x log(~ ) - x, the ambiguous sign being in each case chosen so as to make the contents of the bracket positive: or, instead of log (x), log {+ (ax +b)}, etc. we may write log x, log ax + b, etc. or og2, log( ax + )2, etc. In ~ 118 we did actually adopt this course (see equations (2) et seq. on pp. 228-9). 206. The exponential function. In Ch. IX we defined a function ey of the real variable y as the inverse of the function y = logx. It is naturally suggested that we should define a function which is the inverse of the function Log z.

Page 387

205-207] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 387 DEFINITION. If any value of Log z is equal to C, we shall call z the exponential of C and write z = exp ~. Thus z = exp ~ if f = Log z. It is certain that to any given value of z correspond infinitely many different values of ^. It would not be unnatural to suppose that, conversely, to any given value of I correspond infinitely many values of z, or in other words that exp 5 is an infinitely many-valued function of C. This is however not the case, as is proved by the following theorem. THEOREM. The exponential function exp ' is a one-valued function of C. For suppose that z2 = exp ' and z = exp. Then = Log z, = Log z. If zi = r1 (cos 01 + i sin 01) and z = r2 (cos 02 + i sin 02) we deduce log r1 + (2m7r + 0?) i = log r2 + (2nr + 02) i, where m and n are integers. This involves log r1 = log r2, 2m7r + 01 = 2nr + 0. Thus r1=-2 and 01 and 02 differ by a multiple of 27r. Hence ZI =- Z2 COROLLARY. Whez is real exp ' = e, the real exponential function of -defined in Ch. IX. For if z = e, logz=, i.e. one of the values of Logz is. Hence z = exp. 207. The value of exp,. Let -=: + iy and z = exp ' = r (cos 0 + i sin 0). Then t + ir = Log z = log r + (0 + 2m7r) i, where m is an integer. Hence | = log r, 77 = 0 + 2m7r, or r = et, 0 = - 2mw7r; and accordingly exp ( + i7) = et (cos ' + i sin s ). If V7 = 0, exp a = et, as we have already inferred (~ 206). It is clear that exp ( + iv) is a continuous function of: and y7 for all values of ~ and?.

Page 388

388 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X 208. The functional equation satisfied by exp ~. Let 1= 1 + i, 2 = 2 + i+2. Then exp i x exp 2 = et (cos w + i sin 1) x et2 (cos '2 + i sin 72) = etl+t2 {cOS (1 +?2) + i sin (X1 + %2)} = exp (~ + ~2). The exponential function therefore satisfies the functional relation f(' + 2) =f(?)f/(2), as we have already seen (Ch. IX, ~ 187) to be the case for real values of ri and '2. 209. The general power aQ It might seem natural, as exp = e; when ' is real, to adopt the same notation when ' is complex and to drop the notation exp ' altogether. We shall not follow this course because we shall have to give a more general definition of the meaning of the symbol ed: we shall find then that e; represents a function with infinitely many values of which exp 3 is only one. We have already defined the meaning of the symbol a; in a considerable variety of cases. It is defined in elementary Algebra in the case in which a is real and positive and ' rational, or a real and negative and ~ a rational fraction whose denominator is odd. According to the definitions there given aL has at most two values. In Ch. III we extended our definitions to cover the case in which a is any real or complex number and ' any rational number p/q: according to the new definition a; had in all cases (unless a = 0) exactly q values, and we showed that this definition is consistent with and includes the previous definition. In Ch. IX we gave a new definition, expressed by the equation a; = ealog which applies whenever T is real and a real and positive. Thus we have, in one way or another, attached a meaning to such expressions as 37/12, (- 1)1/3, (V3 + ) --- 11/2 (3-125)1+W2, but we have as yet given no definitions which enable us to attach one to such expressions as (1 + i)42, it, (3 + 2i)2+i. We shall now give a general definition of a; which applies to all values of a and I, real or complex, with the one limitation that a must not be equal to 0.

Page 389

208-210] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 389 DEFINITION. The function a; is defined by the equation a; = exp (C Log a) where Log a is any value of the logarithm of a. We must first satisfy ourselves that this definition is consistent with the previous definitions and includes them all as particular cases. (1) If a is positive and [ real one value of CLog a, viz. Clog a, is real: and exp ( log a) = e log a, which agrees with the definition adopted in Ch. IX. The definition of Ch. IX was, as we then saw, consistent with the definition given in elementary Algebra; and so our new definition is so too. (2) If a = e' (cos A + i sin A), then Log a= a + (A + 2m7r) i, exp {(p/q) Log a} = ePal Cis {(p/q) (A + 2m7r)}, where m may have any integral value. It is easy to see that, if m assumes all possible integral values, this expression assumes q and only q different values, which are precisely the values of aP/q found in Ch. III, ~ 41. Hence our new definition is consistent also with that of Ch. III. 210. The general value of ad. Let = -t i, a = p (cosA + isinA) where -7r<A 7r, so that, in the notation of ~209, p = e or a = log p. Then [Log a = (I + i1) {log p + (A + 2m7r) i} = L + iM, where = log p -q (A + 2m7r), M1 = log p + f (A + 2m7r); and a = exp (' Log a) = eL (cos M + i sin M). Thus the general value of ad is t log p- (A+2mr) [cos { log p + ( (A + 2m7r)} + i sin {a log p + 4 (A + 2m7}r)}]. In general a; is an infinitely many-valued function. For I a;I = et log p-7 (A + 2mr) has a different value for every value of m unless v = 0. If v = 0, on the other hand, the moduli of all the different values of ad are

Page 390

390 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X the same. But any two values differ unless their amplitudes also are the same. This requires that t (A + 2m7r) and 5 (A + 2n7r), where m and n are different integers, shall differ, if at all, by a multiple of 27r. But if t (A + 2m7r) -, (A + 2nr) = 2k7r, it is clear that A = k/(n - n) is rational. We conclude that a; is infinitely many-valued unless is real and rational. If ~ is complex the values have all different moduli: if 5 is real and irrational there are infinitely many values, all of which have the same modulus, so that their representative points lie on a circle. On the other hand we have already seen that when ' is real and rational a; has but a finite number of values. The principal value of a~=exp ( Loga) is obtained by giving Log a its principal value, i.e. by supposing m=0 in the general formula. Thus the principal value of a; is et log p- 2-A {cos (7 log p + ~A) +i sin (a log p +:A)}. Two particular cases are of especial interest: (i) if a is real and positive and C real, then p =a, A =0, = C, 7=0, and the principal value of a; is e log a, which is the value defined in the last chapter: (ii) if I a I=1 and C is real, then p =l, 4 = 0, 7=0 and the principal value of (cos A + i sin A); is cos CA + i sin (A. This is a further generalisation of De Moivre's Theorem (Ch. III, ~~ 36, 42). Examples XCVI. 1. Find all the values of ii. [By definition i=exp (i Log i). But i=cosr T+i sin 1-T, Log i = (2k7r +r) i where k is any integer. Hence i= exp {- (2k +~) 7r} =e-(2k+i)*. All the values of ii, therefore, are real and positive.] 2. Find all the values of ( +i)i, il+i, ( +i)1+i. 3. The values of a(, when plotted in the Argand diagram, are the vertices of an equiangular polygon inscribed in an equiangular spiral whose angle is independent of a. (Math. Trip. 1899.) [If aW=r (cos 0 + i sin 0) we find r=e log p- (A+2mr1) 0 =q logp + (A+2nr)); and all the points lie on the spiral r=p(< +?2)l/ e-_O/t.] 4. The function e. If we write e for a in the general formula, so that logp = 1, A =0, we obtain e~= e~- 2ml {cos (r + 2m7r$) + i sin ( + 2nmrr)}

Page 391

210] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 391 The principal value of e~ is et (cos i7 + i sin 7), which is the same as exp (~ 207). In particular, if ( is real, so that 7=0, we obtain e (cos 2m7rr+i sin 2mrr) as the general and e; as the principal value, e( denoting here the positive value of the exponential defined in Ch. IX. 5. Show that Log e=(1+2mnri) (+22nri, m and n being any integers, and that in general Log a; has a double infinity of values. 6. The equation l/a;=-a- is completely true (Ex. xcv. 3): it is also true of the principal values. 7. The equation aCxb= (ab)( is completely true but not always true of the principal values. 8. The equation a x a'=a+!' is not completely true, but is true of the principal values. [Every value of the right-hand side is a value of the lefthand side, but the general value of aSx a', viz. exp {J (log a + 2nrri) + ' (log a + 2n7ri)}, is not as a rule a value of a~+;' unless m=n.] 9. What are the corresponding results as regards the equations Log a= Log a, (a;) = (a) = a 2 10. For what values of a is (a) any value (b) the principal value of e; (i) real (ii) purely imaginary (iii) of unit modulus? 11. The necessary and sufficient conditions that all the values of as should be real are that 4, {7log la + amra}/rr, should both be integral. What are the corresponding conditions that all the values should be of unit modulus? 12. The general value of axi+x-ix, where x>0, is e -(~ - ~ 4n/[2 {cosh 2 (m +n) r +cos (2 log s)}]. 13. Explain the fallacy in the following argument: since emri=e2i, where m and n are any integers, therefore, raising each side to the power i, we obtain e- m e-2n,. 14. In what circumstances are any of the values of xx, where x is real, themselves real? [If x>0 then XX = exp (x Log x)= exp (x log x) Cis 2m7rx, the first factor being real. The principal value, for which m=0, is always real. If x is a rational fraction p/(2q+ 1), or is irrational, there is no other real value. But if x is of the form p/2q there is one other value, viz. - exp (xlogx), given by mn= q.

Page 392

392 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X If x= - $<0 then x =exp {- Log (-:)} =exp ( - log) Cis { - (2m + 1) 7r}. The only case in which any value is real is that in which =pl/(2q + 1), when m =q gives the real value exp ( - log:) Cis ( -pr)=( - 1)P-. The cases of reality are illustrated by the examples (a)9=1//3, ( ) -)1/2 (, (-.))2 (-1-=-/3.] 15. Logarithms to any base. We may define (=Logrz in two different ways: (i) we may say that (=Log, z if the principal value of a~ is equal to z; (ii) we may say that C= Log, z if any value of a( is equal to z. Thus if a=e, then g=Logez, according to the first definition, if the principal value of e; is equal to z, or if exp = z; and so Loge z is identical with Logz. But, according to the second definition, (=Logez if en=exp (Loge)=z, Log e=Logz, or = (Log z)/(Log e), any values of the logarithms being taken. Thus -L log I z + (am z+2mrnr)i (= Logr z+= I + 2nrri so that ( is a doubly infinitely many valued function of z. And generally, according to this definition, Logc, = (Log z)/(Log a). 16. Logel=2m7ri/(l +2nrri), Log( - l)=(2m +l) 7ri/(l + 2nri), where mr and n are any integers. 17. We know that, as g-O, h being real, {log(1 + )}/-(-1. This result may be extended to complex values of C. For log (1 + ()=, the path of integration being the straight line from 1 to 1 +. This line may be represented by the equations x=- + tpcos, y-=tpsin 5, (Ot<l 1), where p is the modulus and q the amplitude of (. Thus log (1 + )= p Cis dt o (,1 +,- / ~0 1 + tp C is q'f Ilog:is dt =dt and -log(l+))=1 +t p Cis p 1+ tp Cis d The modulus of the last term is less than tdt I1 tdt p P P o (1 2tp cos + t2p2)< Poo < 1 p < P -tp z o 2(1-p)' which tends to zero with p, and so {log (1 +)}/-1. If c=-$+iq, and $ and 77 are each made to tend to zero, then c approaches the origin along a path the nature of which depends on the way in which

Page 393

210] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 393 j and q tend to zero, or on the relations which hold between them in the course of this process. Thus if ~ were always equal to 7 this path would be.a straight line bisecting the angle between the axes. If a function f(C) has the property of tending to a limit I as — 0 and 77 -0, whatever these relations may be (cf. ~ 90) or along whatever path C approaches the origin, we shall say that limf(C)= or f () —l as C-0. Thus {log (1 + )}/C-1 as — 0. Similarly we define the meaning of 'f(C)-l as ~ —0'. 18. Generalisation of the exponential limit. In Ch. IX we proved that limr + =expz when z is real. We shall now prove that the result remains true for complex values of z. For, by the last example, {log(1 + )}/~-l1 as C-0. If we put -=z/n, where n is a positive integer, we see that, as n oo, n log (l + -z; n ) and (l+z) =exp { log (l +)} -exp z, since expz, where z=x +iy, is a continuous function of x and y. 19. More generally, if Zn is a function of z, such that nZZ, —z as n-co, then (1 + Z,)n -exp z. [For flog (1 +Zn)~ log (+ Z,)=(nZ) {log (- + )Z 20. Prove that if t is real then Dt log (1 +tz)=z/(l +tz), for all values of z real or complex. [This is of course the same formula as is given by Th. (8) of ~ 94 when z is real. In the general case we have d log (l + tz)= + lim / h log (1 + ), ilogil~tr)= l~my~log K i/~z) I l+mt { [_ d-t h-v0 ~' log-/-t — h 0\1-+ -/ ~ h-0 ~ l 14-o u), where u =hz/(1 + tz), and the result follows from Ex. 17.] 21. Show that the formula Dt log 4 (t) = )' (t)/ (t) holds generally when b is a complex function of the real variable t. [Write =t +4 +x, log 0= log (+2 +X2)+iarctan (X/+) and differentiate according to the ordinary rules.] 22. Prove the formula Dt exp e (t) = ' (t) exp [ (t). [Write = + +ix, expq=exp +(cosx+isinx) and differentiate according to the ordinary rules.] 23. Prove that t (1 +tz)m=nmz ( lt)m-l, where t is real, z and m have any values, and the principal value of each side is taken. [Write (1 -tz)m=exp {m log (1 +tz)} and apply the results of Exs. 20 and 22.]

Page 394

894 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X 211. The exponential values of the sine and cosine. From the formula exp (I + it) = exp I (cos J + i sin r), we can deduce a number of extremely important subsidiary formulae. Taking = = 0 we obtain exp (i) = cos q + i sin r; and changing the sign of q, exp (- ij) = cos q - i sin 7. Hence cos q = - {exp (in) + exp (- i)}, sin - = - ~i {exp (in) - exp (- i1)}. We can of course deduce expressions for any of the trigonometrical ratios of r in terms of exp (in). 212. Definition of sin? and cos? for all values of?. We saw in the last section that, when? is real, cos -= I {exp (it) + exp (- i)1) sin =- i {exp (i) - exp (- i~)} The left-hand sides of these equations are defined, by the ordinary geometrical definitions adopted in elementary Trigonometry, only for real values of g. The right-hand sides have, on the other hand, been defined for all values of ~, real or complex. We are therefore naturally led to adopt the formulae (1) as the definitions of cos ' and sin ' for all values of. These definitions agree, in virtue of the results of ~ 211, with the elementary definitions for real values of f. Having defined cos [ and sin ' we define the other trigonometrical ratios by the equations sin r cos 5 1 tan = sin cot i sec =- cosec...(2). cos t' sin f" cos ~' sin "' It is evident that cos ' and sec ' are even functions of ', and sin ', tan g, cot r, and cosec ' odd functions. Also, if exp (i)= t, we have cos I= 2 {t + (1/t), sin = - ~it - (l/t)}, cos2 g+ sin2?=! [{t + (/t)}2 - {t - (l/t)2] = 1......(3) Moreover we can express the trigonometrical functions of?+ g' in terms of those of ' and?' by precisely the same formulae as those which hold in elementary trigonometry. For if exp (i') = t, exp (i"') = t', we have

Page 395

211-214] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 39.5 t t' {(t + ) (t + t+ + -t (t' - = cos cos co - sin {sin g............(4); and similarly we can prove that sin ( + 0') sin ' cos +' + cos f sin '............ (5). In particular cos ( + ~ 7r)= i - sin, sin ( + 1r) =cos......(6). All the ordinary formulae of elementary trigonometry are algebraical corollaries of the equations (2)-(6); and so all such relations hold also for the generalised trigonometrical functions defined in this section. 213. The generalised hyperbolic functions. In Ch. IX, Ex. LXXXIX. 19, we defined cosh f and sinh b, for real values of f, by the equations cosh =-: {exp b+exp (- f)}, sinh = = {exp f- exp (- f)}......(1). We can now extend this definition to complex values of the variable; i.e. we can agree that the equations (1) are to define cosh f and sinh f for all values of f real or complex. The reader will easily verify the following relations: cos i= cosh f, sin i = i sinh f, cosh i = cos f, sinh i = i sin f. We have seen that any elementary trigonometrical formula, such as the formula cos 2 = cos2 C - sin2, remains true when f is allowed to assume complex values. It remains true therefore if we write cos id for cos f, sin if for sin C and cos 2if for cos 2C; or, in other words, if we write cosh f for cos A, cosh 29 for cos 2(, and i sinh f for sin f. Hence cosh 2C= cosh2 f+ sinh2 C. The same process of transformation may be applied to any trigonometrical identity. It is of course this fact which explains the correspondence noted in Ex. LXXXIX. 21 between the formulae for the hyperbolic and those for the ordinary trigonometrical functions. 214. Formulae for cos(+i?7), sin(4+i/), etc. It follows from the addition formulae that cos (i + i)) = cos 4 cos iq - sin 4 sin ii7 =cos 4 cosh r - i sin 4 sinh a, sin (4 + i/) = sin 4 cos iq + cos 4 sin ir] = sin 4 cosh q7 + i cos 4 sinh y. These formulae are true for all values of 4 and a. The interesting case is that in which 4 and q are real. They then give expressions for the real and imaginary parts of the cosine and sine of a complex number. Examples XCVII. 1. Determine the values of for which cos f and sin C are (i) real, (ii) purely imaginary. [For example cos is real when -=0 or when 4 is any multiple of 7r.]

Page 396

396 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X 2. cos (4 + i1) I =^/(cos2 4 + sinh2 7) = /{ (cosh 2 + cos 2)}, sin (4 + i7) = -/(sin2 4 + sinh2 ) = /{2 (cosh 27 - cos 2S)}. [Use the equation cos (4 +ir) -J= V{cos (4 + i7) cos (4- ir)}.] sin 24+ i sinh 2r c. sin 24 - i sinh 27 3. tan(+i )=7cosh 2 +cos 2 cosh 2 1-cos 2~ [For example sin ( + ir) cos (4 - i) sin 24 + sin 2i77 tan(+ )() os = (-r ) c os ( cos24 + cos 2it1 ' which leads at once to the result given.] se cos 4 cosh 7 + i sin 4 sinh q 4. sec(4+i7)= (cosh 2q +cos2) ' sin 4 cosh 7 - i cos 4 sinh r cosec (4 + it) = } (cosh 27 - cos 24) 5. If Icos(4+i77)1=l then sin2$=sinh277 and if Isin(4+i/1)l=1 then cos2 = sinh2 7. 6. If icos (4+iq) =1, then sin {am cos (+ + ir)} = + sin2 = ~ sinh2 7. 7. Prove that Log cos (4 + i7) = A + iB, where A = log {(cosh 277 + cos 24)} and B is any angle such that cos B sin B 1 cos 4 cosh 77 sin 4 sinh7 = /{~ (cosh 27 +cos 2)} Find a similar formula for Log sin (4+ iq). 8. Solution of the equation cos — a, where a is real. Putting (=4+ iv, and equating real and imaginary parts, we obtain cos ~ cosh 7 = a, sin ~ sinh r = 0. Hence either 7=0 or 4 is a multiple of rr. If (i) r7=0 then cos= a, which is impossible unless - 1c ac 1. This hypothesis leads to the solution (= 2k7r + arc cos a, where arc coS a lies between 0 and rr. If (ii) =m7r then cosht =(- 1)ma, so that either a l and m is even, or a_ -1 and m is odd. If a= 1 then 7 =0, and we are led back to our first case. If a > 1 then cosh r = a I, and we are led to the solutions C= 2krr~ilog a+^/(a2- 1)} (a>), (=(2k+1)7+ ilog {-.a+/(a2-1)} (a<- ). For example, the general solution of cos = - - is = (2k + 1) ir i log 3. 9. Solve sin == a, where a is real. 10. Solution of cos =a+ i3, where 39 0. We may suppose /3>0, since the results when / < 0 may be deduced by merely changing the sign of i. In this case cos: cosh 77 =a, sin ~ sinh = - -3.....................(1), and (a/cosh 7)2+ (/3/sinh 7)2 = 1.

Page 397

214] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 397 If we put cosh2 77=x we find that X2 -(1 + a2 +2) x + a2 = 0 or x=(A1 + A)2, where Al- - /{(a+ 1)2 +2}, A2 = /{(a- 1)2+/32}. Suppose a > 0. Then A > A2>0 and cosh =A1 +A2. Also cos = a/(cosh 77) =A1 A2, and since cosh 7 > cos we must take cosh =Al A2, cos = A - A2 The general solutions of these equations are = 2kwr + arc cos M, 77= ~ log {L+ /(L2 - 1)}............(2), where L=A1+A2, i=A1-A2, and arccos X lies between 0 and ~Tr. The values of 77 and 6 thus found above include, however, the solutions of the equations cos a cosh = a, sin 6 sinh = 3.....................(3), as well as those of the equations (1), since we have only used the second of the latter equations after squaring it. To distinguish the two sets of solutions we observe that the sign of sin $ is the same as the ambiguous sign in the first of the equations (2), and the sign of sinh q is the same as the ambiguous sign in the second. Since 3 > 0 these two signs must be different. Hence the general solution required is = 2kTr + [arc cos J- i log {L + (L2 - 1)}]. 11. Work out the cases in which a < 0 and a=O in the same way. 12. If /3=0, LZ=-a+ll+~ja-1l, [-M=la+1 —l-i a-1. Verify that the results thus obtained agree with those of Ex. 8. 13. Show that, if a and / are positive, the general solution of sin = a+ i/3 is a= k7 +( - )k [arc sin n+ilog {L + /(L2 - 1)}], where arcsin A lies between 0 and ~7r. Obtain the solution in the other possible cases. 14. Solve tan P=a, where a is real. [All the roots are real.] 15. Show that the general solution of tan ==a+if, where 3 4 0, is -kw=+~O+ ilog a+(1- 2 where 0 is the numerically least angle such that cos: sin 0: 1:: - a2- /2: 2a: /{(1 - a2 - 2)2 4a2}. 16. If z=t exp ( Tri), where is relan al, and c is also real, then the modulus of cos 2rz - cos 2rrc is./[1- {1 + cos 47rc+ cos (27r /2) + cosh (27r,)/2) - 4 cos 27rc cos (rV^/2) cosh (r6V,/2)}].

Page 398

398 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X 17. Prove that I exp exp (i + i) = exp (exp 4 cos 7), R {cos cos ( + iq)} = cos (cos 4 cosh 77) cosh (sin 4 sinh 71), I {sin sin (4 + il)} = cos (sin 4 cosh a) sinh (cos 4 sinh 7q). 18. Prove that I exp [ I tends to + oo if C moves away towards infinity along any straight line through the origin making an angle less than 7Tr with OX, and to 0 if C moves away along a similar line making an angle greater than Ir with OX. 19. Prove that Icos | and [sin tend to +c-o if C moves away towards infinity along any line through the origin other than either half of the real axis. 20. Prove that tan --- t i, according as the line lies above or below OX. 215. The connection between the logarithmic and the inverse trigonometrical functions. We found in Ch. VI that the integral of a rational or algebraical function p (x, a, 3,...), where a, 3,... are constants, often assumes different forms according to the values of a, /3,...; sometimes it can be expressed by means of logarithms, and sometimes by means of inverse trigonometrical functions. Thus dx 1 x dx 1 j — xarctan-_ log +'()...... J a2 - a n ' a)= x - /( - a '''a)} ' according as a>0 or a<0, and x +a =log {x+^/(x2+a )}, (x - =are)sin - (2) according as a>0 or a<0. These facts suggest the existence of some functional connection between the logarithmic and the inverse circular functions. That there is such a connection may also be inferred from the facts that we have expressed the circular functions of C in terms of exp i, and that the logarithm is the inverse of the exponential function. Let us consider more particularly the equation J dx 1 (t x-a\ '_2 =-alog V which holds when a is real and (x - a)/(x+a) is positive. If we could write ia instead of a in this equation we should be led to the formula arc tan (-) = log ( - + const...................(3), and the question is suggested whether, now that we have defined the logarithm of a complex number, this equation will not be found to be actually true. Now (~ 204) Log (x + ia)= ^ log (.2 + a2) + (4 + 2k/r) i, where k is an integer and q is the numerically least angle such that cos b = //(x2+ a2) and sin q=a/J(x2 + a2). Thus Log = - ( +1 ), where I is an integer, and this does in fact differ by a constant from any value of arc tan (x/a).

Page 399

215, 216] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 399 The standard formula connecting the logarithmic and inverse circular functions is 1i I - /..z... arc tan x= Log ( )...........................(4), where x is real. It is most easily verified by putting x = tan y, when the righthand side reduces to 2 \og -Li siy og ((exp 2iy) =y + k7r, k being any integer, so that the equation (4) is 'completely' true. The reader should also verify the formulae ar cos x= - i Log {x ~ i^/(l - x2)}, arc sin x - Log {ix + /(1 - x2)}...(5), where - 1 _ x 1, each of which is also 'completely' true. Example. Solving the equation c= cos x = 2 y +(l/y)}, where y=exp (ix), with respect to y, we obtain y=c ~i/(1 - c2). Thus x= - Logy= -iLog {c+_ i/(l -c2)}, which is equivalent to the first of the equations (5). Obtain the remaining equations (4), (5) by similar reasoning. 216. The power series for exp zi. We saw in Ch. IX, ~ 194, that when z is real Z2 exp = 1+z+ +.....................(1). Moreover we saw in Ch. VIII, ~173, that the series on the righthand side remains convergent (indeed absolutely convergent) when z is complex. It is naturally suggested that the equation (1) remains true, and we shall now prove that this is the case. Let the series (1) be denoted by F(z). The series being absolutely convergent, it follows by direct multiplication (as in Ex. LXXXIII. 7) that F(z) satisfies the functional equation F () F ( )= F(z + ').....................(2). Now let z = iy, where y is real, and F(z) =f(y). Then f(y)f(y') =f(y + y'); and so f y ( y)-f (y) - (y +') = y' * It will be convenient now to use z instead of- as the argument of the exponential function.

Page 400

400 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X But f(y,)- 1 i Y' (iy+_...; But yl4+ + 2! 3! and so, if y'< 1, ^ -^QI I ^ -) 2). f/(y~)- El<+. + 1< < (e-2). Hence {/f(y')- l1/y'-.i as y' —O, and so ()= lim (Y + ') -f(Y) if (). dy f = liO Y = Incidentally we have proved thatf(y) is a continuous function of y. Now If(y) I= F (iy) I = I/{F(iy)F(-iy)} = /{F(0)} = 1, and therefore f(y)= cos Y + i sin Y where Y is an angle depending on y. Since f(y) is continuous and has a differential coefficient, its real and imaginary parts must satisfy the same conditions; and so Y must satisfy them. Hence f' (y) = (- sin Y + i cos Y) DyY. But we have already seen that f'(y) = if(y) = - sinY + i cos Y. Hence DyY= I, Y= y + C, where C is a constant, and f(y)= cos (y + C) + i sin (y + ). But when y= 0, f(0)=-1; hence C is a multiple of 27r and f(y)=cosy+isiny. Thus F(iy)=cosy+isiny for all real values of y. And if x also is real F (x + iy) = F (x) F (iy) = exp x (cos y + i sin y) = exp (x + iy), z2 or exp z=1 z+ +..., for all values of z. 217. The power series for cos z and sin z. From the result of the last section and the equations (1) of ~ 212 it follows at once that z2 Z4 Z. Z cosz= 1- + -, sin z = -!2! 4! 3! 5**

Page 401

216, 217] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 401 for all values of z. These results were proved for real values of z in Ex. LVIII. 1. This is of course by far the most interesting case. Examples XCVIII. 1. Calculate cos i, sin (l+i), and sin (r- 1- i) to two places of decimals by means of the power series for cos z and sin z. 2. Prove that Icos z ] cosh I z I and j sin z I _ sinh I z 1. 3. Prove that if z I < 1 then I cos z \ < 2 and | sin I < - [X \. 4. Since sin 2z = 2 sin z cos z we have (2z)3 (22)5) (1 2! (2-?+ -...=2 -.+... -. Prove by multiplying the two series on the right-hand side (~ 177) and equating coefficients (~ 176) that 21n+1) + (2n+1) +... + 2 \+. 1 3;+* * \,2n+1 Verify the result by means of the binomial theorem. Derive similar identities from the equations cos2 z + sin2 = 1, cos 2z= 2 cos2 - 1 = - 2 sin2 z. 5. Show that exp {(1 +i) z}= 21 exp (4nwi) -. 6. Expand cos z coshz in powers of z. [We have cosz coshz-i sinz sinhz=cos (1 +i) z= [exp ((1 +i) z}+exp{- (1+i) z}] =~ 2{ 2{1 +(- 1i)n} exp (n-ri) ~-,! and a co similarlyn z sinh = cos (1 - i) z -=' 22 1 l + (-_ 1)I} exp (- 'n7ri)! 0 Hence cos z cosh z =2 22 {1+( 1)j} cos ^r-j=1- _ + 2 8 2.... 7. Expand sin z sinh z, cos z sinh z, and sin z cosh z in powers of z. 8. Expand sin2z and sin3z in powers of z. [Use the formulae sin22- (1 - cos 2z), sin3 z= (3 sin z- sin 3),.... It is clear that the same method may be used to expand cos z and sin z, where n is any integer.] 9. Sum the series cos z cos 2z cos 3z sin z sin 2z sin 3z 0=l+ 1-+ -! +2 +.3! 1! +! + +.... H. A. 26

Page 402

402 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X [Here C+ iS= I +exp (iz) + exp (2iz) exp exp (iz) [Here C+iS=1+ 1! + 2! +...=exp{exp(iz)} =- exp (cos z) {cos (sin z) +i sin (sin z)}, and similarly C- iS-= ep exp ( - iz)} = exp (cos z) {cos (sin z) - i sin (sin z)}. Hence C=exp (cos z) cos (sin z), S=exp (cosz) sin (sin a).] a cos0 a2 cos 2z a sin z a2 sin 2z 10. Sum + 1! + 2! 1! + 2! cos 2z co 4 cos4z os z cos 3z 11. Sum -1 + ~~, +., 11. Sum 1- 2! + 4! ' 1! - 3! and the corresponding series involving sines. 12. Show that cos 4z cos 8z 1 + + ~- +...4 - { cos c z) cosh (sin z) + cos (sin z) cosh (cos z)}. 13. Show that the expansions of cos (x+ h) and sin (x +h) in powers of h (Ex. LVIII. 1) are valid for all values of x and h, real or complex. 218. The logarithmic series. We found that when z is real and numerically less than unity log(1 + z)= z- _ 2 + 1z3...................(1). The series on the right-hand side is convergent (indeed absolutely convergent) when z has any complex value whose modulus is less than unity. It is naturally suggested that the equation (1) remains true for such complex values of z. That this. is in fact true may be proved by an easy modification of the argument of ~ 195. It will be remembered that log (1 + z) is the principal value of Log (1 + z), and that /+z dt [z du log (1 + )= J1 t J o 1 + the paths of integration in the planes of t and u being the straight lines from 1 to 1 +z and from 0 to z respectively. But, as in 195, we have 1 = I - u + u-2... +(- 1)m —1l -1 + ( ) 1 1+ U ) and from this (using the results of ~ 202) we deduce log (1 +)= z - Z2 + Z3 (- + (-1) +, m where RM( -1)m1 udu R, =-l~mJol0

Page 403

218, 219] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 403 Now [ 1 + u [, which is equal to the length of the line drawn from the point z =-1 to the position of u, is, as a reference to Fig. 73 shows, always greater than w, the perpen- / dicular from A on to the line OP. Hence, if z = rCis 0 and u = p Cis b, we have, by ~ 202, A 1 Tr rm+l - 1\ / 1I iR I<- p: dp \ I Wo ( P (m+l)' \ Hence RB-O as m —oo, and therefore log(1 +z) =z- Z2+1 Z3- (2). FIG. 73. If z \= 1 the real and imaginary parts of the logarithmic series are each conditionally convergent (unless z=-1). The proof given above is applicable in this case also. Thus the equation (2) holds for all points inside or on the circle I z = 1, except z =-1. Changing z into -z we obtain log (1-) =-log(1- )=z+Lz2+ z3+......(). 219. Now let z=r(cosO+isinO) where Or1l. Then log (1 + z)= log {(1 + r cos 0) + ir sin 0} = log (1 + 2r cos 0 + r2) + i ar tan (, r sin \1 + r cos That value of the inverse tangent must be taken which lies between -1 r and I 7. For since 1 + z is the vector represented by the line from - 1 to z, the principal value of am (1 + z) always lies between these limits when z lies within the unit circle. Since z = rm (cosm0 + i sin mO) we obtain, on equating the real and imaginary parts in (2), log (1 + 2r cos 0 + r2)= r cos 0 - r2 cos 20 + r3 cos 30-..., r sin r arc tan +rs = sin 0- 2 sin 20 + Ir sin 30-.... I + r cos 6 2 32 These equations hold for 0 O r < 1 and all values of 0, except that, when r = 1, must not be equal to an odd multiple of 7r. It is easy to see that they also hold for -1- r O 0 and all values of 0, 26-2

Page 404

404 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X except that, when r =- 1, 0 must not be equal to an even multiple of 7r. A particularly interesting case is that in which r= 1. In this case, if -7r < 0<7r, we have log(1 + z)= log(1 + Cis 0)=log (2 + 2cos)+iarctan ( +sin ) = log (4 cos2 ) + i, and so cos 0 - cos 20 + 3 cos 30 -... = Ilog (4 cos2 20) sin 0- Isin 20 +- sin 30 -... = O. The sums of the series, for other values of 0, are easily found from the consideration that they are periodic functions of 0 with the period 27r. Thus the sum of the cosine series is -log (4 cos2 0) for all values of 0 save odd multiples of 7r (for which the series is divergent), while the sum of the sine series is -(0-2k7r) if (2k - 1) r < 0 < (2k + 1) 7r, and zero if 0 is an odd multiple of 7r. The graph of the function represented by the sine series is shown in Fig. 74. It is discontinuous for =(2k + 1) r. 7rr 6 iir r27 3 7 FIG. 74. If we write + iz for z in (2) we obtain log (1 +~i)= ~iZ-sZ2T z3 4 + 1-4~+i-.... Hence -I{log ( Z log (1 + i (1 - i)}=z - 3 + 1 -.... If z is real and numerically less than unity we are led, by the results of ~ 215, to the formula arctanz=z- 3+ 5 -..., already proved in a different manner in ~ 196. Examples XCIX. 1. Prove that, in any triangle for which a>b, b b2 log c=log a — cos C- cos 2C-.... a 2a2 [Use the formula log c = log (a2 + b2 - 2ab cos C).] 2. Prove that if -1 < r <1 and - T7r< 0 < r then rsin20-r2 sin 40+ r3sin60-... = -arc tan (1 tan 9, the inverse tangent lying between - ~r and ~Tr. Determine the sum of the series for all other values of 0.

Page 405

219, 220] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 405 3. Prove, by considering the expansions of log (1 + iz) and log (1 - iz) in powers of z, that if -1 < r< 1 then rsin + r2 cos 2 - r3 sin 30 r4 cos 4 +...= log (1 + 2r sin 0 + r2), rcos t ~ r2 sin 20 - r3 cos 30 r4 sin 40 +... =arc tan (- cos *8'~3 38a~ A,11 ( 1+ l2rsin 4 +r2 rsin0- r3sin 30.=log(+2 0+ 2 n31n+...- og 1- 2r sin 0+ r2 r - r3 Cos 3 +... = arc tan (2r cos8)- the inverse tangents lying between - ~r and I Tr. 4. Prove that CO os 8 c - 2 cos 20 cos2 09 + cos 30 cos3 0 -.- I log (1 + 3 cos2 0) sin 0 sin 0 - ~ sin 28 sin20 + sin 30 sins 0 -.. =arc cot (I +cot 0 +cot2 0), the inverse cotangent lying between - 1 and ~r; and find similar expressions for the sums of the series cos 0 sin - cos 20 sin2 +..., sin cos - sin 20 cos2 +.... 220. The general form of the Binomial Theorem. We have already proved (~ 197) that the sum of the series 1+ Z + () 2 + is (1 + z) = exp {m log (1 + z)}, for all real values of m and all real values of z between - 1 and + 1. If an is the coefficient of z" then an+l _ rn-n an n+l1 whether in is real or complex. Hence (Ex. LXXXII. 3) the series is always convergent if the modulus of z is less than unity, and we shall now prove that its sum is still exp {m log (1 + z)}, i.e. the principal value of (1 + z)m. For (Ex. xcvI. 23) we know that if t is real then Dt (1 + tz)" = mz (1 + t)z'3-l, z and m having any real or complex values and each side having its principal value. Hence, if p (t) = (1 + tz)m, we have (n) (t) = m (m - 1)..(m - n + 1) zn (1 + tz)- n. This formula still holds if t= 0, so that (X) (0)/n! = () Zn. * Some examples of the cases in which r = 1 will be found in the Miscellaneous Examples at the end of the chapter.

Page 406

406 THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS [X Now (1)-= (o) + ' (0) + 2(0) + + ( ) R where Rn = - 1 (1 - t)n~-l () (t) dt; for (~ 146) this formula holds for either the real or the imaginary parts of b (t), and therefore, by addition, for b (t) itself. But, if z = r Cis 0, we have I 1 + tz = V(1 + 2tr cos 0 + t2) _ (1 - tr), and therefore R nz (m - 1)...(m-n+ 1) frn (l t) (n - 1)! o (I - tr)n-m m (m| - 1)...(m - +l 1) r (1- 0)n-1 (n - 1)! (1 - r)n- ' where 0<0<1; and it follows, exactly as in ~ 147, that RE —0 as n —aoo. Hence we arrive at the following theorem: THEOREM. The sum of the binomial series 1+ z+ ( Z2 +... is exp {m log (1 + z)}, where the logarithm has its principal value, for all values of m, real or complex, and all values of z such that jzI<1. A more complete discussion of the binomial series, including a discussion of the more difficult case in which I z = 1, will be found on pp. 225 et seq. of Mr Bromwich's Infinite Series. Examples 0. 1. Suppose m real. Then since log (1 +z)- log(1 +2r cosO+r2)+i arc tan ( rsi we obtain 2 (in) z= exp {m log ( +2r cos +r2)} Cis arctan ( r cos 0 =(1 +2r cos 0+r2)m Cis {marc tan (1r so 0

Page 407

220] THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 407 all the inverse tangents lying between - rr and ~ir. In particular, if we suppose 0=7r, z=ir, and equate the real and imaginary parts, we obtain 1 - (2 2+ (4 ) ^. * (1 + 2)m cos (m arc tan r), () r-() r () r -...= (1 +r2) sin (m arc tan r). 2. Verify the formulae of Ex. 1 when m= 1, 2, 3. [Of course when m is a positive integer the series is finite.] 3. Prove that if - <r< then 2.4 2.4.6.8 2 ( 2(1 +r2)+1 1 1.3.5. 1.3.5.7.9 _ /^(4r2)-1 - +. -' " = -; ' 2 2.4.6 2.4.6.8.10 '" 2(l+r2) [Take m= - in the last two formulae of Ex. 1.] 4. Prove that if -I7r<0<7r then co m = cosm {1 - (2) tan2 0 (4) tan4 -..., sin m = cos"S {( tan 0 - ( tan3+... for all real values of m. [These results follow at once from the equations cos me + i sin m = (cos 0+i sin O)m = cosm 0 (1 + i tan O)m.] 5. We proved (Ex. LXXXIII. 6), by direct multiplication of series, that f (n, z)= () zn (where I z<1) satisfies the functional equation f(m, z)f(m', z)=f(m+m', z). Deduce, by an argument similar to that of ~ 198, and without assuming the general result of ~ 220, that if m is real and rational then f(m, z) =exp {m log (1 +z)}. 6. If z and p are real, and -1<z<1, then E (l) n = cos {f1 log (l) + z) + i sin {f log (I + z)}. MISCELLANEOUS EXAMPLES ON CHAPTER X. 1. Show that the real part of lo(1 +i) is e -(4 +l2 cos { (4k + 1) log 2}, where k is any integer. Find a general formula for the real part of iLog(l+).

Page 408

408 MISCELLANEOUS EXAMPLES ON CHAPTER X 2. If a cos 0 + b sin 0 + c = 0, where a, b, c are real and c2 > a2+ b2, then -- qT +'-a + i log I c I + /(c2 - a2 - b2) =mir +a~i log- d(a2 3+b2) where m is any odd or any even integer, according as c is positive or negative, and a is the least angle whose cosine and sine are a//(a2 + b2) and b/J(a2 +b2). 3. Prove that if 0 is real and sin 0 sin = 1 then 0=(k+~) r +ilogcot (k r+), where k is any even or any odd integer, according as sin 0 is positive or negative. 4. Show that if x is real then d exp {(a +ib)x = (a +ib) e( + ib), exp {(a + ib)x} dx = exp a+ib) Deduce the results of Ex. LXXxIX. 4. 5. Show that if a > 0 then exp- (a + ib) x} dx=, and deduce the results of Ex. LXXXIX. 5. 6. Show that if x is real then excos b = { W () a-2 b2 + ()a-4 b4 -.. where there are (n +1) or I (n +2) terms in the brackets. Find a similar series for eax sin bx. 7. If (x/a)2+(y/b)2= 1 is the equation of an ellipse, and f(x, y) denotes the terms of highest degree in the equation of any other algebraic curve, show that the sum of the eccentric angles of the points of intersection of the ellipse and the curve differs by a multiple of 2rr from -i {log (a, ib) - log (a, - ib)}. [The eccentric angles are given by f(a cos a, b sin a) +... =0 or by I/{a(+), -'( i - )}+,..=0, if u = exp ia; and 2a is equal to one of the values of - i Log P, where P is the product of the roots of this equation.] 8. Determine the number and approximate positions of the roots of the equation tan z=az, where a is real. [We know already (Ex. xvIII. 5) that the equation has infinitely many real roots. Now let z=x +iy, and equate real and imaginary parts. We obtain sin 2x/(cos 2x + cosh 2y) =ax, sinh 2y/(cos 2x + cosh 2y) = ay, so that, unless x or y is zero, we have (sin 2x)/2x= (sinh 2y)/2y. This is impossible, the left-hand side being numerically less, and the righthand side numerically greater than unity. Thus x=0 or y=O. If y=O we come back to the real roots of the equation, which have already been discussed.

Page 409

MISCELLANEOUS EXAMPLES ON CHAPTER X 409 If x=0, tanh y= ay. It is easy to see by graphical methods that this equation has no real root other than zero if a = 0 or a > 1, and two such roots if 0 < a < 1. Thus there are two purely imaginary roots if 0 < a < 1; otherwise all the roots are real.] 9. The equation tan z=az+b, where a and b are real and b= 0, has no complex roots if a < 0. If a > 0 the real parts of all the complex roots are numerically greater than I b/2a 1. 10. The equation tan z=a/z, where a is real, has no complex roots, but has one purely imaginary root if a<0. 11. The equation tan z= a tanh (cz), where a and c are real, has an infinity of real and of purely imaginary roots, but no complex roots. 12. Transformations. In Ch. III (Exs. 21 et seq., and Misc. Exs. 29 et seq.) we considered some simple cases of the geometrical relations between figures in the planes of two variables z, Z connected by a relation z=f(Z). We shall now consider some cases in which the relation involves logarithmic, exponential, or circular functions. Suppose firstly that z=exp (rZ/a), Z= (a/r) Log z where a is positive. To one value of Z corresponds one of z, but to one of z infinitely many of Z. If x, y, r, 0 are the coordinates of z and X, Y, R, e those of Z, we have the relations x=e /arXa cos (r Y/a) X=(a/7r)logr l y=eX/la sin (rr Y/a)) Y=(aO/rr)+2 a a' where k is any integer. If we suppose that - rr < 0 rrT and that Log z has its principal value log z, then k= 0, and Z is confined to a strip of its plane parallel to the axis OX and extending to a distance a from it on each side, one point of this strip corresponding to one of the whole z-plane, and conversely. By taking a value of Log z other than the principal value we obtain a similar relation between the z-plane and another strip of breadth 2a in the Z-plane. To the lines in the Z-plane for which X and Y are constant correspond the circles and radii vectores in the z-plane for which r and 0 are constant. To one of the latter lines corresponds the whole of a parallel to OX, but to a circle for which r is constant corresponds only a part (of length 2a) of a parallel to OY. To make Z describe the whole of the latter line we must make z move continually round and round the circle. 13. Show that to a straight line in the Z-plane corresponds an equiangular spiral in the z-plane. 14. Discuss similarly the transformation z = cosh (rZ/a), showing in particular that the whole z-plane corresponds to any one of an infinite number of strips in the Z-plane, each parallel to the axis OX and of breadth 2a. Show also that to the line X= X0 corresponds the ellipse cosh (x c s yh (X ) {c Cosh (~rrXo/a)) sinh (qriYo/a)J8

Page 410

410 MISCELLANEOUS EXAMPLES ON CHAPTER X and that for different values of X0 these ellipses form a confocal system; and that the lines Y= Yo correspond to the associated system of confocal hyperbolas. Trace the variation of z as Z describes the whole of a line X= X0 or Y-= Yo. How does Z vary as z describes the degenerate ellipse and hyperbola formed by the segment between the foci of the confocal system and the remaining segments of the axis of x? 15. Verify that the results of Ex. 14 are in agreement with those of Ex. 12 and those of Ch. III, Misc. Ex. 34. [The transformation z=ccosh( r-Z/a) may be regarded as compounded from the transformations Z= c1, = -'{Z2+ (1/Z2)}, Z2=eXp (rZ/a).] 16. Discuss similarly the transformation z=ctanh (7rZ/a), showing that to the lines X=X0 correspond the coaxal circles {x - c coth (wrXo/a)2 + y2 = c2 cosech2 (rYo/a) and to the lines Y= Yo the orthogonal system of coaxal circles. 17. The Stereographic and Mercator's Projections. The points of a unit sphere whose centre is the origin are projected from the S. pole (whose coordinates are 0, 0, -1) on to the tangent plane at the N. pole. The coordinates of a point on the sphere are ~, 7, (, and Cartesian axes OX, 0 Y are taken on the tangent plane, parallel to the axes of ~ and r7. Show that the coordinates of the projection of the point are x=2S/(1+ ), y=2q/(l+C), and that x+iy y=2 tan I0 Cis q5, where q is the longitude (measured from the plane ri=0) and 0 the north polar distance of the point on the sphere. This projection gives a map of the sphere on the tangent plane, generally known as the Stereographic Projection. If now we introduce a new complex variable Z=X+iY= -ilog z= - ilog (x+iy) so that X=-, Y= log cot 0O, we obtain another map in the plane of Z, usually called Mercator's Projection. In this map parallels of latitude and longitude are represented by straight lines parallel to the axes of X and Y respectively. 18. Discuss the transformation given by the equation z= Log {(Z- a)/(Z- b)}, showing that the straight lines for which x and y are constant correspond to two orthogonal systems of coaxal circles in the Z-plane. 19. Discuss the transformation z=Log d(Z-a) + d(Z-b) showing that the straight lines for which x and y are constant correspond to sets of confocal ellipses and hyperbolas whose foci are the points Z=a and Z= b.

Page 411

MISCELLANEOUS EXAMPLES ON CHAPTER X 411 [We have,^(Z-a)+^ (Z- b)=^(b-a) exp (x +iy), /(Z- a) - I(Z- b) =I(b - a) exp (- x - iy); and it will be found that iZ-aj+JZ-bl=Ib-a\cosh2x, IZ-al -Z-bl=lb-alcos2y.] 20. The transformation z=Zi. If z=Zi, where the imaginary power has its principal value, we have exp (log r + iO) = z=exp (i log Z) = exp (i log R - ), so that log r =- e, =log R+ 2krr, where k is an integer. As all values of k give the same point z we shall suppose k 0, so that logr= -e, = logR..............................(1). The whole plane of Z is covered if we allow R to vary from 0 to oo and 0 from - rr to r: then r has the range e-~ to e" and 0 ranges through all real values. Thus to the Z-plane corresponds the ring in the z-plane bounded by the circles r=e -r, r= ei; but this ring is covered infinitely often. If however we restrict the variation of 0 to the range (- 7r, + 7r), so that the ring is covered only once, then R is restricted to the range (e-, e"), so that the variation of Z is restricted to a ring similar in all respects to that within which z varies. Each ring, moreover, must be regarded as having a barrier along the negative real axis which z (or Z) must not cross, as its amplitude must not transgress the limits - r, + r. We thus obtain a correspondence between two rings, given by the pair of equations z=Zi, Z=z-i where each power has its principal value. To circles whose centre is the origin in one plane correspond straight lines through the origin in the other. 21. If Z, starting at the point eW moves round the larger circle in the positive direction to - e7, along the barrier, round the smaller circle in the negative direction, back along the barrier, and round the remainder of the larger circle to its original position, trace the corresponding variation of z. 22. Suppose each plane to be divided up into an infinite series of rings by circles of radii..) e- (2nl)rr,... e e, ~,..., ~.... e-(2n+1)7r e-T err 3r e(2n+1)7r Show how, by taking suitable values of the powers in the equations z=Zi, Z=z-i, to make any ring in one plane correspond to any ring in the other. 23. If z=Zi (any value of the power being taken) and Z moves along an equiangular spiral whose pole is the origin in its plane, then z moves along an equiangular spiral whose pole is the origin in its plane. 24. How does Z=Zai, where a is real, behave as z approaches the origin along the real axis. [Z moves round and round a circle whose centre is the origin (the unit circle if zai has its principal value), and the real and imaginary parts of Z both oscillate finitely.]

Page 412

412 MISCELLANEOUS EXAMPLES ON CHAPTER X 25. Discuss the same question for Z=z + bi, where a and b are any real numbers. 26. Show that the region of convergence of a series of the type E anai -oo00 where a is real, is an angle, i.e. a region bounded by inequalities of the type o < am z < 61. [The angle may vanish, or include all possible values of am z.] 27. Level Curves. Iff(z) is any function of the complex variable z, we shall call the curves for which /f(z) is constant the level curves of f(z). Sketch the forms of the level curves of z- a (concentric circles), (z - a) (z - b) (Cartesian ovals), (z - a)/(z- b) (coaxal circles), exp z (straight lines). FIG. 75. 28. Sketch the forms of the level curves of (z- a) (z - b) (z -c), (1+ z^/3+ 2)/z. [Some of the level curves of the latter function are drawn in Fig. 75, the curves marked I-VIT corresponding to the values *10, 2 -/3= 27, '40, 1-00, 2-00, 2+^3 =3-73, 4-53

Page 413

MISCELLANEOUS EXAMPLES ON CHAPTER X 413 of f (z) 1. The reader will probably find but little difficulty in arriving at a general idea of the forms of the level curves of any given rational function; but to enter into details would carry us into the general theory of functions of a complex variable.] 29. Sketch the forms of the level curves of (i) zexpz, (ii) sinz. [See Fig. 76, which represents the level curves of sin z. The curves marked I-VIII correspond to k='35, '50, -71, 1-00, 1'41, 2-00, 2-83, 4-00.] l IVI Vlv Vill I — 3 -1 t FIG. 76. FIG. 77. 30. Sketch the forms of the level curves of exp z - c, where c is a real constant. [Fig. 77 shows the level curves of lexpz-1, the curves I-vII corresponding to the values of k given by log = - 1'00, - 20, -'05, 0'00, ~05, 20, 1'00.] FIG. 78. FIG. 79.

Page 414

414 MISCELLANEOUS EXAMPLES ON CHAPTER X 31. The level curves of sin z-c, where c is a positive constant, are sketched in Figs. 78, 79. [There are two kinds of curves corresponding to the cases in which c < 1 and c > 1. In Fig. 78 c= 5, and the curves I-VIII correspond to k=-29, -37, -50, -87, 1'50, 2-60, 4-50, 7-79. In Fig. 79 c=2, and the curves I-vII correspond to k=-58, 1'00, 1'73, 3'00, 5-20, 9-00, 15-59. If c= 1 the curves are the same as those of Fig. 76, except that the origin and scale are different.] 32. Prove that if 0 < 0 < 7r then cos 0+ 1 cos 30+ os 50+... = og cot2, sin 0+ sin 30+ sin 50 +... == r, and determine the sums of the series for all other values of 0 for which they are convergent. [Use the equation z+ 3 +... log (+Z) where z= cos 0 +i sin 0. When 0 is increased by ir the sum of each series simply changes its sign. It follows that the first formula holds for all values of 0 save multiples of 7r (for which the series diverges), while the sum of the second series is 1 r if 2kir<0<(2k+ 1)rr, - r if (2k+1) 7 <O<(2k+2) r, and O if 0 is a multiple of 71.] 33. Prove that if 0 < 0 < I rr then cosO - cos30+1 Cos50-... =-7 sin 0 - -sin 30 + L si 50-... = log (sec 0 + tan 0)2; and determine the sums of the series for all other values of 0 for which they are convergent. 34. Prove that cos 0 cos a+-cos 20 cos 2a+ cos 3 cos 3a +...=- log {4(cos -cos a)2, unless 0 -a or 0+a is a multiple of 27r. 35. Prove that if neither a nor b is real then GO dx loga-logb jo (x-a)(x-b) a-b ' each logarithm having its principal value. Verify the result when a=ci, b = - ci, where c is positive. Discuss also the cases in which a or b or both are real and negative. 36. Prove that if a and / are real, and / > 0, then Is dx 7ri o x2 - (a+i3)2 2 (a+ i3) What is the value of the integral when / < 0? 37. Prove that 00 dv iri J A2 + 2Bx + B C d/(B2-A C) ' the-sign of /(B2 - AC) being so chosen that the real part of {,/(B2-AC)}IAi is positive.

Page 415

APPENDIX I. (To CHAPTERS III, IV, V.) The Proof that every Equation has a Root. LET Z= P(z)=-aozn+alzn-1 +... +an be any polynomial in z, with real or complex coefficients. We can represent the values of z and Z by points in two planes, which we may call the z-plane and the Z-plane respectively. If z describes a closed path y in the z-plane it is evident that Z describes a corresponding closed path r in the Z-plane. When Z returns to its original position its amplitude may be the same as before, as will certainly be the case if r does not enclose the origin, like path (a) in Fig. B, or it may differ from its original value by any multiple of 2r. Thus /// / (~' "' (a) FIG. A. FIG. B. if its path is like (b) in Fig. B, winding once round the origin in the positive direction, its amplitude will have increased by 2rr. We shall now prove that if the amplitude of Z is not the same when Z returns to its original position, the path of z must contain inside or on it at least one point at which Z=0. We can divide y into a number of smaller contours by drawing parallels to the axes at a distance a1 from one another (see Fig. C). If there is, on the boundary of any one of these contours, any point for which Z=0, what we wish to prove is already established. We may therefore suppose that this is not the case. Then the increment of am Z, when z describes y, is equal to the sum of all the increments of am Z obtained by supposing z- to describe each of these smaller contours separately in the same sense as y

Page 416

416 APPENDIX I For if z describes each of the smaller contours in turn, in the same sense, it will ultimately (see Fig. D) have described the boundary of y once, and each P Q /,1/' FIG. C. FIG. D. part of each of the dividing parallels twice and in opposite directions. Thus PQ will have been described twice, once from P to Q and once from Q to P. As z moves from P to Q, am Z varies continuously (since Z does not pass through the origin); and if the increment of am Z is 0 its increment when z moves from Q to P will be -; so that when we add up the increments of am Z due to the description of the various parts of the smaller contours all cancel one another save the increments due to the description of parts of y itself. Hence, if am Z is changed when z describes y, there must be at least one of the smaller contours (say yi) such that am Z is changed when z describes yl. This contour may be a square bounded wholly by the auxiliary parallels or may be bounded in part by the boundary of y. In any case it is entirely included within a square A1 whose sides are of length 81 and parallel to the axes. We can now further subdivide y1 by the help of parallels to the axes at a smaller distance 82 from one another, and we can find a contour y2, entirely included in a square A2 of side 82 (itself included inside A1), and such that am Z is changed when z describes it. Now let us take an infinite sequence of decreasing numbers 81, 82,..., am,..., whose limit is zero. By repeating the argument used above we can determine a series of squares A1, A2,..., A,,,... and a series of contours y1, 72,..., 7m,... such that (i) Am+l lies entirely inside Am, (ii) Y7 lies entirely inside A,, (iii) am Z is changed when z describes y,. If (xm, ym) and (xm + m, ym+ m) are the lower left-hand and upper righthand corners of Am, it is clear that xl, x2,..., xM,,... is an increasing and x+8l1, x 2+ 2,..., xm+m,... a decreasing sequence, and that they have a common limit x0. Similarly y, and y + m approach a common limit Yo, and (xo, yo) is the one and only point situated inside every square. However small be 8, we can draw a square which includes (xo, y0), and whose sides are parallel to the axes and of length 8, and inside this square a closed contour such that am Z is changed when z describes the contour.

Page 417

APPENDIX I 417 It can now be shown that P (o + iyo)=0. For if not let P (xo+iyo)=a, where a 0, and let I a =p. Since P (x +iy) is a continuous function of x and y we can draw a square 'whose centre is (x0, 0o) and whose sides are parallel to the axes, and which is such that IP (+ iy)- P (xo++ iy) I< 2P for all points x +iy inside it or on its boundary. For all such points P (x+iy)=a+f), where I < p. Now let us take any closed contour lying entirely inside this square. As z describes it, Z= a+ q5 describes a closed contour. But this contour evidently lies inside the circle whose centre is a and whose radius is p, and this circle does not include the origin. Hence the amplitude of Z is unchanged. But this contradicts what was proved above, viz. that inside each square A. we can find a closed contour the description of which by z changes am Z. Hence P (x + iyo) = 0. All that remains now is to show that we can always find some contour the description of which by z changes am Z. Now Z= n I+(l+ a+2 +... + a. (aK a0z ao / We can choose R so that lai la211 la Jao0R I aoIR2+ I" IaoIRn <E where e is any positive number, however small; and then, if y is the circle whose centre is the origin and whose radius is R, Z=ao Z (1 +p), where I p 1 <, for all points on y. But so long as I p I < 1 it follows, by an argument similar to that used above, that am(1 +p) is unchanged as z describes y in the positive sense, while am ez on the other hand is increased by 2n7r. Hence am Z is increased by 2nwr, and the proof that Z=0 has a root is completed. We leave it as an exercise to the reader to infer, from the discussion which precedes and that of ~ 34, that when z describes any contour y in the positive sense the increment of am Z is 2kwr, where k is the number of roots of Z=O inside y, multiple roots being counted multiply. There is another proof, proceeding on different lines, which is often given. It depends, however, on an extension to functions of two or more variables of the results of ~ 89. We define, precisely on the lines of ~ 89, the upper and lower limits of a function f(, y), for all pairs of values of x and y corresponding to any point of any region in the plane of (x, y) bounded by an ordinary closed curve. And we can prove, much as in ~ 89, that a continuous function f(x, y) attains its upper and lower limits in any such region, H. A, 27

Page 418

418 APPENDIX I Now IZl=IP(x+iy) is a continuous and positive function of x and y. If m is its lower limit for points on and inside y, there must be a point Zo for which IZ =m, and this must be the least value assumed by I ZI. If = 0, P(zo) =0, and we have proved what we want. Suppose, therefore, that m > 0. If we put z=zo +, and rearrange P(z) according to powers of I, we obtain P(z)=P(z0)+Al'+A2 2+... +An%, say. Let Ak be the first of the coefficients which does not vanish, and let AI A =I, I = p. We can choose p so small that Ak+l |p+|Ak+2i p2+...+l|Aqj pnk< I-. Then P(z) -P (z)-A k <2 pk and | P() I < |P (,o)+A kI + 1pk. Now suppose that z moves round the circle whose centre is zo and radius p. Then P (Zo) +Ak (k moves k times round the circle whose centre is P(zo) and radius ] Akk I =ypk, and passes k times through the point in which this circle is intersected by the line joining P(zo) to the origin. Hence there are k points on the circle described by z for which I P(zo)+Akk I = I p(zo) I - pk and so I P(z) I <P(Zo) -~ Mpk= -~tpk; and this contradicts the hypothesis that in is the lower limit of I P (z) j. The preceding argument would fail if zo were on the boundary of y: but if y is a circle whose centre is the origin, and whose radius R is large enough, the last hypothesis is untenable, since I P(z) 1 + co with I l. In this case it must therefore be true that m=0, and our conclusion follows. EXAMPLES ON APPENDIX I. 1. Show that the number of roots of f()=0, which lie within a closed contour which does not pass through any root, is equal to the increment of {log/(z)}/27ri when z describes the contour. 2. Show that if R is any number such that all I a~2 Ila,l, R R2 Rib then all the roots of n + a lXn +...+ a,=O0 are in absolute value less than R. In particular show that all the roots of X5- 13x - 7=0 are in absolute value less than 2-1. 3. Determine the numbers of the roots of the equation x2+a+ax+b=O, where a and b are real and p odd, which have their real parts positive and negative. Show that if a > 0, b > 0 the numbers are p - 1 and p + 1, if a < 0, b>0 they are p + 1 and p-1, and if b < 0 they are p and p. Discuss the

Page 419

EXAMPLES ON APPENDIX I 419 particular cases in which a= or b =. Verify the results when p= 1. [Trace the variation of am (X2p + ax+ b) as x describes the contour formed by a large semicircle whose centre is the origin and whose radius is R, and the part of the imaginary axis intercepted by the semicircle.] 4. Consider similarly the equations x4q+ax+b=O, x4q-l+ax+b=0, x4q+l+ax+b=0. 5. Show that, if a and /3 are real, the numbers of the roots of the equation 2n + a2 X2-1+ 2 = 0 which have their real parts positive and negative are n - 1 and n +1 or n and n respectively, according as n is odd or even. (Math. Trip. 1891.) 6. Show that when z moves along the straight line joining the points Z=Z1, z=z2, from a point near zl to a point near Z2, the increment of am —+ -- - Z1 Z-Z2 is nearly equal to 7r. 7. A contour enclosing the three points z=zl, Z=Z2, Z=Z3 is defined by parts of the sides of the triangle formed by Z1, Z2, Z3, and the parts exterior to the triangle of three small -circles with their centres at those points. Show that when z describes the contour the increment of am 1 + 1- 1 + \z-1z z-Z2 z-XZ3 is equal to - 27r. 8. Prove that a closed oval path which surrounds all the roots of a cubic equation f(z) =0 also surrounds those of the derived equation f'(z)=O. [Use the equation f'(z)=/() _(2 -12+-21 +- ), where Zl, Z2, Z3 are the roots off(z) =0, and the result of Ex. 7.] 9. Show that the roots of f'(z) = O are the foci of the ellipse which touches the sides of the triangle (zl, 22, Z3) at their middle points. [For a proof see Cesaro's Elementares Lehrbuch der Algebraischen Analysis, p. 352.] 10. Extend the result of Ex. 8 to equations of any degree. 11. If f(z) and q (z) are two polynomials in z, and y is a contour which does not pass through any root of either, and I P (z) I < If(z) I for all points on y, then the numbers of the roots of the equations f/()=0, /(z) + ()=0 which lie inside y are the same.

Page 420

APPENDIX II. (To CHAPTERS IX, X.) A Note on Double Limit Problems. IN the course of Chapters IX and X we came on several occasions into contact with problems of a kind which invariably puzzle beginners and are indeed, when treated in their most general forms, problems of great difficulty and of the utmost interest and importance in higher mathematics. Let us consider some special instances. In ~ 195 we proved that log (1 +x)= x - 2 +- x3... by integrating the equation 1/(1 +t)=l - t+t2-... between the limits 0 and x, and what we proved amounted to this, that t-f dt- tdt+ t2dt-...; or 1 + t - o to 0 or in other words that the integral of the sum of the infinite series 1 - t+ t2 -..., taken between the limits 0 and x, is equal to the sum of the integrals of its terms taken between the same limits. Another way of expressing this fact is to say that the operations of summation from 0 to cc, and of integration from 0 to x, are commutative when applied to the function (- 1)n tn, i.e. that it does not matter in what order they are performed on the function. Again in ~~ 187, 194 we proved that the differential coefficient of the exponential function x2 exp x=l x+-!+... is itself equal to exp x, or that DX (l+x+!+..*)-D(1)+D.(x)+D. (2!)+*...; that is to say that the differential coefficient of the sum of the series is equal to the sum of the differential coeficients of its terms, or that the operations of summation from 0 to co and of differentiation with respect to x are commutative when applied to x/nz!. Finally, we proved incidentally in the same sections that the function expx is a continuous function of x, or in other words that lim (l+x+ +...2 +++...* lim( x+lim(.; x-(->f 2 - 2! X o_>_; x '~ \."

Page 421

APPENDIX II 421 i.e. that the limit of the sum of the series is equal to the sum of the limits of the terms, or that the sum of the series is continuous for x=, or that the operations of summation from 0 to oo and of making x-,- are commutative when applied to Xn/n!. In each of these cases we gave a special proof of the correctness of the result. We have not proved, and in this volume shall not prove, any general theorem from which the truth of any one of them could be immediately inferred. In Ex. xxxvIII. 1 we saw that the sum of a finite number of continuous terms is itself continuous: in ~ 94 that the differential coefficient of the sum of a finite number of terms is equal to the sum of their differential coefficients: and in ~ 144 we stated the corresponding theorem for integrals. Thus we have proved that in certain circumstances the operations symbolised by lim..., Dx..., f...dx x —) are commutative with respect to the operation of summation of afinite number of terms. And it is natural to suppose that, in certain circumstances which it should be possible to define precisely, they should be commutative also with respect to the operation of summation of an infinite number. It is natural to suppose so: but that is all that we have a right to say at present. A few further instances of commutative and non-commutative operations may help to elucidate these points. (1) Multiplication by 2 and multiplication by 3 are always commutative, for 2x3xx=3x2xx for all values of x. (2) The operation of taking the real part of z is never commutative with that of multiplication by i, except when z = 0. For ix R( iy) =ix, R {ix (x+iy)}=-y. (3) The operations of proceeding to the limit zero with each of two variables x and y, applied to a function f(x, y) may or may not be commutative. Thus lim {lim (x +y)}= 0, lx=0, lim {lim (x +y)} =limy=0; x —O y — 0 — o0 ' -O y —0 but on the other hand lim (lim y-f0 = lim x- lim (1) =1, x o0\yo X+y X-0 x x- — o lim (lim -!) = li m (- 1)= -1. Y-#0 x —,O y+ Sy-a-0 y — )0 (4) The operations 2..., lim... may or may not be commutative. Thus, 0 x —1 if x-l through values less than 1, -im { — x} — =lim log ( +x)= log 2, x-~ l t x-~l { lir -1 } X =log2: ~1 x —1 s 1 27-3 27-3

Page 422

422 APPENDIX II but on the other hand lim 1)} (X - X2)+ lim (1) = 1, x —l1 x-l; lim( - X +1) =(1-1) =: (O) = 0. 1 x —l 1 1 The preceding examples suggest that there are three possibilities with respect to the commutation of two given operations, viz.: (1) the operations may always be commutative; (2) they may never be commutative, except in very special circumstances; (3) they may be commutative in most of the ordinary cases which occur practically. The really important case (as is suggested by the instances which we gave from Ch. IX) is that in which each operation is one which involves a passage to the limit, such as a differentiation or the summation of an infinite series: such operations are called limit operations. The general question as to the circumstances in which two given limit operations are commutative is one of the most important in all mathematics. But to attempt to deal with general questions of this character by means of general theorems would carry us far beyond the scope of this volume. We may however remark that the answer to the general question is on the lines suggested by the examples above. If L and L' are two limit operations the numbers LL'z and L'Lz are not generally equal, in the strict theoretical sense of the word 'general.' We can always, by the exercise of a little ingenuity, find z so that LL'z.LLz. But they are equal generally, if we use the word in a more practical sense, viz. as meaning 'in a great majority of such cases as are likely to occur naturally' or in ordinary cases. Of course, in an exact science like pure mathematics, we cannot be satisfied with an answer of this kind; and in the higher branches of mathematics the detailed investigation of these questions is an absolute necessity. But for the present the reader may be content if he realises the point of the remarks which we have just made. In practice, a result obtained by assuming that two limit-operations are commutative is probably true: it at any rate affords a valuable suggestion as to the answer to the problem under consideration. But an answer thus obtained must, in default of a further study of the general question or a special investigation of the particular problem, such as we gave in the instances which occurred in Ch. IX, be regarded as suggested only and not proved. Detailed investigations of a large number of important double limit problems will be found in Mr Bromwich's book on Infinite Series.

Page 423

INDEX [The numbers refer to pages. References to the text are given in ordinary type (e.g. 329), to the examples in clarendon type (e.g. 339)] Abel's test for convergence, 329 Abel's theorem on the multiplication of series, 339 Algebraical functions: explicit, 41; implicit, 42; differentiation of, 200, 217 et seq.; integration of, 226 et seq., 246 et seq. - numbers: see Numbers, algebraical Amplitude of a complex number, 82 Areas of curves, 236 et seq., 239 et seq., 249; in polar coordinates, 241; proof of existence of, 271 et seq. Argand diagram, 81 Argument, of a point, 82; of a function, 377 Axes of coordinates, 29, 51; change of, 29 et seq. - of a conic, 63, 212 Binomial theorem, 256, 288, 334, 365 et seq., 368, 405 et seq. Cauchy's tests for convergence: general test, 297; integral test, 303, 305 et seq.; condensation test, 308 Cis 0, defined, 83 Classes, finite and infinite, 110 Coaxal circles, 89 Comparison theorems, for series, 297, 325; for integrals, 311, 319 Condition that three points lie on a straight line, 88 - that four points lie on a circle, 92, 93 Cone, 58 Conjugate complex numbers, 77 Contact of plane curves, 259 et seq. Continuity, 171 et seq.; geometrical illus tration of, 173; of sums and products, 173; of polynomials and rational functions, 173; of cos x and sin x, 174; of functions of two variables, 182 Continuous functions without derivatives, 187 et seq. Continuum, the, 15 et seq. Contour maps, 57 Convergence, circle and radius of, 333 Coordinates, 29, 51; polar, 34 cos zn and sin n0, formulae for, 95 et seq., 103; applications, 97, 98, 104 et seq. cos x, sin x, series for, 255, 400 Cross ratios, 92 Curvature, 261, 291 Cylinder, 56 d'Alembert's test for convergence, 298 Decimals, terminating, 3; infinite, 146 et seq. De Moivre's theorem, 82 et seq., 101, 390; applications, 96 et seq., 103 et seq. Derivative, 185 et seq.; of a constant, 190; of xrn, 190, 199, 351; of cos x and sin x, 190; of a sum, a product, etc., 192 etseq., 195; off(ax+b), 193; of an inverse function, 194,195; of a complex function, 194; of arc sin x, etc., 196, 201; of a polynomial, 196 et seq.; of a rational function, 198 et seq.; of an algebraical function, 200, 217 et seq.; of a transcendental function, 200 et seq., 217 et seq.; of tan x, etc., 200; meaning of sign of, 205; of a function of a function, 216; of a determinant, 242; of a function of two functions, 264 et seq.; of log x, 343; of ex, 349; of ax, 351; of exp (a+ ib)x}, 408

Page 424

424 INDEX Derivatives: general theorems concerning, 205 et seq.; discontinuous, 213; of functions of several variables, 262 et seq. —, higher: 202 et seq.; of a/(a2+x2) and x/(a2+x2), 204; of ea", etc., 353 Difference-equations, 337 Differential coefficient, 187; repeated, 202 et seq.; partial, 263; total, 264: see also Derivative - equations: of straight lines, circles, etc., 242, 243; 0"=-a2O, 249, 371 Dirichlet's test for convergence, 328 Dirichlet's theorem on the rearrangement of series, 301, 324, 330 Discontinuity, types of, 175 Displacements, 66 et seq. Double limit problems, 420 et seq. e, number, 347 et seq. Equation, of a locus in a plane: 30 et seq., 53 et seq.; of a straight line, 30; of a circle, 32, 61; of a conic, 63 -, of a locus in space: 51 et seq.; of a plane, 52; of a sphere, 53; of a surface in general, 55; of a cylinder, 56; of a cone, 58; of a surface of revolution, 58; of a ruled surface, 59 -, complete, 384 -, general: of first degree, 31, 52; of second degree, 63 -, quadratic: geometrical construction for roots of, 8; graphical solution of, 50; complex roots of, 79; with complex coefficients, 88, 102 -, cubic or biquadratic: 15 et seq., 42 et seq., 60, 102; with complex coefficients, 89, 102; repeated roots of, 197, 243 - x=f(x), 155 Equations: of a line or curve in space, 55; of a section of a cone, 62; of the tangent and normal to a curve, 190, 218 -, algebraical: rational roots of, 7, 20; graphical solution of, 51, 60; number of roots of, 80, 415 et seq.; repeated roots of, 197, 209, 243; approximation to roots of, by Newton's method, 253; proof of existence of roots of, 415 et seq. Equations, algebraical: with rational coefficients, irrational roots occur in pairs, 11, 86; with real coefficients, complex roots occur in pairs, 85; with complex coefficients, linear, 88, quadratic, 88, 102, cubic, 89 -, differential: see Differential equations -, functional, 292, 344, 344, 349, 350, 371, 384, 388 -, transcendental, 51, 61, 201, 209, 244, 254, 354 et seq., 371 et seq., 396 et seq., 408 et seq. Euler's Constant, 359 Euler's Theorem on homogeneous functions, 291 Expansions: see Power-series Exponential function, 348 et seq., 386 et seq.; graph of, 349; continuity of, 349, 367, 387; derivative of, 349, 367; functional equation satisfied by, 349, 388; order of infinity of, as x -- + o, 350, 356, 369, 372; representation of, as a limit, 351 et seq., 393; integrals involving, 353 etseq.; power-series for, 360 et seq., 399 et seq. - limit, 140 et seq., 154, 351 etseq., 39S - series, 300, 331, 335, 360 et seq., 373, 399 et seq. - theorem, 361 - values of cos x and sin x, 394 Factor theorem for complex numbers, 77 Formulae of reduction, 247 et seq., 316, 354 Fourier's integrals, 280 Function, continuous, 171 et seq.; fundamental property of, 175; range of values of, 180 et seq.; attains its upper and lower limits, 181; of several variables, 182 Functions, of a continuous real variable. 25 et seq.; of several variables, 51 et seq.; complex, of a real variable, 94; of a positive integral variable, 108 et seq.; of a complex variable, 376 et seq.

Page 425

INDEX 425 Functions, graphical representation of, 28 et seq., 51 et seq. -, increasing and decreasing, 135 et seq., 162, 171, in stricter sense, 179; inverse, 178 et seq.; independent and non-independent, 270; homogeneous, 291 -, expression of as limits, 149,150, 155 et seq. -, rational, 38 et seq.; algebraical, explicit, 41, implicit, 42; transcendental, 44 et seq.; trigonometrical or circular, 44 et seq., 394 et seq., inverse, 45 et seq., 398 et seq.; logarithmic, 222 et seq., 342 et seq., 380 et seq., 392; exponential, 348 et seq., 386 et seq.; ax, 350 et seq., 388 et seq.; hyperbolic, 355, 395, inverse, 356 -: see also Algebraicalfusnctions; Exponentialfunction; Function, continuous; Logarithmic function; Oscillation; Polynomials; Rational functions Geometrical series, 145 et seq., 153; allied series, 148 et seq., 154, 156 et seq., 299; tests of convergence derived from comparison with, 297 et seq.; rapidity of convergence of, 303 Harmonic relation of four points, 92 Homogeneous functions, 291 Hyperbolic functions, 355, 395; inverse, 356 Inequalities mxm-1 (x - a) xm - am am-l (x - a), 165 Infinity, 114 et seq.; of a function, 174, 175; scales of, logarithmic and exponential, 346, 350, 370 Inflexion, point of, 260 Integral, curvilinear: 377 et seq. —, definite: 277 et seq.; limits of, 278; calculation of, from indefinite integral, 279, direct, 280; general properties of, 281 et seq.; approximation to, 295; infinite, of first kind, 309 et seq., of second kind, 316 et seq.; finite, 310 Integral function, 220 -, indefinite, 278 - f x-dx, 307, 312; x-dx, etc., Ja Jo 317 et seq. Integrals connected with conics, 227 et seq., 246 et seq., 250 -, convergent and divergent, 309 et seq. Integration, 219 et seq.: arbitrary constant of, 221; of polynomials, 222; of rational functions, 222 et seq., 247, explicit, 225; of (Ax + B)/(ax2 +2bx + c), explicit, 224; of algebraical functions, 226 et seq., 246 et seq.; of Rix, ^/((IX2+2bx+c)}, 228 et seq., 246; of 1/,/(x2+a2), etc., 229; of ^/(x + a2), etc., 231; of xP (1 + x)q, 232, 248; of transcendental functions, 233 et seq.; of trigonometrical functions, 233 et seq.; of (cos x) (sin x), etc., 233, 248; of xl cosx, etc., 233, 248; of R (cos x, sin x), 234 et seq.; of tanx, etc., 234, 248; of 1/(a+bcosx), etc., 235, 249; of arc sin, etc., 235 et seq.; of logx, etc., 235 et seq., 249; of eaxcosbx, etc., 353 et seq.; of eaxxn, etc., 354; of exp {(a + ib) x}, 408 - by parts: 230, 234, 247 et seq.; for definite integrals, finite, 284 et seq., 287, infinite, 316, 320 -, by substitution: 226 et seq., 231 et seq., 234 et seq., 246 et seq., 354; for definite integrals, finite, 284 et seq., infinite, 314 et seq., 320, 321 et seq. -, range of, 278; subject of, 278; path of, 378 Interpolation, functional, 109 Irrational numbers: see Numbers, irrational Jacobians, 270 'Large values' of n, 111 et seq. Leibniz's theorem, 202

Page 426

426 INDEX Lengths of curves, 238 et seq.; in polar coordinates, 241; existence of, 276 Level curves, 412 et seq. Limit: of f (n), as n- oo, 116 et seq.; and value, 121, 166 et seq.; of a sum, a product, etc., 127 et seq., 161, 164; of an increasing or decreasing function, 135 et seq., 162, 171; of a complex function, 151 et seq.; of f (x), as x-.+c, 159 et seq., as x — -o, 160, as x —0, 162, as x —a, 163; of a function of a complex variable, 393 - as n -- a: of nk, 122; of R (n) and R{0 (n), (n),..., 134 et seq.; of xn, 138, 153; of {1 + (1/n)n, 140 et seq., 154, of 1l+(x/n)}n, 351 et seq., 393; of n'xn, n-rx", 140; of xi/n, 140; of nl/n, 140; of X//n!, 140; of (n!)1/n, 140; of is (xIl/- 1), 141 et seq., 352; of ( ) xt, 155; of xn, where xn1+=f(x), 155; of (sl + 2 +... + s)/n, 158; of f (n +1)-f(n) and { f(n) }/n, 158; of {f(n+ 1)}/{f(n)} and l/{f(}n)}, 356; of,(n!)/n, 356; of 1 1 1 +... + —logn, 2 ns etc., 359, 369; of n! (a/ln), 370 - as x -- 0: of xm, 165,168; of (sin x)/x, etc., 169 et seq.; of {log(l+x)}/sx, 344, 392 et seq.; of (a -- 1)/x, 350, 351 - as x-.a: of xm, 165, 166, 173; of P (x), R (x), 165; of (.xa- an)/( - a), 168 -, logarithmic, 141 et seq., 352; exponential, 140 et seq., 154, 351 et seq., 393; Euler's, 359, 369 Limits, geometrical illustrations of definitions of, 119, 161; calculation of by differentiation, 257 et seq. -, upper and lower, of a function, 180 et seq.; of an integral, 278 Locus, in a plane, 31 et seq.; in space, 54 et seq. Logarithm, 342 et seq.; common, 353; of a complex number, 380 et seq.; principal value of, 382; of a negative number, 384, 385 et seq.; of a complex number to any base, 392: see also Log arithmic function; Logarithmic limit; Logarithmic series Logarithmic function, 222 et seq., 341 et seq., 380 et seq., 392; graph of, 343; order of infinity of, as x --- + oo, 344 et seq., 369, 373; functional equation satisfied by, 344, 384; representation of, as a limit, 142, 352; power-series for, 363 et seq., 402 et seq. - - and inverse trigonometrical functions, 398 et seq. - limit, 142, 352 - scale of infinity, 346 - series, 363 et seq., 402 et seq. - tests for convergence, 357 et seq. Maclaurin's integral test for convergence, 303, 305 et seq. Maclaurin's series, 255: see also Taylor's series Maxima and minima, 206 et seq., 256 et seq.; discrimination between, 207, 257; occur alternately, 208; examples of, 209 et seq., 244; of (ax2+2bx +c )(A2 2+2Bx + C), 210 et seq. Mean Value Theorem, 214 et seq.; of second order, 252; general, 252; for functions of two variables, 267; for integrals, first, 282, generalised, 282, second, 285, Bonnet's form, 286 Measure of curvature, 261 Mercator's projection, 410 Modulus, 82; of a product, 83; of a sum, 84, 86, 324 Multiplication of series, 301 et seq., 334, 335, 339, 373 ' n - o ', 114 et seq. Newton's method of approximation to the roots of an equation, 253 Normal to a curve, 190, 218 Number,,/2, 5 et seq., 12 et seq.; 7r, 17, 65; i, 77 et seq.; e, 347 et seq. -, infinite, 111 et seq. Numbers, algebraical, 23 et seq.: see also Numbers, irrational; Quadratic surds; Surds -, complex, 75 et seq.; equivalence,

Page 427

INDEX 427 addition, multiplication, and division of, 75 et seq.; conjugate, 77; factor theorem for, 77; real and imaginary parts of, 82; modulus and amplitude of,. 82; rational functions of, 84 et seq.; geometrical applications of, 86 et seq., 101 et seq.; trigonometrical applications of, 95 et seq.; roots of, 98 et seq. Numbers, irrational, 4 et seq.; expression of as decimals, 147: see also Numbers, algebraical; Quadratic surds; Surds -, rational, 1 et seq.; expression of as decimals, 3, 146 et seq. Orders of smallness and greatness, 168 et seq., 183 et seq., 344 et seq., 350 Oscillation, of a function of n, 123 et seq.; finite and infinite, 124; examples of, 124 et seq.; of sin nOr, etc., 125; theorems connected with, 126, 129 et seq. -, of a function of x, 160 et seq. Partial fractions, 198 et seq., 222 et seq. Polygons, regular, constructions for, 100 Polynomials, 35 et seq.; having given values, 61; in a complex variable, 80; limits of, as x —a, 165; continuity of, 173; differentiation of, 196 et seq.; integration of, 222; in x, cos ax, etc., integration of, 233 et seq., 248; in x, log x, etc., integration of, 236, 249; in x, eax, etc., integration of, 354 Power-series, 331 et seq.; circle and radius of convergence of, 332 et seq.; uniqueness of, 333; multiplication of, 334; recurring, 337 et seq. -: Taylor's and Maclaurin's, 255, 287 et seq.; for cos x and sin x, 255, 400 et seq.; for arc tan x, 364, 404: see also Binomial theorem; Exponential series; Logarithmic series Principal value, of am z, 82; of Log z, 382; of aZ, 390; of eZ, 391 Pringsheim's theorem, 304 et seq., 309 analogue of, for integrals, 314 Quadratic surds, 5 et seq.; geometrical constructions for, 8 et seq., 64; theorems concerning, 9 et seq.; approximation to, by binomial theorem, 366 Quadrature of circle, approximate, 65 Radius of convergence, 333 - of curvature, 261 Rates of variation, 188 et seq. Rational functions, 38 et seq., 61; of a complex variable, 84 et seq.; of 0 (n), V (n), etc., limits of, as n -- oo, 134; of n, limits of, as n -a- oo, 135; of x, limits of, as x - a, 165; continuity of, 173; differentiation of, 198 et seq.; integration of, 222 et seq.; of cos x and sinx, integration of, 234; of eX, integration of, 354 - numbers: see Numbers, rational Rationalisation, integration by, 226 et seq., 231 et seq., 234 et seq., 246 et seq., 354 Rearrangement of series, 301, 324, 327, 328, 330 Recurring series, 337 et seq. Rolle's theorem, for polynomials, 198; in general, 205 Roots: see Equations Scales of infinity, 346, 350, 370 Series, infinite: 142 et seq., 296 et seq., 322 et seq.; convergence, divergence, and oscillation of, 143; general theorems concerning, 144 et seq.; harmonic, 144, 149, 300, 305, 308, 335, 359 et seq., 369 et seq.; arithmetic, 148 -, -, of positive terms: 145, 296 et seq.; comparison theorem for, 297; Cauchy's and d'Alembert's tests of convergence for, 297 et seq.; Pringsheim's theorem concerning, 304 et seq.; integral test of convergence for, 305 et seq.; condensation test of convergence for, 308 -,, of positive and negative terms: 322 et seq.; absolutely convergent, 323 et seq.; conditionally convergent, 325 et seq.; alternating, 326; Abel's

Page 428

428 INDEX and Dirichlet's tests of convergence for, 328 et seq. Series, infinite, of complex terms: 330 et seq.; absolutely convergent, 330 1 1 -, - special: -, 1 b, 144, 149, s n an+b' 300, 305, 308, 335, 359 et seq., 369 etseq.;.. 156, 299; n (n + l)...(n +- k) ' Z R (n), 299, 335; Zn-8, 299, 307 et seq., 309; Za cos n, Za sin n, cosn0 324, 328 et seq., 329; (- 1)n-1 n sinn0 Z( )n-1 in, etc., 404, 414, —:, Z nx'n, ns P (n) x>, / R (n) x", X Xn Xn X2n Z(-l) -ln, z n Zn' Z(-1)a l' X2n+l /Wrn' z(- 1) 2n+1)' ( xI ", see other (2n+ 1)! \n entries mentioned below; I xz1 cos nO, X2n+l Z xn sin n0, 153, 154; Z ( - 1) 2n — ' 364, 404; ZP(n)xn! 362; Z P(n) x8, 367 -, -: see also Binomial theorem; Exponential series; Geometrical series; Logarithmic series; Multiplication of series; Power-series; Rearrangement of series; Taylor's series Simpson's Rule, 295 Stereographic projection, 410 Surds, 14 et seq., 21 et seq.; approximation to, by binomial theorem, 366: see also Numbers, algebraical; Quadratic surds Surface, 54; of revolution, 58; ruled, 59 Symmetric functions of the roots of a trigonometrical equation, 96 et seq. Tangent to a curve, 185 et seq.; equation of, 190, 218 Taylor's series, 255, 287 et seq.; remainder, Lagrange's form, 255, Cauchy's form, 288; for derivative or integral, 373 Taylor's theorem, 252 et seq., 287 et seq. Transformation, 90 et seq., 104 et seq., 409 et seq.; z = (aZ + b)/(cZ + d), 90 et seq., 104; z=Zm, 94; z=-{Z+(1/Z)},105; az2 + 2hzZ + bZ2 + 2gz + 2fZ + c=0, 105; z=exp (7rZ/a), z=ccosh (rZ/a), etc., 409 et seq.; z=Zi, 411 Triangles, geometrical properties of, 86, 87, 101, 419 Unity, roots of, 99 et seq. Variable, continuous real, 18 et seq.; positive integral, 19, 108 et seq. Velocity, 189 CAMBRIDGE: PRINTED BY JOHN CLAY, M.A. AT THE UNIVERSITY PRESS.

Do you have questions about this content? Need to report a problem? Please contact us.