F X y G X h Y Where G and H Are Continuously Differentiable Functions of One Variable
Results from Elementary Analysis
R.F. Hoskins Research Professor , in Delta Functions (Second Edition), 2011
1.4.3
For differentiable functions in general the following results hold:
- (i)
-
If u and v are differentiable functions, and a and b are constants, then w = au + bv is differentiable and
- (ii)
-
If u and v are differentiable then so also is the product function uv and
Similarly
- (iii)
-
If y = f(x) where x = g(t) then, provided the derivatives concerned do exist, we have
- (iv)
-
If y = f(t) is a differentiable function with a well-defined inverse function, t = f –1(y), then
- (v)
-
Rolle's Theorem. Let f be continuous on the closed interval [a, b] and differentiable at least in the open interval (a, b). If f(a) = f(b) = 0, then there exists a point t 0 in (a, b) such that f′(t 0) = 0.
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B9781904275398500017
Differentiation—Applications
Colin McGregor , ... Wilson Stothers , in Fundamentals of University Mathematics (Third Edition), 2010
9.5 Rates of Change
If u is a differentiable function of s, we sometimes refer to as the rate of change of u with respect to s. For example, if A is the area of a circle of radius r, so that A = πr 2, then = 2 πr is the rate of change of the area with respect to the radius. If v is a differentiable function of time t then , which is often written , is simply called the rate of change of v.
The reason for the terminology can be understood by considering a particle moving on the x-axis. The rate of change of the position of the particle is its velocity. Suppose the x-coordinate of the particle at time t is x(i) where x is a differentiable function of t. The average velocity between times t and t + h (h ≠ 0) is
See Figure 9.5.1. The limit of this quotient, as h approaches 0, is the velocity of the particle at time t. It is also . Thus
Figure 9.5.1.
If x is a twice differentiable function of t then, taking the rate of change of the velocity of the particle to be its acceleration and arguing as above, we get
Note the extended use of the 'dot notation' for differentiation with respect to t.
Example 9.5.1 At time t the x-coordinate of a particle moving on the x-axis is given by
Find (a) the position and velocity of the particle when its acceleration is 0, (b) the acceleration when the velocity is 0.
Solution Here,
(a) The acceleration is 0 when t = 2. Then
(b) The velocity is 0 when t = 1 or t = 3. When t = 1,
and when t = 3,
□
Example 9.5.2 An object is propelled, from ground level, vertically upwards with an initial velocity u. The deceleration of the object, due to gravity, is g (constant). Disregarding any other factors, when will the object return to earth?
Solution Let s, v and a be, respectively, the height, velocity and acceleration of the object at time t. Then
Hence, using Corollary 9.4.8,
But v(0) = u. So K = u giving
Hence, using Corollary 9.4.8,
But s(0) = 0. So C = 0 giving
Thus the object returns to earth at time t = 2u/g. □
In problems on related rates, one rate of change, (say), is given and another, (say), has to be found. The first step is to relate u and v. This might be possible in one of several ways. For example, u = or v = or u2 + v2 = 1 or u = cos w, v = sin w where w is a third function of t. Then differentiating with respect to t relates and .
Example 9.5.3 Water is running out of a conical funnel at the rate of 0∙002 cubic metres per second. The radius of the top of the funnel is 0∙25 metres and its height is 0∙5 metres. At what rate is the water level changing when it is 0∙3 metres from the top?
Solution Let r and h be, respectively, the radius and height of the surface of the water at time t. See Figure 9.5.2. Let V be the volume of water left in the tank at time t.
Figure 9.5.2.
Here we are given = –0∙002 and have to find .
By similar triangles,
Hence,
so that, differentiating with respect to t,
So, when h = (0∙5 – 0∙3) = 0∙2,
i.e. the water level is dropping at 0∙2/π metres per second. □
Example 9.5.4 A weight W is being raised vertically by a rope through a block B, 30 metres above a point A at ground level. The rope is 70 metres long and is held by a man M who walks away from A at 0∙75 metres per second. How fast is the weight rising when it is 10 metres above A?
Solution We ignore the dimensions of the weight and the block (pulley), the thickness of the rope and the height of the man. Let x and y be the distances from A to M and W, respectively. See Figure 9.5.3.
Figure 9.5.3.
Here we are given = 0∙75 and have to find .
From triangle ABM, we get
Differentiating with respect to time t gives.
Hence, when y = 10 so that x = 40 (see Figure 9.5.3),
i.e. the weight is rising at 0∙6 metres per second. □
Example 9.5.5 The volume of a spherical balloon is increasing at the rate of 0∙05 cubic metres per second. How fast is the surface area increasing when the radius is 0∙4 metres?
Solution Let V, S and r be the volume, surface area and radius, respectively, at time t.
Here we are given = 0∙05 and have to find .
So, differentiating with respect to t,
Hence, when r = 0∙4,
i.e. the surface area is increasing at 0∙25 square metres per second. □
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B9780857092236500092
Random Matrices
In Pure and Applied Mathematics, 2004
Lemma 2.6.2
If three continuous and differentiable functions f k (x), k = 1, 2, 3, satisfy the equation
(2.6.1)
then they are necessarily of the form a ln x + bk (k = 1, 2, 3), with b 1 = b 2 + b 3.
Proof. Differentiating (2.6.1) with respect to x, we have
which, on integration with respect to y, gives
(2.6.2)
where g(x) is still arbitrary. Substituting f 1(xy) from (2.6.2) into (2.6.1),
(2.6.3)
Therefore the left-hand side of (2.6.3) must be independent of x; this is possible only if
that is, only if
where a, b 2 and b 3 are arbitrary constants.
Now (2.6.3) gives
and finally (2.6.1) gives
Let us now examine the consequences of the statistical independence of the various components of H. Consider the particular transformation
(2.6.4)
where
(2.6.5)
or, in quaternion notation (provided N is even),
(2.6.6)
This U is, at the same time, orthogonal, symplectic, and unitary.
Differentiation of (2.6.4) with respect to θ gives
(2.6.7)
and by substituting for U, UT , ∂U/∂θ and ∂UT /∂θ from (2.6.5) or (2.6.6) we get
(2.6.8)
where
(2.6.9)
or, in quaternion notation, A is diagonal.
(2.6.10)
If the probability density function
(2.6.11)
is invariant under the transformation U, its derivative with respect to θ must vanish; that
(2.6.12)
Let us write this equation explicitly, say, for the unitary case. Equations (2.6.8) and (2.6.12) give
(2.6.13)
The braces at the left-hand side of this equation depend on mutually exclusive sets of variables and their sum is zero. Therefore each must be a constant; for example,
(2.6.14)
On dividing both side of (2.6.14) by H (0) 1k H (0) 2k and applying the Lemma 2.6.2, we conclude that the constant must be zero, that is,
(2.6.15)
which on integration gives
(2.6.16)
In the other two cases we also derive a similar equation. Now because the off-diagonal elements come on as squares in the exponential and all invariants are expressible in terms of the traces of powers of H, the function P H) is an exponential that contains traces of at most the second power of H.
Because P(H) is required to be invariant under more general transformations than we have here considered, one might think that the form of P(H) is further restricted. This, however, is not so, for
(2.6.17)
is already a product of functions, each of which depends on a separate variable. Moreover, because we require P(H) to be normalizable and real, a must be real and positive and b and c must be real.
Therefore we have proved the following theorem (Porter and Rosenzweig, 1960a).
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/S0079816904800928
Differential Calculus with Several Independent Variables
Robert G. Mortimer , in Mathematics for Physical Chemistry (Fourth Edition), 2013
8.7 Maximum and Minimum Values of Functions of Several Variables
A point at which either a maximum or a minimum value in a function occurs is called an extremum. For example, Figure 8.3 shows a perspective view of a graph of the function . The surface representing the function has a "peak" at the origin, representing a maximum value of the function.
Figure 8.3. The surface representing the function with the absolute maximum and a constrained maximum shown.
The figure also shows a curve at which the surface intersects with a plane representing the equation . On this curve there is also a maximum, which has a smaller value than the maximum at the peak. We call this value a constrained maximum subject to the constraint that . We discuss the constrained maximum later.
The maximum at the origin in Figure 8.3 is called a local maximum or a relative maximum, because the value of the function at such a peak is larger than at any other point in the immediate vicinity. However, a complicated function can have more than one local maximum. Also, if we consider a finite region, the function might have a larger value somewhere on the boundary of the region that is larger than the value at a local maximum. To find the absolute maximum of the function for a given region, we must consider all local maxima and any points on the boundary of the region that might have greater values. The peak at the origin in Figure 8.3 is the absolute maximum of this function. Points of minimum value are completely analogous to points of maximum value.Local minima are located at the bottom of depressions or valleys in the surface representing the function. To find an absolute minimum for a given region, you must consider all local minima and any points on the boundary of the region that might have smaller values.
To locate a local maximum or minimum, we use the fact that the plane that is tangent to the surface will be horizontal at any local maximum or minimum. Therefore, the curve representing the intersection of any vertical plane with the surface will have a local maximum or a local minimum at the same place. The partial derivative with respect to one independent variable gives the slope of the tangent to the curve in the plane corresponding to a constant value of the other independent variable, so we can find a local maximum or minimum by finding the places where all the partial derivatives of the function vanish simultaneously.
Our method for a differentiable function of two variables is therefore to
- 1.
-
Solve the simultaneous equations
(8.46)
- 2.
-
Calculate the value of the function at all points satisfying these equations, and at the boundaries of the region being considered and any cusps or discontinuities. The maximum or minimum value in the region being considered must be in this set of values.
Example 8.14
Find the maximum value of the function shown in Figure 8.3, .
At a local maximum or minimum
The only solution for finite values of x and y is , . Since no restricted region was specified, we consider all values of x and y. For very large magnitudes of x or y, the function vanishes, so we have found the desired absolute maximum, at which .
In the case of one independent variable, a local maximum could be distinguished from a local minimum or an inflection point by determining the sign of the second derivative. With more than one variable, the situation is more complicated. In addition to inflection points, we can have points corresponding to a maximum with respect to one variable and a minimum with respect to another. Such a point is called a saddle point, and at such a point, the surface representing the function resembles a mountain pass or the surface of a saddle. Such points are important in the transition-state theory of chemical reaction rates.
For two independent variables, we calculate the following quantity:
(8.47)
The different cases are as follows:
- 1.
-
If and , then we have a local minimum.
- 2.
-
If and , then we have a local maximum.
- 3.
-
If , then we have neither a local maximum nor a local minimum.
- 4.
-
If , the test fails, and we cannot tell what we have.
Exercise 8.12
Evaluate D at the point (0, 0) for the function of the previous example and establish that the point is a local maximum.
For more than two independent variables, the method is similar, except that there is one equation for each independent variable.
8.7.1 Constrained Maximum/Minimum Problems
Sometimes we must find a maximum or a minimum value of a function subject to some condition, which is called a constraint. Such an extremum is called a constrained maximum or a constrained minimum. Generally, a constrained maximum is smaller than the unconstrained maximum of the function, and a constrained minimum is larger than the unconstrained minimum of the function. Consider the following example:
Example 8.15
Find the maximum value of the function in the previous example subject to the constraint .
The constraint corresponds to the specification of y as a function of x by
This function is given by the line in the x–y plane of Figure 8.3. We are now looking for the place along this curve at which the function has a larger value than at any other place on the curve.
Since y is a function of x on the curve, the direct way to proceed is to replace y by :
Since f is now a function only of x, the local maximum is now at the point where vanishes:
This equation is satisfied by and by . The constrained maximum corresponds to and the minimum corresponds to . At the constrained maximum and the value of the function at the constrained maximum is
As expected, this value is smaller than the unconstrained maximum, at which .
8.7.2 Lagrange's Method of Undetermined Multipliers 3
If we have a constrained maximum or minimum problem with more than two variables, the direct method of substituting the constraint relation into the function might not be practical. Lagrange's methodfinds a constrained maximum or minimum without substituting the constraint relation into the function. If the constraint is written in the form , the method for finding the constrained maximum or minimum in is as follows:
- 1.
-
Form the new function
(8.48)
where is a constant called an undetermined multiplier. Its value is unknown at this point of the analysis. - 2.
-
Form the partial derivatives of u, and set them equal to zero,
(8.49)
(8.50)
- 3.
-
Solve the set of equations consisting of and these two equations as a set of simultaneous equations for the value of x, the value of y, and the value of that correspond to the local maximum or minimum.
Example 8.16
Find the constrained maximum of the previous example by the method of Lagrange.
The constraint equation is written
The function u is
The three equations to be solved are
We begin by solving for in terms of x and y. We multiply the first equation by y and the second equation by x and add the two equations. The result can be solved to give
Substitute this into the first equation to obtain
The exponential factor is not zero for any finite values of x and y, so
When the expression for is substituted into the second equation in the same way, the result is
The difference of these two equation is
which is solved for y in terms of x to obtain
This is substituted into the third simultaneous equation to obtain
which gives
This is the same result as in the previous example. In this case, the method of Lagrange was more work than the direct method. In problems involving more variables, the method of Lagrange will usually be easier.
Exercise 8.13
- (a)
-
Find the local minimum of the function
- (b)
-
Without using the method of Lagrange, find the constrained minimum subject to the constraint,
- (c)
-
Find the constrained minimum using the method of Lagrange.
The method of Lagrange also works if there is more than one constraint. Assume that we desire the local maximum or minimum of the function
(8.51)
subject to the two constraints
(8.52)
and
(8.53)
The procedure is similar, except that two undetermined multipliers are used. One forms the function
(8.54)
and solves the set of simultaneous equations consisting of Eqs. (8.52) and (8.53), and the three equations:
(8.55)
(8.56)
(8.57)
The result is a value for , a value for , and values for , and z which locate the constrained local maximum or minimum.
Example 8.17
Find the minimum in the function
subject to the constraints
We form the function
We need to solve the equations
plus the two constraint equations. This is a simple case, since the first equation gives
and the constraint equations give the values of y and z. The constrained minimum occurs at
The value of the function at this point is .
Exercise 8.14
Find the minimum of the previous example without using the method of Lagrange.
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B9780124158092000082
Differentiation
Agamirza Bashirov , in Mathematical Analysis Fundamentals, 2014
7.2 Differentiation and Continuity
The following establishes that the differentiable functions are more delicate than the continuous functions.
Theorem 7.4
If the function is differentiable at , then it is continuous at .
Proof
Take any . Then there exists such that and imply
Let . Then and imply
Thus, . This proves the theorem.
Example 7.5
The converse of Theorem 7.4 fails. Indeed, the function is continuous at (see Example 5.5). But does not exist, because for the function
we have and , implying the nonexistence of . The graph of the function is given in Figure 7.2. One can observe that at the point the graph of this function has a "corner," which is typical for being continuous and nondifferentiable.
In 1872 Weierstrass shocked the mathematical world by constructing an example of a function that is continuous at every , being differentiable nowhere. Another such pathological example was known by Bolzano earlier. Since that time, several such functions have been constructed. Next we present one of them due to Van der Waerden. 64
Example 7.6
The idea of the continuous nowhere differentiable function that we are going to construct consists of increasing the number of "corner points" on the graph of the function from Example 7.5. For this, define the function
and extend it to by , where and . Then is a continuous periodic function with the period that is not differentiable at the points , where . Moreover, satisfies for every . Let
and define
(7.3)
By Theorems 3.30 and 3.34 Theorem 3.30 Theorem 3.34 , the series in Eq. (7.3) converges for every sinceTherefore, the function is well defined for every . Figure 7.3 illustrates how the number of "corner points" on the graphs of the functions increases and how this effects the graph of . We assert that the function is uniformly continuous and nowhere differentiable on .
For uniform continuity, note that each of the functions is uniformly continuous on . Indeed, for , we can take . Then from , we deduce
This proves the uniform continuity of . Furthermore, for , we have
For , we can take sufficiently large such that and let . Then from , we obtain
This proves the uniform continuity of .
To prove that is nowhere differentiable on , assume the contrary: there exists at which is differentiable. Then for every sequence with the terms ,
We will deduce a contradiction by construction of a sequence for which the preceding limit does not exist. Since has the period 1, for every and ,
Fix . Then for ,
where is a possible value of . Therefore,
On the other hand, implies
Therefore, at least one of the equalities
and
holds for every . Let if the first of the preceding equalities hold. Otherwise, let . Then
Thus, if exists, then , where the terms of the series are either or . By Corollary 3.27, this series diverges, contradicting the existence of .
Figure 7.2. Function .
Figure 7.3. Example 7.6.
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B978012801001300007X
Processing, Analyzing and Learning of Images, Shapes, and Forms: Part 2
Christos Sakaridis , ... Petros Maragos , in Handbook of Numerical Analysis, 2019
3.1 Geometric gradient approximation on graphs
The first term of level set-based active contour evolution models to be approximated is the gradient of the bivariate embedding function. A general method is thus developed in Sakaridis et al. (2017) for calculating the gradient of a real-valued, bivariate function that is implicitly defined on a continuous domain, although its values are known only at a finite set of 2D points, which constitute the vertices of the graph. The context is the same as in the preceding work of Drakopoulos and Maragos (2012).
3.1.1 Formulation
Compared to the gradient approximations proposed in Drakopoulos and Maragos (2012), the geometric gradient approximation of Sakaridis et al. (2017) leverages the local spatial configuration of vertices in the formulation of the approximation. More specifically, it introduces the concept of the neighbour angle, i.e., the angle around a vertex which is "occupied" by each of its neighbours. The motivation for this approach comes from the following lemma in bivariate calculus.
Lemma 1
The gradient of a differentiable function at point x is
(3)
where e ϕ is the unit vector in direction ϕ and D ϕ u(x) is the directional derivative of u at x in this direction, defined by
Based on Lemma 1, the goal is to approximate the gradient at a vertex of the graph by substituting the integral
(4)
with a sum over all the neighbours of the vertex. To this end, let us first introduce several key concepts.
The Euclidean distance between vertices v and w of a graph is denoted by d(v, w) and the unit vector in the direction of the edge vw starting at v is denoted by e vw . We define ϕ(w) ∈ [0, 2π) as the angle between the vector e vw and the horizontal axis, as in Fig. 1. A vertex v will be alternatively denoted by v to declare its position vector. Moreover, we denote by the set of neighbours of v in , with cardinality N(v). For the sake of brevity in notation, this cardinality will be written simply as N. We write so that the angles ϕ(w i ) are in ascending order. Based on this ordering, we define the angle around v "occupied" by w i , which is called neighbour angle, as
Fig. 1. Angles ϕ(w i ), Δϕ(w i ) and ω(w i ).
From Sakaridis, C., Drakopoulos, K., Maragos, P. 2017. Theoretical analysis of active contours on graphs. SIAM J. Imaging Sci. 10 (3), 1475–1510.(5)
In a similar fashion, we define the angle corresponding to the bisector between two consecutive neighbours as
(6)
A visual representation of the neighbour angle is provided in Fig. 1. Using the above notation, the geometric gradient approximation at v is given by the following formula:
(7)
The directional derivative term in (3) is approximated by the difference quotient of the function along each edge. On the other hand, the angle differential is handled through the neighbour angles, which effectively constitute a Voronoi tessellation of the circle around v, created from its neighbours. The reasoning behind this approach is to use information about the change of u along each particular direction that comes from the neighbour which is closest to this direction.
If the neighbour angles were not taken into account, we would place equal importance on all neighbours of v and return to an approximation similar to the weighted sum that was introduced in Drakopoulos and Maragos (2012):
(8)
3.1.2 Convergence for random geometric graphs
In the remaining theoretical analysis of this section, we mainly focus on a certain type of graphs, namely random geometric graphs (Penrose, 2003).
Definition 1
A random geometric graph (RGG) is comprised of a set of vertices and a set of edges. The set consists of n points distributed uniformly at random and independently in a bounded region . The set of edges is defined through the radius ρ(n) of the graph: an edge connects two vertices v and w if and only if their distance is at most ρ(n), i.e., d(v, w) ≤ ρ(n).
An instance of an RGG is given in Fig. 2. For RGGs, the approximation of (7) converges in probability to the true value of the gradient as the number of vertices increases, under some conditions on the radius which constrain the density of the graph. Before stating the related theorem, we remind the reader of some definitions for the asymptotic notations which are used in the analysis.
Fig. 2. A random geometric graph embedded in D = [0, 1]2, with n = 80 vertices and radius ρ = 0.25.
From Sakaridis, C., Drakopoulos, K., Maragos, P. 2017. Theoretical analysis of active contours on graphs. SIAM J. Imaging Sci. 10 (3), 1475–1510.Definition 2
Let f and g be two nonnegative functions. Then,
Theorem 1
Let be a differentiable function and an RGG embedded in D = [0, 1]2, with . For every vertex v of , the gradient approximation of (7) converges in probability to ∇u(v).
The proof of Theorem 1 is provided in Sakaridis et al. (2017).
3.1.3 Asymptotic analysis of approximation error
Note that Drakopoulos and Maragos (2012) also proved that two of their gradient approximations converge in probability to the true value of the gradient. Going one step further, Sakaridis et al. (2017) obtain an asymptotic bound on the rate of convergence to the true gradient as the size of the RGG grows large. Since the framework in RGGs is stochastic, their result involves the expectation of the approximation error.
Let us denote the error in approximating with
(9)
by . Comparing the two expressions, we deduce that the approximation with is threefold:
- 1.
-
Directional derivatives along edges are approximated with difference quotients.
- 2.
-
The approximate value for the directional derivative along each edge is used as a constant estimate for all the directions "falling into" the respective neighbour angle.
- 3.
-
The unit vector in the direction of each edge is also used for all the directions corresponding to the respective neighbour angle.
The calculation of the error that is performed in the proof of the following theorem involves construction of intermediate expressions between and , bounding the magnitudes of the resulting differences and combining these individual bounds using the triangle inequality.
Theorem 2
Let be a differentiable function and an RGG embedded in D = [0, 1]2, with . For the gradient approximation error at every vertex v of , it holds that
(10)
The full proof of Theorem 2 is given in Sakaridis et al. (2017). The radius of the graph is effectively the factor that determines the strictness of the asymptotic bound. To provide better intuition, we study the case when . Substituting in (10), we obtain
(11)
The strictest upper bound is , it is achieved for a = 1/3 and it constitutes a trade-off between minimizing the first error term, which calls for small radii, and the other two terms, which require more neighbours and consequently larger radii. On the other hand, when a∉(0, 1/2), the conditions of Theorems 1 and 2 for ρ(n) are not met and convergence to the true value of the gradient is not guaranteed in general. For instance, for a = 0, we get ρ(n) ∈ Θ(1), which means that the radius does not approach 0 in the limit. In turn, this implies that the difference quotients are not guaranteed to converge to the respective directional derivatives, since the distance d(v, w) ≤ ρ(n) does not go to 0 in the limit. In addition, for a = 1/2, it holds that ρ(n) ∈Θ(n −1/2) and hence ρ 2(n) ∈ Θ(1/n). As a result, the expected number of neighbours is E[N] = (n − 1)πρ 2(n) ∈ Θ(1). In other words, the sum is finite in the limit, which forbids convergence to the integral .
In Theorems 1 and 2, the domain D of the RGG is assumed to be the unit square. It is, however, straightforward to generalize the results of both theorems to arbitrary rectangular regions, since their proofs in Sakaridis et al. (2017) only use this assumption for calculating the area of D. This generalization involves applying a uniform scaling to both coordinates by , so that the transformed region has unitary area. The values of u are also scaled by the same factor. These steps ensure that all terms in (7) remain unaffected by the transformation. The two theorems are then applicable to the transformed input. In order to transfer the results back to the original input, one just needs to scale the radius ρ by the constant factor , which leaves the asymptotic bounds in both theorems unaffected.
3.1.4 Practical application
Despite convergence of the geometric gradient approximation to the true value of the gradient in the case of RGGs, in practice there is a nonnegligible error for graphs with finite number of vertices. This error is propagated to the embedding function of the active contour after each update and may be accumulated after several iterations. To mitigate this, Sakaridis et al. (2017) apply smoothing filtering on the approximate gradient values at a local, neighbourhood level as an empirical means to eliminate potential outliers by taking into account the values at neighbouring vertices. This smoothing is also applicable to curvature, as we discuss in Section 3.2. The smoothing filter can be either an average or a median filter, receiving as input the set of function values at the vertex itself and all its neighbours. In the case of curvature this is straightforward, while for gradient, each of the two vector components are filtered separately. Application of smoothing filtering is statistically motivated for the case of smooth (differentiable) functions by showing that using the neighbours of a vertex to form an ensemble of estimators of the approximated quantity at that vertex reduces the variance of the estimation compared to the basic approximation while not changing the bias.
Experimental validation of the gradient and curvature approximations as well as the aforementioned smoothing filtering is performed by using closed-form functions defined on RGGs. In the experiments that follow in the rest of Sections 3 and 5, the radius of an RGG is chosen as ρ(n) = 0.6n −1/3 unless otherwise mentioned, so as to achieve the strictest asymptotic bound for gradient approximation error according to the results of Section 3.1.3. Besides, in Sections 3.1.4 and 3.2.2, all RGGs are embedded in [0, 1]2. For each graph, the function's gradient and the curvature of its level sets are approximated at each vertex and afterwards the results are filtered with an average or median filter. The analytical expressions of the function's gradient and curvature are then compared to the estimates. Performance is measured using a global, graph-level error metric which is called relative error and denoted by e r . The error at each individual vertex of the graph is defined as the difference between the approximate value and the true analytical value, and the relative error is simply the ratio of the energy of the error signal to the energy of the true signal on the entire graph:
(12)
The relative error of the geometric gradient approximation is evaluated for an isotropic Gaussian on RGGs whose size ranges from 1000 to 10,000 vertices. The analytical form of the Gaussian is
(13)
with σ = 0.25 and x 0 = y 0 = 0.5. Fig. 3A shows average values of e r over 10 different graphs for each size to reduce variance in the reported performance. Using either an average or a median filter reduces the relative error substantially irrespective of size. Based on this result, smoothing filtering is generally applied for gradient in practice when performing active contour evolution.
Fig. 3. (A) Relative error of gradient approximations for a Gaussian function defined on RGGs of increasing size. The geometric approximation with no filtering is compared to its filtered versions with an average or median filter. (B) Relative error of curvature approximations for a conic function defined on RGGs of increasing size. All four combinations of type of approximation (geometric or gradient-based) and smoothing filter (average or median) are compared.
Modified from Sakaridis, C., Drakopoulos, K., Maragos, P. 2017. Theoretical analysis of active contours on graphs. SIAM J. Imaging Sci. 10 (3), 1475–1510.Another practical consideration about the geometric gradient approximation (7) is that its absolute error increases as the magnitude of the true gradient grows large, i.e., when the function exhibits abrupt variations. This does not pose a problem for the calculation of gradient direction (which is relevant as input for approximating the curvature), since the latter does not depend on the range of the function's variation around the examined vertex. Utilizing all incident edges in the weighted sum of (7) ensures that all available information in the neighbourhood of the vertex is used to estimate which direction the gradient points to, as emphasized in Drakopoulos and Maragos (2012). However, the estimated gradient magnitude with the geometric gradient approximation (7) is prone to greater error, as it depends on the range of the function's variation. The use of difference quotients in (7) accentuates this effect for dense graphs, where distances between neighbouring vertices that appear in the denominator of the quotients approach zero. Thus, the preferable approximation for gradient magnitude in practice is the maximum absolute difference of values of the function along edges that are incident on v, introduced in Drakopoulos and Maragos (2012):
(14)
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/S1570865919300158
Stochastic Dynamics
Don Kulasiri , Wynand Verwoerd , in North-Holland Series in Applied Mathematics and Mechanics, 2002
2.3 What is Stochastic Calculus?
In standard calculus we deal with differentiable functions which are continuous except perhaps in certain locations of the domain under consideration. To understand the continuity of the functions better we make use of the definitions of the limits. We call a function f, a continuous function at the point t = t0 if
regardless of the direction t approaches t0 . A right-continuous function at t0 has a limiting value only when t approaches t0 from the right direction, i.e. t is larger than t0 in the vicinity of t 0. We will denote this as
Similarly a left-continuous function at t0 can be represented as
These statements imply that a continuous function in both right-continuous and left-continuous at a given point of t. Often we encounter functions having discontinuities; hence the need for the above definitions. To measure the size of a discontinuity, we define the term "jump" at any point t to be a discontinuity where the both f(t +) and f(t-) exist and the size of the jump be ∆f (t)=f(t +)− f(t −). The jumps are the discontinuities of the first kind and any other discontinuity is called a discontinuity of the second kind. Obviously a function can only have countable number of jumps in a given range. From the mean value theorem in calculus it can be shown that we can differentiate a function in a given interval only if the function is either continuous or has a discontinuity of the second kind during the interval. Stochastic calculus is the calculus dealing with often non-differentiable functions having jumps without discontinuities of the second kind. One such example of a function is the Wiener process (Brownian motion). One realization of the standard Wiener process is given in Figure 2.1.
Figure 2.1. An example of a function dealt in stochastic calculus. This function is continuous but not differentiable at any point.
Without going into details of how we computed this function- we will do that in Chapter 3 – we can see that the increments are irregular and we can not define a derivative according to the mean value theorem. This is because of the fact that the function changes erratically within small intervals, however small that interval may be, and we can not define a derivative at a given point in the conventional sense. Therefore we have to devise new mathematical tools that would be useful in dealing with these irregular non-differentiable functions.
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/S0167593102800031
Calculus
Huw Fox , Bill Bolton , in Mathematics for Engineers and Technologists, 2002
Sums of functions
Consider a function which can be considered to be a sum of a number of other functions, e.g. y = f(x) + g(x):
[11]
The derivative of the sum of two differentiable functions is the sum of their derivatives.
Key point
The derivative of the sum of two differentiable functions is the sum of their derivatives.
As an illustration, consider the differentiation of the hyperbolic function y = sinh x. This function (see Section 1.8) can be written as ½(e x − e−x ). Thus:
[12]
In a similar way we can differentiate sinh ax and cosh ax, obtaining:
Key point
The hyperbolic function tanh ax can be differentiated using the quotient rule (see later in this Section).
[13]
[14]
Example
Determine the derivatives of:
- (a)
-
y = 2x 3 + x 2,
- (b)
-
y = sin x + cos 2x,
- (c)
-
y = e4x + x
- (a)
-
- (b)
-
- (c)
-
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B9780750655446500059
Deterministic Partial Differential Equations
Jinqiao Duan , Wei WANG , in Effective Dynamics of Stochastic Partial Differential Equations, 2014
2.5 Sobolev Inequalities
We now review some inequalities for weakly differentiable functions [264, Ch. 7], [287, Ch. II] or [144, Ch. 7]. Let be a domain in . The Lebesgue spaces , are the spaces of measurable functions that are th order Lebesgue integrable on a domain in . The norm for in is defined by
In particular, is a Hilbert space with the following scalar product and norm :
for in . Now we introduce some common Sobolev spaces. For define
Here with 's being nonnegative integers, , and . Each of these is a Hilbert space with scalar product
and the norm
For and , we further define another class of Sobolev spaces,
with norm
Recall that is the space of infinitely differentiable functions with compact support in the domain . Then is the closure of in Hilbert space (under the norm ). It is a Hilbert space contained in . Similarly, is the closure of in Banach space (under the norm ). It is a Banach space contained in .
Standard abbreviations are used for these common Sobolev spaces.
Let us list several useful inequalities.
Cauchy–Schwarz inequality
For ,
Hölder inequality
For and with , and ,
Minkowski inequality
For ,
Poincaré inequality
For ,
where is the Lebesgue measure of the domain , and is the volume of the unit ball in in terms of the Gamma function :
It is clear that and .
Similarly, for , and a bounded domain,
Let , and a bounded convex domain. Take , be any measurable subset, and define the spatial average of over by (with being the Lebesgue measure of ). Then
where is the diameter of .
Agmon inequality
Let be an open domain with piecewise smooth boundary. There exists a positive constant , depending only on domain , such that
In particular, for and ,
Moreover, for and ,
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/B9780128008829000020
Half-Linear Differential Equations
In North-Holland Mathematics Studies, 2005
5.6.1 Asymptotic formula for distribution of zeros
Here we present an asymptotic formula for number of zeros of oscillatory solutions of the equation
(5.6.1)
It is supposed that c(t) > 0 for large t and the results are based on the generalized Prüfer transformation from Section 1.2. In this transformation, a nontrivial solution and its derivative are expressed via the generalized half-linear sine and cosine functions. Recall that the half-linear sine function, denoted by sin p t, is the solution of the equation
satisfying the initial condition x(0) = 0, x'(0) = 1 and the half-linear cosine function is denned by cos p t = sin' p t. Generalized π, denoted by π p , is introduced in Subsection 1.1.2.
Theorem 5.6.1
Suppose that c is a differentiable function such that c(t) > 0 on an interval [T, ∞), and
(5.6.2)
holds. Then (5.6.1) is oscillatory. Moreover, if N[x; T] denotes the number of zeros of a solution x of (5.6.1) in the interval [a, T], then
(5.6.3)
where P[x; T] is the principal term given by
and R[x; T] is the remainder which is of smaller order than P[x; T] as T → ∞ and satisfies
Proof. Set and define
(5.6.4)
Then C *(t) is nonincreasing and satisfies lim t→∞ C *(t) = 0 by (5.6.2). We have
which implies that
It follows that lim t→∞ t −1[c(t)]−1/p = 0, or equivalently, lim t→∞ t p c(t) = ∞. This implies, by Theorem 1.4.5, that (5.6.1) is oscillatory.
Now we turn our attention to the proof of the asymptotic formulas for numbers of zeros. By the Sturmian comparison theorem (Theorem 1.2.4) we have that N[x 1; T] and N[x 2; T] differ at most by one for any solutions x 1 and x 2 of (5.6.1), so we may restrict our attention to the solution x 0 of (5.6.1) determined by the initial conditions x 0(a) = 0, x'0(a) = 1. This solution is oscillatory by the first part of the our theorem.
We introduce the polar coordinates ρ(t), φ(t) for x 0(t) by setting
(5.6.5)
It can be shown without difficulty that ρ(t) and φ(t) are continuously differentiable on [a, ∞) and satisfy the differential equations
(5.6.6)
We use the notation
in terms of which (5.6.6) is written as
(5.6.7)
From the first equation in (5.6.5) we see that x 0(t) = 0 if and only if φ(t) = jπ p , j ∈ ℤ. We may suppose that φ(a) = 0. In view of (5.6.2) there is no loss of generality in assuming that
where C *(t) is defined by (5.6.4). Since
(5.6.8)
we have
which implies that φ'(t) > 0, so that φ(t) is increasing for t ≥ a.
We now integrate (5.6.7) over [a, T], obtaining
(5.6.9)
where
From (5.6.8) it is clear that
(5.6.10)
Noting that the number of zeros of x 0(t) in [a, T] is given by
where [u] denotes the greatest integer not exceeding u, we see from (5.6.9) and (5.6.10) that the conclusion of the theorem holds with the choice
That the term R[x 0; T] = N[x 0; T] − P[x 0; T] is of smaller order than P[x 0; T] follows from the observation that
This completes the proof.
Example 5.6.1
Consider the equation
(5.6.11)
where β is a constant with p + β > 0. The function c(t) = t β satisfies
and so we conclude from Theorem 5.6.1 that the quantity P[x; T] can be taken to be
and (5.6.3) holds with this P[x; T] and R[x; T] satisfying
Remark 5.6.1
- (i)
-
The results of this subsection cannot be applied to the generalized Euler equation 1.4.20
shows that both of them are of the same logarithmic order as T → ∞. - (ii)
-
In [317], M. Piros has investigated a similar problem under a more stringent restriction on c(t), namely he supposed that c v (t) is a concave function of t for some v > 0. Then he proved that the error term R[x; T] in (5.6.3) is O(1). Exactly, the differential equation (5.6.11) with β = 1/v plays the exceptional role in determining the precise value of R[x; T].
- (iii)
-
It is not difficult to see how the results can be extended to the case of a general r in (1.1.1), which satisfies ƒ∞ r 1-q (s) ds = ∞, by using the transformation of independent variable, see Subsection 1.2.7.
Read full chapter
URL:
https://www.sciencedirect.com/science/article/pii/S0304020805800064
Source: https://www.sciencedirect.com/topics/mathematics/differentiable-function
0 Response to "F X y G X h Y Where G and H Are Continuously Differentiable Functions of One Variable"
Post a Comment