Newton Interpolation 1

N e w t o n I n t e r p o l a t i o n P o l y n o m i a l : P r o j e c t NIP_1

Given the following n+1 (measurements) points

No.	0	1	2	...	i	...	n
X-value	X₀	X₁	X₂	...	X_i	...	X_n
Y-value	Y₀	Y₁	Y₂	...	Y_i	...	Y_n

Table: Values (X_i,Y_i) i=0,n; X_i≠X_k ∀i≠k

All (X,Y) values are supposed connected according to (unknown) function y=f(x), of which only a few value pairs have been collected (measured). An example of such measurement could be the resistivity of a metal (R=y-values) in function of its temperature (T=x-values). Electrical resistivity usually increases with temperature according to following relationship: R(T) = c₀ + c₁.T + c₂ T².

A general objective is to express functional relationships y=F(x) from the set of known values {(X_i,Y_i)}, with F; F(x)≈f(x) ∀x. There are many methods to define F. The Newton Interpolation Polynomial method (NIP) is based on following assumptions:

NIP is an interpolation method

eq. [1] F(x_i)=y_i, ∀i, i=0,n
F(x) is a polynomial

eq. [2] F(x) = c₀ + c₁.x + ... + c_i.xⁱ + .. + c_n.xⁿ
- c_i, i=0,n called the polynomial coefficients are constants ∈ ℝ (real numbers) or more generally ∈ ℂ (complex numbers),
- xⁱ, i=0,n are powers of x, which define the polynomial basis
eq. [2] writes F(x) in canonical form. Newton and Lagrange resolve the interpolation by transposing F(x) into non-canonical equivalents.

Newton rewrites F(x) as follows:

eq. [3]

F(x)=N(x)=Nⁿ(x)= y₀ + b₁(x-x₀) + ... + b_i(x-x₀)(x-x₁)...(x-x_i-1) + ... + b_n(x-x₀)...(x-x_n-1)

Nⁿ(x) is the Newton Interpolation Polynomial of degree n associated with the n+1 measures (x_i,y_i), i=0,n.

Background Knowledge

The followings results of Numerical Analysis may help better understand the mathematics behind NIP.

Fundamental theorem of algebra

Every non-constant single-variable polynomial with complex coefficients has at least one complex root.

eq. [4] P_n(X) = c₀ + c₁.X + ... + c_i.Xⁱ + .. + c_n.Xⁿ = 0

with
- i=0,n ∈ℕ
- Xⁱ is the power i of complex variable X
- c_i are constant coefficients ∈ ℂ (the field of complex numbers)
eq. [4] has at least one solution, X₀ (= root of the equation).

Following consequences are derived from the fundamental theorem of algebra (or can be considered as alternative formulations):
1. (Through the use of successive polynomial division)
  In the field of complex numbers eq. [4] has exactly n roots X_i, i=1,n , each counted with its multiplicity.
2. Be P_n(X) and Q_n(X) 2 polynomials on the complex field of degree n.
  If both polynomials yield an equal output for at least n+1 values i.e. P_n(x_i) = Q_n(x_i), for x_i, i=0,n, and x_i ≠ x_j for i ≠ j
  then P_n(X) = Q_n(X) i.e. both polynomials are the same i.e. are defined by the same coefficients.

Taylor development

The

Taylor series of a real or complex-valued function f(x) that is infinitely differentiable at a real or complex number a is the power series

eq. [5-a]	f(x) = f(a) + (f'(a) ⁄ 1!) . (x-a) + ... + (f^k(a) ⁄ k!) . (x-a)^k + ...
eq. [5-b]	f(x) = ∑_k=0,∞ (f^k(a) ⁄ k!) . (x-a)^k

Is the complex or real number a equal to 0, the following polynomial development of f(x) is obtained

eq. [5-c]

f(x) = ∑_k=0,∞ (f^k(0) ⁄ k!) . x^k

In words, if a (infinitely differentiable) function f(x) is exactly known at one point of its definition domain, then it is known everywhere - and can be expressed as a polynomial, whatever precision is initially targeted .

Lagrange Interpolation

Beside Newton, Lagrange proposed another solution, the Lagrange Interpolation Polynomial (LIP)

eq. [6-a]

L(x) = ∑_k=0,n y_k.L_k(x)

A solution is obviously found, if L_k satisfy following equations:

eq. [6-b]	L_k(x_k) = 1
eq. [6-c]	L_k(x_i) = 0, for i≠k

Lagrange proposes to write:

eq. [6-d]	L_k(x) = l_k.(x-x₀)...(x-x_k-1).(x-x_k+1).(x-x_n)
eq. [6-e]	l_k(x) = 1 ⁄ (x_k-x₀)...(x_k-x_k-1).(x_k-x_k+1).(x_k-x_n )

According to the fundamental theorem of algebra N(x) and L(x) of degree n, interpolating the same set (x_i,y_i), i=0,n are identical.

Solutions and discussion

The interpolation problem in canonical form reads as a set of inhomogeneous linear equations

eq. [7-a]

P_n(x_i) = ∑_k=0,n x_i^k . c_k = y_i

Transposed in the matrix formalism:

eq. [7-b]

[A_(i,j)].[C_(j)] = [Y_(i)]

a_(i,j) = x_i^j	x-coordinates of the measurement points
c_j	polynomial coefficients to calculate
y_j	y-coordinates of the measurement points

Equations system eq. [7] has a particular form. Its determinant det[A], called 'VanDerMond' determinant, equals to ∏_i>j (x_i - x_j): There is only a solution, if x_i≠x_k for i≠k. Also the distance between all (x_i-x_j) should be comparable, to avoid numeric uncertainties.

Neither N(x) nor L(x) is expressed in canonic form ∑_k=0,n c_k.x^k.
The advantage? Avoid resolving the linear equations system in the general form (eq. [7]). This was particularly interesting in former times, as computer did not exist - and even now, if programmers don't dispose of libraries to solve (linear) equations systems in their general form.

Transposed into the Lagrange basis, the general equations system (eq. [7]) simplifies radically: only the first diagonal remains (each l_k coefficient (eq. [6-e]) is calculated directly and independently from each other).
Since the l_k coefficients only depend on x_k, they can be reused, if measures apply on the same x-values.
A major drawback is that each Lagrange basis function L_k(x) is a polynomial of degree n. Should the number of measurement points change, all basis functions must be re-calculated. Besides the explanatory potential is rather poor.

Transposed into the Newton basis (eq. [3]), the general equations system (eq. [7]) transcribes into a lower triangular matrix. A way to proceed is to calculate the divided differences. In this case, N(x) can be reformulated as follows:

eq. [8]

N(x) = [y₀] + [y₀,y₁](x-x0) +...+ [y₀...y_i](x-x0)(x-x1)...(x-x_i-1) +...+ [y₀...y_n](x-x0)...(x-x_n-1)

with

[y₀]	= y₀	= b₀
[y₀y₁]	= ([y₁] - [y₀]) ⁄ (x₁ - x₀) = (y₁ - y₀) ⁄ (x₁ - x₀)	= b₁
[y₀...y_i+1]	= ([y₁...y_i+1] - [y₀...y_i]) ⁄ (x_i+1 - x₀)	= b_i+1

If all x-values are equidistant, simplifications occur:

(x_i+1-x_i)	= constant=h,	∀i=0,n-1
(x_i-x₀)	= i.h
x	= x₀ + h.t,	t ∈[0,n] (x in the domain of interpolation)

eq. [9]

x-x_i = h.(t-i),

t ∈[0,n] (x in the domain of interpolation)

Consequently, the Newton Interpolation Polynomial can be rewritten as follows:

eq. [10-a]	N(x) = y₀ + Δy₀.t + ... + (Δ^ky₀ ⁄ k!) . h^k .t(t-1)...(t-(k-1)) +...
eq. [10-b]	N(x) = ∑_k=0,n (Δ^ky₀ ⁄ k!) . h^k .t(t-1)...(t-(k-1))
eq. [10-c]	N(x) = ∑_k=0,n Δ^ky₀ . h^k . (t(t-1)...(t-(k-1)) ⁄ k!)

eq. [11-a]	Δ⁰y₀	=y₀
eq. [11-b]	Δ^k+1y₀	= (Δ^ky₁ - Δ^ky₀)

It is interesting to note the formal similarity between eq. [10-b] (the Newton development) and eq. [5-b] (the Taylor development).
In fact, the Newton approximation may be considered some sort of discrete Taylor development: The previously calculated coefficients keep their value, as the number of base functions increases i.e. the number of measurement points increases.

perl implementation

perl tips

Before going into details, a few tips about perl that proved helpful.

Command Options

There is a wide range of options that can be used to order to control perl at invocation. My favorites are following:

perl -c NameOfPerlScript
perl will check the syntax of perl script named NameOfPerlScript without executing the program. Beware, a successful check of syntax does not guarantee a successful run.

Literals

Special literals have a dedicated meaning

__END__ Legal end of code before the actual EOF
__FILE__ filename
__LINE__ line number
__PACKAGE__ package name

Named Code Blocks

A total of 5 code blocks are - if present - executed at the beginning or at the end of a running perl program.

BEGIN{}
is executed as soon as possible, that is the moment it is completely defined, even the rest of the containing file is parsed.
END{}
is executed as late as possible, after perl has finished running the program and just before the interpreter is being exited.

Table: perl tips used in this project

Coding guide lines

This perl implementation of the NIP (project NIP_1) is merely an exercise: The way is as important as the objective. For this reason I will briefly mention my Programing Guidelines. The topic is controversial, if treated out of context: Criteria for code evaluation are too manifold and partially contradict one another. Therefore, each company, project, programmer must define their priorities - given the circumstances. And so do I.

In project NIP_1 the priority is code readability - basing on following pillars:
- Modularity of code
  As far as reasonable, the code will be broken down into (handy) functions (the unitary modules). Functions themselves will be grouped according to their purpose into different 'libraries' (that may be separately stored into different files).
- Self explaining code
  Especially the naming of entities (variables, routines) should be meaningful.
- Comments
  Comments should add value to the bare code. Is the syntax easy to understand, no need for extensive comments.
  Is the syntax ambiguous or complex, comments help the readers understand what has been written and why.
  Also comments should disclose critical informations that cannot otherwise be expressed into code, e.g. general concept, boundary conditions, scheduling, authors, references.
These guidelines also greatly improve maintenance, re-usability, extension, upgrading of codes.

Some other implementation criteria were ignored:
- Minimize code (Laconism)
  perl has been known to allow extreme code concision: one line of code can amount to an entire subroutine.
  Since the amount of information required to perform a treatment is basically determined by its nature, extreme concision relies on implicit assumptions. Sometimes extremely short code is only made possible because of bugs or non specified features within the environment providing resources (cf. hacks).
- Minimize resource allocation
  Available resources determine duration and limit the (transient) storage required.
Under circumstances (e.g. quick patch, little storage/processing capacity) the latter criteria may play a role. In this project not. Obviously they impair code readability, maintenance and portability.

Program description

The NIP resolution discussed here (called NIP_1) is coded into three files:

NIP.pl stores only the main program (that is not labeled as such in the PL file)
It consists almost exclusively in (library) subroutine calls.
NIP_Lib.pm stores all NIP-specific subroutines, functions.
Because their number is limited, one single PM file suffices.
NIP_input.txt is the input file.
Its name can be re-defined in the command line (CL).

NIP.pl

main program

It consists in following structuring subroutines (in order of invocation):

NIP_lib::read_CL_Options
parses the command line that may include following options (in any order).
1. -if AlternativeInputFileName
  By default NIP.pl reads NIP_input.txt.
2. -of AlternativeOutputFileName
  By default NIP.pl writes NIP_output.txt.
3. -mf AlternativeMessageFileName
  By default NIP.pl writes NIP_messages.txt.
4. -v(erbose)
  Extensive messages are output (into the messages file).
  By default, the verbose mode is deactivated.
5. -d(ebug)
  Generates even more messages than the verbose mode.
  By default, deactivated.
NIP_lib::read_Section_INPUT_DATA
parses section INPUT_DATA of the input file, describing the NIP problem.

Beware: NIP.pl assumes that all x_i-values are equidistant. Users need only enter x₀ and Delta-x=(x_i+1-x_i) to define the set of measurement values.
NIP_lib::solve_NIP
evaluates all b_i coefficients, i=0,n (n=number of measures - 1) (see eq. [3]).
NIP_lib::read_Section_ANALYSIS
parses section ANALYSIS of the input file, describing how to assess the calculated NIP.

Beware: To define the set of interpolated points, the same method applies as for the measurements set: Users need only enter x₀ and Delta-x=(x_i+1-x_i).
NIP_lib::calc_Ys_NIP
calculates an entire set of values (x,y) according to formula:
N^k(x)= ∑_i=0,k (b_i.∏_j=0,i-1 (x-x_j)) with k≤n (n= number of interpolation points - 1).

If k=n, N^k(x)=N(x) is the complete NIP. Otherwise, it is a partial NIP. (It is interesting to observe how partial NIPs converge into the complete one, when k→n.)

Beware: Values y=N(x) are calculated with Horner's method.

NIP_lib.pm

perl package (PM module),
containing all NIP-specific subroutines mentioned above (and a few more).

NIP_input.txt

Input file
containing two sections: INPUT_DATA, ANALYSIS.

NIP_input.txt is the file name by default.
The user determines a specific name using CL option -if SpecificFileName.txt (for Input File).

Table: General program structure

The program produces following output.

NIP_output.txt

This is the main output:

Displays input data from Section INPUT_DATA that actually impacts onto the resolution.
Displays the b_i-values (characterizing the NIP) in format %6.20f i.e. 20 decimal places!
(It allows to get a rough idea of the numeric precision in the results.)
Displays interpolated (and extrapolated) (x_i,y_i)-values as determined in Section ANALYSIS, in format %6.9f i.e. 9 decimal places (expected precision of the calculation).

NIP_output.txt is the file name by default.
The user determines a specific name using CL option -of SpecificFileName.txt (for Output File).

NIP_messages.txt

Displays warnings and errors, depending on parameters debug (mode), verbosity.

The debug mode is set by CL option -d(ebug).
I activated the option to extensively check what the program reads and how.
CL option -v(erbose) also increases the amount of messages - less than the debug mode though.

NIP_messages.txt is the file name by default.
The user determines a specific name using CL option -mf SpecificFileName.txt (for Message File).

Beware: Results and messages are always displayed onto screen (i.e. standard output). Users only need files, if they want to store results or further process them.

Table: Output files of NIP.pl

The program is invoked as follows:

Minimum syntax: perl NIP.pl
Maximum syntax: perl NIP.pl -if InputFN.txt -of OutputFN.txt -mf MessageFN.txt -v(erbose) -debug

Discussion

Solving the NIP problem means calculating the b_i coefficients, as formulated in eq. [3].

eq. [8] or eq. [10] evaluate the b-coefficient of the NIP in general or in case that all x_i, i=0,n=number of measures - 1, are equidistant.
Both equations are valuable from an epistemological point of view - help better understand implications of the NIP formulation. They may also be coded into spreadsheet applications - like Microsoft EXCEL or LibreOffice calc. However, I would not recommend such a method.

The perl implementation discussed here directly resorts to the fact that matrix A_(i,j) in equations system associated to the NIP (eq. [3]) is lower triangular: b_i can be directly calculated, provided that all b_k, k=0,i-1 are known. No need for dedicated subroutines operating on matrices. More precisely perl function NIP_lib::solve_NIP relies on following equations:

eq. [12-a]	N⁰(x) = b₀ = y₀
eq. [12-b]	b_i = (y_i - N^i-1(x_i)) / ( x_i-x₀)...(x_i-x_i-1), ∀ i=1,n

Numerical imprecision will grow as more coefficients must be calculated - at least because of following (interrelated) reasons :

Values in matrix A_(i,j) from equations system in eq. [3] grow increasingly heterogeneous: The matrix becomes worse and worse conditioned.
In most cases, values of higher b_i-coefficients become increasingly tiny and will soon disappear into the numerical noise.
Imprecisions in b_i values are amplified into the calculation of subsequent b_k, k>i.
This phenomenon applies specifically to NIP.pl and may greatly reduce the number of coefficients correctly estimated.

Above a limit to be determined, I recommend to solve the canonical equations system (eq. [7]), and use algorithms that calculate all b_i with the same degree of precision - which includes a preliminary matrix conditioning.

Study cases

Objectives

To show how NIP.pl calculates.
To assess validity and precision of results.
To show how the Newton Interpolation Polynomial converges in selected cases: N^k(x) -> f(x), k → ∞.

The trick consists in defining a f(x) function in advance and then compare true and interpolated values. Following cases will be analyzed:

Case Name	Definition of f(x) (to be interpolated)	List of TXT-files associated to NIP.pl
Case 1: polynomial	f1(x)=(1 + x + 0.5 x² + 0.25 x³ + 2 x⁵)	NIP_input_StudyCase_1_Polynomial.txt NIP_output_StudyCase_1_Polynomial.txt NIP_message_StudyCase_1_Polynomial.txt
Case 2: exponential function	f2(x)=e^x	NIP_input_StudyCase_2_exp_B.txt NIP_output_StudyCase_2_exp_B.txt NIP_message_StudyCase_2_exp_B.txt
Case 3: trigonometric function	f3(x)=sin(x)	NIP_input_StudyCase_3_sin.txt NIP_output_StudyCase_3_sin.txt NIP_message_StudyCase_3_sin.txt
Case 4-1: square root	f4_1(x)=sqrt(x+1)	NIP_input_StudyCase_4_sqrt_1.txt NIP_output_StudyCase_4_sqrt_1.txt NIP_message_StudyCase_4_sqrt_1.txt
Case 4-2: square root	f4_2(x)=sqrt(x)	NIP_input_StudyCase_4_sqrt_2.txt NIP_output_StudyCase_4_sqrt_2.txt NIP_message_StudyCase_4_sqrt_2.txt

Table: Study cases overview

Rem: Message files were only mentioned for the sake of completeness. (In all cases presented NIP.pl was fine: empty message files.)

About the methodology applied:

Input values y_i=f(x_i) were calculated with LibreOffice calc and pasted into section INPUT_DATA of associated NIP_input.txt.
ODS spreadsheet NIP_StudyCases.ods contains all tables and graphics available. Some of them are discussed below.
NIP N(x) associated with set s_K={(X₀,Y₀),...,(X_K,Y_K)} of K+1 measurement points is named N^K(x).
NIP N(x) associated with superset s_K+L={(X₀,Y₀),...,(X_K,Y_K),..., (X_K+L,Y_K+L)} is named N^K+L(x).

It is interesting to observe, how N^k(x) evolves, as k increases:
If s_K is associated with an analytic function f(x) i.e. Y_i=f(X_i), ∀i=0,K, then N^k(x) should converge to f(x).

For each study case 2 NIP will be compared: N⁵(x), N^K(x), K≈10, with K+1 is the total number of measurements considered.
The criterion used for comparison is the associated set {c_i} of coefficients of the polynomials in canonical form.
The numerical precision available - in perl as well as in LibreOffic-Calc - was presumed to be about 10^-10 (No inquiry has been made to validate this assumption). Therefore results will be output with 10 significant decimals.
Interpolation results, y=N(x), will be considered good, if the relative error - expressed as (N(x)-f(x))/f(x) * 100 - does not exceed 1%.

Study Case: Interpolate a polynomial

f1(x)= 1 + x + 0.5 x² + 0.25 x³ + 2 x⁵, examined for x ∈ [0,1], (X_i+1-X_i)= Delta_X= 0.1

perl NIP.pl -if NIP_input_StudyCase_1_Polynomial.txt -of NIP_output_StudyCase_1_Polynomial.txt (-mf NIP_message_StudyCase_1_Polynomial.txt)

Graphic: NIP of polynomial f1(x)=1+ x+ 0.5 x²+ 0.25 x³+ 2 x⁵

Input data (see NIP_input_StudyCase_1_Polynomial.txt)

10 (X_i,Y_i)-values were used to calculate the NIP, N⁹(x): X₀=0.00 to X₉=0.9, Delta-X=0.10.
41 values were recalculated: X₀=0.00 to X₄₀=2.0, Delta-X=0.05.
Rem: Some y-values have been interpolated (for x ≤ 0.9), others extrapolated (for 0.9 < x ≤ 2.0).

Results (see NIP_output_StudyCase_1_Polynomial.txt)

All N⁹(X_i) fit exactly the original f(X_i): According to calc all differences equal to zero. The same result applies actually to N⁵.

c_i coefficients of N⁵ and N⁹

Index of c_i	N⁵(x)	N⁹(x)
0	+1.0000000000	+1.0000000000
1	+1.0000000000	+1.0000000000
2	+0.5000000000	+0.5000000000
3	+0.2500000000	+0.2500000000
4	+0.0000000000	+0.0000000000
5	+2.0000000000	+1.9999999999
6		+0.0000000002
7		-0.0000000002
8		+0.0000000001
9		-0.0000000000

Table: Comparison of the c_i-coefficients for N⁵(x) and N⁹(x)

In this particular case, where f1(x) is a polynomial, both NIP N⁵, N⁹ should ideally produce exactly the same result - which is the original set of all c_i-coefficients in f1. Practically, N⁵(x) gives a better interpolation. The slight degradation is due to numerical imprecision (The values of coefficients c₆ to c₉ in N⁹ are obviously numerical noise).

Study Case: interpolate e^x
f2(x)= e^x, examined for x ∈ [0,4], (x_i+1-x_i)= Delta_x= 0.4

perl NIP.pl -if NIP_input_StudyCase_2_exp_B.txt -of NIP_output_StudyCase_2_exp_B.txt (-mf NIP_message_StudyCase_2_exp_B.txt)

Graphic: NIP of polynomial f2(x) =e^x

Input data (see NIP_input_StudyCase_2_exp_B.txt)

15 (X_i,Y_i)-values were used to calculate N¹⁴(x): X₀=0.00 to X₁₀=5.6, Delta-X=0.40.
50 values were recalculated: X₀=0.00 to X₄₉=9.8, Delta-X=0.20.
Rem: Some y-values have been interpolated (for x ≤ 5.6), others extrapolated (for 5.6 < x≤ 9.8).

Results (see NIP_output_StudyCase_2_exp_B.txt)

N¹⁴ produces good results over the whole recalculation range [0-9.8] (Its interpolation range is [0-5.6]).
N¹⁰ produces good results within a range [0-7.0] (Its interpolation range is [0-4.0]).
N⁵ produces good results within a range [0-2.8] (Its interpolation range is [0-2.0]).

c_i coefficients of N⁵, N¹⁰, N¹⁴ and T[e^x]

Index of c_i	N⁵	N¹¹	N¹⁴	Taylor series T[e^x]
0	+1.0000000000	+1.000000000	+1.0000000000	+1
1	+1.0041565261	+0.9999361463	+0.9999973245	+1
2	+0.4768052674	+0.5004629399	+0.5000216050	+0.5
3	+0.2123995333	+0.1652975971	+0.1665925953	+0.1666666667
4	+0.0015571296	+0.0438795661	+0.0418118925	+0.0416666667
5	+0.0234191137	+0.0061452376	+0.0081494533	+0.0083333333
6		+0.0027799887	+0.0015491374	+0.0013888889
7		-0.0003780932	+0.0000987721	+0.0001984127
8		+0.0001776319	+0.0000698567	+0.0000248016
9		-0.0000214749	-0.0000121544	+0.0000027557
10		+0.0000021764	+0.0000038663	+0.0000002756
11			-0.0000005916	+0.0000000251
12			+0.0000000744	+0.0000000021
13			-0.0000000051	+0.0000000002
14			+0.0000000002	+0.0000000000

Table: Convergence of the c_i-coefficients toward the Taylor series T[e^x]

The Taylor series T[e^x)]= ∑_n=0,∞ (xⁿ/n!) is the limit to which N^k should converge, when k→∞.

The table above shows roughly how the c_i-coefficients of N^K converge toward the Taylor series, as K increases.
Beware: For i ≥ 13, the c_i values cannot be clearly distinguished from the numerical noise (≈10^-10) anymore!
The graphic below shows the errors in the c_i estimation of the c_i coefficients for N⁵, N¹⁰ and N¹⁴.

Graphic: Errors in the estimation of the c_i-coefficients of selected NIP

For coefficients c₁ to c₇ there is a clear progression N⁵ > N¹⁰ > N¹⁴ > T[e^x]. However, as the c_i index increases (e.g. the c_i value decreases), numerical imprecision gains influence. The relative errors in % shown on the secondary Y-axis may be an indicator of the phenomena: (absolute) values greater than 100% have been erased for the sake of readability.

Study Case: Interpolate sin(x)

f3(x)= sin(x), examined for x ∈ [0, 6.6], (x_i+1-x_i)= Delta_x= 0.6 (rem: 2.π ≈ 6.283)

perl NIP.pl -if NIP_input_StudyCase_3_sin.txt -of NIP_output_StudyCase_3_sin.txt (-mf NIP_message_StudyCase_3_sin.txt)

Graphic: NIP of f3(x)= sin(X)

Input data (see NIP_input_StudyCase_3_sin.txt)

12 (X_i,Y_i) values were used to calculate N¹¹(x): X₀=0.00 to X₁₁=6.6, Delta-X=0.60.
42 values were recalculated: X₀=0.00 to X₄₁=12.3, Delta-X=0.30.
Rem: Some y-values have been interpolated (for x ≤ 6.6), others extrapolated (for 6.6 < x ≤ 12.3).

Results (see NIP_output_StudyCase_3_sin.txt)

N¹¹(x) produces good values within x-range [0, 7.80] (Its interpolation scope is [0, 6.60]).
N⁵(x) produces good values within x-range [0, 3.00] - which corresponds to its interpolation scope.

c_i coefficients of N⁵, N¹¹, and T[sin(x)]

Index of c_i	N⁵(x)	N¹¹(x)	Taylor series T[sin(x)]
0	0.0000000000	0.0000000000	0
1	+0.9884512209	+1.0000246629	+1
2	+0.0431998861	-0.0001428261	0
3	-0.2237806822	-0.1663142431	-0.1666666667
4	+0.0332862131	-0.0004939005	0
5	+0.0005467594	+0.0087732791	+0.0083333333
6		-0.0002630934	0
7		-0.0000900869	-0.0001984127
8		-0.0000308189	0
9		+0.0000086901	+0.0000027557
10		-0.0000007253	0
11		+0.0000000210	-0.0000000251

Table: Convergence of the c_i-coefficients toward the Taylor series T[sin(x)]

The Taylor series T[sin(x)]=∑_n=0,∞ (-1)ⁿ / (2n+1)! (x²ⁿ⁺¹) is the limit to which N^k should converge, when k→∞.

The table above shows roughly how the c_i-coefficients of N^K converge, as K increases.
Note: All c_i-coefficient values can still be clearly distinguished from the numerical noise but a few more coefficients will break the limit.
The graphic below shows the estimation errors for all c_i in N⁵ and N¹¹.

Graphic: Estimation errors for the c_i-coefficients of NIP applied to sin(x)

For coefficients c₁ to c₇ there is a clear improvement from N⁵ to N¹¹ toward T[sin(x)]. As the c_i index increases (e.g. the c_i value decreases), the relative errors in % shown on the secondary Y-axis oscillates wilder and wilder. For the sake of readability, (absolute) values greater than 100% have been erased .

Study Case: Interpolate sqrt(1+x)

f4_1(x)= sqrt(x+1), calculated for x∈[0,2] and (x_i+1-x_i)=Delta_x=0.2

perl NIP.pl -if NIP_input_StudyCase_4_sqrt_1.txt -of NIP_output_StudyCase_4_sqrt_1.txt (-mf NIP_message_StudyCase_4_sqrt_1.txt)

Graphic: NIP of f4_1(x)= sqrt(x+1)

Input data (see NIP_input_StudyCase_4_sqrt_1.txt)

11 (X_i,Y_i) values were used to calculate N¹⁰(x): X₀=0.00 to X₁₀=2.0, Delta-X=0.20.
42 values were recalculated: X₀=0.00 to X₄₁=4.1, Delta-X=0.10.
Rem: Some y-values have been interpolated (for x ≤2.0), others extrapolated (for 2.0 < x ≤4.1).

Results (see NIP_output_StudyCase_4_sqrt_1.txt)

N¹⁰(x) produces good values within x-range [0, 3.3] (Its interpolation scope is [0, 2.0]).
N⁵(x) produces good values within x-range [0, 2.1] (Its interpolation scope is [0, 1.0]).

c_i coefficients of N⁵, N¹⁰, and T[sqrt(x+1)]

Index of c_i	N⁵(x)	N¹⁰(x)	Taylor series T[sqrt(x+1)]
0	+1.0000000000	+1.0000000000	+1
1	+0.4998720084	+0.4999941272	+0.5
2	-0.1234444216	-0.1249091931	-0.125
3	+0.0553503036	+0.0619097542	+0.0625
4	-0.0224153672	-0.0368784515	-0.0390625
5	+0.0048510391	+0.0220985976	+0.02734375
6		-0.0116379230	-0.0205078125
7		+0.0048049210	+0.0161132813
8		-0.0013969429	-0.013092041
9		+0.0002488773	+0.0109100342
10		-0.0000202043	-0.0092735291

Table: Convergence of the c_i-coefficients toward the Taylor series T[sqrt(x+1)]

The Taylor series T[sqrt(1+x)]= ∑_n=0,∞(0.5(0.5-1)...(0.5-n+1)/n!) xⁿ is the limit to which N^k should converge, when k→∞.

The table above shows roughly how the c_i-coefficients of N^K converge, as K increases.
Note: In contrast to the situation in previous examples - e^x, sin(x) - all c_i-coefficient values are widely above the numerical noise: the interpolation potential is not yet exhausted.
The graphic below shows the estimation errors for all c_i in N⁵ and N¹¹.

Graphic: Estimation errors for the c_i-coefficients of NIP applied to sqrt(x+1)

The graphic confirms the conclusions drawn before from the table: The N^p suite converges smoothly toward T[sqrt(x+1)]. Increasing the number of measurement points used for calculating the NIP will certainly improve the precision of the results.

Study Case: Interpolate sqrt(x)

f4_2(x)= sqrt(x), calculated for x∈[0,2] and (x_i+1-x_i)=Delta_x=0.2

perl NIP.pl -if NIP_input_StudyCase_4_sqrt_2.txt -of NIP_output_StudyCase_4_sqrt_2.txt (-mf NIP_message_StudyCase_4_sqrt_2.txt)

Graphic: NIP of f4_2(x)= sqrt(x)

Input data (see NIP_input_StudyCase_4_sqrt_2.txt)

11 (X_i,Y_i) values were used to calculate N¹⁰(x): X₀=0.00 to X₁₀=2.0, Delta-X=0.20.
42 values were recalculated: X₀=0.00 to X₄₁=4.1, Delta-X=0.10.
Rem: Some y-values have been interpolated (for x ≤2.0), others extrapolated (for 2.0 < x ≤4.1).

Results (see NIP_output_StudyCase_4_sqrt_2.txt)

N¹⁰(x) produces good values within x-range [0, 2.0] (Its interpolation scope is [0, 2.0]).
N⁵(x) produces good values within x-range [0, 1.0] (Its interpolation scope is [0, 1.0]).

Outside the interpolation range, the estimation error exceeds almost immediately the 100% limit: In order to preserve the readability of the graphical representation, the doted curves showing errors in % (linked to the secondary Y-axis) have been additionally truncated.
Also interesting to notice is the deviation pick occurring systematically at the first x value interpolated (x=0.1).

c_i coefficients of N⁵, N¹⁰

Index of c_i	N⁵(x)	N¹⁰(x)
0	+0.0000000000	+0.0000000000
1	+3.6887261304	+4.2280735342
2	-10.2108662533	-17.0372885063
3	+17.5071419536	+50.8437458399
4	-14.8116527292	-98.8271330060
5	+4.8266508984	+127.3064766291
6		-109.6564207910
7		+62.4351224588
8		-22.5408190453
9		+4.6711946380
10		-0.4229517514

Table: Comparison of the c_i-coefficients for N⁵(x) and N¹⁰(x)

There is no Taylor series available for sqrt(x) in [0, 2], since its derivation values tend to +∞ as x tends to 0. In other words, there is no a priori convergence of the N^K suite, when K grows.

Table and graphic show that c_i values swing up and down. Presumably - and in contrast to all previous cases - this swinging effect will amplify as the number of measurements points used to calculate N(x) will increase (to be cleared).

Graphic: Estimation of the c_i-coefficients of NIP applied to sqrt(x)

Conclusions
Both functions f4_1(x)=sqrt(1+x) and f4_2(x)=sqrt(x) are similar: An affine transformation on the x values transposes the one function into the other. Notwithstanding their NIP resolution behaves quite differently:
- For f4_1(x)=sqrt(1+x), it shows a very good response.
- For f4_2(x)=sqrt(x), it shows a latent propensity toward instability.
The difficulty is due to the fact that f4_2(x)=sqrt(x) is not derivable at zero - i.e. no polynomial can correctly reproduce its curve near x=0.

Study Case: General conclusions

Program NIP.pl can correctly estimate the Newton Interpolation Polynomial N(x) for a given set of P+1 measures (X_i,Y_i), i=0,P.
- It will give acceptable results, if the measured values treated can - a priori - fit into the polynomial field i.e. if the unknown function f(x) to interpolate is analytic within the observed range.
  - If the set of input values is extracted from a polynomial, NIP.pl will retrieve the polynomial in question.
  - More generally, the N^K suite will converge to the Taylor series T[f(x)], when K (i.e. the number of measures considered) increases.
- The number of input points that can be handled (i.e. the highest degree of N(x) that can be calculated adequately) is limited by the numerical precision available. Most probably, NIP.pl can retrieve up to 15 coefficients.
The study cases focused so far on the adequate use of NIP. In cases where measurement points manifest ragged chaotic patterns, alternative methods are more appropriate e.g. Fourier series.

Notwithstanding, it would be fun - and from the theoretical point of view just as rewarding - to analyze NIP solutions for input sets that clearly violate the required conditions of use. In order to study such cases in depth, it is proposed to provide NIP.pl with the ability to tackle heterogeneous sets - see chapter Further developments.

References

Following links may help increase one's understanding on the subject.

Web sites:

1	Mathews J. H.	Lecture on Numerical Methods	English
2	Chandra P. et all.	Lecture on Numerical Methods	English
3	Wikipedia	Polynominterpolation	(German, English)
4	FH Kärnten Informatikprojekt	Newton Interpolationsformel	German

Download pdfs:

Shestopalov Y.

Lecture on Interpolation Methods

English

Please beware: The above documentation results from a quick search: It can almost be considered a random output. In other words, a thorough investigation may have produced a different list.

Books (that I read on the subject):

1	Piskounov	Calcul différentiel et intégral Tome 1 p266-268	Translation into French of the Russian original
2	Engeln-Müllges G., Reutter F.	Numerische Mathematik für Ingenieure	German

Downloads

Contents (NIP_1)

Used perl sources and all discussed results are available for download in ZIP_Archive_NIP_1 from project reference Project_NIP_1.

ZIP_Archive_NIP_1 contains following files:

NIP.pl,NIP_lib.pm

perl sources: main program and library

NIP_input.txt

An example of input file, according to the default syntax.

\Study_Cases\

Sub-folder including all input and output files mentioned in the study cases.
NIP_StudyCases.ods (LibreOffice calc) has also been included: It includes input and output data of all study cases, as well as their analysis (especially the graphical output).

Feel free to retrace every step by yourself.

Table: List of contents from project NIP_1 available for download

Version	date	Description
0.0	May 2015	First version available for download: Calculates the Newton interpolation polynomial from a set of points (X_i, Y_i) Reads X_i: X₀, Delta-X= (X_i+1-X_i). For each X_i: Y_i. Reads a set of recalculation points whose X_i are defined similarly to the input X values.
0.1	June 2015	Following extensions have been implemented: Each NIP selected for recalculation/validation will be automatically transposed into canonical form (Estimation of the c_i-coefficients out of the b_i-coefficients).

Version

date

Description

0.0

May 2015

First version available for download:

Calculates the Newton interpolation polynomial from a set of points (X_i, Y_i)
Reads X_i: X₀, Delta-X= (X_i+1-X_i). For each X_i: Y_i.
Reads a set of recalculation points whose X_i are defined similarly to the input X values.

0.1

June 2015

Following extensions have been implemented:

Each NIP selected for recalculation/validation will be automatically transposed into canonical form (Estimation of the c_i-coefficients out of the b_i-coefficients).

Table: Brief history of versions

Rem: At any given time, only one version will be available for download: the last published update.

Portability

I do not expect any portability issues:
- I developed NIP.pl on OSX Yosemite version 10.10.3 using perl version v5.18.2.
- I successfully run NIP.pl, as is, on microsoft windows 7 Ultimate, Service Pack 1 using Strawberry perl (64-bit) version v5.20.2.1.
If someone gets into trouble, please send a message to Feedback explaining the circumstances. I will update this document accordingly.
Further developments

Here a few ideas to implement in further versions of NIP.pl in order of preference.
1. Read heterogeneous measurement sets :
  Free defined (non equidistant) X_i sets open new possibilities of studying NIP: What happens, if the X_i set is not monotone, if areas are more densely measured than others, etc. Programing hint: Only a few lines of code need to be modified (reading section INPUT_DATA).
2. Introduce normalized intervals:
  All X_i values shall be projected into an given interval before treatment e.g. [0,1], [1,2]. In my opinion, this transformation should work like some sort of data conditioning (to be cleared). Programing hint: In this case, one function - performing an (affine) transformation - must be coded.
3. Advanced resolution of the linear equations system:
  A radical modification is to resolve the NIP linear equations system (lower triangular) with algorithms that ensure that all b_i are estimated with the same precision (unlike the current implementation). Beware: A great deal of preliminary coding is required - or a good mathematical library (Does anybody know about perl libs for matrices manipulation?) .