Need help with FastTransforms.jl?

Click the “chat” button below for chat support from the developer who created it, or find similar developers for support.

136 Stars 17 Forks Other 401 Commits 22 Opened issues

:rocket: Julia package for fast orthogonal polynomial transforms :snowboarder:

Readme

FastTransforms.jlallows the user to conveniently work with orthogonal polynomials with degrees well into the millions.

This package provides a Julia wrapper for the C library of the same name. Additionally, all three types of nonuniform fast Fourier transforms are available, as well as the Padua transform.

Installation, which uses BinaryBuilder for all of Julia's supported platforms (in particular Sandybridge Intel processors and beyond), may be as straightforward as:

pkg> add FastTransformsjulia> using FastTransforms, LinearAlgebra

The 34 orthogonal polynomial transforms are listed in

FastTransforms.kind2string.(0:33). Univariate transforms may be planned with the standard normalization or with orthonormalization. For multivariate transforms, the standard normalization may be too severe for floating-point computations, so it is omitted. Here are two examples:

julia> c = rand(8192);julia> leg2cheb(c);

julia> cheb2leg(c);

julia> norm(cheb2leg(leg2cheb(c; normcheb=true); normcheb=true)-c)/norm(c) 1.1866591414786334e-14

The implementation separates pre-computation into an

FTPlan. This type is constructed with either

plan_leg2chebor

plan_cheb2leg. Let's see how much faster it is if we pre-compute.

julia> p1 = plan_leg2cheb(c);julia> p2 = plan_cheb2leg(c);

julia> @time leg2cheb(c); 0.433938 seconds (9 allocations: 64.641 KiB)

julia> @time p1*c; 0.005713 seconds (8 allocations: 64.594 KiB)

julia> @time cheb2leg(c); 0.423865 seconds (9 allocations: 64.641 KiB)

julia> @time p2*c; 0.005829 seconds (8 allocations: 64.594 KiB)

Furthermore, for orthogonal polynomial connection problems that are degree-preserving, we should expect to be able to apply the transforms in-place:

julia> lmul!(p1, c);julia> lmul!(p2, c);

julia> ldiv!(p1, c);

julia> ldiv!(p2, c);

Let

Fbe an array of spherical harmonic expansion coefficients with columns arranged by increasing order in absolute value, alternating between negative and positive orders. Then

sph2fourierconverts the representation into a bivariate Fourier series, and

fourier2sphconverts it back. Once in a bivariate Fourier series on the sphere,

plan_sph_synthesisconverts the coefficients to function samples on an equiangular grid that does not sample the poles, and

plan_sph_analysisconverts them back.

julia> F = sphrandn(Float64, 1024, 2047); # convenience methodjulia> P = plan_sph2fourier(F);

julia> PS = plan_sph_synthesis(F);

julia> PA = plan_sph_analysis(F);

julia> @time G = PS*(P*F); 0.090767 seconds (12 allocations: 31.985 MiB, 1.46% gc time)

julia> @time H = P(PA*G); 0.092726 seconds (12 allocations: 31.985 MiB, 1.63% gc time)

julia> norm(F-H)/norm(F) 2.1541073345177038e-15

Due to the structure of the spherical harmonic connection problem, these transforms may also be performed in-place with

lmul!and

ldiv!.

The NUFFTs are implemented thanks to Alex Townsend: -

nufft1assumes uniform samples and noninteger frequencies; -

nufft2assumes nonuniform samples and integer frequencies; -

nufft3 ( = nufft)assumes nonuniform samples and noninteger frequencies; -

inufft1inverts an

nufft1; and, -

inufft2inverts an

nufft2.

Here is an example:

julia> n = 10^4;julia> c = complex(rand(n));

julia> ω = collect(0:n-1) + rand(n);

julia> nufft1(c, ω, eps());

julia> p1 = plan_nufft1(ω, eps());

julia> @time p1*c; 0.002383 seconds (6 allocations: 156.484 KiB)

julia> x = (collect(0:n-1) + 3rand(n))/n;

julia> nufft2(c, x, eps());

julia> p2 = plan_nufft2(x, eps());

julia> @time p2*c; 0.001478 seconds (6 allocations: 156.484 KiB)

julia> nufft3(c, x, ω, eps());

julia> p3 = plan_nufft3(x, ω, eps());

julia> @time p3*c; 0.058999 seconds (6 allocations: 156.484 KiB)

The Padua transform and its inverse are implemented thanks to Michael Clarke. These are optimized methods designed for computing the bivariate Chebyshev coefficients by interpolating a bivariate function at the Padua points on

[-1,1]^2.

julia> n = 200;julia> N = div((n+1)*(n+2), 2);

julia> v = rand(N); # The length of v is the number of Padua points

julia> @time norm(ipaduatransform(paduatransform(v)) - v)/norm(v) 0.007373 seconds (543 allocations: 1.733 MiB) 3.925164683252905e-16

[1] D. Ruiz—Antolín and A. Townsend. A nonuniform fast Fourier transform based on low rank approximation, *SIAM J. Sci. Comput.*, **40**:A529–A547, 2018.

[2] S. Olver, R. M. Slevinsky, and A. Townsend. Fast algorithms using orthogonal polynomials, *Acta Numerica*, **29**:573—699, 2020.

[3] R. M. Slevinsky. Fast and backward stable transforms between spherical harmonic expansions and bivariate Fourier series, *Appl. Comput. Harmon. Anal.*, **47**:585—606, 2019.

[4] R. M. Slevinsky, Conquering the pre-computation in two-dimensional harmonic polynomial transforms, arXiv:1711.07866, 2017.