Author Topic: Lightweight sin(2·π·x), cos(2·π·x) for MCUs with float hardware (Read 5839 times)

Nominal Animal · « **Reply #25 on:** June 18, 2024, 07:51:50 pm »

Quote from: NorthGuy on June 18, 2024, 05:47:20 pm

By using more Taylor members you can decrease the error to any arbitrary value.

No, you cannot. That only works if you have sufficient precision in your representation of individual terms, compared to that arbitrary error limit.

Essentially, each term needs to have less error (compared to the exact value) than the arbitrary limit; typically less than half the arbitrary limit value.
With floating point types, this means that you generally cannot get less than ±1 ULP of error, and to get within the standard 0.5 ULP of error, you need to use higher precision for the terms. If you have N terms similar in magnitude, you can expect ±N of error.

Simply put, the problem is that with floats, 10000000 + 0.5 == 10000000.

NorthGuy · « **Reply #26 on:** June 18, 2024, 09:00:08 pm »

Quote from: Nominal Animal on June 18, 2024, 07:51:50 pm

Quote from: NorthGuy on June 18, 2024, 05:47:20 pm
By using more Taylor members you can decrease the error to any arbitrary value.
No, you cannot. That only works if you have sufficient precision in your representation of individual terms, compared to that arbitrary error limit.

Essentially, each term needs to have less error (compared to the exact value) than the arbitrary limit; typically less than half the arbitrary limit value.
With floating point types, this means that you generally cannot get less than ±1 ULP of error, and to get within the standard 0.5 ULP of error, you need to use higher precision for the terms. If you have N terms similar in magnitude, you can expect ±N of error.

It's a matter of technique. You start from the back (from last members) where floating point numbers have excess precision because the values are much smaller than the final result and then you move towards bigger members. This way the rounding error does not accumulate.

Postal2 · « **Reply #27 on:** June 18, 2024, 09:48:20 pm »

Angle sensor for your project:
https://www.allegromicro.com/~/media/files/datasheets/a1333-datasheet.pdf

Nominal Animal · « **Reply #28 on:** June 19, 2024, 06:53:36 am »

Quote from: NorthGuy on June 18, 2024, 09:00:08 pm

It's a matter of technique. You start from the back (from last members) where floating point numbers have excess precision because the values are much smaller than the final result and then you move towards bigger members. This way the rounding error does not accumulate.

Again, no. That, and similar techniques like Kahan summation, will help you get down to ±1 ULP or so, but no more.

This is because the terms largest in magnitude cannot have too much error for the sum to be within the acceptable error bounds. Lesser terms cannot compensate for loss of information from the greater terms.

In mathematical terms, we are computing polynomials of form
$$S_n = \sum_{k=0}^n s_k x^k$$
but with floating point types, we actually calculate
$$S_n = \sum_{k=0}^n \left( s_k + \epsilon_{0,k} \right) \left( x^k + \epsilon_{1,k} \right)$$
where \$\epsilon_{0,k}\$ is the error in the coefficient \$s_k\$, and \$\epsilon_{1,k}\$ is error in \$x^k\$. Expanding the sum, we can separate out the error terms:
$$\begin{aligned}
S_n &= \sum_{k=0}^n s_k x^k \\
~ &+ \sum_{k=0}^n \epsilon_{0,k} x^k \\
~ &+ \sum_{k=0}^n \epsilon_{1,k} s_k \\
~ &+ \sum_{k=0}^n \epsilon_{0,k} \epsilon_{1,k} \\
\end{aligned}$$
The product of the errors \$\epsilon_{0,k} \epsilon_{1,k}\$ is obviously very small, and is generally not a problem at all.

Because \$s_k\$ are the constant coefficients, \$\epsilon_{0,k}\$ are constant as well; but since \$x^k\$ depends on \$x\$, so does \$\epsilon_{1,k}\$.

That means the error sum \$\sum_{k=0}^n \epsilon_{0,k} x^k\$ depends on \$x\$, as does the error sum \$\sum_{k=0}^n \epsilon_{1,k} s_k\$, but their dependence on \$x\$ is so different, you cannot make them cancel out.

Increasing \$n\$ will not cause the already accrued error to go away.

With fixed point arithmetic, summing \$n\$ terms of \$B\$ bits each will yield a result with \$B + \lceil \log_2 n \rceil\$ bits. Only one additional bit is needed for proper half-ULP rounding, so generally, you can get the absolute error below any arbitrary limit by using just a few additional bits compared to the absolute error limit.

Nominal Animal · « **Reply #29 on:** June 19, 2024, 07:18:08 am »

NorthGuy: To clarify, the techniques you mentioned are excellent, and I too do recommend their use. Thank you for pointing them out!

I am only arguing that "to an arbitrary limit with simply adding more terms" is not correct, when using limited precision representations like floating-point and fixed-point formats. Because of the limited precision in each term, there is a limit to how many terms can affect the sum; and no term can "repair" the error in other terms.

ali_asadzadeh · « **Reply #30 on:** June 19, 2024, 08:43:17 am »

Nominal Animal, thanks for sharing, would you explain what ±1 ULP means?

Picuino · « **Reply #31 on:** June 19, 2024, 08:59:43 am »

Quote from: Nominal Animal on June 16, 2024, 07:40:35 pm

...
over all \$1,056,964,609\$ normal (non-subnormal) IEEE 754 single-precision (float aka Binary32) values between 0 and 1.

How do you manage to test all floating values between 0 and 1? (If you post the test program, the rest of us can test our routines with the same pattern).

gf · « **Reply #32 on:** June 19, 2024, 09:05:19 am »

Quote from: Picuino on June 19, 2024, 08:59:43 am

How do you manage to test all floating values between 0 and 1?

In C/C++, you can use nextafter() or nextafterf() to iterate over representable floating point numbers.
https://en.cppreference.com/w/c/numeric/math/nextafter

Nominal Animal · « **Reply #33 on:** June 19, 2024, 09:43:06 am »

Quote from: ali_asadzadeh on June 19, 2024, 08:43:17 am

Nominal Animal, thanks for sharing, would you explain what ±1 ULP means?

±1 ULP = ±1 Unit in the Least Significant Place. It is the smallest possible representable change in a floating-point value.

In C, nextafterf(a, HUGE_VALF]) increments a by 1 ULP, yielding the closest value greater than a.
nextafterf(a, -HUGE_VALF]) decrements a by 1 ULP, yielding the closest value greater than a.

The exact size of that change depends on the magnitude of a.

To find out the error in float value compared to the expected nonzero value expect, somewhat similar to value-expect, in units of expected value ULPs, you can use


EEVblog Main Site	EEVblog on Youtube	EEVblog on Twitter	EEVblog on Facebook	EEVblog on Odysee

Author Topic: Lightweight sin(2·π·x), cos(2·π·x) for MCUs with float hardware (Read 5839 times)

Share me