科技回声

12 条评论

corsix大约 1 年前

AArch64 NEON has the URSQRTE instruction, which gets closer to the OP's question than you might think; view a 32-bit value as a fixed-precision integer with 32 fractional bits (so the representable range is evenly spaced 0 through 1-ε, where ε=2^-32), then URSQRTE computes the approximate inverse square root, halves it, then clamps it to the range 0 through 1-ε. Fixed-precision integers aren't quite integers, and approximate inverse square root isn't quite square root, but it might get you somewhere close.The related FRSQRTE instruction is much more conventional, operating on 32-bit floats, again giving approximate inverse square root.

评论 #39961911 未加载

fjfaase大约 1 年前

Is it possible in a single clock-cycle. Yes, with a very large lookup table. It is probably possible to reduce the size depending on how many serial logical gates can be executed within the clock-cycle. Think about that the binary root of 10000 is rather similar to that of 100 only with respect to different number of zero's.

评论 #39960499 未加载

评论 #39960361 未加载

评论 #39964434 未加载

评论 #39960352 未加载

评论 #39962554 未加载

msarnoff大约 1 年前

If you wanted to expand the definition of “processor” to electromechanical contraptions, the Friden SRQ could perform square roots using just additions and shifts, with not a single electronic component other than a motor. And since you had to position the decimal points manually, it would _technically_ be an integer operation…Video: <a href="https://youtu.be/o44a1ao5h8w" rel="nofollow">https://youtu.be/o44a1ao5h8w</a>

评论 #39964388 未加载

treprinum大约 1 年前

Can't you use the sequence 1 + 3 + 5 + ... + 2k + 1 to get the integer square root of any integer number? It's basically the k of the nearest lower number to your number in this sequence.

评论 #39960485 未加载

评论 #39961253 未加载

评论 #39960695 未加载

评论 #39960312 未加载

kelnos大约 1 年前

This bit in an answer further down made me chuckle:> My implementation of square root using binary search, that doesn't depend on a multiplier. Only basic ALU instructions are used. It is vigorously undocumented. I have no idea what I wrote but it seems to work.A fine reminder that if we write clever code, we're probably not going to remember how it works.

moomin大约 1 年前

You need to read down a bit, but the answer “ENIAC” is hilarious.

评论 #39965353 未加载

评论 #39960944 未加载

Dwedit大约 1 年前

2 ^ (1/2 * Log2(X)) = sqrt(X)You can get a really really rough approximation if you replace Log2(x) with 'count leading zeroes'. With a better approximation of Log(2), you can get closer to the answer.

dahart大约 1 年前

For an approximate (very rough) answer, as opposed to one accurate to the nearest integer, a right shift by half the number of bits of the leading 1’s position will do, and of course nearly every processor has a shift instruction. I’m not sure how often processors haven’t had something like FLO (Find Leading One) or FFS (Find First Set) instruction, those seem ubiquitous as well.The super rough approximation for some uses can be approximately as good as an accurate answer. When you just need a decent starting place for some further Newton-Raphson iteration, for example. (Of course the right-shift trick is a nice way to seed a more accurate square root calculation. :P)

评论 #39961135 未加载

评论 #39962374 未加载

bryanlarsen大约 1 年前

IIRC, most (all?) fixed point DSP's have a square root instruction and/or helper instructions.

jlarcombe大约 1 年前

Semi-related and of interest to 6502 fans, exhaustive analysis of square root algorithms: <a href="https://github.com/TobyLobster/sqrt_test">https://github.com/TobyLobster/sqrt_test</a>

pajko大约 1 年前

ARM VFP has VSQRT<a href="https://developer.arm.com/documentation/dui0473/m/vfp-instructions/vsqrt" rel="nofollow">https://developer.arm.com/documentation/dui0473/m/vfp-instru...</a>

Pet_Ant大约 1 年前

I'm sure the VAX must have? (if we are including microcode)

评论 #39972198 未加载

12 条评论

corsix大约 1 年前

评论 #39961911 未加载

fjfaase大约 1 年前

评论 #39960499 未加载

评论 #39960361 未加载

评论 #39964434 未加载

评论 #39960352 未加载

评论 #39962554 未加载

msarnoff大约 1 年前

评论 #39964388 未加载

treprinum大约 1 年前

Can't you use the sequence 1 + 3 + 5 + ... + 2k + 1 to get the integer square root of any integer number? It's basically the k of the nearest lower number to your number in this sequence.

评论 #39960485 未加载

评论 #39961253 未加载

评论 #39960695 未加载

评论 #39960312 未加载

kelnos大约 1 年前

moomin大约 1 年前

You need to read down a bit, but the answer “ENIAC” is hilarious.

评论 #39965353 未加载

评论 #39960944 未加载

Dwedit大约 1 年前

dahart大约 1 年前

评论 #39961135 未加载

评论 #39962374 未加载

bryanlarsen大约 1 年前

IIRC, most (all?) fixed point DSP's have a square root instruction and/or helper instructions.

jlarcombe大约 1 年前

Semi-related and of interest to 6502 fans, exhaustive analysis of square root algorithms: <a href="https://github.com/TobyLobster/sqrt_test">https://github.com/TobyLobster/sqrt_test</a>

pajko大约 1 年前

ARM VFP has VSQRT<a href="https://developer.arm.com/documentation/dui0473/m/vfp-instructions/vsqrt" rel="nofollow">https://developer.arm.com/documentation/dui0473/m/vfp-instru...</a>

Pet_Ant大约 1 年前

I'm sure the VAX must have? (if we are including microcode)

评论 #39972198 未加载

Did any processor implement an integer square root instruction?

12 条评论

Did any processor implement an integer square root instruction?

12 条评论