TechEcho

12 comments

kazinatorover 1 year ago

<pre><code> $ cat parse256.c #include <stdio.h> #include <stdint.h> #include <string.h> int parse_uint8_fastswar(const char *str, size_t len, uint8_t *num) { union { uint8_t as_str[4]; uint32_t as_int; } digits; memcpy(&digits.as_int, str, sizeof(digits)); digits.as_int ^= 0x30303030lu; digits.as_int <<= (4 - (len & 0x3)) * 8; uint32_t y = ((UINT64_C(0x640a0100) * digits.as_int)>>32)&0xff; *num = (uint8_t)(y); return (digits.as_int & 0xf0f0f0f0) == 0 && y < 256 && len != 0 && len < 4; } int main(int argc, char **argv) { if (argv[1]) { uint8_t val = 0; if (parse_uint8_fastswar(argv[1], strlen(argv[1]), &val)) { printf("value of %s is %d\n", argv[1], (int) val); } else { printf("%s is invalid, value stored %d\n", argv[1], (int) val); } } return 0; } $ ./parse256 123 123 is invalid, value stored 225 $ uname -a Linux gcc1-power7.osuosl.org 3.10.0-862.14.4.el7.ppc64 #1 SMP Wed Sep 26 20:38:32 GMT 2018 ppc64 ppc64 ppc64 GNU/Linux </code></pre> Oops!

评论 #38453572 未加载

评论 #38452398 未加载

评论 #38452736 未加载

评论 #38452202 未加载

评论 #38453119 未加载

vinkelhakeover 1 year ago

The SWAR algorithm accepts the 6 ASCII characters after '9'. It'll parse ":>" as 114.<pre><code> int res = parse_uint8_fastswar(":>\0", 2, &num); </code></pre> Returns true and num is 114.

评论 #38455386 未加载

评论 #38458896 未加载

jstanleyover 1 year ago

I tried this out but it parsed the string "123\n" as 32.Also it parses "400" as 144, when the reference implementation considers it not-a-uint8, but I don't mind so much about that.EDIT: Ah, I think it assumes the string contains only a uint8, rather than trying to parse a uint8 from the start of a string. So you need to zero out the "\n" separately, and then it works.

评论 #38451703 未加载

firebazeover 1 year ago

Looking forward to "parsing bit sequences in roman literals quickly"<a href="https://hn.algolia.com/?q=lemire+parsing" rel="nofollow noreferrer">https://hn.algolia.com/?q=lemire+parsing</a>

zokierover 1 year ago

The use of union here is bit confusing (I think its unnecessary?), although I don't imagine it making any difference in the generated code.

评论 #38452045 未加载

评论 #38452051 未加载

nasretdinovover 1 year ago

I imagine that you are not allowed to allocate a constant array that would contain a mapping between ASCII values of integers and the actual ints :)? The're just 255 of them needed. Or woukd it be slower?

评论 #38451496 未加载

评论 #38451485 未加载

评论 #38452113 未加载

评论 #38451508 未加载

blacklionover 1 year ago

Guess the site by title :-) Good as usual.

IshKebabover 1 year ago

Presumably this also only works well if the data is 4-byte aligned.

nvartolomeiover 1 year ago

dlemire, you note that the read ”overflows”. Why can’t you copy just `len` bytes? Does it slow too much because of the branch/more load/store operations?

评论 #38456025 未加载

ameliusover 1 year ago

Fetching the data should be the bottleneck (by far), so why is the naive approach 2x slower than this smarter approach?Sounds like the CPU should be designed in a smarter way, not the code.

评论 #38452939 未加载

doubloonover 1 year ago

i am so confused."you are given a string and it's length".. i dont understand.is the string like "1,1,22,2,189,3,12,2,120,3" ???

评论 #38458841 未加载

aunwickover 1 year ago

Wow! 1st year digital logic question makes news on YC?

12 comments

kazinatorover 1 year ago

评论 #38453572 未加载

评论 #38452398 未加载

评论 #38452736 未加载

评论 #38452202 未加载

评论 #38453119 未加载

vinkelhakeover 1 year ago

The SWAR algorithm accepts the 6 ASCII characters after '9'. It'll parse ":>" as 114.<pre><code> int res = parse_uint8_fastswar(":>\0", 2, &num); </code></pre> Returns true and num is 114.

评论 #38455386 未加载

评论 #38458896 未加载

jstanleyover 1 year ago

评论 #38451703 未加载

firebazeover 1 year ago

Looking forward to "parsing bit sequences in roman literals quickly"<a href="https://hn.algolia.com/?q=lemire+parsing" rel="nofollow noreferrer">https://hn.algolia.com/?q=lemire+parsing</a>

zokierover 1 year ago

The use of union here is bit confusing (I think its unnecessary?), although I don't imagine it making any difference in the generated code.

评论 #38452045 未加载

评论 #38452051 未加载

nasretdinovover 1 year ago

评论 #38451496 未加载

评论 #38451485 未加载

评论 #38452113 未加载

评论 #38451508 未加载

blacklionover 1 year ago

Guess the site by title :-) Good as usual.

IshKebabover 1 year ago

Presumably this also only works well if the data is 4-byte aligned.

nvartolomeiover 1 year ago

dlemire, you note that the read ”overflows”. Why can’t you copy just `len` bytes? Does it slow too much because of the branch/more load/store operations?

评论 #38456025 未加载

ameliusover 1 year ago

Fetching the data should be the bottleneck (by far), so why is the naive approach 2x slower than this smarter approach?Sounds like the CPU should be designed in a smarter way, not the code.

评论 #38452939 未加载

doubloonover 1 year ago

i am so confused."you are given a string and it's length".. i dont understand.is the string like "1,1,22,2,189,3,12,2,120,3" ???

评论 #38458841 未加载

aunwickover 1 year ago

Wow! 1st year digital logic question makes news on YC?

Parsing 8-bit integers quickly

12 comments

Parsing 8-bit integers quickly

12 comments