Skip to content

Instantly share code, notes, and snippets.

@vurtun
Last active December 25, 2019 10:55
Show Gist options
  • Save vurtun/2fb0ed9d3319f6cf21c2aebd42da8dfc to your computer and use it in GitHub Desktop.
Save vurtun/2fb0ed9d3319f6cf21c2aebd42da8dfc to your computer and use it in GitHub Desktop.
static char*
str_chr(char * str, int chr)
{
char *s = str;
int c = chr & 0xFF;
unsigned m = (c << 24)|(c << 16)|(c << 8)|c;
for (;;) {
while (((uintptr_t)s) & 3) {
chk1: if (s[0] == c) return s;
chk2: if (s[0] == 0) return s;
++s;
}
for (;;) {
unsigned v = *(unsigned*)s;
unsigned c = (~v) & 0x80808080;
if (((v ^ m) - 0x01010101) & c) goto chk1;
if ((v - 0x01010101) & c) goto chk2;
s += 4;
}
}
}
static int
str_len(const char *str)
{
const char *s = str;
while (((uintptr_t)s & 3)) {
chk: if (*s == 0) return s-str;
s++;
}
for (;;) {
unsigned int v = *(unsigned int*)s;
unsigned int c = ~v & 0x10101010;
if ((v - 0x01010101) & c) goto chk;
s += 4;
} return s-str;
}
@dumblob
Copy link

dumblob commented Jan 5, 2018

Hm, the line https://gist.github.com/vurtun/2fb0ed9d3319f6cf21c2aebd42da8dfc#file-bit_trick-c-L5 (unsigned m = (c << 24)|(c << 16)|(c << 8)|c;) assumes a 32bit or bigger integer. Is this a correct assumption?

@vurtun
Copy link
Author

vurtun commented Jan 5, 2018

Yeah this whole thing is undefined behavior. In theory this line:

 unsigned v = *(unsigned*)s;

Is already undefined. As for unsigned m = (c << 24)|(c << 16)|(c << 8)|c; at this point all
architectures outside microcontrollers have int/unsigned 4 byte so I don't care anymore. Heck if you
look at all commonly supported hardware for linux you will not find a single instance with int/unsigned being smaller
than 4 byte.

@dumblob
Copy link

dumblob commented Jan 6, 2018

...is already undefined.

Didn't read that far ;)

Btw where do you (want to) use this code? I'm certain, that the library functions are faster on most/all CPUs.

@vurtun
Copy link
Author

vurtun commented Jan 7, 2018

Btw where do you (want to) use this code? I'm certain, that the library functions are faster on most/all CPUs.

The trick is from here: https://github.com/nothings/stb/blob/master/stb_sprintf.h#L306.
Based on: https://graphics.stanford.edu/~seander/bithacks.html#HasLessInWord.

I spend some time trying to understanding how it works. So I wrote these two to:
1.) Confirm I understood how it works
2.) Create a reference if I ever need it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment