From: Adam Dickmeiss Date: Fri, 1 Apr 2011 11:48:25 +0000 (+0200) Subject: Avoid skip of non-ASCII chars in skip article code X-Git-Tag: v1.5.6~7^2^2~1 X-Git-Url: http://lists.indexdata.dk/?a=commitdiff_plain;h=311cf61ceacaa95dd68019f8345faf17b652a218;p=pazpar2-moved-to-github.git Avoid skip of non-ASCII chars in skip article code The code used isalnum to skip leading white space. This could lead to skip of whole string if that contained anything but 7-bit characters - and as a result it would skip the whole string buffer (strstr would find " "). --- diff --git a/src/charsets.c b/src/charsets.c index 7bbe102..d70c11d 100644 --- a/src/charsets.c +++ b/src/charsets.c @@ -184,8 +184,6 @@ void pp2_relevance_first(pp2_relevance_token_t prt, char *pout = firstword; char articles[] = "the den der die des an a "; // must end in space - while (*p && !isalnum(*(unsigned char *)p)) - p++; for (; *p && *p != ' ' && pout - firstword < (sizeof(firstword)-2); p++) *pout++ = tolower(*(unsigned char *)p); *pout++ = ' ';