[Yazlist] Fields ending in combining diacritics
adam at indexdata.dk
Thu Mar 8 11:05:54 CET 2007
Gary Anderson wrote:
> I recently ran some tests using records from the National Library of
> Canada. Of the 600,000+ records in their name and subject authority
> file, six records had 670 tags where the subfield a data ended in a
> combining diacritic character with no following character.
> Submitting that data string
> (indicators+subfieldmark+subfieldcode+data+fieldmark) to siconvert
> resulted in an output string that did not contain the diacritic
> character. It was dropped. The field mark character was retained. Can
> you suggest a means for notifying the caller when this condition
> occurs? Byte counts don't really work because UTF8 is one side or the
> other of the conversion transaction.
> The ending diacritic values were: 0xE2, 0xE5, 0xE8, 0xEA, and 0xF6.
Did you use yaz-marcdump for the conversion?
Or did you do something else ? (such as programming towards the siconv
> Yazlist mailing list
> Yazlist at lists.indexdata.dk
More information about the Yazlist