[Yazlist] Fields ending in combining diacritics

Adam Dickmeiss adam at indexdata.dk
Thu Mar 8 11:05:54 CET 2007


Gary Anderson wrote:
> I recently ran some tests using records from the National Library of 
> Canada.  Of the 600,000+ records in their name and subject authority 
> file, six records had 670 tags where the subfield a data ended in a 
> combining diacritic character with no following character.
> 
> Submitting that data string 
> (indicators+subfieldmark+subfieldcode+data+fieldmark) to siconvert 
> resulted in an output string that did not contain the diacritic 
> character.  It was dropped.  The field mark character was retained.  Can 
> you suggest a means for notifying the caller when this condition 
> occurs?  Byte counts don't really work because UTF8 is one side or the 
> other of the conversion transaction.
> 
> The ending diacritic values were:  0xE2, 0xE5, 0xE8, 0xEA, and 0xF6.
> 
Did you use yaz-marcdump for the conversion?

Or did you do something else ? (such as programming towards the siconv 
interface)?

/ Adam

> Thanks
> Gary
> 
> _______________________________________________
> Yazlist mailing list
> Yazlist at lists.indexdata.dk
> http://lists.indexdata.dk/cgi-bin/mailman/listinfo/yazlist




More information about the Yazlist mailing list