Adam Dickmeiss [Thu, 8 Nov 2007 17:15:13 +0000 (17:15 +0000)]
Avoid locale member in icu_tokenizer.
The locale member of icu_tokenizer is not needed.
Adam Dickmeiss [Thu, 8 Nov 2007 17:08:10 +0000 (17:08 +0000)]
Function icu_casemap_create no longer takes locale parameter.
Function icu_casemap_create no longer takes locale parameter. Instead,
icu_casemap_casemap takes a locale parameter.
Adam Dickmeiss [Thu, 8 Nov 2007 13:35:13 +0000 (13:35 +0000)]
Refactor ICU WRT locale.
The locale parameter is no longer a parameter for icu_chain_xml_config
and locale is read as an attribute for top XML node. The icu_casemap
locale is no longer a member and a parater for icu_casemap_casemap
instead. This saves a copy of a locale in each casemap.
Adam Dickmeiss [Thu, 8 Nov 2007 09:29:38 +0000 (09:29 +0000)]
Public header icu.h, private header icu_I18N.h .
Adam Dickmeiss [Thu, 8 Nov 2007 09:07:14 +0000 (09:07 +0000)]
Indent
Adam Dickmeiss [Thu, 8 Nov 2007 09:03:11 +0000 (09:03 +0000)]
Changed header - part of YAZ not Pazpar2
Adam Dickmeiss [Thu, 8 Nov 2007 08:17:17 +0000 (08:17 +0000)]
For ICU enabled YAZ defined YAZ_HAVE_ICU instead of HAVE_ICU.
Adam Dickmeiss [Wed, 7 Nov 2007 10:19:12 +0000 (10:19 +0000)]
For ICU, set ICU status to OK for some public functions.
The ICU status code, UErrorCode, is now set to U_ZERO_ERROR in
the preamble of functions icu_chain_create, icu_chain_xml_config
and icu_chain_next_token.
Adam Dickmeiss [Wed, 7 Nov 2007 09:50:24 +0000 (09:50 +0000)]
ICU element index and sortkey no longer available.
Adam Dickmeiss [Tue, 6 Nov 2007 10:27:30 +0000 (10:27 +0000)]
For ICU, locale, is a char ptr rather than uint8_t ptr.
The type uint8_t is a ICU type that we don't want to use if we
can avoid it.
Adam Dickmeiss [Thu, 1 Nov 2007 10:32:43 +0000 (10:32 +0000)]
Fixed printf of pointer
Adam Dickmeiss [Thu, 1 Nov 2007 10:29:28 +0000 (10:29 +0000)]
Fixed link to OID tools
Adam Dickmeiss [Wed, 31 Oct 2007 21:58:07 +0000 (21:58 +0000)]
Fixed bug #1122: Certain record IDs are not found by searches.
The CQL parser preserves terms completely and \\s are no longer
removed. This changes CQL output for some test cases and so these
are updated accordingly. The CQL to PQF transform only treats
* and ? operators if they aren't escaped. A memory leak was also
fixed for the ANSIZ39.58 pattern case.
Marc Cromme [Mon, 29 Oct 2007 13:50:57 +0000 (13:50 +0000)]
changed ICU chain to return all tokens, also those which became empty under normalization. These might still have some display terms attached to them, which are to be used by the snippet display stuff or the scan term display stuff.
Marc Cromme [Mon, 29 Oct 2007 10:22:22 +0000 (10:22 +0000)]
renamed ICU chain functions to the following names relation the operation
to the actually processed token
int icu_chain_token_number(struct icu_chain * chain);
const char * icu_chain_token_display(struct icu_chain * chain);
const char * icu_chain_token_norm(struct icu_chain * chain);
const char * icu_chain_token_sortkey(struct icu_chain * chain);
Adam Dickmeiss [Mon, 29 Oct 2007 09:16:32 +0000 (09:16 +0000)]
Extended auth command so that NULL User, pass may be set.
This is to solve bug #1820.
Marc Cromme [Thu, 25 Oct 2007 10:04:32 +0000 (10:04 +0000)]
added special case with an empty ICU chain for 'raw' index processing without call of transliterators, case maps or tokenizators. It's optimized such that the following happens:
if (!chain->sort), no utf8 to utf16 translation occurs, and the original cstring is returned for the icu_chain-get_norm8() call.
if (chain->sort), utf8 to utf16 translation is made, the original cstring is returned for the icu_chain-get_norm8() call, and the utf16->sortkey conversion is run previous of calling icu_chain-get_sort8().
Marc Cromme [Thu, 25 Oct 2007 08:42:21 +0000 (08:42 +0000)]
simplified test case XML config root element tags
Marc Cromme [Thu, 25 Oct 2007 08:40:06 +0000 (08:40 +0000)]
removed dead code from ICU files
Marc Cromme [Thu, 25 Oct 2007 08:32:50 +0000 (08:32 +0000)]
added sortkey flag to icu_chain constructor
Adam Dickmeiss [Wed, 24 Oct 2007 22:10:24 +0000 (22:10 +0000)]
Avoid per-target CPPFLAGS.
If we use per-target flags, AM_PROG_CC_C_O is required or we just
dont it.
Marc Cromme [Wed, 24 Oct 2007 14:48:17 +0000 (14:48 +0000)]
Isolated ICU locale out of XML config file. Locale is now the second argument to chain = icu_chain_xml_config(xml_node, (uint8_t *) "en", &status);
Adam Dickmeiss [Wed, 24 Oct 2007 13:50:02 +0000 (13:50 +0000)]
Added yaz_match_glob
Marc Cromme [Wed, 24 Oct 2007 13:23:34 +0000 (13:23 +0000)]
fixed wrong token count when tokens disappear with ICU normalization
Marc Cromme [Wed, 24 Oct 2007 07:41:48 +0000 (07:41 +0000)]
fixed memory violation in ICU chain code, restructuring tokenizer to allow for non-empty tokens still ongoing work
Marc Cromme [Tue, 23 Oct 2007 07:51:57 +0000 (07:51 +0000)]
surpressed ICU test tokenization output
Adam Dickmeiss [Mon, 22 Oct 2007 17:52:09 +0000 (17:52 +0000)]
File m4/ac_check_icu.m4 part of dist.
Adam Dickmeiss [Mon, 22 Oct 2007 17:32:07 +0000 (17:32 +0000)]
Make ICU stuff compatible with old C comps. YAZ License.
Adam Dickmeiss [Mon, 22 Oct 2007 12:50:52 +0000 (12:50 +0000)]
Fixed lalibs output for ICU mode.
Adam Dickmeiss [Mon, 22 Oct 2007 12:40:04 +0000 (12:40 +0000)]
Debian package 3.0.15-1 with ICU.
Adam Dickmeiss [Mon, 22 Oct 2007 12:21:38 +0000 (12:21 +0000)]
Added ICU chain component - which used to be part of Pazpar2.
Adam Dickmeiss [Tue, 16 Oct 2007 10:45:53 +0000 (10:45 +0000)]
Removed material about Object identifiers for YAZ 2.
Adam Dickmeiss [Mon, 15 Oct 2007 20:45:46 +0000 (20:45 +0000)]
Added a record for testing of bug #1778.
Adam Dickmeiss [Mon, 15 Oct 2007 20:45:05 +0000 (20:45 +0000)]
Fixed bug #1778: Extended ASCII not escaped always (UTF-8 to MARC-8).
When in non-ASCII mode, YAZ now escapes combined character sequences.
A test was added in tsticonv.c.
Adam Dickmeiss [Mon, 15 Oct 2007 12:00:31 +0000 (12:00 +0000)]
Increase intermediate buffers for wrbuf+iconv.
The intermediate buffer for wrbuf+iconv is increased from 12 to 16
bytes. There may be MARC-8 sequence overflows overwise - with at
least reset (flush).
Adam Dickmeiss [Fri, 12 Oct 2007 14:22:19 +0000 (14:22 +0000)]
Re-organized MARC-8 charset creating code a bit.
Re-organized MARC-8 charset creating code a bit.. A define + code to work
on bug #1778 was added.
Adam Dickmeiss [Tue, 9 Oct 2007 06:00:55 +0000 (06:00 +0000)]
Revised types for functions cs_complete_auto, etc.
The functions cs_complete_auto, cs_complete_auto_head takes 'const char *'
as buffer rather than 'const unsigned char *'. This makes some casts
unnecessary. This also fixes tst_comstack which used to test
cs_complete_http. This function, howver is no longer public, so
cs_complete_auto is used instead.
Adam Dickmeiss [Sun, 7 Oct 2007 08:53:26 +0000 (08:53 +0000)]
Added HTTP tunnel facility for COMSTACK, bug #1752.
This is a facility that allows a Web proxy, such as squid, to tunnel
Z39.50 traffic. This facility is "transparent" to must applications
using YAZ. It's enabled by using pseudo transport connect: followed bý
the we proxy address, followed by command, then follwed by he regular
"virtual" addresss. For example,
connect:webproxy.com:3128,tcp:z3950.loc.gov:7090/voyager .
Adam Dickmeiss [Fri, 5 Oct 2007 16:46:55 +0000 (16:46 +0000)]
Implemented function cs_create_host.
Implemented function cs_create_host parses a YAZ host component. This
is to be used for HTTP Connect tunneling (bug #1752).
Adam Dickmeiss [Fri, 5 Oct 2007 15:11:36 +0000 (15:11 +0000)]
Changed blank (0x20) handling conversion to MARC-8.
The conversion to MARC-8 now always escapes the BLANK character outside
Basic ASCII. This is stricly speaking not necessary - blank is part
of "all" pages (sets). But some tools cannot handle it. YAZ, however,
treats blank as blank in all pages when MARC-8 is read (converted from).
Adam Dickmeiss [Wed, 26 Sep 2007 19:14:04 +0000 (19:14 +0000)]
Bump to 3.0.15.
Adam Dickmeiss [Tue, 25 Sep 2007 07:34:03 +0000 (07:34 +0000)]
Combine test from tstmarciso.sh and tstmarcxml.sh to tstmarc.sh .. Mostly
because the latter requires output from the former (so make -j2 could
break).
Adam Dickmeiss [Mon, 24 Sep 2007 22:22:13 +0000 (22:22 +0000)]
Update this test due to fix for bug #1666.
Adam Dickmeiss [Mon, 24 Sep 2007 12:51:10 +0000 (12:51 +0000)]
Fixed bug with shadowing of variable in lookup_marc8. Bug introduced
in previous commit (1.44).
Adam Dickmeiss [Sun, 23 Sep 2007 07:40:13 +0000 (07:40 +0000)]
Document more yaz-marcdump options
Adam Dickmeiss [Sun, 23 Sep 2007 07:21:51 +0000 (07:21 +0000)]
Added MARC-8 encoded Arabic record, marc9.marc, to test that blanks are
preserved.
Added UTF-8 encoded Chinese record, marccol3.u8.marc, which has the
Ideographic space sequence it.
Adam Dickmeiss [Sat, 22 Sep 2007 18:56:22 +0000 (18:56 +0000)]
Added test for conversion of utf8 e38080 to marc8 212321 (Ideographic
space).
Adam Dickmeiss [Sat, 22 Sep 2007 18:55:02 +0000 (18:55 +0000)]
Remove duplicates (already existing) in flat entries for MARC-8
tries. Previously only duplicates in index tries were treated.
This ensures that only first sequence (of multiple) is used for
conversion and takes precedence. For example utf E38080 is
converted then converted to marc8 212321 (not marc8 212320).
Adam Dickmeiss [Sat, 22 Sep 2007 18:49:55 +0000 (18:49 +0000)]
Treat space (20X) as present in all MARC-8 charsets
Adam Dickmeiss [Fri, 21 Sep 2007 16:04:48 +0000 (16:04 +0000)]
Fixed compilation for absent Libxml2.
Adam Dickmeiss [Fri, 21 Sep 2007 09:24:17 +0000 (09:24 +0000)]
Towards 3.0.14
Adam Dickmeiss [Fri, 21 Sep 2007 07:10:08 +0000 (07:10 +0000)]
Initialize diag_messÃage member of ZOOM_record
Adam Dickmeiss [Thu, 20 Sep 2007 17:22:45 +0000 (17:22 +0000)]
Fixed bug in conversion to XML from MARC: Non-XML characters in control
fields were not removed.
Adam Dickmeiss [Tue, 18 Sep 2007 19:19:16 +0000 (19:19 +0000)]
Removed reference to url.srw
Adam Dickmeiss [Tue, 18 Sep 2007 13:05:18 +0000 (13:05 +0000)]
Versino 3.0.13
Adam Dickmeiss [Mon, 17 Sep 2007 19:18:27 +0000 (19:18 +0000)]
Make an MARC conversion trie for each characterSet section rather than
codeTable. This is ensure we can deal with G0/G1 sequences. Bug #1464.
Adam Dickmeiss [Mon, 17 Sep 2007 19:15:22 +0000 (19:15 +0000)]
Log iconv tests that fail
Adam Dickmeiss [Mon, 17 Sep 2007 19:14:26 +0000 (19:14 +0000)]
Added wrbuf_verbose_str.
Adam Dickmeiss [Fri, 14 Sep 2007 21:39:36 +0000 (21:39 +0000)]
Use updated codetables.xml as provided by Larry Dixson (bug #1464).
Adam Dickmeiss [Fri, 14 Sep 2007 20:55:57 +0000 (20:55 +0000)]
Bump version to 3.0.13
Adam Dickmeiss [Wed, 12 Sep 2007 06:49:37 +0000 (06:49 +0000)]
Towards 3.0.12
Adam Dickmeiss [Tue, 11 Sep 2007 16:41:37 +0000 (16:41 +0000)]
More news
Adam Dickmeiss [Tue, 11 Sep 2007 16:36:51 +0000 (16:36 +0000)]
ZOOM C announces SRU version 1.2.
Adam Dickmeiss [Tue, 11 Sep 2007 15:42:43 +0000 (15:42 +0000)]
More news
Adam Dickmeiss [Tue, 11 Sep 2007 15:16:20 +0000 (15:16 +0000)]
Handle NULL SRU addinfo in printing
Adam Dickmeiss [Tue, 11 Sep 2007 08:40:28 +0000 (08:40 +0000)]
Fixed problem with record cache and schema/diagnostics not being
set at all times.
Adam Dickmeiss [Tue, 11 Sep 2007 08:37:53 +0000 (08:37 +0000)]
Using odr_strdup_null a couple of places
Adam Dickmeiss [Tue, 11 Sep 2007 08:35:42 +0000 (08:35 +0000)]
Added odr_strdup_null which calls nmem_strdup_null. This function accepts
a NULL ptr as source in which the result is a NULL ptr too.
Adam Dickmeiss [Sun, 9 Sep 2007 05:54:45 +0000 (05:54 +0000)]
Handle surrogate diagnostics for SRU
Adam Dickmeiss [Sat, 8 Sep 2007 06:17:45 +0000 (06:17 +0000)]
Function ZOOM_record_get allows type_spec=schema in which case schema
for record is returned.
Mike Taylor [Fri, 7 Sep 2007 17:42:12 +0000 (17:42 +0000)]
Rolling
Mike Taylor [Fri, 7 Sep 2007 17:41:47 +0000 (17:41 +0000)]
yaz_diag_srw_to_bib1 now looks up SRW diagnostic codes first in a new
explicit reverse-mapping table, falling back to reverse-lookup in the
old table only if this does not succeed.
Mike Taylor [Thu, 6 Sep 2007 17:11:59 +0000 (17:11 +0000)]
Add z_HTTP_header_add_basic_auth()
Uses new static yaz_base64encode()
Mike Taylor [Thu, 6 Sep 2007 17:10:35 +0000 (17:10 +0000)]
Add HTTP Basic authentication to SRU/GET, SRU/POST and SRW requests.
Mike Taylor [Thu, 6 Sep 2007 17:09:56 +0000 (17:09 +0000)]
Add z_HTTP_header_add_basic_auth()
Mike Taylor [Thu, 6 Sep 2007 16:38:38 +0000 (16:38 +0000)]
HTTP Basic clienty authentication.
Mike Taylor [Thu, 6 Sep 2007 12:40:53 +0000 (12:40 +0000)]
Set SRU request structure's username/password from connection options.
Adam Dickmeiss [Fri, 31 Aug 2007 21:23:45 +0000 (21:23 +0000)]
Option sru_version holds negotiated SRU/SRW version.
Adam Dickmeiss [Fri, 24 Aug 2007 13:26:03 +0000 (13:26 +0000)]
yaz_srw_get returns 1.1 PDU (as in earlier versions)
Adam Dickmeiss [Fri, 24 Aug 2007 13:25:13 +0000 (13:25 +0000)]
yaz-client command 'sru' may set both SRU method (SOAP/GET/POST) and
version.
Adam Dickmeiss [Thu, 23 Aug 2007 14:28:07 +0000 (14:28 +0000)]
Fixed typo
Adam Dickmeiss [Thu, 23 Aug 2007 14:23:23 +0000 (14:23 +0000)]
Added new ZOOM connection option "sru_version" which specifies SRU
version - say 1.1 or 1.2. Fixed YAZ server's SRU GET/POST decoding
WRT version checking. Anything but 1.1 was rejected and this turned out
to be a mistake. For the GFS, anything but version 1.1 is negotiated to
1.2, except 1.1 which is negotiated to 1.1.
Adam Dickmeiss [Thu, 23 Aug 2007 12:24:38 +0000 (12:24 +0000)]
For SRU sarchRetrieve responses option resultSetId is set for result set
Adam Dickmeiss [Wed, 22 Aug 2007 09:09:03 +0000 (09:09 +0000)]
Version 3.0.10
Adam Dickmeiss [Wed, 22 Aug 2007 08:58:16 +0000 (08:58 +0000)]
Updated RPM spec file with some enhancements from Martin Armbrecht.
Adam Dickmeiss [Wed, 22 Aug 2007 08:11:28 +0000 (08:11 +0000)]
Revert last change regarding display_term. There still exists systems
with old behavior.
Adam Dickmeiss [Tue, 21 Aug 2007 13:20:51 +0000 (13:20 +0000)]
Always honor display_term being set by backend handler. There is no
need to be pre YAZ 2.0 backwards compatible.
Adam Dickmeiss [Tue, 21 Aug 2007 13:19:44 +0000 (13:19 +0000)]
Fixed scan continuation line not being set for displayTerm being ste
Adam Dickmeiss [Thu, 16 Aug 2007 10:09:36 +0000 (10:09 +0000)]
Added support for SRU scan.
Mike Taylor [Wed, 15 Aug 2007 17:53:11 +0000 (17:53 +0000)]
COmment.
Adam Dickmeiss [Tue, 14 Aug 2007 12:22:51 +0000 (12:22 +0000)]
For XML ILL Request, YAZ attaches a dummy XML record as the ItemRequest
EXTERNAL in the targetPart structure.
Adam Dickmeiss [Tue, 14 Aug 2007 12:21:14 +0000 (12:21 +0000)]
For XML ILL Response, YAZ prints Octet aligned buf (presumably XML)
if that is the ItemRequest EXTERNAL in the targetPart structure.
Mike Taylor [Mon, 13 Aug 2007 16:58:46 +0000 (16:58 +0000)]
Document ccl2rpn setting.
Mike Taylor [Mon, 13 Aug 2007 16:58:27 +0000 (16:58 +0000)]
Add entity reference-tools-ccl-qualifiers
Better section title.
Mike Taylor [Mon, 13 Aug 2007 16:58:02 +0000 (16:58 +0000)]
Better section title.
Mike Taylor [Mon, 13 Aug 2007 16:57:40 +0000 (16:57 +0000)]
Add entity reference-tools-ccl-qualifiers
Mike Taylor [Mon, 13 Aug 2007 16:46:47 +0000 (16:46 +0000)]
Add code to compile CCL qualifiers if specified using <ccl2rpn>.
Mike Taylor [Mon, 13 Aug 2007 16:46:19 +0000 (16:46 +0000)]
Add ccl_transform element to struct gfs_server.
Mike Taylor [Mon, 13 Aug 2007 16:43:59 +0000 (16:43 +0000)]
If a CCL qualifier mapping file has been specified, then incoming CCL
queries are now transformed into Type-1 before being being handed off
to the back-end callback. This is supported for Z39.50 searches, but
not for Scan, nor for SRU (where it's impossible to send CCL anyway).
Mike Taylor [Mon, 13 Aug 2007 16:41:59 +0000 (16:41 +0000)]
Describe CCL support in GFS.