pkgsrc/textproc/xapian/Makefile

25 lines
576 B
Makefile
Raw Normal View History

Update to 1.2.8. From the changelog: 1.2.8: API: * Add support to TermGenerator and QueryParser for indexing and searching CJK text using n-grams. Currently this is only enabled when the environmental variable XAPIAN_CJK_NGRAM is set to a non-empty value. portability: + Some fixes for warnings when cross-compiling to mingw. * tests/soaktest/soaktest.cc: With Sun's compiler, random() and srandom() aren't in <cstdlib> so we need to use <stdlib.h> instead. 1.2.7: API: * Document objects now track whether any document positions have been modified so that replacing a modified document can completely skip considering updating positions if none have changed. Currently the flint, chert, and brass backends implement this optimisation. A common case this speeds up is adding and/or removing boolean filter terms to/from existing documents - for example this gives an 18% speedup for adding tags in notmuch. portability: * Fix -Wshadow warnings from GCC 4.6. * Fix warning from GCC 3.3. 1.2.6: API: * QueryParser: + Add new set_max_wildcard_expansion() method to allow limiting the number of terms a wildcard can expand to. (ticket#350) + If default_op is OP_NEAR or OP_PHRASE then disable stemming of the terms, since we don't index positional information for stemmed terms by default. * Spelling correction was failing to correctly handle words which had the same trigram in an even number of times. portability: * Fix to build for mingw. 1.2.5: API: * Enquire::get_eset() now accepts a min_wt argument to allow the minimum wanted weight to be specified. Default is 0, which gives the previous behaviour. * QueryParser: Handle NEAR/<offset> and ADJ/<offset> where offset isn't an integer the same way at the end of the query as in the middle. * Replication: + Only keep $XAPIAN_MAX_CHANGESETS changeset files when generating a new one (previously this variable only controlled if we generated changesets or not). Closes ticket#278. + $XAPIAN_MAX_CHANGESETS is reread each time, rather than only when the database is opened. + If you build Xapian with DANGEROUS mode enabled, changeset files now actually have the appropriate flag set (the reader will currently throw an exception, but that's better than quietly handling them incorrectly). portability: * api/compactor.cc: Add missing header <ctime> for time() (ticket#530). * api/compactor.cc: Use msvc_posix_rename() under __WIN32__ to atomically update stub file after compaction (ticket#525). * Fix uninitialised variable warnings with gcc -O3. * Eliminate std::string member of global static object used when compiled with --enable-log which was causes problems on Mac OS X. * Fix some issues highlighted by clang++ warnings. 1.2.4: API: * QueryParser: + Avoid a double free if Query construction throws an exception in a particular case. Fixes ticket#515. + Allow phrase generators between a probabilistic prefix and the term itself (e.g. path:/usr/local). + The correct window size wasn't being set in some cases when default_op was set to OP_PHRASE. * Enquire::get_mset(): + Avoid pointlessly trying to allocate lots of memory if the first document requested is larger than the size of the database. + An empty query now returns an MSet with firstitem set correctly - previously firstitem was always 0 in this case. * Document: Initialise docid to 0 when creating a document from scratch, as documented. * Compactor: + Move the database compaction and merging functionality into this new class, and make xapian-compact a simple wrapper around this class. (ticket#175) + Inputs can now be stub database directories or files, in which case the databases in the stub are used as inputs. + Add support for compacting to a stub database, which can be one of the inputs (for atomic update). + If spellings and/or synonyms were only present in some source databases, they weren't copied to the output database, but now they are. portability: * configure: Add support for --enable-sse=sse and --enable-sse=sse2 to allow control of which SSE instructions to use. * configure: Enable use of SSE maths on x86 by default with Sun's compiler. * configure: Beef up the test for whether -lm is required and add a special case to force it to be for Sun's C++ compiler - there's some interaction with libtool and/or shared objects which means that the previous configure test didn't think -lm is needed here when it is. * Fix to build on OpenBSD 4.5 with GCC 3.3.5. * Need to avoid excess precision on m68k when targeting models 68010, 68020, 68030 as well as 68000. * Fix compilation with Sun's C++ compiler. * Fix testsuite to build on Solaris < 10. 1.2.3: API: * Database::get_spelling_suggestion() will now suggest a correction even if the passed word is in the dictionary, provided the correction has at least the same frequency. Partly addresses #225. * QueryParser: + Fix handling of groups of terms which are all stopwords - in situations where this causes a problem we now disable stopword checks for such groups. (ticket#245) + Fix to be smarter about handling a boolean filter term containing ".." in the presence of valuerangeprocessors. portability: * configure: Don't pass -mtune=generic unless GCC >= 4.2 is in use (ticket#492). * Fix handling of some obscure cases of resolving relative paths on Microsoft Windows. (ticket#243). * Optimise closing of all unwanted file descriptors after forking by using closefrom() if available, and otherwise providing our own implementation (optimised to some extent for many platforms). * Fix test harness to build under Microsoft Windows (ticket#495). 1.2.2: portability: * Revert 1.2.1 change to visibility of Xapian::Weight's copy constructor as it making it private broke compilation with GCC 4.1 (which seems to be a bug in this compiler version). * tests/harness/testsuite.cc: Need <cstdio> for sprintf(). Fixes compilation error which was masked if valgrind was installed. (ticket#489) pkgsrc changes: * Remove options (the "quartz" backend was unrelated to Darwin and no longer exists). * Unconditionally buildlink libuuid. If that's overzealous, improve its builtin detection.
2012-01-10 02:03:28 +01:00
# $NetBSD: Makefile,v 1.14 2012/01/10 01:03:28 schmonz Exp $
#
Update to 1.2.8. From the changelog: 1.2.8: API: * Add support to TermGenerator and QueryParser for indexing and searching CJK text using n-grams. Currently this is only enabled when the environmental variable XAPIAN_CJK_NGRAM is set to a non-empty value. portability: + Some fixes for warnings when cross-compiling to mingw. * tests/soaktest/soaktest.cc: With Sun's compiler, random() and srandom() aren't in <cstdlib> so we need to use <stdlib.h> instead. 1.2.7: API: * Document objects now track whether any document positions have been modified so that replacing a modified document can completely skip considering updating positions if none have changed. Currently the flint, chert, and brass backends implement this optimisation. A common case this speeds up is adding and/or removing boolean filter terms to/from existing documents - for example this gives an 18% speedup for adding tags in notmuch. portability: * Fix -Wshadow warnings from GCC 4.6. * Fix warning from GCC 3.3. 1.2.6: API: * QueryParser: + Add new set_max_wildcard_expansion() method to allow limiting the number of terms a wildcard can expand to. (ticket#350) + If default_op is OP_NEAR or OP_PHRASE then disable stemming of the terms, since we don't index positional information for stemmed terms by default. * Spelling correction was failing to correctly handle words which had the same trigram in an even number of times. portability: * Fix to build for mingw. 1.2.5: API: * Enquire::get_eset() now accepts a min_wt argument to allow the minimum wanted weight to be specified. Default is 0, which gives the previous behaviour. * QueryParser: Handle NEAR/<offset> and ADJ/<offset> where offset isn't an integer the same way at the end of the query as in the middle. * Replication: + Only keep $XAPIAN_MAX_CHANGESETS changeset files when generating a new one (previously this variable only controlled if we generated changesets or not). Closes ticket#278. + $XAPIAN_MAX_CHANGESETS is reread each time, rather than only when the database is opened. + If you build Xapian with DANGEROUS mode enabled, changeset files now actually have the appropriate flag set (the reader will currently throw an exception, but that's better than quietly handling them incorrectly). portability: * api/compactor.cc: Add missing header <ctime> for time() (ticket#530). * api/compactor.cc: Use msvc_posix_rename() under __WIN32__ to atomically update stub file after compaction (ticket#525). * Fix uninitialised variable warnings with gcc -O3. * Eliminate std::string member of global static object used when compiled with --enable-log which was causes problems on Mac OS X. * Fix some issues highlighted by clang++ warnings. 1.2.4: API: * QueryParser: + Avoid a double free if Query construction throws an exception in a particular case. Fixes ticket#515. + Allow phrase generators between a probabilistic prefix and the term itself (e.g. path:/usr/local). + The correct window size wasn't being set in some cases when default_op was set to OP_PHRASE. * Enquire::get_mset(): + Avoid pointlessly trying to allocate lots of memory if the first document requested is larger than the size of the database. + An empty query now returns an MSet with firstitem set correctly - previously firstitem was always 0 in this case. * Document: Initialise docid to 0 when creating a document from scratch, as documented. * Compactor: + Move the database compaction and merging functionality into this new class, and make xapian-compact a simple wrapper around this class. (ticket#175) + Inputs can now be stub database directories or files, in which case the databases in the stub are used as inputs. + Add support for compacting to a stub database, which can be one of the inputs (for atomic update). + If spellings and/or synonyms were only present in some source databases, they weren't copied to the output database, but now they are. portability: * configure: Add support for --enable-sse=sse and --enable-sse=sse2 to allow control of which SSE instructions to use. * configure: Enable use of SSE maths on x86 by default with Sun's compiler. * configure: Beef up the test for whether -lm is required and add a special case to force it to be for Sun's C++ compiler - there's some interaction with libtool and/or shared objects which means that the previous configure test didn't think -lm is needed here when it is. * Fix to build on OpenBSD 4.5 with GCC 3.3.5. * Need to avoid excess precision on m68k when targeting models 68010, 68020, 68030 as well as 68000. * Fix compilation with Sun's C++ compiler. * Fix testsuite to build on Solaris < 10. 1.2.3: API: * Database::get_spelling_suggestion() will now suggest a correction even if the passed word is in the dictionary, provided the correction has at least the same frequency. Partly addresses #225. * QueryParser: + Fix handling of groups of terms which are all stopwords - in situations where this causes a problem we now disable stopword checks for such groups. (ticket#245) + Fix to be smarter about handling a boolean filter term containing ".." in the presence of valuerangeprocessors. portability: * configure: Don't pass -mtune=generic unless GCC >= 4.2 is in use (ticket#492). * Fix handling of some obscure cases of resolving relative paths on Microsoft Windows. (ticket#243). * Optimise closing of all unwanted file descriptors after forking by using closefrom() if available, and otherwise providing our own implementation (optimised to some extent for many platforms). * Fix test harness to build under Microsoft Windows (ticket#495). 1.2.2: portability: * Revert 1.2.1 change to visibility of Xapian::Weight's copy constructor as it making it private broke compilation with GCC 4.1 (which seems to be a bug in this compiler version). * tests/harness/testsuite.cc: Need <cstdio> for sprintf(). Fixes compilation error which was masked if valgrind was installed. (ticket#489) pkgsrc changes: * Remove options (the "quartz" backend was unrelated to Darwin and no longer exists). * Unconditionally buildlink libuuid. If that's overzealous, improve its builtin detection.
2012-01-10 02:03:28 +01:00
DISTNAME= xapian-core-1.2.8
PKGNAME= ${DISTNAME:S/-core//}
CATEGORIES= textproc
Update to 1.2.8. From the changelog: 1.2.8: API: * Add support to TermGenerator and QueryParser for indexing and searching CJK text using n-grams. Currently this is only enabled when the environmental variable XAPIAN_CJK_NGRAM is set to a non-empty value. portability: + Some fixes for warnings when cross-compiling to mingw. * tests/soaktest/soaktest.cc: With Sun's compiler, random() and srandom() aren't in <cstdlib> so we need to use <stdlib.h> instead. 1.2.7: API: * Document objects now track whether any document positions have been modified so that replacing a modified document can completely skip considering updating positions if none have changed. Currently the flint, chert, and brass backends implement this optimisation. A common case this speeds up is adding and/or removing boolean filter terms to/from existing documents - for example this gives an 18% speedup for adding tags in notmuch. portability: * Fix -Wshadow warnings from GCC 4.6. * Fix warning from GCC 3.3. 1.2.6: API: * QueryParser: + Add new set_max_wildcard_expansion() method to allow limiting the number of terms a wildcard can expand to. (ticket#350) + If default_op is OP_NEAR or OP_PHRASE then disable stemming of the terms, since we don't index positional information for stemmed terms by default. * Spelling correction was failing to correctly handle words which had the same trigram in an even number of times. portability: * Fix to build for mingw. 1.2.5: API: * Enquire::get_eset() now accepts a min_wt argument to allow the minimum wanted weight to be specified. Default is 0, which gives the previous behaviour. * QueryParser: Handle NEAR/<offset> and ADJ/<offset> where offset isn't an integer the same way at the end of the query as in the middle. * Replication: + Only keep $XAPIAN_MAX_CHANGESETS changeset files when generating a new one (previously this variable only controlled if we generated changesets or not). Closes ticket#278. + $XAPIAN_MAX_CHANGESETS is reread each time, rather than only when the database is opened. + If you build Xapian with DANGEROUS mode enabled, changeset files now actually have the appropriate flag set (the reader will currently throw an exception, but that's better than quietly handling them incorrectly). portability: * api/compactor.cc: Add missing header <ctime> for time() (ticket#530). * api/compactor.cc: Use msvc_posix_rename() under __WIN32__ to atomically update stub file after compaction (ticket#525). * Fix uninitialised variable warnings with gcc -O3. * Eliminate std::string member of global static object used when compiled with --enable-log which was causes problems on Mac OS X. * Fix some issues highlighted by clang++ warnings. 1.2.4: API: * QueryParser: + Avoid a double free if Query construction throws an exception in a particular case. Fixes ticket#515. + Allow phrase generators between a probabilistic prefix and the term itself (e.g. path:/usr/local). + The correct window size wasn't being set in some cases when default_op was set to OP_PHRASE. * Enquire::get_mset(): + Avoid pointlessly trying to allocate lots of memory if the first document requested is larger than the size of the database. + An empty query now returns an MSet with firstitem set correctly - previously firstitem was always 0 in this case. * Document: Initialise docid to 0 when creating a document from scratch, as documented. * Compactor: + Move the database compaction and merging functionality into this new class, and make xapian-compact a simple wrapper around this class. (ticket#175) + Inputs can now be stub database directories or files, in which case the databases in the stub are used as inputs. + Add support for compacting to a stub database, which can be one of the inputs (for atomic update). + If spellings and/or synonyms were only present in some source databases, they weren't copied to the output database, but now they are. portability: * configure: Add support for --enable-sse=sse and --enable-sse=sse2 to allow control of which SSE instructions to use. * configure: Enable use of SSE maths on x86 by default with Sun's compiler. * configure: Beef up the test for whether -lm is required and add a special case to force it to be for Sun's C++ compiler - there's some interaction with libtool and/or shared objects which means that the previous configure test didn't think -lm is needed here when it is. * Fix to build on OpenBSD 4.5 with GCC 3.3.5. * Need to avoid excess precision on m68k when targeting models 68010, 68020, 68030 as well as 68000. * Fix compilation with Sun's C++ compiler. * Fix testsuite to build on Solaris < 10. 1.2.3: API: * Database::get_spelling_suggestion() will now suggest a correction even if the passed word is in the dictionary, provided the correction has at least the same frequency. Partly addresses #225. * QueryParser: + Fix handling of groups of terms which are all stopwords - in situations where this causes a problem we now disable stopword checks for such groups. (ticket#245) + Fix to be smarter about handling a boolean filter term containing ".." in the presence of valuerangeprocessors. portability: * configure: Don't pass -mtune=generic unless GCC >= 4.2 is in use (ticket#492). * Fix handling of some obscure cases of resolving relative paths on Microsoft Windows. (ticket#243). * Optimise closing of all unwanted file descriptors after forking by using closefrom() if available, and otherwise providing our own implementation (optimised to some extent for many platforms). * Fix test harness to build under Microsoft Windows (ticket#495). 1.2.2: portability: * Revert 1.2.1 change to visibility of Xapian::Weight's copy constructor as it making it private broke compilation with GCC 4.1 (which seems to be a bug in this compiler version). * tests/harness/testsuite.cc: Need <cstdio> for sprintf(). Fixes compilation error which was masked if valgrind was installed. (ticket#489) pkgsrc changes: * Remove options (the "quartz" backend was unrelated to Darwin and no longer exists). * Unconditionally buildlink libuuid. If that's overzealous, improve its builtin detection.
2012-01-10 02:03:28 +01:00
MASTER_SITES= http://oligarchy.co.uk/xapian/1.2.8/
MAINTAINER= schmonz@NetBSD.org
HOMEPAGE= http://xapian.org/
COMMENT= Search engine based on Probabilistic Information Retrieval
Update to 1.0.12. From the changelog: * WritableDatabase::remove_spelling() now works properly. * The QueryParser now treats NON_SPACING_MARK Unicode characters as phrase generators, which improves handling of Arabic. This is a stop-gap solution for 1.0.x which will work with existing databases without requiring reindexing - in 1.1.0, NON_SPACING_MARK will be regarded as part of a word. (ticket#355) * Fix undefined behaviour in distribution of OP_NEAR and OP_PHRASE over a non-leaf subquery (indentified by valgrind on testcase nearsubqueries1). (ticket#349) * Enhance distribution of OP_NEAR/OP_PHRASE over non-leaf subqueries to work when there are multiple non-leaf subqueries (ticket#201). * Enquire::get_mset() no longer needlessly checks if the documents exist. * PostingIterator::get_description() output improved visually in some cases. * Enquire::get_mset(): + Now throws UnimplementedError if there's a percentage cutoff and sorting is primarily by value - this has never been correctly supported and it's better to warn people than give incorrect results. + No longer needlessly copies the results internally. + When searching multiple databases, now recalculates the maximum attainable weight after each database which may allow it to terminate earlier. (ticket#336). + Fix inconsistent percentage scores when sorting primarily by value, except when a MatchDecider is also being used; document this remaining problem case. (ticket#216) * Enquire::set_sort_by_value() (and similar methods): Rename the wrongly named "ascending" parameter to "reverse", and note that its value should always be explicitly given since defaulting to "reverse=true" is confusing and the default will be deprecated in 1.1.0. (ticket#311) * Database::allterms_begin(): Fix memory leak when iterating all terms from more than one database. * Query::get_terms_begin(): Don't return "" from the TermIterator (happened when the query contained or was Query::MatchAll). * Add QueryParser::FLAG_DEFAULT to make it easier to add flags to those set by default.
2009-04-21 00:25:12 +02:00
LICENSE= gnu-gpl-v2
2009-07-07 23:12:53 +02:00
PKG_DESTDIR_SUPPORT= user-destdir
GNU_CONFIGURE= yes
USE_LIBTOOL= yes
USE_LANGUAGES= c c++
Update to 1.0.12. From the changelog: * WritableDatabase::remove_spelling() now works properly. * The QueryParser now treats NON_SPACING_MARK Unicode characters as phrase generators, which improves handling of Arabic. This is a stop-gap solution for 1.0.x which will work with existing databases without requiring reindexing - in 1.1.0, NON_SPACING_MARK will be regarded as part of a word. (ticket#355) * Fix undefined behaviour in distribution of OP_NEAR and OP_PHRASE over a non-leaf subquery (indentified by valgrind on testcase nearsubqueries1). (ticket#349) * Enhance distribution of OP_NEAR/OP_PHRASE over non-leaf subqueries to work when there are multiple non-leaf subqueries (ticket#201). * Enquire::get_mset() no longer needlessly checks if the documents exist. * PostingIterator::get_description() output improved visually in some cases. * Enquire::get_mset(): + Now throws UnimplementedError if there's a percentage cutoff and sorting is primarily by value - this has never been correctly supported and it's better to warn people than give incorrect results. + No longer needlessly copies the results internally. + When searching multiple databases, now recalculates the maximum attainable weight after each database which may allow it to terminate earlier. (ticket#336). + Fix inconsistent percentage scores when sorting primarily by value, except when a MatchDecider is also being used; document this remaining problem case. (ticket#216) * Enquire::set_sort_by_value() (and similar methods): Rename the wrongly named "ascending" parameter to "reverse", and note that its value should always be explicitly given since defaulting to "reverse=true" is confusing and the default will be deprecated in 1.1.0. (ticket#311) * Database::allterms_begin(): Fix memory leak when iterating all terms from more than one database. * Query::get_terms_begin(): Don't return "" from the TermIterator (happened when the query contained or was Query::MatchAll). * Add QueryParser::FLAG_DEFAULT to make it easier to add flags to those set by default.
2009-04-21 00:25:12 +02:00
TEST_TARGET= check
.include "../../devel/libuuid/buildlink3.mk"
.include "../../devel/zlib/buildlink3.mk"
.include "../../mk/bsd.pkg.mk"