Commit graph

4974 commits

Author SHA1 Message Date
markd
e16360e8b4 Update to Sphinx-0.6.6
many bug fixes.
2010-06-01 21:22:10 +00:00
obache
513ba113c1 Update py-enchant to 1.6.2.
Based on PR#43384 by Wen Heping.

    * Migrated from SWIG to ctypes since 1.5.0:
        * Add dependency on devel/py-ctypes for python2.4
        * Fix paths for libenchant to exactly load suitable one.
        * No need c compiler and buildlink with textproc/enchant.

Changes:
1.6.2:

    * Upgraded bundled enchant to v1.6.0.
    * Fixed bug in printf() utility function; all input args are now converted
      to strings before printing.

1.6.1:

    * Fixed loading of enchant DLL on win32 without pkg_resources installed.
    * Fixed HTMLChunker to handle unescaped < and > characters that are
      clearly not part of a tag.

1.6.0:

    * Upgraded to enchant v1.5.0:
        * new Broker methods get_param() and set_param() allow
          runtime customisation of provider data
    * Added the concept of 'chunkers' to enchant.tokenize.get_tokenizer().
      These serve split split the text into large chunks of checkable tokens.
        * implemented a simple HTMLChunker class
    * Moved error classes into 'enchant.errors' for easier importing
    * Moved testcases into separate files so they're not loaded by default
    * Allowed SpellChecker to use default language if none is specified
    * Improved compatibility with Python 3

1.5.3:

    * Fixed termination conditions in English tokenization loop.
    * Improved unicode detection in English tokenizer.
    * Made enchant spellcheck all of its docstrings as part of the
      unittest suite.

1.5.2:

    * Modify utils.get_resource_filename and utils.win32_data_files for
      compatibility with py2exe (which was broken in the move to ctypes).
      Thanks to Stephen George for the fix.

1.5.1:

    * SpellChecker.add_to_personal renamed to SpellChecker.add and fixed
      to use the corresponding Dict method.

1.5.0:

    * Migrated from SWIG to ctypes:
        * now runs under PyPy!
        * also opens possibilities for Jython, IronPython, ...
    * Compatibility updates for Python 3.0, mostly around unicode strings
    * Dropped compatibility with Python 2.2
2010-06-01 05:28:11 +00:00
joerg
af55fed87a Fix build 2010-05-28 16:18:23 +00:00
wiz
22b5a55d2f Prefer same section order as mdocml. Bump version. 2010-05-26 08:31:59 +00:00
seb
3869510e04 Update p5-Pod-Simple from version 3.13 to version 3.14.
Upstream changes:
2009-04-27
	* Release 3.14

	Removed explicit loading of UNIVERSAL. RJBS.

	Reversed the change applied in release 3.09 to fix RT #12239. POD
	tag found inside a complex POD tag (e.g., "C<<< I<foo> >>>") is
	again parsed as a tag embedded in a tag instead of text and
	entities. The previous interpretation of `perldoc perlpod` was
	mistaken. (RT #55602 from Christopher J. Madsen).
2010-05-24 21:33:45 +00:00
taca
0854314141 It seems that something wrong with distinfo, DIST_SUBR directory name. 2010-05-20 00:45:43 +00:00
obache
b07455b637 + cabocha, p5-cabocha, py-cabocha, ruby-cabocha 2010-05-19 12:42:08 +00:00
obache
6b2d7b4774 Import ruby-cabocha-0.53 as textproc/ruby-cabocha.
CaboCha is a Japanese dependency analysis machine based on Support Vector
Machines. It is (89.29%) system that accuracy is the highest as a statistical
Japanese dependency analysis machine as of June, 2001. Moreover, definite
analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is
adopted, and an efficient analysis can be done comparatively.

This package is ruby module for CaboCha.
2010-05-19 12:39:53 +00:00
obache
550241e405 Import py-cabocha-0.53 as textproc/py-cabocha.
CaboCha is a Japanese dependency analysis machine based on Support Vector
Machines. It is (89.29%) system that accuracy is the highest as a statistical
Japanese dependency analysis machine as of June, 2001. Moreover, definite
analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is
adopted, and an efficient analysis can be done comparatively.

This package is python module for CaboCha.
2010-05-19 12:37:04 +00:00
obache
1f5dda473d Import p5-cabocha-0.53 as textproc/p5-cabocha.
CaboCha is a Japanese dependency analysis machine based on Support Vector
Machines. It is (89.29%) system that accuracy is the highest as a statistical
Japanese dependency analysis machine as of June, 2001. Moreover, definite
analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is
adopted, and an efficient analysis can be done comparatively.

This package is perl module for CaboCha.
2010-05-19 12:35:26 +00:00
obache
c6e994dd24 Import cabocha-0.53 as textproc/cabocha.
CaboCha is a Japanese dependency analysis machine based on Support Vector
Machines. It is (89.29%) system that accuracy is the highest as a statistical
Japanese dependency analysis machine as of June, 2001. Moreover, definite
analytical algorithm (Cascaded Chunking Model) that doesn't do back-track is
adopted, and an efficient analysis can be done comparatively.
2010-05-19 12:16:25 +00:00
obache
a35c66f849 + yamcha, p5-yamcha, py-yamcha, ruby-yamcha 2010-05-19 12:12:55 +00:00
obache
e6d12e9fc8 Import ruby18-yamcha-0.33 as textproc/ruby-yamcha.
YamCha is a generic, customizable, and open source text chunker oriented toward
a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP
chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning
algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in
1995.

This package is ruby module for YamCha.
2010-05-19 12:10:51 +00:00
obache
9c1d8d0ff8 Import py26-yamcha-0.33 as textproc/py-yamcha.
YamCha is a generic, customizable, and open source text chunker oriented toward
a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP
chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning
algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in
1995.

This package is python module for YamCha.
2010-05-19 12:10:10 +00:00
obache
08a5e39342 Import p5-yamcha-0.33 as textproc/p5-yamcha.
YamCha is a generic, customizable, and open source text chunker oriented toward
a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP
chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning
algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in
1995.

This package is perl module for YamCha.
2010-05-19 12:09:13 +00:00
obache
b281a7f515 Import yamcha-0.33 as textproc/yamcha.
YamCha is a generic, customizable, and open source text chunker oriented toward
a lot of NLP tasks, such as POS tagging, Named Entity Recognition, base NP
chunking, and Text Chunking. YamCha is using a state-of-the-art machine learning
algorithm called Support Vector Machines (SVMs), first introduced by Vapnik in
1995.

YamCha is exactly the same system which performed the best in the CoNLL2000
Shared Task, Chunking and BaseNP Chunking task.
2010-05-19 12:08:02 +00:00
morr
c8aae00e74 Add uni2ascii 2010-05-17 22:30:27 +00:00
morr
973ade2233 uni2ascii and ascii2uni convert between UTF-8 Unicode and any of a
variety of 7-bit ASCII equivalents including: hexadecimal and decimal
HTML numeric characters references, \u-escapes, standard hexadecimal
and raw hexadecimal.

Packaged by bartosz.kuzma, import from pkgsrc-wip.
2010-05-17 22:27:26 +00:00
wiz
d130e31584 Bump PKGREVISION for config file path fix in a2x. 2010-05-17 15:31:56 +00:00
gls
d132431fe7 Fix a2x CONF_DIR.
Thanks wiz!
2010-05-16 17:16:27 +00:00
obache
0b30927370 LICENSE=ruby-license 2010-05-16 12:19:11 +00:00
obache
13f72e144c Update ruby-json-pure to 1.4.3.
2010-05-05 (1.4.3)
  * Fixed some test assertions, from Ruby r27587 and r27590, patch by nobu.
  * Fixed issue http://github.com/flori/json/issues/#issue/20 reported by
    electronicwhisper@github. Thx!
2010-05-16 11:50:57 +00:00
obache
17ed76bf3b Update ruby-json to 1.4.3.
2010-05-05 (1.4.3)
  * Fixed some test assertions, from Ruby r27587 and r27590, patch by nobu.
  * Fixed issue http://github.com/flori/json/issues/#issue/20 reported by
    electronicwhisper@github. Thx!
2010-05-16 09:12:30 +00:00
obache
01596ab8a2 Note commented out LICENSE.
mainly ICOT Free Software License.
2010-05-16 01:11:46 +00:00
obache
6955a76f03 LICENSE=gnu-lgpl-v2.1 2010-05-16 00:37:37 +00:00
obache
f55db0556e LICENSE=modified-bsd 2010-05-16 00:25:26 +00:00
obache
c928556e5d LICENSE=gnu-gpl-v2 2010-05-15 23:57:42 +00:00
obache
9471200bf6 Add comment about "used by suikyo-docs". 2010-05-15 23:56:00 +00:00
gls
373aa54ae9 Updated to 8.5.3.
Patches no longer required.
2010-05-15 20:26:02 +00:00
gls
d21c510f39 Updated to 8.5.3.
Changelog way too long to mention here.
Refer to http://www.methods.co.nz/asciidoc/CHANGELOG.html for complete
list of changes.
2010-05-15 20:23:38 +00:00
obache
5e0945f681 Note commented out LICENSE (Ruby's). 2010-05-15 13:20:13 +00:00
obache
d95d30bcb4 Note commented out LICENSE (Ruby's). 2010-05-15 13:09:40 +00:00
wiz
e9fde33254 + gutcheck. 2010-05-08 20:47:31 +00:00
wiz
73a4738cbb Initial import of gutcheck-0.991:
Gutcheck is a plain-text checking program that specializes in
reporting the problems that spellcheckers don't--errors like
mismatched quotes, misplaced punctuation, unintended blank lines.
It is specifically tuned for checking texts for submission to
Project Gutenberg, though it can be useful elsewhere as well.
2010-05-08 20:46:36 +00:00
drochner
046e303957 add a patch originating from eric4 (the IDE) which supposedly fixes
a problem with foreign characters
bump PKGREVISION
2010-05-08 15:28:02 +00:00
ahoka
d7fc9709e7 Drop maintainership on these packages, I am no longer interested. 2010-05-07 08:34:22 +00:00
sbd
ed23d65120 Work around the inconsistent behavior "cp -r" when coping symbolic link.
Closes PR 41142.
2010-05-06 09:36:28 +00:00
seb
4a97de57d6 Update p5-YAML-LibYAML from version 0.32 to version 0.33.
Pkgsrc changes:
- this package minimum required Perl version (PERL5_REQD) can be
dropped as pkgsrc requires a newer version anyway
- placate pkglint: whitespace fix

Upstream changes:
version: 0.33
date:    Wed Apr 14 17:57:10 PDT 2010
changes:
- Applied Gisle Aas' patch from https://rt.cpan.org/Ticket/Display.html?id=43115
2010-05-05 21:56:11 +00:00
minskim
f1a358653a Update tex-xmltex{,-doc} to Revision 17012.
- Fixed the options to build a format.
2010-05-04 23:51:22 +00:00
taca
bd446eb478 Update ruby-json-pure from 1.2.0 to 1.4.2.
2010-04-26 (1.4.2)
  * Applied patch from naruse Yui NARUSE <naruse@airemix.com> to make building with
    Microsoft Visual C possible again.
  * Applied patch from devrandom <c1.github@niftybox.net> in order to allow building of
    json_pure if extensiontask is not present.
  * Thanks to Dustin Schneider <dustin@stocktwits.com>, who reported a memory
    leak, which is fixed in this release.
  * Applied 993f261ccb8f911d2ae57e9db48ec7acd0187283 patch from josh@github.
2010-04-25 (1.4.1)
  * Fix for a bug reported by Dan DeLeo <dan@kallistec.com>, caused by T_FIXNUM
    being different on 32bit/64bit architectures.
2010-04-23 (1.4.0)
  * Major speed improvements and building with simplified
    directory/file-structure.
  * Extension should at least be comapatible with MRI, YARV and Rubinius.
2010-04-07 (1.2.4)
  * Triger const_missing callback to make Rails' dynamic class loading work.
2010-03-11 (1.2.3)
  * Added a State#[] method which returns an attribute's value in order to
    increase duck type compatibility to Hash.
2010-02-27 (1.2.2)
  * Made some changes to make the building of the parser/generator compatible
    to Rubinius.
2009-11-25 (1.2.1)
  * Added :symbolize_names option to Parser, which returns symbols instead of
    strings in object names/keys.
2010-05-04 02:48:21 +00:00
taca
1711260fae Update ruby-json package from 1.2.0 to 1.4.2.
2010-04-26 (1.4.2)
  * Applied patch from naruse Yui NARUSE <naruse@airemix.com> to make building with
    Microsoft Visual C possible again.
  * Applied patch from devrandom <c1.github@niftybox.net> in order to allow building of
    json_pure if extensiontask is not present.
  * Thanks to Dustin Schneider <dustin@stocktwits.com>, who reported a memory
    leak, which is fixed in this release.
  * Applied 993f261ccb8f911d2ae57e9db48ec7acd0187283 patch from josh@github.
2010-04-25 (1.4.1)
  * Fix for a bug reported by Dan DeLeo <dan@kallistec.com>, caused by T_FIXNUM
    being different on 32bit/64bit architectures.
2010-04-23 (1.4.0)
  * Major speed improvements and building with simplified
    directory/file-structure.
  * Extension should at least be comapatible with MRI, YARV and Rubinius.
2010-04-07 (1.2.4)
  * Triger const_missing callback to make Rails' dynamic class loading work.
2010-03-11 (1.2.3)
  * Added a State#[] method which returns an attribute's value in order to
    increase duck type compatibility to Hash.
2010-02-27 (1.2.2)
  * Made some changes to make the building of the parser/generator compatible
    to Rubinius.
2009-11-25 (1.2.1)
  * Added :symbolize_names option to Parser, which returns symbols instead of
    strings in object names/keys.
2010-05-04 02:46:50 +00:00
wiz
a790a2adc7 Update to 2.2.4:
2.2.6 (2010-03-02)
==================

Bugs fixed
----------

* Fixed several Python 3 regressions by building with Cython 0.11.3.


2.2.5 (2010-02-28)
==================

Features added
--------------

* Support for running XSLT extension elements on the input root node
  (e.g. in a template matching on "/").

Bugs fixed
----------

* Crash in XPath evaluation when reading smart strings from a document
  other than the original context document.

* Support recent versions of html5lib by not requiring its
  ``XHTMLParser`` in ``htmlparser.py`` anymore.

* Manually instantiating the custom element classes in
  ``lxml.objectify`` could crash.

* Invalid XML text characters were not rejected by the API when they
  appeared in unicode strings directly after non-ASCII characters.

* lxml.html.open_http_urllib() did not work in Python 3.

* The functions ``strip_tags()`` and ``strip_elements()`` in
  ``lxml.etree`` did not remove all occurrences of a tag in all cases.

* Crash in XSLT extension elements when the XSLT context node is not
  an element.
2010-05-02 13:38:23 +00:00
wiz
7ad81a87e4 Update to 4.2.1:
Sed 4.2.1

* fix parsing of s/[[[[[[[[[]//

* security contexts are preserved by -i too under SELinux

* temporary files for sed -i are not made group/world-readable until
  they are complete
2010-05-02 11:09:41 +00:00
wiz
45a59344c7 Update to 1.0.5: 1.5 years of bug fixes. 2010-05-02 11:01:41 +00:00
wiz
614f2a889b Update to 1.6.0:
* Fix bug 12567: the ispell sources aren't licensed under the LGPL
    * Add a function to get enchant's version (enchant_get_version)
    * Disable zemberek plugin by default, as it's known to cause issues/crashes with WebKit
    * Fix bug 12472: Win32 DLL dependency not found popup occurs when module has unmet dependencies
    * Possibly fix Ubuntu bug 474062
    * Fix bug 12409: Registry handle not closed in enchant_get_registry_value_ex
    * Fix bug 12406: Leak in _enchant_get_user_home_dirs() on Windows
    * Fix bug 12007: Update FSF address
    * Fix bug 12305: Zemberek module lists a Turkish dictionary even without Zemberek installed
    * Don't assert if passed a null string list
    * Fix bug 12350: enchant_pwl_init_with_file truncates pwl file
    * Fix a double-free memory corruption bug
    * Fix bug 12173: fix some small memory leaks
    * Fix bug 12174: mis-acceptence of dictionaries which start with a partial match of the lang id
    * Fix bug 12160: enchant 1.5.0 always looks in "lib" dir for plugins
    * Fix the build with the MSVC compiler
    * Add a --with-system-myspell option
    * Package missing compile-resource file
    * Compare paths ignoring case sensitivity on windows
2010-05-02 10:59:23 +00:00
wiz
6d1ef79356 + py-Whoosh. 2010-05-01 20:45:48 +00:00
wiz
95063ccedf Initial import of py-Whoosh-0.1.13:
Whoosh is a fast, featureful full-text indexing and searching
library implemented in pure Python.

Some of Whoosh's features include:

    * Pythonic API.
    * Pure-Python. No compilation or binary packages needed, no
    mysterious crashes.
    * Fielded indexing and search.
    * Fast indexing and retrieval -- much faster than any other
    pure-Python solution.
    * Pluggable scoring algorithm (including BM25F), text analysis,
    storage, posting format, etc.
    * Powerful query language parsed by pyparsing.
    * Pure Python spell-checker (as far as I know, the only one).

Whoosh takes much inspiration (and sometimes translates code) from
other open-source search engines. The fundamental design is similar
to (but does not entirely work like and is not compatible with)
Lucene, but uses KinoSearch's indexing algorithm, some scoring
algorithms from Terrier, and the English morphological variation
generator from Minion.
2010-05-01 20:45:05 +00:00
wiz
b9feef60cf + dadadodo. 2010-04-30 10:05:23 +00:00
wiz
993ed1b00f Initial import of dadadodo-1.04:
dadadodo analyses text files and generates Markov chains of word
frequencies; it can then generate random sentences based on that
data.
2010-04-30 10:04:56 +00:00
obache
c61dccb749 Add patch-aj to include resolv.h conditionally.
https://bugzilla.gnome.org/show_bug.cgi?id=617053
Fixes build on Interix-6.0.
2010-04-28 08:54:30 +00:00