Commit graph

1142 commits

Author SHA1 Message Date
Cheng-Lung Sung
c301a34f24 - Ferret is a high-performance, full-featured text search engine library written for Ruby. 2006-08-10 09:15:42 +00:00
Cheng-Lung Sung
f6858c0da4 - CLucene is a C++ port of Lucene: the high-performance,
full-featured text search engine written in Java.
  CLucene is faster than lucene as it is written in C++.
2006-08-10 01:45:03 +00:00
Cheng-Lung Sung
adcd8a85aa - YamCha is a generic, customizable, and open source
text chunker oriented toward a lot of NLP tasks.
2006-08-03 03:03:18 +00:00
Andrew Pantyukhin
3c07d6686e Add port textproc/p5-Text-Flowed:
This module provides functions that deals with formatting data with
Content-Type 'text/plain; format=flowed' as described in RFC2646
(http://www.rfc-editor.org/rfc/rfc2646.txt). In a nutshell,
format=flowed text solves the problem in plain text files where it
is not known which lines can be considered a logical paragraph,
enabling lines to be automatically flowed (wrapped and/or joined)
as appropriate when displaying.

In format=flowed, a soft newline is expressed as " \n", while hard
newlines are expressed as "\n". Soft newlines can be automatically
deleted or inserted as appropriate when the text is reformatted.

WWW: http://search.cpan.org/dist/Text-Flowed/

Justification:	socialtext dependency
2006-08-02 11:03:38 +00:00
Andrew Pantyukhin
4e80309686 Add port textproc/p5-Plucene-Simple:
This provides a simple interface to Plucene. Plucene is large and multi-
featured, and it expected that users will subclass it, and tie all the
pieces together to suit their own needs. Plucene::Simple is, therefore,
just one way to use Plucene. It's not expected that it will do exactly
what *you* want, but you can always use it as an example of how to
build your own interface.

WWW: http://search.cpan.org/dist/PluceneSimple/

Justification:	socialtext dependency
Quirks:		1/6 test fails
2006-08-02 10:06:15 +00:00
Ying-Chieh Liao
1eb712f0b3 add p5-Text-Quote 0.3
Quotes strings as required for perl to eval them back correctly

PR:		101018
2006-08-02 01:20:01 +00:00
Cheng-Lung Sung
d8ddbeefc3 Add p5-Text-Delimited 2.00, module for parsing delimited text files.
PR:		ports/101158
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-01 12:29:47 +00:00
Cheng-Lung Sung
b83b882d46 Add p5-XML-Literal 0.01, syntax suppor for XML literals.
PR:		ports/101155
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-01 12:26:26 +00:00
Cheng-Lung Sung
c63b7a1917 - p5-XML-RSS-Feed provide the ability of Persistant XML RSS Encapsulation 2006-08-01 03:33:57 +00:00
Cheng-Lung Sung
cd4aa0dc19 Add pear-XML_DTD 0.4.2, parsing of DTD files and DTD validation of XML
files.

PR:		ports/101081
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-08-01 01:37:09 +00:00
Roman Bogorodskiy
455e551f27 This stem extension for PHP provides stemming capability for a variety of
languages using Dr. M.F. Porter's Snowball API, which can be found at:
http://snowball.tartarus.org

WWW: http://pecl.php.net/package/stem/

PR:		ports/101106
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-31 18:51:37 +00:00
Cheng-Lung Sung
7e583b11b0 Add p5-XML-Flow 0.81, store (restore) perl data structures in XML
stream.

PR:		ports/100906
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-27 07:29:15 +00:00
Andrew Pantyukhin
d99033e15c Add port textproc/p5-Text-Bastardize:
Bastardize provides an magical object into which text can be charged
and then returned in various, slighty modified ways.

Among others, bastardize has the following methods:

rdct converts english to hyperreductionist english
    (ex. "english" becomes "")

pig pig latin
    (ex. "hi there" becomes "ihay erethay")

k3wlt0k a k3wlt0kizer developed originally by Fmh

rot13 implements rot13 "encryption" in perl
    (ex. "foo bar" becomes "sbb one")

rev reverses the arrangement of characters

censor attempts to censor text which might be innaproriate

n20e performs numerical abbreviations
    (ex. "numerical_abbreviation" becomes "n20e")

WWW: http://search.cpan.org/dist/Text-Bastardize/
2006-07-25 22:07:05 +00:00
Andrew Pantyukhin
17248c23f7 Add port textproc/p5-Unicode-CheckUTF8:
This is an XS wrapper around some Unicode Consortium code to check if
a string is valid UTF-8, revised to conform to what expat/Mozilla
think is valid UTF-8, especially with regard to low-ASCII characters.

Note that this module has NOTHING to do with Perl's internal UTF8 flag
on scalars.

This module is for use when you're getting input from users and want
to make sure it's valid UTF-8 before continuing.

WWW: http://search.cpan.org/dist/Unicode-CheckUTF8/
2006-07-25 21:50:41 +00:00
Martin Wilke
2d16749363 Text::Emoticon::GoogleTalk is a text filter that replace text emoticons like
":-)", "<3", etc. with the icons of Google Talk (or Gmail Chat), detailed in
http://mail.google.com/support/bin/answer.py?answer=34056

WWW:    http://search.cpan.org/dist/Text-Emoticon-GoogleTalk/

PR:		ports/100676
Submitted by:	chinsan <chinsan.tw(at)gmail.com>
Approved by:	krion (mentor)
2006-07-25 21:13:31 +00:00
Martin Wilke
7e8c3594a3 Add uncrustify , highly configurable source code beautifier.
The goals of this project are simple:
Create a highly configurable, easily modifiable source code beautifier.
What it does:
 * Ident code, aligning on parens, assignments, etc
 * Align on '=' and variable definitions
 * Align structure initializers
 * Align #define stuff
 * Align backslash-newline stuff
 * Reformat comments (a little bit)
 * Fix inter-character spacing
 * Add or remove parens on return statements
 * Add or remove braces on single-statement if/do/while/for statements
 * Highly configurable - 118 configurable options as of version 0.0.15

 WWW: http://uncrustify.sourceforge.net

PR:		ports/100604
Submitted by:	Dmitry Marakasov <amdmi3 at mail.ru>
2006-07-22 09:23:56 +00:00
Cheng-Lung Sung
0d8b6e60d3 Add p5-Lingua-EN-NamedEntity 1.7, basic Named Entity Extraction
algorithm.

PR:		ports/100675
Submitted by:	Jin-Shan Tseng <tjs at cdpa.nsysu.edu.tw>
2006-07-22 04:54:01 +00:00
Rong-En Fan
97bbe4de85 Add p5-CSS 1.07, object oriented access to Cascading Style Sheets (CSS).
PR:		ports/100640
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-21 12:23:04 +00:00
Alex Dupre
ec97c639b2 XStream is a simple library to serialize Java objects
to XML and back again.

WWW: http://xstream.codehaus.org/
2006-07-19 19:10:17 +00:00
Cheng-Lung Sung
6a9f73102a Add p5-Text-Trac 0.03, perl extension for formatting text with Trac Wiki
Style.

PR:		ports/100528
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-19 13:50:26 +00:00
Cheng-Lung Sung
1e0f2c09d7 Add p5-XML-SAXDriver-Excel 0.06, perl SAX driver for converting Excel
files to XML.

PR:		ports/100509
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-19 07:17:54 +00:00
Cheng-Lung Sung
8423c37d63 Add p5-XML-SAXDriver-CSV 0.07, perl SAX driver for converting CSV files
to XML.

PR:		ports/100510
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-19 07:15:52 +00:00
Rong-En Fan
6a070b4106 Add p5-YAML-Tiny 0.05, YAML::Tiny - Read/Write YAML files with as little
code as possible.

PR:		ports/100444
Submitted by:	Yuan-Chung Hsiao <ychsiao at ychsiao.org>
2006-07-17 18:23:22 +00:00
Thierry Thomas
8d1905f704 All dictionaries can be installed separately:
- by default, textproc/aspell installs the English dictionaries (no
    change);
  - thereafter you can install any foreign dictionary;
  - when you install a foreign dictionary, i.e. french/aspell or
    textproc/da-aspell, it installs only the dictionaries, and depends
    upon textproc/aspell for the programs;
  - if you don't need the English dictionaries, you can define
    WITHOUT_DICTEN or install textproc/aspell-without-dicten;
  - add a new port for textproc/en-aspell: if aspell had been installed
    without the English dictionaries, they can be added thereafter;
  - add a missing port for german/alt-aspell;
  - foreign dictionaries are almost independent from textproc/aspell,
    and their maintainership is available.

Credits:	special thanks to Serge Gagnon <ser_gagnon (at) sympatico.ca>
2006-07-15 14:28:44 +00:00
Ion-Mihai Tetcu
bd0196ca13 Slave port for redland python bindings.
PR:		ports/99487
Submitted by:	Jordi Haarman
2006-07-15 09:45:43 +00:00
Cheng-Lung Sung
9001c210aa Add p5-XML-Mini 1.2.8, perl implementation of the XML::Mini XML
create/parse interface.

PR:		ports/100265
Submitted by:	Vivek Khera <vivek at khera.org>
2006-07-15 07:54:56 +00:00
Cheng-Lung Sung
f4bb75c77a Add p5-Text-Format+NWrap 0.52, various subroutines to format and wrap
text.

PR:		ports/100264
Submitted by:	Vivek Khera <vivek at khera.org>
2006-07-15 06:37:53 +00:00
Cheng-Lung Sung
6f2a187950 Add p5-XML-DOM2 0.03, DOM controlled, strict XML module for extentable
xml objects.

PR:		ports/100247
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-15 05:07:13 +00:00
Cheng-Lung Sung
1cc7ffbe07 Add p5-SGML-DTDParse 2.00, parse an SGML or XML DTD.
PR:		ports/100082
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-14 06:16:17 +00:00
Roman Bogorodskiy
c6e2885a32 This package reads and writes any document that conforms to the PDF
specification generously provided by Adobe at
http://partners.adobe.com/public/developer/pdf/index_reference.html

The file format is well-supported, with the exception of the
"linearized" or "optimized" output format, which this module can read
but not write.  Many specific aspects of the document model are not
manipulable with this package (like fonts), but if the input document
is correctly written, then this module will preserve the model
integrity.

This library grants you some power over the PDF security model. Note
that applications editing PDF documents via this library MUST respect
the security preferences of the document. Any violation of this
respect is contrary to Adobe's intellectual property position, as
stated in the reference manual at the above URL.

WWW:	http://search.cpan.org/dist/CAM-PDF/

PR:		ports/100182
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-13 06:56:16 +00:00
Cheng-Lung Sung
e1a85577ba Add p5-Pod-XML 0.96, module to convert POD to XML.
PR:		ports/100095
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-13 03:34:56 +00:00
Rong-En Fan
7519952217 Add p5-Pod-Xhtml 1.52, generate well-formed XHTML documents from POD
format documentation.

PR:		ports/100174
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-13 01:51:48 +00:00
Ion-Mihai Tetcu
ff0a9fbdea EXML is an abstracted, generalized interface to an xml parser/writer, using
ecore data structures and making things generally easy to get around in.

The functions detailed in EXML.h are fairly self explanatory, and the io
interfaces are also generalized and independent (open from a socket, write
to in memory xml image).

WWW: http://www.enlightenment.org/

PR:		ports/100002
Submitted by:	Stanislav Sedov <ssedov at mbsd.msk.ru>
2006-07-12 19:29:28 +00:00
Rong-En Fan
801697b326 Remove textproc/p5-JSON-Syck, which is removed before
and I added it back yesterday.

Requested by:	tobez
2006-07-12 14:13:37 +00:00
Rong-En Fan
1531284261 Add csvdiff 1.3, compare/diff two (comma) seperated files with each
other.

PR:		ports/100097
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-12 05:53:15 +00:00
Cheng-Lung Sung
e41fc5114b Add p5-XML-Handler-Dtd2DocBook 0.41, generate a DocBook documentation
from a DTD.

PR:		ports/100077
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-12 05:46:25 +00:00
Cheng-Lung Sung
96490a9b80 Add p5-XML-Handler-Dtd2Html 0.41, SAX2 handler for generate a Html
documentation from a DTD.

PR:		ports/100075
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-12 03:41:58 +00:00
Rong-En Fan
4e18bce7be You might want to know the difference between JSON and JSON::Syck.
Since JSON is a pure-perl module and JSON::Syck is based on libsyck,
JSON::Syck is supposed to be very fast and memory efficient. See
chansen's benchmark table at
http://idisk.mac.com/christian.hansen/Public/perl/serialize.pl

JSON.pm comes with dozens of ways to do the same thing and lots of
options, while JSON::Syck doesn't. There's only Load and Dump.

Oh, and JSON::Syck doesn't use camelCase method names :-)

Author:	Audrey Tang <autrijus@autrijus.org>
	Tatsuhiko Miyagawa <miyagawa@gmail.com>
WWW:	http://search.cpan.org/dist/JSON-Syck/

PR:		ports/100071
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 20:58:42 +00:00
Rong-En Fan
4d5dc1b29b Add p5-Regexp-Common-profanity_us 2.2, provide regexes for U.S.
profanity.

PR:		ports/100070
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 20:42:43 +00:00
Rong-En Fan
cc79b6fa3b Add p5-Search-Estraier 0.06, pure perl module to use Hyper Estraier
search engine.

PR:		ports/100073
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 15:11:14 +00:00
Cheng-Lung Sung
108d2727e4 Add p5-XML-XPathEngine 0.03, a re-usable XPath engine for DOM-like
trees.

PR:		ports/100074
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 14:14:26 +00:00
Roman Bogorodskiy
374fa97a34 This module transforms HTML into PDF, using an assortment of XML
transformations implemented in PDF::FromHTML::Twig.

There is also a command-line utility, html2pdf.pl, that comes with this
distribution.

WWW:	http://search.cpan.org/dist/PDF-FromHTML/

PR:		ports/100060
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 11:43:29 +00:00
Cheng-Lung Sung
4936834788 Add p5-Text-CSV_PP 1.01, comma-separated values manipulation routines PP
version.

PR:		ports/100064
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 10:40:48 +00:00
Cheng-Lung Sung
aeedefcad6 Add p5-Text-Emoticon 0.04, factory class for Yahoo! and MSN emoticons.
PR:		ports/100068
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 10:34:48 +00:00
Cheng-Lung Sung
acef33bd90 Add p5-Text-Hatena 0.14, perl extension for formatting text with Hatena
Style.

PR:		ports/100066
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 10:25:27 +00:00
Roman Bogorodskiy
477effd0a0 This is a generalized API that allows a module that generates PDFs to
transparently target multiple backends without changing its code.

WWW:	http://search.cpan.org/dist/PDF-Writer/

PR:		ports/100058
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 10:14:27 +00:00
Rong-En Fan
7ff8f72b14 Add p5-XML-RSS-Liberal 0.01, XML::RSS With A Liberal Parser.
PR:		ports/99967
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 05:26:06 +00:00
Rong-En Fan
877f6dd7ea Add p5-XML-Liberal 0.12, super liberal XML parser that parses broken
XML.

PR:		ports/99966
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-11 05:15:55 +00:00
Cheng-Lung Sung
71a956d060 Add p5-Text-Tags 0.04, parses folksonomy space-separated tags.
PR:		ports/99978
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-10 15:47:09 +00:00
Ion-Mihai Tetcu
73cc6eb34d An integrated solution for XML-based publishing in print and web.
It is specifically targeted at producing technical documentation
in the field of computer science.

Documents are written in an XML-based markup language and translated
to different formats with XSL-transformations. At this time, eCromedos
supports the target formats XHTML and LATEX. Where LATEX output can be
further processed into high-quality printable formats by use of the
TEX typesetting system (http://www.ctan.org).

Author:	Tobias Koch <tkoch@ecromedos.net>
WWW:	http://www.ecromedos.net/

PR:		ports/98895
Submitted by:	Nicola Vitale	<nivit at email.it>
2006-07-09 13:08:17 +00:00
Renato Botelho
77887bedcc SDCV(StarDict under Console Version) is simple, cross-platform text-based
utility for work with dictionaries in StarDict's format.

The word from "list of words" may be string with leading '/' for using Fuzzy
search algorithm, string may contain '?' and '*' for using regexp search.
It work in interactive and not interactive mode.

WWW: http://sdcv.sourceforge.net/

PR:		ports/96836
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-05 19:29:51 +00:00
Vasil Dimov
f4212e1cd8 Remove expired leaf ports:
2006-07-01 emulators/linux_base-fc3
2006-06-15 misc/linux-opengroupware
2006-07-01 net/opengk
2006-07-01 security/p5-Crypt-OpenPGP
2006-07-01 textproc/sed_inplace
2006-07-01 textproc/xml4j
2006-07-01 x11-wm/aewm++
2006-07-04 14:04:31 +00:00
Aaron Dalton
05ec2ba8f0 Adding port textproc/p5-Games-Dissociate, A Dissociated Press algorithm
and filter.

Approved by:	tobez (implicit)
2006-06-22 00:53:33 +00:00
Aaron Dalton
6d8c618d09 Adding port textproc/p5-CSS-Squish, Compact many CSS files into one big
file.

Approved by:	tobez (implicit)
2006-06-19 05:15:55 +00:00
Aaron Dalton
f2a2bcfef8 Adding port textproc/p5-XML-Atom-Stream, A client interface for
AtomStream.

Approved by:	tobez (implicit)
2006-06-13 02:20:39 +00:00
Cheng-Lung Sung
b004a5d473 - textproc/p5-Text-MagicTemplate has been expired and replace by
textproc/p5-Template-Magic
2006-06-07 05:14:38 +00:00
Aaron Dalton
b684da0d06 Adding port textproc/p5-Text-Decorator, Apply a chain of filters to text.
Approved by:	tobez (implicit)
2006-06-06 21:02:42 +00:00
Ying-Chieh Liao
057262ad0f add p5-XML-RSS-SimpleGen 11.11
For writing RSS files
2006-06-05 07:50:53 +00:00
Aaron Dalton
9dee8b7d6e Adding port textproc/p5-Parse-Flex, a fast, XS-based flex parser.
- minor 'make test' failure reported to vendor

Approved by:	tobez (implicit)
2006-05-30 22:13:43 +00:00
Aaron Dalton
1ec64150cc Adding port textproc/p5-Syntax-Highlight-Engine-Kate, a Perl port of Kate's syntax highlighting engine.
Approved by:	tobez (implicit)
2006-05-29 22:36:48 +00:00
Aaron Dalton
0d8a30e598 Adding port textproc/p5-Text-Diff3, Compute three-way differences between texts.
Approved by:	tobez (implicit)
2006-05-29 20:36:28 +00:00
Pav Lucistnik
c4d35f2348 - Rename ports
textproc/gtkspell2 -> textproc/gtkspell
  textproc/gtkspell2-reference -> textproc/gtkspell-reference
2006-05-28 15:54:10 +00:00
Pav Lucistnik
870d888d37 - Rename port textproc/gnomespell -> textproc/gnome-spell 2006-05-28 15:45:41 +00:00
Pav Lucistnik
5d8e925c01 - Rename port textproc/gnomedocutils -> textproc/gnome-doc-utils 2006-05-28 15:41:18 +00:00
Aaron Dalton
25f54cb85b Adding port textproc/p5-Petal-Utils, Useful template modifiers for Petal.
Approved by:	tobez (implicit)
2006-05-26 18:44:51 +00:00
Aaron Dalton
fd75220bc8 Adding port textproc/p5-Petal-Mail, Format text e-mail using Petal.
Approved by:	tobez (implicit)
2006-05-26 18:28:08 +00:00
Aaron Dalton
79170ca035 Adding port textproc/p5-Petal-CodePerl, Make Petal go faster by compiling the expression.
Approved by:	tobez (implicit)
2006-05-26 18:16:31 +00:00
Aaron Dalton
6ded5906ef Adding port textproc/p5-Petal, Perl Template Attribute Language - TAL for Perl!
Approved by:	tobez (implicit)
2006-05-25 23:16:05 +00:00
Aaron Dalton
21d57492ac Adding port textproc/p5-MKDoc-XML, The MKDoc XML Toolkit.
Approved by:	tobez (implicit)
2006-05-25 22:55:23 +00:00
Aaron Dalton
a9bfbc5cfc Adding port textproc/p5-Text-Unaccent, Remove accents from a string.
Approved by:	tobez (implicit)
2006-05-25 19:41:00 +00:00
Aaron Dalton
309551a294 Adding port textproc/p5-MathML-Entities, Convert XHTML+MathML Named Entities to Numeric Character References
Approved by:	tobez (implicit)
2006-05-25 16:56:45 +00:00
Aaron Dalton
fb3f2c8920 Adding port textproc/p5-Algorithm-RabinKarp, an implementation of Rabin and Karp's streaming hash, as described
in "Winnowing: Local Algorithms for Document Fingerprinting" by Schleimer,
Wilkerson, and Aiken.

Approved by:	tobez (implicit)
2006-05-24 17:47:11 +00:00
Anton Berezin
cc4567f3d4 Add p5-Data-Report - a framework for flexible reporting. 2006-05-24 11:10:48 +00:00
Sergey Matveychuk
67ce86e3d6 XML::Xerces is the Perl API to the Apache project's Xerces XML
parser. It is implemented using the Xerces C++ API, and it provides
access to most of the C++ API from Perl.

WWW: http://xerces.apache.org/xerces-p/

PR:		ports/95296
Submitted by:	Ken Menzel <kenm@icarz.com>
2006-05-20 19:14:02 +00:00
Aaron Dalton
9f830715e5 Adding port textproc/p5-String-Fraction, convert fractions into unicode chars.
Approved by:	tobez (implicit)
2006-05-16 22:42:55 +00:00
Aaron Dalton
449d06989f Adding port textproc/p5-TeX-Encode, Encode/decode Perl utf-8 strings into TeX
Approved by:	tobez (implicit)
2006-05-15 16:39:18 +00:00
Jean-Yves Lefort
2915abb3bb Move the libxslt programming reference to the libxslt-reference port. 2006-05-11 17:36:53 +00:00
Jean-Yves Lefort
3fb5ddd692 Move the libxml2 programming reference to the libxml2-reference port. 2006-05-11 17:25:12 +00:00
Jean-Yves Lefort
acc7ca62ee Move the gtkspell2 programming reference to the gtkspell2-reference port. 2006-05-11 03:51:11 +00:00
Aaron Dalton
d7c266e3cd Add p5-Pod-LaTeX 0.58, convert Pod to LaTeX.
PR:		ports/93913
Submitted by:	Paul Dlug <paul@aps.org>
Approved by:	tobez (implicit)
2006-05-10 01:59:49 +00:00
Andrey Slusar
1dc59b2a33 Add dictem-xemacs21-mule: dictEm is a dict client for [X]Emacs.
Submitted by:	Max N. Boyarov <m.boyarov@gmail.com>
2006-05-08 20:37:34 +00:00
Andrey Slusar
92e2b085f8 Add dictem: DictEm is a dict client for [X]Emacs.
Submitted by:	Max N. Boyarov <m.boyarov@gmail.com>
2006-05-08 20:14:31 +00:00
Vanilla I. Shu
31720e6c6d Add pecl-ctemplate, it's php extension for google ctemplate library. 2006-05-07 13:39:49 +00:00
Aaron Dalton
9b2b727c3f Add gladtex 0.3, a program to replace LaTeX formulas in HTML files with
images.

PR:		ports/92820
Submitted by:	Nicola Vitale	<nivit@email.it>
Approved by:	tobez
2006-05-05 06:34:05 +00:00
Aaron Dalton
019b4069c7 Add textogif 1.1, a Perl script to convert LaTeX formulas into GIFs.
PR:		ports/92800
Submitted by:	Nicola Vitale	<nivit@email.it>
Approved by:	tobez
2006-05-05 06:32:30 +00:00
Pav Lucistnik
a8bec7153e Divmod Reverend is a simple, general purpose Bayesian classifier,
written in Python.
It is designed to be easy to adapt and extend for your application.

Stuff you can do with the Reverend:

    * classify RSS stories
    * classify recipes by cuisine
    * who do you write like? Shakespeare, Dickens or Austen
    * detect the language of a document
    * is your code more like Guido's or Peter's

Author:	Amir Bakhtiar <amir@divmod.org>
WWW:	http://www.divmod.org/trac/wiki/DivmodReverend

PR:		ports/96531
Submitted by:	Nicola Vitale <nivit@email.it>
2006-05-01 12:56:49 +00:00
Ying-Chieh Liao
d6bad8c46b add iksemel 1.2
XML parser library mainly designed for Jabber applications

PR:		96283
Submitted by:	trasz <trasz@pin.if.uz.zgora.pl>
2006-04-28 07:34:00 +00:00
Ying-Chieh Liao
220e8ece45 add google-ctemplate 0.1
A simple but powerful template language for C++

pass maintainership to clsung
2006-04-27 01:52:15 +00:00
Anton Berezin
aae8188e7e Add p5-Text-Markdown 1.0.3, markdown Perl5 module.
PR:		ports/96292
Submitted by:	Fernan Aguero <fernan@iib.unsam.edu.ar>
2006-04-25 09:08:55 +00:00
Erwin Lansing
3ec36ea94c KinoSearch is a loose port of the Java search engine library Apache Lucene,
written in Perl and C. The archetypal application is website search, but it
can be put to many different uses.

Features
 * Extremely fast and scalable - can handle millions of documents
 * Full support for 12 Indo-European languages.
 * Support for boolean operators AND, OR, and AND NOT; parenthetical
   groupings, and prepended +plus and -minus
 * Algorithmic selection of relevant excerpts and highlighting of search terms
   within excerpts
 * Highly customizable query and indexing APIs
 * Phrase matching
 * Stemming
 * Stoplists

WWW: http://www.rectangular.com/kinosearch/

PR:		ports/96115
Submitted by:	Vivek Khera <vivek@khera.org>
2006-04-23 08:44:40 +00:00
Yen-Ming Lee
21687c1105 - add p5-XML-RSS-Parser 4.0
XML::RSS::Parser is a lightweight liberal parser of RSS feeds. This parser
is "liberal" in that it does not demand compliance of a specific RSS version
and will attempt to gracefully handle tags it does not expect or understand.
The parser's only requirements is that the file is well-formed XML and
remotely resembles RSS. Roughly speaking, well formed XML with a channel
element as a direct sibling or the root tag and item elements etc.

There are a number of advantages to using this module then just using
a standard parser-tree combination. There are a number of different RSS
formats in use today. In very subtle ways these formats are not entirely
compatible from one to another. XML::RSS::Parser makes a couple assumptions
to "normalize" the parse tree into a more consistent form. For instance,
it forces channel and item into a parent-child relationship.

WWW: http://search.cpan.org/dist/XML-RSS-Parser/
2006-04-21 04:09:07 +00:00
Marc G. Fournier
3c0df914cb add pecl-xdiff 2006-04-20 17:37:23 +00:00
Cheng-Lung Sung
30b40f4cf2 - p5-Tree-Suffix is a perl interface to the libstree library (C library) 2006-04-19 08:38:22 +00:00
MANTANI Nobutaka
be9825e4f4 Add uim-qt. QT modules of uim input method.
PR:		ports/95785
Submitted by:	NIIMI Satoshi <sa2c@sa2c.net>
2006-04-16 16:47:40 +00:00
MANTANI Nobutaka
e1cb76089d Add uim-m17nlib. Uim plugin using m17n library input method.
PR:		ports/95785
Submitted by:	NIIMI Satoshi <sa2c@sa2c.net>
2006-04-16 16:46:19 +00:00
MANTANI Nobutaka
bf53e8b27e Add uim-gtk. GTK+ modules of uim input method.
PR:		ports/95785
Submitted by:	NIIMI Satoshi <sa2c@sa2c.net>
2006-04-16 16:44:32 +00:00
MANTANI Nobutaka
aafe137582 Add uim-gnome. GNOME applets of uim input method
PR:		ports/95785
Submitted by:	NIIMI Satoshi <sa2c@sa2c.net>
2006-04-16 16:43:18 +00:00
Norikatsu Shigemura
d40ad636ad Add p5-WWW-Google-SiteMap 1.08, is a perl extention to manage
Google SiteMaps.

The Sitemap Protocol allows you to inform search engine
crawlers about URLs on your Web sites that are available
for crawling.

WWW:	http://search.cpan.org/dist/WWW-Google-SiteMap/
2006-04-14 23:06:16 +00:00
Mark Linimon
4c9725dfc2 Finish removing p5-JSON-Syck.
Pointy hat to:	tobez
2006-04-14 19:18:58 +00:00
Anton Berezin
0c3674981c Add textproc/p5-Makefile-Parser 0.11, a Perl module that parses makefiles. 2006-04-11 13:30:05 +00:00
Alexey Dokuchaev
9573492329 Add ssddiff, a diff application for semistructured data such as XML files.
WWW: http://ssddiff.alioth.debian.org/
2006-04-11 04:05:13 +00:00
Renato Botelho
52140fb40c nXML is a C library for parsing, writing and creating XML 1.0 and 1.1 files
or streams. It supports utf-8, utf-16be and utf-16le, ucs-4 (1234, 4321,
2143, 2312).

Author:	Andrea Marchesini <bakunin@autistici.org>
WWW:	http://www2.autistici.org/bakunin/libnxml/doc/

PR:		ports/95539
Submitted by:	Nicola Vitale	<nivit@email.it>
2006-04-09 18:02:57 +00:00
Renato Botelho
43cf76f912 Move recently added textproc/pyenchant to textproc/py-enchant for consistency
with other Python ports

Noted by:	danfe
Approved by:	maintainer
2006-04-06 15:05:08 +00:00
Renato Botelho
f5e19826c5 PyEnchant is a set of language bindings and some wrapper classes to make
the excellent Enchant spellchecker available as a Python module.
The bindings are generated using SWIG. It includes all the functionality
of Enchant with the flexibility of Python and a nice 'Pythonic'
object-oriented interface. It also aims to provide some higher-level
functionality than is available in the C API.

Author:	Ryan Kelly <ryan@rfk.id.au>
WWW:	http://pyenchant.sourceforge.net/

PR:		ports/95284
Submitted by:	Nicola Vitale	<nivit@email.it>
2006-04-06 11:39:20 +00:00
Anton Berezin
090b0675d0 Add textproc/p5-Lingua-EN-Fathom 1.09, a Perl module that measures
readability of English text.
2006-03-28 11:48:30 +00:00
Pav Lucistnik
30e54bba97 btparse is a C library for parsing and processing BibTeX files.
It provides a lexical scanner and LR parser (constructed by PCCTS),
both of which are efficient and offer good error detection and
recovery; a set of functions for traversing the AST (abstract
syntax tree) generated by the parser; and utility functions for
manipulating strings according to BibTeX conventions.

WWW:	http://www.gerg.ca/software/btOOL

PR:		ports/94686
Submitted by:	Kay Lehmann <kay_lehmann@web.de>
2006-03-20 18:36:06 +00:00
Pav Lucistnik
4513766db9 Emacs Muse is an authoring and publishing environment for Emacs. It
simplifies the process of writings documents and publishing them to
various output formats.

Muse consists of two main parts: an enhanced text-mode for authoring
documents and navigating within Muse projects, and a set of publishing
styles for generating different kinds of output.

WWW: http://www.emacswiki.org/cgi-bin/wiki/MuseMode

PR:		ports/93716
Submitted by:	Dryice Liu <dryice@dryice.name>
2006-03-16 11:04:55 +00:00
Xin LI
d6c3971615 Add fixrtf, an utility that is intended for use with the
Documentation Project, which can embed PNGs into RTF
files.

Obtained from:	The FreeBSD Simplified Chinese Project
Approved by:	clive
2006-03-16 07:50:15 +00:00
Renato Botelho
0a4fa21bc3 Add libstree 0.4.2, a Generic Suffix Tree Library.
PR:		ports/94008
Submitted by:	clsung
2006-03-15 10:40:38 +00:00
Emanuel Haupt
2d095b145e Add p5-Text-vCard, a package to parse, edit and create multiple
vCards (RFC 2426).
2006-02-22 14:42:39 +00:00
Aaron Dalton
4733ac6446 Adding port textproc/p5-AI-Categorizer, a tool to aid in the automatic categorization of text documents.
Approved by:	tobez (implicit)
2006-02-20 17:47:38 +00:00
Aaron Dalton
89182e000d Adding port textproc/p5-CQL-Parser, compiles CQL strings into parse trees of Node subtypes.
Approved by:	tobez (implicit)
2006-02-19 23:26:45 +00:00
Aaron Dalton
4b9c802530 Adding port textproc/p5-String-Tokenizer, A simple string tokenizer.
Approved by:	tobez (implicit)
2006-02-19 23:05:28 +00:00
Aaron Dalton
a21ec186c9 Adding port textproc/p5-Parse-PhoneNumber.
Approved by:	tobez (implicit)
2006-02-18 22:34:37 +00:00
Aaron Dalton
8debde9f8b Adding port textproc/p5-Unicode-Normalize, implements Unicode normalization forms.
Committed by:	aaron
Approved by:	tobez (implicit)
2006-02-18 09:31:07 +00:00
Aaron Dalton
3c581302ff Add p5-Text-Capitalize 0.5, capitalize strings ("to WORK AS titles"
becomes "To Work as Titles").

PR:		ports/92856
Submitted by:	Dmitry Marakasov <amdmi3@mail.ru>
Approved by:	tobez (implicit)
2006-02-17 23:36:38 +00:00
Eric Anholt
a26cccc0b7 New port: asciidoc, a tool to create man, html, pdf, etc. from short, formatted
plain-text documents.
2006-02-15 02:22:31 +00:00
Anton Berezin
dcfa97dfcd Add textproc/p5-HTML-EscapeEvil, a Perl module that will escape HTML
tags considered evil.
2006-02-14 08:28:35 +00:00
Jean-Yves Lefort
879a16d795 Add xmldiff.
xmldiff uses xmlprpr and diff to display meaningful differences in XML
files in an easy to read format. Output formats available include HTML,
ANSI colour, and regular diff. The coloured modes are particularly
useful for viewing small differences in context within large XML files.

WWW: http://software.decisionsoft.com/tools.html

PR:		ports/92947
Submitted by:	Paul Chvostek <paul+ports@it.ca>
2006-02-11 07:35:58 +00:00
Jean-Yves Lefort
7e63466156 Add xmlprpr.
An XML pretty printer created to format XML that doesn't make use of
mixed content. In the default mode each element is put on a separate
line with consistent indentation. It can also separate attributes onto
individual lines, sort attributes in a specified or alphabetic order,
expand self closing tags, and more.

Note that the distribution calls this tool "xmlpp", but it has been
renamed so as not to conflict with an xmlpp already in the ports tree.

WWW: http://software.decisionsoft.com/tools.html

PR:		ports/92946
Submitted by:	Paul Chvostek <paul+ports@it.ca>
2006-02-11 07:14:24 +00:00
Cheng-Lung Sung
ff5ead4435 Add p5-Text-WordDiff 0.02, text::WordDiff - Track changes between
documents.

- required for Bricolage 1.10.0

PR:		ports/92472
Submitted by:	Rod Taylor <ports@rbt.ca>
2006-01-30 14:41:28 +00:00
Jun Kuriyama
e480c1064c Add rast 0.3.1, full-text search system using N-gram. 2006-01-30 13:53:33 +00:00
Tilman Keskinoz
dab6f540ea Add py-chardet, Mark Pilgrim's "Universal Encoding Detector" 2006-01-25 22:46:32 +00:00
Andrey Slusar
34ffb5a9cd tools for helping translation of documentation
The po4a (po for anything) project goal is to ease translations
(and more interestingly, the maintenance of translations) using
gettext tools on areas where they were not expected like documentation.

This package contains the main libraries of po4a, and the following sub-modules:

  - KernelHelp: Help messages of each kernel compilation option.
  - Man: Good old manual page format.
  - Pod: Perl documentation format.
  - Sgml: either debiandoc or docbook DTD.
  - Dia: uncompressed Dia diagrams.
  - LaTeX: generic TeX or LaTeX format

WWW: http://packages.debian.org/unstable/text/po4a

PR:		ports/91532
Submitted by:	Meno Abels <meno.abels@adviser.com>
2006-01-24 19:48:09 +00:00
Andrey Slusar
239bbd6172 This module is a substitution for Text::Wrap, supporting
multibyte characters such as UTF-8, EUC-JP, and GB2312, fullwidth
characters such as east Asian characters, combining characters
such as diacritical marks and Thai, and languages which don't
use whitespaces between words such as Chinese and Japanese.

WWW: http://packages.debian.org/unstable/perl/libtext-wrapi18n-perl

PR:		ports/91532
Submitted by:	Meno Abels <meno.abels@adviser.com>
2006-01-24 19:42:34 +00:00
Andrey Slusar
c81c300815 This is a part of fakeroot project.
Fakeroot runs a command in an environment were it appears to have
root privileges for file manipulation, by setting LD_PRELOAD to a
library with alternative versions of getuid(), stat(), etc. This
is useful for allowing users to create archives (tar, ar, .deb .rpm
etc.) with files in them with root permissions/ownership. Without
fakeroot one would have to have root privileges to create the
constituent files of the archives with the correct permissions and
ownership, and then pack them up, or one would have to construct
the archives directly, without using the archiver.

WWW: http://freshmeat.net/projects/fakeroot

PR:		ports/91532
Submitted by:	Meno Abels <meno.abels@adviser.com>
2006-01-24 19:36:51 +00:00
Aaron Dalton
e428e89715 Add p5-XML-OPML 0.26, creates and updates OPML (Outline Processor Markup
Language) files.

PR:		ports/89350
Submitted by:	ijliao
Approved by:	tobez
2006-01-22 23:55:04 +00:00
Cheng-Lung Sung
54ffed78bb Add openvanilla-modules 0.7.1.20050902, input method modules for
OpenVanilla.

PR:		ports/85731
Submitted by:	Leland Wang <llwang@infor.org>
2006-01-21 06:12:20 +00:00
Cheng-Lung Sung
928d8277e9 Add scim-openvanilla 0.7.1.20050902, SCIM OpenVanilla input method
(IM)/output filter (OF) framework.

PR:		ports/85731
Submitted by:	Leland Wang <llwang@infor.org>
2006-01-20 10:08:11 +00:00
Cheng-Lung Sung
9d2c34fea1 Add openvanilla-framework 0.7.1.20050902, development header files for
OpenVanilla.

PR:		ports/85731
Submitted by:	Leland Wang <llwang@infor.org>
2006-01-20 08:07:05 +00:00
Thierry Thomas
bdab6f5f0e - Add a Gujarati (gu) dictionary;
- Update Lithuanian (lt) & Uzbek (uz) dictionaries.
2006-01-19 23:09:42 +00:00
Alex Dupre
f31cb16e15 Add php5-xmlwriter, the xmlwriter shared extension for php. 2006-01-16 12:09:32 +00:00
Cheng-Lung Sung
2181e34108 Add p5-XML-RSS-LibXML 0.15, XML::RSS with XML::LibXML.
PR:		ports/91819
Submitted by:	Gea-Suan Lin <gslin@gslin.org>
2006-01-16 02:39:06 +00:00
Anton Berezin
4d49b00821 Add textproc/p5-JSON-Syck 0.01, a fast, lightweight JSON loader and
dumper.
2006-01-10 19:37:06 +00:00
Anton Berezin
f1f8252ece Add textproc/p5-YAML-Syck 0.03, a Perl module that implements fast,
lightweight YAML loader and dumper.
2005-12-28 09:46:03 +00:00
Anton Berezin
12c4d9cda6 Add textproc/p5-String-Koremutake 0.30, a Perl module to convert to/from
Koremutake Memorable Random Strings.
2005-12-27 13:34:49 +00:00
MANTANI Nobutaka
25e658047c Add uim-el-xemacs21-mule. uim-el-xemacs21-mule is XEmacs lisp for uim input method library. 2005-12-25 06:10:54 +00:00
MANTANI Nobutaka
5363593c75 Add uim-el. uim-el is Emacs lisp for uim input method library. 2005-12-25 05:55:30 +00:00
Anton Berezin
83e2e3f0ff Add textproc/p5-HTML-Truncate 0.03, a Perl module that truncates HTML by
character count while preserving well-formedness.
2005-12-23 13:03:36 +00:00
Lars Thegler
c586332276 Add p5-XML-SAX-Expat-Incremental 0.04, XML::SAX::Expat subclass for
non-blocking (incremental).
2005-12-22 14:49:28 +00:00
Pav Lucistnik
7604ffa399 The RomDict project includes (for now) only an English -- Romanian dictionary.
This port installs it in the data format for use with dictd server.

WWW: http://romdict.sourceforge.net/

PR:		ports/90620
Submitted by:	Ion-Mihai "IOnut" Tetcu <itetcu@people.tecnik93.com>
2005-12-21 00:15:32 +00:00
Jun Kuriyama
55b7c5608a Add p5-Text-CSV-Simple 1.00, perl module for Simpler parsing of CSV
files.
2005-12-17 16:46:46 +00:00
Sam Lawrance
dad50d99b1 Remove expired ports. 2005-12-15 03:18:03 +00:00
Lars Thegler
1fdfe397dd Add p5-Devel-TraceSAX 0.021, trace SAX events. 2005-12-09 20:30:55 +00:00
Alex Dupre
39519c176a Add new XMLReader php5 extension. 2005-12-09 19:45:41 +00:00
Renato Botelho
4285e2a206 The Excel::Template perl module is a module used for templating Excel files.
Its genesis came from the need to use the same datastructure as HTML::Template,
but provide Excel files instead. The existing modules don't do the trick, as
they require replication of logic that's already been done within
HTML::Template.

WWW: http://search.cpan.org/dist/Excel-Template/

PR:		ports/90044
Submitted by:	Espen Tagestad <espen@tagestad.no>
2005-12-07 15:03:07 +00:00
Hiroki Sato
2777d9e407 Add textproc/heirloom-doctools:
The Heirloom Documentation Tools provide troff, nroff, and related
 utilities to format manual pages and other documents for output
 on terminals and printers.  They are portable and enhanced versions
 of the utilities released by Sun as part of OpenSolaris, and, for
 pic, grap, mpm, and some minor parts, by Lucent as part of Plan 9.
2005-12-05 15:45:27 +00:00
MANTANI Nobutaka
b2f9d34f45 Move japanese/uim port to textproc category after repocopy.
PR:		ports/89725
Submitted by:	nobutaka
2005-11-30 16:32:14 +00:00
Anton Berezin
c0491943cb Add textproc/p5-String-Divert 0.96, a Perl module that provides a
scalar-like string object that facilitates nested generation of
structured output.
2005-11-29 13:27:21 +00:00
Cheng-Lung Sung
cf18e651e0 - Critique Perl source for style and standards, which give suggestions
about your perl source code.
- ijliao made this port, and I will maintain it :)
2005-11-28 06:20:52 +00:00