Commit graph

1070 commits

Author SHA1 Message Date
Cheng-Lung Sung
4e8eca8e54 Add p5-Parse-CSV 1.00, highly flexible CVS parser for large files.
PR:		ports/104559
Submitted by:	Jin-Shan Tseng <tjs at cdpa.nsysu.edu.tw>
2006-11-01 01:24:23 +00:00
Alejandro Pulver
e4774a0c80 XML version of the DocBook DTD V4.5.
WWW: http://www.docbook.org/xml/4.5/index.html
2006-10-31 19:43:38 +00:00
Dirk Meyer
8a76e3c97e Duncan is an English-Thai dictionary. It was developed on Mac OS X, using the
Cocoa libraries. The GNUstep port that can be found here, was done by me. It
was very easy to do; primarily requiring only new interface files, and build
files.
PR:		104964
Submitted by:	Gürkan Sengün
2006-10-31 17:26:33 +00:00
Cheng-Lung Sung
8b3c859c2a Add dwdiff 1.2, a delimited word diff program.
PR:		ports/103840
Submitted by:	Alexander Logvinov <ports at logvinov.com>
2006-10-07 16:17:50 +00:00
Cheng-Lung Sung
1060cd5ded Add p5-Regexp-DefaultFlags 0.01, set default flags on regular
expressions.

PR:		ports/103337
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-10-02 00:28:30 +00:00
Sergey Skvortsov
e34dada69d Remove p5-XML-LibXML-XPathContext because it merged into p5-XML-LibXML
Approved by:	maintainer (mat)
2006-09-30 14:08:36 +00:00
Anton Berezin
06ff9c0db3 Add p5-Pod-WSDL 0.04, a Perl module to create WSDL documents from
extended pod.
2006-09-30 11:23:16 +00:00
Cheng-Lung Sung
671e48e5a0 Add p5-XML-Compile 0.05, compilation based XML processing.
PR:		ports/102626
Submitted by:	Jin-Shan Tseng <tjs at cdpa.nsysu.edu.tw>
2006-09-26 08:59:45 +00:00
Cheng-Lung Sung
6a772478f0 Add p5-POSIX-Regex 0.89, OO interface for the gnu regex engine.
PR:		ports/102253
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-09-26 08:18:26 +00:00
Martin Wilke
f98ad549cc Text::Language::Guess guesses a document's language. Its implementation
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
it determines how many of the known stopwords the document contains for
each language supported by "Lingua::StopWords".

Each word in the document recognized as stopword of a particular
language scores one point for this language.

The "language_guess()" function takes a document as a parameter and
returns the abbreviation of the language that it is most likely written
in.

Author: Mike Schilli <cpan@perlmeister.com>
WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/

PR:		ports/103571
Submitted by:	Masahiro Teramoto <markun@onohara.to>
2006-09-24 19:51:59 +00:00
Martin Wilke
15a6efbb01 Parse FOAF (Friend of a Friend) data
Author: Benjamin Trott <ben+cpan@stupidfool.org>
WWW: http://search.cpan.org/~btrott/XML-FOAF-0.03/

PR:		ports/103579
Submitted by:	markun@onohara.to
2006-09-24 19:48:48 +00:00
Cheng-Lung Sung
d13c67f51e - a fast, middleweight template engine.
- this port required by databases/p5-SQLite-Work
2006-09-20 04:51:29 +00:00
Andrew Pantyukhin
697d566341 Add port textproc/ffe:
ffe  is a program for extracting fields from flat file records and dis-
playing them in different formats. ffe relies on the configuration file
to control input file structure and the output format.

WWW: http://sourceforge.net/projects/ff-extractor/
Author: Timo Savinen <tjsa@iki.fi>
2006-09-19 17:45:25 +00:00
Martin Wilke
bb7c6094dc This module allows you to extract Hatena keywords used in an
arbitrary text and also allows you to mark up a text as HTML
with the keywords.

A Hatena keyword is an element in a suite of web sites
*.hatena.ne.jp having blogs and social bookmarks among others.
Please refer to http://d.hatena.ne.jp/keyword/ (in Japanese) for details.

In Hatena Diary, a blog hosting service, a Hatena keyword found in
a posting is linked to the keywords page automatically.
You can implement the same kind of feature outside Hatena using this module.

It queries Hatena Keyword Link API internally for retrieving terms

Author: Naoya Ito <naoya@bloghackers.net>
WWW:    http://search.cpan.org/~naoya/Hatena-Keyword-0.04/

PR:		ports/102794
Submitted by:	Masahiro Teramoto <markun(at)onohara.to>
2006-09-12 21:11:30 +00:00
Sergey Skvortsov
385e0bc7ea Add diffmark 0.05, XML diff and merge package. 2006-09-05 17:54:18 +00:00
Cheng-Lung Sung
4dfc31d7e0 - not fetchable, remove it. 2006-09-04 02:32:12 +00:00
Andrew Pantyukhin
309d264df7 Add port textproc/minised:
This is a smaller, cheaper, faster SED implementation. Minix uses it. GNU
used to use it, until they built their own sed around an extended (some
would say over-extended) regexp package.

For embedded use we searched for a tiny sed implementation especially for
use with the dietlibc and found Eric S. Raymond's sed implementation quite
handy. Though it suffered several bugs and was not under active maintenance
anymore. After sending a bunch of fixes we agreed to continue maintaining
this lovely, historic sed implementation.

Along a lot fixes and cleanups, further speedups, and some missing features
and POSIX conformance, we also added a test-suite to the package, so
regressions are quickly and easily uncovered.

WWW: http://www.exactcode.de/oss/minised/
Author: ExactCode <info@exactcode.de>
2006-09-02 17:49:07 +00:00
Ying-Chieh Liao
8700f476c4 add libyaml 0.0.1
A YAML 1.1 parser and emitter written in C

PR:		102712
2006-09-01 09:50:25 +00:00
Andrey Slusar
c21f5cae29 Support code for editing reStructuredText with Emacs.
Basically, this package contains:

- Functions to automatically adjust and cycle the section underline
  decorations;
- A mode that displays the table of contents and allows you to jump anywhere
  from it;
- Functions to insert and automatically update a TOC in your source
  document;
- A mode which supports font-lock highlighting of reStructuredText
  structures;
- Some other convenience functions.

This package is the result of merging:

- restructuredtext.el
- rst-mode.el
- rst-html.el

Those files are now OBSOLETE and have been replaced by this single
package file (2005-10-30).

WWW: http://docutils.sourceforge.net/docs/user/emacs.html

PR:		ports/102384
Submitted by:	Denis Shaposhnikov <dsh at vlink.ru>
2006-08-23 05:35:57 +00:00
Cheng-Lung Sung
702ee46358 Add p5-Text-SwiftTemplate 0.0701, a fast, lightweight template engine.
PR:		ports/102280
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-22 14:53:05 +00:00
Munechika SUMIKAWA
008bee4237 Plagger is a pluggable RSS/Atom feed aggregator written in
Perl. Everything is implemented as a small plugin and you can mash
them up together using Plagger core API and plugin hooks. You can
think of Plagger as a blosxom or qpsmtpd for RSS aggregator.

WWW: http://plagger.org/

WARNING: This port depends on thousands of ports spececially with
full options.
2006-08-22 09:20:26 +00:00
Cheng-Lung Sung
0545f06ee3 Add p5-XML-DOM-Lite 0.11, lite Pure Perl XML DOM Parser Kit.
PR:		ports/102326
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-22 07:08:15 +00:00
Cheng-Lung Sung
76d7bca3b5 Add p5-ack 1.24, a grep-like text finder for large trees of text.
PR:		ports/102073
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-08-16 05:00:47 +00:00
Cheng-Lung Sung
6c1fbb75cc Add p5-XML-TreePP 0.18, pure Perl implementation for parsing/writing xml
files.

PR:		ports/101990
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-14 07:23:45 +00:00
Andrew Pantyukhin
e0688cabc2 Add port textproc/xxdiff-scripts:
xxdiff is a computer program that allows a user (usually a software
developer of some sort) to easily visualize the differences between
files. The manner and goal for which this process is applied over
multiple files is highly dependent on the application, and most of
the time is driven by custom user scripts.

For example, a configuration management engineer in a company might
provide some kind of merge policing environment, that allows software
developers to review changes in files for the purpose of accepting or
rejecting a submitted changeset to a codebase. Another example is
that of a developer wishing to review the changes he made to a
checkout of files from a source-code management system such as CVS,
Subversion, ClearCase, Perforce, etc.

WWW: http://furius.ca/xxdiff/doc/xxdiff-scripts.html
2006-08-13 19:09:58 +00:00
Andrew Pantyukhin
7acd3ed4da Add port textproc/flex:
Flex is a tool for generating scanners. A scanner, sometimes called a
tokenizer, is a program which recognizes lexical patterns in text. The
flex program reads user-specified input files, or its standard input
if no file names are given, for a description of a scanner to generate.
The description is in the form of pairs of regular expressions and C
code, called rules. Flex generates a C source file named, "lex.yy.c",
which defines the function yylex(). The file "lex.yy.c" can be compiled
and linked to produce an executable. When the executable is run, it
analyzes its input for occurrences of text matching the regular
expressions for each rule. Whenever it finds a match, it executes the
corresponding C code.

WWW: http://flex.sourceforge.net/

Note that there's flex 2.5.4 in the base system. This port provides
a newer version for programs that require it, textproc/xxdiff for one.
2006-08-13 15:30:22 +00:00
Cheng-Lung Sung
647f97515d Add msort 8.27, sorting files in sophisticated ways.
PR:		ports/101777
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-11 08:18:00 +00:00
Cheng-Lung Sung
b1aeba77f3 Add metaf2xml 1.8, parses and decodes aviation routine and stores the
components in XML.

PR:		ports/101779
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-11 06:52:58 +00:00
Cheng-Lung Sung
c301a34f24 - Ferret is a high-performance, full-featured text search engine library written for Ruby. 2006-08-10 09:15:42 +00:00
Cheng-Lung Sung
f6858c0da4 - CLucene is a C++ port of Lucene: the high-performance,
full-featured text search engine written in Java.
  CLucene is faster than lucene as it is written in C++.
2006-08-10 01:45:03 +00:00
Cheng-Lung Sung
adcd8a85aa - YamCha is a generic, customizable, and open source
text chunker oriented toward a lot of NLP tasks.
2006-08-03 03:03:18 +00:00
Andrew Pantyukhin
3c07d6686e Add port textproc/p5-Text-Flowed:
This module provides functions that deals with formatting data with
Content-Type 'text/plain; format=flowed' as described in RFC2646
(http://www.rfc-editor.org/rfc/rfc2646.txt). In a nutshell,
format=flowed text solves the problem in plain text files where it
is not known which lines can be considered a logical paragraph,
enabling lines to be automatically flowed (wrapped and/or joined)
as appropriate when displaying.

In format=flowed, a soft newline is expressed as " \n", while hard
newlines are expressed as "\n". Soft newlines can be automatically
deleted or inserted as appropriate when the text is reformatted.

WWW: http://search.cpan.org/dist/Text-Flowed/

Justification:	socialtext dependency
2006-08-02 11:03:38 +00:00
Andrew Pantyukhin
4e80309686 Add port textproc/p5-Plucene-Simple:
This provides a simple interface to Plucene. Plucene is large and multi-
featured, and it expected that users will subclass it, and tie all the
pieces together to suit their own needs. Plucene::Simple is, therefore,
just one way to use Plucene. It's not expected that it will do exactly
what *you* want, but you can always use it as an example of how to
build your own interface.

WWW: http://search.cpan.org/dist/PluceneSimple/

Justification:	socialtext dependency
Quirks:		1/6 test fails
2006-08-02 10:06:15 +00:00
Ying-Chieh Liao
1eb712f0b3 add p5-Text-Quote 0.3
Quotes strings as required for perl to eval them back correctly

PR:		101018
2006-08-02 01:20:01 +00:00
Cheng-Lung Sung
d8ddbeefc3 Add p5-Text-Delimited 2.00, module for parsing delimited text files.
PR:		ports/101158
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-01 12:29:47 +00:00
Cheng-Lung Sung
b83b882d46 Add p5-XML-Literal 0.01, syntax suppor for XML literals.
PR:		ports/101155
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-08-01 12:26:26 +00:00
Cheng-Lung Sung
c63b7a1917 - p5-XML-RSS-Feed provide the ability of Persistant XML RSS Encapsulation 2006-08-01 03:33:57 +00:00
Cheng-Lung Sung
cd4aa0dc19 Add pear-XML_DTD 0.4.2, parsing of DTD files and DTD validation of XML
files.

PR:		ports/101081
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-08-01 01:37:09 +00:00
Roman Bogorodskiy
455e551f27 This stem extension for PHP provides stemming capability for a variety of
languages using Dr. M.F. Porter's Snowball API, which can be found at:
http://snowball.tartarus.org

WWW: http://pecl.php.net/package/stem/

PR:		ports/101106
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-31 18:51:37 +00:00
Cheng-Lung Sung
7e583b11b0 Add p5-XML-Flow 0.81, store (restore) perl data structures in XML
stream.

PR:		ports/100906
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-27 07:29:15 +00:00
Andrew Pantyukhin
d99033e15c Add port textproc/p5-Text-Bastardize:
Bastardize provides an magical object into which text can be charged
and then returned in various, slighty modified ways.

Among others, bastardize has the following methods:

rdct converts english to hyperreductionist english
    (ex. "english" becomes "")

pig pig latin
    (ex. "hi there" becomes "ihay erethay")

k3wlt0k a k3wlt0kizer developed originally by Fmh

rot13 implements rot13 "encryption" in perl
    (ex. "foo bar" becomes "sbb one")

rev reverses the arrangement of characters

censor attempts to censor text which might be innaproriate

n20e performs numerical abbreviations
    (ex. "numerical_abbreviation" becomes "n20e")

WWW: http://search.cpan.org/dist/Text-Bastardize/
2006-07-25 22:07:05 +00:00
Andrew Pantyukhin
17248c23f7 Add port textproc/p5-Unicode-CheckUTF8:
This is an XS wrapper around some Unicode Consortium code to check if
a string is valid UTF-8, revised to conform to what expat/Mozilla
think is valid UTF-8, especially with regard to low-ASCII characters.

Note that this module has NOTHING to do with Perl's internal UTF8 flag
on scalars.

This module is for use when you're getting input from users and want
to make sure it's valid UTF-8 before continuing.

WWW: http://search.cpan.org/dist/Unicode-CheckUTF8/
2006-07-25 21:50:41 +00:00
Martin Wilke
2d16749363 Text::Emoticon::GoogleTalk is a text filter that replace text emoticons like
":-)", "<3", etc. with the icons of Google Talk (or Gmail Chat), detailed in
http://mail.google.com/support/bin/answer.py?answer=34056

WWW:    http://search.cpan.org/dist/Text-Emoticon-GoogleTalk/

PR:		ports/100676
Submitted by:	chinsan <chinsan.tw(at)gmail.com>
Approved by:	krion (mentor)
2006-07-25 21:13:31 +00:00
Martin Wilke
7e8c3594a3 Add uncrustify , highly configurable source code beautifier.
The goals of this project are simple:
Create a highly configurable, easily modifiable source code beautifier.
What it does:
 * Ident code, aligning on parens, assignments, etc
 * Align on '=' and variable definitions
 * Align structure initializers
 * Align #define stuff
 * Align backslash-newline stuff
 * Reformat comments (a little bit)
 * Fix inter-character spacing
 * Add or remove parens on return statements
 * Add or remove braces on single-statement if/do/while/for statements
 * Highly configurable - 118 configurable options as of version 0.0.15

 WWW: http://uncrustify.sourceforge.net

PR:		ports/100604
Submitted by:	Dmitry Marakasov <amdmi3 at mail.ru>
2006-07-22 09:23:56 +00:00
Cheng-Lung Sung
0d8b6e60d3 Add p5-Lingua-EN-NamedEntity 1.7, basic Named Entity Extraction
algorithm.

PR:		ports/100675
Submitted by:	Jin-Shan Tseng <tjs at cdpa.nsysu.edu.tw>
2006-07-22 04:54:01 +00:00
Rong-En Fan
97bbe4de85 Add p5-CSS 1.07, object oriented access to Cascading Style Sheets (CSS).
PR:		ports/100640
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-21 12:23:04 +00:00
Alex Dupre
ec97c639b2 XStream is a simple library to serialize Java objects
to XML and back again.

WWW: http://xstream.codehaus.org/
2006-07-19 19:10:17 +00:00
Cheng-Lung Sung
6a9f73102a Add p5-Text-Trac 0.03, perl extension for formatting text with Trac Wiki
Style.

PR:		ports/100528
Submitted by:	Gea-Suan Lin <gslin at gslin.org>
2006-07-19 13:50:26 +00:00
Cheng-Lung Sung
1e0f2c09d7 Add p5-XML-SAXDriver-Excel 0.06, perl SAX driver for converting Excel
files to XML.

PR:		ports/100509
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-19 07:17:54 +00:00
Cheng-Lung Sung
8423c37d63 Add p5-XML-SAXDriver-CSV 0.07, perl SAX driver for converting CSV files
to XML.

PR:		ports/100510
Submitted by:	chinsan <chinsan.tw at gmail.com>
2006-07-19 07:15:52 +00:00